S&F Ref: 472450D1
AUSTRALIA
PATENTS ACT 1990 COMPLETE SPECIFICATION FOR A STANDARD PATENT
ORIGINAL
Name and Address of Applicants: The Ludwig Institute for Cancer Research 33rd Floor, 605 Third Avenue New York New York 10158 United States of America Helsinki University Licensing Ltd. OY Viikinkaari 6 00710 Helsinki Finland Actual Inventor(s): Address for Service: Invention Title: Kari Alitalo, Vladimir Joukov Spruson Ferguson St Martins Tower,Level 31 Market Street Sydney NSW 2000 (CCN 3710000177) Vascular Endothelial Growth Factor C (VEGF-C) Protein and Gene, Mutants Thereof, and Uses Thereof The following statement is a full description of this invention, including the best method of performing it known to me/us:- 5845c Vascular Endothelial Growth Factor C (VEGF-C) Protein and Gene, Mutants Thereof, and Uses Thereof Field of the Invention The present invention generally relates to the field of genetic engineering and more particularly to growth factors for endothelial cells and growth factor genes.
Background of the Invention Developmental growth, the remodelling and regeneration of adult tissues, as well as solid tumour growth, can only occur when accompanied by blood vessel formation. Angioblasts and haematopoietic precursor cells differentiate from the mesoderm and form the blood islands of the yolk io sac and the primary vascular system of the embryo. The development of blood vessels from these early (in situ) differentiating endothelial cells is termed vasculogenesis. Major embryonic blood vessels are believed to arise via vasculogenesis, whereas the formation of the rest of the vascular tree is thought to occur as a result of vascular sprouting from pre-existing vessels, a process called angiogenesis, Risau et al., Devel. Biol., 125:441-450 (1998).
Endothelial cells give rise to several types of functionally and morphologically distinct vessels.
When organs differentiate and begin to perform their specific functions, the phenotypic heterogeneity of endothelial cells increases. Upon angiogenic stimulation, endothelial cells may re-enter the cell cycle, migrate, withdraw from the cell cycle and subsequently differentiate again to form new vessels that are [R:\LIBC]07015.doc:bav
I
-2 functionally adapted to their tissue environment. Endothelial cells undergoing angiogenesis degrade the underlying basement membrane and migrate, forming capillary sprouts that project into the perivascular stroma. Ausprunk et al., Microvasc. Rev., 14:51-65 (1977). Angiogenesis during tissue development and regeneration depends on the tightly controlled processes of endothelial cell proliferation, migration, differentiation, and survival. Dysfunction of the endothelial cell regulatory system is a key feature of many diseases. Most significantly, tumor growth and metastasis have been shown to be angiogenesis dependent. Folkman et al., J. Biol. Chem., 267:10931-10934 (1992).
Key signals regulating cell growth and differentiation are mediated by polypeptide growth factors and their transmembrane receptors, many of which are tyrosine kinases. Autophosphorylated peptides within the tyrosine kinase insert and carboxylterminal sequences of activated receptors are commonly recognized by kinase substrates involved in signal transduction for the readjustment of gene expression in responding cells.
Several families of receptor tyrosine kinases have been characterized. Van der Geer et al., Ann. Rev. Cell Biol., 10:251-337 (1994). The major growth factors and receptors transducing angiogenic stimuli are schematically shown in Fig. 1.
Fibroblast growth factors are also known to be involved in the regulation of angiogenesis. They have been shown to be mitogenic and chemotactic for cultured endothelial cells. Fibroblast growth factors also stimulate the production of proteases, 2 0 such as collagenases and plasminogen activators, and induce tube formation by endothelial cells. Saksela et al., Ann. Rev. CellBiol., 4:93-126 (1988). There are two general classes of fibroblast growth factors, FGF-I and FGF-2, both of which lack conventional signal peptides. Both types have an affinity for heparin, and FGF-2 is bound to heparin sulfate proteoglycans in the subendothelial extracellular matrix from which it may be released after injury. Heparin potentiates the stimulation of endothelial cell proliferation by angiogenic FGFs, both by protecting against denaturation and degradation and dimerizing the FGFs.
Cultured endothelial cells express the FGF-1 receptor but no significant levels of other high-affinity fibroblast growth factor receptors.
Among other ligands for receptor tyrosine kinases, the platelet derived 3 0 growth factor, PDGF-BB, has been shown to be weakly angiogenic in the chick chorioallantoic membrane. Risau et al., Growth Factors, 7:261-266 (1992). Transforming growth factor a (TGFa) is an angiogenic factor secreted by several tumor cell types and 3 by macrophages. Hepatocyte growth factor (HGF), the ligand of the c-met protooncogene-encoded receptor, also is strongly angiogenic.
Recent evidence shows that there are endothelial cell specific growth factors and receptors that may be primarily responsible for the stimulation of endothelial cell growth, differentiation and certain differentiated functions. The best studied of these is vascular endothelial growth factor (VEGF), a member of the PDGF family. Vascular endothelial growth factor is a dimeric glycoprotein of disulfide-linked 23 kD subunits.
Other reported effects of VEGF include the mobilization of intracellular calcium, the induction of plasminogen activator and plasminogen activator inhibitor-1 synthesis, stimulation of hexose transport in endothelial cells, and promotion of monocyte migration in vitro. Four VEGF isoforms, encoded by distinct mRNA splice variants, appear to be equally capable of stimulating mitogenesis in endothelial cells. However, each isoform has a different affinity for cell surface proteoglycans, which behave as low affinity receptors for VEGF. The 121 and 165 amino acid isoforms of VEGF (VEGF121 and VEGF165) are secreted in a soluble form, whereas the isoforms of 189 and 206 amino acid residues remain cell surface-associated and have a strong affinity for heparin. VEGF was originally purified from several sources on the basis of its mitogenic activity toward endothelial cells, and also by its ability to induce microvascular permeability, hence it is also called vascular permeability factor (VPF).
Two high affinity receptors for VEGF have been characterized: VEGFR- 1/Fit-1 (fms-like tyrosine kinase-1) and VEGFR-2/KDR/Flk-1 (kinase insert domain containing receptor/fetal liver kinase-1). Those receptors are classified in the PDGFreceptor family, but they have seven rather than five immunoglobulin-like loops in their extracellular domain (see Fig. and they possess a longer kinase insert than normally observed in this family. The expression of VEGF receptors occurs mainly in vascular endothelial cells, although some may be present on hematopoietic progenitor cells, monocytes, and melanoma cells. Only endothelial cells have been reported to proliferate in response to VEGF, and endothelial cells from different sources show different responses.
Thus, the signals mediated through VEGFR-1 and VEGFR-2 appear to be cell type specific. The VEGF-related placenta growth factor (PIGF) was recently shown to bind to VEGFR-I with high affinity. PIGF was able to enhance the growth factor activity of VEGF, but it did not stimulate endothelial cells on its own. Naturally occurring 4 VEGF/PIGF heterodimers were nearly as potent mitogens as VEGF homodimers for endothelial cells. Cao et al., J. Biol. Chem., 271:3154-62 (1996).
The Fit4 receptor tyrosine kinase (VEGFR-3) is closely related in structure to the products of the VEGFR-1 and VEGFR-2 genes. Despite this similarity, the mature 0 5 form of Flt4 differs from the VEGF receptors in that it is proteolytically cleaved in the 00 extracellular domain into two disulfide-linked polypeptides. Pajusola et aL, Cancer Res., 52:5738-5743 (1992). The 4.5 and 5.8 kb Flt4 mRNAs encode polypeptides which differ in their C-termini due to the use of alternative 3' exons. Isoforms of VEGF or PIGF do 0 not show high affinity binding to Flt4 or induce its autophosphorylation.
Expression of Fit4 appears to be more restricted than the expression of VEGFR-1 or VEGFR-2. The expression of Flt4 first becomes detectable by in situ hybridization in the angioblasts of head mesenchyme, the cardinal vein, and extraembryonically in the allantois of 8.5 day p.c. mouse embryos. In 12.5 day p.c.
embryos, the Flt4 signal is observed in developing venous and presumptive lymphatic endothelia, but arterial endothelia appear negative. During later stages of development, Flt4 mRNA becomes restricted to developing lymphatic vessels. The lymphatic endothelia and some high endothelial venules express Flt4 mRNA in adult human tissues and increased expression occurs in lymphatic sinuses in metastatic lymph nodes and in lymphangioma. These results support the theory of the venous origin of lymphatic vessels.
Five endothelial cell specific receptor tyrosine kinases, FIt-I (VEGFR-1), KDR/Flk-1 (VEGFR-2), Flt4 (VEGFR-3), Tie, and Tek/Tie-2 have so far been described, which possess the intrinsic tyrosine kinase activity essential for signal transduction.
Targeted mutations inactivating Fit-1, Flk-1, Tie, and Tek in mouse embryos have indicated their essential and specific roles in vasculogene'sis and angiogenesis at the molecular level. VEGFR-1 and VEGFR-2 bind VEGF with high affinity (Kd 16 pM and 760 pM, respectively) and VEGFR-1 also binds the related placenta growth factor (PIGF; Kd about 200 pM). A ligand for Tek is reported in PCT patent publication WO 96/11269.
SUMMARY OF THE INVENTION The present invention provides, in one embodiment a method of isolating cells that co-express CD34 and VEGFR-3, comprising steps of: obtaining a biological sample from a human subject that contains cells; and 4a separating cells that express both CD34 (CD34 and VEGFR-3 (VEGFR-3 from cells that do not co-express CD34 and VEGFR-3 in the sample, thereby isolating CD34+/VEGFR-3 cells from the sample.
The present invention provides, in another embodiment, a composition comprising CD34+/VEGFR-3 cells isolated according to the method of the invention.
The present invention describes a ligand, designated VEGF-C, for the Flt4 receptor 00 tyrosine kinase (VEGFR-3). Thus, the invention describes a purified and isolated O polypeptide which is capable of binding to the Flt4 receptor tyrosine kinase. Preferably, an 0
(N
[R:\LIBW]54200.doc:ake Flt4 ligand of the invention is capable of stimulating tyrosine phosphorylation of Flt4 receptor tyrosine kinase in a host cell expressing the Fit4 receptor tyrosine kinase.
Preferred ligands of the invention are mammalian polypeptides. Highly preferred ligands are human polypeptides. As explained in detail below, dimers and multimers comprising polypeptides of the invention linked to each other or to other polypeptides are specifically contemplated as aspects of the invention.
In one embodiment, an Flt4 ligand polypeptide has a molecular weight of approximately 23 kD as determined by SDS-PAGE under reducing conditions. For example, the invention includes a ligand composed of one or more polypeptides of approximately 23 kD which is purifyable from conditioned media from a PC-3 prostatic adenocarcinoma cell line, the cell line having ATCC Acc. No. CRL 1435. Amino acid sequencing of this PC-3 cell-derived ligand polypeptide revealed that the ligand ,polypeptide comprises an amino terminal amino acid sequence set forth in SEQ ID NO: The present invention also provides a new use for the PC-3 prostatic adenocarcinoma cell line which produces an Flt4 ligand. In a preferred embodiment, the ligand may be purified and isolated directly from the PC-3 cell culture medium.
In a highly preferred embodiment, the ligand polypeptide comprises a fragment of the amino acid sequence shown in SEQ ID NO: 8 which binds with high affinity to the human Flt4 receptor tyrosine kinase. It will be understood that the term "high affinity," in the context of a polypeptide ligand of a receptor tyrosine kinase, typically reflects a binding relationship characterized by sub-nanomolar dissociation constants as reported herein for VEGF-C binding to VEGFR-2 and VEGFR-3, and reported elsewhere in the art for the binding of VEGF, PIGF, PDGF, and other factors to their receptors. Exemplary fragments include: a polypeptide comprising an amino acid sequence set forth in SEQ ID NO: 8 from about residue 112 to about residue 213; a polypeptide comprising an amino acid sequence from about residue 104 to about residue 227 of SEQ ID NO: 8; and a polypeptide comprising an amino acid sequence from about residue 112 to about residue 227 of SEQ ID NO: 8. Other exemplary fragments include polypeptides comprising amino acid sequences of SEQ ID NO: 8 that span, approximately, the following residues: 31-213, 31-227, 32-227, 103-217, 103-225, 104-213, 113-213, 103-227, 113-227, 131-211, 161-211, 103-225, 227-419, 228-419, 31-419, and 1-419, as described in greater detail below.
6 The present invention also provides one or more polypeptide precursors of an Flt4 ligand, wherein one such precursor (designated "prepro-VEGF-C") comprises the complete amino acid sequence (amino acid residues 1 to 419) shown in SEQ ID NO: 8.
Thus, the invention includes a purified and isolated polypeptide having the amino acid sequence of residues 1 to 419 shown in SEQ ID NO: 8. Ligand precursors according to the invention, when expressed in an appropriate host cell, produce, via cleavage, a polypeptide which binds with high affinity to the Flt4 receptor tyrosine kinase. A putative 102 amino acid leader (prepro) peptide has been identified in the amino acid sequence shown in SEQ ID NO: 8. Thus, in a related aspect, the invention includes a purified and isolated polypeptide having the amino acid sequence of residues 103-419 shown in SEQ ID NO: 8.
In one embodiment, an expressed Flt4 ligand polypeptide precursor is proteolytically cleaved upon expression to produce an approximately 23 kD Flt4 ligand polypeptide. Thus, an Flt4 ligand polypeptide is provided which is the cleavage product of the precursor polypeptide shown in SEQ ID NO: 8 and which has a molecular weight of approximately 23 kD under reducing conditions.
Putative VEGF-C precursors/processing products consisting of polypeptides with molecular weights of about 29 and 32 kD also are considered aspects of the invention.
In another embodiment, an expressed Flt4 ligand polypeptide precursor is proteolytically cleaved upon expression to produce an approximately 21 kD VEGF-C polypeptide. Sequence analysis has indicated that an observed 21 kD form has an amino terminus approximately 9 amino acids downstream from the amino terminus of the 23 kD form, suggesting that alternative cleavage sites exist.
From the foregoing, it will be apparent that an aspect of the invention includes a fragment of the purified and isolated polypeptide having the amino acid sequence of residues 1 to 419 shown in SEQ ID NO: 8, the fragment being capable of binding with high affinity to Flt4 receptor tyrosine kinase. Preferred embodiments include fragments having an apparent molecular weight of approximately 21/23 kD and 29/32 kD 3 0 as assessed by SDS-PAGE under reducing conditions. More generally, the invention includes a purified and isolated polypeptide that is a VEGF-C of vertebrate origin, wherein the VEGF-C has a molecular weight of about 21-23 kD, as assessed by SDS-PAGE under 7 reducing conditions, and wherein the VEGF-C is capable of binding to Flt4 receptor tyrosine kinase (VEGFR-3). Vertebrate VEGF-C forms of about 30-32 kD that are capable of binding VEGFR-3 also are intended as an aspect of the invention.
Evidence suggests that the amino acids essential for retaining Flt4 ligand activity are contained within approximately amino acids 103/112-226/227 of SEQ ID NO: 8, and that a carboxy-terminal proteolytic cleavage to produce a mature, naturallyoccurring Flt4 ligand occurs at the approximate position of amino acids 226-227 of SEQ ID NO: 8. Accordingly, a preferred Flt4 ligand comprises approximately amino acids 103- 227 of SEQ ID NO: 8.
VEGF-C mutational analysis described herein indicates that a naturally occurring VEGF-C polypeptide spanning amino acids 103-227 of SEQ ID NO: 8, produced by a natural processing cleavage that defines the C-terminus, exists and is biologically active as an Flt4 ligand. A polypeptide fragment consisting of residues 104- 213 of SEQ ID NO: 8 has been shown to retain VEGF-C biological activity. Additional mutational analyses indicate that a polypeptide spanning only amino acids 113-213 of SEQ ID NO: 8 retains Flt4 ligand activity. Accordingly, preferred polypeptides comprise sequences spanning, approximately, amino acid residues 103-227, 104-213, or 113-213, of SEQ ID NO: 8.
Moreover, sequence comparisons of members of the VEGF family of polypeptides provide an indication that still smaller fragments will retain biological activity, and such smaller fragments are intended as aspects of the invention. In particular, eight highly conserved cysteine residues of the VEGF family of polypeptides define a region from residue 131 to residue 211 of SEQ ID NO: 8 (see Figures 2, 5 10); therefore, a polypeptide spanning from about residue 131 to about residue 211 is expected to retain VEGF-C biological activity. In fact, a polypeptide comprising approximately residues 161-211, which retains an evolutionarily-conserved RCXXCC motif, is postulated to retain VEGF-C activity, and therefore is intended as an aspect of the invention.
In addition to binding Flt4, VEGF-C polypeptides are shown herein to bind and activate KDR/flk-1 receptor tyrosine kinase (VEGFR-2). Thus, the invention includes a purified and isolated polypeptide that is capable of binding to at least one of KDR receptor tyrosine kinase (VEGFR-2) and Flt4 receptor tyrosine kinase (VEGFR-3), the polypeptide comprising a portion of the amino acid sequence in SEQ ID NO: 8 effective to 8 permit such binding. In one preferred embodiment, the portion of the amino acid sequence in SEQ ID NO: 8 is a continuous portion having as its amino terminal residue an amino acid between residues 102 and 161 of SEQ ID NO: 8 and having as its carboxy terminal residue an amino acid between residues 210 and 228 of SEQ ID NO: 8. In a highly preferred embodiment, the portion has, as its amino terminal residue, an amino acid between residues 102 and 131 of SEQ ID NO: 8. In a very highly preferred embodiment, the portion of the amino acid sequence in SEQ ID NO: 8 is a continuous portion having as its amino terminal residue an amino acid between residues 102 and 114 of SEQ ID NO: 8 and having as its carboxy terminal residue an amino acid between residues 212 and 228 of SEQ ID NO: 8. Polypeptides of the invention which bind to and activate a receptor VEGFR-2 or VEGFR-3) are useful for stimulating VEGF-C biological activities that are mediated through the receptor. Polypeptides of the invention which bind to but do not activate a receptor are useful for inhibiting VEGF-C activities mediated through that receptor.
The definition of polypeptides of the invention is intended to include within its scope variants thereof. The polypeptide variants contemplated include purified and isolated polypeptides having amino acid sequences that differ from the exact amino acid sequences of such polypeptides VEGF-C, VEGF-C precursors and VEGF-C fragments) by conservative substitutions, as recognized by those of skill in the art, that are compatible with the retention of at least one VEGF-C biological activity or VEGF-Cinhibitory activity of the polypeptide. The term "variants," when used to refer to polypeptides, also is intended to include polypeptides having amino acid additions, including but not limited to additions of a methionine and/or leader sequence to promote translation and/or secretion; additions of peptide sequences to facilitate purification polyhistidine sequences and/or epitopes for antibody purification); and additions of polypeptide-encoding sequences to produce fusion proteins with VEGF-C. The term "variants" also is intended to include polypeptides having amino acid deletions at the amino terminus, the carboxy terminus, or internally of amino acids that are non-conserved amongst the human, mouse, and quail VEGF-C sequences taught herein, and that are compatible with the retention of the VEGF-C or VEGF-C-inhibitory activity of the polypeptide to which the deletions have been made.
-9 The term "variant" also is intended to include polypeptides having modifications to one or more amino acid residues that are compatible with retaining VEGF-C or VEGF-C inhibitory activity of the polypeptide. Such modifications include glycosylations (identical or different to glycosylations of native VEGF-C); and the addition of other substituents labels, compounds to increase serum half-life polyethylene glycol), and the like.
Additional polypeptides of the invention include certain fragments that have been observed to result from the processing of prepro-VEGF-C into mature VEGF-C. For example, the invention includes a purified and isolated polypeptide having a molecular weight of about 29 kD as assessed by SDS-PAGE under reducing conditions and having an amino acid sequence consisting essentially of a portion of SEQ ID NO: 8 having residue 228 ofSEQ ID NO: 8 as its amino terminal amino acid residue; and a purified and isolated polypeptide having a molecular weight of about 15 kD as assessed by SDS-PAGE under reducing conditions and having an amino acid sequence consisting essentially of a portion of SEQ ID NO: 8 having residue 32 of SEQ ID NO: 8 as its amino terminal amino acid.
residue. Such polypeptides are expected to modulate VEGF-C biological activity through their interactions with VEGF-C receptors and/or interactions with biologically active
VEGF-C.
Some of the conserved cysteine residues in VEGF-C participate in interchain disulfide bonding to make homo- and heterodimers of the various naturally occurring VEGF-C polypeptides. Beyond the preceding considerations, evidence exists that VEGF-C polypeptides lacking interchain disulfide bonds retain VEGF-C biological activity. Consequently, the materials and methods of the invention include all VEGF-C fragments that retain at least one biological activity of VEGF-C, regardless of the presence or absence of interchain disulfide bonds. The invention also includes multimers (including dimers) comprising such fragments linked to each other or to other polypeptides.
Fragment linkage may be by way of covalent bonding disulfide bonding) or noncovalent bonding of polypeptide chains hydrogen bonding, bonding due to stable or induced dipole-dipole interactions, bonding due to hydrophobic or hydrophilic interactions, combinations of these bonding mechanisms, and the like). Thus, the invention includes a purified and isolated polypeptide multimer, wherein at least one monomer thereof is a polypeptide that is capable of binding to VEGFR-2 and/or VEGFR-3, the polypeptide 10 comprising a portion of the amino acid sequence in SEQ ID NO: 8 effective to permit such binding, and wherein the multimer itself is capable of binding to VEGFR-2 and/or VEGFR-3. In a preferred embodiment, the multimer has at least one VEGF-C biological activity as taught herein.
In one embodiment, at least one monomer of the multimer is a polypeptide from another member of the PDGF/VEGF family of proteins, a vascular endothelial growth factor (VEGF) polypeptide, a vascular endothelial growth factor B (VEGF-B) polypeptide, a platelet derived growth factor A (PDGF-A) polypeptide, a platelet derived growth factor B (PDGF-B) polypeptide, a c-fos induced growth factor (FIGF) polypeptide, or a placenta growth factor (PIGF) polypeptide.
In a highly preferred embodiment, the multimer of the invention is a dimer of two monomer polypeptides. For example, the invention includes a dimer wherein each monomer thereof is capable of binding to at least one of VEGFR-2 and VEGFR-3 and has an amino acid sequence comprising a portion of SEQ ID NO: 8 effective to permit such binding. Dimers having covalent attachments and dimers wherein the two monomers are free of covalent attachments to each other are contemplated.
In yet another aspect, the invention includes analogs of the polypeptides of the invention. The term "analog" refers to polypeptides having alterations involving one or more amino acid insertions, internal amino acid deletions, and/or non-conservative amino acid substitutions (replacements). The definition of analog is intended to include within its scope variants of analog polypeptides embodying such alterations. The term "mutant," when used with respect to polypeptides herein, is intended to refer generically to VEGF-C variants, VEGF-C analogs, and variants of VEGF-C analogs. Preferred analogs possess at least 90% amino acid sequence similarity to the native peptide sequence from which the analogs were derived. Highly preferred analogs possess 95%, 96%, 97%, 98%, 99%, or greater amino acid sequence similarity to the native peptide sequence.
For example, in one embodiment, the invention includes a polypeptide analog of a VEGF-C of vertebrate origin that is capable of binding to VEGFR-3 an analog of a vertebrate VEGF-C of about 21-23 kD as assessed by SDS-PAGE under 3 0 reducing conditions), wherein an evolutionarily conserved cysteine residue in the VEGF-C 11 has been deleted or replaced, and wherein the analog is capable of binding to VEGFR-3 and has reduced VEGFR-2 binding affinity relative to the wildtype VEGF-C. For analogs 11 according to this embodiment of the invention, the determination that a residue is "evolutionarily conserved" is made solely by reference to the alignment of human, mouse, and quail VEGF-C sequences provided herein and aligned to show similarity in Fig. The presence of the same residue in all three sequences indicates that the residue is evolutionarily conserved, notwithstanding the fact that VEGF-C from other species may lack the residue. In a preferred embodiment, the conserved cysteine residue corresponds to the cysteine at position 156 of SEQ ID NO: 8. "Correspondence to the cysteine at position 156" is readily determined from an analysis of the vertebrate VEGF-C sequence of interest, since the cysteine at position 156 of SEQ ID NO: 8 (human VEGF-C) falls within an evolutionarily conserved portion of VEGF-C (see Fig. 5, comparing human, mouse, and quail VEGF-C polypeptides). Alignment of human VEGF-C allelic variants, other mammalian VEGF-C polypeptides, and the like with the three VEGF-C forms in Fig. 5 will identify that cysteine which corresponds to the cysteine at position 156 of SEQ ID NO: 8, even if the allelic variant has greater or fewer than exactly 155 residues preceding the cysteine of interest.
In another embodiment, the invention includes a purified polypeptide that is an analog of human VEGF-C and that is capable of binding to at least one of Flt-1 receptor tyrosine kinase (VEGFR-1), KDR receptor tyrosine kinase (VEGFR-2), and Flt4 receptor tyrosine kinase (VEGFR-3).
Specifically contemplated is an analog of human VEGF-C that binds VEGFR-3 but has reduced VEGFR-2 binding affinity, as compared to the VEGFR-2 binding affinity of a wildtype human VEGF-C as compared to the VEGFR-2 binding affinity of a human VEGF-C having an amino acid sequence consisting essentially of amino acids 103-227 of SEQ ID NO: One such family of human VEGF-C analogs are VEGF-C As 6 s polypeptides. By "VEGF-C AC 5 6 polypeptide" is meant an analog wherein the cysteine at position 156 of SEQ ID NO: 8 has been deleted or replaced by another amino acid. A VEGF-C AC,, 5 polypeptide analog can be made from any VEGF-C polypeptide of the invention that comprises all of SEQ ID NO: 8 or a portion thereof that includes position 156 of SEQ ID NO: 8. Preferably, the VEGF-C ACi, 5 polypeptide analog comprises a portion of SEQ ID NO: 8 effective to permit binding to VEGFR-3.
For example, the invention includes a VEGF-C ACts6 polypeptide that binds VEGFR-3, has reduced VEGFR-2 binding affinity, and has an amino acid sequence 12 which includes amino acids 131 to 211 of SEQ ID NO: 8, wherein the cysteine residue at position 156 of SEQ ID NO: 8 has been deleted or replaced. In a preferred embodiment, the VEGF-C AC, 5 polypeptide comprises a continuous portion of SEQ ID NO: 8, the portion having as its amino terminal residue an amino acid between residues 102 and 114 of SEQ ID NO: 8, and having as its carboxy terminal residue an amino acid between residues 212 and 228 of SEQ ID NO: 8, wherein the cysteine residue at position 156 of SEQ ID NO: 8 has been deleted or replaced. In an embodiment exemplified herein, the cysteine residue at position 156 of SEQ ID NO: 8 has been replaced by a serine residue.
A second family of human VEGF-C analogs that bind VEGFR-3 but have reduced VEGFR-2 binding affinity are VEGF-C AR 22 6
AR
22 7 polypeptides. By "VEGF-C
AR
2 26
AR
22 polypeptide" is meant an analog wherein the arginine residues at positions 226 and 227 of SEQ ID NO: 8 have been deleted or replaced by other amino acids, for the purpose of eliminating a proteolytic processing site of the carboxy terminal pro-peptide of VEGF-C. Preferably, the VEGF-C AR 2 26 AR2 7 polypeptide comprises a portion of SEQ ID NO: 8 effective to permit binding of VEGFR-3. For example, the invention includes a VEGF-C AR 2 26
AR
2 27 polypeptide having an amino acid sequence comprising amino acids 112-419 of SEQ ID NO: 8, wherein the arginine residues at positions 226 and 227 of SEQ ID NO: 8 have been deleted or replaced. Specifically exemplified herein is a VEGF-C AR,2 6
AR
22 7 polypeptide wherein the arginine residues at positions 226 and 227 of SEQ ID NO: 8 have been replaced by serine residues.
Another family of VEGF-C analogs of the invention are human VEGF-C~' polypeptides. By "VEGF-Ca"ic polypeptide" is meant a VEGF-C analog wherein at least one amino acid having a basic side chain has been introduced into the VEGF-C coding sequence, to emulate one or more basic residues in VEGF residues Arglo 0 Lysno 0 and His,, 2 in the VEGF165 precursor shown in Fig. 2) that have been implicated in VEGF receptor binding. Preferably, two or three basic residues are introduced into VEGF-C.
Based on the VEGF/VEGF-C polypeptide alignment provided herein, positions 187, 189, and 191 of SEQ ID NO: 8 are preferred positions to introduce basic residues. For example, the invention includes a VEGF-Cbi polypeptide that is capable of binding to at least one of VEGFR-1, VEGFR-2, and VEGFR-3, and that has an amino acid sequence comprising residues 131 to 211 of SEQ ID NO: 8, wherein the glutamic acid residue at position 187, the threonine residue at position 189, and the proline residue at position 191 13 of SEQ ID NO: 8 have been replaced by an arginine residue, a lysine residue, and a histidine residue, respectively.
In yet another aspect of the invention, VEGF-C structural information is employed to create useful analogs of VEGF. For example, mature VEGF-C contains an unpaired cysteine (position 137 of SEQ ID NO: 8) and is able to form non-covalently bonded polypeptide dimers. In one embodiment, a VEGF analog is created wherein this unpaired cysteine residue from mature VEGF-C is introduced at an analogous position of VEGF introduced in place of Leus5 of the human VEGF165 precursor (Fig. 2, Genbank Acc. No. M32977). Such VEGF analogs are termed VEGF^ polypeptides.
Thus, the invention includes a human VEGF analog wherein a cysteine residue is introduced in the VEGF amino acid sequence at a position selected from residues 53 to 63 of the human VEGF165 precursor having the amino acid sequence set forth in SEQ ID NO: 56. At least four naturally occurring VEGF isoforms have been described, and VEGFc y polypeptide analogs of each isoform are contemplated. Most preferably, the cysteine is introduced at a position in a VEGF isoform which corresponds to position 58 of the VEGF165 precursor having the amino acid sequence set forth in SEQ ID NO: 56.
The present invention also provides purified and isolated polynucleotides nucleic acids) encoding all of the polypeptides of the invention, including but not limited to cDNAs and genomic DNAs encoding VEGF-C precursors, VEGF-C, and biologically active fragments thereof, and DNAs encoding VEGF-C variants and VEGF-C analogs. A preferred nucleic acid of the invention comprises a DNA encoding amino acid residues 1 to 419 of SEQ ID NO: 8 or one of the aforementioned fragments or analogs thereof. Due to the degeneracy of the genetic code, numerous such coding sequences are possible, each having in common the coding of the amino'acid sequence shown in SEQ ID NO: 8 or the fragment or analog thereof. Distinct polynucleotides encoding any polypeptide of the invention by virtue of the degeneracy of the genetic code are within the scope of the invention.
A preferred polynucleotide according to the invention comprises the human VEGF-C cDNA sequence set forth in SEQ ID NO: 7 from nucleotide 352 to 1611. Other polynucleotides according to the invention encode a VEGF-C polypeptide from, e.g., mammals other than humans, birds avian quails), and others. Still other 14 polynucleotides of the invention comprise a coding sequence for a VEGF-C fragment, and allelic variants of those DNAs encoding part or all of VEGF-C.
Still other polynucleotides of the invention comprise a coding sequence for a VEGF-C variant or a VEGF-C analog. Preferred variant-encoding and analog-encoding polynucleotides comprise the human, mouse, or quail VEGF-C cDNA sequences disclosed herein nucleotides 352-1611 ofSEQ ID NO: 7 or continuous portions thereof) wherein one or more codon substitutions, deletions, or insertions have been introduced to create the variant/analog-encoding polynucleotide. For example, a preferred polynucleotide encoding a VEGF-C ACs, 5 polypeptide comprises all or a portion of SEQ ID NO: 7 wherein the cysteine codon at positions 817-819 has been replaced by a codon encoding a different amino acid a serine-encoding TCC codon).
The invention further comprises polynucleotides that hybridize to the aforementioned polynucleotides under standard stringent hybridization conditions.
Exemplary stringent hybridization conditions are as follows: hybridization at 42'C in formamide, 5X SSC, 20 mM Na-PO 4 pH 6.8; and washing in 0.2X SSC at 55 0 C. It is understood by those of skill in the art that variation in these conditions occurs based on the length and GC nucleotide content of the sequences to be hybridized. Formulas standard in the art are appropriate for determining appropriate hybridization conditions. See Sambrook et al., Molecular Cloning: A Laboratory Manual (Second ed., Cold Spring Harbor Laboratory Press, 1989) 9.47-9.51. These polynucleotides, capable of hybridizing to polynucleotides encoding VEGF-C, VEGF-C fragments, or VEGF-C analogs, are useful as nucleic acid probes for identifying, purifying and isolating polynucleotides encoding other (non-human) mammalian forms of VEGF-C and human VEGF-C allelic variants. Additionally, these polynucleotides are useful in screening methods of the invention, as described below.
Preferred nucleic acids useful as probes of the invention comprise nucleic acid sequences of at least about 16 continuous nucleotides of SEQ ID NO: 7. More preferably, these nucleic acid probes would have at least about 20 continuous nucleotides found in SEQ ID NO: 7. In using these nucleic acids as probes, it is preferred that the nucleic acids specifically hybridize to a portion of the sequence set forth in SEQ ID NO: 7.
Specific hybridization is herein defined as hybridization under standard stringent hybridization conditions. To identify and isolate other mammalian VEGF-C genes 15 specifically, nucleic acid probes preferably are selected such that they fail to hybridize to genes related to VEGF-C fail to hybridize to human VEGF or to human VEGF-B genes).
Thus, the invention comprehends polynucleotides comprising at least about 16 nucleotides wherein the polynucleotides are capable of specifically hybridizing to a gene encoding VEGF-C, a human gene. The specificity of hybridization ensures that a polynucleotide of the invention is able to hybridize to a nucleic acid encoding a VEGF-C under hybridization conditions that do not support hybridization of the polynucleotide to nucleic acids encoding, VEGF or VEGF-B. In one embodiment, polynucleotides of at least about 16 nucleotides, and preferably at least about 20 nucleotides, are selected as continuous nucleotide sequences found in SEQ ID NO: 7 or the complement of the nucleotide sequence set forth in SEQ ID NO: 7.
In another embodiment, the invention includes polynucleotides having at least 90 percent (preferably at least 95 percent, and more preferably at least 97, 98, or 99 percent) nucleotide sequence identity with a nucleotide sequence encoding a polypeptide of the invention. In a highly preferred embodiment, the polynucleotides have at least percent sequence identity with a nucleotide sequence encoding a human VEGF-C precursor (such as the VEGF-C precursor in SEQ ID NO: 8 and allelic variants thereof), human VEGF-C, or biologically active VEGF-C fragments.
Additional aspects of the invention include vectors which comprise nucleic acids of the invention; and host cells transformed or transfected with nucleic acids or vectors of the invention. Preferred vectors of the invention are expression vectors wherein nucleic acids of the invention are operatively connected to appropriate promoters and other control sequences that regulate transcription and/or subsequent translation, such that appropriate prokaryotic or eukaryotic host cells transformed or transfected with the vectors are capable of expressing the polypeptide encoded thereby the VEGF-C, VEGF-C fragment, VEGF-C variant, or VEGF-C analog encoded thereby). A preferred vector of the invention is plasmid pFLT4-L, having ATCC accession no. 97231. Such vectors and host cells are useful for recombinantly producing polypeptides of the invention, including VEGF-C, and fragments, variants, and analogs thereof.
In a related aspect of the invention, host cells such as procaryotic and eukaryotic cells, especially unicellular host cells, are modified to express polypeptides of 16 the invention. Host cells may be stably transformed or transfected with isolated DNAs of the invention in a manner allowing expression of polypeptides of the invention therein.
Thus, the invention further includes a method of making polypeptides of the invention. In a preferred method, a nucleic acid or vector of the invention is expressed in a host cell, and a polypeptide of the invention is purified from the host cell or the host cell's growth medium.
Similarly, the invention includes a method of making a polypeptide capable of specifically binding to VEGFR-1, VEGFR-2 and/or VEGFR-3, comprising the steps of: transforming or transfecting a host cell with a nucleic acid of the invention; (b) cultivating the host cell to express the nucleic acid; and purifying a polypeptide capable of specifically binding to VEGFR-1, VEGFR-2, and/or VEGFR-3 from the host cell or from the host cell's growth media. The invention also includes purified and isolated polypeptides produced by methods of the invention. In one preferred embodiment, the invention includes a human VEGF-C polypeptide or biologically active fragment, variant, or analog thereof that is substantially free of other human polypeptides.
Alternatively, host cells may be modified by activating an endogenous VEGF-C gene that is not normally expressed in the host cells or that is expressed at a lower rate than is desired. Such host cells are modified by homologous recombination) to express the VEGF-C by replacing, in whole or in part, the naturallyoccurring VEGF-C promoter with part or all of a heterologous promoter so that the host cells express VEGF-C. In such host cells, the heterologous promoter DNA is operatively linked to the VEGF-C coding sequences, controls transcription of the VEGF-C coding sequences. See, for example, PCT International Publication No. WO 94/12650; PCT International Publication No. WO 92/20808; and PCT International Publication No.
WO 91/09955. The invention also contemplates that, in addition to heterologous promoter DNA, amplifiable marker DNA ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydro-orotase) and/or intron DNA may be recombined along with the heterologous promoter DNA into the host cells. If linked to the VEGF-C coding sequences, amplification of the marker DNA by standard selection methods results in co-amplification of the VEGF-C coding sequences in such host cells. Thus, the invention includes, for example, a cell comprising a nucleic acid having a sequence encoding human VEGF-C and 17 further comprising a non-VEGF-C promoter sequence a heterologous promoter sequence) or other non-VEGF-C control sequence that increases RNA transcription in the cell of the sequence encoding human VEGF-C.
The DNA sequence information provided by the present invention also makes possible the development, by homologous recombination or "knockout" strategies [see, Capecchi, Science, 244: 1288-1292 (1989)], of rodents that fail to express functional VEGF-C or that express a VEGF-C fragment, variant, or analog. Such rodents are useful as models for studying the activities of VEGF-C and VEGF-C modulators in vivo.
In another aspect, the invention includes an antibody that specifically binds to one or more polypeptides of the invention, and/or binds to polypeptide multimers of the invention. In the context of antibodies of the invention, the term "specifically binds" is intended to exclude antibodies that cross-react with now-identified, related growth factors, such as VEGF, VEGF-B, PDGF-A, PDGF-B, FIGF, and PIGF. However, due to the high level of amino acid similarity shared by VEGF-C polypeptides of different species, it will be understood that antibodies that specifically bind to human VEGF-C polypeptides of the invention will, in many instances, also bind non-human mouse, quail) VEGF-C polypeptides of the invention. Antibodies, both monoclonal and polyclonal, may be made against a polypeptide of the invention according to standard techniques in the art. See, Harlow and Lane, Antibodies: A Laboratory Manual (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1988)). Standard protein manipulation techniques and recombinant techniques also may be employed to generate humanized antibodies and antigen-binding antibody fragments and other chimeric antibody polypeptides, all of which are considered antibodies of the invention. The invention further includes hybridoma cells that produce antibodies of the invention or other cell types that have been genetically engineered to express antibody polypeptides of the invention.
Antibodies of the invention may be used in diagnostic applications to monitor angiogenesis, vascularization, lymphatic vessels and their disease states, wound healing, or certain tumor cells, hematopoietic, or leukemia cells. The antibodies also may be used to block the ligand from activating its receptors; to purify polypeptides of the invention; and to assay fluids for the presence of polypeptides of the invention. The invention further includes immunological assays (including radio-immuno assays, enzyme linked 18 immunosorbent assays, sandwich assays and the like) which employ antibodies of the invention.
Ligands according to the invention may be labeled with a detectable label and used to identify their corresponding receptors in situ. Labeled Flt4 ligand and anti- Flt4 ligand antibodies may be used as imaging agents in the detection of lymphatic vessels, high endothelial venules and their disease states, and Flt4 receptors expressed in histochemical tissue sections. The ligand or antibody may be covalently or non-covalently coupled to a suitable supermagnetic, paramagnetic, electron dense, echogenic, or radioactive agent for imaging. Other, non-radioactive labels, such as biotin and avidin, may also be used.
A related aspect of the invention is a method for the detection of specific cells, endothelial cells. These cells may be found in vivo, or in ex vivo biological tissue samples. The method of detection comprises the steps of contacting a biological tissue comprising, endothelial cells, with a polypeptide according to the invention which is capable of binding to VEGFR-2 and/or VEGFR-3, under conditions wherein the polypeptide binds to the cells, optionally washing the biological tissue, and detecting the polypeptide bound to the cells in the biological tissue, thereby detecting the cells. It will be apparent that certain polypeptides of the invention are useful for detecting and/or imaging cells that express both VEGFR-2 and VEGFR-3, whereas other polypeptides VEGF-C AC, 56 polypeptides) are useful for imaging specifically those cells which express VEGFR-3.
The many biological activities described herein for VEGF-C (including but not limited to affecting growth and migration of vascular endothelial cells; promoting growth of lymphatic endothelial cells and lymphatic vessels; increasing vascular permeability; and affecting myelopoiesis growth of neutrophilic granulocytes)) support numerous diagnostic and in vitro and in vivo clinical utilities for polypeptides and antibodies of the invention, for modulating (stimulating or inhibiting) these biological activities. Generally, VEGF-C and precursor, fragment, variant, and analog polypeptides that retain one or more VEGF-C biological activities are useful agonists for stimulating the desired biological activity; whereas precursor, fragment, variant, and analog polypeptides that are capable of binding to VEGFR-2 and/or VEGFR-3 (either alone or as a homo- or hetero-dimer with other polypeptides) without stimulating receptor-mediated VEGF-C 19 activity without activating the receptor) are useful as antagonists (inhibitors) of VEGF-C. Similarly, antibodies of the invention that bind biologically active VEGF-C forms and thereby interfere with VEGF-C-receptor interactions are useful as inhibitors of VEGF-C. Antisense oligonucleotides comprising a portion of the VEGF-C coding sequence and/or its complement also are contemplated as inhibitors of the invention. Both biologically active polypeptides and inhibitor polypeptides of the invention have utilities in various imaging applications.
For example, the biological effects of VEGF-C on vascular endothelial cells indicate in vivo uses for polypeptides of the invention for stimulating angiogenesis during wound healing, in tissue transplantation, in eye diseases, in the formation of collateral vessels around arterial stenoses and into injured tissues after infarction) and for inhibiting angiogenesis to inhibit tumor growth and/or metastatic cancer). The biological effects on vascular endothelial cells indicate in vitro uses for biologically active forms of VEGF-C to promote the growth of (including proliferation of) cultured vascular endothelial cells and precursors thereof.
The biological effects of VEGF-C on lymphatic endothelia indicate in vivo uses for polypeptides of the invention for stimulating lymphangiogenesis to promote re-growth or permeability of lymphatic vessels in, for example, organ transplant patients; to mitigate the loss of axillary lymphatic vessels following surgical interventions in the treatment of cancer breast cancer); to treat aplasia of the lymphatic vessels or lymphatic obstructions) and for inhibiting it to treat lymphangiomas). Additional in vivo uses for polypeptides of the invention include the treatment or prevention of inflammation, edema, elephantiasis, and Milroy's disease. The biological effects on lymphatic endothelial cells indicate in vitro uses for biologically active forms of VEGF-C to promote the growth of cultured lymphatic endothelial cells and precursors thereof.
Thus, the invention includes a method of modulating (stimulating/increasing or inhibiting/decreasing) the growth of vertebrate endothelial cells or vertebrate endothelial precursor cells comprising contacting such endothelial cells or precursor cells with a polypeptide or antibody (or antigen-binding portion thereof) of the invention, in an amount effective to modulate the growth of the endothelial or endothelial precursor cells.
Mammalian endothelial cells and their precursors are preferred. Human endothelial cells are highly preferred. In one embodiment, the endothelial cells are lymphatic endothelial 20 cells. In another embodiment, the cells are vascular endothelial cells. The method may be an in vitro method for cultured endothelial cells) or an in vivo method. The in vitro growth modulation of CD34+ endothelial precursor cells [see, Asahara et al., Science, 275:964-967 (1997)] isolated from peripheral blood, bone marrow, or cord blood is specifically contemplated. For in vivo methods, it is highly preferable to administer a pharmaceutical composition (comprising the polypeptide formulated in a pharmaceutically acceptable diluent, adjuvant, excipient, carrier, or the like) to the subject, in an amount effective to modulate the growth of lymphatic endothelial cells in vivo.
In one preferred embodiment, the endothelial cells are lymphatic endothelial cells, and the polypeptide is one that has reduced effect on the permeability of mammalian blood vessels compared to a wildtype VEGF-C polypeptide compared with VEGF-C having an amino acid sequence set forth in SEQ ID NO: 8 from residue 103 to residue 227). VEGF-C ACm 1 polypeptides are contemplated for use in this embodiment.
In modulating the growth of endothelial cells in vivo, the invention contemplates the modulation of endothelial cell-related disorders. Endothelial cell disorders contemplated by the invention include, but are not limited to, physical loss of lymphatic vessels surgical removal of axillary lymph tissue), lymphatic vessel occlusion elephantiasis), and lymphangiomas. In a preferred embodiment, the subject, and endothelial cells, are human. The endothelial cells may be provided in vitro or in vivo, and they may be contained in a tissue graft. An effective amount of a polypeptide is defined herein as that amount of polypeptide empirically determined to be necessary to achieve a reproducible change in cell growth rate (as determined by microscopic or macroscopic visualization and estimation of cell doubling time, or nucleic acid synthesis assays), as would be understood by one of ordinary skill in the art.
Polypeptides of the invention may be used to stimulate lymphocyte production and maturation, and to promote or inhibit trafficking of leukocytes between tissues and lymphatic vessels or to affect migration in and out of the thymus.
The biological effects of VEGF-C on myelopoiesis indicate in vivo and in vitro uses for polypeptides of the invention for stimulating myelopoiesis (especially growth of neutrophilic granuloctyes) or inhibiting it. Thus, the invention includes a method for modulating myelopoiesis in a mammalian subject comprising administering to a mammalian subject in need of modulation of myelopoiesis an amount of a polypeptide or antibody (or 21 antigen-binding portion thereof) of the invention that is effective to modulate myelopoiesis. In one embodiment, a mammalian subject suffering from granulocytopenia is selected, and the method comprises administering to the subject an amount of a polypeptide effective to stimulate myelopoiesis. In particular, a polypeptide of the invention is administered in an amount effective to increase the neutrophil count in blood of the subject. Preferred subjects are human subjects. An effective amount of a polypeptide is an amount of polypeptide empirically determined to be necessary to achieve a reproducible change in the production of neutrophilic granulocytes (as determined by microscopic or macroscopic visualization and estimation of cell doubling time, or nucleic acid synthesis assays), as would be understood by one of ordinary skill in the art.
In a related embodiment, the invention includes a method of increasing the number of neutrophils in the blood of a mammalian subject comprising the step of expressing in a cell in a subject in need of an increased number of blood neutrophils a DNA encoding a VEGF-C protein, the DNA operatively linked to a non-VEGF-C promoter or other non-VEGF-C control sequence that promotes expression of the DNA in the cell.
Similarly, the invention includes a method of modulating the growth of neutrophilic granulocytes in vitro or in vivo comprising the step of contacting mammalian stem cells with a polypeptide or antibody of the invention in an amount effective to modulate the growth of mammalian endothelial cells.
More generally, the invention includes a method for modulating the growth of CD34+ progenitor cells (especially hematopoietic progenitor cells and endothelial progenitor cells) in vitro or in vivo comprising the step of contacting mammalian CD34+ progenitor cells with a polypeptide or antibody of the invention in an amount effective to modulate the growth of mammalian endothelial cells. For in vitro methods, CD34+ progenitor cells isolated from cord blood or bone marrow are specifically contemplated.
It will be apparent from the Detailed Description below that in vitro and in vivo methods of the invention for stimulating the growth of CD34+ precursor cells also include methods wherein polypeptides of the invention are employed together (simultaneously or sequentially) with other polypeptide factors for the purpose of modulating hematopoiesis/myelopoiesis or endothelial cell proliferation. Such other factors include, but are not limited to colony stimulating factors ("CSFs," e.g., 22 granulocyte-CSF (G-CSF), macrophage-CSF (M-CSF), and granulocyte-macrophage-CSF (GM-CSF)), interleukin-3 (IL-3, also called multi-colony stimulating factor), other interleukins, stem cell factor (SCF), other polypeptide factors, such as VEGF, and their analogs that have been described and are known in the art. See generally The Cytokine Handbook, Second Ed., Angus Thomson (editor), Academic Press (1996); Callard and Gearing, The Cytokine FactsBook, Academic Press Inc. (1994); and Cowling and Dexter, TIBTECH, 10(10):349-357 (1992). The use of a polypeptide of the invention as a progenitor cell or myelopoietic cell growth factor or co-factor with one or more of the foregoing factors may potentiate previously unattainable myelopoietic effects and/or potentiate previously attainable myelopoietic effects while using less of the foregoing factors than would be necessary in the absence of a polypeptide of the invention.
In addition to methods, the invention includes compositions comprising polypeptides of the invention in admixture with one or more of the factors identified in the previous paragraph. Preferred compositions further comprise a pharmaceutically acceptable diluent, adjuvant, excipient, or carrier. The invention also includes kits comprising at least one polypeptide of the invention packaged with one or more of the foregoing polypeptides in unit dosage form, but not in admixture with each other).
For methods which involve the in vivo administration of polypeptides or antibodies of the invention, it is contemplated that the polypeptides or antibodies will be administered in any suitable manner using an appropriate pharmaceutically-acceptable vehicle, a pharmaceutically-acceptable diluent, adjuvant, excipient or carrier. Thus, the invention further includes compositions, pharmaceutical compositions, comprising one or more polypeptides or antibodies of the invention. By pharmaceutical composition is meant a composition that may be administered to a mammalian host, orally, topically, parenterally (including subcutaneous injections, intravenous, intramuscular, intracisternal injection or infusion techniques), by inhalation spray, or rectally, in unit dosage formulations containing conventional non-toxic carriers, diluents calcium carbonate, sodium carbonate, lactose, calcium phosphate, sodium phosphate, kaolin, water), adjuvants, vehicles, and the like, including but not limited to flavoring agents, preserving agents; granulating and disintegrating agents; binding agents; time delay 23 materials; oils; suspending agents; dispersing or wetting agents; anti-oxidants; emulsifiers, etc.
The invention further provides a method of using a polypeptide of the invention for the manufacture of a medicament for use in any of the foregoing methods.
Similarly, the invention further provides a method of using a polypeptide of the invention for the manufacture of a medicament for the treatment of any of the foregoing indicated conditions and disease states. Such methods optionally involve the use of additional biologically active ingredients VEGF, PIGF, G-CSF, etc.) for the manufacture of the medicament.
Effective amounts of polypeptides for the foregoing methods are empirically determined using standard in vitro and in vivo dose-response assays. In addition, experimental data provided herein provide guidance as to amounts of polypeptides of the invention that are effective for achieving a desired biological response.
For example, the dissociation constants determined for one form of mature VEGF-C (KD=13 5 pM for VEGFR-3 and KD=410 pM for VEGFR-2) provide an indication as to the concentration of VEGF-C necessary to achieve biological effects, because such dissociation constants represent concentrations at which half of the VEGF-C polypeptide is bound to the receptors through which VEGF-C biological effects are mediated. Results from in vivo Miles assays, wherein 0 8 picomoles of VEGF-C was injected intradermally, provide an indication that picomole quantities of mature VEGF-C are sufficient to induce localized biological effects. In vitro analysis of 3 H-thymidine incorporation into bovine capillary endothelial cells treated with a mature VEGF-C form showed increasing VEGF-C effects on cell proliferation at concentrations of 10 1000 pM. Collectively, this data suggests that localized concentrations of 100 1000 pM of fully-processed VEGF-C have VEGF-C biological activity in vivo. Effective concentrations of other polypeptides of the invention are generally expected to correlate with the dissociation constant of the polypeptides for the relevant receptors. Pharmacokinetic and pharmacological analyses reveals the preferred dosages, dosage formulations, and methods of administration to achieve the desired local or systemic concentration of a polypeptide of the invention.
Polypeptides of the invention also may be used to quantify future metastatic risk by assaying biopsy material for the presence of active receptors or ligands in a binding assay. Such a binding assay may involve the use of a detectably labeled polypeptide of the 24 invention or of an unlabeled polypeptide in conjunction with a labeled antibody, for example. Kits comprising such substances are included within the scope of the invention.
The present invention also provides methods for using the claimed nucleic acids polynucleotides) in screening for endothelial cell disorders. In a preferred embodiment, the invention provides a method for screening an endothelial cell disorder in a mammalian subject comprising the steps of providing a sample of endothelial cell nucleic acids from the subject, contacting the sample of endothelial cell nucleic acids with a polynucleotide of the invention which is capable of hybridizing to a gene encoding VEGF- C (and preferably capable of hybridizing to VEGF-C mRNA), determining the level of hybridization between the endothelial cell nucleic acids and the polynucleotide, and correlating the level of hybridization with a disorder. A preferred mammalian subject, and source of endothelial cell nucleic acids, is a human. The disorders contemplated by the method of screening with polynucleotides include, but are not limited to, vessel disorders such as the aforementioned lymphatic vessel disorders, and hypoxia.
Purified and isolated polynucleotides encoding other (non-human) VEGF-C forms also are aspects of the invention, as are the polypeptides encoded thereby, and antibodies that bind to non-human VEGF-C forms. Preferred non-human forms of VEGF- C are forms derived from other vertebrate species, including avian and mammalian species.
Mammalian forms are highly preferred. Thus, the invention includes a purified and isolated mammalian VEGF-C polypeptide, and also a purified and isolated polynucleotide encoding such a polypeptide.
In one embodiment, the invention includes a purified and isolated polypeptide having the amino acid sequence of residues 1 to 415 of SEQ ID NO: 11, which sequence corresponds to a putative mouse VEGF-C precursor. The putative mouse VEGF-C precursor is believed to be processed into a mature mouse VEGF-C in a manner analogous to the processing of the human prepro-polypeptide. Thus, in a related aspect, the invention includes a purified and isolated polypeptide capable of binding with high affinity to an Flt4 receptor tyrosine kinase a human or mouse Flt-4 receptor tyrosine kinase), the polypeptide comprising a fragment of the purified and isolated polypeptide having the amino acid sequence of residues 1 to 415 of SEQ ID NO: 11, the fragment being capable of binding with high affinity to the Flt4 receptor tyrosine kinase. The invention further includes multimers of the foregoing polypeptides and purified and 25 isolated nucleic acids encoding the foregoing polypeptides, such as a nucleic acid comprising all or a portion of the sequence shown in SEQ ID NO: In another embodiment, the invention includes a purified and isolated quail VEGF-C polypeptide, biologically active fragments and multimers thereof, and polynucleotides encoding the foregoing polypeptides.
It is also contemplated that VEGF-C polypeptides from other species may be altered in the manner described herein with respect to human VEGF-C variants, in order to alter biological properties of the wildtype protein. For example, elimination of the cysteine at position 152 of SEQ ID NO: 11 or position 155 of SEQ ID NO: 13 is expected to alter VEGFR-2 binding properties in the manner described below for human VEGF-C
AC,
5 6 mutants.
In yet another embodiment, the invention includes a DNA comprising a VEGF-C promoter, that is capable of promoting expression of a VEGF-C gene or another operatively-linked, protein-encoding gene in native host cells, under conditions wherein VEGF-C is expressed in such cells. Thus, the invention includes a purified nucleic acid comprising a VEGF-C promoter sequence. Genomic clone lambda 5 described herein comprises more than 5 kb of human genomic DNA upstream of the VEGF-C translation initiation codon, and contains promoter DNA of the invention. Approximately 2.4 kb of this upstream sequence is set forth in SEQ ID NO: 48. Thus, in one embodiment, the invention includes a purified nucleic acid comprising a portion of SEQ ID NO: 48, wherein the portion is capable of promoting expression of a protein encoding gene operatively linked thereto under conditions wherein VEGF-C is expressed in native host cells.
Similarly, the invention includes a chimeric nucleic acid comprising a VEGF-C promoter nucleic acid according to the invention operatively connected to a sequence encoding a 2 5 protein other than a human VEGF-C.
Additional aspects and embodiments of the invention will be apparent from the detailed description which follows.
BRIEF DESCRIPTION OF THE DRAWING Figure 1 schematically depicts major endothelial cell receptor tyrosine 3 0 kinases and growth factors involved in vasculogenesis and angiogenesis. Major structural domains are depicted, including immunoglobulin-like domains (IGH), epidermal growth 26 factor homology domains (EGFH), fibronectin type III domains (FNIII), transmembrane (TM) and juxtamembrane (JM) domains, tyrosine kinase (TK1, TK2) domains, kinase insert domains and carboxy-terminal domains (CT).
Figure 2 shows a comparison of the deduced amino acid sequences of PDGF-A (SEQ ID NO: 53), PDGF-B (SEQ ID NO: 54), PIGF-1 (SEQ ID NO:
VEGF-B,
6 (SEQ ID NO: 56), VEGFI65 (SEQ ID NO: 57), and Fit4 ligand (VEGF-C, (SEQ ID NO: Figure 3 schematically depicts the VEGF-C promoter-reporter constructs and their activities in transfected HeLa cells. A restriction map of a portion of a genomic clone that includes the VEGF-C initiation codon and about 6 kb of upstream sequence is depicted above the constructs. Constructs were made linking putative VEGF-C promoter to the Luciferase reporter gene in pGL3 vector (Promega) and introduced into HeLa cells by calcium phosphate-mediated transfection method. The Luciferase activity obtained was compared to the level using the promoterless pGL3basic construct to obtain a measure of relative promoter activity. Luciferase activity is expressed graphically as a ratio of activity of the constructs versus this control. Also shown are numerical ratios of Luciferase activity in experiments where the constructs were transfected into HeLa cells and cells were starved 24 hours followed by serum stimulation for four hours (Luciferase activity is expressed as a ratio of activity in serum-stimulated versus serum-starved cells).
Figures 4A-4B graphically depict the results of a competitive binding assay.
The ability ofVEGF165 (filled triangles: wildtype VEGF-C (filled circles: and three VEGF-C mutants [VEGF-C R226,227S (open boxes: VEGF-C ANACHis (open circles: and VEGF-C ANACHisC156S (open triangles: to compete with VEGF-CANACHis for binding to VEGFR-2 (Fig. 4B) and VEGFR-3 (Fig. 4A) is shown.
Figure 5 depicts the amino acid sequences of human (SEQ ID NO: 8), murine (SEQ ID NO: 11), and quail (SEQ ID NO: 13) VEGF-C polypeptides, aligned to show similarity. Residues conserved in all three species are depicted in bold.
Figures 6A-C depict electrophoretic fractionations of the various forms of recombinant VEGF-C produced by transfected 293 EBNA cells. Figure 6B depicts the electrophoretic fractionation, under non-reducing conditions, of polypeptides produced from mock transfected cells, cells transfected with wild type (wt) VEGF-C cDNA, and cells transfected with a cDNA encoding the VEGF-C mutant VEGF-C-R102S. Each 27 of the bands identified in Figure 6B was excised and electrophoretically fractionated in a separate lane under reducing conditions. Fractionation of bands corresponding to wt VEGF-C are depicted in Figure 6A; fractionation of bands corresponding to the R102S mutant are depicted in Figure 6C.
Figures 7A-B depict the forms and sizes of wild type and mutant recombinant VEGF-Cs, as revealed by non-reducing gel electrophoresis. Figure 7A shows the VEGF-C forms secreted into the media; Figure 7B shows the VEGF-C forms retained by the cells. Mock transfected cells served as a control.
Figures 8A-B present a comparison of the pattern of immunoprecipitated, labeled VEGF-C forms using antisera 882 and antisera 905. Adjacent lanes contain immunoprecipitates that were (lanes marked or were not (lanes marked subjected to reduction and alkylation.
Fig. 9 is a schematic model of the proteolytic processing of VEGF-C. The regions of the VEGF-C polypeptide are depicted as follows: signal sequence dark shaded box; VEGF-homology domain medium shaded box; N-terminal and C-terminal propeptides dotted and open boxes, respectively. Conserved cysteine residues in the VEGF-homology domain are depicted with dots (for clarity, cysteine residues in the Cterminal propeptide are not marked). Putative sites of N-linked glycosylation are shown with Y symbols. Numbers indicate approximate molecular mass (kDa) of the corresponding polypeptide as measured by SDS-PAGE in reducing conditions. Disulfide bonds are marked as non-covalent bonds are depicted as dotted lines. A question mark indicates the presence of a possible non-covalent bond. The proteolytic generation of a small fraction of disulfide-linked 21 kDa forms is not indicated in the figure. Several intermediate forms also are omitted to simplify the scheme. Particularly, only one precursor polypeptide is cleaved initially. The figure is not intended to suggest that other intermediate forms, for example 21 kDa 31 kDa, 31 kDa 31 kDa 29 kDa, do not exist.
Figure 10 presents a comparison of the human and mouse VEGF-C amino acid sequences. The amino acid sequence of mouse VEGF-C is presented on the top line 3 0 and differences in the human sequence are marked below it. An arrow indicates the putative cleavage site for the signal peptidase; BR3P motifs, as well as a CR/SC motif, are 28 boxed; and conserved cysteine residues are marked in bold above the sequence. Arginine residue 158 is also marked in bold. The numbering refers to mouse VEGF-C residues.
Figures 11 A and 11B depict the genomic structure of the human (11 IA) and murine (11B) VEGF-C genes. Sequences of exon-intron junctions are depicted together with exon and intron lengths. Intron sequences are depicted in lower case letters.
Nucleotides of the open reading frame observed in VEGF-C cDNAs are indicated as upper case letters in triplets (corresponding to the codons encoded at the junctions).
Figure 12 depicts the exon-intron organization of the human VEGF-C gene. Seven exons are depicted as open boxes, with exon size depicted in base pairs.
Introns are depicted as lines, with intron size (base pairs) depicted above the lines. 5' and 3' untranslated sequences of a putative 2.4 kb mature mRNA are depicted as shaded boxes.
The location of genomic clones used to characterize the VEGF-C gene are depicted below the map of the gene.
DETAILED DESCRIPTION OF THE INVENTION Described herein is the isolation of a novel vascular endothelial growth factor and the cloning of a DNA encoding this novel growth factor from a cDNA library prepared from the human prostatic adenocarcinoma cell line PC-3. The isolated cDNA encodes a protein which is proteolytically processed and secreted to cell culture medium.
The processing is described in detail below. The secreted protein, designated VEGF-C, binds to the extracellular domain and induces tyrosine autophosphorylation of both Flt4 (VEGFR-3) and KDR/flk-1 (VEGFR-2). In contrast, neither VEGF nor PlGF show high affinity binding to VEGFR-3 or induced its autophosphorylation. VEGF-C also stimulates the migration of endothelial cells in collagen gel and induces vascular permeability in vivo. In transgenic mice, VEGF-C induces proliferation of the lymphatic endothelium and an causes an increase in neutrophilic granulocytes. Based on studies of VEGF-C variants and analogs and studies of VEGF precursors, it is anticipated that one or more VEGF-C precursors (the largest putative native VEGF-C precursor, excluding signal peptide, having the complete amino acid sequence from residue 32 to residue 419 of SEQ ID NO: 8) is capable of stimulating VEGFR-3.
In addition to providing a cDNA sequence encoding prepro-VEGF-C, the present application also provides significant guidance concerning portions of the VEGF-C 29 amino acid sequence which are necessary for biological activity and portions (of one or more amino acids) which, when altered, will modulate (up-regulate or inhibit) VEGF-C biological activities. Such alterations are readily achieved through recombinant DNA and protein techniques, such as site-directed mutagenesis of a VEGF-C encoding cDNA and recombinant expression of the resultant modified cDNA. The skilled artisan also understands that, in recombinant production of proteins, additional sequence may be expressed along with a sequence encoding a polypeptide having a desired biological activity, while retaining a desired biological activity of the protein. For example, additional amino acids may be added at the amino terminus, at the carboxy-terminus, or as an insertion into the polypeptide sequence. Similarly, deletion variants of a protein with a desired biological activity can be recombinantly expressed that lack certain residues of the endogenous/natural protein, while retaining a desired biological activity. Moreover, it is well-known that recombinant protein variants may be produced having conservative amino acid replacements (including but not limited to substitution of one or more amino acids for other amino acids having similar chemical side-chains (acidic, basic, aliphatic, aliphatic hydroxyl, aromatic, amide, etc.)) which do not eliminate the desired biological activity of the protein. Accordingly, it is anticipated that such alterations of VEGF-C are VEGF-C equivalents within the scope of the invention.
As set forth in greater detail below, the putative prepro-VEGF-C has a deduced molecular mass of 46,883; a putative prepro-VEGF-C processing intermediate has an observed molecular weight of about 32 kD; and mature VEGF-C isolated from conditioned media has a molecular weight of about 23 kD as assessed by SDS-PAGE under reducing conditions. A major part of the difference in the observed molecular mass of the purified and recombinant VEGF-C and the deduced molecular mass of the prepro- VEGF-C encoded by the VEGF-C open reading frame (ORF) is attributable to proteolytic removal of sequences at the amino-terminal and carboxyl-terminal regions of the prepro- VEGF-C polypeptide. Extrapolation from studies of the structure of PDGF (Heldin et al., Growth Factors, 8:245-52 (1993)) suggests that the region critical for receptor binding and activation by VEGF-C is contained within amino acids residues 104-213, which are found in the secreted form of the VEGF-C protein the form lacking the putative prepro leader sequence and some carboxyterminal sequences). The 23 kD polypeptide binding VEGFR-3 corresponds to a VEGF-homologous domain of VEGF-C. After 30 biosynthesis, the nascent VEGF-C polypeptide may be glycosylated at three putative Nlinked glycosylation sites identified in the deduced VEGF-C amino acid sequence.
Polypeptides containing modifications, such as N-linked glycosylations, are intended as aspects of the invention.
The carboxyl terminal amino acid sequences, which increase the length of the VEGF-C polypeptide in comparison with other ligands of this family, show a pattern of spacing of cysteine residues reminiscent of the Balbiani ring 3 protein (BR3P) sequence (Dignam et al., Gene, 88:133-40 (1990); Paulsson et al., J. Mol. Biol., 211:331-49 (1990)). This novel C-terminal silk protein-like structural motif of VEGF-C may fold into an independent domain, which is cleaved off after biosynthesis. Interestingly, at least one cysteine motif of the BR3P type is also found in the carboxyl terminus of VEGF. As explained in detail below, putative precursors and putative fully-processed VEGF-C were both detected in the cell culture media, suggesting cleavage by cellular proteases. The determination of amino-terminal and carboxy-terminal sequences of VEGF-C isolates was performed to identify the proteolytic processing sites. Antibodies generated against different parts of the pro-VEGF-C molecule were used to determine the precursor-product relationship and ratio, their cellular distribution, and the kinetics of processing and secretion.
VEGF-C has a conserved pattern of eight cysteine residues, which may participate in the formation of intra- and interchain disulfide bonds, creating an antiparallel, dimeric, biologically active molecule, similar to PDGF. Mutational analysis of the cysteine residues involved in the interchain disulfide bridges has shown that, in contrast to PDGF, VEGF dimers need to be held together by these covalent interactions in order to maintain biological activity. Disulfide linking of the VEGF-C polypeptide chains was evident in the analysis of VEGF-C in nonreducing conditions, although recombinant protein also contained "fully processed" ligand-active VEGF-C forms which lacked disulfide bonds between the polypeptides. (See Fig. 9.) VEGFR-3, which distinguishes between VEGF and VEGF-C, is closely related in structure to VEGFR-1 and VEGFR-2. Finnerty et al., Oncogene, 8:2293-98 (1993); Galland et al., Oncogene, 8:1233-40 (1993); Pajusola et al., Cancer Res., 52:5738-43 (1992). Besides VEGFR-3, VEGFR-2 tyrosine kinase also is activated in response to VEGF-C. VEGFR-2 mediated signals cause striking changes in the 31 morphology, actin reorganization and membrane ruffling of porcine aortic endothelial cells over-expressing this receptor. In these cells, VEGFR-2 also mediated ligand-induced chemotaxis and mitogenicity. Waltenberger et al., J. Biol. Chem., 269:26988-95 (1994).
Similarly, the receptor chimera CSF- R/VEGFR-3 was mitogenic when ectopically expressed in NIH 3T3 fibroblastic cells, but not in porcine aortic endothelial cells (Pajusola et al., 1994). Consistent with such results, the bovine capillary endothelial (BCE) cells, which express VEGFR-2 mRNA but very little or no VEGFR-1 or VEGFR-3 mRNAs, showed enhanced migration when stimulated with VEGF-C. Light microscopy of the BCE cell cultures in collagen gel also suggested that VEGF-C stimulated the proliferation of these cells. The data thus indicate that the VEGF family of ligands and receptors show a great specificity in their signaling, which may be cell-type-dependent.
The expression pattern of the VEGFR-3 (Kaipainen et al., Proc. Natl.
Acad. Sci. (USA), 92:3566-70 (1995)) suggests that VEGF-C may function in the formation of the venous and lymphatic vascular systems during embryogenesis.
Constitutive expression of VEGF-C in adult tissues shown herein further suggests that this gene product also is involved in the maintenance of the differentiated functions of the lymphatic and certain venous endothelia where VEGFR-3 is expressed (Kaipainen et al., 1995). Lymphatic capillaries do not have well-formed basal laminae and an interesting possibility exists that the silk-like BR3P motif is involved in producing a supramolecular structure which could regulate the availability of VEGF-C in tissues. However, as shown here, VEGF-C also activates VEGFR-2, which is abundant in proliferating endothelial cells of vascular sprouts and branching vessels of embryonic tissues, but not so abundant in adult tissues. Millauer et al., Nature, 367:576-78 (1993). These data have suggested that VEGFR-2 is a major regulator of vasculogenesis and angiogenesis. VEGF-C may thus have a unique effect on lymphatic endothelium and a more redundant function, shared with VEGF, in angiogenesis and possibly in regulating the permeability of several types of endothelia. Because VEGF-C stimulates VEGFR-2 and promotes endothelial migration, VEGF-C may be useful as an inducer of angiogenesis of blood and lymphatic vessels in wound healing, in tissue transplantation, in eye diseases, and in the formation of collateral 3 0 vessels around arterial stenoses and into injured tissues after infarction.
Previously-identified growth factors that promote angiogenesis include the fibroblast growth factors, hepatocyte growth factor/scatter factor, PDGF and TGF-a.
32 (See Folkman, Nature Med., 1:27-31 (1995); Friesel et al., FASEB 9:919-25 (1995); Mustonen et al., J. Cell. Biol., 129:895-98 (1995). However, VEGF has been the only growth factor relatively specific for endothelial cells. The newly identified factors VEGF-B [Olofsson et al., Proc. Natl. Acad. Sci., 93:2578-81 (1996)] and VEGF-C thus increase our understanding of the complexity of the specific and redundant positive signals for endothelial cells involved in vasculogenesis, angiogenesis, permeability, and perhaps also other endothelial functions. Expression studies using Northern blotting show abundant VEGF-C expression in heart and skeletal muscle; other tissues, such as placenta, ovary, small intestine, thyroid gland, kidney, prostate, spleen, testis and large intestine also express this gene. Whereas P1GF is predominantly expressed in the placenta, the expression patterns of VEGF, VEGF-B and VEGF-C overlap in many tissues, which suggests that members of the VEGF family may form heterodimers and interact to exert their physiological functions.
Targeted mutagenesis leading to inactivation of the VEGF receptor loci in the mouse genome has shown that VEGFR-1 is necessary for the proper organization of endothelial cells forming the vascular endothelium, while VEGFR-2 is necessary for the generation of both endothelial and hematopoietic cells. This suggests that the four genes of the VEGF family can be targets for mutations leading to vascular malformations or cardiovascular diseases.
The following Examples illustrate preferred embodiments of the invention, wherein the isolation, characterization, and function of VEGF-C, VEGF-C variants and analogs, VEGF-C encoding nucleic acids, and anti-VEGF-C antibodies according to the invention are shown.
Example 1 Production of pLTRFlt41 expression vector The identification and isolation of two forms of Flt4 receptor tyrosine kinase (VEGFR-3) cDNA (Flt4 short form (Flt4s), Genbank Accession No. X68203, SEQ ID NO: 1; and Flt4 long form, (Flt41), Genbank Accession Nos. X68203 and S66407, SEQ ID NO:2) was reported in United States Patent Application Serial Number 08/340,011, filed November 14, 1994 Patent No. 5,776,755), incorporated by reference herein. An Flt4 expression vector designated pLTRFlt41 (encoding the long form of Flt4) was constructed using the pLTRpoly expression vector reported in Makela et al., Gene, 118: 293-294 (1992) (Genbank accession number X60280, SEQ ID NO:3) and the Flt4 cDNAs, in the manner described in commonly-owned PCT patent application PCT/FI96/00427, filed August 01, 1996, published as PCT publication No. WO 97/05250 [R:\LIBZZ]05153.doc:mrr on 13 February 1997, and commonly-owned United States Patent Application Serial Nos.
08/671,573, filed June 28, 1996; 08/601,132, filed February 14, 1996 Patent No.
6,403,088); 08/585,895, filed January 12, 1996 Patent No. 6,245,530); and 08/510,133, filed August 1, 1995 Patent No. 6,221,839), all of which are incorporated by reference in their entirety.
EXAMPLE 2 Production and analysis of Flt4I transfected cells NIH 3T3 cells (60% confluent) were co-transfected with 5 micrograms of the pLTRFlt41 construct and 0.25 micrograms of the pSV2neo vector containing the neomycin phosphotransferase gene (Southern et al., J. Mol. Appl. Genet., 1:327 (1982)), using the DOTAP liposome-based transfection reagents (Boehringer-Mannheim, Mannheim, Germany). One day after transfection, the cells were transferred into selection media containing 0.5 mg/ml geneticin (GIBCO, Grand Island, Colonies of geneticin-resistant cells were isolated and analyzed for expression of the Flt4 proteins.
Cells were lysed in boiling lysis buffer containing 3.3% SDS and 125 mM Tris, pH 6.8.
Protein concentrations of the samples were measured by the BCA method (Pierce, Rockford, IL). About 50 micrograms of protein from each lysate were analyzed for the presence of Flt4 by 6% SDS-polyacrylamide gel electrophoresis (SDS-PAGE) and immunoblotting using antisera against the carboxyl terminus of Flt4. Signals on Western blots were revealed using the ECL method (Amersham).
For production of anti-Flt4 antiserum, the Flt4 cDNA fragment encoding the carboxy-terminal amino acid residues of the Flt4 short form: NH2-PMTPTTYKG SVDNQTDSGM VLASEEFEQI ESRHRQESGFR-COOH (SEQ ID NO:4) was cloned as a 657 bp EcoRI-fragment into the pGEX-IXT bacterial expression vector (Pharmacia- LKB, Inc., Uppsala, Sweden) in frame with the glutathione-S-transferase coding region.
The resultant GST-Flt4S fusion protein was produced in E. coli and purified by affinity chromatography using a glutathione-Sepharose 4B column. The purified protein was lyophilized, dissolved in phosphate-buffered saline (PBS), mixed with Freund's adjuvant and used for immunisation of rabbits at bi-weekly intervals using methods standard in the art (Harlow et al., Antibodies: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1988)). Antisera were used, after the fourth booster immunization, for immunoprecipitation of Flt4 from transfected cells. Cells clones expressing Flt4 were also used for ligand stimulation analysis.
[R:\LIBZZ]05153.doc:mrr 34 EXAMPLE 3 Construction of a FIt4 EC baculovirus vector and expression and purification of its product Using the pVTBac plasmid described in Tessier et al., Gene 98:177-183 (1991), and the Flt4 cDNAs described in Example 1, a baculovirus expression vector was constructed to facilitate expression of the extracellular domain of Flt4 (Flt4 EC), as described in commonly-owned PCT patent application PCT/FI96/00427, filed August 01, 1996, published as PCT publication No. WO 97/05250 on 13 February 1997, and commonly-owned United States Patent Application Serial Nos. 08/671,573, filed June 28, 1996; 08/601,132, filed February 14, 1996 Patent No. 6,403,088); 08/585,895, filed January 12, 1996 Patent No. 6,245,530); and 08/510,133, filed August 1, 1995 (U.S.
Patent No. 6,221,839), all of which are incorporated by reference herein. A nucleotide sequence encoding a 6xHis tag was operatively connected to the Flt4 EC coding sequence to facilitate purification.
The Flt4EC construct was transfected together with baculovirus genomic DNA into SF-9 cells by lipofection. Recombinant virus was purified, amplified and used for infection of High-Five cells (Invitrogen, San Diego, CA) using methods standard in the art. The Flt4 extracellular domain (Flt4EC) was purified from the culture medium of the infected High-Five cells using Ni-NTA affinity chromatography according to manufacturer's instructions (Qiagen) for binding and elution of the 6xHis tag encoded in the COOH-terminus of the recombinant Flt4 extracellular domain.
EXAMPLE 4 Isolation of an Flt4 Ligand from Conditioned Media A human Flt4 ligand according to the invention was isolated from media conditioned by a PC-3 prostatic adenocarcinoma cell line (ATCC CRL 1435) in serumfree Ham's F-12 Nutrient mixture (GIBCO) (containing 7% fetal calf serum (FCS)).
Cells were reseeded and grown in this medium, which was subsequently changed to serum-free medium. The preparation of the conditioned media, and the identification of a component therein which stimulated Flt4 tyrosine phosphorylation, are described in detail in commonly-owned PCT patent application PCT/FI96/00427, filed August 01, 1996, and commonly-owned United States Patent Application Serial Nos. 08/671,573, filed June 28, 1996; 08/601,132, filed February 14, 1996 Patent No. 6,403,088); 08/585,895, filed January 12, 1996 Patent No. 6,245,530); 08/510,133, filed August 1, 1995 (U.S.
Patent No. 6,221,839); and 08/340,011, filed November 14, 1994 Patent No.
5,776,755), all of which are incorporated by reference herein in their entirety. The ability [R:\LIBZZ]05153.doc:mrr of the conditioned medium to stimulate Flt4 phosyphorylation was considerably increased when the PC-3 conditioned medium was concentrated four-fold using a concentrator (Amicon). Pretreatment of the concentrated PC-3 conditioned medium with microliters of Flt4 extracellular domain coupled to CNBr-activated sepharose CL-4B (Pharmacia; about 1 mg of Flt4EC domain/ml sepharose resin) completely abolished Flt4 tyrosine phosphorylation. Similar pretreatment of the conditioned medium with unsubstituted sepharose CL-4B did not affect stimulatory activity. Also, the flow through obtained after concentration, which contained proteins of less than 10,000 molecular weight, did not stimulate Flt4 phosphorylation.
In another experiment, a comparison of Flt4 autophosphorylation in transformed NIH 3T3 cells expressing LTRFlt41 was conducted, using unconditioned medium, medium from PC-3 cells expressing the Flt4 ligand, or unconditioned medium containing either 50 ng/ml of VEGF165 or 50 ng/ml of PIGF-1. The cells were lysed, immunoprecipitated using anti-Flt4 antiserum and analyzed by Western blotting using anti-phosphotyrosine antibodies. Only the PC-3 conditioned medium expressing the Flt4 ligand (lane Flt-4L) stimulated Flt4 autophosphorylation.
These experiments showed that PC-3 cells produce a ligand which binds to the extracellular domain of Flt4 and activates this receptor.
EXAMPLE Purification of the Flt4 Ligand The ligand expressed by human PC-3 cells as characterized in Example 4 was purified and isolated using a recombinantly-produced Flt4 extracellular domain (Flt4EC) in affinity chromatography.
[R:\LIBZZ]05153.doc:mn 36 Two harvests of serum-free conditioned medium, comprising a total of 8 liters, were collected from 500 confluent 15 cm diameter culture dishes containing confluent layers of PC-3 cells. The conditioned medium was clarified by centrifugation at 10,000 x g and concentrated 80-fold using an Ultrasette Tangential Flow Device (Filtron, Northborough, MA) with a 10 kD cutoff Omega Ultrafiltration membrane according to the manufacturer's instructions. Recombinant Flt4 extracellular domain was expressed in a recombinant baculovirus cell system and purified by affinity chromatography on Niagarose (Ni-NTA affinity column obtained from Qiagen). The purified extracellular domain was coupled to CNBr-activated Sepharose CL-4B at a concentration of 5 mg/ml and used as an affinity matrix for ligand affinity chromatography.
Concentrated conditioned medium was incubated with 2 ml of the recombinant Flt4 extracellular domain-Sepharose affinity matrix in a rolling tube at room temperature for 3 hours. All subsequent purification steps were at +4 The affinity matrix was then transferred to a column with an inner diameter of 15 mm and washed successively with 100 ml of PBS and 50 ml of 10 mM Na-phosphate buffer (pH 6.8).
Bound material was eluted step-wise with 100 mM glycine-HCI, successive 6 ml elutions having pHs of 4.0, 2.4, and 1.9. Several 2 ml fractions of the eluate were collected in tubes containing 0.5 ml 1 M Na-phosphate (pH Fractions were mixed immediately and dialyzed in 1 mM Tris-HCI (pH Aliquots of 75 pl each were analyzed for their ability to stimulate tyrosine phosphorylation of Flt4. The ultrafiltrate, 100 pl aliquots of the concentrated conditioned medium before and after ligand affinity chromatography, as well as 15-fold concentrated fractions of material released from the Flt4 extracellular domain-Sepharose matrix during the washings were also analyzed for their ability to stimulate Flt4 tyrosine phosphorylation.
The concentrated conditioned medium induced prominent tyrosine phosphorylation of Flt4 in transfected NIH 3T3 cells over-expressing Flt4. This activity was not observed in conditioned medium taken after medium was exposed to the Flt4 Sepharose affinity matrix. The specifically-bound Flt4-stimulating material was retained on the affinity matrix after washing in PBS, 10 mM Na-phosphate buffer (pH and at 3 0 pH 4.0. It was eluted in the first two 2 ml aliquots at pH 2.4. A further decrease of the pH of the elution buffer did not cause release of additional Flt4-stimulating material. No Flt4 phosphorylation was observed in a control wherein Flt4-expressing cells were treated 37 with unconditioned medium; similarly, no phosphorylation was observed following treatment of Flt4-expressing cells with the ultrafiltrate fraction of conditioned medium containing polypeptides of less than 10 kD molecular weight.
Small aliquots of the chromatographic fractions were concentrated in a SpeedVac concentrator (Savant, Farmingdale, and subjected to SDS-PAGE under reducing conditions with subsequent silver staining of the gel, a standard technique in the art. The major polypeptide, having a molecular weight of approximately 23 kD (reducing conditions), was detected in the fractions containing Flt4 stimulating activity. That polypeptide was not found in the other chromatographic fractions. On the other hand, besides these bands and a very faint band having a 32 kD mobility, all other components detected in the two active fractions were also distributed in the starting material and in small amounts in the other washing and eluting steps after their concentration. Similar results were obtained in three independent affinity purifications, indicating that the 23 kD polypeptide binds with high affinity to Flt4 and induces tyrosine phosphorylation of Flt4.
Fractions containing the 23 kD polypeptide were combined, dried in a SpeedVac concentrator and subjected to SDS-PAGE in a 12.5% gel. The proteins from the gel were then electroblotted to Immobilon-P (PVDF) transfer membrane (Millipore, Marlborough, MA) and visualized by staining of the blot with Coomassie Blue R-250.
The region containing only the stained 23 kD band was cut from the blot and subjected to 2 0 N-terminal amino acid sequence analysis in a Prosite Protein Sequencing System (Applied Biosystems, Foster City, CA). The data were analyzed using a 610A Data Analysis System (Applied Biosystems). Analysis revealed a single N-terminal sequence of NHz- XEETIKFAAAHYNTEILK-COOH (SEQ ID NO: EXAMPLE 6 Construction of PC-3 cell cDNA library in a eukaryotic expression vector Human poly(A)* RNA was isolated from five 15 cm diameter dishes of confluent PC-3 cells by a single step method using oligo(dT) (Type III, Collaborative Biomedical Products, Becton-Dickinson Labware, Bedford, MA) cellulose affinity chromatography (Sambrook et al., 1989). The yield was 70 micrograms. Six micrograms of the Poly(A)* RNA were used to prepare an oligo(dT)-primed cDNA library in the 38 mammalian expression vector pcDNA I and the Librarian kit of Invitrogen according to the instructions included in the kit. The library was estimated to contain about 10 6 independent recombinants with an average insert size of approximately 1.8 kb.
EXAMPLES 7-9 Amplification of a cDNA encoding the FIt4 ligand amino terminus The procedures used to isolate a cDNA encoding the Flt4 ligand are described in detail in commonly-owned PCT patent application PCT/FI96/00427, filed August 01, 1996, and commonly-owned United States Patent Application Serial Nos. 08/671,573, filed June 28, 1996; 08/601,132, filed February 14, 1996 Patent No. 6,403,088); 08/585,895, filed January 12, 1996 Patent No. 6,245,530); and 08/510,133, filed August 1, 1995 Patent No. 6,221,839), all of which are incorporated by reference herein. Initially, degenerate oligonucleotides were designed based on the N-terminal amino acid sequence of the isolated human Flt4 ligand (see Example 5) and were used as primers in a polymerase chain reaction (PCR) to amplify a partial cDNA encoding the (fully-processed) Flt4 ligand amino terminus from the PC-3 cDNA library. The amplified cDNA fragment was cloned into a pCR II vector (Invitrogen) using the TA cloning kit (Invitrogen) and sequenced using the radioactive dideoxynucleotide sequencing method of Sanger. Six clones were analyzed and all six clones contained the sequence encoding the expected peptide (amino acid residues 104-120 of the Flt4 ligand precursor, SEQ ID NO:8). Nucleotide sequence spanning the region from the third nucleotide of codon 6 to the third nucleotide of codon 13 (the extension region between the PCR primers) was identical in all six clones and thus represented an amplified product from the unique sequence encoding part of the amino terminus of the Flt4 ligand.
Based on the unique nucleotide sequence encoding the N-terminus of the isolated human Flt4 ligand, two pairs of nested primers were designed to amplify, in two nested PCR reactions, the complete 5'-end of the corresponding cDNAs from one microgram of DNA of the above-described PC-3 cDNA library. One major product of about 220 bp and three minor products of about 270 bp, 150bp, and 100bp were obtained.
The amplified fragment of approximately 220 bp was excised from an agarose gel, cloned into a pCRII vector using the TA cloning kit, and sequenced, Three (R:\LIBZZ]05153.doc:mrr 39 recombinant clones were analyzed and they contained the sequence
TCACTATAGGGAGACCCAAGCTTGGTACCGAGCTCGGATCCACTAGTAACGGC
CGCCAGTGTGGTGGAATTCGACGAACTCATGACTGTACTCTACCCAGAATATT
GGAAAATGTACAAGTGTCAGCTAAGGCAAGGAGGCTGGCAACATAACAGAGA
ACAGGCCAACCTCAACTCAAGGACAGAAGAGACTATAAAATTCGCTGCAGCA
CACTACAAC- 3' (SEQ ID NO: The beginning of the sequence represents the vector and the underlined sequence represents the amplified product of the 5'-end of the cDNA insert.
Based upon the amplified 5'-sequence of the clones encoding the amino terminus of the 23 kD human Flt4 ligand, two pairs of non-overlapping nested primers were designed to amplify the 3'-portion of the Flt4-ligand-encoding cDNA clones via PCR.
Two DNA fragments were obtained, having sizes of 1350 bp and 570 bp. Those fragments were cloned into a pCRII vector and the inserts of the clones were sequenced.
Both of these fragments were found to contain sequences encoding an amino acid sequence homologous to the VEGF sequence.
EXAMPLE Screening the PC-3 cell cDNA library using the PCR fragment of Flt4 ligand cDNA A 153 bp fragment encoding the 5' end of the Flt4 ligand was labeled with 3 2 P]-dCTP using the Klenow fragment of E. coli DNA polymerase I (Boehringer Mannheim). That fragment was used as a probe for hybridization screening of the amplified PC-3 cell cDNA library.
Filter replicas of the library were hybridized with the radioactively labeled probe at 420C for 20 hours in a solution containing 50% formamide, 5x SSPE, Denhardt's solution, 0.1% SDS and 0.1 mg/mi denatured salmon sperm DNA. Filters were washed twice in lx SSC, 0.1% SDS for 30 minutes at room temperature, then twice for 30 minutes at 65"C and exposed overnight.
On the basis of autoradiography, 10 positive recombinant bacterial colonies hybridizing with the probe were chosen from the library. Plasmid DNA was purified from 3 0 these colonies and analyzed by EcoRI and NotI digestion and agarose gel electrophoresis followed by ethidium bromide staining. The ten plasmid clones were divided into three 40 groups on the basis of the presence of insert sizes of approximately 1.7, 1.9 and 2.1 kb, respectively. Inserts of plasmids from each group were sequenced using the T7 oligonucleotide as a primer and walking primers for subsequent sequencing reactions.
Sequence analysis showed that all clones contain the open reading frame encoding the NH2-terminal sequence of the 23 kD human Flt4 ligand. Dideoxy sequencing was continued using walking primers in the downstream direction. A complete human cDNA sequence and deduced amino acid sequence from a 2 kb clone is set forth in SEQ ID NOs: 7 and 8, respectively. A putative cleavage site of a "prepro" leader sequence is located between residues 102 and 103 of SEQ ID NO: 8. When compared with sequences in the GenBank Database, the predicted protein product of this reading frame was found to include a region homologous with the predicted amino acid sequences of the PDGF/VEGF family of growth factors, as shown in Fig. 2.
Plasmid pFLT4-L, containing the 2.1 kb human cDNA clone in pcDNAI vector, has been deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville, MD 20852 as accession number 97231.
EXAMPLE 11 Stimulation of Flt4 autophosphorylation by the protein product of the Flt4 ligand vector The 2.1 kb human cDNA insert of plasmid pFlt4-L, which contains the open reading frame encoding the sequence shown in SEQ ID NOs: 7 and 8; human prepro- VEGF-C, see below), was cut out from the pcDNAI vector using HindIII and NotI restriction enzymes, isolated from a preparative agarose gel, and ligated to the corresponding sites in the pREP7 expression vector (Invitrogen). The pREP7 vector containing the pFlt4-L insert was transfected into 293-EBNA cells (Invitrogen) using the calcium phosphate transfection method (Sambrook el al., 1989). About 48 hours after transfection, the medium of the transfected cells was changed to DMEM medium lacking fetal calf serum and incubated for 36 hours. The conditioned medium was then collected, centrifuged at 5000 x g for 20 minutes, the supernatant was concentrated 5-fold using Centriprep 10 (Amicon) and used to stimulate NIH 3T3 cells expressing LTRFlt41 (the Flt4 receptor), as in Example 4. The cells were lysed, immunoprecipitated using anti-Fit4 antiserum and analyzed by Western blotting using anti-phosphotyrosine antibodies.
41 The conditioned medium from two different dishes of the transfected cells stimulated Flt4 autophosphorylation in comparison with the medium from mocktransfected cells, which gave only background levels of phosphorylation of the Flt4 receptor. When the concentrated conditioned medium was pre-absorbed with microliters of a slurry of Flt4EC domain coupled to Sepharose (see example no phosphorylation was obtained, showing that the activity responsible for Flt4 autophosphorylation was indeed the Flt4 ligand. Thus, these results demonstrate that an expression vector having an approximately 2.1 kb insert and containing an open reading frame as shown in SEQ ID NO: 7 is expressed as a biologically active Fit4 ligand (VEGF- C) in transfected cells. The sequence encoded by that open reading frame is shown in SEQ ID NO: 8.
The deduced molecular weight of a polypeptide consisting of the complete amino acid sequence in SEQ ID NO: 8 (residues 1 to 419) is 46,883. The deduced molecular weight of a polypeptide consisting of amino acid residues 103 to 419 of SEQ ID NO: 8 is 35,881. The Flt4 ligand purified from PC-3 cultures had an observed molecular weight of about 23 kD as assessed by SDS-PAGE under reducing conditions. Thus, it appeared that the Flt4 ligand mRNA was translated into a precursor polypeptide, from which the mature ligand was derived by proteolytic cleavage. Also, the Flt4 ligand may be glycosylated at three putative N-linked glycosylation sites conforming to the consensus which can be identified in the deduced Flt4 ligand amino acid sequence (N-residues underlined in Fig. 2).
The carboxyl terminal amino acid sequences, which increase the predicted molecular weight of the Flt4 ligand subunit in comparison with other ligands of this family, show a pattern of spacing of cysteine residues reminiscent of the Balbiani ring 3 protein (BR3P) sequence (Dignam et al., Gene, 88:133-140 (1990)). Such a sequence may encode an independently folded domain present in a Flt4 ligand precursor and it may be involved, for example, in the regulation of secretion, solubility, stability, cell surface localization or activity of the Flt4 ligand. Interestingly, at least one cysteine motif of the BR3P type is also found in the VEGF carboxy terminal amino acid sequences.
Thus, the Flt4 ligand mRNA appears first to be translated into a precursor from the mRNA corresponding to the cDNA insert of plasmid FLT4-L, from which the mature ligand is derived by proteolytic cleavage. To define the mature Flt4 ligand 42 polypeptide, one first expresses the cDNA clone (which is deposited in the pcDNAI expression vector) in cells, such as COS cells. One uses antibodies generated against encoded polypeptides, fragments thereof, or bacterial Flt4 fusion proteins, such as a GSTfusion protein, to raise antibodies against the VEGF-homologous domain and the aminoand carboxyl-terminal propeptides of Flt4 ligand. One then follows the biosynthesis and processing of the Flt4 ligand in the transfected cells by pulse-chase analysis using radioactive cysteine for labeling of the cells, immunoprecipitation, and gel electrophoresis.
Using antibodies against the three domains of the product encoded by the cDNA insert of plasmid FLT4-L, material for radioactive or nonradioactive amino-terminal sequence analysis is isolated. The determination of the amino-terminal sequence of the mature VEGF-C polypeptide allows for identification of the amino-terminal proteolytic processing site. The determination of the amino-terminal sequence of the carboxyl-terminal propeptide will give the carboxyl-terminal processing site. This is confirmed by sitedirected mutagenesis of the amino acid residues adjacent to the cleavage sites, which would prevent the cleavage.
The Fit4 ligand is further characterizeable by progressive 3' deletions in the 3' coding sequences of the Flt4 ligand precursor clone, introducing a stop codon resulting in carboxy-terminal truncations of its protein product. The activities of such truncated forms are assayed by, for example, studying Flt4 autophosphorylation induced by the truncated proteins when applied to cultures of cells, such as NIH 3T3 cells expressing LTRFlt41. By extrapolation from studies of the structure of the related platelet derived growth factor (PDGF, Heldin et al., Growth Factors, 8:245-252 (1993)) one determines that the region critical for receptor activation by the Flt4 ligand is contained within the first approximately 180 amino acid residues of the secreted VEGF-C protein lacking the putative 102 amino acid prepro leader (SEQ ID NO: 8, residues 103-282), and apparently within the first approximately 120 amino acid residues (SEQ ID NO: 8, residues 103-223).
On the other hand, the difference between the molecular weights observed for the purified ligand and deduced from the open reading frame of the Flt4 ligand clone may be due to the fact that the soluble ligand was produced from an alternatively spliced mRNA which would also be present in the PC-3 cells, from which the isolated ligand was derived. To isolate such alternative cDNA clones one uses cDNA fragments of the deposited clone and PCR primers made according to the sequence provided as well as 43 techniques standard in the art to isolate or amplify alternative cDNAs from the PC-3 cell cDNA library. One may also amplify using reverse transcription (RT)-PCR directly from the PC-3 mRNA using the primers provided in the sequence of the cDNA insert of plasmid FLT4-L. Alternative cDNA sequences are determined from the resulting cDNA clones.
One can also isolate genomic clones corresponding to the Flt4 ligand mRNA transcript from a human genomic DNA library using methods standard in the art and sequence such clones or their subcloned fragments to reveal the corresponding exons. Alternative exons can then be identified by a number of methods standard in the art, such as heteroduplex analysis of cDNA and genomic DNA, which are subsequently characterized.
EXAMPLE 12 Expression of the Gene Encoding VEGF-C in Human Tumor Cell Lines Expression of transcripts corresponding to the Flt4 ligand (VEGF-C) was analyzed by hybridization of Northern blots containing isolated poly(A) RNA from HT- 1080 and PC-3 human tumor cell lines. The probe was the radioactively labeled insert of the 2.1 kb cDNA clone (pFlt4-L/VEGF-C, specific activity 10'-10' cpm/mg of DNA). The blot was hybridized overnight at 42*C using 50% formamide, 5x SSPE buffer, 2% SDS, x Denhardt's solution, 100 mg/ml salmon sperm DNA and I x 10' cpm of the labeled probe/ml. The blot was washed at room temperature for 2 x 30 minutes in 2x SSC containing 0.05% SDS, and then for 2 x 20 minutes at 52*C in 0.lx SSC containing 0.1% SDS. The blot was then exposed at -70"C for three days using intensifying screens and Kodak XAR film. Both cell lines expressed an Flt4 ligand mRNA of about 2.4 kb, as well as VEGF and VEGF-B mRNAs.
EXAMPLE 13 VEGF-C Chains Are Proteolytically Processed after Biosynthesis and Disulfide Linked The predicted molecular mass of a secreted human VEGF-C polypeptide, as deduced from the VEGF-C open reading frame, is 46,883 kD, suggesting that VEGF-C mRNA may be first translated into a precursor, from which the observed ligands of 21/23 kD and 29/32 kD are derived by proteolytic cleavage.
This possibility was explored by metabolic labeling of 293 EBNA cells expressing VEGF-C. Initially, 293 EBNA cells were transfected with the VEGF-C cDNA construct.
Expression products were labeled by the addition of 100 pCi/ml of Pro-mixTM L-[ 35 S] in vitro cell labeling mix ((containing 35 S-methionine and 35 S-cysteine) Amersham, Buckinghamshire, England) to the culture medium devoid of cysteine and methionine.
After two hours, the cell layers were washed twice with PBS and the medium was then replaced with DMEM-0.2% BSA. After 1, 3, 6, 12 and 24 hours of subsequent incubation, the culture medium was collected, clarified by centrifugation, and concentrated, and human VEGF-C was bound to 30 p1 of a slurry of Flt4EC-Sepharose overnight at +4 0 C, followed by three washes in PBS, two washes in 20 mM Tris-HCl (pH alkylation, SDS-PAGE and autoradiography. Alkylation was carried out by treatment of the samples with 10 mM 1,4 Dithiothreitol (Boehringer-Mannheim, Mannheim, Germany) for one hour at 25°C, and subsequently with 30 mM iodoacetamide (Fluka, Buchs, Switzerland).
These experiments demonstrated that a putative precursor polypeptide of 32 kD apparent molecular mass was bound to the Flt4EC affinity matrix from the conditioned medium of metabolically labeled cells transfected with the human VEGF-C expression vector, but not from mock transfected cells. Increased amounts of a 23 kD receptor binding polypeptide accumulated in the culture medium of VEGF-C transfected cells during a subsequent chase period of three hours, but not thereafter, suggesting that the 23 kD form is produced by proteolytic processing, which is incomplete, at least in the transiently transfected cells. Subsequent experiments showed that the 32 kD VEGF-C form contains two components migrating in the absence of alkylation as polypeptides of 29 and 32 kD (Figs. 6-8).
In a related experiment, human VEGF-C isolated using Flt4EC-Sepharose after a 4 hour continuous metabolic labeling was analyzed by polyacrylamide gel electrophoresis in nonreducing conditions. Higher molecular mass forms were observed under nonreducing conditions, suggesting that the VEGF-C polypeptides can form disulfidelinked dimers and/or multimers. Gel photographs depicting these experimental results are set forth in Figures 13A-B of PCT application PCT/FI96/00427 (publication WO 97/05250) and Figures 3A-B of U.S. Patent Application Serial No. 08/795,430 (U.S.
Patent No. 6,130,078), which are incorporated herein by reference.
[R:\LIBZZ]05153.doc:mn 45 Additional experiments have shown that higher molecular mass forms of VEGF-C (about 58 kD and about 43 kD) are observed under reducing conditions as well.
(See below and Fig. 6A.) EXAMPLE 14 Stimulation Of VEGFR-2 Autophosphorylation By VEGF-C Conditioned medium (CM) from 293 EBNA cells transfected with the human VEGF-C vector also was used to stimulate porcine aortic endothelial (PAE) cells expressing VEGFR-2 (KDR). Pajusola et al., Oncogene, 9:3545-55 (1994); Waltenberger et al., J. Biol. Chem., 269:26988-26995 (1994). The cells were lysed and immunoprecipitated using VEGFR-2 specific antiserum (Waltenberger et al., 1994).
PAE-KDR cells (Waltenberger et al., 1994) were grown in Ham's F12 fetal calf serum (FCS). Confluent NIH 3T3-Flt4 cells or PAE-KDR cells were starved overnight in DMEM or Ham's F12 medium, respectively, supplemented with 0.2% bovine serum albumin (BSA), and then incubated for 5 minutes with the analyzed media. Recombinant human VEGF (R&D Systems) and PDGF-BB, functional as stimulating agents, were used as controls. The cells were washed twice with ice-cold Tris-Buffered Saline (TBS) containing 100 mM sodium orthovanadate and lysed in RIPA buffer containing 1 mM phenylmethylsulfonyl fluoride (PMSF), 0.1 U/ml aprotinin and I mM sodium orthovanadate. The lysates were sonicated, clarified by centrifugation at 16,000 x g for 20 minutes and incubated for 3-6 hours on ice with 3-5 pl of antisera specific for Flt4 (Pajusola et al., 1993), VEGFR-2 or PDGFR-0 (Claesson-Welsh et al., J.
Biol. Chem., 264:1742-1747 (1989); Waltenberger et al., 1994). Immunoprecipitates were bound to protein A-Sepharose, washed three times with RIPA buffer containing ImM PMSF, ImM sodium orthovanadate, washed twice with 10 mM Tris-HCI (pH 7.4), and subjected to SDS-PAGE using a 7% gel. Polypeptides were transferred to nitrocellulose by Western blotting and analyzed using PY20 phosphotyrosine-specific monoclonal antibodies (Transduction Laboratories) or receptor-specific antiserum and the ECL detection method (Amersham Corp.).
PAE cells expressing VEGFR-2 were treated with 10- or 2-fold concentrated medium from mock-transfected 293-EBNA cells, or with 5- or concentrated medium from 293-EBNA cell cultures expressing the recombinant VEGF-C.
46 VEGFR-2 was immunoprecipitated with specific antibodies and analyzed by SDS-PAGE and Western blotting using phosphotyrosine antibodies. For comparison, the treatments were also carried out with non-conditioned medium containing 50 ng/ml of purified recombinant VEGF. Additional cells were also treated with VEGF-C- or VEGFcontaining media pretreated with FIt4EC.
The results of this experiment were as follows. A basal level of tyrosine phosphorylation of VEGFR-2 was detected in cells stimulated by CM from the mocktransfected cells. A further concentration of this medium resulted in only a slight enhancement of VEGFR-2 phosphorylation. CM containing recombinant VEGF-C stimulated tyrosine autophosphorylation of VEGFR-2 and the intensity of the autophosphorylated polypeptide band was increased upon concentration of the VEGF-C CM. Furthermore, the stimulating effect was abolished after pretreatment of the medium with the Flt4EC affinity matrix. The maximal effect of VEGF-C in this assay was comparable to the effect of recombinant VEGF added to unconditioned medium at concentration of 50 ng/ml. Pretreatment of the medium containing VEGF with Flt4EC did not abolish its stimulating effect on VEGFR-2. These results suggest that the VEGF-C expression vector encodes a ligand not only for Flt4 (VEGFR-3), but also for KDR/Flk-1 (VEGFR-2).
In order to further confirm that the stimulating effect of VEGF-C on tyrosine phosphorylation of VEGFR-3 and VEGFR-2 was receptor-specific, we analyzed the effect of VEGF-C on tyrosine phosphorylation of PDGF receptor 0 (PDGFR-P) which is abundantly expressed on fibroblastic cells. PDGFR-P-expressing NIH 3T3 cells were treated with non-conditioned medium, 5-fold concentrated CM from mock-transfected or VEGF-C- transfected cells, or with non-conditioned medium containing 50 ng/ml of recombinant human PDGF-BB. Medium containing VEGF-C was also pretreated with recombinant Flt4EC (lane PDGFR-P was immunoprecipitated with specific antibodies and analyzed by SDS-PAGE and Western blotting using phosphotyrosine antibodies with subsequent stripping and reprobing of the membrane with antibodies specific for PDGFR-p. A weak tyrosine phosphorylation of PDGFR-P was detected upon stimulation of Flt4-expressing NIH 3T3 cells with CM from the mock-transfected cells. A similar low level of PDGFR-P phosphorylation was observed when the cells were incubated with CM from the VEGF-C transfected cells, with or without prior treatment with FIt4EC. In 47 contrast, the addition of 50 ng/ml of PDGF-BB induced a prominent tyrosine autophosphorylation of PDGFR-P.
EXAMPLE VEGF-C Stimulates Endothelial Cell Migration In Collagen Gel Conditioned media (CM) from cell cultures transfected with the VEGF-C expression vector was placed in a well made in collagen gel and used to stimulate the migration of bovine capillary endothelial (BCE) cells in the three-dimensional collagen gel as follows.
BCE cells (Folkman et al., Proc. Natl. Acad. Sci. (USA), 76:5217-5221 (1979)) were cultured as described in Pertovaara et al., J. Biol. Chem., 269:6271-74 (1994). The collagen gels were prepared by mixing type I collagen stock solution mg/ml in 1 mM HCI) with an equal volume of 2x MEM and 2 volumes of MEM containing 10% newborn calf serum to give a final collagen concentration of 1.25 mg/ml.
The tissue culture plates (5 cm diameter) were coated with about 1 mm thick layer of the solution, which was allowed to polymerize at 37°C. BCE cells were seeded on top of this layer. For the migration assays, the cells were allowed to attach inside a plastic ring (1 cm diameter) placed on top of the first collagen layer. After 30 minutes, the ring was removed and unattached cells were rinsed away. A second layer of collagen and a layer of growth medium newborn calf serum solidified by 0.75% low melting point agar (FMC BioProducts, Rockland, ME), were added. A well (3 mm diameter) was punched through all the layers on both sides of the cell spot at a distance-of4 mm, and the sample or control media were pipetted daily into the wells. Photomicrographs of the cells migrating out from the spot edge were taken after six days through an Olympus CK 2 inverted microscope equipped with phase-contrast optics. The migrating cells were counted after nuclear staining with the fluorescent dye bisbenzimide (1 mg/ml, Hoechst 33258, Sigma).
The number of cells migrating at different distances from the original area of attachment towards wells containing media conditioned by the non-transfected (control) or transfected (mock; VEGF-C; VEGF) cells were determined 6 days after addition of the media. The number of cells migrating out from the original ring of attachment was 48 counted in five adjacent 0.5 mm x 0.5 mm squares using a microscope ocular lens grid and lOx magnification with a fluorescence microscope. Cells migrating further than 0.5 mm were counted in a similar way by moving the grid in 0.5 mm steps. The experiments were carried out twice with similar results. At each distance, VEGF-C-containing CM stimulated cell migration more than medium conditioned by the non-transfected or mock-transfected cells but less than medium from cells transfected with a VEGF expression vector. Daily addition of 1 ng of FGF2 into the wells resulted in the migration of approximately twice the number of cells when compared to the stimulation by CM from VEGF-transfected cells.
In related experiments, a "recombinantly-matured" VEGF-C polypeptide (VEGF-C ANACHis, described below) was shown to stimulate the incorporation of 3
'H-
thymidine into the DNA of BCE cells in a dose dependent manner (VEGF-C concentrations of 0, 10, 100, and 1000 pM tested). This data tends to confirm the observation, under light microscopy, that VEGF-C stimulates proliferation of these cells.
EXAMPLE 16 VEGF-C Is Expressed In Multiple Tissues Northern blots containing 2 micrograms of isolated poly(A)* RNA from multiple human tissues (blot from Clontech Laboratories, Inc., Palo Alto, CA) were probed with radioactively labeled insert of the 2.1 kb VEGF-C cDNA clone. Northern blotting and hybridization analysis showed that the 2.4 kb RNA and smaller amounts of a kb mRNA are expressed in multiple human tissues, most prominently in the heart, placenta, muscle, ovary and small intestine, and less prominently in prostate, colon, lung, pancreas, and spleen. Very little VEGF-C RNA was seen in the brain, liver, kidney, testis, or thymus and peripheral blood leukocytes (PBL) appeared negative. A similar analysis of RNA from human fetal brain, lung, liver, and kidney tissues showed that VEGF-C is highly expressed in the kidney and lung and to a lesser degree in the liver, while essentially no expression is detected in the brain. Interestingly, VEGF expression correlates with VEGF-C expression in these tissues, whereas VEGF-B is highly expressed in all four fetal tissues analyzed.
49 EXAMPLE 17 The VEGF-C Gene Localizes To Chromosome 4q34 A DNA panel of 24 interspecies somatic cell hybrids, which had retained one or two human chromosomes, was used for the chromosomal localization of the VEGF-C gene (Bios Laboratories, Inc., New Haven, CT). DNAs from human rodent somatic cell hybrids containing defined sets of human chromosomes were analyzed by Southern blotting and hybridization with a VEGF-C cDNA probe. Among 24 DNA samples on the hybrid panel, representing different human chromosomes, human-specific signals were observed only in hybrids which contained human chromosome 4. The results were confirmed by PCR of somatic cell hybrid DNAs using VEGF-C specific primers, where amplified bands were obtained only from DNAs containing human chromosome 4.
A genomic P plasmid for VEGF-C was isolated using specific primers and PCR and verified by Southern blotting and hybridization using a VEGF-C specific cDNA probe. The chromosomal localization of VEGF-C was further studied using metaphase FISH. Using the P1 probe for VEGF-C in FISH, a specific hybridization to the 4q34 chromosomal band was detected in 40 out of 44 metaphases. Double-fluorochrome hybridization using a cosmid probe specific for the aspartylglucosaminidase (AGA) gene showed that VEGF-C is located just proximal to the AGA gene previously mapped to the 4q34-35 chromosomal band.
Biotin-labeled VEGF-C PI and digoxigenin-labeled AGA cosmid probes were hybridized simultaneously to metaphase chromosomes. This experiment demonstrated that the AGA gene is more telomerically located than the VEGF-C gene.
The foregoing example demonstrates the utility of polynucleotides of the invention as chromosomal markers and for the presence or absence of the VEGF-C gene region in normal or diseased cells. The VEGF-C locus at 4q34 is a candidate target for mutations leading to vascular malformations or cardiovascular diseases.
EXAMPLE 18 Effect of glucose concentration and hypoxia on VEGF, VEGF-B and VEGF-C mRNA levels in C6 glioblastoma cells Confluent cultures of C6 cells (ATCC CCL 107) were grown on 10 cm diameter tissue culture plates containing 2.5 ml of DMEM and 5% fetal calf serum plus 50 antibiotics. The cultures were exposed for 16 hours to normoxia in a normal cell culture incubator containing 5% CO, or hypoxia by closing the culture plates in an airtight glass chamber and burning a piece of wood inside until the flame was extinguished due to lack of oxygen. Polyadenylated RNA was isolated (as in the other examples), and 8 micrograms of the RNA was electrophoresed and blot-hybridized with a mixture of the VEGF, VEGF-B and VEGF-C probes. The results show that hypoxia strongly induces VEGF mRNA expression, both in low and high glucose, but has no significant effect on the VEGF-B mRNA levels. The VEGF-C mRNA isolated from hypoxic cells runs slightly faster in gel electrophoresis and an extra band of faster mobility can be seen below the upper mRNA band. This observation suggests that hypoxia affects VEGF-C RNA processing. One explanation for this observation is that VEGF-C mRNA splicing is altered, affecting the VEGF-C open reading frame and resulting in an alternative VEGF-C protein being produced by hypoxic cells. Such alternative forms of VEGF-C and VEGF- C-encoding polynucleotides are contemplated as an aspect of the invention. This data indicates screening and diagnostic utilities for polynucleotides and polypeptides of the invention, such as methods whereby a biological sample is screened for the hypoxiainduced form of VEGF-C and/or VEGF-C mRNA. The data further suggests a therapeutic indication for antibodies and/or other inhibitors of the hypoxia-induced form of VEGF-C or the normal form of VEGF-C.
EXAMPLE 19 Pulse-chase labeling and immunoprecipitation of VEGF-C polypeptides from 293 EBNA cells transfected with VEGF-C expression vector.
The following VEGF-C branched amino-terminal peptide, designated PAMI26, was synthesized for production of anti-VEGF-C antiserum:
NH
2 -E-E-T-I-K-F-A-A-A-H-Y-N-T-E-I-L-K-COOH (SEQ ID NO: 9).
In particular, PAM126 was synthesized as a branched polylysine structure K3PA4 having four peptide acid (PA) chains attached to two available lysine residues. The synthesis was performed on a 433A Peptide Synthesizer (Applied Biosystems) using Fmoc-chemistry and TentaGel S MAP RAM10 resin mix (RAPP Polymere GmbH, Tubingen, Germany), yielding both cleavable and resin-bound peptides. The cleavable 51 peptide was purified via reverse phase HPLC and was used together with the resin-bound peptide in immunizations. The correctness of the synthesis products were confirmed using mass-spectroscopy (Lasermatt).
The PAM126 peptide was dissolved in phosphate buffered saline (PBS), mixed with Freund's adjuvant, and used for immunization of rabbits at bi-weekly intervals using methods standard in the art (Harlow and Lane, Antibodies, a laboratory manual, Cold Spring Harbor Laboratory Press (1988)). Antisera obtained after the fourth booster immunization was used for immunoprecipitation of VEGF-C in pulse-chase experiments, as described below.
For pulse-chase analysis, 293 EBNA cells transfected with a VEGF-C expression vector the FLT4-L cDNA inserted into the pREP7 expression vector as described above) were incubated for 30 minutes in methionine-free, cysteine-free, serum-free DMEM culture medium at 37"C. The medium was then changed, and 200 pCi of Pro-mixTM (Amersham), was added. The cell layers were incubated in this labeling medium for two hours, washed with PBS, and incubated for 0, 15, 30, 60, 90, 120, or 180 minutes in serum-free DMEM (chase). After the various chase periods, the medium was collected, the cells were again washed two times in PBS, and lysed in immunoprecipitation buffer. The VEGF-C polypeptides were analyzed from both the culture medium and from the cell lysates by immunoprecipitation, using the VEGF-C-specific antiserum raised against the NH2-terminal peptide (PAM126) of the 23 kD VEGF-C form.
Immunoprecipitated polypeptides were analyzed via SDS-PAGE followed by autoradiography.
The resultant autoradiograms demonstrated that immediately after a 2 hour labeling (chase time the VEGF-C vector-transfected cells contained a radioactive polypeptide band of about 58kD (originally estimated to be about 55 kD, and re-evaluated to be about 58 kD using different size standards), which was not observed in mock-transfected cells Most of this -58 kD precursor undergoes dimerization. This -58 kD polypeptide band gradually diminished in intensity with increasing chase periods.
A 32 kD polypeptide band also is observed in VEGF-C transfected cells (but not mocktransfected cells). This 32 kD band disappears from cells with similar kinetics to that of the -58 kD band. Additional analysis indicated that the 32 kD band was a doublet of 29 kD and 31-32 kD forms, held together by disulfide bonds. Simultaneously, increasing 52 amounts of 32 kD and subsequently 23 kD and 14-15 kD polypeptides appeared in the medium.
Collectively, the data from the pulse-chase experiments indicate that the -58 kD intracellular polypeptide represents a pro-VEGF-C polypeptide, which is proteolytically cleaved either intracellularly or at the cell surface into the 29 kD and 31-32 kD polypeptides. The 29/31 kD form is secreted and simultaneously further processed by proteolysis into the 23 kD and 14-15 kD forms. In additional experiments, disulfide linked dimers of the 29 kD and 15 kD forms were observed. Without intending to be limited to a particular theory, it is believed that processing of the VEGF-C precursor occurs as removal of a signal sequence, removal of the COOH-terminal domain (BR3P), and removal of an amino terminal polypeptide, resulting in a VEGF-C polypeptide having the TEE... amino terminus.
At high resolution, the 23 kD polypeptide band appears as a closely-spaced polypeptide doublet, suggesting heterogeneity in cleavage or glycosylation.
EXAMPLE Isolation of Mouse and Quail cDNA Clones Encoding VEGF-C A. Murine VEGF-C To clone a murine VEGF-C, approximately 1 x 106 bacteriophage lambda clones of a commercially-available 12 day mouse embryonal cDNA library (lambda EXIox library, Novagen, catalog number 69632-1) were screened with a radiolabeled fragment of human VEGF-C cDNA containing nucleotides 495 to 1661 of SEQ ID NO: 7. One positive clone was isolated.
A 1323 bp EcoRI/HindIII fragment of the insert of the isolated mouse cDNA clone was subcloned into the corresponding sites of the pBluescript SK+ vector (Stratagene) and sequenced. The cDNA sequence of this clone was homologous to the human VEGF-C sequence reported herein, except that about 710 bp of 5'-end sequence present in the human clone was not present in the mouse clone.
For further screening of mouse cDNA libraries, a Hindlll-BstXI (HindII site is from the pBluescript SK+ polylinker) fragment of 881 bp from the coding region of 3 0 the mouse cDNA clone was radiolabeled and used as a probe to screen two additional mouse cDNA libraries. Two additional cDNA clones from an adult mouse heart ZAP II 53 cDNA library (Stratagene, catalog number 936306) were identified. Three additional clones also were isolated from a mouse heart 5'-stretch-plus cDNA library in Agtl 1 (Clontech Laboratories, Inc., catalog number ML5002b). Of the latter three clones, one was found to contain an insert of about 1.9 kb. The insert of this cDNA clone was subcloned into EcoRI sites of pBluescript SK+ vector and both strands of this clone were completely sequenced, resulting in the nucleotide and deduced amino acid sequences shown in SEQ ID NOs: 10 and 11.
It is contemplated that the polypeptide corresponding to SEQ ID NO: 11 is processed into a mature mouse VEGF-C protein, in a manner analogous to the processing of the human VEGF-C prepropeptide. Putative cleavage sites for the mouse protein are identified using procedures outlined above for identification of cleavage sites for the human VEGF-C polypeptide.
The foregoing results demonstrate the utility of polynucleotides of the invention for identifying and isolating polynucleotides encoding other non-human mammalian VEGF-C proteins. Such identified and isolated polynucleotides, in turn, can be expressed (using procedures similar to those described in preceding examples) to produce recombinant polypeptides corresponding to non-human mammalian forms of
VEGF-C.
B. Quail VEGF-C The mouse and human VEGF-C sequences were used to design probes for isolating a quail VEGF-C cDNA from a quail cDNA library. A fragment of the human VEGF-C cDNA comprising nucleotides 495-1670 of SEQ ID NO: 7 was obtained by PCR amplification, cloned into the pCRII vector (Invitrogen) according to the manufacturer's instructions, and amplified. The insert was isolated by EcoRI digestion and preparative gel electrophoresis and then labeled using radioactive dCTP and random priming. A cDNA library made from quail embryos of stage E-4 in pcDNA-1 vector (Invitrogen) was then screened using this probe. About 200,000 colonies were plated and filter replicas were hybridized with the radioactive probe. Nine positive clones were identified and secondarily plated. Two of the nine clones hybridized in secondary screening. The purified clones (clones I and 14) had approximately 2.7 kb EcoRI inserts. Both clones were amplified and then sequenced using the T7 and SP6 primers (annealing to the vector). In addition, an internal Sphl restriction endonuclease cleavage site was identified about 1.9 kb from the 54 T7 primer side of the vector and used for subcloning and SphI fragments, followed by sequencing from the SphI end of the subclones. The sequences obtained were identical from both clones and showed a high degree of similarity to the human VEGF-C coding region. Subsequently, walking primers were made in both directions and double-stranded sequencing was completed for 1743 base pairs, including the full-length open reading frame.
The cDNA sequence obtained includes a long open reading frame and untranslated region. The DNA and deduced amino acid sequences for the quail cDNA are set forth in SEQ ID NOs: 12 and 13, respectively. Studies performed with the putative quail VEGF-C cDNA have shown that its protein product is secreted from transfected cells and interacts with avian VEGFR-3 and VEGFR-2, further confirming the conclusion that the cDNA encodes a quail VEGF-C protein. The proteins secreted from 293-EBNA cells transfected with quail VEGF-C cDNA were analyzed in immunoprecipitation studies using the VEGF-C-specific polyclonal antisera generated against the PAM 126 polypeptide (Example 19). A doublet band of about 30-32 kD, and a band of about 22-23 kD, were immunoprecipitated from the transfected cells but not from control cells. These immunoprecipitation studies thus provide a further indication that VEGF-C from nonhuman species is processed (from a prepro-VEGF-C form) in a manner analogous to the processing of human VEGF-C. As shown in Fig. 5, the human, murine, and avian (quail) VEGF-C precursor amino acid sequences share a significant degree of conservation. This high degree of homology between species permits the isolation of VEGF-C encoding sequences from other species, especially vertebrate species, and more particularly mammalian and avian species, using polynucleotides of the present invention as probes and using standard molecular biological techniques such as those described herein.
EXAMPLE 21 N-terminal peptide sequence analyses of recombinant VEGF-C Cells (293 EBNA) transfected with VEGF-C cDNA (see Example 13) secrete several forms of recombinant VEGF-C (Fig. 6A, lane IP). In the absence of alkylation, the three major, proteolytically-processed forms of VEGF-C migrate in SDS- PAGE as proteins with apparent molecular masses of 32/29 kD (doublet), 21 kD and kD. Two minor polypeptides exhibit approximate molecular masses of 63 and 52 kD, 55 respectively. One of these polypeptides is presumably a glycosylated and non-processed form; the other polypeptide is presumably glycosylated and partially processed. More precise size measurements (using SDS-PAGE under reducing conditions) revealed that the molecular masses of the VEGF-C forms that were initially estimated as 63, 52, 32, 23, and 14 kD (using SDS-PAGE under reducing conditions and a different set of size standards) are approximately 58, 43, 31, 29, 21, and 15 kD, respectfully (the initial measurements in most cases falling within acceptable 10% error of the more precise measurements).
To determine sites of proteolytic cleavage of the VEGF-C precursor, an immunoaffinity column was used to purify VEGF-C polypeptides from the conditioned medium of 293 EBNA cells transfected with VEGF-C cDNA. To prepare the immunoaffinity column, a rabbit was immunized with a synthetic peptide corresponding to amino acids 104-120 of SEQ ID NO: 8: H 2 N-EETIKFAAAHYNTEILK (see PAMI26 in Example 19). The IgG fraction was isolated from the serum of the immunized rabbit using protein A Sepharose (Pharmacia). The isolated IgG fraction was covalently bound to CNBr-activated Sepharose CL-4B (Pharmacia) using standard techniques at a concentration of 5 mg IgG/ml of Sepharose. This immunoaffinity matrix was used to isolate processed VEGF-C from 1.2 liters of the conditioned medium (CM).
The purified material eluted from the column was analyzed by gel electrophoresis and Western blotting. Fractions containing VEGF-C polypeptides were combined, dialyzed against 10 mM Tris HCI, vacuum-dried, electrotransferred to Immobilon-P (polyvinylidene difluoride or PVDF) transfer membrane (Millipore, Marlborough, MA) and subjected to.N-terminal amino acid sequence analysis.
The polypeptide band of 32 kD yielded two distinct sequences: NH 2 FESGLDLSDA... and NH 2 -AVVMTQTPAS... (SEQ ID NO: 14), the former corresponding to the N-terminal part of VEGF-C after cleavage of the signal peptide, starting from amino acid 32 (SEQ ID NO: and the latter corresponding to the kappachain of IlgG, which was present in the purified material due to "leakage" of the affinity matrix during the elution procedure.
In order to obtain the N-terminal peptide sequence of the 29 kD form of VEGF-C, a construct (VEGF-C NHis) encoding a VEGF-C mutant was generated. In particular, the construct encoded a VEGF-C mutant that fused a 6xHis tag to the Nterminus of the secreted precursor between amino acids 31 and 33 in SEQ ID NO: 56 The phenylalanine at position 32 was removed to prevent possible cleavage of the tag sequence during secretion of VEGF-C. The VEGF-C NHis construct was cloned into pREP7 as a vector; the construction is described more fully in Example 28, below.
The calcium phosphate co-precipitation technique was used to transfect VEGF-C NHis into 293 EBNA cells. Cells were incubated in DMEM/10% fetal calf serum in 15 cm cell culture dishes (a total of 25 plates). The following day, the cells were reseeded into fresh culture dishes (75 plates) containing the same medium and incubated for 48 hours. Cell layers were then washed once with PBS and DMEM medium lacking FCS was added. Cells were incubated in this medium for 48 hours and the medium was collected, cleared by centrifugation at 5000 x g and concentrated 500X using an Ultrasette Tangential Flow Device (Filtron, Northborough, MA), as described in Example 5 above.
VEGF-C NHis was purified from the concentrated conditioned medium using TALON'T Metal Affinity Resin (Clontech Laboratories, Inc.) and the manufacturer's protocol for native protein purification using imidazole-containing buffers. The protein was eluted with a solution containing 20 mM Tris-HCI (pH 100 mM NaCI, and 200 mM imidazole.
The eluted fractions containing purified VEGF-C NHis were detected by immunoblotting with Antiserum 882 (antiserum from rabbit 882, immunized with the PAM-126 polypeptide). Fractions containing VEGF-C NHis were combined, dialyzed and vacuumdried. Due to to the presence of the 6xHis tag at the N-terminus of this form of VEGF-C, the upper component of the major doublet of the VEGF-C NHis migrates slightly slower than the 32 kD form of wild type VEGF-C, thereby improving the separation of the VEGF-C NHis 32 kD mutant from the 29 kD band using SDS-PAGE. Approximately jig of the purified VEGF-C were subjected to SDS-PAGE under reducing conditions, electrotransferred to Immobilon-P (PVDF) transfer membrane (Millipore, Inc., Marlborough, MA) and the band at 29 kD was subjected to N-terminal amino acid sequence analysis. This sequence analysis revealed an N-terminal sequence of H 2
N-
SLPAT corresponding to amino acids 228-232 of VEGF-C (SEQ ID NO: 8).
The polypeptide band of 21 kD yielded the sequence H 2 N-AHYNTEILKS corresponding to an amino-terminus starting at amino acid 112 of SEQ ID NO: 8.
Thus, the proteolytic processing site which results in the 21 kD form of VEGF-C produced by transfected 293 EBNA cells apparently occurs nine amino acid residues 57 downstream of the cleavage site which results in the 23 kD form of VEGF-C secreted by PC-3 cells.
The N-terminus of the 15 kD form was identical to the N-terminus of the 32 kD form (NH 2 -FESGLDLSDA...). The 15 kD form was not detected when recombinant VEGF-C was produced by COS cells. This suggests that production of this form is cell lineage specific.
Example 22 Dimeric and monomeric forms of VEGF-C The composition of VEGF-C dimers was analyzed as follows. Cells (293 EBNA cells), transfected with the pREP7 VEGF-C vector as described in Example 11, were metabolically labeled with Pro-mix L-[ 3 S] labeling mix (Amersham Corp.) to a final concentration of 100 tCi/ml.
In parallel, a VEGF-C mutant, designated "R102S", was prepared and analyzed. To prepare the DNA encoding VEGF-C-RI 02S, the arginine codon at position 102 of SEQ ID NO: 8 was replaced with a serine codon. This VEGF-C-R102S-encoding DNA, in a pREP7 vector, was transfected into 293 EBNA cells and expressed as described above. VEGF-C polypeptides were immunoprecipitated using antisera 882 (obtained by immunization of a rabbit with a polypeptide corresponding to residues 104-120 of SEQ ID NO: 8 (see previous Example)) and antisera 905 (obtained by immunization of a rabbit with a polypeptide corresponding to a portion of the pro-VEGF-C leader: H 2 N-ESGLDLSDAEPDAGEATAYASK (residues 33 to 54 of SEQ ID NO: 8).
The immunoprecipitates from each cell culture were subjected to SDS- PAGE under non-denaturing conditions (Fig. 6B). Bands 1-6 were cut out from the gel, soaked for 30 minutes in lx gel-loading buffer containing 200 mM B-mercaptoethanol, and individually subjected to SDS-PAGE under denaturing conditions (Figs. 6A and 6C, lanes 1-6).
As can be seen from Figures 6A-C, each high molecular weight form of VEGF-C (Fig. 6B, bands 1-4) consists of at least two monomers bound by disulfide bonds (Compare Figs. 6A and 6C, lanes 1-4, in the reducing gels). The main component of bands 1-3 is the doublet of 32/29 kD, where both proteins are present in an equimolar 58 ratio. The main fraction of the 21 kD form is secreted as either a monomer or as a homodimer connected by means other than disulfide bonds (bands 6 and lanes 6 in Figs. 6A-C).
The RI02S mutation creates an additional site for N-linked glycosylation in VEGF-C at the asparagine residue at position 100 in SEQ ID NO: 8. Glycosylation at this additional glycosylation site increases the apparent molecular weight of polypeptides containing the site, as confirmed in Figures 6A-C and Figures 7A-B. The additional glycosylation lowers the mobility of forms of VEGF-C-R I 02S that contain the additional glycosylation site, when compared to polypeptides of similar primary structure corresponding to VEGF-C. Figures 6A-C and Figures 7A-B reveal that the VEGF-C- RI02S polypeptides corresponding to the 32 kD and 15 kD forms of wt VEGF-C exhibit increased apparent molecular weights, indicating that each of these polypeptides contains the newly introduced glycosylation site. In particular, the VEGF-C-R102S polypeptide corresponding to the 15 kD polypeptide from VEGF-C comigrates on a gel with the 21 kD form of the wild type (wt) VEGF-C, reflecting a shift on the gel to a position corresponding to a greater apparent molecular weight. (Compare lanes 4 in Figures 6A and 6C). The mobility of the 58 kD form of VEGF-C was slowed to 64 kD by the R102S mutation, indicating that this form contains the appropriate N-terminal peptide of VEGF- C. The mobilities of the 21, 29, and 43 kD forms were unaffected by the RIO02S mutation, suggesting that these polypeptides contain peptide sequences located C-terminally of Ro 2 In a related experiment, another VEGF-C mutant, designated "R226,227S," was prepared and analyzed. To prepare a DNA encoding VEGF-C-R226,227S, the arginine codons at positions 226 and 227 of SEQ ID NO: 8 were replaced with serine codons by site-directed mutagenesis. The resultant DNA was transfected into 293 EBNA cells as described above and expressed and analyzed under the same conditions as described for VEGF-C and VEGF-C-R102S. In the conditioned medium from the cells expressing VEGF-C-R226,227S, no 32 kD form of VEGF-C was detected. These results indicate that a C-terminal cleavage site of wild-type VEGF-C is adjacent to residues 226 and 227 of SEQ ID NO: 8, and is destroyed by the mutation of the arginines to serines.
Again, the mobility of the 29 kD component of the doublet was unchanged (Figures 7A-
B).
59 Taken together, these data indicate that the major form of the processed VEGF-C is a heterodimer consisting of(1) a polypeptide of 32 kD containing amino acids 32-227 of the prepro-VEGF-C (amino acids 32 to 227 in SEQ ID NO: 8) attached by disulfide bonds to a polypeptide of 29 kD beginning with amino acid 228 in SEQ ID NO: 8. These data are also supported by a comparison of the pattern of immunoprecipitated, labeled VEGF-C forms using antisera 882 and antisera 905.
When VEGF-C immunoprecipitation was carried out using conditioned medium, both antisera (882 and 905) recognized some or all of the three major processed forms of VEGF-C (32/29 kD, 21 kD and 15 kD). When the conditioned medium was reduced by incubation in the presence of 10 mM dithiothreitol for two hours at room temperature with subsequent alkylation by additional incubation with 25 mM iodoacetamide for 20 minutes at room temperature, neither antibody precipitated the 29 kD component, although antibody 882 still recognized polypeptides of 32 kD, 21 kD and kD. In subsequent experiments it was observed that neither antibody was capable of immunoprecipitating the 43 kD form. These results are consistent with the nature of the oligopeptide antigen used to elicit the antibodies contained in antisera 882, an oligopeptide containing amino acid residues 104-120 of SEQ ID NO: 8. On the other hand, antisera 905 recognized only the 32 kD and 15 kD polypeptides, which include sequence of the oligopeptide (amino acids 33 to 54 of SEQ ID NO: 8) used for immunization to obtain antisera 905. Taking into account the mobility shift of the 32 kD and 15 kD forms, the immunoprecipitation results with the R102S mutant were similar (Figs. 8A-B). The specificity of antibody 905 is confirmed by the fact that it did not recognize a VEGF-C AN form wherein the N-terminal propeptide spanning residues 32-102 of the unprocessed polypeptide had been deleted (Fig. 8B).
The results of these experiments also demonstrate that the 21 kD polypeptide is found in heterodimers with other molecular forms (see Figs. 6A-C and Figs. 7A-B), and secreted as a monomer or a homodimer held by bonds other than disulfide bonds (Figs. 6A and 6B, lanes 6).
The experiments disclosed in this example demonstrate that several forms of VEGF-C exist. A variety of VEGF-C monomers were observed and these monomers can vary depending on the level and pattern of glycosylation. In addition, VEGF-C was observed as a multimer, for example a homodimer or a heterodimer. The processing of 60 VEGF-C is schematically presented in Fig. 9 (disulfide bonds not shown). All forms of VEGF-C are within the scope of the present invention.
Example 23 In situ Hybridization of Mouse Embryos To analyze VEGF-C mRNA distribution in different cells and tissues, sections of 12.5 and 14.5-day post-coitus mouse embryos were prepared and analyzed via in situ hybridization using labeled VEGF-C probes. In situ hybridization of tissue sections was performed as described in Vastrik et al., J. Cell Biol., 128:1197-1208 (1995). A mouse VEGF-C antisense RNA probe was generated from linearized pBluescript II SK+ plasmid (Stratagene Inc., La Jolla, CA), containing a cDNA fragment corresponding to nucleotides 499-979 of a mouse VEGF-C cDNA (SEQ ID NO: Radiolabeled RNA was synthesized using T7 polymerase and ["S]-UTP (Amersham).
Mouse VEGF-B antisense and sense RNA probes were synthesized in a similar manner from linearized pCRII plasmid containing the mouse VEGF-B cDNA insert as described Olofsson elt al., Proc. Natl. Acad. Sci. (USA), 93:2576-2581 (1996). The high stringency wash was for 45 minutes at 65*C in a solution containing 30 mM dithiothreitol (DTT) and 4 x SSC. The slides were exposed for 28 days, developed and stained with hematoxylin.
For comparison, similar sections were hybridized with a VEGFR-3 probe and the 12.5-day p.c. embryos were also probed for VEGF-B mRNA.
Darkfield and lightfield photomicrographs from these experiments are presented in commonly-owned PCT patent application PCT/FI96/00427, filed August 01, 1996, published as WO 97/05250, incorporated by reference herein. Observations from the photomicrographs are summarized below. In a 12.5 day p.c. embryo, a parasagittal section revealed that VEGF-C mRNA was particularly prominent in the mesenchyme around the vessels surrounding the developing metanephros. In addition, hybridization signals were observed between the developing vertebrae, in the developing lung mesenchyme, in the neck region and developing forehead. The specificity of these signals was evident from the comparison with VEGF-B expression in an adjacent section, where the myocardium gave a very strong signal and lower levels of VEGF-B mRNA were detected in several other tissues. Both genes appear to be expressed in between the 61 developing vertebrae, in the developing lung, and forehead. Hybridization of the VEGF-C sense probe showed no specific expression within these structures.
Studies also were conducted of the expression patterns of VEGF-C and VEGFR-3 in 12.5 day p.c. mouse embryos in the jugular region, where the developing dorsal aorta and cardinal vein are located. This is the area where the first lymphatic vessels sprout from venous sac-like structures according to the long-standing theory of Sabin, Am. J. Anat., 9:43-91 (1909). An intense VEGF-C signal was detected in the mesenchyme surrounding the developing venous sacs which also were positive for VEGFR-3.
The mesenterium supplies the developing gut with blood and contains developing lymphatic vessels. The developing 14.5 day p.c. mesenterium is positive for VEGF-C mRNA, with particularly high expression in connective tissue surrounding certain vessels. The adjacent mesenterial VEGFR-3 signals that were observed originate from small capillaries of the mesenterium. Therefore, there appears to be a paracrine relationship between the production of the mRNAs for VEGF-C and its receptor. This data indicates that VEGF-C is expressed in a variety of tissues. Moreover, the pattern of expression is consistent with a role for VEGF-C in venous and lymphatic vessel development. Further, the data reveals that VEGF-C is expressed in non-human animals.
Example 24 Analysis of VEGF, VEGF-B, and VEGF-C mRNA Expression in Fetal and Adult Tissues A human fetal tissue Northern blot containing 2 ,g of polyadenylated RNAs from brain, lung, liver and kidney (Clontech Inc.) was hybridized with a pool of the following probes: a human full-length VEGF-C cDNA insert (Genbank Ace. No. X94216), a human VEGF-Bi 67 cDNA fragment (nucleotides 1-382, Genbank Ace. No. U48800) obtained by PCR amplification; and a human VEGF 581 bp cDNA fragment covering base pairs 57-638 (Genbank Ace. No. X15997). Blots were washed under stringent conditions, using techniques standard in the art.
Mouse embryo multiple tissue Northern blot (Clontech Inc.) containing 2 zg of polyadenylated RNAs from 7, 11, 15 and 17 day postcoital embryos was hybridized with mouse VEGF-C cDNA fragment (base pairs 499-656). A mouse adult 62 tissue Northern blot was hybridized with the probes for human VEGF, VEGF-B 6 7 VEGF-C and with a VEGFR-3 cDNA fragment (nucleotides 1-595; Genbank Acc. No.
X68203).
In adult mouse tissues, both 2.4 kb and 2.0 kb mRNA signals were observed with the VEGF-C probe, at an approximately 4:1 ratio. The most conspicuous signals were obtained from lung and heart RNA, while kidney, liver, brain, and skeletal muscle had lower levels, and spleen and testis had barely visible levels. As in the human tissues, VEGF mRNA expression in adult mice was most abundant in lung and heart RNA, whereas the other samples showed less coordinate regulation with VEGF-C expression.
Skeletal muscle and heart tissues gave the highest VEGF-B mRNA levels from adult mice, as previously reported Olofsson et al., Proc. Natl. Acad Sci. (USA), 93:2576-2581 (1996). Comparison with VEGFR-3 expression showed that the tissues where VEGF-C is expressed also contain mRNA for its cognate receptor tyrosine kinase, although in the adult liver VEGFR-3 mRNA was disproportionally abundant.
To provide a better insight into the regulation of the VEGF-C mRNA during embryonic development, polyadenylated RNA isolated from mouse embryos of various gestational ages 11, 15, and 17 day was hybridized with the mouse VEGF-C probe. These analyses showed that the amount of 2.4 kb VEGF-C mRNA is relatively constant throughout the gestational period.
Example Regulation of mRNAs for VEGF family members by serum, interleukin-1 and dexamethasone in human fibroblasts in culture Human IMR-90 fibroblasts were grown in DMEM medium containing FCS and antibiotics. The cells were grown to 80% confluence, then starved for 48 hours in 0.5 FCS in DMEM. Thereafter, the growth medium was changed to DMEM containing 5% FCS, with or without 10 ng/ml interleukin-1 (IL-1) and with or without 1 mM dexamethasone. The culture plates were incubated with these additions for the times indicated, and total cellular RNA was isolated using the TRIZOL kit (GIBCO-BRL).
About 20 pg of total RNA from each sample was electrophoresed in 1.5% formaldehydeagarose gels as described in Sambrook et al., supra (1989). The gel was used for 63 Northern blotting and hybridization with radiolabeled insert DNA from the human VEGF clone (a 581 bp cDNA covering bps 57-638, Genbank Acc. No. 15997) and a human
VEGF-B
67 cDNA fragment (nucleotides 1-382, Genbank Acc. No. U48800).
Subsequently, the Northern blots were probed with radiolabeled insert from the VEGF-C cDNA plasmid. Primers were labeled using a standard technique involving enzymatic extension reactions of random primers, as would be understood by one of ordinary skill in the art.
The Northern blot analyses revealed that very low levels of VEGF-C and VEGF are expressed by the starved IMR-90 cells as well as cells after 1 hour of stimulation. In contrast, abundant VEGF-B mRNA signal was visible under these conditions. After 4 hours of serum stimulation, there was a strong induction of VEGF-C and VEGF mRNAs, which were further increased in the IL-I treated sample. The effect of IL-I seemed to be abolished in the presence of dexamethasone. A similar pattern of enhancement was observed in the 8 hour sample, but a gradual down-regulation of all signals was observed for both RNAs in the 24 hour and 48 hour samples. In contrast, VEGF-B mRNA levels remained constant and thus showed remarkable stability throughout the time period. The results are useful in guiding efforts to use VEGF-C and its fragments, its antagonists, and anti-VEGF-C antibodies in methods for treating a variety of disorders.
Example 26 Expression and analysis of recombinant murine VEGF-C The mouse VEGF-C cDNA was expressed as a recombinant protein and the secreted protein was analyzed for its receptor binding properties. The binding of mouse VEGF-C to the human VEGFR-3 extracellular domain was studied by using media from Bosc23 cells transfected with mouse VEGF-C cDNA in a retroviral expression vector.
The 1.8 kb mouse VEGF-C cDNA was cloned as an EcoRl fragment into the retroviral expression vector pBabe-puro containing the SV40 early promoter region [Morgenstern et al., Nucl. Acids Res., 18:3587-3595 (1990)], and transfected into the Bosc23 packaging cell line [Pearet et al., Proc. Natl. Acad Sci. (USA), 90:8392-8396 (1994)] by the calcium-phosphate precipitation method. For comparison, Bosc23 cells 64 also were transfected with the previously-described human VEGF-C construct in the pREP7 expression vector. The transfected cells were cultured for 48 hours prior to metabolic labeling. Cells were changed into DMEM medium devoid of cysteine and methionine, and, after 45 minutes ofpreincubation and medium change, Pro-mix' in vitro cell labeling mix (Amersham Corp.), in the same medium, was added to a final concentration of about 120 jsCi/ml. After 6 hours of incubation, the culture medium was collected and clarified by centrifugation.
For immunoprecipitation, 1 ml aliquots of the media from metabolicallylabeled Bosc23 cells transfected with empty vector or mouse or human recombinant VEGF-C, respectively, were incubated overnight on ice with 2 pl of rabbit polyclonal antiserum raised against an N-terminal 17 amino acid oligopeptide of mature human VEGF-C (H- 2 N-EETIKFAAAHYNTEILK) (SEQ ID NO: 8, residues 104-120).
Thereafter, the samples were incubated with protein A sepharose for 40 minutes at 46C with gentle agitation. The sepharose beads were then washed twice with immunoprecipitation buffer and four times with 20 mM Tris-HCI, pH 7.4. Samples were boiled in Laemmli buffer and analyzed by 12.5% sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE).
Immunoprecipitation of VEGF-C from media of transfected and metabolically-labeled cells revealed bands of approximately 30-32x10 3 M, (a doublet) and 22-23x 103 M, in 12.5% SDS-PAGE. These bands were not detected in samples from nontransfected or mock-transfected cells. These results show that antibodies raised against human VEGF-C recognize the corresponding mouse ligand, and provide an indication that the proteolytic processing that occurs to produce murine VEGF-C is analogous to the processing that occurs to produce human VEGF-C.
For receptor binding experiments, 1 ml aliquots of media from metabolically-labeled Bosc23 cells were incubated with VEGFR-3 extracellular domain (see Example covalently coupled to sepharose, for 4 hours at 4'C with gentle mixing.
The sepharose beads were washed four times with ice-cold phosphate buffered saline (PBS), and the samples were analyzed by gel electrophoresis as described in Joukov et al., EMBO 15:290-298 (1996).
Similar 30-32 x 103 M r doublet and 22-23 x 103 M, polypeptide bands were obtained in the receptor binding assay as compared to the immunoprecipitation assay.
65 Thus, mouse VEGF-C binds to human VEGFR-3. The slightly faster mobility of the mouse VEGF-C polypeptides that was observed may be caused by the four amino acid residue difference observed in sequence analysis (residues H88-E91, Fig. The capacity of mouse recombinant VEGF-C to induce VEGFR-3 autophosphorylation was also investigated. For the VEGFR-3 receptor stimulation experiments, subconfluent NIH 3T3-Flt4 cells, Pajusola et al., Oncogene, 9:3545-3555 (1994), were starved overnight in serum-free medium containing 0.2% BSA. In general, the cells were stimulated with the conditioned medium from VEGF-C vector-transfected cells for 5 minutes, washed three times with cold PBS containing 200 /M vanadate, and lysed in RIPA buffer for immunoprecipitation analysis. The lysates were centrifuged for minutes at 16000 x g and the resulting supernatants were incubated for 2 hours on ice with the specific antisera, followed by immunoprecipitation using protein A-sepharose and analysis in 7% SDS-PAGE. Polypeptides were transferred to nitrocellulose and analyzed by immunoblotting using anti-phosphotyrosine (Transduction Laboratories) and antireceptor antibodies, as described by Pajusola et al., Oncogene, 9:3545-3555 (1994). Filter stripping was carried out at 50'C for 30 minutes in 100 mM 2-mercaptoethanol, 2% SDS, 62.5 mM Tris-HCI, pH 6.7, with occasional agitation. The results of the experiment demonstrated that culture medium containing mouse VEGF-C stimulates the autophosphorylation of VEGFR-3 to a similar extent as human baculoviral VEGF-C or the 2 0 tyrosyl phosphatase inhibitor pervanadate.
Mouse VEGF-C appeared to be a potent inducer of VEGFR-3 autophosphorylation, with the 195x10 3 M, precursor and proteolytically-cleaved 125 x 103 M, tyrosine kinase polypeptides of the receptor (Pajusola et al., Oncogene, 9:3545-3555 (1994)), being phosphorylated.
VEGFR-2 stimulation was studied in subconfluent porcine aortic endothelial (PAE) cells expressing KDR (VEGFR-2) (PAE-VEGFR-2) [Waltenberger et al., J. Biol. Chem., 269:26988-26995 (1994)], which were starved overnight in serum-free medium containing 0.2% BSA. Stimulation was carried out and the lysates prepared as described above. For receptor immunoprecipitation, specific antiserum for VEGFR-2 [Waltenberger et al., J. Biol. Chem., 269:26988-26995 (1994)] was used. The immunoprecipitates were analyzed as described for VEGFR-3 in 7% SDS-PAGE followed by Western blotting with anti-phosphotyrosine antibodies, stripping of the filter, and re- 66 probing it with anti-VEGFR-2 antibodies (Santa Cruz). VEGFR-2 stimulation was first tried with unconcentrated medium from cells expressing recombinant VEGF-C, but immunoblotting analysis did not reveal any receptor autophosphorylation.
To further determine whether mouse recombinant VEGF-C can also induce VEGFR-2 autophosphorylation as observed for human VEGF-C, PAE cells expressing VEGFR-2 were stimulated with tenfold concentrated medium from cultures transfected with mouse VEGF-C expression vector and autophosphorylation was analyzed. For comparison, cells treated with tenfold concentrated medium containing human recombinant VEGF-C (Joukov et al., (1996)), unconcentrated medium from human VEGF-C baculovirus infected insect cells, or pervanadate (a tyrosyl phosphatase inhibitor) were used. In response to human baculoviral VEGF-C as well as pervanadate treatment, VEGFR-2 was prominently phosphorylated, whereas human and mouse recombinant VEGF-C gave a weak and barely detectable enhancement of autophosphorylation, respectively. Media from cell cultures transfected with empty vector or VEGF-C cloned in the antisense orientation did not induce autophosphorylation of VEGFR-2. Therefore, mouse VEGF-C binds to VEGFR-3 and activates this receptor at a much lower concentration than needed for the activation of VEGFR-2. Nevertheless, the invention comprehends methods for using the materials of the invention to take advantage of the interaction of VEGF-C with VEGFR-2, in addition to the interaction between VEGF-C and VEGFR-3.
Example 27 VEGF-C E104-S213 fragment expressed in Pichia yeast stimulates autoph9sphorylation of FIt4 (VEGFR-3) and KDR (VEGFR-2) A truncated form of human VEGF-C cDNA was constructed wherein (1) the sequence encoding residues of a putative mature VEGF-C amino terminus H 2
N-
E(104)ETIK (SEQ ID NO: 8, residues 104 et seq.) was fused in-frame to the yeast PHO1 signal sequence (Invitrogen Pichia Expression Kit, Catalog #K 1710-01), and a stop codon was introduced after amino acid 213 (H 2 N- RCMS; after codon 213 of SEQ ID NO: The resultant truncated cDNA construct was then inserted into the Pichia pastoris expression vector pHIL-S I (Invitrogen). For the cloning, an internal BgllI site in 67 the VEGF-C coding sequence was mutated without change of the encoded polypeptide sequence.
This VEGF-C expression vector was then transfected into Pichia cells and positive clones were identified by screening for the expression of VEGF-C protein in the culture medium by Western blotting. One positive clone was grown in a 50 ml culture, and induced with methanol for various periods of time from 0 to 60 hours. About 10 pl of medium was analyzed by gel electrophoresis, followed by Western blotting and detection with anti-VEGF-C antiserum, as described above. An approximately 24 kD polypeptide (band spreading was observed due to glycosylation) accumulated in the culture medium of cells transfected with the recombinant VEGF-C construct, but not in the medium of mocktransfected cells or cells transfected with the vector alone.
The medium containing the recombinant VEGF-C protein was concentrated by Centricon 30 kD cutoff ultrafiltration and used to stimulate NIH 3T3 cells expressing Flt4 (VEGFR-3) and porcine aortic endothelial (PAE) cells expressing KDR (VEGFR-2).
The stimulated cells were lysed and immunoprecipitated using VEGFR-specific antisera and the immunoprecipitates were analyzed by Western blotting using anti-phosphotyrosine antibodies, chemiluminescence, and fluorography. As a positive control for maximal autophosphorylation of the VEGFRs, vanadate (VO 4 treatment of the cells for 10 minutes was used. Medium from Pichia cultures secreting the recombinant VEGF-C polypeptide induced autophosphorylation of both Flt41 polypeptides of 195 kD and 125 kD as well as the KDR polypeptide of about 200 kD. Vanadate, on the other hand, induces heavy tyrosyl phosphorylation of the receptor bands in addition to other bands probably coprecipitating with the receptors.
These results demonstrate that a VEGF-homologous domain of VEGF-C consisting of amino acid residues 104E 213S (SEQ ID NO: 8, residues 104-213) can be recombinantly produced in yeast and is capable of stimulating the autophosphorylation of Flt4 (VEGFR-3) and KDR (VEGFR-2). Recombinant VEGF-C fragments such as the fragment described herein, which are capable of stimulating Flt4 or KDR autophosphorylation are intended as aspects of the invention; methods of using these fragments are also within the scope of the invention.
68 Example 28 Properties of the differentially processed forms of VEGF-C The following oligonucleotides were used to generate a set of VEGF-C variants and analogs: TCTCTTCTGTGCTTGAGTTGAG -3'(SEQ ID NO: 15), used to generate VEGF-C R102S (arginine mutated to serine at position 102 (SEQ ID NO: (SEQ ID NO: 16), used to generate VEGF-C R I 02G (arginine mutated to glycine at position 102 (SEQ ID NO: GGCGGCGQGCGCCTCGCGAGGACC (SEQ ID NO: 17), used to generate VEGF- C AN (deletion of N-terminal propeptide corresponding to amino acids 32-102 (SEQ ID NO: CTGQCAGGGAACTGCTAATAATGGAATGAA 3' (SEQ ID NO: 18), used to generate VEGF-C R226,227S (arginine codons mutated to serines at positions 226 and 227 (SEQ IID NO: GATGTGATCGGCGGCGGCGGCGCTCGGAGACC (SEQ ID NO: 19), used to generate VEGF-C NHis (this construct encodes a polypeptide with a 6xHis tag fuised to approximately the N-termninus of the secreted precursor, as described in 2 0 Example 21 (amino acid 33 of SEQ ID NO: Some of the foregoing VEGF-C mutant constructs were fu~rther modified to obtain additional constructs. For example, VEGF-C RI102G in pALTER (Promega) and oligonucleotide 5'-GTATTATAATGTCCTCCACCAJLATTTTATAG (SEQ ID NO: were used to generate VEGF-C 4G, which encodes a polypeptide with four point mutations: RIO2G, Al 10G, AllI 1G, and Al 12G (alanines mutated to glycines at positions 110- 112 (SEQ lID NO: These four mutations are adjacent to predicted sites of cleavage of VEGF-C expressed in PC-3 and recombinantly expressed in 293 EBNA cells.
Another construct was created using VEGF-C AN and oligonucleotide 3 0 GTTCGCTGCCTGACACTGTGGTAGTGTcCGG
GGCCGCTAGTGATGGTGATGTGATGATGTTGAACTTGTCTGTAAAC
ATCCAG (SEQ ID NO: 21) to generate VEGF-C ANAC~is. This construct encodes 69 a polypeptide with a deleted N-terminal propeptide (amino acids 32-102); a deleted Cterminal propeptide (amino acids 226-419 of SEQ ID NO: and an added 6xHis tag at the C-terminus (see SEQ ID NO: 59).
All constructs were further digested with HindIl and NotI, subcloned into HindllINotI digested pREP7 vector, and used to transfect 293 EBNA cells. About 48 hours after transfection, the cells were either metabolically labelled with Pro-mix M as described above, or starved in serum-free medium for 2 days. Media were then collected and used in subsequent experiments. Wild type (wt) VEGF-C, VEGF-C NHis and VEGF- C ANACHis were expressed to similar levels in 293 EBNA cells. At the same time, expression of the VEGF-C 4G polypeptide was considerably lower, possibly due to the changed conformation and decreased stability of the translated product. However, all the above VEGF-C mutants were secreted from the cells.
The conditioned media from the transfected and starved cells were concentrated 5-fold and used to assess their ability to stimulate tyrosine phosphorylation of Flt4 (VEGFR-3) expressed in NIH 3T3 cells and KDR (VEGFR-2) expressed in PAE cells. Wild type (wt) VEGF-C, as well as all three mutant polypeptides, stimulated tyrosine phosphorylation of VEGFR-3. The most prominent stimulation observed was by the short mature VEGF-C ANACHis. This mutant, as well as VEGF-C NHis, also stimulated tyrosine phosphorylation of VEGFR-2. Thus, despite the fact that a major component of secreted recombinant VEGF-C is a dimer of 32/29 kD, the active part of VEGF-C responsible for its binding to VEGFR-3 and VEGFR-2 is localized between amino acids 102 and 226 (SEQ ID NO: 8) of the VEGF-C precursor. Analysis and comparison of binding properties and biological activities of these VEGF-C proteins and mutants, using assays such as those described herein, will provide data concerning the significance of the observed major 32/29 kD and 21-23 kD VEGF-C processed forms.
The data indicate that constructs encoding amino acid residues 103-225 of the VEGF-C precursor (SEQ ID NO: 8) generate a recombinant ligand that is functional for both VEGFR-3 and VEGFR-2.
The data from this and preceding examples demonstrate that numerous fragments of the VEGF-C polypeptide retain biological activity. A naturally occurring VEGF-C polypeptide spanning amino acids 103-226 (or 103-227) of SEQ ID NO: 8, produced by a natural processing cleavage defining the C-terminus, has been shown to be 70 active. Example 27 demonstrates that a fragment with residues 104-213 of SEQ ID NO: 8 retains biological activity.
In addition, data from Example 21 demonstrates that a VEGF-C polypeptide having its amino terminus at position 112 of SEQ ID NO: 8 retains activity.
Additional experiments have shown that a fragment lacking residues 1-112 of SEQ ID NO: 8 retains biological activity.
In a related experiment, a stop codon was substituted for the lysine at position 214 of SEQ ID NO: 8 (SEQ ID NO: 7, nucleotides 991-993). The resulting recombinant polypeptide still was capable of inducing Flt4 autophosphorylation, indicating that a polypeptide spanning amino acid residues 113-213 of SEQ ID NO: 8 is biologically active.
Sequence comparisons of members of the VEGF family of polypeptides provides an indication that still smaller fragments of the polypeptide depicted in SEQ ID NO: 8 will retain biological activity. In particular, eight highly conserved cysteine residues of the VEGF family of polypeptides define a region from residues 131 211 of SEQ ID NO: 8 (see Figure 10) of evolutionary signficance; therefore, a polypeptide spanning from about residue 131 to about residue 211 is expected to retain VEGF-C biological activity.
In fact, a polypeptide which retains the conserved motif RCXXCC a polypeptide comprising from about residue 161 to about residue 211 of SEQ ID NO: 8 is postulated to retain VEGF-C biological activity. To maintain native conformation of these fragments, it may be preferred to retain about 1-2 additional amino acids at the carboxy-terminus and 1- 2 or more amino acids at the amino terminus.
Beyond the preceding considerations, evidence exists that smaller fragments and/or fragment analogs which lack the conserved cysteines nonetheless will retain VEGF-C biological activity. Consequently, the materials and methods of the invention include all VEGF-C fragments, variants, and analogs that retain at least one biological activity of VEGF-C, regardless of the presence or absence of members of the conserved set of cysteine residues.
71 Example 29 Expression of human VEGF-C under the human K14 keratin promoter in transgenic mice induces abundant growth of lymphatic vessels in the skin The Flt4 receptor tyrosine kinase is relatively specifically expressed in the endothelia of lymphatic vessels. Kaipainen et al., Proc. Natl. Acad Sci. (USA), 92: 3566- 3570 (1995). Furthermore, the VEGF-C growth factor stimulates the Flt4 receptor, showing less activity towards the KDR receptor of blood vessels (Joukov et al., EMBO J., 290-298 (1996); See Example 26).
Experiments were conducted in transgenic mice to analyze the specific effects of VEGF-C overexpression in tissues. The human K14 keratin promoter is active in the basal cells of stratified squamous epithelia (Vassar et al., Proc. Natl. Acad. Sci.
(USA), 86:1563-1567 (1989)) and was used as the expression control element in the recombinant VEGF-C transgene. The vector containing the K 14 keratin promoter is described in Vassar et al., Genes Dev., 5:714-727 (1991) and Nelson et al., J. Cell Biol.
97:244-251 (1983).
The recombinant VEGF-C transgene was constructed using the human full length VEGF-C cDNA (GenBank Acc. No. X94216). This sequence was excised from a pCI-neo vector (Promega) with XhoI/NotIl, and the resulting 2027 base pair fragment containing the open reading frame and stop codon (nucleotides 352-1611 of SEQ ID NO: 7) was isolated. The isolated fragment was then subjected to an end-filling reaction using the Klenow fragment of DNA polymerase. The blunt-ended fragment was then ligated to a similarly opened BamHl restriction site in the K14 vector. The resulting construct contained the EcoRI site derived from the polylinker of the pCI-neo vector. This EcoRI site was removed using standard techniques (a Klenow-mediated fill-in reaction following partial digestion of the recombinant intermediate with EcoRI) to facilitate the subsequent excision of the DNA fragment to be injected into fertilized mouse oocytes. The resulting clone, designated K14-VEGF-C, is illustrated in Fig. 20 of commonly-owned PCT patent application PCT/FI96/00427, filed August 01, 1996, published as WO 97/05250.
The EcoRI-HindIII fragment from clone K 14 VEGF-C containing the K 14 promoter, VEGF-C cDNA, and K14 polyadenylation signal was isolated and injected into fertilized oocytes of the FVB-NIH mouse strain. The injected zygotes were transplanted 72 to oviducts of pseudopregnant C57BL/6 x DBA/2J hybrid mice. The resulting founder mice were analyzed for the presence of the transgene by polymerase chain reaction of tail DNA using the primers: 5'-CATGTACGAACCGCCAG-3' (SEQ ID NO: 22) and AATGACCAGAGAGAGGCGAG-3' (SEQ ID NO: 23). In addition, the tail DNAs were subjected to EcoRV digestion and subsequent Southern analysis using the EcoRI-HindIII fragment injected into the mice. Out of 8 pups analyzed at 3 weeks of age, 2 were positive, having approximately 40-50 copies and 4-6 copies of the transgene in their respective genomes.
The mouse with the high copy number transgene was small, developed more slowly than its litter mates and had difficulty eating suckling). Further examination showed a swollen, red snout and poor fur. Although fed with a special liquid diet, it suffered from edema of the upper respiratory and digestive tracts after feeding and had breathing difficulties. This mouse died eight weeks after birth and was immediately processed for histology, immunohistochemistry, and in situ hybridization.
Histological examination showed that in comparison to the skin of littermates, the dorsal dermis ofKl4-VEGF-C transgenic mice was atrophic and connective tissue was replaced by large lacunae devoid of red cells, but lined with a thin endothelial layer. These distended vessel-like structures resembled those seen in human lymphangiomas. The number of skin adnexal organs and hair follicles were reduced. In the snout region, an increased number of vessels was also seen. Therefore, VEGF-C overexpression in the basal epidermis is capable of promoting the growth of extensive vessel structure in the underlying skin, including large vessel lacunae. The endothelial cells surrounding these lacunae contained abundant Flt4 mRNA in in situ hybridization (see Examples 23 and 30 for methodology). The vessel morphology indicates that VEGF-C stimulates the growth of vessels having features of lymphatic vessels. The other K14- VEGF-C transgenic mouse had a similar skin histopathology.
Nineteen additional pups were analyzed at 3 weeks of age for the presence of the VEGF-C transgene, bring the number of analyzed pups to twenty-seven. A third transgene-positive pup was identified, having approximately 20 copies of the transgene in its genome. The 20 copy mouse and the 4-6 copy mouse described above transmitted the gene to 6 out of 11 and 2 out of 40 pups, respectively. The physiology of these additional transgenic mice were further analyzed.
73 The adult transgenic mice were small and had slightly swollen eyelids and poorly developed fur. Histological examination showed that the epidermis was hyperplastic and the number of hair follicles was reduced; these effects were considered unspecific or secondary to other phenotypic changes. The dermis was atrophic (45% of the dermal thickness, compared to 65% in littermate controls) and its connective tissue was replaced by large dilated vessels devoid of red cells, but lined with a thin endothelial cell layer. Such abnormal vessels were confined to the dermis and resembled the dysfunctional, dilated spaces characteristic of hyperplastic lymphatic vessels. See Fossum, et al., J. Vet. Int. Med., 6: 283-293 (1992). Also, the ultrastructural features were reminiscent of lymphatic vessels, which differ from blood vessels by having overlapping endothelial junctions, anchoring filaments in the vessel wall, and a discontinuous or even partially absent basement membrane. See Leak, Microvasc. Res., 2: 361-391 (1970).
Furthermore, antibodies against collagen types IV, XVIII [Muragaki et al., Proc. Nail.
Acad. Sci. USA, 92: 8763-8776 (1995)] and laminin gave very weak or no staining of the vessels, while the basement membrane staining of other vessels was prominent. The endothelium was also characterized by positive staining with monoclonal antibodies against desmoplakins I and II (Progen), expressed in lymphatic, but not in vascular endothelial cells. See Schmelz et al., Differentiation, 57: 97-117 (1994). Collectively, these findings strongly suggested that the abnormal vessels were of lymphatic origin.
In Northern hybridization studies, abundant VEGF-C mRNA was detected in the epidermis and hair follicles of the transgenic mice, while mRNAs encoding its receptors VEGFR-3 and VEGFR-2 as well as the Tie-1 endothelial receptor tyrosine kinase [Korhonen et al., Oncogene, 9: 395-403 (1994)] were expressed in endothelial cells lining the abnormal vessels. In the skin of littermate control animals, VEGFR-3 could be detected only in the superficial subpapillary layer of lymphatic vessels, while VEGFR-2 was found in all endothelia, in agreement with earlier findings. See Millauer et al., Cell, 72: 1-20 (1993); and Kaipainen et al., Proc. Natl. Acad. Sci. USA, 92: 3566-3570 (1995).
The lymphatic endothelium has a great capacity to distend in order to adapt to its functional demands. To determine whether vessel dilation was due to endothelial distension or proliferation, in vitro proliferation assays were conducted. Specifically, to measure DNA synthesis, 3mm x 3mm skin biopsies from four transgenic and four control mice were incubated in D-MEM with 10 micrograms/ml BrdU for 6 hours at 37°C, fixed 74 in 70% ethanol for 12 hours, and embedded in paraffin. After a 30 minute treatment with 0.1% pepsin in 0.1 M HCI at room temperature to denature DNA, staining was performed using mouse monoclonal anti-BrdU antibodies (Amersham). It appeared that the VEGF- C-receptor interaction in the transgenic mice transduced a mitogenic signal, because, in contrast to littermate controls, the lymphatic endothelium of the skin from young K14- VEGF-C mice showed increased DNA synthesis as demonstrated by BrdU incorporation followed by staining with anti-BrdU antibodies. This data further confirms that VEGF-C acts as a true growth factor in mammalian tissues.
In related experiments, a similar VEGF transgene did not induce lymphatic proliferation, but caused enhanced density of hyperpermeable, tortuous blood microvessels instead.
Angiogenesis is a multistep process which includes endothelial proliferation, sprouting, and migration. See Folkman et al., J. Biol. Chem., 267: 10931- 10934 (1992). To estimate the contribution of such processes to the transgenic phenotype, the morphology and function of the lymphatic vessels was analysed using fluorescent microlymphography using techniques known in the art. See Leu et al., Am. J.
Physiol., 267: 1507-1513 (1994); and Swartz et al., Am. J. Physiol., 270: 324-329 (1996).
Briefly, eight-week old mice were anesthetized and placed on a heating pad to maintain a 37 0 C temperature. A 30-gauge needle, connected to a catheter filled with a solution of FITC-dextran 2M (8 mg/mi in PBS), was injected intradermally into the tip of the tail.
The solution was infused with a constant pressure of 50 cm water (averaging roughly 0.01 microliters per minute flow rate) until the extent of network filling remained constant (approximately 2 hours). Flow rate and fluorescence intensity were monitorerd continuously throughout the experiment. In these experiments, a typical honeycomb-like 2 5 network with similar mesh sizes was observed in both control and transgenic mice, but the diameter of lymphatic vessels was about twice as large in the transgenic mice, as summarized in the table below. (The intravital fluorescence microscopy of blood vessels 75 was performed as has been described in the art. See Fukumura et al., Cancer Res., 4824-4829 (1995).) Structural parameters of lymphatic and blood vessel networks transgenic control P-value** diameter 142.3±26.2 68.2±21.7 .0143 horizontal 1003±87.1 960.8±93.1 .2207 lymphatic mesh size*** vessels* Vertical mesh 507.3±58.9 488.8±59.9 .5403 size (n=6) median 8.3±0.6 7.6±1.1 .1213 diameter blood vessels vessel density, 199.2 ±6.6 216.4±20.0 .3017 vessels cm/cm' n=number of aminals mean (tm)±SD **Mann-Whitney test ***mesh size describes vessel density Some dysfunction of the abnormal vessels was indicated by the fact that it took longer for the dextran to completely fill the abnormal vessels. Injection of FITC-dextran into the tail vein, followed by fluorescence microscopy of the ear, showed that the blood vascular morphology was unaltered and leukocyte rolling and adherence appeared normal in the transgenic mice. These results suggest that the endothelial proliferation induced by VEGF-C leads to hyperplasia of the superficial lymphatic network but does not induce the sprouting of new vessels.
These effects of VEGF-C overexpression are unexpectedly specific, especially since, as described in other examples, VEGF-C is also capable of binding to and activating VEGFR-2, which is the major mitogenic receptor of blood vessel endothelial cells. In culture, high concentrations of VEGF-C stimulate the growth and migration of bovine capillary endothelial cells which express VEGFR-2, but not significant amounts of VEGFR-3. In addition, VEGF-C induces vascular permeability in the Miles assay [Miles, 76 A. and Miles, E. J. PhysioL, 118:228-257 (1952); and Udaka, et Proc. Soc.
Exp. Biol. Med., 133:1384-1387 (1970)], presumably via its effect on VEGFR-2. VEGF- C is less potent than VEGF in the Miles assay, 4- to 5-fold higher concentrations of VEGF-C ANACHis being required to induce the same degree of permeability. In vivo, the specific effects of VEGF-C on lymphatic endothelial cells may reflect a requirement for the formation of VEGFR-3xVEGFR-2 heterodimers for endothelial cell proliferation at physiological concentrations of the growth factor. Such possible heterodimers may help to explain how three homologous VEGFs exert partially redundant, yet strikingly specific biological effects.
The foregoing in vivo data indicates utilities for both VEGF-C polypeptides and polypeptide variants and analogs having VEGF-C biological activity, and (ii) anti-VEGF-C antibodies and VEGF-C antagonists that inhibit VEGF-C activity by binding VEGF-C or interfering with VEGF-C/receptor interactions. For example, the data indicates a therapeutic utility for VEGF-C polypeptides in patients wherein growth of lymphatic tissue may be desirable in patients following breast cancer or other surgery where lymphatic tissue has been removed and where lymphatic drainage has therefore been compromised, resulting in swelling; or in patients suffering from elephantiasis). The data indicates a therapeutic utility for anti-VEGF-C antibody substances and VEGF-C antagonists for conditions wherein growth-inhibition of lymphatic tissue may be desirable treatment of lymphangiomas). Accordingly, methods of administering VEGF-C and VEGF-C variants, analogs, and antagonists are contemplated as methods and materials of the invention.
Example Expression of VEGF-C and Flt4 in the Developing Mouse Embryos from a 16-day post-coitus pregnant mouse were prepared and fixed in 4% paraformaldehyde (PFA), embedded in paraffin, and sectioned at 6 Wrm. The sections were placed on silanated microscope slides and treated with xylene, rehydrated, fixed for 20 minutes in 4% PFA, treated with proteinase K (7mg/ml; Merck, Darmstadt, Germany) for 5 minutes at room temperature, again fixed in 4% PFA and treated with acetic anhydride, dehydrated in solutions with increasing ethanol concentrations, dried and used for in situ hybridization.
77 In situ hybridization of sections was performed as described (Vastrik et al., J. Cell BioL, 128:1197-1208 (1995)). A mouse VEGF-C antisense RNA probe was generated from linearized pBluescript II SK+ plasmid (Stratagene Inc.), containing a fragment corresponding to nucleotides 499-979 of mouse VEGF-C cDNA, where the noncoding region and the BR3P repeat were removed by Exonuclease III treatment. The fragment had been cloned into the EcoRI and HindII sites of pBluescript II SK+.
Radiolabeled RNA was synthesized using T7 RNA Polymerase and ["S]-UTP (Amersham, Little Chalfont, UK). About two million cpm of the VEGF-C probe was applied per slide.
After an overnight hybridization, the slides were washed first in 2x SSC and 20-30 mM DDT for 1 hour at 50"C. Treatment continued with a high stringency wash, 4x SSC and mM DTT and 50% deionized formamide for 30 minutes at 65"C followed by RNase A treatment (20 pg/mi) for 30 minutes at 37"C. The high stringency wash was repeated for minutes. Finally, the slides were dehydrated and dried for 30 minutes at room temperature. The slides were dipped into photography emulsion and exposed for 4 weeks.
Slides were developed using Kodak D-16 developer, counterstained with hematoxylin and mounted with Permount (FisherChemical).
For in situ hybridizations of Flt4 sequences, a mouse Flt4 cDNA fragment covering bp 1-192 of the published sequence (Finnerty et al., Oncogene, 8:2293-2298 (1993)) was used, and the above-described protocol was followed, with the following exceptions. Approximately one million cpm of the Flt4 probe were applied to each slide.
The stringent washes following hybridization were performed in lx SSC and 30 mM DTT for 105 minutes.
Darkfield and lightfield photomicrographs from these experiments are presented in commonly-owned PCT patent application PCT/FI96/00427, filed August 01, 1996, incorporated by reference herein. Observations from the photomicrographs are summarized below.
The most prominently Flt4-hybridizing structures appeared to correspond to the developing lymphatic and venous endothelium. A plexus-like endothelial vascular structure surrounding the developing nasopharyngeal mucous membrane was observed.
The most prominent signal using the VEGF-C probe was obtained from the posterior part of the developing nasal conchae, which in higher magnification showed the epithelium surrounding loose connective tissue/forming cartilage. This structure gave a strong in situ 78 hybridization signal for VEGF-C. With the VEGF-C probe, more weakly hybridizing areas were observed around the snout, although this signal is much more homogeneous in appearance. Thus, the expression of VEGF-C is strikingly high in the developing nasal conchae.
The conchae are surrounded with a rich vascular plexus, important in nasal physiology as a source for the mucus produced by the epithelial cells and for warming inhaled air. It is suggested that VEGF-C is important in the formation of the concheal venous plexus at the mucous membranes, and that it may also regulate the permeability of the vessels needed for the secretion of nasal mucus. Possibly, VEGF-C and its derivatives, and antagonists, could be used in the regulation of the turgor of the conchal tissue and mucous membranes and therefore the diameter of the upper respiratory tract, as well as the quantity and quality of mucus produced. These factors are of great clinical significance in inflammatory (including allergic) and infectious diseases of the upper respiratory tract.
Accordingly, the invention contemplates the use of the materials of the invention, including VEGF-C, Flt4, and their derivatives, in methods of diagnosing and treating inflammatory and infectious conditions affecting the upper respiratory tract, including nasal structures.
Example 31 Characterization of the exon-intron organization of the human VEGF-C gene Two genomic DNA clones covering exons 1, 2, and 3 of the human VEGF-C gene were isolated from a human genomic DNA library using VEGF-C cDNA fragments as probes. In particular, a human genomic library in bacteriophage EMBL-3 lambda (Clontech) was screened using a PCR-generated fragment corresponding to nucleotides 629-746 of the human VEGF-C cDNA (SEQ ID NO: One positive clone, designated "lambda was identified, and the insert was subcloned as a 14 kb Xhol fragment into the pBluescript II (pBSK II) vector (Stratagene). The genomic library also was screened with a labeled 130 bp Notl-Sacd fragment from the 5'-noncoding region of the VEGF-C cDNA (the NotI site is in the polylinker of the cloning vector; the SacI site corresponds to nucleotides 92-97 of SEQ ID NO: Two positive clones, designated "lambda 5" and "lambda were obtained. Restriction mapping analysis showed that 79 clone lambda 3 contains exons 2 and 3, while clone lambda 5 contains exon I and the putative promoter region.
Three genomic fragments containing exons 4, 5, 6 and 7 were subcloned from a genomic VEGF-C PI plasmid clone. In particular, purified DNA from a genomic P plasmid clone 7660 (Paavonen et al., Circulation, 93: 1079-1082 (1996)) was used.
EcoRI fragments of the P1 insert DNA were ligated into pBSK II vector. Subclones of clone 7660 which contained human VEGF-C cDNA homologous sequences were identified by colony hybridization, using the full-length VEGF-C cDNA as a probe. Three different genomic fragments were identified and isolated, which contained the remaining exons 4-7.
To determine the genomic organization, the clones were mapped using restriction endonuclease cleavage. Also, the coding regions and exon-intron junctions were partially sequenced. The result of this analysis is depicted in Figures 11 IA and 12.
The sequences of all intron-exon boundaries (Fig. 11 A, SEQ ID NOs: 24-35) conformed to the consensus splicing signals (Mount, Nucl. Acids Res., 10: 459-472 (1982)). The length of the intron between exon 5 and 6 was determined directly by nucleotide sequencing and found to be 301 bp. The length of the intron between exons 2 and 3 was determined by restriction mapping and Southern hybridization and was found to be about 1.6 kb. Each of the other introns is over 10 kb in length.
A similar analysis was performed for the murine genomic VEGF-C gene.
The sequences of murine VEGF-C intron-exon boundaries are depicted in Figure 11 B and SEQ ID NOs: 36-47.
The restriction mapping and sequencing data indicated that the VEGF-C signal sequence and the first residues of the N-terminal propeptide are encoded by exon 1.
The second exon encodes the carboxy-terminal portion of the N-terminal propeptide and the amino terminus of the VEGF homology domain. The most conserved sequences of the VEGF homology domain are distributed in exons 3 (containing 6 conserved cysteine residues) and 4 (containing 2 cys residues). The remaining exons encode cysteine-rich motifs of the type C-6X-C-10X-CRC (exons 5 and 7) and a fivefold repeated motif of type C-6X-B-3X-C-C-C, which is typical of a silk protein.
To further characterize the human VEGF-C gene promoter, the lambda clone was further analyzed. Restriction mapping of this clone using a combination of 80 single- and double-digestions and Southern hybridizations indicated that it includes: an approximately 6 kb region upstream of the putative initiator ATG codon, exon 1, and at least 5 kb of intron I of the VEGF-C gene.
A 3.7 kb Xba I fragment of clone lambda 5, containing exon I and 5' and 3' flanking sequences, was subcloned and further analyzed. As reported previously, a major VEGF-C mRNA band migrates at a position of about 2.4 kb. Calculating from the VEGF-C coding sequence of 1257 bp and a 391 bp 3' noncoding sequence plus a polyA sequence of about 50-200 bp, the mRNA start site was estimated to be about 550-700 bp upstream of the translation initiation codon.
RNase protection assays were employed to obtain a more precise localization of the mRNA start site. The results of these experiments indicated that the RNA start site in the human VEGF-C gene is located 539 bp upstream of the ATG translational initiation codon.
To further characterize the promoter of the human VEGF-C gene, a genomic clone encompassing about 2.4 kb upstream of the translation initiation site was isolated, and the 5' noncoding cDNA sequence and putative promoter region were sequenced. The sequence obtained is set forth in SEQ ID NO: 48. (The beginning of the VEGF-C cDNA sequence set forth in SEQ ID NO: 7 corresponds to position 2632 of SEQ ID NO: 48; the translation initiation codon corresponds to positions 2983-2985 of 2 0 SEQ ID NO: 48.) Similar to what has been observed with the VEGF gene, the VEGF-C promoter is rich in G and C residues and lacks consensus TATA and CCAAT sequences.
Instead, it has numerous putative binding sites (5'-GGGCGG-3' or 5'-CCGCCC-3') for Sp 1, a ubiquitous nuclear protein that can initiate transcription of TATA-less genes. See Pugh and Tjian, Genes andDev., 5:105-119 (1991). In addition, sequences upstream of the VEGF-C translation start site were found to contain frequent consensus binding sites for the AP-2 factor (5'-GCCN 3 GCC-3') and binding sites for the AP-1 factor TKASTCA-3'). Binding sites for regulators of tissue-specific gene expression, like NFkB and GATA, are located in the distant part of VEGF-C promoter. This suggests that the cAMP-dependent protein kinase and protein kinase C, as activators of AP-2 transcription factor [Curran and Franza, Cell, 55:395-397 (1988)], mediate VEGF-C transcriptional regulation.
81 The VEGF-C gene is abundantly expressed in adult human tissues, such as heart, placenta, ovary and small intestine, and is induced by a variety of factors. Indeed, several potential binding sites for regulators of tissue-specific gene expression, like NFkB (5'-GGGRNTYYC-3') and GATA, are located in the distal part of the VEGF-C promoter.
For example, NFkB is known to regulate the expression of tissue factor in endothelial cells. Also, transcription factors of the GATA family are thought to regulate cell-type specific gene expression.
Unlike VEGF, the VEGF-C gene does not contain a binding site for the hypoxia-inducible factor, HIF-1 (Levy et al., J. Biol. Chem., 270: 13333-13340 (1995)).
This finding suggests that if the VEGF-C mRNA is regulated by hypoxia, the mechanism would be based mainly on the regulation of mRNA stability. In this regard, numerous studies have shown that the major control point for the hypoxic induction of the VEGF gene is the regulation of the steady-state level of mRNA. See Levy et al., J. Biol. Chem., 271: 2746-2753 (1996). The relative rate of VEGF mRNA stability and decay is considered to be determined by the presence of specific sequence motifs in its 3' untranslated region (UTR), which have been demonstrated to regulate mRNA stability.
(Chen and Shyu, Mol. Cell Biol., 14: 8471-8482 (1994)). The 3'-UTR of the VEGF-C gene also contains a putative motif of this type (TTATTT), at positions 1873-1878 of SEQ ID NO: 7.
To identify DNA elements important for basal expression of VEGF-C in transfected cells, a set of luciferase reporter plasmids containing serial 5' deletions through the promoter region was constructed. Restriction fragments of genomic DNA containing portions of the first exon were cloned into the polylinker of the pGL3 reporter vector (Promega) and confirmed by sequencing. About 10 gg of the individual constructs in combination with 2 1g of pSV2-1-galactosidase plasmid (used as a control of transfection efficiency) were transfected into HeLa cells using the calcium phosphate-mediated transfection method. Two days after transfection, the cells were harvested and subjected to the luciferase assay. The luciferase activity was normalized to that of the pGL3 control vector driven by SV40 promoter/enhancer.
As depicted in Fig. 3, the 5.5 kb Xhol-RsrlI fragment of clone lambda gave nearly 9-fold elevated activity when compared with a promoterless vector. Deletion of a 5' XhoI-HindIII fragment of 2 kb had no effect on the promoter activity. The activity 82 of the 1.16 kb XbaI-RsrII fragment was about twice that of the pGL3. basic vector, while the activity of the same fragment in the reverse orientation was at background level.
Further deletion of the XbaI-SacI fragment caused an increase in the promoter activity, suggesting the presence of silencer elements in the region from -1057 to -199 199 to 1057 bp upstream from the transcription initiation site). The shortest fragment (SacII- Rsrll) yielded only background activity, which was consistent with the fact that the mRNA initiation site was not present in this construct.
To determine whether further sequences in the first exon of human VEGF- C are important for basal expression, an RsrII fragment spanning nucleotides 214-495 214-495 bp downstream from the transcription initiation site) was subcloned in between of Xbal-RsrII fragment and the luciferase reporter gene. Indeed, the obtained construct showed an 50 increase in activity when compared with the Xbal-RsrII construct.
The VEGF gene has been shown to be up-regulated by a number of stimuli including serum derived growth factors. To find out whether the VEGF-C gene also can be stimulated by serum, RNA from serum-starved and serum-stimulated HT1080 cells was subjected to primer extension analysis, which demonstrated that VEGF-C mRNA is upregulated by serum stimulation.
Additional serum stimulation experiments indicated that the serum stimulation leads to increased VEGF-C promoter activity. Cells were transfected as described above and 24 h after transfection changed into medium containing 0.5% bovine serum albumin. Cells were then stimulated with 10 fetal calf serum for 4 hours and analyzed. The XbaI-RsrII promoter construct derived from lambda 5 yielded a twofold increased activity upon serum stimulation, while the same fragment in the reverse orientation showed no response. All other promoter constructs also showed upregulation, ranging from 1.4 to 1.6 fold (Fig. 3).
Example 32 Identification of a VEGF-C splice variant As reported in Example 16, a major 2.4 kb VEGF-C mRNA and smaller amounts of a 2.0 kb mRNA are observable. To clarify the origin of these RNAs, several additional VEGF-C cDNAs were isolated and characterized. A human fibrosarcoma cDNA library from HT1080 cells in the lambda gtl 1 vector (Clontech, product 83 #HL 048b) was screened using a 153 bp human VEGF-C cDNA fragment as a probe as described in Example 10. See also Joukov el al., EMBO 15:290-298 (1996). Nine positive clones were picked and analyzed by PCR amplification using oligonucleotides 5'-CACGGCTTATGCAAGCAAAG-3' (SEQ ID NO: 49) and 5'-AACACAGTTTTCCATAATAG-3' (SEQ ID NO: 50) These oligonucleotides were selected to amplify the portion of the VEGF-C cDNA corresponding to nucleotides 495-1661 of SEQ ID NO: 7. PCR was performed using an annealing temperature of and 25 cycles.
The resultant PCR products were elefctrophoresed on agarose gels. Five clones out of the nine analyzed generated PCR fragments of the expected length of 1147 base pairs, whereas one was slightly shorter. The shorter fragment and one of the fragments of expected length were cloned into the pCRTMII vector (Invitrogen) and analyzed by sequencing. The sequence revealed that the shorter PCR fragment had a deletion of 153 base pairs, corresponding to nucleotides 904 to 1055 of SEQ ID NO: 7.
These deleted bases correspond to exon 4 of the human and mouse VEGF-C genes, schematically depicted in Figs. 13A and 13B. Deletion of exon 4 results in a frameshift, which in turn results in a C-terminal truncation of the full-length VEGF-C precursor, with fifteen amino acid residues translated from exon 5 in a different frame than the frame used to express the full-length protein. Thus, the C-terminal amino acid sequence of the resulting truncated polypeptide would be -Leu (181)-Ser-Lys-Thr-Val-Ser-Gly-Ser-Glu- Gln-Asp-Leu-Pro-His-Glu-Leu-His-Val-Glu(199) (SEQ ID NO: 51). The polypeptide encoded by this splice variant would not contain the C-terminal cleavage site of the VEGF-C precursor. Thus, a putative alternatively spliced RNA form lacking conserved exon 4 was identified in HT-1080 fibrosarcoma cells and this form is predicted to encode a protein of 199 amino acid residues, which could be an antagonist of VEGF-C.
Example 33 VEGF-C is similarly processed in different cell cultures in vitro To study whether VEGF-C is similarly processed in different cell types, 293 EBNA cells, COS-1 cells and HT-1080 cells were transfected with wild type human VEGF-C cDNA and labelled with Pro-Mix M as described in Example 22. The conditioned media from the cultures were collected and subjected to immunoprecipitation using 84 antiserum 882 (described in Example 21, recognizing a peptide corresponding to amino acids 104-120 of SEQ ID NO: The immunoprecipitated polypeptides were separated via SDS-PAGE, and detected via autoradiography. The major form of secreted recombinant VEGF-C observed from all cell lines tested is a 29/32 kD doublet. These two polypeptides are bound to each other by disulfide bonds, as described in Example 22. A less prominent band of approximately 21 kD also was detected in the culture media.
Additionally, a non-processed VEGF-C precursor of 63 kDa was observed. This form was more prominent in the COS-I cells, suggesting that proteolytic processing of VEGF-C in COS cells is less efficient than in 293 EBNA cells. Endogenous VEGF-C (in nontransfected cells) was not detectable under these experimental conditions in the HT-1080 cells, but was readily detected in the conditioned medium of the PC-3 cells. Analysis of the subunit polypeptide sizes and ratios in PC-3 cells and 293 EBNA cells revealed strikingly similar results: the most prominent form was a doublet of 29/32 kDa and a less prominent form the 21 kD polypeptide. The 21 kD form produced by 293 EBNA cells was not recognized by the 882 antibody in the Western blot, although it is recognized when the same antibody is used for immunoprecipitation (see data in previous examples). As reported in Example 21, cleavage of the 32 kD form in 293 EBNA cells occurs between amino acid residues 111 and 112 (SEQ ID NO: downstream of the cleavage site in PC-3 cells (between residues 102 and 103). Therefore, the 21 kD form produced in 293 EBNA cells does not contain the complete N-terminal peptide used to generate antiserum 882. In a related experiment, PC-3 cells were cultured in serum-free medium for varying periods of time (1 8 days) prior to isolation of the conditioned medium. The conditioned medium was concentrated using a Centricon device (Amicon, Beverly, USA) and subjected to Western blotting analysis using antiserum 882. After one day of culturing, a prominent 32 kD band was detected. Increasing amounts of a 21-23 kD form were detected in the conditioned media from 4 day and 8 day cultures. The diffuse nature of this polypeptide band, which is simply called the 23 kD polypeptide in example 5 and several subsequent examples, is most likely due to a heterogenous and variable amount of glycosylation. These results indicate that, initially, the cells secrete a 32 kD polypeptide, which is further processed or cleaved in the medium to yield the 21-23 kD form. The microheterogeneity of this polypeptide band would then arise from the variable glycosylation degree and, from microheterogeneity of the processing cleavage sites, such 85 as obtained for the amino terminus in PC-3 and 293 EBNA cell cultures. The carboxyl terminal cleavage site could also vary, examples of possible cleavage sites would be between residues 225-226, 226-227 and 227-228 as well as between residues 216-217.
Taken together, these data suggest the possibility that secreted cellular protease(s) are responsible for the generation of the 21-23 kD form of VEGF-C from the 32 kD polypeptide. Such proteases could be used in vitro to cleave VEGF-C precursor proteins in solution during the production of VEGF-C, or used in cell culture and in vivo to release biologically active VEGF-C.
Example 34 Differential binding of VEGF-C forms by the extracellular domains of VEGFR-3 and VEGFR-2 In two parallel experiments, 293 EBNA cells were transfected with a construct encoding recombinant wild type VEGF-C or a construct encoding VEGF-C ANACHis (Example 28) and about 48 hours after transfection, metabolically labelled with Pro-Mix m as described in previous examples. The media were collected from mocktransfected and transfected cells and used for receptor binding analyses.
Receptor binding was carried out in binding buffer (PBS, 0.5% BSA, 0.02% Tween 20, 1 microgram/ml heparin) containing approximately 0.2 microgram of either a fusion protein comprising a VEGFR-3 extracellular domain fused to an immunoglobulin sequence (VEGFR-3-1g) or a fusion protein comprising VEGFR-2 extracellular domain fused to an alkaline phosphatase sequence (VEGFR-2-AP; Cao et al., J. Biol. Chem. 271:3154-62 (1996)). As a control, similar aliquots of the 293 EBNA conditioned media were mixed with 2 il of anti-VEGF-C antiserum (VEGF-C IP).
After incubation for 2 hours at room temperature, anti-VEGF-C antibodies and VEGFR-3-Ig protein were adsorbed to protein A-sepharose (PAS) and VEGFR-2-AP was immunoprecipitated using anti-AP monoclonal antibodies (Medix Biotech, Genzyme Diagnostics, San Carlos, CA, USA) and protein G-sepharose. Complexes containing VEGF-C bound to VEGFR-3-Ig or VEGFR-2-AP were washed three times in binding buffer, twice in 20 mM Tris-HCl (pH 7.4) and VEGF-C immunoprecipitates were washed three times in RIPA buffer and twice in 20 mM tris-HCl (pH 7.4) and analyzed via SDS- PAGE under reducing and nonreducing conditions. As a control, the same media were 86 precipitated with antiAP and protein G-sepharose (PGS) or with PAS to control for possible nonspecific adsorption.
These experiments revealed that VEGFR-3 bound to both the 32/29 kD and 21-23 kD forms of recombinant VEGF-C, whereas VEGFR-2 bound preferentially to the 21-23 kD component from the conditioned media. In addition, small amounts of 63 kD and 52 kD VEGF-C forms were observed binding with VEGFR-3. Further analysis under nonreducing conditions indicates that a great proportion of the 21-23 kD VEGF-C bound to either receptor does not contain interchain disulfide bonds. These findings reinforce the results that VEGF-C binds VEGFR-2. This data suggests a utility for recombinant forms of VEGF-C which are active towards VEGFR-3 only or which are active towards both VEGFR-3 and VEGFR-2. On the other hand, these results, together with the results in Example 28, do not eliminate the possibility that the 32/29 kD dimer binds VEGFR-3 but does not activate it. The failure of the 32/29 kD dimer to activate VEGFR-3 could explain the finding that conditioned medium from the N-His VEGF-C transfected cells induced a less prominent tyrosine phosphorylation of VEGFR-3 than medium from VEGF-C ANACHis transfected cells, even though expression of the former polypeptide was much higher. Stable VEGF-C polypeptide mutants that bind to a VEGF- C receptor but fail to activate the receptor are useful as VEGF-C antagonists.
Example Discovery of VEGF-C analogs that selectively bind to and activate VEGFR-3, but not VEGFR-2 To further identify the cysteine residues of VEGF-C that are critical for retaining VEGF-C biological activities, an additional VEGF-C mutant, designated VEGF- CANACHisC156S, was synthesized, in which the cysteine residue at position 156 of the 419 amino acid VEGF-C precursor (SEQ ID NO: 8; Genbank accession number X94216) was replaced with a serine residue.
The mutagenesis procedure was carried out using the construct of VEGF- CANACHis (see Example 28), cloned in the pALTER vector, and the Altered sites II in vitro mutagenesis system of Promega. An oligonucleotide GACGGACACAGATGGAGGTTTAAAG-3' (SEQ ID NO: 52) was used to introduce the desired mutation in the cDNA encoding VEGF-CANACHis. The resulting mutated 87 VEGF-C cDNA fragment was subcloned into the HindllI/NotI sites of the pREP-7 vector (Invitrogen), and the final construct was re-sequenced to confirm the C156S mutation.
The resultant clone has an open reading frame encoding amino acids 103-225 of SEQ ID NO: 8 (with a serine codon at position 156), and further encoding a 6xHis tag.
The wildtype VEGF-C cDNA and three VEGF-C mutant constructs (VEGF-C R226,227S, VEGF-C ANACHis, and VEGF-C ANACHisC156S) were used to transfect 293 EBNA cells, which were subcultured 16 hours after transfection. About 48 hours after transfection, the media were changed to DMEM/0.1% BSA, and incubation in this medium was continued for an additional 48 hours. The resultant conditioned media were concentrated 30-fold using Centriprep-10 (Amicon), and the amount of VEGF-C in the media was analyzed by Western blotting using the anti-VEGF-C antiserum 882 for immunodetection. Different amounts of the recombinant VEGF-C ANACHis, purified from a yeast expression system, were analyzed in parallel as reference samples to measure and equalize the VEGF-C concentrations in the conditioned media. The conditioned medium from mock-transfected cells was used to dilute the VEGF-C conditioned media to achieve equal concentrations.
An aliquot of the transfected cells were metabolically labelled for 6 hours with 100 microcuries/ml of the PRO-MIX T m
L-[
S
in vitro cell labelling mix (Amersham). The conditioned media were collected, and binding of the radioactively labelled VEGF-C proteins to the extracellular domains of VEGFR-3 and VEGFR-2 was analyzed using recombinantly produced VEGFR-3EC-Ig and VEGFR-2EC-Ig constructs (containing seven and three Ig loops of the extracellular domains of the respective receptors, fused to an immunoglobulin heavy chain constant region).
All processed VEGF-C forms secreted to the culture medium bound to VEGFR-3EC domain, with preferential binding of the 21 kDa form. When present at high concentrations, the VEGF-C forms of 58 kDa and 29/31 kDa bound to some extent nonspecifically to protein A Sepharose.
The VEGFR-2EC domain preferentially bound the mature 21 kDa form of wildtype VEGF-C and VEGF-CANACHis. Significantly, VEGF-CANACHisC156S failed to bind the VEGFR2-EC.
Next, the ability of the above-described VEGF-C polypeptides to compete with the 125 s-VEGF-CANACHis for binding to VEGFR-2 and VEGFR-3 was analyzed.
88 Scatchard analysis using VEGF-C ACANHis provided indications of the VEGF-C binding affinity for VEGFR-3 (K,=135 pM) and VEGFR-2 (KD=410 pM). Ten micrograms of the purified yeast VEGF-C ANACHis was labeled using 3 mCi of Iodine-125, carrier-free (Amersham), and an lodo-Gen lodination Reagent (Pierce), according to the standard protocol of Pierce. The resulting specific activity of the labeled VEGF-CANACHis was 1.25x10 5 cpm/ng.
To study receptor binding, PAE/VEGFR-2 and PAE/VEGFR-3 cells were seeded into 24-well tissue culture plates (Nunclon), which had been coated with 2% gelatin in PBS. The 25 1-VEGF-C ANACHis (2x10 5 cpm) and different amounts of media containing equal concentrations of the non-labeled VEGF-C (wildtype and mutants) were added to each plate in Ham's F12 medium, containing 25 mM HEPES (pH 0.1% BSA, and 0.1% NaN 3 The binding was allowed to proceed at room temperature for minutes. The plates were then transferred onto ice and washed three times with ice-cold PBS containing 0.1% BSA. The cells were then lysed in 1 M NaOH, the lysates were collected, and the radioactivity was measured using a y-counter. Binding in the presence of VEGF-C-containing conditioned medium was calculated as a percentage of binding observed in parallel control studies wherein equal volumes of medium from mocktransfected cells were used instead of VEGF-C conditioned media.
As shown in Fig. 4A, all VEGF-C mutants displaced 2 sI-VEGF- CANACHis from VEGFR-3. The efficiency of displacement was as follows: VEGF- CANACHisC 156S VEGF-CANACHis wildtype VEGF-C VEGF-CR226,227S.
These results indicate that enhanced binding to VEGFR-3 was obtained upon "recombinant maturation" of VEGF-C. Recombinant VEGF165 failed to displace VEGF- C from VEGFR-3.
VEGF, VEGF-CANACHis, and wildtype VEGF-C all efficiently displaced labeled VEGF-CANACHis from VEGFR-2, with VEGF-CANACHis being more potent when compared to wildtype VEGF-C (Fig. 4B). The non-processed VEGF-C R226,227S showed only weak competition of 2 I-VEGF-CANACHis.
Surprisingly, VEGF-CANACHisRI56S failed to displace VEGF- CANACHis from VEGFR-2, thus confirming the above described results obtained using a soluble extracellular domain of VEGFR-2.
89 The ability of the above mentioned VEGF-C forms to stimulate tyrosine phosphorylation of VEGFR-3 and VEGFR-2 was also investigated. Importantly, identical dilutions of the conditioned media were used for these experiments and for the competitive binding experiments described above. A Western blot analysis of the conditioned media using anti-VEGF-C antiserum 882 was performed to confirm the approximately equal relative amounts of the factors present.
The stimulation of VEGFR-3 and VEGFR-2 autophosphorylation by the different VEGF-C forms in general correlated with their binding properties, as well as with the degree of "recombinant processing" of VEGF-C. The VEGF-CANACHisC 156S appeared to be at least as potent as VEGF-CANACHis in stimulating VEGFR-3 autophosphorylation. VEGF-CANACHis showed a higher potency when compared to wildtype VEGF-C in its ability to stimulate tyrosine autophosphorylation of both VEGFR- 2 and VEGFR-3. The VEGF-CR226,227S conditioned medium possessed a considerably weaker effect on autophosphorylation of VEGFR-3, and almost no effect on VEGFR-2 autophosphorylation.
Stimulation of VEGFR-2 tyrosine phosphorylation by VEGF- CANACHisC 56S did not differ from that of conditioned medium from the mock transfected cells, thus confirming the lack of VEGFR-2-binding and VEGFR-2-activating properties of this mutant.
The ability of VEGF-C ANACHisCI56S to alter vascular permeability in vivo was analyzed using the Miles assay (see Example 29). The recombinant VEGF-C forms assayed (ANACHis, ANACHisCI 56S) were produced by 293 cells, purified from conditioned media using Ni-NTA Superflow resin (QIAGEN) as previously described, and pretreated with 15 pg/ml of anti-human VEGF neutralizing antibody (R&D systems) to neutralize residual amounts of co-purified, endogenously produced VEGF. Eight picomoles of the various VEGF-C forms, as well as 2 pmol of recombinant human VEGF165 (R&D systems) and approximately 2 pmol of VEGF165 from the conditioned medium which were either non-treated or pretreated with the above mentioned VEGFneutralizing antibody were injected subcutaneously to the back region of a guinea pig. The 3 0 area of injection was analyzed 20 minutes after injections. Both VEGF and VEGF-C ANACHis caused increases in vascular permeability, whereas ANACHisC 156S did not affect vascular permeability. The neutralizing antibody completely blocked permeability 90 activity of VEGF but did not affect VEGF-C activity. Residual permeability activity observed for the VEGF-containing conditioned medium even after its treatment with VEGF neutralizing antibody was presumably caused by permeability factors other than VEGF that are produced by 293 cells.
In yet another assay, the ability of VEGF-CANACHis and VEGF- CANACHisC I 56S to stimulate migration of bovine capillary endothelial cells in a collagen gel was analyzed. The ANACHis form dose-dependently stimulated migration, whereas the ANACHisC 1 56S form had no significant activity in the assay.
The Miles assay also was used to assay the ability of VEGF-C R226,227S (8 pM, pretreated with anti-VEGF antibody) to induce vascular permeability. The results indicated that the ability of VEGF-C R226,227S to induce vascular permeability was much weaker when compared to wildtype and ANACHis forms of VEGF-C. Collectively, this Miles assay data is consistent with the VEGFR-2 binding and autophosphorylation data described above, and indicates that VEGF-C effect on vascular permeability is mediated via VEGFR-2.
Mitogenic signals from growth factor receptors are frequently relayed via the extracellular signal regulated kinases/mitogen activated protein kinases (ERK/MAPK) pathway into the nucleus. Purified recombinant VEGF-CANACHis and VEGF-C ANAC156S produced by a Pichia expression system were used to determine MAPK pathway activation of cells expressing either VEGFR-2 or VEGFR-3. The growth factor treated cells were lysed, and activated MAPK was detected using Western blotting with antibodies against the phosphorylated forms of ERKI and ERK2. At a concentration of 100 ng/ml, VEGF-CANACHis showed rapid activation of the ERK1 and ERK2 MAPK in both VEGFR-2- and VEGFR-3-expressing cells. In contrast, VEGF-CANAC156S activated ERKI and ERK2 exclusively in the VEGFR-3-expressing cells. At the concentrations used, both VEGF-CANACHis and VEGF-C ANAC156S appeared to be equally potent in activating the MAPK through VEGFR-3. The amounts of total MAPK protein were confirmed to be similar in the treated and untreated cells, as shown by staining of the filter with p44/p42 MAPK antibodies made against a synthetic peptide of rat p42.
The foregoing data indicates that proteolytic processing of VEGF-C results in an increase in its ability to bind and to activate VEGFR-3 and VEGFR-2. Non- 91 processed VEGF-C is a ligand and an activator of preferentially VEGFR-3, while the mature 21/23 kDa VEGF-C form is a high affinity ligand and an activator of both VEGFR- 3 and VEGFR-2.
Moreover, replacement of the cysteine residue at position 156 (of prepro- VEGF-C, SEQ ID NO: 8) creates a selective ligand and activator of VEGFR-3. This alteration inactivates the ability of processed VEGF-C to bind to VEGFR-2 and to activate VEGFR-2. Importantly, it is believed that the elimination of the cysteine at position 156 is the alteration responsible for this unexpected alteration in VEGF-C selectivity, and not the substitution of a serine per se. It is expected that replacement of the cysteine at position 156 with other amino acids, or the mere deletion of this cysteine, will also result in VEGF- C analogs having selective biological activity with respect to VEGFR-3. All such replacement and deletion analogs (collectively referred to as VEGF-C ACI, polypeptides) are contemplated as aspects of the present invention. Thus, "VEGF-C AC, 56 polypeptides" of the invention derived from human VEGF-C include polypeptides depicted in SEQ ID NO: 58, fragments of those polypeptides (especially fragments having an amino terminus anywhere between residues 102 and 161 of SEQ ID NO: 58 and a carboxy-terminus anywhere between residues 210 and 228 of SEQ ID NO: 58). "VEGF-C AC, 5 6 polypeptides" of the invention also include the corresponding polypeptides derived from murine, quail, and other wildtype VEGF-C polypeptides.
VEGF-C polypeptides that have the C156S mutation (or functionally equivalent mutations at position 156) and that retain biological activity with respect to VEGFR-3, such as VEGF-C ANACHisC156S, are useful in all of the same manners described above for wildtype VEGF-C proteins and biologically active fragments thereof where VEGFR-3 stimulation is desired. It is contemplated that most biologically active VEGF-C fragments and processing variants, including but not limited to the biologically active fragments and variants identified in preceding examples, will retain VEGF-C biological activity (as mediated through VEGFR-3) when a mutation is introduced.
All such biologically active VEGF-C ACs 6 polypeptides are intended as an aspect of the present invention.
Moreover, VEGF-C forms containing the C 156S mutation or equivalent mutations can be used to distinguish those effects of VEGF-C mediated via VEGFR-3 and VEGFR-2 from those obtained via only VEGFR-3. The ability of such VEGF-C 92 polypeptides to selectively stimulate VEGFR-3 are also expected to be useful in clinical practice, it being understood that selectivity of a pharmaceutical is highly desirable in many clinical contexts. For example, the selectivity of VEGF-C AC, 5 polypeptides for VEGFR- 3 binding suggests a utility for these peptides to modulate VEGF-C biological activities mediated through VEGFR-3, without significant concomitant modulation of blood vessel permeability or other VEGF-C activities that are modulated through VEGFR-2.
The data presented herein also indicates a utility for AC,6 polypeptides that are capable of binding VEGFR-3, but that do not retain biological activity mediated through VEGFR-3. Specifically, such forms are believed to be capable of competing with wildtype VEGF-C for binding to VEGFR-3, and are therefore contemplated as molecules that inhibit VEGF-C-mediated stimulation of VEGFR-3. Because of the AC,, alteration, such polypeptides (especially covalent or noncovalent dimers of such polypeptides) are not expected to bind VEGFR-2. Thus, certain AC, 5 6 polypeptides and polypeptide dimers are expected to have utility as selective inhibitors of VEGF-C biological activity mediated through VEGFR-3 without substantially altering VEGF-C mediated stimulation of VEGFR-2).
In another embodiment of the invention, heterodimers comprising a biologically active VEGF-C polypeptide in association with a ACm polypeptide are contemplated. It is contemplated that such heterodimers can be formed in vitro, as described below in Example 37, or formed in vivo with endogenous VEGF-C following administration of a AC 1 polypeptide. Such heterodimers are contemplated as modulators of VEGF-C mediated effects in cells where the biological effects of VEGF-C are mediated through VEGFR-2/VEGFR-3 heterodimers. VEGF-C AC,, polypeptides in homodimers or in heterodimers with wt VEGF-C might selectively inhibit the ability of the latter to induce VEGF-like effects, particularly to increase the vascular permeability.
Replacement of the second and/or the fourth of the eight conserved cysteine residues of VEGF abolishes VEGF dimer formation and VEGF biological activity.
The analogous effect was investigated for VEGF-C, wherein the cysteines at positions 156 and 165 of SEQ ID NO: 8 correspond to the second and fourth conserved cysteines. No homodimers were obtained when VEGF-CANACHisCl56,165S Cys,s 6 and Cys, 6 both replaced with serine residues) or in VEGF-CANACHisC165S were chemically crosslinked. On the other hand, about half of both crosslinked VEGF-CANACHis and 93 VEGF-CANACHisC156S migrated as dimers. This data indicates that VEGF- CANACHisC156S forms homodimers. Moreover, unlike VEGF-CANACHis, which forms preferentially non-covalently bound dimers, a fraction of VEGF-CANACHisC156S was disulfide bonded, as detected by SDS-PAGE in non-reducing conditions. In receptor binding studies (using procedures such as those described above), the C165S and C156,165S forms were both unable to bind VEGFR-3 or VEGFR-2. Collectively, these data suggest that homodimerization is required for VEGFR-3 activation by VEGF-C, and indicate that the inability of ANAC156S to activate VEGFR-2 and to induce VEGF-like effects is not due to an inability of this mutant to form homodimers.
Example 36 Utility for VEGF-C in promoting myelopoiesis The effects of VEGF-C on hematopoiesis were also analyzed. Specifically, leukocytes populations were analyzed in blood samples taken from the F1 transgenic mice described in Example 29, and from their non-transgenic littermates. Leukocyte population data from these mice and from non-transgenic FVB-NIH control mice the strain used to generate the transgenic mice) are set forth in the tables below.
FVB/NIH MICE male male female male 5.5 9.5 Cell Type months months months months mean±o Lymphocytes 72.20% 82.17% 84.25% 74.25% 78.22±5.10 Neutrophils 23.00% 15.17% 14.25% 22.25% 18.67±3.98 Monocytes 0.65% 1.00% 0.25% 0.50% 0.60±0.27 Eosinophils 2.15% 1.70% 1.25% 3.00% 2.03±0.65 Basophils 0.00% 0.00% 0.00% 0.00% 0-0 94 VEGF-C TRANSGENIC MICE male male male Cell Type 2 months 3.5 months 7 months mean*uY Lymphocytes 41.3% 41.50% 18.70% 33.83±10.70 Neutrophils 55.3% 53.80% 80.17% 63.09-+12.09 Monocytes 2.16% 1.30% 0.67%h 1.38±0.61 Eosinophils 3.50% .50% 1.72±1.29 Baohlsg 0.1 00% 0.000% 0+0 VEGF-C NEGATIVE CONTROL -MICE (NON-TRANSGENIC L1T1'ERMATES OF VEGF-C TRANSGENIC CE male male male male 2 months 2 months 3.5 7 Cell Tye___ ___months months mean~a Lymphocytes 89.00% 67.50% 9 1. 00%r 71.30% 79.7*10.41 Neutrophils 7.75% 23.00% 7.00% 23.75% 15.38W.01 Monocytes 1.50%/ 0.50%/ 0.83% 0.75% 0.9010.37 Eosinophils 1.50% 9.00%h 0.67% 14.00% 3.79*3.25 Basophils 1 0.00% 0.00%/0 0.50% 0. 50 0 h/ 0.25+0.2 As the foregoing data indicates, the overexpression of VEGF-C in the skin of the transgenic mice correlates with a distinct alteration in leukocyte populations. Notably, the measured populations of neutrophils were markedly increased in the transgenic mice. One.
explanation for the marked increase in neutrophils is a myelopoietic activity attributable to VEGF-C. A VEGF-C influence on leukocyte trafficking in and out of tissues also may effect observed neutrophil populations. Fluorescence-activated cell sorting analysis, performed on isolated human bone marrow and umbilical cord blood CD34-positiVe hematopoictic cells, demonstrated that a fraction of these cells are positive for Flt4 (VEGFR-3). Thus, the VEGF-C effect on myelopoiesis may be.exerted through this VBGFR-3-posifive cell population and its receptors. In any case, the foregoing data indicates a use for VEFG-C polypeptides to increase granulocyte (and, in particular, 95 neutrophil) counts in human or non-human subjects, in order to assist the subject fight infectious diseases. The exploitation of the myelopoietic activity of VEGF-C polypeptides is contemplated both in vitro in cell culture) and in vivo, as a sole myelopoietic agent and in combination with other effective agents granulocyte colony stimulating factor
(G-CSF)).
Additional studies of the myelopoietic effect of VEGF-C, using VEGF-C mutants VEGF-C AC, polypeptides, VEGF-C ANACHis, VEGF-C R226,227S) having altered VEGFR-2 binding affinities, will elucidate whether this effect is mediated through VEGFR-2, VEGFR-3, or both receptors, for example. The results of such analysis will be useful in determining which VEGF-C mutants have utility as myelopoietic agents and which have utility as agents for inhibiting myelopoiesis.
Example 37 Generation of Heterodimers consisting of members of the VEGF family of growth factors Both naturally-occurring and recombinantly-produced heterodimers of polypeptides of the PDGF/VEGF family of growth factors have been shown to exist in nature and possess mitogenic activities. See, Cao et al., J. Biol. Chem., 271:3154-62 (1996); and DiSalvo, et al., J.Biol.Chem., 270:7717-7723 (1995). Heterodimers comprising a VEGF-C polypeptide may be generated essentially as described In Cao et al.
(1996), using recombinantly produced VEGF-C polypeptides, such as the VEGF-C polypeptides described in the preceding examples. Briefly, a recombinantly produced VEGF-C polypeptide is mixed at an equimolar ratio with another recombinantly produced polypeptide of interest, such as a VEGF, VEGF-B, PIGF, PDGFa, PDGFP, or c-fos induced growth factor polypeptide. (See, Cao et al. (1990); Collins et al., Nature, 316:748-750 (1985) (PDGF-P, GenBank Ace. No. X0281 Claesson-Welsh et al., Proc.
Natl. Acad. Sci. USA, 86(13):4917-4921 (1989) (PDGF-a, GenBank Ace. No. M22734);.
Claesson-Welsh et al., Mol. Cell. Biol. 8:3476-3486 (1988) (PDGF-P, GenBank Ace. No.
M21616); Olofsson et al., Proc. Natl. Acad. Sci. (USA), 93:2576-2581 (1996) (VEGF-B, GenBank Ace. No. U48801); Maglione et al., Proc. Natl. Acad. Sci. (USA), 88(20):9267- 9271 (1996) (PIGF, GenBank Ace. No. X54936); Heldin et al., Growth Factors, 8:245- 252 (1993); Folkman, Nature Med., 1:27-31 (1995); Friesel et al., FASEBJ., 9:919-25 96 (1995); Mustonen et al., J. Cell. BioL, 129:895-98 (1995); Orlandini, Proc. Nail.
Acad. Sci. USA, 93(21):11675-11680 (1996); and others cited elsewhere herein. The mixed polypeptides are incubated in the presence of guanidine-HCI and DTT. The thiol groups are then protected with S-sulfonation, and the protein is dialyzed overnight, initially against urea/glutathione-SH, glutathione-S-S-glutathione, and subsequently against 20 mM Tris-HCl.
In a preferred embodiment, a variety of differently processed VEGF-C forms and VEGF-C variants and analogs, such as the ones described in the preceding examples, are employed as the VEGF-C polypeptide used to generate such heterodimers.
Thereafter, the heterodimers are screened to determine their binding affinity with respect to receptors of the VEGF/PDGF family (especially VEGFR-1, VEGFR-2, and VEGFR-3), and their ability to stimulate the receptors assaying for dimer-stimulated receptor phosphorylation in cells expressing the receptor of interest on their surface). The binding assays may be competitive binding assays such as those described herein and in the art. In the initial binding assays, recombinantly produced proteins comprising the extracellular domains of receptors are employable, as described in preceding examples for VEGFR-2 and VEGFR-3. Heterodimers that bind and stimulate receptors are useful as recombinant growth factor polypeptides. Heterodimers that bind but do not stimulate receptors are useful as growth factor antagonists. Heterodimers that display agonistic or antagonistic activities in the screening assays are further screened using, endothelial cell migration assays, vascular permeability assays, and in vivo assays. It will also be apparent from the preceding examples that dimers comprising two VEGF-C polypeptides dimers of identical VEGF-C polypeptides as well as dimers of different VEGF-C polypeptides) are advantageously screened for agonistic and antagonistic activities using the same assays.
In one preferred embodiment, VEGF-C ACI, 5 polypeptide is employed to make the dimers. It is anticipated that agonists and antagonists comprising a VEGF-C ACi 56 polypeptide will have increased specificity for stimulating and inhibiting VEGFR-3, without concomitant stimulation or inhibition of VEGFR-2.
In another preferred embodiment, VEGF-C polypeptides wherein the Cterminal proteolytic cleavage site has been altered to reduce or eliminate C-terminal processing VEGF-C R226,227S) is employed to make dimers for screening for inhibitory activity.
97 In yet another preferred embodiment, VEGF-C polypeptides comprising amino-terminal fragments the VEGF-C .15 kD form described herein) of VEGF-C are employed to make dimers.
It is further contemplated that inactivation of only one polypeptide chain in a dimer could be enough to generate an inhibitory molecule, which is demonstrated by the generation of PDGF inhibitory mutant as reported in Vassbotn, Mol. Cell.Bio., 13:4066-4076 (1993). Therefore, in one embodiment, inhibition is achieved by expression in vivo of a polynucleotide a cDNA construct) encoding the heterodimerization partner which is unable to bind (or binds inefficiently) to the receptor, or by direct administration of that monomer in a pharmaceutical composition.
Example 38 Formation and Screening of Useful Recombinant VEGF/VEGF-C genes and polypeptides Amino acid sequence comparison reveals that mature VEGF-C bears structural similarity to VEGFl21 [Tischer etal., Biol. Chem., 266(18): 11947-54 (1991)], with certain noteworthy structural differences. For example, mature VEGF-C contains an unpaired cysteine (position 137 of SEQ ID NO: 8) and is able to form noncovalently bonded polypeptide dimers. In one embodiment of the invention, a VEGF analog is created wherein the unpaired cysteine residue from mature VEGF-C is introduced at an analogous position of VEGF introduced at Leus,, of the human VEGF165 precursor (Fig. 2, Genbank Acc. No. M32977) to generate a VEGF*' mutant designated VEGF L58C). Such an alteration is introduced into the VEGFi65 coding sequence using site-directed mutagenesis procedures known in the art, such as the procedures described above in preceding examples to generate various VEGF-C mutant forms. This VEGF" mutant is recombinantly expressed and is screened (alone and as a heterodimer with other VEGF and VEGF-C forms) for VEGFR-2 and/or VEGFR-3 binding, stimulatory, and inhibitory activities, using in vitro and in vivo activity assays as described elsewhere herein. To form another VEGF analog of the invention, a VEGF 1 mutant is altered to remove a conserved cysteine corresponding to cys of the VEGF165 precursor. Elimination of this cysteine from the VEGF L58C would result in a VEGF analog resembling VEGF-CANACHisCI56S. This VEGF analog is screened for its 98 VEGF-inhibitory activities with respect to VEGFR-2 and/or VEGFR-I and for VEGF-C like stimulatory or inhibitory activities.
Another noteworthy structural difference between VEGF and VEGF-C is the absence in VEGF-C of several basic residues found in VEGF residues Arg 8 s, Lysno and Hisl 2 in the VEGF165 precursor shown in Fig. 2) that have been implicated in VEGF receptor binding. See Keyt et al., Biol. Chem., 271(10):5638-46 (1996). In another embodiment of the invention, codons for basic residues (lys, arg, his) are substituted into the VEGF-C coding sequence at one or more analogous positions by sitedirected mutagenesis. For example, in a preferred embodiment, Glu 7 Thr,,, and Pro,,, in VEGF-C (SEQ ID NO: 8) are replaced with Arg, Lys, and His residues, respectively.
The resultant VEGF-C analogs (collectively termed "VEGF-C'""si polypeptides) are recombinantly expressed and screened for VEGFR-1, VEGFR-2, and VEGFR-3 stimulatory and inhibitory activity. The foregoing VEGF and VEGF-C analogs that have VEGF-like activity, VEGF-C-like activity, or that act as inhibitors of VEGF or VEGF-C, are contemplated as additional aspects of the invention. Polynucleotides encoding the analogs also are intended as aspects of the invention.
EXAMPLE 39 EFFECTS OF VEGF-C ON GROWTH AND DIFFERENTIATION OF HUMAN CD34+ PROGENITOR CELLS IN VITRO Human CD34+ progenitor cells (HPC, 10 x 103) were isolated from bone marrow or cord blood mononuclear cells using the MACS CD34 Progenitor cell Isolation Kit (Miltenyi Biotec, Bergish Gladbach, Germany), according to the instructions of the manufacturer and cultured in RPMI 1640 medium supplemented with L-glutamine mM), penicillin (125 IE/ml), streptomycin (125 pg/ml) and pooled 10 umbilical cord blood (CB) plasma at 37 °C in a humidified atmosphere in the presence of 5% CO 2 for seven days, with or without VEGF-C and with or without one of the combinations of growth factors described below. Each experiment was performed in triplicate. After seven days, total cell number was evaluated in each culture.
In a first set of experiments, VEGF-C was added, at concentrations ranging from 10 ng/ml to 1 pg/m, to the cultures of CB CD34+ HPCs. Cell numbers were evaluated at day 7 of culture. When added as a single factor, 100 ng/ml of VEGF-C was 99 found support the survival and proliferation of only a few CD34+ HPCs under serum-free conditions. With medium alone, most of the cells died within a culture period of 7 days.
However, there were consistently more cells in the cultures provided with the VEGF-C.
A subsequent set of experiments investigated the co-stimulatory effect of VEGF-C in cultures either supplemented with recombinant human stem cell factor (rhSCF, ng/nml PreproTech, Rocky Hill, NY) alone or a combination of granulocyte macrophage colony stimulating factor (rhGM-CSF, 100 ng/ml, Sandoz, Basel, Switzerland) plus SCF.
Addition of VEGF-C to SCF-supplemented cultures resulted in a slight co-stimulatory effect on cell growth of CD34+ cells, and this effect was already observable at a VEGF-C concentration of 10 ng/ml. Addition of VEGF-C to GM-CSF- plus SCF-supplemented cultures clearly increased cell yields after 7 days of culture, with an optimum VEGF-C concentration of 100 ng/ml. Additional experiments were conducted to analyze the costimulatory effects of 100 ng/ml VEGF-C on total cell yields of serum-free cultures of CB CD34+ HPC cells supplemented with either GM-CSF alone, IL-3 (rhIL-3, 100 U/ml, Behring AG, Marburg, Germany) alone; or a combination of GM-CSF plus IL-3. The results are shown below in the following table:
S.'
100 Total cell number (E x 10') after a culture period of 7 days in RPMI CBPL, specified growth factors with or without ()VEGF-C.
(Cell number at day 0 Growth experiment -VEGF-C
VEGF-C
Factor(s) number 1 I11 GM-CSF 2 10 17 3 19 mean±SE 13.3±2.8 19.0*3.1* 1 113 130 2 107 113 11-3 3 200 433 4. 45 mean±SE 116.2-131.9 191.5±80.9 GM-CSF 1150 160 2 130 140 IL-3 3 140 155 mean±SE 140.0±5.7 151.7+L6.0* GM-CSF 1 31 37 2 60 227 SCIF 3 47 mean±SE 46.0*8.3 104.7±61.3 *mean±SE; p=0.02 As depicted in the table, VEGF-C led to a consistent enhancement of cell growth when added as a supplement to each growth factor or combination of growth factors tested.
Effect of VEGF C on anulonocgy ic differentiation of CD34+ progenitors Using cells from the (7 day) plasma-supplemented cultures described above, imniunofluorescence triple st'ainings were performed to analyze the expression of 2 0 the early granulomonocytic marker molecules lysozyme (LZ) and myeloperoxidase
(WPO)
as well as the lipopolysaccharide (LPS) receptor CD 14. The table below depicts the percentages and numbers of cells expressing MPG and/or LZ: 101 Percentages and numbers of cells expressing the markers MPO and LZ after 7 days of culture with or without VIEGF-C and specified growth factors percent of cells positive for cell marker numbers of cells positive factor marker exp. no. VEGF-C VEGF-C -VEGF-C VEGF-C 1 57 69 6 11 MPO 2 45 53 5 9 GM-CSF 3 18 24 10 13 mean*SE 40.0±1 1.0 49.0±13- 7.0±1.5 11.0±1.5* 1 54 70 6 11 LZ 2 16 16 2 3 3 15 23 9 13 ____mean±SE 28.0±12.8 36.0±16.7 5.7±2.0 9.0±3.0 1 20 28 23*6 MPO 2 37 42 39 48 IL-3 3 5 9 10 ____mean±SE 21.0±9.0 26.0±9.0 24.0±8.3 39.,7±4.2 115 22 17 29 LZ 2 3 3 3 3 3 3 5 6 22 _____mean±SE 7.0±4.0 10.3±5.8 8.7±4.0 18.0±7.0 1 29 37 46. 56 GM-CSF MPO 2 38 40 49 56 3 6 10 3 6 II1-3 mean±SE 24.0±9.0 39.3±16.6 32.7±14.8 39.3±+16.6 118 20 29 LZ 2 2 3 3 3 3 1 2 1 2 ____mean±SE 7.0±5.5 8.3±+5.8 11.0±+9.0 12.0±9.0 1 50 51 15 19 GM-CSF MPO 2 16 21 10 48 SCF mean±SE 33.0±17.0 36.0±15.0 12.5±2.5 33.5±14.5 1 15 15 5 6 LZ 2 9 20 5 mean±SE 12.0±3.0 18.0±2.0 15.0±0.0 25.51.
102 Among the granulomonocytic markers tested, VEGF-C led to an increase in the proportion of LZ+ cells under all culture conditions. In comparison, LZ+CDI4+ cells, which represent differentiated monocytic cells only very slightly increased upon addition of VEGF-C (data not shown). Co-stimulation of the cells with VEGF-C increased the expression of MPO, an early granulocytic marker molecule, only modestly, except in combination with both GM-CSF and IL-3, where the increase in the proportion of MPO+ cells was more pronounced.
VEGF-C exerts co-stimulatory effects in combination with M-CSF In another series of experiments, CD34+ cells were cultured in medium supplemented with 50 ng/ml M-CSF, with or without 100 ng/ml VEGF-C, for seven days.
Culture of CD34+ cells in the presence of M-CSF leads to the generation of CD14+ monocytes within 7 days. After seven days, the cultures were analyzed to determine the percentages of CD14+ cells and mean fluorescence intensity. The results are summarized in the table below: Percentages of CD14* cells and mean fluorescence intensity (MFI) of cells cultured with M-CSF in the absence or in the presence of VEGF-C M-CSF alone M-CSF VEGF-C exp no CD14+ MFI CD14+ MFI 1 37 20 47 2 42 44. 54 74 3 32 6 36 7 mean*SE 36.8±2.9 23.3111.1 45.7±5.2 40.3±19.3 As shown in the table, addition of VEGF-C to these cultures increased both the proportion of CD14+ cells (37% CD14+ cells vs. 46%) and the fluorescence intensity of CD14 expression (MFI 23.3 vs. 40.3). However, cell numbers did not increase upon addition of VEGF-C to M-CSF supplemented cultures. Thus, VEGF-C had a small effect on the differentiation of monocytic cells, but not on their growth.
In the foregoing experiments the presence of VEGF-C was associated with enhanced numbers of cells in cultures of cord blood CD34+ cells. Under all conditions tested (GM-CSF, IL-3, GM-CSF IL-3; GM-CSF SCF), co-culture with VEGF-C led 103 to an enhancement of proportions of myeloid cells. These results indicate an application for VEGF-C in the stimulation and/or differentiation of CD34+ progenitor cells in vitro or in vivo. Furthermore, the use of VEGF-C alone also slightly increased the number of surviving cells. The results thus indicate uses for compositions comprising VEGF-C prepared in admixture with the aforementioned or other growth factors, such as VEGF-C, and unit dose formulations comprising VEGF-C packaged together with the aforementioned or other growth factors. Such compositions, unit dose formulations, and methods of their use are intended as further aspects of the present invention.
Deposit of Biological Materials: Plasmid FLT4-L has been deposited with the American Type Culture Collection (ATCC), 12301 Parklawn Dr., Rockville MD 20952 (USA), pursuant to the provisions of the Budapest Treaty, and has been assigned a deposit date of 24 July 1995 and ATCC accession number 97231.
While the present invention has been described in terms of specific embodiments, it is understood that variations and modifications will occur to those in the art. Accordingly, only such limitations as appear in the appended claims should be placed on the invention.
104 SEQUENCE LISTING GENERAL INFORMATION: APPLICANT: Ludwig Institute for Cancer Research Helsinki University Licensing Alitalo, Kari(U.S. only) Joukov, Vladimir(U.S. only) (ii) TITLE OF INVENTION: Vascular Endothelial Growth Factor C (VEGF-C) Protein and Gene, Mutants Thereof, and Uses Thereof (iii) NUMBER OF SEQUENCES: 59 (iv) CORRESPONDENCE
ADDRESS:
ADDRESSEE: Marshall, O'Toole, Gerstein, Murray Borun STREET: 6300 Sears Tower, 233 South Wacker Drive CITY: Chicago STATE: Illinois COUNTRY: United States of America ZIP: 60606-6402 COMPUTER READABLE FORM: MEDIUM TYPE: Floppy disk COMPUTER: IBM PC compatible OPERATING SYSTEM: PC-DOS/MS-DOS SOFTWARE: PatentIn Release Version #1.30 (vi) CURRENT APPLICATION
DATA:
APPLICATION
NUMBER:
FILING DATE:
CLASSIFICATION:
(vii) PRIOR APPLICATION
DATA:
APPLICATION NUMBER:08/795,430 FILING DATE: 05-FEB-1997 (vii) PRIOR APPLICATION
DATA:
APPLICATION NUMBER: PCT/FI96/00427 FILING DATE: 01-AUG-1996 (vii) PRIOR APPLICATION
DATA:
APPLICATION NUMBER: 08/671,573 FILING DATE: 28-JUN-1996 (vii) PRIOR APPLICATION
DATA:
APPLICATION NUMBER: 08/601,132 FILING DATE: 14-FEB-1996 (vii) PRIOR APPLICATION
DATA:
APPLICATION NUMBER: 08/585,895 FILING DATE: 12-JAN-1996 (vii) PRIOR APPLICATION
DATA:
APPLICATION NUMBER: 08/510,133 FILING DATE: 01-AUG-1995 105 (vii) PRIOR APPLICATION DATA: APPLICATION NUMBER: 08/340,011 FILING DATE: 14-NOV-1994 (viii) ATrORNEY/AGENT INFORMATION: NAME: Gass, David A.
REGISTRATION NUMBER: 38,153 REFERENCE/DOCKET NUMBER: 28967/34140 (ix) TELECOMMUNICATION INFORMATION: TELEPHONE: 312/474-6300 TELEFAX: 312/474-0448 TELEX: 25-3856 INFORMATION FOR SEQ ID NO:1: SEQUENCE CHARACTERISTICS: LENGTH: 4416 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: CCACGCGCAG CGGCCGGAGA TGCAGCGGGG
CCTGGGACTC
CACGGAGGAG
CTGGACGGCC TGGTGAGTGG TCACACGTCA TCGACACCGG GCACCCCCTC GAGTGGGCTT GGCCAGGAGC CAGCGAGGAC ACGGGGGTGG GGTGITGCTG CTGCACGAGG GTACATCAAG GCACGCATCG CTTTGAGCAG CCATTCATCA GTGGGTGCCC TGTCTGGTGT GGTGCTGTGG CCAGACGGGC CACGCCACTG CTGCACGATG CTTCCTTTCC AACCCCTTCC GTTGCCCAGG AAGTCGCTGG GTGGGCTGAG TTTAACTCAG
TGCGAGACTG
TACATGCCAA
AGGGCACCAC
ACAAGCCTGA
CCATCCCCGG
AGGAGGTGGT
CCCTGTACCT
TGGTGCACAT
CGCCGCGCTG
CTACTCCATG
TGACAGCCTG
TCAGGAGGCG
CGAGGGCACA
CGACACAGGC
GGCCGCCAGC
CACGCTCTTG
CCTCAATGTC
GTGGGATGAC
GCAGTGCGAG
CACAGGCAAC
TGCCTGCGAC
ACCCCCCCGA
TCCATCTCCT
CCAGCCACCG
GACGCCAGGC
AGCTACGTCT
TCCTACGTGT
GTCAACAGGA
ACGCTGCGCT
CGGCGGGGCA
ACCACCTGGG
GAGCTCTATG
CTGGTCCTGA
TACCCAGGGA.
TGTGGCTCTG
CTGAACAT
GCAGGGGACA
GAGACAAGGA
CCTACTIGCAA
GCTACTACAA
TCGTGAGAGA
AGGACGCCAT
CGCAAAGCTC
TGCTCGTGTC
GAGACCAGGA
ACATCCAGCT
ACTGCACCGT
AGCAGGCAGA
120 180 240 300 360 420 480 540 600 660 720 780 840 AGCTGCTGGT AGGGGAGAAG GTGTCACCTT TGACTGGGAC GCGGGGTAAG TGGGTGCCCG CCTGACCATC CACAACGTCA CGGCATCCAG CGATTTCGGG CGTCGAGTGG CTCAAAGGAC GCCCGTGAAG CTGGCAGCOT ACTGTCCGGG CGCCACAGTC AflGCP.CCTAC ACCCTCGCCC GGAG3C7UGTO GTGAATGTCC CTACTCGCGT CACAGCCC CAGCATCCAG TGGCACTGGC CCGGCGGCGG CAGCAGCAAG GCAGGATGCC GTGAACCCCA GAPATAPGACT GTGA.GCAAGC TGTGGTCTCC AACAAL3GTGG 106 AGCGACGCTC CCAGCAGACC GCCAGCACGA CCTGGGCTCG AGAGCACCGA GGTCATIT CCATCCTGGA GGCCACGGCA ACCCCCCGCC CGADTTCCAG CACATGCCCT GGTGCTCAAG TGTGGAACTC CGCTGCTGGC CCCCCCAGAT ACATGAGAAG AGGCCCTCAC CTGCACG3GCC GGCCCTIGGAC ACCCTGCAAG ACCTCATGCC ACAG3TGCCGT TC-GAGAOCCT CvGACACCTG TUGTGATCCA GAATGCCAAC GCCAGGATGxA GCGGCTCATC CCCCGACGGC TrCACCATCG AATCCAAGCC ATCCGA3GAG CACACAGAAC TCTCCAGCAT TATGTIGTGCA AGGCCAACAA CATGAAAATC CCrCrATCAG GGAGACGAGC TGGTGAAGCT TGGTACAAGG ATGGAAAGGC GAfGTACAG .AGGCCAGCAC CTGAfIGCGCA ACATCAGCCT GArjGCCTCCT CCCCCADCAT TACOGGGTGC CCCTGCCTCT ATGTTTGCCC ALGCGTAGTCr GACTGGAGGG CGGTGACCAC ACCV2AGTTT(G T13GAGGGAAA GTGTCTGCA= TGTACAAGTG TACTrTATG TGACCACCAT CTACTAGAGG GCCAGCCGGT C1'GCGCTGGT ACCGCCTCAA CTCGACTGCA AAACGTGCA GCACCTGGrGG CGCGCCACGC GAGGGCCACT ATGTGTGCGA AAGTACCI'GT CGGTGCAGGC C'CGTGAACG TGAGcOiACrC AGCATCGTIGT GGTACAAAGA GACTCCAACC AGAAGCTGmAG TGCAGCGTIGT GCAACGCCA6A TCCGAGGATA AGGGCAGCAT TrcIrGGG Tcc1'ccTcCT ATCAAGACWGGcCTACCTGTC TGCGAATACC TGTCCTACGA GGG.AGAGTGC TCGGCrACGG 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 GCTCCTGAGC TGCCAAGCCG ACAG3CTACAA CCTG;TCCACG CTOCAcGATO CGCACGGGAA TCTGTI'CGCC ACCCCTCTGG CACGCTCAOC CTGALTATCC AG;TGCAAGAC CGGCGCAGCC CCTrGGAAGCC CCTCGGCTCA GCTGGAQATG CAGTGCTTGG CGAGAGGCTG CTGGAGGAAA CATCCAGCGC GTGCGCQAGG GGGCTGCGTC AACTCCTCCG GGAGATCGTG ATCCTTGTCG CCTCATC'rrC TGTAACATGA CATCATCATG GACCCCGGGG TGCCAGCCAG TavGGAATTCC
CCGCCACCT
CCCGCGTCGC
ATGACAALGCA
CGCAGAAC7T
TGGCCGGAGC
AGTCTGGAGT
AGGATGCQGG
CCAGCGTGGC
GTACCGGCGT
GGAGGCCGGC
AGGT'GCCTCT
CCCGAGAGCG
GTACGACCAT
CCCGCTra
GGAGGAGGTG
GCCCGAGCAC
CTGCCACAAG
GACCGACC1'C
GCACGCGCCC
CGACTTGGCG
ACGCTATCTG
CG7GGAAGGC
CATCGCTGTC
CCACGCAGAC
CGGAGGAGCAA
GCTGCALCCTG
107 CGCC7TCGGG
CACCGTGGCC
GTCGGAGCTC
GGCGTGCACC
CCTrCTCCAAC
CGAGCAGCGC
GGGGAGCAGC
GGCTTCTCCA
TGTCTGCTAC
CCACAGAGAC
TGACTTTGGC
AAGGTGGTGG
GTGAAAATGC
AAGATCCTCA
AAGCCGCAGG
TTCCTGCGCG
GGACGCrrCC
GACAGGGTCC
GACCAAGAAG
AGCTTCCAGG
CTGGCTGCTC
CTTGCCCGGG
AAGCCTCCGC
TGAAAGAGGG
TTCACATCGG
GCCCCCTCAT
CCAAGCGGGA
GCGCCATGGT
TCTTCGCGCG
CTGAGGACCT
TGGCCAGAGG
TTTCGGCATC
CGCCACGGCC
CAACCACCTC
GGTGATCGTG
CGCCTTCALGC
GGAGCTCGCC
GTTCTCGAAG
GTGGCTGAGC
GATGCGAGTTC
CACAAGGGCA GCAGCTGTGA AGCGAGCACC GCGCGCTGAT AACGTGGTCA ACCTCCTCGG GAGTTCTGCA AGTACGGCAA
CCCTGCGCGG
AGGCTGGATC
ACCGAGGGCG
CCGCTGACCA
CTGGCTTCCC
AGCGACGTGG
TACGTCCGCA
GGAACATTCT GCTGTCGGAA ACATCTACAA AGACCCTGAC CCGGCTGCCC CTGAAGTGGA GAGTGACGTG TGGTCCTTITG GTACCCTGGG GTGCAGATCA GAGGGCCCCG GAGCTGGCCA AGACCCCAAG GCGAGACCTG GGGCAGGGGC CTGCAAGAGG3 AGAAGAGGGC AGCTTCTCGC TGAGGACAGC CCGCCAAGCC GTCCT=TCCC GGGTGCCTGG AI-rGAGGAA
TTCCCCATGA
CAGTGGGATG GTGCTGGCCT AAGCGGCTI2C AGGTAGCTGA.
TCTGCAC7TA TAAGAiAAGAT TACTACAA3AC TI'CAAAGAGG.
GTGACCACTG'AAGCACCACA
GATAATATCC AGCCTCCCAC TGGCCCCTGA AAGCATCTTC GACAAGGTGT
GGGTGCTTCT
ATGAGGAGTT
CTCCCGCCAT
CATTrCGGA
AAGAGGAGGT
AGGTGTCCAC
TGCAGCGCCA
CCAGAGGGGC
CCCCAACGAC
CGGAGGAGTT
AGCAGAGAGA
CAAAGACTT
PLACCAGGAGG
GGGAAGGGGT
PIAGAAGCTGG
CTGGGAGATC
CTGCCAGCGG
ACGCCGCATC
GCTGGTGGAG
CTGCATGGCC
CATGGCCCTA
CAGCCTGGCC
TGAGACCCGT
CTACAAAGGC
TGAGCAGATA
GAGAAGGCAG
AAGACTCG
ACAAGAGGAG
TAGGCCTCCG
TGGAGCAGAG
AGTAACAGGT
CCTTATGCCA
TTCTCTCTGG
CTGAGAGACG
ATGCTGAACT
ATCCTGGGGG
CCGCGCAGCT
CACATCGCCC
GCCAGGTATT
GGTTCCTCCA
TCTGTGGACA
GAGAGCAGGC
CATACGTCAG
CTXTCTTC
CATGAAAGTG
AGAAGTCTCC
GGAGGCGGCC
GAGCGAGGCG
TGGAAGATCT
GAAAGTGCAT
TGAAGATCTG
AGGGCAGTGC
ACACCACGCA
GGGCCTCCCC
GCACAAGGAT
GCTGGTCCGG
ACCTGCTCCA
CTCAGAGCTC
AGGCTGACGC
ACAACTGGGT
GGATGAAGAC
ACCAGACAGA
NTAGACAAGA
C:ATTTTCrrC rACTGCTATC 3ACAAGGAGT 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3780 3840 3900 3960 4020 4080 4140 4200 4260 4320 GATGACTGCG GGCAGGCCTG TGTTCCCTGA CTCCTCCAAG GCNTTCCCAG ACACTGGCGT GCGTGACAGA GGGCTCACCT GAAAGGGAGA CGCCCTTTCA TGGTCTGCTG TACTGCTTGA CCAAAGAGCC CTCAAGCGGC 108 CTrGCCTTCT AGGTCACTTC TCACACAATG TCCCTTCAGC ACCTGACCCT GTGCCCGCCA GTTA'rrCCTr GGTAATATGA GTAATACATC AAAGAG INFORMATION FOR SEQ ID 140:2: SEQUENCE CHARACTERISTICS: LENGTH: 216 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: CAAGAAAGCG GCTTCAGCTG TAAAGGACCT GGCCAGAATG TGGCTGTGAC CAGGGCACAC CCTGACTCCC AAGGGAGGCG GCGGCGGCCT GAGCGGGGGG CCCGAGGAGG CCAGGTG~r TACAACAGCG AGTATGGGGA GCTGTCGGAG CCAAGCGAGG AGGACCACTO CTCCCCGTCT GCCCGCGTGA CTI-rCTrCAC AGACAACAGC TACTAA INFORMATION FOR SEQ ID NO:3: SEQUENCE CHARACTERISTICS: LENGTH: 4273 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA 4380 4416 120 180 216 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: AAGCTTATCG ATTTCGAACC CGGGGGTACC GAATTCCTCG AGTCTAGAGG CAGGTCGACC GGGCTCGATC CCCTCGCGAG TTGGTTCAGC TGCTGCCTGA CCTCGCGGAG TtCTACCGGC AGTGCAAATC CGTCGGCATC CAGGAAACCA TCCGCGCATC CATGCCCCCG AATCAG GTGGGGAGG3C ACGATGGCCG GGATCTTTGT GAAGGAIACCT TACTTCTGTG GTGTGACATA ATTGGACAAA AGATTTAAAG CTCTAAGGTA AATATAAAAT TTTTAAGTGT ATAATGTGTT TTCTAATTGT TTGTGTATT TAGATTCCAA CCTATGGAAC TGATGAATGG GGAATGCCT TAATGAGA AACCTGTrrr GCTCAGAAGA AATGCCATCT
AGCATGCCTG
GGCTGGACGA
GCAGCGGCTA
CT11'GGTCCC
CTACCTACAG
AAACTACTGA
120 240 300 360 420 480
AGTGATGATG
AIGGCTACTGC TGACTCTCAA CATI'=TACTC CCAAGGAC7I' TCC7TCAGAA TrGCTAAGTr CTCTriCTTG CrTGCTATT TACACCACAA.
TI'ATGGAAAA ATATI'CTOTA ACC~I?1ATAA.
TG~rrrcTTACTCCACAC AGGCATAGAG TGTGTACCTT TAGC==~rA AT1TGTAA CCTTGACTAG AGATCATAAT CADCCATACC AACCTCCCAC ACCTCCCCCT GAACCTGAAA TrGTTTATrG CAI3CTTATAA TG=TACAAA AAA3CATTT TrCACTGCA TTCTAGTYT13 CATG GGA TCTGCCGGTC TCCCTATAGT ACCTIGCATTA ATGaAhTCGGC CAACGCQCGG CCGCTTCCTC GCJCACTGAC TCGCTGCGCT CTCACTCAAA GGCGGTAATA CGGTTATCCA 7TrGAGCAAA AGGCCAGCAA AAGGCCAL3GA TCCATAGGCT CCGCCCCCCr GACGAGCATC GAAACCCGAC AGGACTATAA AGATACCAGG CTCC7UM~C GACCCTGCCG CTIrACCGGAT TGGCCTTC TCAATGCTCA CGCTGTAGGT AGCTGGGCTG TGTGCACGAA CCCCCCGTTC ATCGTCTTGA GTCCAACCCG GTAAGACACG ACAGGA7TAG CAGAGCGAGG TATG;TAGCG ACTACGGCTA CACTAG3AAGG ACAGTATG TCGGAAAAAG AG3TTGGTAGC TCrIrGkTCCG TrF-lrGTTTG CAAGCAGCAG ATTACGCGCIL TCTFTTCTAC GGGGTCTGAC GCTCAGTGGA TGAaATTATC AAAAAGGATC 11'CACCTW3A CAATCTAAA13 TATATATGAI3 TAAACTI7GOT CACCTATCTC AGCGATCTG;T CTATT1TCGTT CTCCAAAAAA GAAGAG3AAAG 7TGAGTCA TGCTGTrGTT AGGAAAAAGC TGCACTGCTA
GTAGAAGACC
AGTAATAGAA
TACAAGAAAA
GTAGGCATAA CAGTTATAAT CATA3ACATAC TSTCTG;CTAT TAATAACTAT GCTCAAAAAT GGGTTAATAA GGAATAT77G ATGTATAGTG.
ACATTTGTAG AGGTTlTACr TGCTITAAAAA cATAAAA7GA ATGCAATTGT Tvrrd7AAc TAAAGCAATA GCATCACAAA TrYCACAAAT GG7TTGTCCA AACTCATCAA TGTATCTTAT GAGTCGTAr AATTMCGATA AGCCAGGTTA GGAGAGGCGG Tr=CGTA7T GGGCGCTCTT CGGTCGTTCG GCTGvCGGCGA GCGGTATCAG CAGAATCAGG GGATAACGCA GGAAAGAACA J.CCGTAAAAA~ GGACGCGTITG CTGGCGT-rr ACAAAAATCG AGCTCAAGT CAGAZGT3GC CGTrrCCCCC TGGAAG=TC CTCGTGCGCT ACTGTCCGC CTCTCCCr TCGGGAAGCG ATCTCAGTrC GGTGTAGGTC GTrCGCTCCA AGCCCGACCG CTGCGCCTrA TCCGGTAACT ACTrATCG;CC ACTGGCAGCA GCCACTGGTA GTGCTACLGA GTTCTrC-%AA TGGTGGCCTA GTATCTGCGC TCTrGCTGAAG3 CCAGTTACCT GCAAACAAAC CACCGCTGGT AGCGGTGG~ GAAAAAAAGG ATCTCAAt3AA GATCCT1rrA ACGAAAACTC ACGTTAAGGG ATrTTTGGTCA TCCTT-rrAAA TrAAAAATGA AGTT'rIAAkLT CTGACAG=ZA CCAATGCTTA ATCAI3TGAGG CATCCATAGT TGCC7GACTC CCCGTCGTGT 540 600 660 '720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1690 1740 1 BOO 1860 1920 1980 2040 2100 2160 2220 AGATAACTAC GATACGGGAG GGCTTACCAT ACCCACGCTC ACCGGCTCCA GATTTATCAG GCAGAAGTGG TCCTGCAACT 7TATCCGCCT CTAGAGTAAG TAGTI'CGCCA GTTAATAGTT TCGTGGTGTC ACGCTCGTCG TTTGGTATGG GGCGAG'ITAC ATGATCCCCC ATGTTGTGC-A TCGTTGTCAG AAGTAAG7TG GCCGCAGTGT ATTCTC7TAC TGTCATGCCA TCCGTAAGAT AGTCA7TCTG AGAATAGTGT ATGCGGCGAC ATAATACCGC GCCACATAGC AGAACTTTAA GGCGAAAACT CTCAAGGATC TTACCGCTGT CACCCAACTG ATC7TCAGC-A TC~rrTACr GAAGGCAAAA TGCCGCAAAA AAGGGAATAA TCTrCCI-iz- TCAATATrAT TGAAGCATT TATrGAATG TATrrAGAAA AATAAACAAA 110 CTGGCCCCAG TGCTGCAATG CAATAAACCA GCCAGCCGGA TGCCACCTGA CGTCTAAGAA
TCACGAGGCC
AGCTCCCGGA
AGGGCGCGTC
AGATTGTACT
ACATAACC~r
TTCCAAATTG
GTI-rGAGGGA ACCCCTCA7T
CCAATGTT
CCAGCTGAAG
GAA.AAAGGGG
cTF1.LCGTCTC
GACGGTCACA
AGCGGGTGrr
GAGAGTGCAC
ATGTATCATA
AGAGAGAGGC
CTGTrrAACA
GTACTCCTAA
AAAGACAGGA
CCTATAGAGT
GGAATGAAAG
ACCA7TATA
GCGCGTTTCG
GCTTGTCTGT
GGCGGGTGTC
CATATGGACA
CACATACGAT
TI'AATCAGAG
GATCCCCTTG
TGAIrrTGCT
TATCAGTGGT
ACGAGCCATA
ACCCCACCTG
C
C
C
C
CCATCCAGTC
TGCGCAACG'I
CTTCATCAG
AAAAAGCGG7
TATCACTCA'I
GCTTTTCTGT
CGAGTTGCTC
AAGTGCTCAT
TGAGATCCAG
TCACCAGCGT
GGGCGACACG
ATCAGGGTA
TAGGGGTTCC
TCATGACATT
GTGATGACGG
AiAGCGGATGC
GGGCTGGCT
rATTGTCG7T
Z'TAGGTGACA
4.CAGAAACTG 3'TTIACCACC
TCGGACCC
:CAGGCTCTA
3ATAAAATAA 7AGGTTTr7GC
;AGAATAGAG
!AGGATATCT
TTGTCTCATG AGCGGATACA GCGCACATTr CCCCGAAAAG AACCTATAAA AATAGGCGTA TA7TAATTGT TG7TGCCATT CTCCNG7TCc TAA3CTCC7TC GGWrATGGCA
GACTGGTGAG
ITGCCCGGCG
CA77GGAAAA
TTCGATGTAA
'ITCTGGGTGA
GAAATGTTGA
ATACCGCGAG
AGGGCCGAGC
TGCCGGGAAG
GCTACAGGCA
CAACGATCAA
GGTCCTCCGA
GCACTGCATA
TACTCAACCA
TCAATACGGG
CGTrCTTCGG
CCCACTCGTG
GCAAAA.ACAG
ATACTCATAC
2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3780 3840 3900 3960
TGAAAACCTC
CGGGAGCAGA
TAACTATGCG
AGAACGCGGC
CTATAGAACT
77ITGAGTCAA
TTGATATCTA
TGCA7TC7A
GTI'MGCTC
AAGATTAT
AAGCTAGCTT
AJTCAGAT
GTGGTAAGCA
TGACACATGC
CAAGCCCGTC
GCATCAGAGC
TACAATTAT
CGAGCAGAGC
CTCAAGGATG
CCA7TATGGG
ATCGATTAGT
AACAATATCA
TTAGTCTCCA
AAGTAACGCC
CAAGGTCAGG
c3TTCCTGCCC A7GCAAG GCATGGAAAA ATACATAACT AACAGATGGA ACAGCTGAT ATGGGCCAAA ill CGGCTCAGGG CCAAGAACAG ATGGAACAGC TGAATATGGG CCAAACAGGA TATCTGTGGT 4020 AAGCAGTTCC TGCCCCGGCT CAGGGCCAAG AACAGATGGT CCCCAGATGC GGTCCAGCCC 4080 TCAGCAGTrT CTAGAGAACC ATCAGATGTT TCCAGGGTGC CCCAAGGACC TGAAATGACC 4140 CTGTGCC7TA TTGAACTAA CCAATCAGTr CGCT'rCTCGC TTCTGTTCGC GCGCTICTGC 4200 TCCCCGAGCT CAATAAAAGA GCCCACAACC CCTCACTCGG GGCGCCAGTC CTCCGATTGA 4260 CTGAGTCGCC CGG 4273 INFORMATION FOR SEQ ID NO:4: SEQUENCE CHARACTERISTICS: LENGTH: 40 amino acids TYPE: amino acid STRANDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: Pro Met Thr Pro Thr thr Tyr Lys Giy Ser Val Asp Asn Gin Thr Asp 1 5 10 Ser Gly Met Val Leu Ala 5cr Glu Glu Phe Giu Gln Ile Giu Ser Arg 25 His Axg Gin Glu Ser Gly Phe Arg INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 18 amino acids TYPE: amino acid STRZAqDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID Xaa Glu Glu Thr Ile Lys Phe Ala Ala Ala His Tyr Asn Thr Giu Ile 1 5 10 i Leu Lys 112 INFORMATION FOR SEQ ID NO:6: SEQUENCE CHARACTERISTICS: LENGTH: 219 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLO0GY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: TCACTATAGG GAGACCCAAG CTrGGTACCG AGCTCGGATC CACTAGTAAC GGCCGCCAGT GTGGTGGAAT TCGACGAACT CATGACTGTA CTCTACCCAG AATATrGGAA AATGTACAAG TGTCAGCTAA GGCAAGGAGG CTGGCAACAT AACAGAGAAC AGGCCAACCT CAACTCAAGG ACAGAAGAGA CTATAAAATr CGCTGCAGCA, CACTACAAC INFORMATION FOR SEQ ID NO:?: SEQUENCE CHARACTERISTICS: LENGTH: 1997 base pairs TYPE: nucleic acid STR.ANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: NAME/KEY: CDS LOCATION: 352..3.608 (xi) SEQUENCE DESCRIPTION: SEQ ID NO;: CCCGCCCCGC CTCTCCAA AGCTACACCG ACGCGGACCG CTCGCTTCAC CTCGCGGGCT CCGAATGCGG GGAGCTCGGA TTTTACCTGA CACCCGCCGC C-1-rTCCCCGG CACTGGCTGG GGAACGCGA GCCCCGGACC CGCTCCCGCC GCCTCCGGCT GAGGAGCCCG GGGGAGAGGG ACCAGGAGG GCCCGCGGCC CCACCCCTGC CCCCGCCAGC GGACCGGTCC
CCCACCCCCG
CGGCGGCGTC
TGTCCGGTT
GAGGGCGCCC
CGCCCAGGGG
TCGCAGGGGC
GTCCTTCCAC
CTCCCTCGCC
CCTGTGAGGC
TGCAAAGTI'G
GGGTCGCCGG
GCCCGCGCCC
C ATG CAC Met His
I
TTG CTG GGC TT Leu Leu Gly Ph CTC CCG GGT CC Leu Pro Gly Pr GGA CTC GAC CT Gly Leu Asp Lei TAT GCA AGC AX~ Tyr Ala Ser Lyi GAT GAA CTC AT( Asp Giu Leu Mel 7c TGT CAG CTA AGC Cys Gin Leu Arc CTC AAC TCA AGC Leu Asn Ser Arc 100 AAT ACA GAG ATC Asn Thr Glu Ile 115 TGC ATG CCA CGG Cys Met Pro Arg GCG ACA AAC ACC Ala Thr Asn Thr 150 GGG GGT TGC TGC Gly Gly Cys Cys 165 AGC TAC CTC AGC Ser Tyr Leu Ser 180 GGC CCC AAA CCA Gly Pro Lys Pro 195 TGC ATG TCT AAA Cys met Ser Lys 113 'C TTC TCT GTG GCG TGT TCT CTG CTC e Phe Ser Val Ala Cys Ser Leu Leu 10 TCGC GAG GCG CCC GCC GCC GCC GCC o Arg Giu Ala Pro Ala Ala Ala Ala 25 C TCG GAC GCG GAG CCC GAC GCG GGC ui Ser Asp Ala Glu Pro Asp Ala Gly' 40 45 SGAT CTG GAG GAG CAG TTA CGG TCT S Asp Leu Glu Giu Gin Leu Arg Ser 60 2ACT GTA CTC TAC CCA GAA TAT TG Thr Val Leu Tyr Pro Glu Tyr Trp 75 AAA GGP. GGC TGG CAA CAT AAC AGA Lys Gly Gly Trp Gin His Asn Arg 90 ACA GAA GAG ACT ATA AAA TIT GCT Thr Glu Glu Thr Ile Lys Phe Ala 105 110 TTG AAA AGT ATT GAT AAT GAG TIGG Leu Lys Ser Ile Asp Asn Giu Trp 120 125 GAG GTG TGT ATA GAT GTG GGG AAG Giu Val CysIle Asp Val Gly Lys 135 140 TITC TTT AAA CCT CCA TGT*GTG TCC Phe Phe Lys Pro Pro Cys Val Ser 155 AAT AGT GAG COG CTG CAG TGC ATG Asn Ser Giu Gly Leu Gin Cys met 1702 AAG ACG TIA TI'T GAA ATI ACA GTG C Lys Thr Leu Phe Glu Ile Thr Val P 185 190 GTA ACA ATC AGT Tn' GCC AAT CAC Val Thr Ile Ser Phe Ala Aen HisT 200 205 CTG GAT GTT TAC AGA CAA GTT CAT T Leu Asp Val Tyr Arg Gin Val His S 215 220 GCC GCT GCG CTG Ala Ala Ala Leu GCC TIC GAG TCC Ala Phe Glu Ser GAG GCC ACG OCT Glu Ala Thr Ala OTG TCC AGT GTA Val Ser Ser Val AAA ATG TAC AAG Lys Met Tyr Lys GAA CAG GCC AAC Giu Gin Ala Asn GCA GCA CAT TAT Ala Ala His Tyr PiC AAG ACT CAA krg Lys Thr Gin 130 3AG TnT GOA GTC 1lu Phe Gly Val 145 ;TC TAC AGA TGT fai Tyr Arg Cys 160 LAC ACC AGC ACG Lsn Thr Ser Thr CT CTC TCT CAA 'ro Leu Ser Gin ,CT TCC TG CGA hr Ser Cys Arg 210 CC ATI ATI AGA er Ile Ile Arg 225 405 453 501 549 597 645 693 741 789 837 885 933 981 1029 114 C-AG TGT C-AG C-GT TCC CTG CCA GCA ACA CTA CCA GCA GCG AAC AAG ACC 1077 Arg Ser Leu Pro 230 Ala Thr Leu Pro Gin C-ye Gin Ala Ala Asn Lys Thr 235 240 TGC Ccc ACC AAT TAC ATG TGG AAT AAT C-AC ATC Cys Pro Thr Asn Tyr Met Trp Asn Aen His Ile 245 250
C-AG
Gin
GGA
Gly 275
TGT
C-ye
C-AC
His
CTC
Leu
TGC
Cys
CCT
Pro 355 TA2 Leu
C-CAI
GAA
Glu 260
TC
Phe
C-AG
Gin
AAA
Lys TrC Phe
CAG
340
GGA
%AA
-ys
~GT
GAT
Asp
C-AT
His
TGT
C-ye
GAA
Giu
CCC
Pro 325
TGT
C-ye
AAA
Lys
GGA
Gly ACG2 rrr Phe
GAC
Asp
GTC
Val
C-TA
Leu 310
AGC
Ser
GTA
Val
TGT
Cys
RAG.
Lys
NAC
ATG TTT TCC TCG GAT GC-T GGA Met Phe Ser Ser Asp Ala Gly 265
TGC
C-ye
GAT
Asp 270 AGA TGC C-TG Arg C-ys Leu 255 GAC TCA ACA Asp Ser Thr CAT GAA GAG Asp Glu Giu AGC TGT GGA
ATC
Ile
TGC
C-ye 295
GAC
Asp
CAA
Gin
TGT
C-Ye
GCC
Ala
AAG
Lye 375
CGC
TGT GGA Cys. Gly 280 AGA GC-G .Arg Ala AGA AAC Arg Asn TGT GGG Cys Gly AAA AGA Lys Arg 345 TGT GAA C-ye Giu 360 TIC CAC Phe His CAG AAG
*C-CA
Pro
GGG
Gly
TCA
Ser
GCC
Ala 330
ACC
Thr
TGT
C-ye
C-AC
His
GCT
AAC AAG GAG C-TG Asn Lye Glu Leu 285 C-TI C-G C-CT GCC Leu Arg Pro Ala 300 TGC C-AG TGT GTC Cys Gin C-ys Val 315 AAC C-CA GAP. TTI- Asn Arg Glu Phe TGC C-CC AGA AAT Cys Pro Arg Asn 350 ACA GAA AGT C-CA Thr Glu Ser Pro 365 C-AA AC-A TGC AGC Gin Thr C-ye Ser 380 TGT GAG C-CA GGA C-ye Giu Pro Gly 395 TCA TAT TGG AAA
GCT
Ala
GAT
Asp
ACC
Thr 290
CCC
Pro
AAA
Lye
ACA
Thr Ser
TGT
C-ye
CAT
Asp 335
CAA
Gin
CAG
Gin
TGT
C-ye Tr Phe Gly 305
AAC
Asn
AAC
Asn Pro Leu Asn 1125 1173 1221 1269 1317 1365 1413 1461 1509 1557 1605 1658 1718 1778 1838 TGC TTG C-ye Leu 370 AGA C-GG Arg Arg 385 TAT AGT Tyr Ser Pro C-ye Thr Asn Arg Gin Lye Ala 390 GAP. GAP. GTG Glu Glu Val 405 TGT C-GT TGT C-ye Axg C-ye GTC C-CT Val Pro Ser Tyr Trp Lye Arg Pro Gin Met 410 AGC TAAGATTGTA C-TG3.rTCCA GTICATCGAT TTTCTATI'AT GGAAAAC-TGT Ser GTTGCCACAG TAGAAC-TGTC TGTGAACAGA GAGACCCTIG TGGGTCCATG CTAACAAAGA CAAAAGTCTG TC'-IrCCTGA ACCATGTGGA TAACITTTACA GAAATGGACT GGAGCTCATC TGCAAAAGGC CTCTGTA GACTGGTTI' CTGCCAATGA CC-AAP.CAGCC AAGATTTTCC 115 TCTTGTGATT TCTTrAA4AG AATGACTATA TAATrI'AIT CCACTAAAAA TATTG'TTT GCATTCATTT TTATAGCAAC AACAATTGGT AAAACTCACT GTGATCAATA TT TATATC ATGCAAAATA TGTITAAAAT AAAATGAAAA TTGTATTAT INFORMATION FOR SEQ ID NO:B: SEQUENCE CHARACTERISTICS: LENGTH: 419 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: Met His Leu Leu Gly Phe Phe Ser Val Ala Cys Ser Leu Leu Ala Ala 1. 5 10 Ala Leu Leu Pro Gly Pro Arg Glu Ala Pro Ala Ala Ala Ala Ala Phe 1898 1958 1997 Giu Ser Gly Leu Asp Leu Ser Asp Ala Glu Pro Asp Ala Gly Glu Ala Thr Ala Tyr Ala Ser Lys Asp Ser Tyr Ala His Thr Gly 145 Arg Val Asp Lys Cys Asn Leu Tyr Asn 115 Gin Cys 130 Val Ala Cys Gly Glu Gin Asn 100 Thr Met Thr Gly Leu met 70 Leu Arg Ser Arg Glu Ile Pro Arg Asn Thr 150 Cys Cys 165 55 Thr Lys Thr Leu Glu 135 Phe Asn Leu Val Gly Glu Lys 120 Val Phe Ser Giu Leu Gly Giu 105 Ser Cys Lys Glu Leu 185 Tyr Trp 90 Thr Ile Ile Pro Gly 170 Phe Pro 75 Gin Ile Asp Asp Pro Leu Glu Glu Gin Leu Arg Ser Val Ser Glu His Lys Asn Val 140 Cys Gin Ile Tyr Asn Phe Giu 125 Gly Val Cys Thr Trp Lys Arg Giu Ala Ala 110 Trp Arg Lys Giu Ser Val Met Asn 175 Val Pro Met s0 Gin Ala Lys Phe Tyr 1.60 Thr Leu Ser Thr Ser Tyr 180 Leu Ser Lys Thr 190 His Thr Ser Ser Gin Gly 195 Pro Lys Pro Vai Thr 200 Ile Ser Phe Ala 116 Cys Arg Cys Met Ser Lys Leu Asp Val Tyr Arg Gin Val His Ser Ile 210 215 Ile 225 Lys Leu Thr Glu Gly 305 Asn Asn Leu Cys Arg 385 Tyr Gin Arg Thr Ala Asp Thr 290 Pro Lys Thr Asn Leu 370 Arg Ser Met Ser Leu Pro Thr 245 Giu Asp 260 Phe His Gin Cys Lys Giu Phe Pro 325 Gin Cys 340 Gly Lys Lys Gly Cys Thr Giu Val 405 Ala Tyr Met Ile Cys 295 Asp Gin Cys Ala Lys 375 Arg Arg Pro Asn 250 Ser Pro Gly Ser Ala 330 Thr Cys His Ala Pro 410 220 Cys His Ala Lys Arg 300 Gin Arg Pro Giu Thr 380 Giu Tyr Gin Ala Ala Ile Cys Arg 255 Gly Asp Asp 270 Glu Leu Asp 285 Pro Ala Ser Cys Val Cys Giu Phe Asp 335 Arg Asn Gin 350 Ser Pro Gin 365 Cys Ser Cys Pro Gly Phe Trp, Lys Arg 415 INFORMATION FOR SEQ ID NO:9: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 17 amino acids TYPE: amino acid STRAZNDEDNESS: not reievant TOPOLOGY: linear (ii) MOLECULE TYPE: peptide 117 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: Glu Glu Thr Ile Lys Phe Ala Ala Ala His Tyr Asn Thr Glu Ile Leu 10 Lys INFORMATION FOR SEQ ID Wi SEQUENCE CHARACTERISTICS: LENGTH: 1836 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: NAME/KEY: CDS LOCATION: 168. .1412 (xi) SEQUENCE DESCRIPTION: SEQ ID GCGGCCGCGT CGACGCAAAA GTrGCGAGCC GCCGAGTCCC GGGAGACGCT CGCCCAGGGG GGTCCCCGGG AGGAAACCAC GGGACAGGGA CCAGGAGAGG ACCTCAGCCT CACGCCCCAG CCTGCGCCAG CCAACGGACC GGCCTCCCTG CTCCCGGTCC ATCCACC ATG CAC TTG; Met His Leu 1 CTG TGC TTC TTG TCT CTG GCG TGT TCC CTG CTC GCC GCT GCG CTG ATC Leu Cys Phe Leu Ser Leu Ala Cys Ser Leu Leu Ala Ala Mla Leu Ile AGT CCG CGC GAG Ser Pro Arg Glu CCC GCC ACC GTC Pro Ala Thr Val GCC TTC GAG TCG Ala Phe Glu Ser CTG GGC TTC TCG Leu Gly Phe Ser GCG GAG CCC GAC GGG GGC GAG GTC AAG Ala Glu Pro Asp Gly Gly Glu Val Lys GCT TTT 320 Ala Phe GAA GGC AAA Glu Gly Lys CTG GAG GAG CAG Leu Glu Glu Gin CGG TCT GTG TCC .Arg Ser Val Ser AGC GTA GAT Ser Val Asp GAG CTG ATG TCT GTC Glu Leu Met Ser Val CTG TAC CCA GAC TAC TOG AAA ATG TAC AAG WGC Leu Tyr Pro Asp Tyr Trp Lys Met Tyr Lys Cys CAG CTG Gin Leu CGG AAA GGC GGC Arg Lys Gly Gly TGG CAG CAG CCC ACC CTC AAT ACC AGG ACA Trp Gin Gin Pro Thr Leu Asn Thr Arg Thr 90 118 000 GAC AGT GTA Gly Asp Ser Val 100 AAA TIT GCT Lys Phe Ala 105 OCT OCA CAT TAT AAC ACA GAG ATC Ala Ala His Tyr Asn Thr Glu Ile 1.10 AAA AMT ATfl OAT AAT GAG TOG AGA AAG Lys Ser Ile Asp Asn Giu Trp Arg Lys 120 CAA TGC ATG CCA Gin Cys Met Pro COT GAG Arg Giu 130 GTG TOT ATA OAT Val Cys Ile Asp 135 TnT AAA CCT CCA Phe Lys Pro Pro 150 GTG GGG AAG GAG Val Gly Lys Glu GA GCA GCC Gly Ala Ala AGA TOT 000 Arg Cys Gly TOT GTG TCC Cys Val Ser GTC TAC Val Tyr 155 ACA AAC ACC 7TC Thr Asn Thr Phe 145 GOT TOC TGC AAC Gly Cys Cys Asn 160 TAC CTC AGC AAG Tyr Leu Ser Lys AGC GAG Ser Giu 165 000 CT0 CAG Gly Leu Gin TOC ATG Cys Met 170 AAC ACC AGC ACA Asn Thr Ser Thr 77M Tnr GAA ATT ACA GTG Leu Phe Giu Ile Thr Val ccT CTC TCA Pro Leu Ser CAA GGC Gin Gly 190 CCC AAA CCA Pro Lys Pro ACA ATC AGT =I GCC AAT CAC ACT TCC Thr Ile Ser Phe Ala Asn His 7hr Ser 200 COO TGC ATO TCT Arg Cys Met ser AAA CTG Lys Leu 210 OAT GTT TAC AGA CAA Mr CA Asp Val Tyr ACA TrA CCA Thr Leu Pro 230 Gin Val His TCA ATT Ser le 220 GCr iAAC Ala Asn 235 AT AGA COT TCT le Arg Arg Ser Cro CCA 'OCA Leu Pro Ala 225 ACA AAC TAT Thr Asn Tyr CAG.TOT CAG GCA Gin Cys Gin Ala AAG ACA TOT Lys Thr Cys OTO TG Val Trp 245 AAT AAC TAC ATO TGC Asn Asri Tyr Met Cys 250 CGA TGC CIG GCT Arg -Cys Leu Ala CAG OAT TTT ATC Gin Asp Phe le TAT TCA AAT Tlyr Ser Asn O'TT GAA GAT Val Oiu Asp .265 GAC TCA ACC Asp Ser Thr GGA TTC CAT OAT Gly Phe His Asp TGT GGA CCC AAC AAG GAG =TGOAT GAA Cys Gly Pro Aen Lys Oiu Leu Asp Giu 280 ACC TOT CAG T= Thr Cys Gin Cys OTC TOC Val Cys 290 992 1040 1088 1136 AAG 000 000 Lys Gly Oiy AGA GAC TCA Arg Asp Ser 310 COO CCA TCT AGT Arg Pro Ser Ser GGA CCC CAC Gly Pro His AAA GAA CTA OAT Lys Giu Leu Asp .305 TTC CCT AAT TCA Phe Pro Aen Ser 320 TOT CAG TOT GTC Cys. Gin Cys Val .AAA AAC AAA CTT Lys Asn Lys Leu 119 TGT GGA GCC AAC AGG Cys Gly Ala Asn Arg 325 AAA AGA ACG TGT CCA Lys Arg Thr Cys Pro 340 TGT GAA TGT ACA GAA Cys Glu Cys Thr Glu 360 TTC CAC CAT CAA ACA Phe His His Gin Thr 375 GAA TTT Glu Phe 330 AGA AAT Arg Asn 345 AAC ACA Asn Thr TGC AGT Cys Ser GAT GAG AAT Asp Glu Aen CAG CCC CTG Gin Pro Leu CAG AAG TGC Gin Lys Cys 365 TGT TAC AGA Cys Tyr Arg ACA TGT CAG TGT GTA TGT Thr Cys 335 AAT CCT Asn Pro
CTT
Leu Gin Cys Val Cys GGG AAA TGT GCC Gly Lys Cys Ala 355 AAA GGG AAG AAG Lys Giy Lys Lys 370 TGT GCG AAT CGA Cys Ala Asn Arg 385 AGA CCG Arg Pro 380
LCC
;er CTG AAG CAT Leu Lys His 390 TGT GTC CCA Cys Val Pro TGT GAT Cys Asp Cc P2 TCG TAT TC Ser Tyr T3 CCAGTTTTCA GTCAGTCACA ACTGTCTATG CACAGAAAGA TrATrGAACC ATGTGGATI'A TTCAAAGACT GGTTTTCTGC AAAAGAATGA CTATATAAT1' GCAATAACAA TTGGTAAAGC TAAAATAAAA TGAAAATTGT :AGGA CTG~ :o Gly Leu 395 ;G AAA AGG .p Lys Arg 410
GTCATTTACT
CTCTGTGGGA
CTGCGGGAGA
CAGGGACCAG
TATTTCCACT
TCACTGTGAT
AT1'ATAAAAA TTT AGT GAA Phe Ser Glu GAA GTA TG( Glu Val Cyi 400 :CA CAT CTG ?ro His Leu
CTCTTGAAGA
CCACATGGTA
GGACTGGCAC
ACAGCTGAGG
AAAAATATI'G
CAGTATTT
AAAAAAAAAA
AAC TAAGATCATA Asn 415 CTGTTGGAAC AGCACTTAGC ACAGAGGCCC AAGTCTGTGT TCATGTGCAA AAAAAACCTC TTITCTCTT GTGATTTAAA TrCCTGCAT' CATTTT1ATA ATAACATGCA AAACTATGTT AAAAAAAAAA GCTT
CGC
Arg 1184 1232 1280 1328 1376 1422 1482 1542 1602 1662 1722 1782 1836 INFORMATION FOR SEQ ID NO:11: SEQUENCE CHARACTERISTICS: LENGTH: 415 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: Met His Leu Leu Cys Phe Leu Ser Leu Ala Cys Ser Leu Leu Ala Ala 1 5 10 Ala Leu Ile Pro Ser Pro Arg dlu Ala Pro Ala Thr Val Ala Ala Phe 25 120 Glu Ser Gly Leu Gly Phe Ser Giu Ala Glu Pro Asp Gly Gly Giu Val Lys Ser Tyr Thr Glu Pro Asn 145 Cys Leu Lys Ser Leu 225 Thr Asp His Cys Giu 305 Ala Val Lys Arg Ile Arg 130 Thr Cys Ser Pro Lys 210 Pro Asn Phe Asp Val 290 Leu Phe Giu Asp Giu Cys Gin Thr Gly 100 Leu Lys 115 Giu Val Phe Phe Asn Ser Lys Thr 180 Val Thr 195 Leu Asp Ala Thr Tyr Val Ile Phe 260 Val Cys 275 Cys Lys Asp Arg Gly Lys Leu Met 70 Leu Arg Asp Ser Ser Ile Cys Ile Lys Pro 150 Glu Gly 165 Leu Phe Ile Ser Val Tyr Leu Pro 230 Trp Asn 245 Tyr Se~r Gly Pro Gly Gly Asp Ser 310 Asp 55 Ser Lys Val Asp Asp 135 Pro Leu Giu Phe Arg 215 Gin Asn Asn Asn Leu 295 Cys 40 Leu Giu Val Leu Gly Gly Lys Phe 105 Asn Giu 120 Val Gly Cys Val Gin Cys Ile Thr 185 Ala Asn 200 Gin Vai Cys Gin Tyr Met Val Giu 265 Lys Giu 280 Arg Pro Gin Cys Glu Gin Tyr Pro 75 Trp Gin Ala Ala Trp Arg Lys Giu Ser Vai 155 Met Asn 170 Val Pro His Thr His Ser Ala Ala .235 Cys Arg 250 Asp Asp Leu Asp Ser Ser Val Cys 315 Leu Asp Gin Ala Lys Phe 140 Tyr Thr Leu Ser le 220 Asn Cys Ser Glu Cys 300 Lys Arg Ser Tyr Trp Pro Thr His Tyr 110 Thr Gin 125 Gly Ala Arg Cys Ser Thr Ser Gin 190 Cys Arg 205 le Arg Lys Thr Leu Ala Thr Asn 270 Asp Thr 285 Gly Pro Asn Lys Ser met Asn Thr Met Thr Gly 160 Tyr Pro Met Ser Pro 240 Gin Phe Gin Lys Phe 320 Pro Asn Ser Cys Gly Ala Asn Arg Glu Phe Asp Glu Asn Thr Cys Gin 335 Cys Val Cys Lys 340 Lys Cys Ala Cys 355 Gly Lys Lys Phe 370 121 Arg Thr Cys Pro Arg Asn Gin Pro 345 Glu Cys Thr Glu Asn Thr Gin Lys 360 His His Gin Thr Cys Ser Cys Tyr 375 390 Lys His Cys Asp Pro Gly Leu Ser Leu Asn Pro Giy 350 Phe Leu Lys Arg Pro Cys Asn Arg Leu Phe Ser Glu Val Cys Arg Cys Val Pro Ser Tyr Trp Ll 405 4 INFORMATION FOR SEQ ID NO:12: Wi SEQUENCE CHARACTERISTICS: LENGTH: 1741. base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDXA Pro His Lou Asn 415 (ix) FEATURE: NAME/KEY: CDS LOCATION: 453. .1706 (xi) SEQUENCE DESCRIPTION: SEQ ID VO:12: GCCCCCGCCG AGCGCTCCGC GCGCAGCCGC CGGGCCGGGC CGGCCCGCGG AGGGCGCGCr GCGAGCGGCC ACTGGGTCCT GCTTCCTCC TICCTCTCCC TCCTCCTCCT CCTCCTTCTC TCTGCGCTTT CCACCGCTCC CGAGCGAGCG CACOCTCGGA TGTCCGGTrr CCTGGTGGGT 7M"TTACCTG GCAAAGTCCG GATAACTrCG GTGAGAATTT GCAAAGAGGC TGGGAGCTCC CC!TGCAGGCG TCTGGGAGCT GcTGCCGCCG TCGCATCTIC TCCATCCCGC GGATT~AcT GCCTTGGATA TYGCGAGGGG AGGGAGGGGG GTGAGGACAG CAAAAAGAAA GGGGTGGGGG GOGGAGAGA AAAGGAAAAG AAGGAGCCTC GGAAT'T%3C CCGCATTCC T GCGCTGCCCC GCGGCCCCCC TCCGCTCrGC CATCTCCGCA CA ATG CAC Ti'G CTG GAG ATG CTC Met His Leu Lou Glu Met Lou TCC CTG GGC TGC TGC CTC GCr GCT GGC GCC GM~ CTC CMTG GA CCC COG Ser Lou Gly Cys Cys Lou Ala Ala Gly Ala Val Lou Lou Gly Pro Arg 15
CAG
Gln
GAG
Giu
GAA
Giu CTr Leu
GGT
Gly
CCG
Pro
GAG
Giu
GAG
Giu
TAC
Tyr
TGG
Trp Ccc Pro
CCC
Pro
CAG
Gin
CCA
Pro
CAA
Gin GTC GCC Val Ala GGT GCC Gly Ala TTG CGA Leu Arg GAA TAC Glu Tyr CAC AAC His Asn GCC GCC Ala Ala 30 GGG GAA Gly Giu 45 TCT GTG Ser Val TGG AAA Trp Lys AGG GAA Arg Glu GAT TCA TTG AAA TTT GCC GCA Asp Sex 105 AGT ATI Ser Ile 120 TGT GTG Cys Val AAA CCC Lys Pro GAA GGA Glu Gly TTG rrr Leu Phe 185 GTC AGT Val Ser 200 GTT TAC Val Tyr Leu
GAT
Asp
GAT
Asp
CCG
Pro
CTC
Leu 170
GAG
Giu Phe Lys
ACT
Thr
TTG
Leu
TGT
Cys 155
CAG
Gin
AT.
Ile
CC
Ala.
Phe
GAA
Giu
GGG
Gly 140
GTA
Val
TGT
Cys
ACA
Thr
AAT
Asn Ala Ala 1 110 TGG AGA Trp Arg 125 AAA GAG Lys Giu TCC ATC Ser Ile ATG AAT Met Asn GTG CCT Val Pro 19 0 CAC ACG His Thr 122 TAC GAG TCC GGG CAC Tyr Giu Ser Gly His CCC AAG GCT CAT GCA Pro Lys Ala His Ala 50 TCC ACT CTG CAT GAA Ser Ser Val Asp Giu 65 ATG TTC AAA TGT CAG Met Phe Lys Cys Gin 80 CAC TCC AGC TCT GAT His Ser Ser Ser Asp 95 GCA CAT TAT AAT CCA Ala His Tyr Asn Ala 115 AAA ACC CAG GGC ATG Lys Thr Gin Gly Met 130 rrr GGA GCA ACT ACA.
Phe Gly Ala Thr Thr.
145 TAC AGA TGT GGA GGT Tyr .Arg Cys Gly Gly 160 ATC AGC ACA AAT TAC Ile Ser Thr Asn Tyr 175 CTC TCT CAT CCC CCC2 Leu Ser His Gly Pro 195 TCC TGC CGA TGC ATG I Ser Cys Arg Cys Met 210 ATC ATA AGA CGT TCC1 Ile Ile Arg Arg Ser I GGC TA Gly Ty: AGC AAN Ser Lyi CTC AT( Leu, Met TTG AGC Leu Arc ACA AGP Thr Arg 100 GAG ATC Giu Ile CCA CGT Pro Arg AAC ACC Asn Thr TGC TGC Cys Cys 165 W.C AGC Ile Ser 180 %AA CCT ys Pro ~CT AAG er Lys .TG CCA ~eu Pro TAC GAG rTyr Giu k. GAC CTG Asp Leu ACA GTA -Thr Vai AAA GGA rLys Giy LTCA CAT fSer Asp CTG AAA Leu Lys GAA GTG Giu Val 135 TTC TrT Phe Phe 150 AAT ACT Asn Ser AAG ACA Lys Thr GTA ACA Val Thr TTG CAT Leu Asp 215 GCA ACA Ala Thr 230 569 617 665 713 761 809 857 905 953 1001 1049 1097 1145 1193 205 AGA CAA CTT CAT TCT.
Arg Gin Val His Ser 220 CAA ACT CAG Gin Thr Gin TCT CAT Cys His 235 GTG GCA AAC AAG ACC TGT CCA AAA .AAT CAT GTC Val Aia Asn Lys Thr Cys Pro Lys Asn His Vai 240 245 123 TGC TTA GCA CAG TGG AAT AAT CAG ATL' TGC AGA Trp Asn Asn Gin Ile Cys Arg 250 CAC GAT TTT GOT TTC Cys Leu 255 Ala Gin His Asp Phe Gly Phe TCT TCT CAC CTT GGA GAT TCT GAC Al~ Ser Ser His Leu Giy Asp Ser Asp Thr 265 270 GOG CCC AAC AAA GAG CTG GAT GAA GAA Gly Pro Asn Lys Giu Leu Asp Giu Giu 280 285 GGA GGT GTG CGG CCC ATA AGC TOT GGC Giy Gly Val Arg Pro Ile Ser Cys Gly 300 GCA TCA TOT CAG TGC ATG TGC AAA AAC Aia Ser Cys Gin Cys Met Cys Lys Asn 315 320 COG CCT AAC AAA GAA TTT GAT GAA GAA Gly Pro Asn Lys Giu Phe Asp Glu Giu 330 335 AAG ACC TOT CCC AAA CAT CAT CCA CTA Lys Thr Cys Pro Lys His His Pro Leu 345 350 GAA TGT ACA GAA TCT CCC AAT AAA TOT Giu Cys Thr Giu Ser Pro Asn Lys Cys 360 .365 CAT CAC CAG ACA TGC AGT TOT TAC AGA His His Gin Thr Cys Ser Cys Tyr Arg 380 AAA CGC TOT GAT OCT GGA TIT CTG TTA Lys Arg Cys Asp Ala Giy Phe Leu Leu 395 400
TCT
Ser
ACC
Thr
CCT
Pro 305
AAA
Lys
AAG
Lys
AAT
Asn
TTC
Phe
CCA
Pro 385
GCT
Ala
GAA
Giu
TOT
Cys 290
CAC
His
CTG
Leu
TGC
Cys
CCT
Pro
TTA
Leu 370
CCA
Pro GGA TTC Gly Phe 275 CAA TOC Gin Cys AAA GAA Lys Giu CTC CCC Leu Pro CAG TOT Gin Cys 340 OCA AAA Ala Lys 355 AAA OGA Lys Giy TOT ACA Cys Thr CAT ATT TOT His Ile Cys OTC TOC AAA Val Cys Lye 295 CTA GAC AGO Leu Asp Arg 310 AGT TCC TOT Ser Ser Cys 325 OTA TOT AAA Val Cys Lye TGC ATC TOC Cys Ile Cys AAA AGA TTr Lys Arg Phe 375 GTC CGA ACO Val Arg Thr 390 1241 1289 1337 1385 1433 1481 1529 1577 1625 1673 1726 1741 GAA GAA OTO TOC CCC TOT Glu Clu Val Cys Arg Cys 405 GTA COC ACA TCT TOG AAA AGA CCA CTI' ATG AAT TAAGCGAAGA AAGCACTACT Vai Arg Thr Ser Trp Lys Arg Pro Leu Met Asn 410 415 CGCTATATAG TGTCG INFORMATION FOR SEQ, ID NO:13: SEQUENCE CHARACTERISTICS: LENGTH: 418 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein 124 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: Met His Leu Leu Giu Met Leu Ser Leu G
I
Ala Ser Ala Val Lys Ser Tyr Gin Ala 145 Cys Thr His Arg Arg 225 Thr Ala Val Gly His Asp Cys Ser Asn Gly 130 Thr Gl y Asn Gly Cys 210 FArg ys Gln Leu His Ala Giu Gin Asp Ala 115 Met Thr Gly Tyr Pro 195 Met Ser Pro His Leu Gly Ser Leu Leu Thr 100 Glu Pro Asn Cys Ile 180 Lys Ser Leu Lys ksp 260
S
Gly Tyr Lys Met Arg Arg Ile Arg Thr Cys 165 Ser Pro Lys Pro Asn 245 Phe Pro Tyr Asp Thr 70 Lys Ser Leu Giu Phe 150 Asn Lys Val Leu Ala 230 His fly Arg Giu Leu 55 Val Gly Asp Lys Val 135 Phe Ser Thr Thr Asp 215 Thr Val Phe Gir Git 40 Git Leu Gly Asp Ser 120 Cys Lys Giu Leu Val 200 Val Gin Trp Ser Pro 25 Giu Giu Tyr Trp, Ser 105 Ile Val Pro Gly Phe 185 Ser Tyr Thr Asn Ser 265
P
P
G
P
G
L
A
A
1 25 ,ly Cys 10 ro Vai 'ro, Gly in Leu ro Giu 75 in His eu Lys sp Thr sp Leu ro Cys 155 eu Gin 70 lu Ile ie Ala :g Gin .n Cys 235 n5 Gin 0 Cys Al a Ala Arg Tyr Asn Phe Giu Gly 140 Val Cys Thr Asn V'al 220 H{is Ile *Leu AlMa *Gly Ser Trp Arg Ala Trp 125 Lys Ser met Val His 205 His Val Cys Ala Ala Glu Val Lys Giu Ala 110 Arg Glu Ile Asn Pro 190 Thr Ser rda Ala Tyr Pro Ser Met His Ala Lys Phe Tyr Ile 175 Leu Ser Ile Asn Cys 255 Gly Glu Lys Ser Phe Ser His Thr Gly Arg 160 Ser Ser Cys Ile Lys 240 Leu 'B Leu Gly Asp Ser Asp Thr 270 Ser Giu Gly Phe His Ile Cys Gly Pro Asn Lys Giu Leu Asp Giu Giu 125 Thr Cys Gin Cys Val Cys Lys Gly Gly Val Arg Pro Ile Ser Cys Gly 290 295 300 Pro His Lys Giu Leu Asp Arg Ala Ser Cys Gin Cys Met Cys Lys Asn 305 310 315 320 Lys Leu Leu Pro Ser Ser Cys Gly Pro Asn Lys Giu Phe Asp Glu Giu 325 330 335 Lys Cys Gin Cys Val Cys Lys Lys Thr Cys Pro Lys His His Pro Leu 340 345 350 Asn Pro Ala Lys Cys Ile Cys Giu Cys Thr Giu Ser Pro Asn Lys Cys 355 360 365 Phe Leu Lys Gly Lys Arg Phe His His Gin Thr Cys Ser Cys Tyr Arg 370 375 380 Pro Pro Cys Thr Val Arg Thr Lys Arg Cys Asp Ala Gly Phe Leu Leu 385 390 395 400 Ala Glu Giu Val Cys Arg Cys Val Arg Thr Ser Trp Lys Arg Pro Leu 405 410 415 Met Asn INFORMATION FOR SEQ ID N0.:14: SEQUENCE CHAR~ACTERISTICS: LENGTH: 10 amnino acids TYPE: amino acid STRA2NDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:.
Ala Val Val Met Thr Gin Thr Pro Ala Ser 1 5 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 22,base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA 126 (xi) SEQUENCE DESCRIPTION: SEQ ID TCTCTTCTGT GCTTGAGTTG AG 22 INFORMATION FOR SEQ ID NO:l6: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l6: TCTCTTCTGT CCCTGAGTTG AG 22 INFORMATION FOR SEQ ID NO:17: SEQUENCE CHARACTERISTICS: LENGTH: 65 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l7: TGTGCTGCAG CAAAT7TTAT AGTCTCTTCT GTGGCGGCGG CGGCGGCGGG CGCCTCGCGA GGACC INFORMATION FOR SEQ ID NO:18: Wi SEQUENCE CHARACTERISTICS: LENGTH: 30 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTIO N: SEQ ID NO:18: CTGGCAGGGA ACTGCTAATA ATGGAATGAA 1 127 INFORMATION FOR SEQ ID NO:19: SEQUENCE CHARACTERISTICS: LENGTH: 84 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: GGGCTCCGCG TCCGAGAGGT CGAGTCCGGA CTCGTGATGG TGATGGTGAT GGGCGGCGGC GGCGGCGGGC GCCTCGCGAG GACC 84 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 31 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID GTATTATAAT GTCCTCCACC AAATTTTATA G 31 INFORMATION FOR SEQ ID NO:21: SEQUENCE CHARACTERISTICS: LENGTH: 93 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: GTTCGCTGCC TGACACTGTG GTAGTGTTGC TGGCGGCCGC TAGTGATGGT GATGGTGATG AATAATGGAA TGAACTTGTC TGTAAACATC CAG 93 INFORMATION FOR SEQ ID NO:22: 128 SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: CATGTACGAA CCGCCAGG 18 INFORMATION FOR SEQ ID NO:23: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: AATGACCAGA GAGAGGCGAG INFORMATION FOR SEQ ID NO:24: SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: GCCACGGTAG GTCTGCGT 18 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear 129 (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID TTTCTTTGAC AGGCTTAT 18 INFORMATION FOR SEQ ID NO:26: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: ATCTTGAAAA GTAAGTATGG G 21 INFORMATION FOR SEQ ID NO:27: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: ATGACTTGAC AGGTATTGAT INFORMATION FOR SEQ ID NO:28: SEQUENCE
CHARACTERISTICS:
LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 130 AGCAAGACGG TGGGTAT1'GT INFORMATION FOR SEQ ID NO:29: Wi SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRAN~DEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: CCCTrCTITG TAGTTATTTG AA INFORM4ATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANflEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID CCACAGTGAG TATGAATTAA INFORMATION FOR SEQ ID NO:31: Ci SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs TYPE: nucleic acid CC) STRANDEDNESS: single CD) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: TTCTI'CCAAA GGTGTCAG INFORMATION FOR SEQ ID NO:32: Ci SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs 131 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: GGAGATGGTA GCAGAATG 18 INFORMATION FOR SEQ ID NO:33: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: CTATTTGTCT AGACTCAACA GAT 23 INFORMATION FOR SEQ ID N0:34: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: CAAACATGCA GGTAAGAGAT CC 22 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) 132 (xi) SEQUENCE DESCRIPTION: SEQ ID TGTTCTCCTA GCTGTTACAG A 21 INFORMATION FOR SEQ ID NO:36: SEQUENCE CHARACTERISTICS: LENGTH: 24 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: GGCGAGGTCA AGGTAGGTGC AAGG 24 INFORMATION FOR SEQ ID NO:37: SEQUENCE CHARACTERISTICS: LENGTH: 26 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: ATTGTCTTTG ACAGGCTTTT TGAAGG 26 INFORMATION FOR SEQ ID NO:38: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: GAGATCCTGA AAAGTAAGTA
G
133 INFORMATION FOR SEQ ID NO:39: SEQUENCE CHARACTERISTICS: LENGTH: 24 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE:. DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: TGTGACTCGA CAGGTATTGA TAAT INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID CTCAGCAAGA CGGTAGGTAT INFORMATION FOR SEQ ID NO:41: SEQUENCE CHARACTERISTICS: LENGTH: 25 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: TTGTCCCTTG TAGTTGTTTG
AAATT
INFORMATION FOR SEQ ID NO:42: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: single 134 TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: ACATTACCAC AGTGAGTATG INFORMATION FOR SEQ ID NO:43: SEQUENCE CHARACTERISTICS: LENGTH: 26 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: GTCTCCCCAA AAGGTGTCAG GCAGCT 26 INFORMATION FOR SEQ ID NO:44: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: AATGTTGAAG ATGGTAAGTA AAA 23 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 16 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) 135 (xi) SEQUENCE DESCRIPTION: SEQ ID TCTAGACTCA ACCAAT 1 INFORMATION FOR SEQ ID NO:46: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: CAAACATGCA GGTAAGGAGT GT 22 INFORM4ATION FOR SEQ ID NO:47: Wi SEQUENCE CHARACTERISTICS: LENGTH: 24 base, pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: TTTTCCCCTA GTI'G1-ACAG AAGA 24 INFORMATION FOR SEQ ID NO:4B: Wi SEQUENCE CHARACTERISTICS: LENGTH: 2991 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: G7 =rAAGTA GAGACGGGGT TTCACCAACG GTTGAAAATA TFTATCATGG TCTCCCTAAG ATGGACGGTG TTAGCTAGG.A TGGTCTCGAT cTCCTGACCT cATGATCCAC CCCCTCGGC 120 CTCCCAAjAGT GCTGGGATA CAGGCGTGAG CCACCGTGTC CGACCAACCT TAAGACAAAC 180 136 AACTACTGCA TGATTGTTTT TGGAGACC'fl 7"ITIATTC TTCTGACTC AAAGTATAGC AGCAGGAAGA TAACACTTrr CAGCTTrACTG CTGTXTIAA ATGAAACAGT AGTTAATATG
TATTTGAGT
TGCCTGTGT1'
TTAGACCAGT
CC1I-r'rGTGA
GGAAACTCCA
AGATCTATCA
GACAGCAAGA
TTGTTGATTT TCCAGTCT C TCTCAATrr GTITGCCTAT TAAGCCAGAA AGGCAGAAGG TGCCAAGTGC AATCAAAGTT CCTTCTATTC AAATCCTACC ATGTCTGAAG ATAACTATGG GAGTGATACA CTGACCATGT
ACCCGCTGCT
TAGAATCCTG
TGTACTCAAG
TAGAATCATT
CCAGTCTGCC
CAGGCTGATC
TCCAAATCAC
AAATAAATTT TTGCCAGCAT GTGAGAAAAA AGTrGAATA ATATTAATAT ArTrGGATA Ac3GCCTGTGG GTGTTGGAAA ATGTCCAAGC CTTACTCCAG CATCTGT7T TrCAAALATCT GTAATAGCAA ATGGTTGAAT CTAGCTGTr CTCTTrr'rAc AAATATGCAT AGAGCAGGAA AAAACATCTC AACAGGCTAG 240 300 360 420 480 540 600 660 720 780 ATCATGGACC GAGTCTGATG GGATGGAATr TACAGTAAAC TTAACTCCAC ATc3AAAAATr ATTGAGGAAT TGGCTGCCAG AAAATAAACT GCATGCTCAT TACAGCACTA GGAAAGCTAA CAGTATGGTT GATATTTAGC CAGGAGTAAA TGAAGGACTC AG'IIGTGGAA GCAATAGAGA GATAGAAGGA ACAGCCACAC AGAATGGAAG ATGGGTGAGG AGTTTCAAAT GATATTATGG ACCTATAACA TTTGATGTTC TGTACACTGT GCCCCAAGGA GGCAGCTAGC TGGGGAAATe3 CAGTCACTT CACATCTCTC ?rATCACCC~ TCATAAAGAT ACATAAAAAA GCATCT'rGGA GAATTTAGAC GTGACTAAGT AGCAGTACAT
AAATACAGGG
TTrGTrGACT
CAATCCAAGG
AGTACCGAGC
AAGGGGGGAA
G7TAGCI-rAG
TTAAGGGTAG
CTGTATCAAC
CATGTTCTCA
ACTGGAACTG
GGAGACCATG
TGTGAGTCAA CACTGTGATT AAAGGCATAC TGTCCATTCT ACCGCATACT TCATGAACAT TCATCT'rGGG GAATAAAAGG ATAAATATIT TCAAAATATT TAAAA~CTCCA GGAGTCAAAA CTCATCAATA AAGGGAGTGA TTAAGCCTT CAAGTAAATT GTGAGA7TAG ATTTATTGTG AATGCTrTTC AGCAGTGTrG GTTCCCTGTC CTrTTAGATA CTAGAGTTGA ACCAGATAAG TCTGTGTACT TAAAAGTAGA TGAAACCCCT TAGCGTGGTC CAATGCCACC AGCCCAACGG TCTTTGGATG CTACACATCC CCIIrTTTCAA
GTAAACCAAC
840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 AAAGTCTCTr
TTGGGAGTGA
CTCTGTAACC
AAACCCCAGT
TITCTGGAAT
TTCTTGCAGA
CACCTGTCAG
TGCATCCTGA
GAGAGAGGGG
CTTCCGGTAA
AAGGCAGACT
TGCTCACCCT
GCTTrCCAA
ATGTCTCACA
AGCTGACAAT
AAGTTTAGGA
GAACTGCGGT
ACCCCTCCGA
GTGTCTCTT
TTCCAAAGGG
TCTTGGATA
ACTCTCCACG AAGATTCTGG CCCTTCTC7T ATCA7TAGCA TCCATCCCAA CAGCCTGCAC TCAGGCAACT TTCAACTAC4
CAGACCAAGG
137
GGTCCCATAC
GGGGCCAGGC
CTCCCTCCAC
CACCTCTAAA
CCCGCTCCCC
TGGGGCCGCC
AGAGTGAGAG
ACATAAGCGC
CGCCGCCGCT
CCGCCGCCAG
CCGCAGCGCC
GGGCTCTGGC
CCTCTCCAAA
CCTCGCGGGC
ACACCCGCCG
AGCCCCGGAC
GGGGGAGAGG
CCCCCGCCAG
GGTrCTCTWI
GTGCGGGAGG.
CCCACGGTGC
GCCGGTCCCG
GCAGGGGACA
GGGGAGGAGG
GGGAGGGCAG
AGGCAGAGGG
CAGCCCGGCG
CCGCCCCGGC
CGCGGCCCGG
GGGTTTGGAG
AAGCTACACC
TCCGAATGCG
Ccl-rrCCCCG
CCGCTCCCGC
GACCAGGAGG
CGGACCGGTC
LCATAGTGATG
GAGGACAAGA
CCAGTI'TCTC
CCAACCGCCA
GGGGCGGGGA
CGAGGGAAAC
CCCGGGCTCG
CGCGTCAGTC
CGCTCTGGAG
GGCCCTCCTC
CTCCTCTCAC
GGGCTGAACA
GACGCGGACC
GGGAGCTCGG
GCACTGGCTG
CGCCTCCGGC
GGCCCGCGGC
CCCCACCCCC
ACC Irr'rr-ICC
ACTCGGGAGI
CCCGCTGCAC
GCCCCGGGAC
GGGAGAGATC
GGGGAGCTCC
GCACGCTCCC
ATGCCCTGCC
GATCCTGCGC
CCGCCCCCGG
TTCGGGGAAG
TCGCGGGGTG
GCGGCGGCGT
ATGTCCGGTr
GGAGGGCGCC
TCGCCCAGGG
CTCGCAGGGG
AAACTITGAG CAGGGCGCTG GGCCGAGGAT AAAGCGGGGG
*GTGGTCCAGG
TGAACTrGCC
CAGAGGGGGG
AGGGZ4GACGG
TCCCTCGGCC
CCTGCGCCCG
CGCGGCGCTC
CACCGCCGCC
GGGAGGGAGG
TCTGGTGTC
CCTCCCTCGC
TCCTGTGAGG
CTGCAAAGTT
GGGGTCGCCG
CGCCCGCGCC
GTGGTCGCAT
CCTCCGGCCG
CTGGGGGAGG
CTTCCGAGGG
GCTI CTCTC
CCGCCGCCGC
CCGGGCCCCG
AGCGCCCCCG
AGGGGGACGA
CCCCGCCCCG
CCTCGCTTCA
CTTTTACCTG
GGGAACGCGG
GGAGGAGCCC
CCACCCCTG
1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 2991 GGTCCTTCC.A CCATGCACTr G INFORMATION FOR SEQ ID NO:49: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYP E: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: CACGGCTrAT
GCAAGCAAAG
INFORMATION FOR SEQ ID SEQUENCE
CHARACTERISTICS:
LENGTH: 20 base pairs 138 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID AACACAGTTT TCCATAATAG INFORMATION FOR SEQ ID NO:51: SEQUENCE CHARACTERISTICS: LENGTH: 19 amino acids TYPE: amino acid STRANDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: Leu Ser Lys Thr Val Ser Gly Ser Glu Gln Asp Leu Pro His Glu Leu 1 5 10 His Val Glu INFORMATION FOR SEQ ID NO:52: SEQUENCE CHARACTERISTICS: LENGTH: 25 base pairs TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: GACGGACACA GATGGAGGTT TAAAG INFORMATION FOR SEQ ID NO:53: SEQUENCE CHARACTERISTICS: LENGTH: 196 amino acids TYPE: amino acid STRANDEDNESS: not relevant 139 TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: met Arg Thr Leu Ala Cys Leu Leu Leu Leu Gly 1 5 10 His Val Leu Ala Glu Giu Ala Giu Ile Pro Arg 25 Leu Ala Arg Ser Gin Ile His Ser Ile Arg Asp 40 Glu Ile Asp Ser Val Gly Ser Glu Asp Ser Leu 55 Ala His Gly Val His Ala Thr Lys His Val Pro 70 75 Pro Ile Arg Arg Lys Arg Ser Ile Glu Glu Ala 90 Lys Thr Arg Thr Val Ile Tyr Giu Ile Pro Arg 100 105 Thr Ser Ala Asn Phe Leu Ile Trp Pro Pro Cys 115 120 Cys Thr Gly Cys Cys Asn Thr Ser Ser Val LysC 130 1351 Val His His Arg Ser Val Lys Val Ala Lys Val G 145 150 155 Lys Pro Lys Leu Lys Glu Val Gln Val Arg Leu G 165 170 Cys Ala Cys Ala Thr Thr Ser Leu Asn Pro AspT 180 185 Thr Asp Val Arg 195 INFORMATION FOR SEQ ID NO:54: i)SEQUENCE
CHARACTERISTICS:
LENGTH: 241 amino acids TYPE: amino acid STRAINDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: protein Cys Gl Giu Va Leu Gli Asp Th2 Glu Lys Val Pro Ser Gin lai Giu 125 .ys Gin ,iu Tyr lu Glu yr Arg Y Tyr Leu Ala 1 Ile Glu .Arg 2i Arg Leu Leu rSer Leu Arg Arg Pro Leu s0 Ala Val Cys Val Asp Pro 110 Val Lys Arg Pro Ser Arg Val Arg Lys 160 His Leu Giu 175 Giu Giu Asp 190 140 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: Met Asn Arg Cys Trp Ala Leu Phe Leu Ser Leu Cys Cys Tyr Leu Arg Leu Leu His Thr Arg Cys Arg Axg Gin 145 Lys Ala Pro Thr Val Ser Giy so Arg Ser Lys Thr Cys 130 Val Lys Cys Gly Ile 210 Ser Asp Asp Ser Leu Thr Asn 115 Se r Gin Pro Lys Giy 195 Arg Giu Ser Gly Ser Ser as Thr Asn Cys Arg Phe 165 Giu Gin Val Gly Asp Ile Arg Giu Giu 55 Gly Gly Leu Thr Giu Vai Phe Leu Cys Asn 135 Pro Val 150 Lys Lys Thr Val Giu Gin Arg Val 215 pro Ser 40 Asp Glu Ile Phe Val 120 Asn Gin Ala Pro Asp Ala Giu Giu 90 Ile Pro Asn Arg Val 170 Ala Lys Pro Giu Leu Leu Leu Ala Arg Cys Gin 140 le Leu Pro Pro Lys 220 Leu Gin Asp Ala met Arg Val 125 Cys Glu Glu Val Gin 205 Gly Tyr Glu Arg Leu Leu Asn A rg Gly Ile Ala Leu Ile 110 Giu Val Arg Pro Ile Val Asp His 175 Thr Arg 190 Thr Arg Lys His Ala Ala 185 Arg Ala 200 Arg Arg Lys Phe Lys His Thr His Asp Lys Thr Ala 225 230 Ala INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: Lys Glu Thr Leu 141 LENGTH: 149 amino acids TYPE: amino acid STRANDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID Met Pro Val Met Arg Leu Phe Pro Cys Phe Leu Gin Leu Leu Ala Gly Leu Asn Arg Tyr Leu Val Asp Glu Ala Gly Ser Pro Arg Glu Arg Cys Leu Ser Tyr Ser Cys Thr Pro 115 Pro Ala Val Ser Glu Val Cys Arg Ala Glu Val Glu Thr Gly Cys Ala Asn Val 100 Ser Tyr Val Pro Glu Lou 55 His Cys Thr Glu Pro Gin 25 Val Val Glu Arg Met Phe Gly Asp Met Gln 105 Leu Thr 120 Gln Pro Leu Ser Glu 90 Leu Phe Ala Gin Asp Ser Leu Lys Gin Ser Ala Val Trp Val Ser Val Ser Cys Val Arg Ser 110 Val Arg Arg Pro Leu .Arg Glu Lys Met Lys Pro Glu Arg Cys Gly Asp 130 Ala Val Pro Arg Arg 145 INFOR~MATION FOR SEQ ID NO:56: i)SEQUENCE CHARACTERISTICS: LENGTH: 191 amino acids TYPE: amino acid STRANDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 142 Met Ann Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Gly Gly Gin Asn His His Giu Arg Ser Tyr Cys His Pro Ile so 55 Tyr Pro Asp Giu Ile Glu Tyr 70 Met Arg Cys Gly Gly Cys Cys Thr Glu Giu Ser An Ile Thr 100 Gin Giy Gin His Ile Gly Giu 115 Glu Cys Arg Pro Lys Lys Asp 130 135 Pro Cys Ser Glu Arg Arg Lys 145 150 Cys Lys Cys Ser Cys Lys Asn 165 Ser Val 40 Glu le Asn Met met 120 Arg His Thr Ala Lys Leu Lys Giu 90 Ile Phe Arg Phe Ser 170 Cys Ala Pro Met Ala Glu Phe Met Asp Val Tyr Val Asp Ile Phe Giln Pro Ser Cys Val Pro Gly Leu Glu Cys Vai Met Arg Ile Lys Pro 110 Leu Gin His Aen Lys 125 Gin Giu Asn Pro Cys 140 Val Gin Asp Pro Gin 155 .Arg Cys Lye Ala Arg 175 Asp Lys Pro Arg Arg 190 Leu Glu Leu Asn Glu Arg Thr Cys
ISO
INFORMATION FOR SEQ ID NO:57: SEQUENCE CHARACTERISTICS: LENGTH: 188 amino acids TYPE: amino acid STRANDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: protein (Xi) met 1 Ala SEQUENCE DESCRIPTION: SEQ ID 110:57: Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 5 10 Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin .25 -143 Asp Val Tyr 40 Arg Lys Val Vai Ser Trp Ile Thr Arg Ala Thr Cys Gin Pro Arg Giu Ala Lys Gin Val Pro Pro Ser Cys Vai Giu Lys Cys 145 Cy r Asn Asp Gin 100 Leu Ala His Arg Thr 180 Asp Gly Leu Ile Leu Met Giu Glu His Val Lys Pro 135 His Gin Arg 150 Ser Phe Leu 165 Cys Arg Cys Glu Cys Ile Arg 105 Ser Gin 120 Asp Ser Pro Asp Val Glu Thr Val 75 Val Pro 90 Tyr Pro Cys Giu Pro Arg Pro Arg 155 Gin Gly 170 Leu Arg Thr Gly Ser Ser Cys Arg 125 Pro Leu 140 Thr Cys Thr Val Gly Gly His Gin Leu Giy Lys Lys Pro Arg Cys Arg 160 Giu Leu Arg Arg Pro Asp Arg Gly Leu 175 Arg INFORMATION FOR SEQ ID NO:58: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 419 amino acids TYPE: amino acid STRANDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: protein (ix) FEATURE: NAME/KEY: other LOCATION: 156 OTHER INFORMATION: /note= "codon 156 can be anything other than cysteine, or can be nothing" (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: Met His Leu Leu Gly Phe Phe Ser Val Aia Cys Ser Leu Leu Ala Ala 1 5 10 144 Ala Leu Leu Pro Gly Pro Arg Glu Ala Pro Ala Ala-Ala Ala Ala Phe 25 Glu Ser Gly Leu Asp Leu Ser Asp Ala Giu Pro Asp Ala Gly Giu Ala 40 Thr Ala Tyr Ala Ser Lys Asp Leu Glu Glu Gln Leu Axg Ser Val Ser 55 Ser Val Asp Glu Leu Met Thr Val Leu Tyr Pro Glu Tyr Trp Lys Met 70 75 Tyr Lys Cys Gin Leu Axg Lys Gly Gly Trp Gln His Asn Arg Giu Gin 90 Ala Asn Leu Asn Ser Arj Thr Giu Giu Thr Ile Lys Phe Ala Ala Ala 100 105 110 His Tyr Asn Thr Glu Ile Leu Lys Ser Ile Asp Asn Giu Trp Arg Lys 115 120 125 Thr Gin Cys Met Pro Arg Glu Val Cys Ile Asp Val Gly Lys Glu Phe 130 135 140 Gly Val. Ala Thr Asn Thr Phe Phe Lys Pro Pro Xaa Val Ser Val Tyr 145 150 155 160 Arg Cys Gly Gly Cys Cys Asn Ser Glu Gly Leu Gln Cys Met Asn Thr 165 170 175 Ser Thr Ser Tyr Leu Ser Lys Thr Leu Phe Giu Ile Thr Val Pro Leu 180 185 190 Ser Gin Gly Pro Lys Pro Val Thr Ile Ser Phe Ala Asn His Thr Ser 195 200 205 Cys Arg Cys Met Ser Lys Leu Asp Val Tyr Arg Gin Val. His Ser Ile 210 215 220 Ile Arg Arg Ser Leu Pro Ala Thr Leu Pro Gin Cys Gin Ala Ala Asn 225 230 235 240 Lys Thr Cys Pro Thr Asn Tyr Met Trp Asn Asn His Ile Cys Arg Cys 245 250 255 Leu Ala Gin Glu Asp Phe Met Phe Ser Ser Asp Ala Gly Asp Asp Ser 260 265 270 Thr Asp Gly Phe His Asp Ile Cys Gly Pro Asn Lys Glu Leu Asp Glu 275 280 285 Glu Thr Cys Gin Cys Val. Cys Arg Ala Gly Leu .Arg Pro Ala Ser Cys 290 295 300 Gly Pro His Lys Giu Leu Asp Arg Asn Ser Cys Gin Cys Val. Cys Lys 305 310 z-2 -14S Asn Lys Leu Phe Pro Ser Gin Cys Gly Ala 325 330 Asn Thr Cys Gin Cys Val Cys Lys Arg Thr 340 345 Leu Asn Pro Gly Lys Cys Ala Cys Glu Cys 355 360 Cys Leu Leu Lys Gly Lys Lys Phe His His 370 375 Axg Arg Pro Cys Thr Ann Arg Gin Lys Ala 385 390 Tyr Ser Glu Glu Val Cys Arg Cys Val Pro 405 410 Gin Met Ser INFORM4ATION FOR SEQ ID NO:59: Wi SEQUENCE CHARACTERISTICS: LENGTH: 160 amino acids TYPE: amino acid STRANDEDNESS: not relevant TOPOLOGY: linear (ii) MOLECULE TYPE: protein Arg Giu Pro Arg Giu Ser 365 Thr Cys 380 Glu Pro Tyr Trp Phe Asp 335 Asn Gin 350 Pro Gin Ser Cys Gly Phe Lys Arg 415 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: Met His Lou Leu Gly Phe Phe Ser Val Ala 10 Ala Leu Lou Pro Gly Pro Arg Giu Ala Pro 25 Giu Giu Thr Ile Lys Phe Ala Ala Ala His 40 Lys Ser Ile Asp Ann Giu Trp Arg Lys Thr 55 Val Cys Ile Asp Val Gly Lys Glu Phe Gly Phe Lys Pro Pro Cys Val Ser Val Tyr Arg 90 Ser Glu Gly Lou Gin Cys Met Asn Thr Ser 100 10S Ser Leu Leu Ala is Ala Ala Ala Ala Asn Thr Giu Ile Cyr. Met Pro Arg Ala Thr Ann Thr Gly Gly Cys Cys Ser Tyr Lou Ser 110 -146- Thr Leu Phe Giu Ile Thr Val Pro Leu Ser Gin Gly Pro Lys Pro Val 115 120 .125 Thr Ile Ser Phe Ala Asn His Thr Ser Cys Arg Cys Met Ser Lys Leu 130 135 140 Asp Val Tyr Arg Gin Val His Ser Ile Ile His His His His His His 145 150 155 160