US5821056A

US5821056A - Growth differentiation factor-9

Info

Publication number: US5821056A
Application number: US08/491,835
Authority: US
Inventors: Se-Jin Lee
Original assignee: School of Medicine of Johns Hopkins University
Current assignee: School of Medicine of Johns Hopkins University
Priority date: 1993-01-12
Filing date: 1994-01-12
Publication date: 1998-10-13
Anticipated expiration: 2015-10-13
Also published as: US6191261B1

Abstract

Growth differentiation factor-9 (GDF-9) is disclosed along with its polynucleotide sequence and amino acid sequence. Also disclosed are diagnostic and therapeutic methods of using the GDF-9 polypeptide and polynucleotide sequences.

Description

This application is a 371 application of PCT US94/00685, filed Jan. 12, 1994, which is a continuation-in-part of U.S. Ser. No. 08/003,303, filed Jan. 12, 1993, now abandoned.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The invention relates generally to growth factors and specifically to a new member of the transforming growth factor beta (TGF-β) superfamily, which is denoted, growth differentiation factor-9 (GDF-9).

2. Description of Related Art

The transforming growth factor β (TGF-β) superfamily encompasses a group of structurally-related proteins which affect a wide range of differentiation processes during embryonic development. The family includes, Mullerian inhibiting substance (MIS), which is required for normal male sex development (Behringer, et al., Nature, 345:167, 1990), Drosophila decapentaplegic (DPP) gene product, which is required for dorsal-ventral axis formation and morphogenesis of the imaginal disks (Padgett, et al., Nature, 325:81-84, 1987), the Xenopus Vg-1 gene product, which localizes to the vegetal pole of eggs ((Weeks, et al., Cell, 51:861-867, 1987), the activins (Mason, et al., Biochem, Biophys. Res. Commun., 135:957-964, 1986), which can induce the formation of mesoderm and anterior structures in Xenopus embryos (Thomsen, et al., Cell, 63:485, 1990), and the bone morphogenetic proteins (BMPs, osteogenin, OP-1) which can induce de novo cartilage and bone formation (Sampath, et al., J. Biol. Chem., 265:13198, 1990). The TGF-βs can influence a variety of differentiation processes, including adipogenesis, myogenesis, chondrogenesis. hematopoiesis, and epithelial cell differentiation (for review, see Massague, Cell 49:437, 1987).

The proteins of the TGF-β family are initially synthesized as a large precursor protein which subsequently undergoes proteolytic cleavage at a cluster of basic residues approximately 110-140 amino acids from the C-terminus. The C-terminal regions of the proteins are all structurally related and the different family members can be classified into distinct subgroups based on the extent of their homology. Although the homologies within particular subgroups range from 70% to 90% amino acid sequence identity, the homologies between subgroups are significantly lower, generally ranging from only 20% to 50%. In each case, the active species appears to be a disulfide-linked dimer of C-terminal fragments. For most of the family members that have been studied, the homodimeric species has been found to be biologically active, but for other family members, like the inhibins (Ung, et al., Nature, 321:779, 1986) and the TGF-,βs (Cheifetz, et al., Cell, 48:409, 1987), heterodimers have also been detected, and these appear to have different biological properties than the respective homodimers.

The inhibins and activins were originally purified from follicular fluid and shown to have counteracting effects on the release of follicle-stimulating hormone by the pituitary gland. Although the mRNAs for all three inhibin/activin subunits (αa, βA and βB) have been detected in the ovary, none of these appear to be ovary-specific (Meunier, et al., Proc.Natl.Acad.Sci. USA, 85:247,1988). MIS has also been shown to be expressed by granulosa cells and the effects of MIS on ovarian development have been documented both in vivo in transgenic mice expressing MIS ectopically (Behringer, supra) and in vitro in organ culture (Vigier, et al., Development, 100:43, 1987).

Identification of new factors that are tissue-specific in their expression pattern will provide a greater understanding of that tissue's development and function.

SUMMARY OF THE INVENTION

The present invention provides a cell growth and differentiation factor, GDF-9, a polynucleotide sequence which encodes the factor and antibodies which are immunoreactive with the factor. This factor appears to relate to various cell proliferative disorders, especially those involving ovarian tumors, such as granulosa cell tumors.

Thus, in one embodiment, the invention provides a method for detecting a cell proliferative disorder of ovarian origin and which is associated with GDF-9. In another embodiment, the invention provides a method of treating a cell proliferative disorder associated with abnormal levels of expression of GDF-9, by suppressing or enhancing GDF-9 activity.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows expression of GDF-9 mRNA in adult tissues.

FIG. 2a-c shows nucleotide and predicted amino acid sequence of murine GDF-9 (SEQ ID NO:3 and SEQ ID No:4, respectively). Consensus N-glycosylation signals are denoted by plain boxes. The putative tetrabasic processing sites are denoted by stippled boxes. The in-frame termination codons upstream of the putative initiating ATG and the consensus polyadenylation signals are underlined. The poly A tails are not shown. Numbers indicate nucleotide position relative to the 5' end.

FIG. 3a and FIG. 3b shows the alignment of the C-terminal sequences of GDF-9 with other members of the TGF-β family (SEQ ID NO:5-25). The conserved cysteine residues are shaded. Dashes denote gaps introduced in order to maximize alignment.

FIG. 4a and FIG. 4b shows amino acid homologies among the different members of the TGF-β superfamily. Numbers represent percent amino acid identities between each pair calculated from the first conserved cysteine to the C-terminus. Boxes represent homologies among highly-related members within particular subgroups.

FIG. 5 shows the immunohistochemical localization of GDF-9 protein. Adjacent sections of an adult ovary were either stained with hematoxyiin and eosin (FIG. 5a) or incubated with immune (FIG. 5b) or pre-immune (FIG. 5c) serum at a dilution of 1:500. Anti-GDF-9 antiserum was prepared by expressing the C-terminal portion of murine GDF-9 (residues 308-441) in bacteria, excising GDF-9 protein from preparative SOS gels, and immunizing rabbits. Sites of antibody binding were visualized using the Vectastain ABC kit (Vector Labs).

FIG. 6 shows a comparison of the predicted amino acid sequences of murine (top lines) and human (bottom lines) GDF-9 (SEQ ID NO:4 and SEQ ID NO:26, respectively). Numbers represent amino acid positions relative to the N-termini. Vertical lines represent sequence identities. Dots represent gaps introduced in order to maximize the alignment. The clear box shows the predicted proteolytic processing sites. The shaded boxes show the cysteine residues in the mature region of the proteins. The bars at the bottom show a schematic of the pre-(clear) and mature (shaded) regions of GDF-9 with the percent sequence identities between the murine and human sequences shown below.

FIG. 7 shows in situ hybridization to adult ovary sections using a GOF-9 RNA probe. ³⁵ S!-labeled anti-sense (FIGS. 7a and 7c) or sense (FIGS. 7b and 7d) GDF-9 RNA probes were hybridized to adjacent paraffin-embedded sections of ovaries fixed in 4% paraformaldehyde. Sections were dipped in photographic emulsion, exposed, developed, and then stained with hematoxylin and eosin. Two representative fields are shown.

FIG. 8 shows in situ hybridization to a postnatal day 4 ovary section using an antisense GDF-9 RNA probe. Sections were prepared as described for FIG. 7. Following autoradiography and staining, the section was photographed under bright-field (FIG. 8a) or dark-field (FIG. 8b) illumination.

FIG. 9 shows in situ hybridization to postnatal day 8 ovary sections using an antisense (FIG. 9a) or sense (FIG. 9b) GDF-9 RNA probe. Sections were prepared as described for FIG. 7.

FIG. 10 shows in situ hybridization to adult oviduct sections using an antisense (FIG. 10a) or sense (FIG. 10b) GDF-9 RNA probe. Sections were prepared as described for FIG. 7.

FIG. 11 shows in situ hybridization to an adult oviduct (0.5 days following fertilization) section using an antisense GDF-9 RNA probe. Sections were prepared as described for FIG. 7. Following autoradiography and staining, the section was photographed under bright-field (FIG. 11a) or dark-field (FIG. 11b) illumination.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides a growth and differentiation factor, GDF-9 and a polynucleotide sequence encoding GDF-9. Unlike other members of the TGF-β superfamily, GDF-9 expression is highly tissue specific, being expressed in cells primarily in ovarian tissue. In one embodiment, the invention provides a method for detection of a cell proliferative disorder of the ovary, which is associated with GDF-9 expression. In another embodiment, the invention provides a method for treating a cell proliferative disorder associated with abnormal expression of GDF-9 by using an agent which suppresses or enhances GDF-9 activity.

The TGF-β superfamily consists of multifunctionaly polypeptides that control proliferation, differentiation, and other functions in many cell types. Many of the peptides have regulatory, both positive and negative, effects on other peptide growth factors. The structural homology between the GDF-9 protein of this invention and the members of the TGF-β family, indicates that GDF-9 is a new member of the family of growth and differentiation factors. Based on the known activities of many of the other members, it can be expected that GDF-9 will also possess biological activities that will make it useful as a diagnostic and therapeutic reagent.

For example, another regulatory protein that has been found to have structural homology with TGF-β is inhibin, a specific and potent polypeptide inhibitor of the pituitary secretion of FSH. Inhibin has been isolated from ovarian follicular fluid. Because of its suppression of FSH, inhibin has potential to be used as a contraceptive in both males and females. GDF-9 may possess similar biological activity since it is also an ovarian specific peptide. Inhibin has also been shown to be useful as a marker for certain ovarian tumors (Lappohn, et al., N. Engl. J. Med., 321:790, 1989). GDF-9 may also be useful as a marker for identifying primary and metastatic neoplasms of ovarian origin. Similarly, GDF-9 may be useful as an indicator of developmental anomalies in prenatal screening procedures.

Another peptide of the TGF-β family is MIS, produced by the testis and responsible for the regression of the Mullerian ducts in the male embryo. MIS has been show to inhibit the growth of human ovarian cancer in nude mice (Donahoe, et al., Ann. Surg., 194:472, 1981). GDF-9 may function similarly and may, therefore, be useful as an anti-cancer agent, such as for the treatment of ovarian cancer.

GDF-9 may also function as a growth stimulatory factor and, therefore, be useful for the survival of various cell populations in vitro. In particular, if GDF-9 plays a role in oocyte maturation, it may be useful in in vitro fertilization procedures, e.g., in enhancing the success rate. Many of the members of the TGF-β family are also important mediators of tissue repair. TGF-β has been shown to have marked effects on the formation of collagen and causes a striking angiogenic response in the newborn mouse (Roberts, et al., Proc. Natl. Acad. Sci. USA, 83:4167, 1986). GDF-9 may also have similar activities and may be useful in repair of tissue injury caused by trauma or burns for example.

The term "substantially pure" as used herein refers to GDF-9 which is substantially free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated. One skilled in the art can purify GDF-9 using standard techniques for protein purification. The substantially pure polypeptide will yield a single major band on a non-reducing polyacrylamide gel. The purity of the GDF-9 polypeptide can also be determined by amino-terminal amino acid sequence analysis. GDF-9 polypeptide includes functional fragments of the polypeptide, as long as the activity of GDF-9 remains. Smaller peptides containing the biological activity of GDF-9 are included in the invention.

The invention provides polynucleotides encoding the GDF-9 protein. These polynucleotides include DNA, cDNA and RNA sequences which encode GDF-9. It is understood that all polynucleotides encoding all or a portion of GDF-9 are also included herein, as long as they encode a polypeptide with GDF-9 activity. Such polynucleotides include naturally occurring, synthetic, and intentionally manipulated polynucleotides. For example, GOF-9 polynucleotide may be subjected to site-directed mutagenesis. The polynucleotide sequence for GDF-9 also includes antisense sequences. The polynucleotides of the invention include sequences that are degenerate as a result of the genetic code. There are 20 natural amino acids, most of which are specified by more than one codon. Therefore, all degenerate nucleotide sequences are included in the invention as long as the amino acid sequence of GDF-9 polypeptide encoded by the nucleotide sequence is functionally unchanged.

Specifically disclosed herein is a cDNA sequence for GDF-9 which is 1712 base pairs in length and contains an open reading frame beginning with a methionine codon at nucleotide 29. The encoded polypeptide is 441 amino acids in length with a molecular weight of about 49.6 kD, as determined by nucleotide sequence analysis. The GDF-9 sequence contains a core of hydrophobic amino acids near the N-terminus, suggestive of a signal sequence for secretion. GDF-9 contains four potential N-glycosylation sites at asparagine residues 163, 229, 258, and 325 and a putative tetrabasic proteolytic processing site (RRRR) at amino acids 303-306. The mature C-terminal fragment of GDF-9 is predicted to be 135 amino acids in length and have an unglycosylated molecular weight of about 15.6 kD, as determined by nucleotide sequence analysis. One skilled in the art can modify, or partially or completely remove the glycosyl groups from the GDF-9 protein using standard techniques. Therefore, the functional protein or fragments thereof of the invention includes glycosylated, partially glycosylated and unglycosylated species of GDF-9.

The degree of sequence identity of GDF-9 with known TGF-β family members ranges from a minimum of 21% with Mullerian inhibiting substance (MIS) to a maximum of 34% with bone morphogenetic protein-4 (BMP-4). GDF-9 specifically disclosed herein differs from the known family members in its pattern of cysteine residues in the C-terminal region. GDF-9 lacks the fourth cysteine of the seven cysteines present in other family members; in place of cysteine at this position, the GDF-9 sequence contains a serine residue. This GDF-9 does not contain a seventh cysteine residue elsewhere in the C-terminal region.

Minor modifications of the recombinant GDF-9 primary amino acid sequence may result in proteins which have substantially equivalent activity as compared to the GDF-9 polypeptide described herein. Such modifications may be deliberate, as by site-directed mutagenesis, or may be spontaneous. All of the polypeptides produced by these modifications are included herein as long as the biological activity of GDF-9 still exists. Further, deletion of one or more amino acids can also result in a modification of the structure of the resultant molecule without significantly altering its biological activity. This can lead to the development of a smaller active molecule which would have broader utility. For example, one can remove amino or carboxy terminal amino acids which are not required for GDF-9 biological activity.

The nucleotide sequence encoding the GDF-9 polypeptide of the invention includes the disclosed sequence and conservative variations thereof. The term "conservative variation" as used herein denotes the replacement of an amino acid residue by another, biologically similar residue. Examples of conservative variations include the substitution of one hydrophobic residue such as isoleucine, valine, leucine or methionine for another, or the substitution of one polar residue for another, such as the substitution of arginine for lysine glutamic for aspartic acids, or glutamine for asparagine, and the like. The term "conservative variation" also includes the use of a substituted amino acid in place of an unsubstituted parent amino acid provided that antibodies raised to the substituted polypeptide also immunoreact with the unsubstituted polypeptide.

DNA sequences of the invention can be obtained by several methods. For example, the DNA can be isolated using hybridization techniques which are well known in the art. These include, but are not limited to: 1) hybridization of genomic or cDNA libraries with probes to detect homologous nucleotide sequences and 2) antibody screening of expression libraries to detect cloned DNA fragments with shared structural features.

Preferably the GDF-9 polynucleotide of the invention is derived from a mammalian organism, and most preferably from a mouse, rat, or human. Screening procedures which rely on nucleic acid hybridization make it possible to isolate any gene sequence from any organism, provided the appropriate probe is available. Oligonucleotide probes, which correspond to a part of the sequence encoding the protein in question, can be synthesized chemically. This requires that short, oligopeptide stretches of amino acid sequence must be known. The DNA sequence encoding the protein can be deduced from the genetic code, however, the degeneracy of the code must be taken into account. It is possible to perform a mixed addition reaction when the sequence is degenerate. This includes a heterogeneous mixture of denatured double-stranded DNA. For such screening, hybridization is preferably performed on either single-stranded DNA or denatured double-stranded DNA. Hybridization is particularly useful in the detection of cDNA clones derived from sources where an extremely low amount of mRNA sequences relating to the polypeptide of interest are present. In other words, by using stringent hybridization conditions directed to avoid non-specific binding, it is possible, for example, to allow the autoradiographic visualization of a specific cDNA clone by the hybridization of the target DNA to that single probe in the mixture which is its complete complement (Wallace, et al., Nucl. Acid Res., 9:879, 1981).

The development of specific DNA sequences encoding GDF-9 can also be obtained by: 1) isolation of double-stranded DNA sequences from the genomic DNA; 2) chemical manufacture of a DNA sequence to provide the necessary codons for the polypeptide of interest; and 3) in vitro synthesis of a double-stranded DNA sequence by reverse transcription of mRNA isolated from a eukaryotic donor cell. In the latter case, a double-stranded DNA complement of mRNA is eventually formed which is generally referred to as cDNA.

Of the three above-noted methods for developing specific DNA sequences for use in recombinant procedures, the isolation of genomic DNA isolates is the least common. This is especially true when it is desirable to obtain the microbial expression of mammalian polypeptides due to the presence of introns.

The synthesis of DNA sequences is frequently the method of choice when the entire sequence of amino acid residues of the desired polypeptide product is known. When the entire sequence of amino acid residues of the desired polypeptide is not known, the direct synthesis of DNA sequences is not possible and the method of choice is the synthesis of cDNA sequences. Among the standard procedures for isolating cDNA sequences of interest is the formaton of plasmid- or phage-carrying cDNA libraries which are derived from reverse transcription of mRNA which is abundant in donor cells that have a high level of genetic expression. When used in combination with polymerase chain reaction technology, even rare expression products can be cloned. In those cases where significant portions of the amino acid sequence of the polypeptide are known, the production of labeled single or double-stranded DNA or RNA probe sequences duplicating a sequence putatively present in the target cDNA may be employed in DNA/DNA hybridization procedures which are carried out on cloned copies of the cDNA which have been denatured into a single-stranded form (Jay, et al., Nucl. Acid Res., 11:2325, 1983).

A cDNA expression library, such as lambda gt11, can be screened indirectly for GDF-9 peptides having at least one epitope, using antibodies specific for GDF-9. Such antibodies can be either polyclonally or monoclonally derived and used to detect expression product indicative of the presence of GDF-9 cDNA.

DNA sequences encoding GDF-9 can be expressed in vitro by DNA transfer into a suitable host cell. "Host cells" are cells in which a vector can be propagated and its DNA expressed. The term also includes any progeny of the subject host cell. It is understood that all progeny may not be identical to the parental cell since there may be mutations that occur during replication. However, such progeny are included when the term "host cell" is used. Methods of stable transfer, meaning that the foreign DNA is continuously maintained in the host, are known in the art.

In the present invention, the GDF-9 polynucleotide sequences may be inserted into a recombinant expression vector. The term "recombinant expression vector" refers to a plasmid, virus or other vehicle known in the art that has been manipulated by insertion or incorporation of the GDF-9 genetic sequences. Such expression vectors contain a promoter sequence which facilitates the efficient transcription of the inserted genetic sequence of the host. The expression vector typically contains an origin of replication, a promoter, as well as specific genes which allow phenotypic selection of the transformed cells. Vectors suitable for use in the present invention include, but are not limited to the T7-based expression vector for expression in bacteria (Rosenberg, et al., Gene ,56:125, 1987), the pMSXND expression vector for expression in mammalian cells (Lee and Nathans, J. Biol. Chem., 263:3521, 1988) and baculovirus-derived vectors for expression in insect cells. The DNA segment can be present in the vector operably linked to regulatory elements, for example, a promoter (e.g., T7, metallothionein I, or polyhedrin promoters).

Polynucleotide sequences encoding GDF-9 can be expressed in either prokaryotes or eukaryotes. Hosts can include microbial, yeast, insect and mammalian organisms. Methods of expressing DNA sequences having eukaryotic or viral sequences in prokaryotes are well known in the art. Biologically functional viral and plasmid DNA vectors capable of expression and replication in a host are known in the art. Such vectors are used to incorporate DNA sequences of the invention.

Transformation of a host cell with recombinant DNA may be carried out by conventional techniques as are well known to those skilled in the art. Where the host is prokaryotic, such as E. coli, competent cells which are capable of DNA uptake can be prepared from cells harvested after exponential growth phase and subsequently treated by the CaCl₂ method using procedures well known in the art. Alternatively, MgCl₂ or RbCl can be used. Transformation can also be performed after forming a protoplast of the host cell if desired.

When the host is a eukaryote, such methods of transfection of DNA as calcium phosphate co-precipitates, conventional mechanical procedures such as microinjection, electroporation, insertion of a plasmid encased in liposomes, or virus vectors may be used. Eukaryotic cells can also be cotransformed with DNA sequences encoding the GDF-9 of the invention, and a second foreign DNA molecule encoding a selectable phenotype, such as the herpes simplex thymidine kinase gene. Another method is to use a eukaryotic viral vector, such as simian virus 40 (SV40) or bovine papilloma virus, to transiently infect or transform eukaryotic cells and express the protein. (see for example, Eukaryotic Viral Vectors, Cold Spring Harbor Laboratory, Gluzman ed., 1982).

Isolation and purification of microbial expressed polypeptide, or fragments thereof, provided by the invention, may be carried out by conventional means including preparative chromatography and immunological separations involving monoclonal or polyclonal antibodies.

The invention includes antibodies immunoreactive with GDF-9 polypeptide or functional fragments thereof. Antibody which consists essentially of pooled monoclonal antibodies with different epitopic specificities, as well as distinct monoclonal antibody preparations are provided. Monoclonal antibodies are made from antigen containing fragments of the protein by methods well known to those skilled in the art (Kohler, et al., Nature, 256:495, 1975). The term antibody as used in this invention is meant to include intact molecules as well as fragments thereof, such as Fab and F(ab')₂, which are capable of binding an epitopic determinant on GDF-9.

The term "cell-proliferative disorder" denotes malignant as well as non-malignant cell populations which often appear to differ from the surrounding tissue both morphologically and genotypically. The GDF-9 polynucleotide that is an antisense molecule is useful in treating malignancies of the various organ systems, particularly, for example, the ovaries. Essentially, any disorder which is etiologically linked to altered expression of GDF-9 could be considered susceptible to treatment with a GDF-9 suppressing reagent.

The invention provides a method for detecting a cell proliferative disorder of the ovary which comprises contacting an anti-GDF-9 antibody with a cell suspected of having a GDF-9 associated disorder and detecting binding to the antibody. The antibody reactive with GDF-9 is labeled with a compound which allows detection of binding to GDF-9. For purposes of the invention, an antibody specific for GDF-9 polypeptide may be used to detect the level of GDF-9 in biological fluids and tissues. Any specimen containing a detectable amount of antigen can be used. A preferred sample of this invention is tissue of ovarian origin, specifically tissue containing granulosa cells or ovarian follicular fluid. The level of GDF-9 in the suspect cell can be compared with the level in a normal cell to determine whether the subject has a GDF-9-associated cell proliferative disorder. Preferably the subject is human.

The antibodies of the invention can be used in any subject in which it is desirable to administer in vitro or in vivo immunodiagnosis or immunotherapy. The antibodies of the invention are suited for use, for example, in immunoassays in which they can be utilized in liquid phase or bound to a solid phase carrier. In addition, the antibodies in these immunoassays can be detectably labeled in various ways. Examples of types of immunoassays which can utilize antibodies of the invention are competitive and non-competitive immunoassays in either a direct or indirect format. Examples of such immunoassays are the radioimmunoassay (RIA) and the sandwich (immunometric) assay. Detection of the antigens using the antibodies of the invention can be done utilizing immunoassays which are run in either the forward, reverse, or simultaneous modes, including immunohistochemical assays on physiological samples. Those of skill in the art will know, or can readily discern, other immunoassay formats without undue experimentation.

The antibodies of the invention can be bound to many different carriers and used to detect the presence of an antigen comprising the polypeptide of the invention. Examples of well-known carriers include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, agaroses and magnetite. The nature of the carrier can be either soluble or insoluble for purposes of the invention. Those skilled in the art will know of other suitable carriers for binding antibodies, or will be able to ascertain such, using routine experimentation.

There are many different labels and methods of labeling known to those of ordinary skill in the art. Examples of the types of labels which can be used in the present invention include enzymes, radioisotopes, fluorescent compounds, colloidal metals, chemiluminescent compounds, phosphorescent compounds, and bioluminescent compounds. Those of ordinary skill in the art will know of other suitable labels for binding to the antibody, or will be able to ascertain such, using routine experimentation.

Another technique which may also result in greater sensitivity consists of coupling the antibodies to low molecular weight haptens. These haptens can then be specifically detected by means of a second reaction. For example, it is common to use such haptens as biotin, which reacts with avidin, or dinitrophenyl, puridoxal, and fluorescein, which can react with specific antihapten antibodies.

In using the monoclonal antibodies of the invention for the in vivo detection of antigen, the detectably labeled antibody is given a dose which is diagnostically effective. The term "diagnostically effective" means that the amount of detectably labeled monoclonal antibody is administered in sufficient quantity to enable detection of the site having the antigen comprising a polypeptide of the invention for which the monoclonal antibodies are specific.

The concentration of detectably labeled monoclonal antibody which is administered should be sufficient such that the binding to those cells having the polypeptide is detectable compared to the background. Further, it is desirable that the detectably labeled monoclonal antibody be rapidly cleared from the circulatory system in order to give the best target-to-background signal ratio.

As a rule, the dosage of detectably labeled monoclonal antibody for in vivo diagnosis will vary depending on such factors as age, sex, and extent of disease of the individual. Such dosages may vary, for example, depending on whether multiple injections are given, antigenic burden, and other factors known to those of skill in the art.

For in vivo diagnostic imaging, the type of detection instrument available is a major factor in selecting a given radioisotope. The radioisotope chosen must have a type of decay which is detectable for a given type of instrument. Still another important factor in selecting a radioisotope for in vivo diagnosis is that deleterious radiation with respect to the host is minimized. Ideally, a radioisotope used for in vivo imaging will lack a particle emission, but produce a large number of photons in the 140-250 keV range, which may readily be detected by conventional gamma cameras.

For in vivo diagnosis radioisotopes may be bound to immunoglobulin either directly or indirectly by using an intermediate functional group. Intermediate functional groups which often are used to bind radioisotopes which exist as metallic ions to immunoglobulins are the bifunctional chelating agents such as diethylenetriaminepentacetic acid (DTPA) and ethylenediaminetetraacetic acid (EDTA) and similar molecules. Typical examples of metallic ions which can be bound to the monoclonal antibodies of the invention are ¹¹¹ In, ⁹⁷ Ru, ⁶⁷ Ga, ⁶⁸ Ga, ⁷² As, ⁸⁹ Zr, and ²⁰¹ TI.

The monoclonal antibodies of the invention can also be labeled with a paramagnetic isotope for purposes of in vivo diagnosis, as in magnetic resonance imaging (MRI) or electron spin resonance (ESR). In general, any conventional method for visualizing diagnostic imaging can be utiuized. Usually gamma and positron emitting radioisotopes are used for camera imaging and paramagnetic isotopes for MRI. Elements which are particularly useful in such techniques include ¹⁵⁷ Gd, ⁵⁵ Mn, ¹⁶² Dy, ⁵³ Cr, and ⁵⁶ Fe.

The monoclonal antibodies of the invention can be used in vitro and in vivo to monitor the course of amelioration of a GDF-9-associated disease in a subject. Thus, for example, by measuring the increase or decrease in the number of cells expressing antigen comprising a polypeptide of the invention or changes in the concentration of such antigen present in various body fluids, it would be possible to determine whether a particular therapeutic regimen aimed at ameliorating the GDF-9-associated disease is effective. The term "ameliorate" denotes a lessening of the detrimental effect of the GDF-9-associated disease in the subject receiving therapy.

The present invention identifies a nucleotide sequence that can be expressed in an altered manner as compared to expression in a normal cell, therefore, it is possible to design appropriate therapeutic or diagnostic techniques directed to this sequence. Thus, where a cell-proliferative disorder is associated with the expression of GDF-9, nucleic acid sequences that interfere with GDF-9 expression at the translational level can be used. This approach utilizes, for example, antisense nucleic acid and ribozymes to block translation of a specific GDF-9 mRNA, either by masking that mRNA with an antisense nucleic acid or by cleaving it with a ribozyme.

Antisense nucleic acids are DNA or RNA molecules that are complementary to at least a portion of a specific mRNA molecule (Weintraub, Scientific American, 262:40, 1990). In the cell, the antisense nucleic acids hybridize to the corresponding mRNA, forming a double-stranded molecule. The antisense nucleic acids interfere with the translation of the mRNA, since the cell will not translate a mRNA that is double-stranded. Antisense oligomers of about 15 nucleotides are preferred, since they are easily synthesized and are less likely to cause problems than larger molecules when introduced into the target GDF-9-producing cell. The use of antisense methods to inhibit the in vitro translation of genes is well known in the art (Marcus-Sakura, Anal. Biochem., 172:289, 1988).

Ribozymes are RNA molecules possessing the ability to specifically cleave other single-stranded RNA in a manner analogous to DNA restriction endonucleases. Through the modification of nucleotide sequences which encode these RNAs, it is possible to engineer molecules that recognize specific nucleotide sequences in an RNA molecule and cleave it (Cech, J. Amer. Med. Assn., 260:3030, 1988). A major advantage of this approach is that, because they are sequence-specific, only mRNAs with particular sequences are inactivated.

There are two basic types of ribozymes namely, tetrahymena-type (Hasselhoff, Nature, 334:585,1988) and "hammerhead"-type. Tetrahymena-type ribozymes recognize sequences which are four bases in length, while "hammerhead"-type ribozymes recognize base sequences 11-18 bases in length. The longer the recognition sequence, the greater the likelihood that the sequence will occur exclusively in the target mRNA species. Consequently, hammerhead-type ribozymes are preferable to tetrahymena-type ribozymes for inactivating a specific mRNA species and 18-based recognition sequences are preferable to shorter recognition sequences.

The present invention also provides gene therapy for the treatment of cell proliferative disorders which are mediated by GDF-9 protein. Such therapy would achieve its therapeutic effect by introduction of the GDF-9 antisense polynucleotide into cells having the proliferative disorder. Delivery of antisense GDF-9 polynucleotide can be achieved using a recombinant expression vector such as a chimeric virus or a colloidal dispersion system.

Especially preferred for therapeutic delivery of antisense sequences is the use of targeted liposomes.

Various viral vectors which can be utilized for gene therapy as taught herein include adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as a retrovirus. Preferably, the retroviral vector is a derivative of a murine or avian retrovirus. Examples of retroviral vectors in which a single foreign gene can be inserted include, but are not limited to: Moloney murine leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV). A number of additional retroviral vectors can incorporate multiple genes. All of these vectors can transfer or incorporate a gene for a selectable marker so that transduced cells can be identified and generated. By inserting a GDF-9 sequence of interest into the viral vector, along with another gene which encodes the ligand for a receptor on a specific target cell, for example, the vector is now target specific. Retroviral vectors can be made target specific by inserting, for example, a polynucleotide encoding a sugar, a glycolipid, or a protein. Preferred targeting is accomplished by using an antibody to target the retroviral vector. Those of skill in the art will know of, or can readily ascertain without undue experimentation, specific polynucleotide sequences which can be inserted into the retroviral genome to allow target specific delivery of the retroviral vector containing the GDF-9 antisense polynucleotide.

Since recombinant retroviruses are defective, they require assistance in order to produce infectious vector particles. This assistance can be provided, for example, by using helper cell lines that contain plasmids encoding all of the structural genes of the retrovirus under the control of regulatory sequences within the LTR. These plasmids are missing a nucleotide sequence which enables the packaging mechanism to recognize an RNA transcript for encapsidation. Helper cell lines which have deletions of the packaging signal include, but are not limited to ψ2, PA317 and PA12, for example. These cell lines produce empty virions, since no genome is packaged. If a retroviral vector is introduced into such cells in which the packaging signal is intact, but the structural genes are replaced by other genes of interest, the vector can be packaged and vector virion produced.

Alternatively, NIH 3T3 or other tissue culture cells can be directly transfected with plasmids encoding the retroviral structural genes gag, pol and env, by conventional calcium phosphate transfection. These cells are then transfected with the vector plasmid containing the genes of interest. The resulting cells release the retroviral vector into the culture medium.

Another targeted delivery system for GDF-9 antisense polynucleotides is a colloidal dispersion system. Colloidal dispersion systems include macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes. The preferred colloidal system of this invention is a liposome. Liposomes are artificial membrane vesicles which are useful as delivery vehicles in vitro and in vivo. It has been shown that large unilamellar vesicles (LUV), which range in size from 0.2-4.0 μm can encapsulate a substantial percentage of an aqueous buffer containing large macromolecules. RNA, DNA and intact virions can be encapsulated within the aqueous interior and be delivered to cells in a biologically active form (Fraley, et al., Trends Biochem. Sci., 6:77, 1981). In addition to mammalian cells, liposomes have been used for delivery of polynucleotides in plant, yeast and bacterial cells. In order for a liposome to be an efficient gene transfer vehicle, the following characteristics should be present: (1) encapsulation of the genes of interest at high efficiency while not compromising their biological activity; (2) preferential and substantial binding to a target cell in comparison to non-target cells; (3) delivery of the aqueous contents of the vesicle to the target cell cytoplasm at high efficiency; and (4) accurate and effective expression of genetic information (Mannino, et al., Biotechniques, 6:682, 1988).

The composition of the liposome is usually a combination of phospholipids, particularly high-phase-transition-temperature phospholipids, usually in combination with steroids, especially cholesterol. Other phospholipids or other lipids may also be used. The physical characteristics of liposomes depend on pH, ionic strength, and the presence of divalent cations.

Examples of lipids useful in liposome production include phosphatidyl compounds, such as phosphatidylglycerol, phosphatidylcholine, phosphatidylserine, phosphatidylethanolamine, sphingolipids, cerebrosides, and gangliosides. Particularly useful are diacylphosphatidylglycerols, where the lipid moiety contains from 14-18 carbon atoms, particularly from 16-18 carbon atoms, and is saturated. Illustrative phospholipids include egg phosphatidylcholine, dipalmitoylphosphatidylcholine and distearoylphosphatidylcholine.

The targeting of liposomes can be classified based on anatomical and mechanistic factors. Anatomical classification is based on the level of selectivity, for example, organ-specific, cell-specific, and organelle-specific. Mechanistic targeting can be distinguished based upon whether it is passive or active. Passive targeting utilizes the natural tendency of liposomes to distribute to cells of the reticulo-endothelial system (RES) in organs which contain sinusoidal capillaries. Active targeting, on the other hand, involves alteration of the liposome by coupling the liposome to a specific ligand such as a monoclonal antibody, sugar, glycolipid, or protein, or by changing the composition or size of the liposome in order to achieve targeting to organs and cell types other than the naturally occurring sites of localization.

The surface of the targeted delivery system may be modified in a variety of ways. In the case of a liposomal targeted delivery system, lipid groups can be incorporated into the lipid bilayer of the liposome in order to maintain the targeting ligand in stable association with the liposomal bilayer. Various linking groups can be used for joining the lipid chains to the targeting ligand.

Due to the expression of GDF-9 in the reproductive tract, there are a variety of applications using the polypeptide, polynucleotide and antibodies of the invention, related to contraception, fertility and pregnancy. GDF-9 could play a role in regulation of the menstrual cycle and, therefore, could be useful in various contraceptive regimens.

The following examples are intended to illustrate but not limit the invention. While they are typical of those that might be used, other procedures known to those skilled in the art may alternatively be used.

EXAMPLE 1 IDENTIFICATION AND ISOLATION OF A NOVEL TGF-β FAMILY MEMBER

To identify a new member of the TGF-β superfamily, degenerate oligonucleotides were designed which corresponded to two conserved regions among the known family members: one region spanning the two tryptophan residues conserved in all family members except MIS and the other region spanning the invariant cysteine residues near the C-terminus. These primers were used for polymerase chain reactions on mouse genomic DNA followed by subcloning the PCR products using restriction sites placed at the 5' ends of the primers, picking individual E. coli colonies carrying these subcioned inserts, and using a combination of random sequencing and hybridization analysis to eliminate known members of the superfamily.

GDF-9 was identified from a mixture of PCR products obtained with the primers SJL160 (5'-CCGGAATTCGGITGG(G/C/A)A(G/A/T/C)(G/C/A)A(G/A/T/C) TGG(A/G)TI(A/G)TI(T/G)CICC-3') (SEQUENCE ID NO. 1) and SJL153 (5'-CCGGAATTC(A/G)CAI(G/C)C(A/G)CAIC(T/C)(G/A/T/C)(C/G/T)TIG(T/C)I(G/A)(TC)CAT-3') (SEQUENCE ID NO. 2). PCR using these primers was carried out with 2 μg mouse genomic DNA at 94° C. for 1 min, 50° C. for 2 min, and 72° C. for 2 min for 40 cycles.

PCR products of approximately 280 bp were gel-purified, digested with Eco RI, gel-purified again, and subcloned in the Bluescript vector (Stratagene, San Diego, Calif.). Bacterial colonies carrying individual subclones were picked into 96 well microtiter plates, and multiple replicas were prepared by plating the cells onto nitrocellulose. The replicate filters were hybridized to probes representing known members of the family, and DNA was prepared from non-hybridizing colonies for sequence analysis.

The primer combination of SJL160 and SJL153, yielded three known sequences (inhibin βB, BMP-2, and BMP-4) and one novel sequence (designated GDF-9) among 145 subclones analyzed.

RNA isolation and Northern analysis were carried out as described previously (Lee, S. J., Mol. Endocrinol. 4:1034, 1990). An oligo dT-primed cDNA library was prepared from 2.5-3 μg of ovary poly A-selected RNA in the lambda ZAP II vector according to the instructions provided by Stratagene. The ovary library was not amplified prior to screening. Filters were hybridized as described previously (Lee, S.-J., Proc. Natl. Acad. Sci. USA., 88:4250-4254, 1991). DNA sequencing of both strands was carried out using the dideoxy chain termination method (Sanger, et al., Proc. Natl. Acad. Sci., USA, 74:5463-5467, 1977) and a combination of the S1 nuclease/exonuclease III strategy (Henikoff, S., Gene, 28:351-359, 1984) and synthetic oligonucleotide primers.

EXAMPLE 2 EXPRESSION PATTERN AND SEQUENCE OF GDF-9

To determine the expression pattern of GDF-9, RNA samples prepared from a variety of adult tissues were screened by Northern analysis. Five micrograms of twice polyA-selected RNA prepared from each tissue were electrophoresed on formaldehyde gels, blotted and probed with GDF-9. As shown in FIG. 1, the GDF-9 probe detected a 1.7 kb mRNA expressed exclusively in the ovary.

A mouse ovary cDNA library of 1.5×10⁶ recombinant phage was constructed in lambda ZAP II and screened with a probe derived from the GDF-9 PCR product. The nucleotide sequence of the longest of nineteen hybridizing clones is shown in FIG. 2. Consensus N-glycosylation signals are denoted by plain boxes. The putative tetrabasic processing sites are denoted by stippled boxes. The in-frame termination codons upstream of the putative initiating ATG and the consensus polyadenylation signals are underlined. The poly A tails are not shown. Numbers indicate nucleotide position relative to the 5' end. The 1712 bp sequence contains a long open reading frame beginning with a methionine codon at nucleotide 29 and potentially encoding a protein 441 amino acids in length with a molecular weight of 49.6 kD. Like other TGF-β family members, the GDF-9 sequence contains a core of hydrophobic amino acids near the N-terminus suggestive of a signal sequence for secretion. GDF-9 contains four potential N-glycosylation sites at asparagine residues 163, 229, 258, and 325 and a putative tetrabasic proteolytic processing site (RRRR) (amino acid residues 303-306 of SEQ ID NO:4) at amino acids 303-306. The mature C-terminal fragment of GDF-9 is predicted to be 135 amino acids in length and have an unglycosylated molecular weight of 15.6 kD.

Although the C-terminal portion of GDF-9 clearly shows homology with the other family members, the sequence of GDF-9 is significantly diverged from those of the other family members (FIGS. 3 and 4). FIG. 3 shows the alignment of the C-terminal sequences of GDF-9 with the corresponding regions of human GDF-1 (Lee, Proc. Natl. Acad. Sci. USA, 88:4250-4254, 1991), Xenopus Vg-1 (Weeks, et al., Cell, 51:861-867, 1987), human Vgr-1 (Celeste, et al., Proc. Natl. Acad. Sci. USA, 87:9843-9847, 1990), human OP-1 (Ozkaynak, et al., EMBO J., 9:2085-2093, 1990), human BMP-5 (Celeste, et al., Proc. Natl. Acad. Sci. USA, 87:9843-9847, 1990), Drosophila 60A (Wharton, et al., Proc. Natl. Acad. Sci. USA, 88:9214-9218, 1991), human BMP-2 and 4 (Wozney, et al., Science, 242:1528-1534, 1988), Drosophila DPP (Padgett, et al., Nature, 325:81-84, 1987), human BMP-3 (Wozney, et al., Science, 242:1528-1534, 1988), human MIS (Cate, et al., Cell, 45:685-698,1986), human inhibin, βA, and βB (Mason, et al., Biochem, Biophys. Res. Commun., 135:957-964, 1986), human TGF-β1 (Derynck, et al., Nature, 316:701-705, 1985), humanTGF-β2 (deMartin, et al., EMBO J., 6:3673-3677, 1987), human TGF-β3 (ten Dijke, et al., Proc. Natl. Acad. Sci. USA, 85:4715-4719, 1988), chicken TGF-β4 (Jakowlew, et al., Mol. Endocrinol., 2:1186-1195, 1988), and Xenopus TGF-β5 (Kondaiah, et al., J. Biol. Chem., 265:1089-1093, 1990). The conserved cysteine residues are shaded. Dashes denote gaps introduced in order to maximize the alignment.

FIG. 4 shows the amino acid homologies among the different members of the TGF-β superfamily. Numbers represent percent amino acid identities between each pair calculated from the first conserved cysteine to the C-terminus. Boxes represent homologies among highly-related members within particular subgroups.

The degree of sequence identify with known family members ranges from a minimum of 21% with MIS to a maximum of 34% with BMP-4. Hence, GDF-9 is comparable to MIS in its degree of sequence divergence from the other members of this superfamily. Moreover, GDF-9 shows no significant sequence homology to other family members in the pro-region of the molecule. GDF-9 also differs from the known family members in its pattern of cysteine residues in the C-terminal region. GDF-9 lacks the fourth cysteine of the seven cysteines that are present in all other family members; in place of cysteine at this position, the GDF-9 sequence contains a serine residue. In addition, GDF-9 does not contain a seventh cysteine residue elsewhere in the C-terminal region.

EXAMPLE 3 IMMUNOCHEMICAL LOCALIZATION OF GDF-9 IN THE ZONA PELLUCIDA

To determine whether GDF-9 mRNA was translated, sections of adult ovaries were incubated with antibodies directed against recombinant GDF-9 protein. In order to raise antibodies against GDF-9, portions of GDF-9 cDNA spanning amino acids 30 to 295 (pro-region) or 308 to 441 (mature region) were cloned into the T7-based pET3 expression vector (provided by F. W. Studier, Brookhaven National Laboratory), and the resulting plasmids were transformed into the BL21 (DE3) bacterial strain. Total cell extracts from isopropyl β-D-thiogalactoside-induced cells were electrophoresed on SDS/polyacrylamide gels, and the GDF-9 protein fragments were excised, mixed with Freund's adjuvant, and used to immunize rabbits by standard methods known to those of skill in the art. All immunizations were carried out by Spring Valley Lab (Sykesville, Md). The presence of GDF-9-reactive antibodies in the sera of these rabbits was assessed by Western analysis of bacterially-expressed protein fragments. The resulting serum was shown to react with the bacterially-expressed protein by Western analysis.

For immunohistochemical studies, ovaries were removed from adult mice, fixed in 4% paraformaidehyde, embedded in paraffin, and sectioned. Sites of antibody binding were detected by using the Vectastain ABC kit, according to the instructions provided by Vector Laboratories. FIG. 5 shows the immunohistochemical localization of GDF-9 protein. Adjacent sections of an adult ovary were either stained with hematoxylin and eosin (FIG. 5a) or incubated with immune (FIG. 5b) or pre-immune (FIG. 5c) serum at a dilution of 1:500. As shown in FIG. 5b, the antiserum detected protein solely in oocytes. No staining was detected using pre-immune serum (FIG. 5c). Hence, GDF-9 protein appears to translated in vivo by oocytes.

EXAMPLE 4 ISOLATION OF HUMAN GDF-9

In order to isolate a cDNA clone encoding human GDF-9, a cDNA library was constructed in lambda ZAP II using poly A-selected RNA prepared from an adult human ovary. From this library, a cDNA clone containing the entire human GDF-9 coding sequence was identified using standard screening techniques as in Example 1 and using the murine GDF-9 clone as a probe. A comparison of the predicted amino acid sequences of murine (top lines) and human (bottom lines) GDF-9 is shown in FIG. 6. Numbers represent amino acid positions relative to the N-termini. Vertical lines represent sequence identities. Dots represent gaps introduced in order to maximize the alignment. The clear box shows the predicted proteolytic processing sites. The shaded boxes show the cysteine residues in the mature region of the proteins. The bars at the bottom show a schematic of the pre-(clear) and mature (shaded) regions of GDF-9 with the percent sequence identities between the murine and human sequences shown below.

Like murine GDF-9, human GDF-9 contains a hydrophobic leader sequence, a putative RXXR proteolytic cleavage site, and a C-terminal region containing the hallmarks of other TGF-β family members. Murine and human GDF-9 are 64% identical in the pro- region and 90% identical in the predicted mature region of the molecule. The high degree of homology between the two sequences suggests that human GDF-9 plays an important role during embryonic development and/or in the adult ovary.

EXAMPLE 5 NUCLEIC ACID DETECTION OF EXPRESSION OF GDF-9 IN OOCYTES

In order to localize the expression of GDF-9 in the ovary, in situ hybridization to mouse ovary sections was carried out using an antisense GDF-9 RNA probe. FIG. 7 shows in situ hybridization to adult ovary sections using a GDF-9 RNA probe. ³⁵ S!-labeled anti-sense (FIGS. 7a and 7c) or sense (FIGS. 7b and 7d) GDF-9 RNA probes were hybridized to adjacent paraffin-embedded sections of ovaries fixed in 4% paraformaldehyde. Sections were dipped in photographic emulsion, exposed, developed, and then stained with hematoxylin and eosin. Two representative fields are shown.

As shown in FIGS. 7a and 7c, GDF-9 mRNA was detected primarily in oocytes in adult ovaries. Every oocyte (regardless of the stage of follicular development) examined showed GDF-9 expression, and no expression was detected in any other cell types. No hybridization was seen using a control GDF-9 sense RNA probe (FIGS. 7b and 7d). Hence, GDF-9 expression appears to be oocyte-specific in adult ovaries.

To determine the pattern of expression of GDF-9 mRNA during ovarian development, sections of neonatal ovaries were probed with a GDF-9 RNA probe. FIG. 8 shows in situ hybridization to a postnatal day 4 ovary section using an antisense GDF-9 RNA probe. Sections were prepared as described for FIG. 7. Following autoradiography and staining, the section was photographed under bright-field (FIG. 8a) or dark-field (FIG. 8b) illumination.

GDF-9 mRNA expression was first detected at the onset of follicular development. This was most clearly evident at postnatal day 4, where only oocytes that were present in follicles showed GDF-9 expression (FIG. 8); no expression was seen in oocytes that were not surrounded by granulosa cells. By postnatal day 8, every oocyte appeared to have undergone follicular development, and every oocyte showed GDF-9 expression (FIG. 9).

To determine whether GDF-9 was also expressed following ovulation, sections of mouse oviducts were examined by in situ hybridization. FIG. 10 shows in situ hybridization to adult oviduct sections using an antisense (FIG. 10a) or sense (FIG. 10b) GDF-9 RNA probe. Sections were prepared as described for FIG. 7.

As shown in FIG. 10, GDF-9 was expressed by oocytes that had been released into the oviduct. However, the expression of GDF-9 mRNA turned off rapidly following fertilization of the oocytes; by day 0.5 following fertilization, only some embryos (such as the one shown in FIG. 11) expressed GDF-9 mRNA, and by day 1.5, all embryos were negative for GDF-9 expression.

Although the invention has been described with reference to the presently preferred embodiment, it should be understood that various modifications can be made without departing from the spirit of the invention. Accordingly, the invention is limited only by the following claims.

SUMMARY OF SEQUENCES

Sequence ID No. 1 is the nucleotide sequence for the primer, SJL160, for GDF-9 (page 24, lines 15 and 16);

Sequence ID No. 2 is the nucleotide sequence for the primer, SJL153, for GDF-9 (page 24, lines 17 and 18);

Sequence ID No. 3 is the nucleotide and deduced amino acid sequence for GDF-9 (FIG. 2);

Sequence ID No. 4 is the deduced amino acid sequence for GDF-9 (FIG. 2);

Sequence ID No. 5 is the amino acid sequence of the C-terminus of GDF-3 (FIG. 3);

Sequence ID No. 6 is the amino acid sequence of the C-terminus of GDF-9 (FIG. 3);

Sequence ID No. 7 is the amino acid sequence of the C-terminus of GDF-1 (FIG. 3);

Sequence ID No. 8 is the amino acid sequence of the C-terminus of Vg-1 (FIG. 3);

Sequence ID No. 9 is the amino acid sequence of the C-terminus of Vgr-1 (FIG. 3);

Sequence ID No. 10 is the amino acid sequence of the C-terminus of OP-1 (FIG. 3);

Sequence ID No. 11 is the amino acid sequence of the C-terminus of BMP-5 (FIG. 3);

Sequence ID No. 12 is the amino acid sequence of the C-terminus of 60A (FIG. 3);

Sequence ID No. 13 is the amino acid sequence of the C-terminus of BMP-2 (FIG. 3);

Sequence ID No. 14 is the amino acid sequence of the C-terminus of BMP-4 (FIG. 3);

Sequence ID No. 15 is the amino acid sequence of the C-terminus of DPP (FIG. 3);

Sequence ID No. 16 is the amino acid sequence of the C-terminus of BMP-3 (FIG. 3);

Sequence ID No. 17 is the amino acid sequence of the C-terminus of MIS (FIG. 3);

Sequence ID No. 18 is the amino acid sequence of the C-terminus of inhibin α (FIG. 3);

Sequence ID No. 19 is the amino acid sequence of the C-terminus of inhibin βA (FIG. 3);

Sequence ID No. 20 is the amino acid sequence of the C-terminus of inhibin βB (FIG. 3);

Sequence ID No.21 is the amino acid sequence of the C-terminus of TGF-β1 (FIG. 3);

Sequence ID No. 22 is the amino acid sequence of the C-terminus of TGF-β2 (FIG. 3);

Sequence ID No. 23 is the amino acid sequence of the C-terminus of TGF-β3 (FIG. 3);

Sequence ID No. 24 is the amino acid sequence of the C-terminus of TGF-β4 (FIG. 3);

Sequence ID No. 25 is the amino acid sequence of the C-terminus of TGF-β5 (FIG. 3); and

Sequence ID No. 26 is the amino acid sequence of human GDF-9 (FIG. 6).

__________________________________________________________________________
SEQUENCE LISTING                                                          
(1) GENERAL INFORMATION:                                                  
(iii) NUMBER OF SEQUENCES: 26                                             
(2) INFORMATION FOR SEQ ID NO:1:                                          
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 35 base pairs                                                 
(B) TYPE: nucleic acid                                                    
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: DNA (genomic)                                         
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: SJL160                                                         
(ix) FEATURE:                                                             
(A) NAME/KEY: CDS                                                         
(B) LOCATION: 1..35                                                       
(D) OTHER INFORMATION: /note= "Where "B"occurs, B =                       
inosine"                                                                  
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                   
CCGGAATTCGGBTGGVANVANTGGRTBRTBKCBCC35                                     
(2) INFORMATION FOR SEQ ID NO:2:                                          
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 33 base pairs                                                 
(B) TYPE: nucleic acid                                                    
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: DNA (genomic)                                         
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: SJL153                                                         
(ix) FEATURE:                                                             
(A) NAME/KEY: CDS                                                         
(B) LOCATION: 1..33                                                       
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                   
CCGGAATTCRCADSCRCADCYNBTDGYDRYCAT33                                       
(2) INFORMATION FOR SEQ ID NO:3:                                          
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 1712 base pairs                                               
(B) TYPE: nucleic acid                                                    
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: DNA (genomic)                                         
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: GDF-9                                                          
(ix) FEATURE:                                                             
(A) NAME/KEY: CDS                                                         
(B) LOCATION: 29..1351                                                    
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                   
ATGCGTTCCTTCTTAGTTCTTCCAAGTCATGGCACTTCCCAGCAACTTCCTG52                    
MetAlaLeuProSerAsnPheLeu                                                  
15                                                                        
TTGGGGGTTTGCTGCTTTGCCTGGCTGTGTTTTCTTAGTAGCCTTAGC100                       
LeuGlyValCysCysPheAlaTrpLeuCysPheLeuSerSerLeuSer                          
101520                                                                    
TCTCAGGCTTCTACTGAAGAATCCCAGAGTGGAGCCAGTGAAAATGTG148                       
SerGlnAlaSerThrGluGluSerGlnSerGlyAlaSerGluAsnVal                          
25303540                                                                  
GAGTCTGAGGCAGACCCCTGGTCCTTGCTGCTGCCTGTAGATGGGACT196                       
GluSerGluAlaAspProTrpSerLeuLeuLeuProValAspGlyThr                          
455055                                                                    
GACAGGTCTGGCCTCTTGCCCCCCCTCTTTAAGGTTCTATCTGATAGG244                       
AspArgSerGlyLeuLeuProProLeuPheLysValLeuSerAspArg                          
606570                                                                    
CGAGGTGAGACCCCTAAGCTGCAGCCTGACTCCAGAGCACTCTACTAC292                       
ArgGlyGluThrProLysLeuGlnProAspSerArgAlaLeuTyrTyr                          
758085                                                                    
ATGAAAAAGCTCTATAAGACGTATGCTACCAAAGAGGGGGTTCCCAAA340                       
MetLysLysLeuTyrLysThrTyrAlaThrLysGluGlyValProLys                          
9095100                                                                   
CCCAGCAGAAGTCACCTCTACAATACCGTCCGGCTCTTCAGTCCCTGT388                       
ProSerArgSerHisLeuTyrAsnThrValArgLeuPheSerProCys                          
105110115120                                                              
GCCCAGCAAGAGCAGGCACCCAGCAACCAGGTGACAGGACCGCTGCCG436                       
AlaGlnGlnGluGlnAlaProSerAsnGlnValThrGlyProLeuPro                          
125130135                                                                 
ATGGTGGACCTGCTGTTTAACCTGGACCGGGTGACTGCCATGGAACAC484                       
MetValAspLeuLeuPheAsnLeuAspArgValThrAlaMetGluHis                          
140145150                                                                 
TTGCTCAAATCGGTCTTGCTATACACTCTGAACAACTCTGCCTCTTCC532                       
LeuLeuLysSerValLeuLeuTyrThrLeuAsnAsnSerAlaSerSer                          
155160165                                                                 
TCCTCCACTGTGACCTGTATGTGTGACCTTGTGGTAAAGGAGGCCATG580                       
SerSerThrValThrCysMetCysAspLeuValValLysGluAlaMet                          
170175180                                                                 
TCTTCTGGCAGGGCACCCCCAAGAGCACCGTACTCATTCACCCTGAAG628                       
SerSerGlyArgAlaProProArgAlaProTyrSerPheThrLeuLys                          
185190195200                                                              
AAACACAGATGGATTGAGATTGATGTGACCTCCCTCCTTCAGCCCCTA676                       
LysHisArgTrpIleGluIleAspValThrSerLeuLeuGlnProLeu                          
205210215                                                                 
GTGACCTCCAGCGAGAGGAGCATTCACCTGTCTGTCAATTTTACATGC724                       
ValThrSerSerGluArgSerIleHisLeuSerValAsnPheThrCys                          
220225230                                                                 
ACAAAAGACCAGGTGCCAGAGGACGGAGTGTTTAGCATGCCTCTCTCA772                       
ThrLysAspGlnValProGluAspGlyValPheSerMetProLeuSer                          
235240245                                                                 
GTGCCTCCTTCCCTCATCTTGTATCTCAACGACACAAGCACCCAGGCC820                       
ValProProSerLeuIleLeuTyrLeuAsnAspThrSerThrGlnAla                          
250255260                                                                 
TACCACTCTTGGCAGTCTCTTCAGTCCACCTGGAGGCCTTTACAGCAT868                       
TyrHisSerTrpGlnSerLeuGlnSerThrTrpArgProLeuGlnHis                          
265270275280                                                              
CCCGGCCAGGCCGGTGTGGCTGCCCGTCCCGTGAAAGAGGAAGCTACT916                       
ProGlyGlnAlaGlyValAlaAlaArgProValLysGluGluAlaThr                          
285290295                                                                 
GAGGTGGAAAGATCTCCCCGGCGCCGTCGAGGGCAGAAAGCCATCCGC964                       
GluValGluArgSerProArgArgArgArgGlyGlnLysAlaIleArg                          
300305310                                                                 
TCCGAAGCGAAGGGGCCACTTCTTACAGCATCCTTCAACCTCAGCGAA1012                      
SerGluAlaLysGlyProLeuLeuThrAlaSerPheAsnLeuSerGlu                          
315320325                                                                 
TACTTCAAACAGTTTCTTTTCCCCCAAAACGAGTGTGAACTCCATGAC1060                      
TyrPheLysGlnPheLeuPheProGlnAsnGluCysGluLeuHisAsp                          
330335340                                                                 
TTCAGACTGAGTTTTAGTCAGCTCAAATGGGACAACTGGATCGTGGCC1108                      
PheArgLeuSerPheSerGlnLeuLysTrpAspAsnTrpIleValAla                          
345350355360                                                              
CCGCACAGGTACAACCCTAGGTACTGTAAAGGGGACTGTCCTAGGGCG1156                      
ProHisArgTyrAsnProArgTyrCysLysGlyAspCysProArgAla                          
365370375                                                                 
GTCAGGCATCGGTATGGCTCTCCTGTGCACACCATGGTCCAGAATATA1204                      
ValArgHisArgTyrGlySerProValHisThrMetValGlnAsnIle                          
380385390                                                                 
ATCTATGAGAAGCTGGACCCTTCAGTGCCAAGGCCTTCGTGTGTGCCG1252                      
IleTyrGluLysLeuAspProSerValProArgProSerCysValPro                          
395400405                                                                 
GGCAAGTACAGCCCCCTGAGTGTGTTGACCATTGAACCCGACGGCTCC1300                      
GlyLysTyrSerProLeuSerValLeuThrIleGluProAspGlySer                          
410415420                                                                 
ATCGCTTACAAAGAGTACGAAGACATGATAGCTACGAGGTGCACCTGT1348                      
IleAlaTyrLysGluTyrGluAspMetIleAlaThrArgCysThrCys                          
425430435440                                                              
CGTTAGCATGGGGGCCACTTCAACAAGCCTGCCTGGCAGAGCAATGCTGTGGG1401                 
Arg                                                                       
CCTTAGAGTGCCTGGGCAGAGAGCTTCCTGTGACCAGTCTCTCCGTGCTGCTCAGTGCAC1461          
ACTGTGTGAGCGGGGGAAGTGTGTGTGTGTGGATGAGCACATCGAGTGCAGTGTCCGTAG1521          
GTGTAAAGGGCACACTCACTGGTCGTTGCCATAAACCAAGTGAAATGTAACTCATTTGGA1581          
GAGCTCTTTCTCCCCACGAGTGTAGTTTTCAGTGGACAGATTTGTTAGCATAAGTCTCGA1641          
GTAGAATGTAGCTGTGAACATGTCAGAGTGCTGTGGTTTTATGTGACGGAAGAATAAACT1701          
GTTGATGGCAT1712                                                           
(2) INFORMATION FOR SEQ ID NO:4:                                          
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 441 amino acids                                               
(B) TYPE: amino acid                                                      
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                   
MetAlaLeuProSerAsnPheLeuLeuGlyValCysCysPheAlaTrp                          
151015                                                                    
LeuCysPheLeuSerSerLeuSerSerGlnAlaSerThrGluGluSer                          
202530                                                                    
GlnSerGlyAlaSerGluAsnValGluSerGluAlaAspProTrpSer                          
354045                                                                    
LeuLeuLeuProValAspGlyThrAspArgSerGlyLeuLeuProPro                          
505560                                                                    
LeuPheLysValLeuSerAspArgArgGlyGluThrProLysLeuGln                          
65707580                                                                  
ProAspSerArgAlaLeuTyrTyrMetLysLysLeuTyrLysThrTyr                          
859095                                                                    
AlaThrLysGluGlyValProLysProSerArgSerHisLeuTyrAsn                          
100105110                                                                 
ThrValArgLeuPheSerProCysAlaGlnGlnGluGlnAlaProSer                          
115120125                                                                 
AsnGlnValThrGlyProLeuProMetValAspLeuLeuPheAsnLeu                          
130135140                                                                 
AspArgValThrAlaMetGluHisLeuLeuLysSerValLeuLeuTyr                          
145150155160                                                              
ThrLeuAsnAsnSerAlaSerSerSerSerThrValThrCysMetCys                          
165170175                                                                 
AspLeuValValLysGluAlaMetSerSerGlyArgAlaProProArg                          
180185190                                                                 
AlaProTyrSerPheThrLeuLysLysHisArgTrpIleGluIleAsp                          
195200205                                                                 
ValThrSerLeuLeuGlnProLeuValThrSerSerGluArgSerIle                          
210215220                                                                 
HisLeuSerValAsnPheThrCysThrLysAspGlnValProGluAsp                          
225230235240                                                              
GlyValPheSerMetProLeuSerValProProSerLeuIleLeuTyr                          
245250255                                                                 
LeuAsnAspThrSerThrGlnAlaTyrHisSerTrpGlnSerLeuGln                          
260265270                                                                 
SerThrTrpArgProLeuGlnHisProGlyGlnAlaGlyValAlaAla                          
275280285                                                                 
ArgProValLysGluGluAlaThrGluValGluArgSerProArgArg                          
290295300                                                                 
ArgArgGlyGlnLysAlaIleArgSerGluAlaLysGlyProLeuLeu                          
305310315320                                                              
ThrAlaSerPheAsnLeuSerGluTyrPheLysGlnPheLeuPhePro                          
325330335                                                                 
GlnAsnGluCysGluLeuHisAspPheArgLeuSerPheSerGlnLeu                          
340345350                                                                 
LysTrpAspAsnTrpIleValAlaProHisArgTyrAsnProArgTyr                          
355360365                                                                 
CysLysGlyAspCysProArgAlaValArgHisArgTyrGlySerPro                          
370375380                                                                 
ValHisThrMetValGlnAsnIleIleTyrGluLysLeuAspProSer                          
385390395400                                                              
ValProArgProSerCysValProGlyLysTyrSerProLeuSerVal                          
405410415                                                                 
LeuThrIleGluProAspGlySerIleAlaTyrLysGluTyrGluAsp                          
420425430                                                                 
MetIleAlaThrArgCysThrCysArg                                               
435440                                                                    
(2) INFORMATION FOR SEQ ID NO:5:                                          
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 117 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: GDF-3                                                          
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..117                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                   
LysArgArgAlaAlaIleSerValProLysGlyPheCysArgAsnPhe                          
151015                                                                    
CysHisArgHisGlnLeuPheIleAsnPheGlnAspLeuGlyTrpHis                          
202530                                                                    
LysTrpValIleAlaProLysGlyPheMetAlaAsnTyrCysHisGly                          
354045                                                                    
GluCysProPheSerMetThrThrTyrLeuAsnSerSerAsnTyrAla                          
505560                                                                    
PheMetGlnAlaLeuMetHisMetAlaAspProLysValProLysAla                          
65707580                                                                  
ValCysValProThrLysLeuSerProIleSerMetLeuTyrGlnAsp                          
859095                                                                    
SerAspLysAsnValIleLeuArgHisTyrGluAspMetValValAsp                          
100105110                                                                 
GluCysGlyCysGly                                                           
115                                                                       
(2) INFORMATION FOR SEQ ID NO:6:                                          
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 118 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: GDF-9                                                          
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..118                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                   
PheAsnLeuSerGluTyrPheLysGlnPheLeuPheProGlnAsnGlu                          
151015                                                                    
CysGluLeuHisAspPheArgLeuSerPheSerGlnLeuLysTrpAsp                          
202530                                                                    
AsnTrpIleValAlaProHisArgTyrAsnProArgTyrCysLysGly                          
354045                                                                    
AspCysProArgAlaValArgHisArgTyrGlySerProValHisThr                          
505560                                                                    
MetValGlnAsnIleIleTyrGluLysLeuAspProSerValProArg                          
65707580                                                                  
ProSerCysValProGlyLysTyrSerProLeuSerValLeuThrIle                          
859095                                                                    
GluProAspGlySerIleAlaTyrLysGluTyrGluAspMetIleAla                          
100105110                                                                 
ThrArgCysThrCysArg                                                        
115                                                                       
(2) INFORMATION FOR SEQ ID NO:7:                                          
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 122 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: GDF-1                                                          
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..122                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                   
ProArgArgAspAlaGluProValLeuGlyGlyGlyProGlyGlyAla                          
151015                                                                    
CysArgAlaArgArgLeuTyrValSerPheArgGluValGlyTrpHis                          
202530                                                                    
ArgTrpValIleAlaProArgGlyPheLeuAlaAsnTyrCysGlnGly                          
354045                                                                    
GlnCysAlaLeuProValAlaLeuSerGlySerGlyGlyProProAla                          
505560                                                                    
LeuAsnHisAlaValLeuArgAlaLeuMetHisAlaAlaAlaProGly                          
65707580                                                                  
AlaAlaAspLeuProCysCysValProAlaArgLeuSerProIleSer                          
859095                                                                    
ValLeuPhePheAspAsnSerAspAsnValValLeuArgGlnTyrGlu                          
100105110                                                                 
AspMetValValAspGluCysGlyCysArg                                            
115120                                                                    
(2) INFORMATION FOR SEQ ID NO:8:                                          
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 118 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: Vg-1                                                           
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..118                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                   
ArgArgLysArgSerTyrSerLysLeuProPheThrAlaSerAsnIle                          
151015                                                                    
CysLysLysArgHisLeuTyrValGluPheLysAspValGlyTrpGln                          
202530                                                                    
AsnTrpValIleAlaProGlnGlyTyrMetAlaAsnTyrCysTyrGly                          
354045                                                                    
GluCysProTyrProLeuThrGluIleLeuAsnGlySerAsnHisAla                          
505560                                                                    
IleLeuGlnThrLeuValHisSerIleGluProGluAspIleProLeu                          
65707580                                                                  
ProCysCysValProThrLysMetSerProIleSerMetLeuPheTyr                          
859095                                                                    
AspAsnAsnAspAsnValValLeuArgHisTyrGluAsnMetAlaVal                          
100105110                                                                 
AspGluCysGlyCysArg                                                        
115                                                                       
(2) INFORMATION FOR SEQ ID NO:9:                                          
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 118 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: Vgr-1                                                          
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..118                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                   
ArgValSerSerAlaSerAspTyrAsnSerSerGluLeuLysThrAla                          
151015                                                                    
CysArgLysHisGluLeuTyrValSerPheGlnAspLeuGlyTrpGln                          
202530                                                                    
AspTrpIleIleAlaProLysGlyTyrAlaAlaAsnTyrCysAspGly                          
354045                                                                    
GluCysSerPheProLeuAsnAlaHisMetAsnAlaThrAsnHisAla                          
505560                                                                    
IleValGlnThrLeuValHisLeuMetAsnProGluTyrValProLys                          
65707580                                                                  
ProCysCysAlaProThrLysLeuAsnAlaIleSerValLeuTyrPhe                          
859095                                                                    
AspAspAsnSerAsnValIleLeuLysLysTyrArgAsnMetValVal                          
100105110                                                                 
ArgAlaCysGlyCysHis                                                        
115                                                                       
(2) INFORMATION FOR SEQ ID NO:10:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 118 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: OP-1                                                           
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..118                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                  
ArgMetAlaAsnValAlaGluAsnSerSerSerAspGlnArgGlnAla                          
151015                                                                    
CysLysLysHisGluLeuTyrValSerPheArgAspLeuGlyTrpGln                          
202530                                                                    
AspTrpIleIleAlaProGluGlyTyrAlaAlaTyrTyrCysGluGly                          
354045                                                                    
GluCysAlaPheProLeuAsnSerTyrMetAsnAlaThrAsnHisAla                          
505560                                                                    
IleValGlnThrLeuValHisPheIleAsnProGluThrValProLys                          
65707580                                                                  
ProCysCysAlaProThrGlnLeuAsnAlaIleSerValLeuTyrPhe                          
859095                                                                    
AspAspSerSerAsnValIleLeuLysLysTyrArgAsnMetValVal                          
100105110                                                                 
ArgAlaCysGlyCysHis                                                        
115                                                                       
(2) INFORMATION FOR SEQ ID NO:11:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 118 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: BMP-5                                                          
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..118                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                  
ArgMetSerSerValGlyAspTyrAsnThrSerGluGlnLysGlnAla                          
151015                                                                    
CysLysLysHisGluLeuTyrValSerPheArgAspLeuGlyTrpGln                          
202530                                                                    
AspTrpIleIleAlaProGluGlyTyrAlaAlaPheTyrCysAspGly                          
354045                                                                    
GluCysSerPheProLeuAsnAlaHisMetAsnAlaThrAsnHisAla                          
505560                                                                    
IleValGlnThrLeuValHisLeuMetPheProAspHisValProLys                          
65707580                                                                  
ProCysCysAlaProThrLysLeuAsnAlaIleSerValLeuTyrPhe                          
859095                                                                    
AspAspSerSerAsnValIleLeuLysLysTyrArgAsnMetValVal                          
100105110                                                                 
ArgSerCysGlyCysHis                                                        
115                                                                       
(2) INFORMATION FOR SEQ ID NO:12:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 118 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: 60A                                                            
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..118                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                  
SerProAsnAsnValProLeuLeuGluProMetGluSerThrArgSer                          
151015                                                                    
CysGlnMetGlnThrLeuTyrIleAspPheLysAspLeuGlyTrpHis                          
202530                                                                    
AspTrpIleIleAlaProGluGlyTyrGlyAlaPheTyrCysSerGly                          
354045                                                                    
GluCysAsnPheProLeuAsnAlaHisMetAsnAlaThrAsnHisAla                          
505560                                                                    
IleValGlnThrLeuValHisLeuLeuGluProLysLysValProLys                          
65707580                                                                  
ProCysCysAlaProThrArgLeuGlyAlaLeuProValLeuTyrHis                          
859095                                                                    
LeuAsnAspGluAsnValAsnLeuLysLysTyrArgAsnMetIleVal                          
100105110                                                                 
LysSerCysGlyCysHis                                                        
115                                                                       
(2) INFORMATION FOR SEQ ID NO:13:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 117 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: BMP-2                                                          
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..117                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                  
GluLysArgGlnAlaLysHisLysGlnArgLysArgLeuLysSerSer                          
151015                                                                    
CysLysArgHisProLeuTyrValAspPheSerAspValGlyTrpAsn                          
202530                                                                    
AspTrpIleValAlaProProGlyTyrHisAlaPheTyrCysHisGly                          
354045                                                                    
GluCysProPheProLeuAlaAspHisLeuAsnSerThrAsnHisAla                          
505560                                                                    
IleValGlnThrLeuValAsnSerValAsnSerLysIleProLysAla                          
65707580                                                                  
CysCysValProThrGluLeuSerAlaIleSerMetLeuTyrLeuAsp                          
859095                                                                    
GluAsnGluLysValValLeuLysAsnTyrGlnAspMetValValGlu                          
100105110                                                                 
GlyCysGlyCysArg                                                           
115                                                                       
(2) INFORMATION FOR SEQ ID NO:14:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 117 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: BMP-4                                                          
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..117                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                  
ArgSerProLysHisHisSerGlnArgAlaArgLysLysAsnLysAsn                          
151015                                                                    
CysArgArgHisSerLeuTyrValAspPheSerAspValGlyTrpAsn                          
202530                                                                    
AspTrpIleValAlaProProGlyTyrGlnAlaPheTyrCysHisGly                          
354045                                                                    
AspCysProPheProLeuAlaAspHisLeuAsnSerThrAsnHisAla                          
505560                                                                    
IleValGlnThrLeuValAsnSerValAsnSerSerIleProLysAla                          
65707580                                                                  
CysCysValProThrGluLeuSerAlaIleSerMetLeuTyrLeuAsp                          
859095                                                                    
GluTyrAspLysValValLeuLysAsnTyrGlnGluMetValValGlu                          
100105110                                                                 
GlyCysGlyCysArg                                                           
115                                                                       
(2) INFORMATION FOR SEQ ID NO:15:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 118 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: DPP                                                            
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..118                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                  
LysArgHisAlaArgArgProThrArgArgLysAsnHisAspAspThr                          
151015                                                                    
CysArgArgHisSerLeuTyrValAspPheSerAspValGlyTrpAsp                          
202530                                                                    
AspTrpIleValAlaProLeuGlyTyrAspAlaTyrTyrCysHisGly                          
354045                                                                    
LysCysProPheProLeuAlaAspHisPheAsnSerThrAsnHisAla                          
505560                                                                    
ValValGlnThrLeuValAsnAsnMetAsnProGlyLysValProLys                          
65707580                                                                  
AlaCysCysValProThrGlnLeuAspSerValAlaMetLeuTyrLeu                          
859095                                                                    
AsnAspGlnSerThrValValLeuLysAsnTyrGlnGluMetThrVal                          
100105110                                                                 
ValGlyCysGlyCysArg                                                        
115                                                                       
(2) INFORMATION FOR SEQ ID NO:16:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 119 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: BMP-3                                                          
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..119                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                  
GlnThrLeuLysLysAlaArgArgLysGlnTrpIleGluProArgAsn                          
151015                                                                    
CysAlaArgArgTyrLeuLysValAspPheAlaAspIleGlyTrpSer                          
202530                                                                    
GluTrpIleIleSerProLysSerPheAspAlaTyrTyrCysSerGly                          
354045                                                                    
AlaCysGlnPheProMetProLysSerLeuLysProSerAsnHisAla                          
505560                                                                    
ThrIleGlnSerIleValArgAlaValGlyValValProGlyIlePro                          
65707580                                                                  
GluProCysCysValProGluLysMetSerSerLeuSerIleLeuPhe                          
859095                                                                    
PheAspGluAsnLysAsnValValLeuLysValTyrProAsnMetThr                          
100105110                                                                 
ValGluSerCysAlaCysArg                                                     
115                                                                       
(2) INFORMATION FOR SEQ ID NO:17:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 115 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: MIS                                                            
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..115                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                  
ProGlyArgAlaGlnArgSerAlaGlyAlaThrAlaAlaAspGlyPro                          
151015                                                                    
CysAlaLeuArgGluLeuSerValAspLeuArgAlaGluArgSerVal                          
202530                                                                    
LeuIleProGluThrTyrGlnAlaAsnAsnCysGlnGlyValCysGly                          
354045                                                                    
TrpProGlnSerAspArgAsnProArgTyrGlyAsnHisValValLeu                          
505560                                                                    
LeuLeuLysMetGlnAlaArgGlyAlaAlaLeuAlaArgProProCys                          
65707580                                                                  
CysValProThrAlaTyrAlaGlyLysLeuLeuIleSerLeuSerGlu                          
859095                                                                    
GluArgIleSerAlaHisHisValProAsnMetValAlaThrGluCys                          
100105110                                                                 
GlyCysArg                                                                 
115                                                                       
(2) INFORMATION FOR SEQ ID NO:18:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 121 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: Inhibin alpha                                                  
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..121                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                  
LeuArgLeuLeuGlnArgProProGluGluProAlaAlaHisAlaAsn                          
151015                                                                    
CysHisArgValAlaLeuAsnIleSerPheGlnGluLeuGlyTrpGlu                          
202530                                                                    
ArgTrpIleValTyrProProSerPheIlePheHisTyrCysHisGly                          
354045                                                                    
GlyCysGlyLeuHisIleProProAsnLeuSerLeuProValProGly                          
505560                                                                    
AlaProProThrProAlaGlnProTyrSerLeuLeuProGlyAlaGln                          
65707580                                                                  
ProCysCysAlaAlaLeuProGlyThrMetArgProLeuHisValArg                          
859095                                                                    
ThrThrSerAspGlyGlyTyrSerPheLysTyrGluThrValProAsn                          
100105110                                                                 
LeuLeuThrGlnHisCysAlaCysIle                                               
115120                                                                    
(2) INFORMATION FOR SEQ ID NO:19:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 121 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: Inhibin betaA                                                  
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..121                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                  
ArgArgArgArgArgGlyLeuGluCysAspGlyLysValAsnIleCys                          
151015                                                                    
CysLysLysGlnPhePheValSerPheLysAspIleGlyTrpAsnAsp                          
202530                                                                    
TrpIleIleAlaProSerGlyTyrHisAlaAsnTyrCysGluGlyGlu                          
354045                                                                    
CysProSerHisIleAlaGlyThrSerGlySerSerLeuSerPheHis                          
505560                                                                    
SerThrValIleAsnHisTyrArgMetArgGlyHisSerProPheAla                          
65707580                                                                  
AsnLeuLysSerCysCysValProThrLysLeuArgProMetSerMet                          
859095                                                                    
LeuTyrTyrAspAspGlyGlnAsnIleIleLysLysAspIleGlnAsn                          
100105110                                                                 
MetIleValGluGluCysGlyCysSer                                               
115120                                                                    
(2) INFORMATION FOR SEQ ID NO:20:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 120 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: Inhibin betaB                                                  
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..120                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                  
ArgIleArgLysArgGlyLeuGluCysAspGlyArgThrAsnLeuCys                          
151015                                                                    
CysArgGlnGlnPhePheIleAspPheArgLeuIleGlyTrpAsnAsp                          
202530                                                                    
TrpIleIleAlaProThrGlyTyrTyrGlyAsnTyrCysGluGlySer                          
354045                                                                    
CysProAlaTyrLeuAlaGlyValProGlySerAlaSerSerPheHis                          
505560                                                                    
ThrAlaValValAsnGlnTyrArgMetArgGlyLeuAsnProGlyThr                          
65707580                                                                  
ValAsnSerCysCysIleProThrLysLeuSerThrMetSerMetLeu                          
859095                                                                    
TyrPheAspAspGluTyrAsnIleValLysArgAspValProAsnMet                          
100105110                                                                 
IleValGluGluCysGlyCysAla                                                  
115120                                                                    
(2) INFORMATION FOR SEQ ID NO:21:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 114 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: TGF-beta1                                                      
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..114                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                  
ArgArgAlaLeuAspThrAsnTyrCysPheSerSerThrGluLysAsn                          
151015                                                                    
CysCysValArgGlnLeuTyrIleAspPheArgLysAspLeuGlyTrp                          
202530                                                                    
LysTrpIleHisGluProLysGlyTyrHisAlaAsnPheCysLeuGly                          
354045                                                                    
ProCysProTyrIleTrpSerLeuAspThrGlnTyrSerLysValLeu                          
505560                                                                    
AlaLeuTyrAsnGlnHisAsnProGlyAlaSerAlaAlaProCysCys                          
65707580                                                                  
ValProGlnAlaLeuGluProLeuProIleValTyrTyrValGlyArg                          
859095                                                                    
LysProLysValGluGlnLeuSerAsnMetIleValArgSerCysLys                          
100105110                                                                 
CysSer                                                                    
(2) INFORMATION FOR SEQ ID NO:22:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 114 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: TGF-beta2                                                      
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..114                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                  
LysArgAlaLeuAspAlaAlaTyrCysPheArgAsnValGlnAspAsn                          
151015                                                                    
CysCysLeuArgProLeuTyrIleAspPheLysArgAspLeuGlyTrp                          
202530                                                                    
LysTrpIleHisGluProLysGlyTyrAsnAlaAsnPheCysAlaGly                          
354045                                                                    
AlaCysProTyrLeuTrpSerSerAspThrGlnHisSerArgValLeu                          
505560                                                                    
SerLeuTyrAsnThrIleAsnProGluAlaSerAlaSerProCysCys                          
65707580                                                                  
ValSerGlnAspLeuGluProLeuThrIleLeuTyrTyrIleGlyLys                          
859095                                                                    
ThrProLysIleGluGlnLeuSerAsnMetIleValLysSerCysLys                          
100105110                                                                 
CysSer                                                                    
(2) INFORMATION FOR SEQ ID NO:23:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 114 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: TGF-beta3                                                      
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..114                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                  
LysArgAlaLeuAspThrAsnTyrCysPheArgAsnLeuGluGluAsn                          
151015                                                                    
CysCysValArgProLeuTyrIleAspPheArgGlnAspLeuGlyTrp                          
202530                                                                    
LysTrpValHisGluProLysGlyTyrTyrAlaAsnPheCysSerGly                          
354045                                                                    
ProCysProTyrLeuArgSerAlaAspThrThrHisSerThrValLeu                          
505560                                                                    
GlyLeuTyrAsnThrLeuAsnProGluAlaSerAlaSerProCysCys                          
65707580                                                                  
ValProGlnAspLeuGluProLeuThrIleLeuTyrTyrValGlyArg                          
859095                                                                    
ThrProLysValGluGlnLeuSerAsnMetValValLysSerCysLys                          
100105110                                                                 
CysSer                                                                    
(2) INFORMATION FOR SEQ ID NO:24:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 116 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: TGF-beta4                                                      
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..116                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                  
ArgArgAspLeuAspThrAspTyrCysPheGlyProGlyThrAspGlu                          
151015                                                                    
LysAsnCysCysValArgProLeuTyrIleAspPheArgLysAspLeu                          
202530                                                                    
GlnTrpLysTrpIleHisGluProLysGlyTyrMetAlaAsnPheCys                          
354045                                                                    
MetGlyProCysProTyrIleTrpSerAlaAspThrGlnTyrThrLys                          
505560                                                                    
ValLeuAlaLeuTyrAsnGlnHisAsnProGlyAlaSerAlaAlaPro                          
65707580                                                                  
CysCysValProGlnThrLeuAspProLeuProIleIleTyrTyrVal                          
859095                                                                    
GlyArgAsnValArgValGluGlnLeuSerAsnMetValValArgAla                          
100105110                                                                 
CysLysCysSer                                                              
115                                                                       
(2) INFORMATION FOR SEQ ID NO:25:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 114 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: TGF-beta5                                                      
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..114                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                  
LysArgGlyValGlyGlnGluTyrCysPheGlyAsnAsnGlyProAsn                          
151015                                                                    
CysCysValLysProLeuTyrIleAsnPheArgLysAspLeuGlyTrp                          
202530                                                                    
LysTrpIleHisGluProLysGlyTyrGluAlaAsnTyrCysLeuGly                          
354045                                                                    
AsnCysProTyrIleTrpSerMetAspThrGlnTyrSerLysValLeu                          
505560                                                                    
SerLeuTyrAsnGlnAsnAsnProGlyAlaSerIleSerProCysCys                          
65707580                                                                  
ValProAspValLeuGluProLeuProIleIleTyrTyrValGlyArg                          
859095                                                                    
ThrAlaLysValGluGlnLeuSerAsnMetValValArgSerCysAsn                          
100105110                                                                 
CysSer                                                                    
(2) INFORMATION FOR SEQ ID NO:26:                                         
(i) SEQUENCE CHARACTERISTICS:                                             
(A) LENGTH: 454 amino acids                                               
(B) TYPE: amino acid                                                      
(C) STRANDEDNESS: single                                                  
(D) TOPOLOGY: linear                                                      
(ii) MOLECULE TYPE: protein                                               
(vii) IMMEDIATE SOURCE:                                                   
(B) CLONE: HUMAN GDF-9                                                    
(ix) FEATURE:                                                             
(A) NAME/KEY: Protein                                                     
(B) LOCATION: 1..454                                                      
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                  
MetAlaArgProAsnLysPheLeuLeuTrpPheCysCysPheAlaTrp                          
151015                                                                    
LeuCysPheProIleSerLeuGlySerGlnAlaSerGlyGlyGluAla                          
202530                                                                    
GlnIleAlaAlaSerAlaGluLeuGluSerGlyAlaMetProTrpSer                          
354045                                                                    
LeuLeuGlnHisIleAspGluArgAspArgAlaGlyLeuLeuProAla                          
505560                                                                    
LeuPheLysValLeuSerValGlyArgGlyGlySerProArgLeuGln                          
65707580                                                                  
ProAspSerArgAlaLeuHisTyrMetLysLysLeuTyrLysThrTyr                          
859095                                                                    
AlaThrLysGluGlyIleProLysSerAsnArgSerHisLeuTyrAsn                          
100105110                                                                 
ThrValArgLeuPheThrProCysThrArgHisLysGlnAlaProGly                          
115120125                                                                 
AspGlnValThrGlyIleLeuProSerValGluLeuLeuPheAsnLeu                          
130135140                                                                 
AspArgIleThrThrValGluHisLeuLeuLysSerValLeuLeuTyr                          
145150155160                                                              
AsnIleAsnAsnSerValSerPheSerSerAlaValLysCysValCys                          
165170175                                                                 
AsnLeuMetIleLysGluProLysSerSerSerArgThrLeuGlyArg                          
180185190                                                                 
AlaProTyrSerPheThrPheAsnSerGlnPheGluPheGlyLysLys                          
195200205                                                                 
HisLysTrpIleGlnIleAspValThrSerLeuLeuGlnProLeuVal                          
210215220                                                                 
AlaSerAsnLysArgSerIleHisMetSerIleAsnPheThrCysMet                          
225230235240                                                              
LysAspGlnLeuGluHisProSerAlaGlnAsnGlyLeuPheAsnMet                          
245250255                                                                 
ThrLeuValSerProSerLeuIleLeuTyrLeuAsnAspThrSerAla                          
260265270                                                                 
GlnAlaTyrHisSerTrpTyrSerLeuHisTyrLysArgArgProSer                          
275280285                                                                 
GlnGlyProAspGlnGluArgSerLeuSerAlaTyrProValGlyGlu                          
290295300                                                                 
GluAlaAlaGluAspGlyArgSerSerHisHisArgHisArgArgGly                          
305310315320                                                              
GlnGluThrValSerSerGluLeuLysLysProLeuGlyProAlaSer                          
325330335                                                                 
PheAsnLeuSerGluTyrPheArgGlnPheLeuLeuProGlnAsnGlu                          
340345350                                                                 
CysGluLeuHisAspPheArgLeuSerPheSerGlnLeuLysTrpAsp                          
355360365                                                                 
AsnTrpIleValAlaProHisArgTyrAsnProArgTyrCysLysGly                          
370375380                                                                 
AspCysProArgAlaValGlyHisArgTyrGlySerProValHisThr                          
385390395400                                                              
MetValGlnAsnIleIleTyrGluLysLeuAspSerSerValProArg                          
405410415                                                                 
ProSerCysValProAlaLysTyrSerProLeuSerValLeuThrIle                          
420425430                                                                 
GluProAspGlySerIleAlaTyrLysGluTyrGluAspMetIleAla                          
435440445                                                                 
ThrLysCysThrCysArg                                                        
450                                                                       
__________________________________________________________________________

Claims

I claim:

1. Substantially pure growth differentiation factor-9 (GDF-9) having the amino acid sequence as set forth in SEQ ID NO:4 or SEQ ID NO:26.

2. An isolated polynucleotide encoding GDF-9 polypeptide tide having an amino acid sequence as set forth in SEQ ID NO:4 or SEQ ID NO:26.

3. The polynucleotide of claim 2, wherein the polynucleotide is isolated from a mammalian cell.

4. The polynucleotide of claim 3, wherein the mammalian cell is selected from the group consisting of mouse, rat, and human cell.

5. An expression vector including the polynucleotide of claim 2.

6. The vector of claim 5, wherein the vector is a plasmid.

7. The vector of claim 5, wherein the vector is a viral vector.

8. A host cell containing the vector of claim 5.

9. The host cell of claim 8, wherein the cell is prokaryotic.

10. The host cell of claim 8, wherein the cell is eukaryotic.

11. An isolated polynucleotide selected from the group consisting of:

a) SEQ ID NO:3;

b) SEQ ID NO:3, wherein T can also be U;

c) nucleic sequences complementary to SEQ ID NO:3; and

d) fragments of a), b), or c) that are at least 15 bases in length and that will hybridize to DNA which encodes the GDF-9 protein of SEQ ID NO:4 or SEQ ID NO:26.

12. A method for detecting an oocyte in a sample comprising contacting the sample with a nucleic acid probe which is a fragment as defined in part d) of claim 11 or an antibody which binds to a GDF-9 polypeptide as set forth in SEQ ID NO:4 or SEQ ID NO:26, wherein binding is indicative of the presence of an oocyte.

13. The method of claim 12, wherein the contacting is in vitro.

14. The method of claim 12, wherein the contacting is in situ.