EP1248798A2 - Humane dns sequenzen - Google Patents

Humane dns sequenzen

Info

Publication number
EP1248798A2
EP1248798A2 EP00966368A EP00966368A EP1248798A2 EP 1248798 A2 EP1248798 A2 EP 1248798A2 EP 00966368 A EP00966368 A EP 00966368A EP 00966368 A EP00966368 A EP 00966368A EP 1248798 A2 EP1248798 A2 EP 1248798A2
Authority
EP
European Patent Office
Prior art keywords
htes3
hfbr2
hutel
hfkd2
nucleic acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP00966368A
Other languages
English (en)
French (fr)
Inventor
Stefan Wiemann
Jens Tampe
Gabriele Nordsiek
Klaus Hornischer
Michael Boecher
Helmut Bloecker
Thoams Regiert
Claudia Rittmueller
Wilhelm Ansorge
Sabine Glassl
H.W. Mewes
Norman Strack
Christian Gruber
Johann Gassenhuber
Karl Koehrer
Andreas Beyer
Rolf Wambutt
Dagmar Heubner
Juergen Lauber
Christoph Koenig
Andreas Duesterhoeft
André Bahr
Birgit Ottenwaelder
Brigitte Obermaier
Helmut Blum
Ruth Wellenreuther
Annemarie Poustka
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Original Assignee
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV filed Critical Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Publication of EP1248798A2 publication Critical patent/EP1248798A2/de
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals

Definitions

  • Current techniques tend to utilize only a single or limited number of markers, thus answering only very simple questions that are of questionable medical import. For example, a typical in vitro assay may ask whether a lead compound binds a particular receptor, which has been implicated in a certain disorder. It is presumed that such binding is indicative of therapeutic usefulness, but it does not even purport to address systemic effects.
  • Toxicity is usually measured by animal testing. Aside from the complications related to in vivo versus in vitro testing, such screens are insufficient because of differences in metabolism, uptake, etc., relative to humans. Thus, improved methods would be not only be in v.tr ⁇ -based, they would also be more "human. "
  • arrays containing large numbers of these targets that can be assayed simultaneously. If such an array contains a large enough population of targets, it can be used to essentially mimic the systemic response. In other words, the array becomes an in vitro surrogate for the human body. The more refined the array, the more accurate the predictive capability. In theory, an array could be constructed that can detect all of the known human expression products simultaneously, thereby, providing a very reliable indicator of the human response to a given compound. These arrays offer advantages over the present in vitro screening systems in that they can assay large numbers of responses simultaneously. They are superior to animal testing because they are more "human” and, thus, more predictive of human responses.
  • such targets will be provided with additional physiologically relevant information, such as whether the target is expressed in a particular tissue and whether it is related to a known functional class of targets.
  • additional physiologically relevant information such as whether the target is expressed in a particular tissue and whether it is related to a known functional class of targets.
  • the artisan can focus as needed, for example, on tissue-specific effects or target class-specific effects, thereby providing information useful in evaluating efficacy and/or toxicity.
  • the present invention responds to the aforementioned and other needs in the field by providing a population of novel targets useful, inter alia, in the profiling and medicinal contexts described above.
  • the invention provides sequences of human cDNA clones that were isolated from libraries generated from different human tissues.
  • assemblages comprising different populations of human nucleic acids, proteins and antibodies are provided.
  • cDNA library-specific assemblages and target-family-specific targets are provided.
  • a database of human nucleotide and protein sequences are provided.
  • novel human nucleotide and protein sequences are provided in electronic form. In one embodiment, one or more of these sequences is provided in a searchable database.
  • the invention provides biologically active target molecules useful in treating or detecting human disorders. Further to this object, the invention provides nucleic acid and protein molecules that have the capacity to affect disease etiology or symptoms or correlate with known disease states. Also further to this object, a database is provided which comprises the disclosed molecules in electronic form.
  • compositions which comprise an effective amount of a pharmaceutical agent, wherein the pharmaceutical agent is selected from the group consisting of one or more polypeptides contemplated by the invention, variants or functional derivatives thereof, and antibodies thereto; and a physiologically acceptable carrier or excipient.
  • the invention results from a need in the art for new human nucleic acids and proteins. This need arises in several contexts. First, there is a need to identify targets for therapeutic intervention. Second, there is a need to identify molecules that may be adversely affected in a therapeutic context, thereby resulting in toxicity. Knowledge of these molecules will aid in the design of new medicaments with enhanced efficacy and decreased toxicity. Finally, the need encompasses human nucleic acids and proteins that have medicinal applicability in their own right.
  • the present inventors set out to isolate and sequence human cDNAs from tissue-specific libraries. In this way, they represent subsets of molecules likely to be targets for therapeutic intervention or for avoiding toxicity. In addition, the inventors divided the molecules into various sub-categories, based on suspected functionality, structural similarity etc, which are of interest from a pharmacological perspective. These molecules are disclosed in provisional application serial nos. 60/149,499 and 60/156,503, filed August 18, 1999, and September 28, 1999, respectively, both of which are hereby incorporated by reference in their entirety.
  • the present invention provides novel polynucleotide molecules that, in some instances, have similarities with known molecules.
  • inventive DNAs were cloned from five different human cDNA libraries.
  • the invention provides their protein translations and antibodies derived from them.
  • inventive DNA and protein sequences are show individually, below.
  • inventive nucleic acids also include the complements of these DNA sequences, as well as their RNA counterparts. Methods of producing the molecules also are provided. Further, the invention provides methods for detecting all or part of the molecules and of detecting polynucleotides encoding all or part of the molecules.
  • the inventive molecules derive from five cDNA libraries: human fetal brain; human fetal kidney; human mammary carcinoma; human testis; and human uterus.
  • each sequence bears a designation that indicates from which library it is derived.
  • these designations are: "hfpbr” for human fetal brain; “hfkd” for human fetal kidney; “hmcf” for human mammary carcinoma; “htes” for human testis; and “hute” for human uterus.
  • the individual libraries were constructed and screened as described below in the examples.
  • the protein and DNA molecules of the invention are variously described herein as "target” molecules or “inventive” molecules.
  • the sequences and other information pertinent to the nucleic acid and protein molecules of the invention are shown, below. Interpreting the data disclosed with the Table and cDNA sequences, below:
  • the clones were assigned to the following fourteen functional and/or tissue-derived groups:
  • the individual clone files are structured in the same pattern.
  • the Sections are separated by paragraphs.
  • DKFZ producer of library
  • _ an underscore to separate library information from plate information plate number (e.g. "16") plate coordinates (letter first; e.g. "fl4")
  • Short Information specifications about the cDNA (who sequenced, completeness of the cDNA, similarity, who sequenced, chromosomal localisation, length of cDNA, localisation of poly A tail and polyadenylation signal)
  • Pedant Information output of fully automated annotation summarises peptide information, homologies, patterns as follows:
  • Blocks are multiply aligned ungapped segments corresponding to the most highly conserved regions of proteins.
  • the blocks for the Blocks Database are made automatically by looking for the most highly conserved regions in groups of proteins documented in the Prosite Database.
  • the Prosite pattern for a protein group is not used in any way to make the Blocks Database and the pattern may or may not be contained in one of the blocks representing a group.
  • These blocks are then calibrated against the SWISS-PROT database to obtain a measure of the chance distribution of matches. It is these calibrated blocks that make up the Blocks Database.
  • the WWW versions of the Prosite and SWISS-PROT Databases that are used on this server are located at the ExPASy World Wide Web (WWW) Molecular Biology Server of the Geneva University Hospital and the University of Geneva. World Wide Web URL http://blocks.fhcrc.org/blocks/about_blocks.html/ is the entry point to the database.
  • the scop database provides a detailed and comprehensive description of the structural and evolutionary relationships between all proteins whose structure is known, including all entries in Brookhaven National Laboratory's Protein Data Bank (PDB). It is available as a set of tightly linked hypertext documents which make the large database comprehensible and accessible. In addition, the hypertext pages offer a panoply of representations of proteins, including links to PDB entries, sequences, references, images and interactive display systems. World Wide Web URL http://scop.mrc-lmb.cam.ac.uk/scop/ is the entry point to the database. Existing automatic sequence and structure comparison tools cannot identify all structural and evolutionary relationships between proteins.
  • the scop classification of proteins has been constructed manually by visual inspection and comparison of structures, but with the assistance of tools to make the task manageable and help provide generality. Proteins are classified to reflect both structural and evolutionary relatedness. Many levels exist in the hierarchy, but the principal levels are family, superfamily and fold. The exact position of boundaries between these levels are to some degree subjective. Scop evolutionary classification is generally conservative: where any doubt about relatedness exists, we made new divisions at the family and superfamily levels.
  • ENZYME is a repository of information relative to the nomenclature of enzymes. It is primarily based on the recommendations of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (IUBMB) and it describes each type of characterized enzyme for which an EC (Enzyme Commission) number has been provided. World Wide Web URL http://www.expasy.ch/enzyme/ is the entry point to the database.
  • PIR Protein Information Resource
  • PIR Protein Information Resource
  • PREDATOR is a secondary structure prediction program. It takes as input a single protein sequence to be predicted and can optimally use a set of unaligned sequences as additional information to predict the query sequence.
  • the mean prediction accuracy of PREDATOR is 68% for a single sequence and 75% for a set of related sequences.
  • PREDATOR does not use multiple sequence alignment. Instead, it relies on careful pairwise local alignments of the sequences in the set with the query sequence to be predicted.
  • PROSITE is a database of protein families and domains. It consists of biologically significant sites, patterns and profiles that help to reliably identify to which known protein family (if any) a new sequence belongs. World Wide Web URL http://www.expasy.ch/prosite/ is the entry point to the database. A description of the prosite consensus patterns is also provided, below.
  • PFAM protein families
  • World Wide Web URL http://www.sanger.ac.uk/Pfam/ is the entry point to the database.
  • Clones were deposited as a pool with the American Type Culture Collection under accession number , from which each clone comprising a particular polynucleotide is obtainable. Each clone has been transfected into separate bacterial cells (E. coli) in this composite deposit.
  • the clones may also be obtained from the Resource Center of the German Human Genome Project (Heubner Weg 6, 14059 Berlin, GERMANY).
  • the Resource Center library numbers are slightly different that those presented here, but may be readily obtained by the following key or with the assistance of Resource Center personnel.
  • the library name becomes a number: brain (hfbr2) becomes 564; kidney (hfkd2) becomes 566; mammary carcinoma (hmcfl) becomes 727; testis (htes3) becomes 434;and uterus (hutel) becomes 586.
  • the plate number is converted to two digits (e.g., "2" becomes "02") and is moved behind the plate coordinate, and the underscore is dropped. The following examples are helpful:
  • the libraries were constructed using two commercially available vectors.
  • the brain (hfbr2 designations) and kidney (hfkd2 designations) libraries utilize pAMP 1 from Life Technologies and are maintained in XL-2Blue (Strategene); the uterus (hutel), testes (htes3) and mammary carcinoma (hmcfl) libraries are constructed in pSPORTl, also from Life Technologies, and are maintained in DH10B (LifeTechnologies).
  • All inserts may be excised with a Notl/Sall digestion.
  • universal primers, flanking the cloning region may be used to amplify the inserts using PCR methods.
  • Bacterial cells containing a particular clone can be obtained from the composite deposit as follows:
  • oligonucleotide probe or probes should be designed to the sequence that is known for that particular clone. This sequence can be derived from the sequences provided herein, or from a combination of those sequences. Methods of probe design are presented below.
  • Oligonucleotide probes may be labeled with ⁇ - 2 P ATP (specific activity 6000 Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for labeling oligonucleotides. Other, non-radioactive labeling techniques can also be used. Unincorporated label typically is removed by gel filtration chromatography or other established methods. The amount of radioactivity incorporated into the probe can be quantified by measurement in a scintillation counter. Preferably, specific activity of the resulting probe generally should be approximately 4X10 6 dmp/pmole.
  • the bacterial culture containing the pool of full-length clones should preferably be thawed and 100 ⁇ l of the stock used to inoculate a sterile culture flask containing 25 ml of sterile L-broth containing ampicillin at 50 - 100 ⁇ g/ml (for XL-2Blue strains 25 ⁇ g/ml tetracycline should also be used).
  • the culture should preferably be grown to saturation at 37°C, and the saturated culture should preferably be diluted in fresh L-broth.
  • Aliquots of these dilutions should preferably be plated to determine the dilution and volume which will yield approximately 5000 distinct and well-separated colonies on solid bacteriological media containing L-broth containing ampicillin at 100 ⁇ g/ml (for XL-2Blue strains 25 ⁇ g/ml tetracycline should also be used)and agar at l.5% in a l50 mm petri dish when grown overnight at 37°C. Other known methods of obtaining distinct, well-separated colonies can also be employed.
  • Standard colony hybridization procedures should then be used to transfer the colonies to nitrocellulose filters and lyse, denature and bake them.
  • the filter is then preferably incubated at 65°C. for 1 hour with gentle agitation in 6 x SSC (20 x stock is 175.3 g NaCl/liter, 88.2 g Na citrate/liter, adjusted to pH 7.0 with NaOH) containing 0.5% SDS, 100 ⁇ g/ml of yeast RNA, and 10 mM EDTA (approximately 10 mL per 150 mm filter).
  • the probe is then added to the hybridization mix at a concentration greater than or equal to 1X10 6 dpm/mL.
  • the filter is then preferably incubated at 65°C.
  • the filter is then preferably washed in 500 mL of 2 x SSC/0.5% SDS at room temperature without agitation, preferably followed by 500 mL of 2 x SSC/0.1% SDS at room temperature with gentle shaking for 15 minutes. A third wash with 0.1 x SSC/0.5% SDS at 65°C. for 30 minutes to 1 hour is optional.
  • the filter is then preferably dried and subjected to autoradiography for sufficient time to visualize the positives on the X-ray film. Other known hybridization methods can also be employed.
  • the positive colonies are picked, grown in culture, and plasmid DNA isolated using standard procedures.
  • the clones can then be verified by restriction analysis, hybridization analysis, or DNA sequencing.
  • clones may be grown as described above, and PCR used to isolate the insert DNAs. Methods of PCR are described below and are otherwise well known .
  • DNA sequences found herein derive from individual clones, which are publicly available, as noted above.
  • any specific sequence disclosed herein readily can be screened for errors by resequencing a particular fragment, in both directions (i.e. , by sequencing both strands).
  • error screening can be performed by amplifying and/or cloning any of the inventive DNAs, using for example RT- PCR, and sequencing the resulting amplified product. In the event that there is a sequencing error, reference should be made to the deposited clone as the correct sequence.
  • inventive molecules and their derivatives are susceptible to a wide variety of uses, based on functional and/or structural properties.
  • the skilled worker will appreciate, based on the biological activities detailed below, and discussed with regard to the individual sequences disclosed below, that the inventive molecules will find usefulness in numerous therapeutic and diagnostic applications.
  • the DNA molecules can be used as fertilizer supplements due to their high nitrogen and phosphorus contents. Since the DNAs are of defined length, they are also useful in gel electrophoresis as molecular weight markers. Due to their similarity with known molecules, certain of the DNA molecules and their variants and derivatives may be used in any number of different diagnostic procedures and therapeutic applications. They may also be used to make the encoded proteins. The proteins themselves have many possible uses. They may be used as a nutritional supplement for humans, animals and even for laboratory use as, for example, medium for bacterial cultures. Moreover, since the proteins are of defined, known sizes, they may be used as molecular weight markers for gel electrophoresis and gel filtration. Because they are of defined sequences, they also have use in microsequencing and protein fingerprinting applications.
  • Expression profiling generally entails constructing an array of indicators that signal the presence of a particular RNA or protein expression product. Such arrays can be used to evaluate, for example, pharmacological effectiveness and toxicity.
  • expression profiles from such arrays can be generated from cells treated with known compounds, having known properties, and these profiles can be compared to profiles of unknowns to evaluate similarities and differences, which can be correlated with efficacy or toxicity.
  • profiling includes diagnosis, tracking development, and ascertaining signaling and metabolic pathways.
  • references describing profiling and its uses see Farr et ⁇ /., U.S. Patent 5,811,231 (1998); Seilhamer et al. , U.S. Patent 5,840,484 (1998); Rine et al, U.S. Patent No. 5,777,888 (1998); WO 97/27317; WO 99/05323; WO 99/09218; and WO 99/14369.
  • Lipshutz et al U.S. Patent No. 5,856,174 (1999) and Anderson et al., U.S. Patent No. 5,922,591 (1999).
  • a subset of the inventive DNAs will be arrayed on a substrate, like a gene chip, a filter or a 96-well plate.
  • Test samples containing cells are maintained in the presence of a label capable of incorporation into nascent mRNA.
  • Samples are treated with test and control compounds, which will induce mRNA expression in the sample, resulting in incorporation of label.
  • Whole mRNA is isolated and applied to the array such that it hybridizes with the DNAs contained therein. After washing, the amount of hybridization is quantified and a profile is generated. These steps are repeated with various control and test compounds, thereby generating a library of profiles, which can be used to ascertain the relationships relevant to pharmacological efficacy or toxicity.
  • the matrices used in such profiling need not be limited to those utilizing DNAs. Rather, other nucleic acids, like RNAs and protein nucleic acids (PNAs), as well as the inventive proteins and antibodies corresponding to the inventive proteins may also be employed. Hence, for example, antibodies could form the array and the samples could be treated in order to label nascent proteins. Whole proteins then would be isolated and applied to the antibody matrix. Developing the resulting signal would result in a protein expression profile, which is useful in essentially the same manner as the nucleic acid profile. A protein matrix could be used, for example, in evaluating antibody responses to pharmaceutical agents in order to eliminate possible cross-reactivity.
  • PNAs protein nucleic acids
  • nucleic acids are used in the matrix
  • variants as defined below
  • This can be used to account for genetic variations that are of little or no consequence to the function of the resultant gene product.
  • they can account for wobble or conservative amino acid variations that do not perturb function, like variations in some of the protein motifs elucidated below.
  • each position in the matrix can employ multiple nucleic acid probes that account for a series of variants.
  • Expression profiling may also be done, in another embodiment, using two- dimensional protein gels in which the inventive proteins are detected.
  • the resultant profiles can be used in the same way as described.
  • Matrices useful for profiling may be constructed based on different criteria. Of course, the more relevant profiles will take into account expression of most human genes, preferably all of them. In certain situations, however, it is advantageous to look at a smaller subset. For example, if one were concerned about fetal neural toxicity, a fetal brain-specific matrix might be chosen. On the other hand, if one were interested in targeting mammary carcinoma tissue, a corresponding matrix could be used. Thus, matrices may be constructed using all of the sequences available from a tissue-specific library.
  • a proliferating cell must coordinate replication and chromosomal separation to ensure that the genome is replicated completely, and that a single copy, is correctly inherited by each daughter cell.
  • the cell cycle is the coordinated series of events that achieves these aims. Many of the key events are initiated by a family of conserved Seiren/threonine protein kinases, the cyclin-dependent kinases (CDKs), that are activated by the cyclin family of proteins (cyclins A-H).
  • CDKs cyclin-dependent kinases
  • the cyclin-CDK complexes are modulated by other protein kinases or phosphatases, and by binding specific inhibitor proteins.
  • CDK activity can be regulated allows the cell to respond to internal signals generated by preceding events in the cell cycle and to external growth signals.
  • the somatic cell cycle is divided into four phases: DNA replication (S phase) and chromosome separation (M phase) are separated by gap phases (GI and G2). At specific control points the decision to begin the next stage (DNA synthesis or mitosis) is carefully regulated.
  • Cdc2 the primary kinase
  • Cdc4 and Cdc6 are involved at the restriction point, where the cell can decide to proliferate or arrest (GK->G0) and Cdc7 is a CDK activating kinase (CAK) as well as a subunit of TFIIH.
  • CAK CDK activating kinase
  • Cyclin-CDK complexes are regulated in various ways. One is through phosphorylation by CDK activating kinases (CAK), like the Y15 kinase (Weel) and dephosphorylation by CDK associated phosphatases (CAP), like Cdc25A a member of the Cdc25 family (Cdc25A, B and C).
  • CAK CDK activating kinases
  • CAP CDK associated phosphatases
  • CKI CDK inhibitors
  • the cell cycle is also regulated through ubiquitin-mediated proteolysis involving the destruction of both cyclins and CDK inhibitors by the 26S proteasome, that requires an ubiquitin conjugating enzyme (UBC) and an ubiquitin ligase.
  • UBC ubiquitin conjugating enzyme
  • ULC ubiquitin conjugating enzyme
  • ubiquitin ligase The instability is conferred by PEST regions (cyclin D and E) or a ten amino acid region in the amino terminus (degradation box) in the A- and B-type cyclins. All these modifications play an important role for the cellular localization, because only the nuclear CDK-cyclin complexes are functional for cell cycle.
  • cyclines A, E and D are synthesized and bind to their cyclin-dependent kinase (CDK) partners.
  • CDK cyclin-dependent kinase
  • CDK complexes containing cyclins A, E and Dl are then imported into and concentrated within nuclei.
  • Cdk6- cyclin D3 has been localized to both cytoplasmic and nuclear compartments, although only the nuclear complex is active.
  • cyclin A and cyclin E complexes remain within the nucleus, whereas cyclin Dl relocalizes to the cytoplasm for proteolysis at the onset of S phase.
  • Cdc2-cyclin A is nuclear and remains so until it is degraded during mitosis.
  • cyclin Bl which binds to Cdc2 upon synthesis during S phase, is predominantly cytoplasmic.
  • Cdc2-cyclin B2 is also cytoplasmic, although this might occur through anchoring of the complex to some cytoplasmic constituent.
  • phosphorylation of cyclin Bl promotes accumulation of Cdc2 -cyclin Bl in the nucleus, whereas cyclin B2 remains in the cytoplasm until nuclear envelope breakdown.
  • Cdc2-cyclin B-Weel and Cdc25C Two crucial regulators of Cdc2-cyclin B-Weel and Cdc25C exist and are responsible for the G2 to M control point. Weel is a nuclear protein throughout the cell cycle, whereas Cdc25C binds to 14-3-3 proteins during interphase and remains predominantly cytoplasmic. In some systems Cdc25C, like cyclin Bl, rushes precipitously into the nucleus just before entry into mitosis.
  • the 110-kDa retinoblastoma (tumor suppressor) protein (RB), a pRB-family member is an important regulator of cell-cycle progression and differentiation.
  • RB suppresses inappropriate proliferation by arresting cells in GI by repressing the transcription of genes required for the transition into S phase.
  • E2F1-5 or DP family (DP 1-3) of transcription activators RB suppresses inappropriate proliferation by arresting cells in GI by repressing the transcription of genes required for the transition into S phase.
  • RB Before the cell proceeds into S phase, RB becomes phosphorylated at multiple sites by the cyclin dependent protein kinases (CDKs) and loses its transcriptional repressing activity.
  • CDKs cyclin dependent protein kinases
  • Cyclin E is the evolutionary conserved target for E2F and interacts together with CDC2 in late GI .
  • the kinase responsible for phosphorylating the unidentified kinetochore component in metaphase may be a member of the MAP kinase family and appears to be the proto oncogene c-MOS, a cytostatic factor (CSF) in meiosis.
  • CSF cytostatic factor
  • Tumor suppressors e.g. N33
  • Tumour-suppressor genes are known to be involved in the control of cell growth and division, interacting with proteins which control the cell cycle.
  • the N33 gene is significantly methylated in tumour cells, a mechanism by which tumor- suppressor genes are inactivated in cancer.
  • the N33 gene has been reported by OMIN OMIN (Online Mendelian Inheritance in Man at http://www.ncbi.nlm.nih.gov/htbin-post/Omin) to be associated (as potentially diagnostic, therapeutic, causative, and or related, etc..) with the following diseases: 1) prostate cancer suppression (OMIN *601385). Clones in this category include: fbr2_2kl4.
  • Cdc25C is a protein kinase that controls entry into mitosis by dephosphorylation of Cdc2.
  • Cdc25C function is regulated by phosphorylation, too.
  • Serine 216 phosphorylation of Cdc25C mediates the binding of 14-3-3 protein to Cdc25C.
  • C-TAK1 (Cdc twenty-five C associated protein kinase) phosphorylates Cdc25C on serine 216 in vitro. Alterations in the gene coding for the above protein kinase has been reported by OMIN to be associated (as potentially diagnostic, therapeutic, causative, and or related, etc%) with Pancreatic cancer (OMIN *60278). Clones in this category include: tes3_7j3.
  • prokaryotes and eukaryotes are the ability of the eukaryotic cell to adopt very different shapes dependent on its function during the differentiation process.
  • Animal cells vary from being round to extended cylindric forms like motorneurons or muscle cells. In humans, more than 100 different cell types can be distinguished, each having a characteristic shape. The form of a cell often is closely related to its capacity to move. Some completely differentiated cells like fibroblasts can still change their form actively, thereby migrating. Other cell types serve as motor elements - "macroscopically” like muscle cells or “microscopically” like ciliated epithelia.
  • Such tasks are fulfilled by a big class of proteins; on the one hand responsible for maintenance of cell structure and contacting neighbor cells or the intercellular matrix and on the other hand for cell motility.
  • the motility apparatus e.g. must be fixed in the cytoskeleton.
  • Three different types of filaments can be distinguished: Actin filaments, tubulin filaments and intermediate filaments, each present in almost all types of cells.
  • Actin filaments are built up of monomers (G- Actin).
  • actin myosin, for both of which several paralogous genes are known, as well as many more proteins are constituents of the contractile apparatus.
  • the "thin” and “thick filaments” in a muscle cell consist mainly of actin and myosin, respectively.
  • Length of the sarcomere is controlled by the giant protein titin.
  • actomyosin system is responsible for many other motions at cellular level, e.g. the amoeboid movement of pseudopodia or the fission of cells at the end of mitosis by a contractile ring.
  • actin fibers fulfill structural tasks like maintenance of the shape of stereocilia or microvilli.
  • actin filaments are connected by proteins like fimbrin.
  • actin fibers There is a network covering the complete cell volume with F-actin as a major constituent.
  • F-actin is highly dynamic. Management of the network structure and turnover is achieved by connecting proteins like alpha-actinin, fimbrin or fill-in; turnover is regulated by gelsolin, villin, and different capping- and fragmentation-proteins.
  • Microtubules are built up of alpha-beta tubulin heterodimers. Turnover of filaments is achieved by building-in and releasing of monomers with different time constant rates at both ends. The resulting cycle is called "treadmilling". Thirteen strings of tubulin duplets build up one subfiber, whereas one fiber contains two or three of those. A complete axoneme consists of 9 radial and 2 central fibers. This "9+2" - structure is the basis both of flagella, their basal bodies and centrioles. In flagella, several additional structures like radial elements exist. Nexin connects the fibers and dyneine is the motor ATPase which shifts the fibers relative to each other. Several genetic diseases like the Cartageneric syndrome are caused by deficiencies of distinct proteins in cilia.
  • microtubules are abundant in all types of cells. They are part of a delivery system for organelles, e.g. in the golgi apparatus. A further very important system based on microtubules is the mitotic spindle, it is organized by the centrosomes. Besides many other components, the major part of a centrosome are two centrioles which are built up of nine microtubule-triplets. Most remarkably, new centrioles are not synthesized de novo but generated by duplication of old ones.
  • Cytoplasmic microtubules are associated with many different proteins. Two major classes are known: The MAPs ("microtubule-associated proteins", with molecular masses between 200 and 300 kD) and the much smaller tau-Proteins with a MW between 60 and 70 kD. These proteins regulate the treadmill-process and the interaction with other structures in the cell.
  • MAPs microtubule-associated proteins
  • tau-Proteins with a MW between 60 and 70 kD.
  • intermediate filaments constitute a third class of filaments.
  • they do not participate in motility, nor are they dynamic structures subject to a vivid turnover.
  • the most important ones are neurofilaments (in neurons), keratin filaments (mainly in epithelial cells), and vimentin filaments (in many sorts different cell types).
  • the extracellular matrix consists of a network of proteins, glycoproteins and polysaccharides. Different proteins are present in relation to different mechanical demands:. Elastin is found in tissues with high elasticity (lungs, heart) whereas collagen, a more hard- wearing protein, is found in tendons and ligaments. Fibronectin is an extracellular protein highly important for cell adhesion.
  • Collagen alpha chain proteins Proteins with the typical (xxG)n repeat of collagen proteins and Pfam von Willebrand factor type A domain(s) suggest they are collagen alpha chains. These proteins can find application in modulation of connective tissue, bone and cartilage development and maintainance.
  • Radial spokehead protein Radial spokehead proteins, e.g., Chlamydomonas reinhardtii radial spokehead protein of flagella or axoneme and the Strongylocentrotus purpuratus sea urchin spermatozoa protein p63, and human proteins with similarity thereto are important for the maintenance of a planar form of sperm flagellar beating.
  • the human protein(s) can find application in modulating the structure of the human spermatozoa radial spoke head and modulation of sperm motility in men (e.g., in sterility). Clones in this category include: tes3_15i5.
  • Ankyrins are peripheral membrane proteins which interconnect integral proteins with the spectrin-based membrane skeleton. Thus these proteins are involved in coupling of cyto skeleton and cell membrane.
  • FGD1 -related F-actin binding protein FGD1 -related F-actin binding protein (Farbin/FGDl): FGD1 -related F-actin-binding protein (Farbin/FGDl) is a novel F-actin-binding protein. The gene locus fgdl seems to be responsible for faciogenital dysplasia or Aarskog-Scott syndrome. (OMIN 305400). Frabin binds F-actin and shows F-actin-cross-linking activity. Overexpression of frabin in Swiss 3T3 cells and COS7 cells induces cell shape change and c-Jun N-terminal kinase activation, as described for FGD1.
  • Cdc42p is an esin yeast, Cdc42p transduces signals to the actin cytoskeleton to initiate and maintain polarized growth and to mitogen-activated protein morphogenesis.
  • Cdc42p regulates a variety of actin-dependent events and induces the JNK/SAPK protein kinase cascade, which leads to the activation of transcription factors within the nucleus.
  • Clones in this category include: tes3_72kl5.
  • Paramyosins are a major structural component of thick filaments and invertebrate muscle. Paramyosins are promising antigens for immunization against several parasites, such as Schistosoma mansoni. Clones in this category include: tes3_7b22.
  • Tuftelin/enamelin are matrix proteins of the teeth. As other proteins involved in calcification, these proteins are also expressed in the uterus matrix. The new protein can find application in modulation of tissue-calcification, especially the uterus. As reported by OMIN, tuftelin has been associated (as potentially diagnostic, therapeutic, causative, and/or related, etc%) with amelogenesis imperfecta (OMIN *600087). Clones in this category include: utel_19g22.
  • CAR1 is involved in the regulation of cell-cell adhesion. OMIN reports the association (as potentially diagnostic, therapeutic, causative, and/or related, etc%) of CAR1 with tumor suppression by the reduction of tumor invasion (OMIN *116935). Clones in this category include: utel_24j6.
  • An animal cell that has achieved a certain level of development is said to be determined.
  • This differentiation of a cell may be irreversible and in that case the cell may be renewed only by simple duplication.
  • Other cells are renewed by means of stem cells which are immortal (e.g. stem cells of the bone marrow, epidermal stem cells).
  • stem cells which are immortal (e.g. stem cells of the bone marrow, epidermal stem cells).
  • the genetic control of development is extensively studied in non- vertebrates and vertebrates.
  • the classical animal model is the fruit fly Drosophilia and the modern model is the transgenic mouse. Animal transgenesis has proven to be useful for physiological as well as physiopathological studies. Besides the approach based on the random integration of a DNA construct in the mouse genome, gene targeting can be achieved using totipotent embryonic stem cells for targeted transgenesis. Transgenic mice are than derived from the embryonic stem cells.
  • mice can be created that express wild-type genes, mutant genes, marker genes or cell lethal genes in a tissue specific manner. These animal models allow to follow changes in tissue and organ development and lead to a better understanding of the cellular function of many genes or to the generation of animal models for human diseases. Fundamental problems in immunology, onset and development of cancer, regulation in fatty acid metabolism, aspects of cardiovascular function, control of the central nervous system development, analysis of reproductive development and function are only some examples of research interests.
  • Apoptosis is of importance for development and homeostasis of animals.
  • the key components of this program have been conserved in evolution from worms (C. elegans) to insects (Drosophilia) to humans.
  • the roles of apoptosis include the sculpting of structures during development, deletion of unneeded cells and tissues, regulation of growth and cell number, and the elimination of abnormal and potentially dangerous cells.
  • apoptosis provides "quality control mechanism" that limits the accumulation of harmful cells, such as virus-infected cells and tumor cells.
  • harmful apoptosis is associated with a wide variety of diseases, including AIDS, neuro-degenerative disorders and ischemic stroke.
  • Inducers of cell differentiation, cell cycle arrest and apoptosis might be the novel molecular targets for new anticancer agents in addition to the signaling pathways for growth factors and cytokines.
  • Proteins, factors, receptors and genes of importance in apoptosis Proteins, factors, receptors and genes of importance in apoptosis:
  • - Caspase-1 to Caspase-11 a family of proteases synthesized as an inactive proenzyme.
  • Targets of the activated enzymes include: poly(ADP-ribose) polymerase, DNA-dependent protein kinase, Ul ribonucleoprotein, nuclear laminins and cytoskeleton components (actin).
  • TNF - CD 95 (synonyms: Fas, APO-1), a receptor protein of the TNF -receptor family which includes TNF-R1 and TNF-R2 with the common characteristic of a 70 amino acid cytoplasmic domain.
  • Cytokine response modifier A a cowpox virus gene whose gene product inhibits caspases.
  • CAD Caspase-activated DNase
  • ICAD inhibitor
  • JNK proline-directed kinase
  • - p53 protein is essential for the induction of apoptosis as a response to chromosomal damage.
  • RIP Receptor interacting protein
  • TNF Tumor necrosis factor
  • TNF-receptor associated factor is an accessory protein that can bind to both TNF-R1 and TNF-R2.
  • Interleukins e.g. Interleukin-7: Interleukin precursors related to interleukin-7, for example, are expected to act as new growth factors for human B lineage cells. Additionally, these proteins should induce the gene rearrangement of the T-cell receptor repertoire, leading to thymocyte commitment, and subsequently induce both cytotoxic T-cell- and lymphocyte- activated killer cells These interleukins could find clinical application in a variety of conditions of hematolymphopoietic failure and different tumours, because of its recruitment of B cell lineage cells, cytotoxic T-cell- and lymphocyte-activated killer cells. (OMIN * 146660). Clones in this category include: tes3_35e21.
  • Testis-specific Y-encoded proteins The TSPY genes are arranged in clusters on the Y chromosome of many mammalian species. TSPY is believed to function in early spermatogenesis and is a candidate for GBY, the putative gonadoblastoma-inducing gene on the Y. Proteins of the TSPY-SET-NAP1L1 family represent proteins closely related to TSPY. These proteins seem to be involved in early spermatogenesis. Clones in this category include: fbr2_2dl5.
  • Eukaryotic cells rely for their viability on the partitioning of many basic cellular processes into membrane-bounded organelles. These are the nucleus, endoplasmic reticulum (ER), Golgi apparatus, endosomes, lysosomal compartments, mitochondria and peroxisomes. Most molecules destined for the lysosome, cell surface and outside the cell are routed through the ER and Golgi, which together with the vesicular intermediates between them, comprise the secretory pathway (Palade 1975). In the ER and Golgi compartments proteins are sorted, modified and often assembled into complexes en route to their final destination. Incorrectly assembled proteins are retained in the ER until they fold correctly or are targeted for degradation.
  • Additional proteins are translocated into and function within the lumenal spaces of organelles or are secreted.
  • proteins synthesized require targeting to membranes either for insertion into or transport across them.
  • a major purpose of this is growth.
  • the secretory pathway is dependent on an intact cytoskeleton and also closely linked to general metabolism by affecting ribosome biogenesis (Mizuta and Warner, 1994).
  • a huge number of proteins is required for targeting, translocation and sorting of newly synthesized proteins.
  • the first step in sorting is the recognition of cis-acting targeting or signal sequences that organelle-targeted proteins contain. This is carried out by cytosolic targeting factors and/or receptors on the membrane to which the protein is targeted. In some cases the primary sequences are extremely degenerate, with only the overall character being conserved (hydrophobicity for an ER signal sequence, helical amphiphilicity for mitochondrial targeting sequence (Kaiser et al, 1987; Lemire et al, 1989). Following the targeting step, proteins are either inserted into or transported across the membrane (translocated) through a proteinaceous apparatus (termed the translocon). The translocon include or recruit motors to drive the translocation process in the correct direction (Schatz and Dobberstein, 1996). Defined intracellular protein transport steps:
  • the compartmentalisation of processes is a prerequisite for a tight regulation of processes and activities.
  • the cells contain a highly dynamic set of membrane compartments that are responsible for packaging, sorting, secreting, and recycling proteins and other molecules. Trafficking between organelles within the secretory pathway occurs as vesicles derived from a donor compartment fuse with specific acceptor membranes, resulting in the directional transfer of cargo molecules.
  • This process is tightly controlled by the Rab/Ypt family of proteins (reviewed by Novick and Zerial, 1997 ), a branch of the superfamily of small GTPases.
  • Rab proteins regulate a variety of functions, including vesicle translocation and docking at specific fusion sites. Rabs may also play critical roles in higher order processes such as modulating the levels of neurotransmitter release in neurons, a likely mechanism in synaptic plasticity that underlies learning and memory (Geppert and Sudhof, 1998).
  • GTPases share a common three-dimensional fold that, in the GTP bound state, can bind a variety of downstream effector proteins.
  • GTP hydrolysis leads to a conformational change in the "switch" regions that renders the GTPase unrecognizable to its effectors. In this way, by localizing and activating a select set of effectors, a common structural motif is used to control a wide array of distinct cellular processes.
  • Rab proteins undergo a intricate cycle of membrane and protein interactions. Rabs are posttranslationally modified at C-terminal cysteines by the addition of two geranylgeranyl groups, which mediate membrane association when the Rab is in the GTP-bound state. After guanine nucleotide hydrolysis occurs, the Rab is extracted from the membrane upon forming a complex with a cytosolic GDP-dissociation inhibitor (GDI). This cytosolic intermediate is then recycled onto a newly forming vesicle, most likely through a secondary factor termed a GDI dissociation factor (GDF), which displaces GDI.
  • GDI cytosolic GDP-dissociation inhibitor
  • a guanidine nucleotide exchange factor promotes release of GDP and the subsequent loading of GTP.
  • the Rab is then free to associate with its specific set of effectors, which can in turn trigger events leading to the eventual fusion of the vesicle with a target membrane.
  • GTPase activating protein accelerates nucleotide hydrolysis, switching off the GTPase. The remaining GDP-bound Rab can then participate in a new round of fusion.
  • Rab interactions with effectors are likely to regulate vesicle targeting and membrane fusion in three ways.
  • a Rab may specifically facilitate vectorial vesicle transport. Vesicles are transported from their site of origin to acceptor compartments likely through associations with cytoskeletal elements and transport motors.
  • a protein has been identified with a domain structure that suggests a connection between the cytoskeleton and the Rabs. This protein, called Rabkinesin-6, contains a kinesin-like ATPase motor domain followed by a coiled-coil stalk region and a RBD that specifically binds Rab6 (Echard et al., 1998 ).
  • An additional link with the cytoskeleton is provided by the Rab effector, Rabphilin-3A.
  • Rabphilin-3A has been shown in vitro to interact with -actinin, an actin-bundling protein, but only when not bound to Rab3A (Kato et al., 1996 ). These results raise the intriguing possibility that Rab proteins regulate vesicle interactions with the cytoskeleton and thereby play an active role in targeting vesicles to their appropriate destinations.
  • Rab proteins may regulate membrane trafficking at the vesicle docking step.
  • a number of Rab effectors including Rabaptin-5, EEA1, Rabphilin-3A, and Rim, may serve as molecular tethers.
  • Each effector protein contains a RBD, followed by a linker region (some having the potential to form elongated coiled-coil structures), and a domain capable of interacting with a second Rab or the target membrane.
  • Rabaptin-5 for example, contains two RBDs, one near the N terminus that specifically recognizes Rab4 and a second near the C terminus that binds Rab5 (Vitale et al., 1998 ).
  • Rim which is localized to the target membrane
  • Rabphilin-3 A which is localized to the vesicle
  • N-terminal RBDs and C-terminal Ca2+-binding C2 domains implicating these effectors in synaptic vesicle localization or docking in response to Ca2+ influx (Wang et al., 1997 ).
  • Tethering effectors may also recognize protein complexes on the acceptor membrane.
  • Sec4p a yeast Rab3A homolog, interacts with the exocyst (Guo et al., 1999 ), a complex of seven or more subunits that is assembled at sites of vesicle fusion along the plasma membrane.
  • the exocyst complex may therefore function as a landmark for Rab/effector-mediated vesicle docking.
  • Rab proteins may selectively activate the SNARE fusion machinery.
  • the mechanism of this activation is unknown but may involve direct interactions of Rabs or, more likely, their effectors with SNAREs.
  • Hrs-2 is a protein that binds to SNAP-25 and contains a Zn2+-f ⁇ nger motif characteristic of Rab-binding proteins such as Rabphilin-3 A, Rim, EEA1, and Noc2, suggesting that Hrs-2 may form a physical link between Rabs and SNAREs (Bean et al., 1997).
  • Rab IB is essential for the intracellular transport of nascent low density lipoprotein (LDL) receptor. It is discussed as a universal mediator of endoplasmatic reticulum to Golgi transport of membrane glycoproteins in mammalian cells. .
  • Clones in this category include: fbr2 2il7, fbr2 3b 16.
  • Rab 10 appear concentrated on membranes in the perinuclear region. Rab 10 has been associated (as potentially diagnostic, therapeutic, causative, and or related, etc..) with the following diseases as reported by OMIN: 1) Choroideremia (OMIN *303199); and 2)RETT Syndrome (OMIN 312750).
  • Clones in this category include: fbr2_62119.
  • Rab 17 shows epithelial cell specificity.
  • Rab 17 is discussed as candidate gene for the mouse mutations In (leaden), Tw (twirler), and ax (ataxia).
  • the new putative Rab-protein is expected to be involved in vesicle trafficking within neuronal cells. These proteins can find application in modulating the transport of vesicles inside neuronal cells, which are essential for development of functional dendritic processes. . . Clones in this category include: fbr2_41ml5.
  • Ankyrin G The ankyrin 3 gene encodes a novel ankyrin, which is expressed in multiple tissues, with very high expression at the axonal initial segment and nodes of Ranvier of neurons in the central and peripheral nervous systems.
  • Ankyrin G shows several tissue- specific alternative mRNA processing.
  • the different ankyrin G proteins participate in maintenance/targeting of ion channels and cell adhesion molecules to nodes of Ranvier and axonal initial segments.
  • Ankyrin G has been associated (as potentially diagnostic, therapeutic, causative, and/or related, etc ..) with Werner disease (OMIN *277700). Clones in this category include: fkd2_24p5.
  • Zn-T-transporters are membrane proteins that facilitates sequestration of zinc in endosomal vesicles.
  • ZnT-3 mRNA seems to be involved in the accumulation of zinc in synaptic vesicles.
  • Zinc (Zn) is an essential element in normal development and metabolism. Recent studies show that in Alzheimer's disease, Zn functions as a double-edged sword, affording protection against Alzheimer's amyloid beta peptide (the major component of senile plaques) at low concentrations and enhancing toxicity at high concentrations by accelerated aggregation of the amyloid beta peptide.
  • Clones in this category include: fbr2_62fl0.
  • This group includes proteins which are involved in the uptake and consumption of nutrients, and enzymes which are part of the biochemical pathways for energy metabolism or which are involved in the supply of building blocks of nucleic acids, proteins (NTPs, dNTPs, amino acids) for DNA/RNA and protein synthesis, and fatty acids (membranes), to allow for the generation of higher order structures.
  • This group constitutes the most important and largest group in prokaryotes and lower eukaryotes. The higher the evolutionary level of an organism is, however, the more other protein classes like 'signal transduction', 'cell cycle' and 'differentiation and development' increase in importance and number of representatives.
  • Proteins involved in the metabolism of energy and compounds are usually the products of house keeping genes, they are often constitutively and/or ubiquitously expressed.
  • ARD1 In yeast, ARD1 and NAT1, are required for the expression of an N- terminal protein acetyltransferase 1. NAT1 controls full repression of the silent mating type locus HML, sporulation and entry into GO. ARD1 is involved in the assembly of the NAT 1- complex. These can find application modulating NAT assembly and action and therefore could be important in metabolism of drugs and environmental mutagens.(OMIN * 108345). Clones in this category include: fbr2_3g8.
  • Apolipoprotein E receptor In LDL-receptors the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. These proteins can find application in modulation of cholesterol binding and transport by LDL- receptors and LDL-binding proteins. In normal individuals, chylomicron remnants and very low density lipoprotein (VLDL) remnants are rapidly removed from the circulation by receptor-mediated endocytosis in the liver.
  • VLDL very low density lipoprotein
  • HLP III type III hyperlipoproteinemia
  • increased plasma cholesterol and triglycerides are the consequence of impaired clearance of chylomicron and VLDL remnants because of a defect in apolipoprotein E. Accumulation of the remnants can result in xanthomatosis and premature coronary and or peripheral vascular disease.
  • OMIN reports that apolipoprotein has associations (as potentially diagnostic, therapeutic, causative, and/or related, etc..) with the following diseases: 1) Familial hypercholesterolemia (OMIN 143890); 2) Familial combined hyperiipidemia (OMIN 144250); and 3) Alzheimer disease. (OMIN #104300).
  • Ubiquitin carboxyl-terminal hydrolases Ubiquitin carboxyl-terminal hydrolases (EC 3.1.2.15) (UCH) (deubiquitinating enzymes) are thiol proteases that recognize and hydrolyze the peptide bond at the C-terminal glycine of ubiquitin. These enzymes are involved in the processing of poly-ubiquitin precursors as well as that of ubiquinated proteins.
  • Ubiquitin-specific proteases have associations (as potentially diagnostic, therapeutic, causative, and or related, etc..) with the following diseases: 1) Lung carcinoma (OMIN *603486); 2) x-linked retinal diseases (OMIN *300050); 3) oncogenesis (OMIN *300050);4) ovarian cancer (OMIN *300050).
  • Clones in this category include: fbr2_78k24; htes3_27dl.
  • Phosphoserine signature phosphoglucomutases. phosphomannomutase: These proteins take part in the conversion of hexose phosphates. OMIN reports that these proteins have associations (as potentially diagnostic, therapeutic, causative, and/or related, etc%) with the following disease: Fanconi-Bickel Syndrome (OMIN #227810). Clones in this category include: fkd2_24bl5.
  • NADH ubiquinone oxidoreductase is the first enzyme in the respiratory electron transport chain of mitochondria. It is a a membrane-bound multi-subunit protein.
  • the bovine heart enzyme contains about 40 different polypeptides. OMIN reports that these proteins have associations (as potentially diagnostic, therapeutic, causative, and/or related, etc ..) with the following disease: Brancio-oto-renal syndrome (OMIN *6601445). Clones in this category include: fkd2_3ol7.
  • Transketolases require thiamin pyrophosphate as cofactor and shows a wide specificity for both reactants, e.g. converts hydroxypyruvate and R-CHO into CO(2) and R-CHOH-CO-CH(2)OH. OMIN reports that these proteins have associations (as potentially diagnostic, therapeutic, causative, and or related, etc..) with the following diseases: Wernicke-Korsakoff Syndrome (OMIN *277730). Clones in this category include: tes3_17117.
  • Fatty acid-Co A svnthetases/li gases These proteins contain AMP-binding domain signature(s), which is present in enzymes which act via an ATP-dependent covalent binding of AMP to their substrate. This domain is found in several CoA synthetases, such as acetate- CoA ligase (EC 6.2.1.1), long-chain-fatty-acid-CoA ligase (EC 6.2.1.3), bile acid-CoA ligase.
  • OMIN reports that these proteins have associations (as potentially diagnostic, therapeutic, causative, and or related, etc..) with the following diseases: 1) Alport syndrome , mental retardation and elliptocytosis (OMIN *300157); 2) Adrenoleukodystrophy (OMIN *300100). Clones in this category include: tes3_35kl7.
  • ADP/ATP or Adenine Nucleotide Translocataors These proteins contain mitochondrial energy transfer signature(s) and are most abundant in mitochondria. In its functional state, it is a homodimer of 30-kD subunits embedded asymmetrically in the inner mitochondrial membrane. The dimer forms a gated pore through which ADP is moved from the matrix into the cytoplasm.. OMIN reports that these proteins have associations (as potentially diagnostic, therapeutic, causative, and/or related, etc%) with the following diseases: 1) cardiomyopathy (OMIN * 103220); 2) myopathy (OMIN * 103220); 3)Progressive external ophthalmoplegia (OMIN *601227). Clones in this category include: tes3_35nl2.
  • Carboxylesterases OMIN reports that these proteins have associations (as potentially diagnostic, therapeutic, causative, and or related, etc%) with the following diseases: l)hepatic carboxylesterase with detoxification of foreign compounds (OMIN *114835); 2) non-Hodgkin lymphoma (OMIN *114835); 3) B-cell chronic lymphocytic leukemia (OMIN * 114835); 4) rheumatoid arthritis (OMIN * 114835). Clones in this category include: tes3_35n9.
  • Heat shock proteins OMIN reports that these proteins have associations (as potentially diagnostic, therapeutic, causative, and or related, etc ..) with the following diseases: 1)27 kd heat shock protein has been correlated with thermotolerance in response to environmental challenges and developmental transitions. (OMIN *6021295). Clones in this category include: utell_23el3.
  • the genetic information is stored in the form of nucleic acids in all organisms. Two kinds of nucleic acids exist, DNA and RNA. Whereas the more stable DNA in most organisms constitutes the storage form of the genetic information, the labile RNA and in particular mRNA is an intermediate used for the temporal expression of specific genes.
  • DNA is usually a double stranded linear molecule consisting of two antiparallel strands and made up of a deoxyribose, a phosphorus backbone and the four bases A, C, G, and T.
  • the DNA of some organisms has a ring structure. The structure of DNA was unraveled years ago by Watson and Crick. DNA is directional molecule determined by the C- atoms of the sugar.
  • replication e.g. DNA polymerases, Telomerase
  • RNA nucleic acid
  • ribozymes e.g. RNase, Ribosomal proteins
  • the DNA of a cell is replicated in the S-phase of the cell cycle.
  • Several enzymes carry out the task of doubling this nucleic acid.
  • the process of replication is tightly regulated.
  • the enzyme DNA polymerase and several other proteins are involved in this process.
  • many prokaryotes do have only one origin of replication (i.e., the starting point of the replication cycle)
  • eukaryotic DNAs (chromosomes) multiple such start points exist.
  • the switch from the synthesis (S) phase to the subsequent G2 or M phases of the cell cycle are dependent on the completion of the replication. This makes clear, that a number of proteins are involved in the replication itself as well as in the control of the process.
  • telomerase Since most eukaryotic chromosomes are linear structures, additional proteins and enzymes are necessary to make sure that the structure is maintained through successive generations. This includes those proteins necessary to build the three dimensional structure of chromosomes (e.g. histones) and the structural network of the nucleus and nucleolus (including the defined localization of transcriptionally active genes in the vicinity of nucleoli) but also such enzymes as telomerase which guarantees the integrity of the chromosomal ends.
  • mRNA messenger RNA
  • mRNA messenger RNA
  • the regulation of transcription is discussed under the separate heading 'transcription factors', but also the classes 'signal transduction', 'development', 'cell cycle' and others are affected as the expression of certain genes determines the fate of a cell or organism.
  • the primary transcript (hnRNA - heterogeneous nuclear RNA) is a single stranded one-to-one copy of the gene as it is located on the chromosome.
  • a 5' cap structure is enzymatically and covalently added to the RNA, blocking the 5' end of the RNA.
  • the enzyme poly A polymerase adds varying numbers of adenine residues to the 3' end of the transcript. This enzyme recognizes the sequence AAUAAA or AUUAAA (+ some minor variations), cuts the RNA 10 - 30 nucleotides downstream and adds the A residues.
  • the size of the poly A sequence affects the stability of the RNA.
  • the introns present on the genomic level and also present in the hnRNA are spliced out by a multi-protein complex consisting of several proteins and RNAs.
  • the finally maturated mRNA is exported to the cytoplasm where it is translated with help of the ribozymes.
  • RNA The half life of RNA is usually much shorter than that of DNA.
  • the mRNA is degraded shortly after synthesis, to guarantee a very defined window of expression of a given gene. This regulation is necessary to specifically maintain or change the set of proteins present at any time in a cell.
  • Specific regions in the 3'UTR determine the stability of the mRNA in the cytoplasm before it is degraded by RNases, enzymes consisting both of protein and RNA.
  • Nucleic acid managemenf'and Several categories of proteins are coded for by clones of the invention within the overall group of "Nucleic acid managemenf'and include, among others, the following:
  • RNA helicases including DEAD/H box helicases: RNA helicases comprise a large family of proteins that are involved in basic biological systems such as nuclear and mitochondrial splicing processes, RNA editing, rRNA processing, translation initiation, nuclear mRNA export, and mRNA degradation. RNA helicases are essential factors in cell development and differentiation, and some of them play a role in transcription and replication of viral single-stranded RNA genomes. The members of the largest subgroup, the DEAD and DEAH box proteins, exhibit a strong dependence of the unwinding activity on ATP hydrolysis.
  • DEAD box proteins have been associated (as potentially diagnostic, therapeutic, causative, and/or related, etc..) as reported by with the following disease processes and/or genes: 1) ataxia-telangiectasia gene: "A human gene (DDX10) encoding a putative DEAD- box RNA helicase at 1 Iq22-q23" Genomics 33:199-206, 1996, Savitsky et al., (OMIN *601235); 2) hematopoetic tumors: "Cloning and expression of a murine cDNA homologous to the human RCK P54, a lymphoma-linked chromosomal breakpoint 1 lq23", Gene 166:293- 6, 1995, Seto et al.
  • Inorganic pyrophosphatase Inorganic pyrophosphatase (EC 3.6.1.1) (PPase) is the enzyme responsible for the hydrolysis of pyrophosphate (PPi) which is formed as the product of the many biosynthetic reactions that utilize ATP. All known PPases require the presence of divalent metal cations, with magnesium conferring the highest activity. Clones in this category include: fbr2_64al5.
  • dinP DNA-damage -inducible protein
  • the dinB/P pathway is a second SOS-pathway in E.coli. Genes related to this seem to be involved in modulating DNA repair and mutagenesis. Clones in this category include: fbr2_72M8 i
  • Cytosolic ribosomal proteins L36 L36 seems to be part of the eukaryotic ribosomal peptidyl transferase center and can find application in modulation of ribosome assembly, maintenance and activity. Clones in this category include: fkd2_3b2.
  • Ribonuclease H proteins are RNA modifiering proteins and have been associated (as potentially diagnostic, therapeutic, causative, and/or related, etc%) with the following diseases as reported by OMIN: 1) Adenomatous Polyposis of the Colon (OMIN * 175100); 2) Retinoblastoma (OMIN * 180200) ; and 3) Von Hippel-Lindau Syndrome (OMIN * 193300). Clones in this category include: phtes3_15j3.
  • the largest known family of cell-surface receptors is that of the G-protein-coupled receptors, which mediate the transmission of diverse stimuli such as neurotransmitters, glycopeptides, hormones, peptides, odorant molecules, and photons.
  • the functional unit of these receptors is composed of the receptor molecule itself (GPCR) which is anchored in the cytoplasma membrane with seven membrane spanning domains, the heterotrimeric G-protein which is composed of ⁇ and ⁇ -subunits (G ⁇ and G ⁇ ), and the effectors that interact with G ⁇ and / or G ⁇ .
  • the dissociated G ⁇ and G ⁇ can regulate the activities of a number of effector molecules such as adenylate cyclases, phopholipase C isoforms, ion channels, and tyrosine kinases, resulting in a variety of cellular functions.
  • effector molecules such as adenylate cyclases, phopholipase C isoforms, ion channels, and tyrosine kinases.
  • the process of signal transduction must be tightly regulated and reversible in order to avoid overstimulation, to achieve signal termination, and render the receptor responsive to subsequent stimuli [Iacovelly L. et al., (1999) FASEB J. 13, 1-8, Hamm, H.E. (1998) J. Biol. Chem. 273, 669- 672].
  • G-proteins are GTPases that, upon binding of GTP change their conformation which in return unmasks structural motives, in particular the so called effector loop, which can mediate the interactions to target proteins, or effectors, for the GTPases.
  • This ability enables the GTPases to cycle between active, GTP-bound and inactive, GDP bound conformations and in the process to function as molecular traffic lights in a multitude of signal transduction pathways.
  • the most important of these signal transduction pathways that are regulated with help of G-proteins are that of the phospholipase C / protein kinase C and that of the adenylate cyclase / protein kinase A.
  • GTPases The cycling of GTPases is tightly regulated by three main classes of proteins: The exchange of hydrolyzed GDP for a fresh GTP is facilitated by guanosine nucleotide exchange factors (GEFs), the hydrolysis of GTP to GDP is sped up by GTPase-activating proteins (GAPs), and the dissociation of GDP from the GTPases is inhibited by GDP dissociation inhibitors (GDIs) [Tapon and Hall (1997) Curr.Opin. Cell. Biol. 9, 86-92, Van Aelst and D- Souza-Schorey (1997) Genes Dev. 11, 2295-2322].
  • GEFs guanosine nucleotide exchange factors
  • GAPs GTPase-activating proteins
  • GDP dissociation inhibitors GDIs
  • SOCS-l-SOCS-7 and CIS The class where the SOCS box was originally described contains several members (SOCS-l-SOCS-7 and CIS). In addition to the SOCS box, these proteins also contain a SH2 (Src-homology 2) domain and a variable N-terminus. These SOCS proteins appear to form part of a classical negative feedback loop that regulates cytokine signal transduction. Upon cytokine stimulation, expression of SOCS proteins is rapidly induced and the proteins inhibit further cytokine action. The mode of action of the SOCS proteins is variable. While SOCS-1 binds and inhibits the JAK (Janus kinases) family of cytoplasmic protein kinases [Narahzaki M. et al (1998) Proc. Natl. Acad. Sci.
  • JAK Janus kinases
  • CIS appears to act by competing with signaling molecules such as the STATs (Transducers and Activators of Transcription) family for binding to phosphorylated receptor cytoplasmic domains [Yoshimura, A. et al. (1995) EMBO J. 14, 2816-2826; Matsumoto, A. et al (1997) BloodS9, 3148-3154].
  • STATs Transducers and Activators of Transcription
  • a second class of SOCS box protein contains additionally WD-40 repeats which were initially identified in the mouse WSB-1 and -2 proteins.
  • the functions of WD-40 proteins are not completely understood but seem to be rather divergent.
  • Cdc4p the WD-40 repeats probably are necessary for binding the substrate for Cdc34p [Mathias, N. et al. (1999) Mol. Cell Biol. 19, 1759-1767].
  • Cdc4p is a component of a ubiquitin ligase that tethers the ubiquitin-conjugating enzyme Cdc34p to its substrates.
  • the posttranslational modification of a protein by ubiquitin usually results in rapid degradation of the ubiquitinated protein by the proteasome.
  • the transfer of ubiquitin to substrate is a multistep process where WD-40 repeats might play an important function.
  • WD-40 containing proteins e.g. the retino blastoma binding protein RbAp48
  • RbAp48 retino blastoma binding protein
  • Zinc metal ions
  • These proteins are involved in the RAS-cAMP pathway that regulates cellular growth [Ach R.A. et al. (1997) Plant Cell 9, 1595-1606].
  • the SPRY domain has been identified in pyrin or marenostrin, a protein which is mutated in patients with Mediterranean fever and which is similar to the butyrophilin family. While butyrophilins seem to be involved in the lactation process in mammals, the function pyrin is unknown. Three proteins (SSB-1 to -3) have been identified to contain both SPRY and SOCS box motifs. The function of these proteins is also not known.
  • Ankyrin repeat containing proteins share a 33-residue repeating motif, an L-shaped structure with protruding ⁇ -hairpin tips which mediate specific macromolecular interactions with cytoskeletal, membrane, and regulatory proteins. These proteins play fundamental roles in diverse biological activities including growth and development, intracellular protein trafficking, the establishment and maintenance of cellular polarity, cell adhesion signal transduction, and mRNA transcription. Three proteins that contain ankyrin repeats (ASB-1 to -3) have been identified to contain a C-terminal SOCS box additionally to the ankyrin repeats. The function of these proteins or the individual domains remains to be discovered [Hilton, D.J. et al (1998) Proc. Natl. Acad. Sci. USA 95, 114-119].
  • GTPases are involved in signal transduction during cellular communication.
  • the function of the SOCS box in this type of proteins is currently unclear [Hilton, D.J. et al (1998) Proc. Natl. Acad. Sci. USA 95, 114-119].
  • the bivalent cation Ca 2+ is, besides cAMP, one of the two major second messengers in eukaryotic cells. Its intracellular concentration is tightly regulated and usually kept very low compared to the cell's environment. Ca 2+ binding proteins and transporters (Gap junction, Voltage-gated, second messenger-gated) help to sequester huge amounts of the ion in various organelles from where Ca 2+ can be released upon extracellular stimuli. E.g. the contraction of the muscle is dependent on the presence of Ca 2+ ions which are readily transported back into the organelles in order for the muscle to relax.
  • Ca 2+ functions as a second messenger that activates Ca 2+ dependent processes through the activation of Ca 2 7calmodulin dependent protein kinases (CaM kinases) which are the major effector molecules of Ca 2+ .
  • CaM kinases Ca 2 7calmodulin dependent protein kinases
  • the CaM dependent kinases activate phospholipases (e.g. phospholipase C) that in return activate other protein kinases such as protein kinase C.
  • phospholipases e.g. phospholipase C
  • the cyclic AMP is produced by the enzyme adenylate cyclase in response to extracellular signals.
  • Certain G-proteins stimulate the activity of adenylate cyclase which converts ATP to cAMP and PPi.
  • Two molecules of cAMP bind to each of two regulatory subunits of cAMP dependent protein kinase which in turn dissociate from the two catalytic subunits of the heterotetramer R 2 C 2 . Upon release of the C-subunits, they become active and phosphorylate substrate proteins at Ser and Thr residues.
  • TGF ⁇ transforming growth factor ⁇
  • Ligand induces formation of heteromeric complexes of these receptors, and signaling is initiated when receptor I is phosphorylated and activated by the constitutively active kinase of receptor II (Wrana et al., 1994 ). The activated type I receptor kinase then propagates the signal to a family of intracellular signaling mediators known as Smads (contraction of the
  • C.elegans Sma and Drosophila Mad genes which were the first identified members of this class of signaling effectors).
  • Smads Three classes of Smads with distinct functions have been defined: the receptor- regulated Smads, which include Smadl, 2, 3, 5, and 8; the common mediator Smad, Smad4; and the antagonistic Smads, which include Smad6 and 7 (Heldin et al., 1997; Attisano and Wrana, 1998 ; Kretzschmar and Massague, 1998 ).
  • R-Smads Receptor-regulated Smads
  • the proteins act as direct substrates of specific type I receptors, and the proteins are phosphorylated on the last two serines at the carboxyl terminus within a highly conserved SSXS motif (Macias-Silva et al., 1996 ; Abdollah et al., 1997 ; Kretzschmar et al., 1997 ; Liu et al., 1997b ; Souchelnytskyi et al., 1997 ). Regulation of R-Smads by the receptor kinase provides an important level of specificity in this system.
  • Smad2 and Smad3 are substrates of TGF ⁇ or activin receptors and mediate signaling by these ligands (Macias-Silva et al., 1996 ; Liu et al., 1997b ; Nakao et al., 1997 ), whereas Smadl, 5, and 8 are targets of BMP receptors and propagate BMP signals (Hoodless et al., 1996 ; Chen et al., 1997b ; Kretzschmar et al., 1997 ; Nishimura et al., 1998 ).
  • R-Smads associate with the common Smad, Smad4 (Lagna et al., 1996 ; Zhang et al., 1997 ), and mediate nuclear translocation of the heteromeric complex.
  • Smad complexes then activate specific genes through cooperative interactions with DNA and other DNA-binding proteins such as FASTI, FAST2, and Fos/Jun (Chen et al., 1996 , Chen et al., 1997a ; Liu et al., 1997a ; Labbe et al., 1998 ; Zhang et al., 1998 ; Zhou et al., 1998 ).
  • the antagonistic Smads, Smad ⁇ and 7 appear to function by blocking ligand-dependent signaling (reviewed in Heldin et al., 1997 ).
  • Smad2/Smad3 interacting protein has been described (Tsukazaki T. et al., 1998 ) that contains a double zinc finger, or FYVE domain, and which has been called SARA (Smad anchor for receptor activation).
  • SARA Smad anchor for receptor activation
  • TGF ⁇ signaling induces dissociation of Smad2 from SARA with concomitant formation of Smad2/Smad4 complexes and nuclear translocation. Moreover, deletion of the FYVE domain in SARA causes mislocalization of Smad2 and inhibits TGF ⁇ -dependent transcriptional responses. Thus, SARA defines a component of TGF ⁇ signaling that functions to recruit Smad2 to the receptor by controlling the subcellular localization of Smad. References: Abdollah et al. (1997) J. Biol. Chem. 272, 27678-27685; Attisano et al. (1998) Curr. Opin. Cell Biol. 10, 188-194; Chen et al. (1996) Nature 383, 691-696; Chen et al.
  • the bivalent cation Ca 2+ is, along with cAMP, one of the two major second messengers in eukaryotic cells. Its intracellular concentration is tightly regulated and usually kept very low compared to the cell's environment. Ca 2+ binding proteins and transporters
  • Ca 2+ functions as a second messenger that activates Ca 2+ dependent processes through the activation of Ca 2 7calmodulin dependent protein kinases (CaM kinases) which are the major effector molecules of Ca 2+ .
  • CaM kinases Ca 2 7calmodulin dependent protein kinases
  • the CaM dependent kinases activate phospholipases (e.g. phospholipase C) that in return activate other protein kinases such as protein kinase C.
  • the compartmentalization of processes is a prerequisite for a tight regulation of processes and activities.
  • the cells contain a highly dynamic set of membrane compartments that are responsible for packaging, sorting, secreting, and recycling proteins and other molecules. Trafficking between organelles within the secretory pathway occurs as vesicles derived from a donor compartment fuse with specific acceptor membranes, resulting in the directional transfer of cargo molecules.
  • This process is tightly controlled by the Rab/Ypt family of proteins (reviewed by Novick and Zerial, 1997 ), a branch of the superfamily of small GTPases.
  • Rab proteins regulate a variety of functions, including vesicle translocation and docking at specific fusion sites. Rabs may also play critical roles in higher order processes such as modulating the levels of neurotransmitter release in neurons, a likely mechanism in synaptic plasticity that underlies learning and memory (Geppert and Sudhof, 1998 ).
  • GTPases share a common three-dimensional fold that, in the GTP bound state, can bind a variety of downstream effector proteins.
  • GTP hydrolysis leads to a conformational change in the "switch" regions that renders the GTPase unrecognizable to its effectors. In this way, by localizing and activating a select set of effectors, a common structural motif is used to control a wide array of distinct cellular processes.
  • Rab proteins undergo a intricate cycle of membrane and protein interactions. Rabs are posttranslationally modified at C-terminal cysteines by the addition of two geranylgeranyl groups, which mediate membrane association when the Rab is in the GTP-bound state. After guanine nucleotide hydrolysis occurs, the Rab is extracted from the membrane upon forming a complex with a cytosolic GDP-dissociation inhibitor (GDI). This cytosolic intermediate is then recycled onto a newly forming vesicle, most likely through a secondary factor termed a GDI dissociation factor (GDF), which displaces GDI.
  • GDI cytosolic GDP-dissociation inhibitor
  • a guanidine nucleotide exchange factor promotes release of GDP and the subsequent loading of GTP.
  • the Rab is then free to associate with its specific set of effectors, which can in turn trigger events leading to the eventual fusion of the vesicle with a target membrane.
  • GTPase activating protein accelerates nucleotide hydrolysis, switching off the GTPase. The remaining GDP-bound Rab can then participate in a new round of fusion.
  • Rab interactions with effectors are likely to regulate vesicle targeting and membrane fusion in three ways.
  • a Rab may specifically facilitate vectorial vesicle transport. Vesicles are transported from their site of origin to acceptor compartments likely through associations with cytoskeletal elements and transport motors.
  • a protein has been identified with a domain structure that suggests a connection between the cytoskeleton and the Rabs. This protein, called Rabkinesin-6, contains a kinesin-like ATPase motor domain followed by a coiled-coil stalk region and a RBD that specifically binds Rab6 (Echard et al., 1998 ).
  • An additional link with the cytoskeleton is provided by the Rab effector, Rabphilin-3A.
  • Rabphilin-3A has been shown in vitro to interact with -actinin, an actin-bundling protein, but only when not bound to Rab3A (Kato et al., 1996 ). These results raise the intriguing possibility that Rab proteins regulate vesicle interactions with the cytoskeleton and thereby play an active role in targeting vesicles to their appropriate destinations.
  • Rab proteins may regulate membrane trafficking at the vesicle docking step.
  • a number of Rab effectors including Rabaptin-5, EEA1, Rabphilin-3 A, and Rim, may serve as molecular tethers.
  • Each effector protein contains a RBD, followed by a linker region (some having the potential to form elongated coiled-coil structures), and a domain capable of interacting with a second Rab or the target membrane.
  • Rabaptin-5 for example, contains two RBDs, one near the N terminus that specifically recognizes Rab4 and a second near the C terminus that binds Rab5 (Vitale et al., 1998 ).
  • Rim which is localized to the target membrane
  • Rabphilin-3 A which is localized to the vesicle
  • N-terminal RBDs and C-terminal Ca2+-binding C2 domains implicating these effectors in synaptic vesicle localization or docking in response to Ca2+ influx (Wang et al., 1997 ).
  • Tethering effectors may also recognize protein complexes on the acceptor membrane.
  • Sec4p a yeast Rab3A homolog, interacts with the exocyst (Guo et al., 1999 ), a complex of seven or more subunits that is assembled at sites of vesicle fusion along the plasma membrane.
  • the exocyst complex may therefore function as a landmark for Rab/effector-mediated vesicle docking.
  • Rab proteins may selectively activate the SNARE fusion machinery.
  • the mechanism of this activation is unknown but may involve direct interactions of Rabs or, more likely, their effectors with SNAREs.
  • Hrs-2 is a protein that binds to SNAP-25 and contains a Zn2+-f ⁇ nger motif characteristic of Rab-binding proteins such as Rabphilin-3 A, Rim, EEA1, and Noc2, suggesting that Hrs-2 may form a physical link between Rabs and SNAREs (Bean et al., 1997).
  • Phosphatases regulate key positions e.g. in the processes of cell proliferation, differentiation and communication/signaling. These processes must be tightly regulated in order to maintain a steady state level of cellular fate. Mis-regulation of kinase activities (or that of phosphatases) is made responsible for a multitude of disease processes such as oncogenesis, inflammatory processes, arteriosclerosis, and psoriasis.
  • Protein kinases constitute the largest protein family that is currently known. Several hundred kinases have been identified already. Classically, kinases are subdivided into two classes based on the amino acid residues in their substrates that are phosphorylated by the particular enzymes. The kinases specifically add phosphoryl groups from adenosine triphosphate (ATP) or, less frequently, guanosine triphosphate (GTP), either to serine and/or threonine or to tyrosine residues of substrate proteins. An estimated 1,000 to 10,000 proteins present in a typical mammalian cell are believed to be regulated also by the action of protein kinases.
  • ATP adenosine triphosphate
  • GTP guanosine triphosphate
  • Protein kinases are frequently integral parts of signaling cascades that transmit extracellular stimuli (e.g. hormones, neurotransmitters, growth- or differentiation factors) into the cell and result in various responses by the cells.
  • the kinases play key roles in these cascades as they constitute a sort of 'molecular switches' turning on or off the activities of other enzymes and proteins, e.g. metabolic, regulatory, channels and pumps, receptors, cytoskeletal, transcription factors.
  • PKA cAMP-dependent protein kinase
  • C catalytic
  • R regulatory subunits
  • cAMP second messenger
  • Both of the catalytic and the regulatory subunits several isoforms exist.
  • the combination of catalytic and regulatory subunits determines the localization of the holoenzyme and also the substrate spectrum that is available for phosphorylation.
  • the consensus pattern necessary to be present in the substrate for PKA action is RRXS/T where X can be any amino acid.
  • the casein kinase II comprises another examples for holoenzymes that consist of catalytic and regulatory subunits.
  • Other kinases that are activated by second messengers are cGMP-dependent protein kinase and Protein kinase C (PKC) which is activated by diacylglycerol, which in turn is produced by phospholipases by cleavage of phosphatidylcholine.
  • PKC Protein kinase C
  • Receptor kinases usually consists of an extracellular domain which can bind effector molecules (e.g. growth factors and hormones) and transfer the stimulus to the intracellular domain of these proteins which usually is a protein tyrosine kinase.
  • tyrosine kinases lack an extracellular domain but are associated with receptors which transfer the signal after effector binding by activating the associated protein kinase enzyme (e.g. Src kinase family; Src, Blk, Fgr, Fyn, Lck Lyn, Yes and Janus kinase family; Jakl-3, Tyk2).
  • Src kinase family Src, Blk, Fgr, Fyn, Lck Lyn, Yes and Janus kinase family
  • Jakl-3, Tyk2 protein kinase enzyme
  • Dysfunction of kinases can be the cause of inflammatory diseases and uncontrolled proliferation.
  • v-Src which is a truncated version of the C-Src protooncogene tyrosine kinase is a classical example for this process as v-Src does not contain the regulatory domain of the cellular gene and is thus constitutively active.
  • Neurocalcin is a Ca(2+)-binding protein with three putative Ca(2+)-binding domains (EF -hands). In cattle, 6 isoforms are differentially expressed in the central nervous system, retina and adrenal gland. Homology with recoverin indicates involvement in Ca2+ dependent activation of guanylate cyclase.. These proteins can find application in modulating/blocking the guanylate cyclase-pathway.
  • OMIN 1 autosomal dominant cone dystrophy
  • OMIN *600364 cone dystrophy 3
  • OMIN *600364 cancer associated retinopathy
  • Clones in this category include: fbr2_23b21.
  • Proteins with a WW Domain Proteins that contain a WW domain which has been originally described as a short conserved region in a number of unrelated proteins, among them dystrophin, the gene responsible for Duchenne muscular dystrophy. The domain, which spans about 35 residues, is repeated up to 4 times in some proteins. It has been shown to bind proteins with particular proline-motifs, [AP]-P-P-[AP]-Y, and thus resembles somewhat SH3 domains. This domain is frequently associated with other domains typical for proteins in signal transduction processes.
  • proteins containing the WW domain are Dystrophin, Utrophin, vertebrate YAP protein (binds the SH3 domain of the Yes oncoprotein), murine NEDD-4 (embryonic development and differentiation of the central nervous system), IQGAP (human GTPase activating protein acting on ras). Therefore these proteins should be involved in intracellular signal transduction.
  • Diseases associated (as potentially diagnostic, therapeutic, causative, and/or related, etc...) with these proteins include as reported by OMIN 1) Muscular Dystrophy, Pseudohypertrophic Progressive Duchenne and Becker Types (OMIN *310200). Clones in this category include: fbr2_23nl6.
  • Protein substrates for cAMP-dependent protein kinase Acting as a choride channel or chloride channel inhibitor these proteins have been associated (as potentially diagnostic, therapeutic, causative, and/or related, etc..) as reported by OMIN with Cystic Fibrosis (OMIN #219700). Clones in this category include fbr2_82il7.
  • Sphingosine kinase is a new type of lipid kinase, which is regulated by growth factors. The enzyme phosphorylates sphingosine, which subsequently exerts intracellular and extracellular actions. Intracellulary, sphingosine 1 -phosphate (SPP) promotes proliferation and inhibits apoptosis. In yeast, survival of cells exposed to heat shock indicates is dependent on SPP. Extracellulary, SPP inhibits cell motility and influences cell morphology, effects that appear to be mediated by the G protein-coupled receptor EDG1. These proteins have been associated (as potentially diagnostic, therapeutic, causative, and/or related, etc%) as reported by OMIN with Gaucher Disease, Type I (OMIN *230800). Clones in this category include fbr2_82m6.
  • Vanilloid Receptors seems to play an important role in the activation and sensitization of nociceptors. It is the receptor for e.g. capsaicin, a selective activator of nociceptors, a natural product of capsicum peppers. Related can find application as a target for the development of new nociception-modulating drugs. Clones in this category include tes3_20k2.
  • RCCl (Regulator of chromosome condensation): RCCl (regulator of chromosome condensation) is a eukaryotic protein which binds to chromatin and interacts with ran, a nuclear GTP-binding protein. RCCl promotes the exchange of bound GDP with GTP, acting as a guanine-nucleotide dissociation stimulator. These proteins can find application in the regulation of gene expression by activition of nuclear GTP-binding proteins.
  • the X-linked retinitis pigmentosa is a result of a defect GTPase regulator, which contains a RCCl -type repeat. OMIN also reports that RCCl has associations (as potentially diagnostic, therapeutic, causative, and or related, etc ..) with retinitis pigmentosa (OMIN *312610). Clones in this category include tes3_21d4.
  • Ras inhibitor proteins Ras is a signal transducting molecule involved in the receptor tyrosine kinase/RAS/Map kinase signalling cascade. Ras proteins bind GDP/GTP and show intrinsic GTPase activity. Mutations in ras, which change aa 12, 13 or 61 activate the potential of ras to transform cultured cells and are implicated in a variety of human tumours.
  • Ras inhibitor proteins have been associated (as potentially diagnostic, therapeutic, causative, and/or related, etc%) with many disease processes as reported by OMIN including: 1) Tumors of the lung, breast, brain, pituitary, pancrase, bone, skin, bladder, kidney, ovary, prostate and lymphocyte, Melanoma (OMIN *600160); 2) X-linked non-specific mental retardation (OMIN * 300104); 3)adenomatouspolyposis of the colon (OMIN * 175100); 4) Beckwith-Wieddemann Syndrome (#130650); and 5) Major affective disorder 1 (OMIN * 125480). Clones in this category include utel_22g21.
  • Mammalian proteins cornicon involving the EGF-receptor Cornicon proteins are part of a signal transduction pathway involving the EGF-receptor.
  • the EGF-receptor has been reported by OMIN to be associated (as potentially diagnostic, therapeutic, causative, and/or related, etc%) with the following diseases: 1) Familial hypercholesterolemia (OMIN 143890); 2) Leprechaunism (OMIN #246200); 3) Hemophilia B (OMIN *306900); 4) Ectodermal dysplasia 1; 5) Kartagenerer syndrome (OMIN *244400) and 6) Glioma of the brain (OMIN * 137800). ). Clones in this category include utel_22el2.
  • Membrane region prediction was effected using the ALOM2 software (Klein et al., 1985; version 2 by K. Nakai). Similar to many other methods, the Kyte & Doolitle (1982) amino acid hydrophobicity scale is used in ALOM2 as the primary variable for classifying sequences in terms of their localization. High prediction accuracy is achieved through the system of intelligent decision rules and the utilization of a carefully selected training data set. The method also generates reliability estimates which makes it possible to distinguish between membrane-spanning proteins (I, intrinsic) and globular proteins with regions of high hydrophobicity buried in the core.
  • H represents the hydrophobicity of an individual residue.
  • P(I/max ⁇ ) and P(E/maxH) be the conditional probabilities that a protein is integral or peripheral, respectively, given its value of maximal hydrophobicity maxH, and let P(I) and P(E) be the prior probabilities of intrinsic and extrinsic membrane proteins estimated from the training set. Then a sequence is assigned to E if
  • conditional probabilities P(maxH E) and P(maxH I) can be determined based on the estimates of probability distributions of maxH in both groups.
  • the odds parameter can be made more or less stringent. For example, one can require odds at least 1 : 10 for a protein to be classified as integral. This leads to higher selectivity but less sensitivity.
  • GTFs general transcription factors
  • TBP TATA-binding Protein
  • TFIIE TFIIE
  • TFIIF TFI IH
  • RNAPII complexes containing the entire set of GTFs or a subset of GTFs together with other proteins have been isolated from mammalian and yeast cells. Although purified RNAPII and GTFs are sufficient for promoter-specific initiation, this system fails to respond to activators. This is mediated by a further complex termed mediator complex which associates with the carboxy-terminal heptapeptide domain (CTD) of the largest subunit of RNAPII.
  • CTD carboxy-terminal heptapeptide domain
  • RNAPII complexes Purification of human RNAPII complexes resulted in two distinct forms of human RNAPII after analysis of functional properties.
  • One complex contained chromatin remodeling activities but was devoid of GTFs.
  • the other complex did not contain factors that modify chromatin but contained a subset of SRB/mediator subunits and GTFs and other polypeptides that mediate transcriptional activation, a scenario similar to that reported for yeast.
  • a complex designated NAT ( ⁇ 2O SU) for negative regulator of transcription contains RNAPII, Cdk8, homologs of the yeast mediator complex as well as Rgrl and SrblO/11 known as negative regulators of transcription.
  • SMCC ⁇ 15 SU
  • SRB/mediator coactivator complex A complex with striking similar structural and functional properties to NAT has been identified designated SMCC ( ⁇ 15 SU) (SRB/mediator coactivator complex), that can also mediate transcriptional activation.
  • the SMCC complex includes all reported NAT subunits including subunits of the TRAP complex.
  • TRAP is a coactivator complex isolated on the basis of its interaction with the thyroid hormone receptor.
  • Another coactivator complex DRIP isolated on the basis of its ability to interact with the vitamin D3 receptor, contains novel subunits as well as subunits of NAT/SMCC and TRAP complexes.
  • RNAIIP holoenzyme Beside the huge amount of transcription factors which can be part of the RNAIIP holoenzyme or the coactivator complexes there is an even larger quantity of specific transcription factors binding to promoter elements within the DNA sequences of a given gene leading to activation or repression of transcription.
  • a broad range of cellular responses like differentiation, proliferation, cell death and others are elicited through activating or repressing the transcription of target genes.
  • Leucine zipper factors where the basic domain is followed by a leucine zipper of repeated leucine residues at every seventh position. The zipper mediates protein dimerization as a prerequisite for DNA-binding.
  • Helix-loop-helix factors contain a DNA-binding basic region followed by a motif of two potential amphipathic alpha-helices connected by a loop of variable length also mediating dimerization.
  • NF-1 NF-1
  • RF-X RF-X
  • bHSH like proteins Further members of this superclass are NF-1, RF-X, and bHSH like proteins.
  • Superclass comprises factors containing zinc-coordinating DNA-binding domains.
  • Proteins with Cys4 zinc finger of nuclear receptor type where two such motifs differing in size, composition and function are present in each receptor molecule.
  • Each finger comprises 4 cysteine residues coordinating one zinc ion.
  • the second half including the second cysteine pair has alpha-helix conformation and the helix of the first finger binds to the DNA through the major groove.
  • the sequence between the first two cysteines of the second finger mediates dimerization upon DNA-binding.
  • This class includes the steroid hormone receptors and the thyroid hormone receptor-like factors.
  • Other diverse cys4 zinc fingers have a motif of GATA-type.
  • the zinc ion is essential for DNA-binding.
  • Zinc fingers of alternating composition Zinc fingers of alternating composition.
  • Helix 3 contacts mainly the major groove of the DNA, some contacts at the minor groove are observed as well. Helix 2 and 3 resemble the helix-turn-helix structure of prokaryotic regulators.
  • the tryptophan clusters comprise several tryptophan residues with a spacing of 12-21 amino acid residues; the subclass of myb-type DNA-binding domains typically exhibit a spacing of 19-21 amino acid residues.
  • the TEA domain has been identified as a region which is conserved among the transcription factors TEF-1, TEC1 and abaA. This domain in TEF-1 has been shown to interact with DNA, although two additional regions may also contribute to DNA-binding. It is predicted to fold into three alpha-helices, with a randomly coiled region of 16-18 amino acid residues between helices 1 and 2, and a short stretch between helices 2 and 3 of 3-8 residues.
  • the structure of the Rel-type DBD exhibits a bipartite subdomain structure, each subdomain comprising a beta-barrel with five loops that form an extensive contact surface to the major groove of the DNA.
  • the first loop of the N-terminal subdomain (the highly conserved recognition loop) performs contacts with the recognition element on the DNA, but other loops are involved.
  • the fact that the main DNA-contacts are made through loops has been suggested to provide a high degree of flexibility in binding to a range of different target sequences. Augmenting interactions are achieved by two alpha-helices within the N-terminal Part that form strong minor groove contacts to the A/T-rich center of the B- element. In p65, the sequence between both alpha-helices is much shorter and even helix 2 is truncated.
  • the second, C-terminal domain is necessary mainly for protein dimerization.
  • p53 proteins MADS MCMl-agamous-deficiens-SRF box proteins. Proteins of this class comprise a region of homology.
  • the DNA-binding domain also comprises the dimerization capability.
  • two antiparallel amphipathic alpha-helices shown for SRF
  • alpha- I two antiparallel amphipathic alpha-helices
  • the bound DNA is bent and wrapped around the protein. It exhibits a compressed minor groove in the center and widened minor groove in the flanks.
  • Beta-Barrel alpha-helix transcription factors are Beta-Barrel alpha-helix transcription factors.
  • Proteins of this class comprise a region of homology with the chromosomal non- histone HMG proteins such as HMG1.
  • This region comprises the DNA-binding domain which in some instances such as HMG1 mediates sequence-unspecific, in other cases such LEF-1 sequence-specific binding to DNA.
  • This domain exhibits a typical L-shaped conformation made up of 3 alpha-helices and an extended N-terminal extension of the first helix. The latter together with helix 1, which contains a kink, form the long arm of the L, whereas helices 1 and 2 form the short arm. Binding to the minor groove induces a sharp bending of the DNA by more than 90 degree, away from the bound protein.
  • the overall topology of the DNA-protein complexes resembles somewhat that of the TBP-TATA box complex.
  • Cold-shock domain factors are characterized by a highly conserved region first found in prokaryotic cold-shock proteins. This domain is a single- stranded nucleic acid-binding structure interacting with DNA or RNA. It consists of an antiparallel five-stranded beta-barrel, the strands of which are connected by turns and loops. Within this structure, a three-stranded beta-strand contains a conserved RNA-binding motif, RNP1. Not all CSD proteins are transcription factors. Those which specifically bind to a certain sequence are termed Y-box proteins. Proteins of this class were previously called protamine-like domain proteins because of having a highly positively charged domain with interspersed proline residues.
  • the members of this transcription factor class have been identified on the basis of their homology to a defined region within the Drosophilia protein Runt.
  • the runt domain is part of the DNA-binding domain of these factors. It consists mainly of beta-strands, does not contain alpha-helical regions and seems to be most similar to the palm domain found in DNA polymerase beta (rat).
  • Superclass contains other transcription factors like Copper fist proteins. HMGKY). STAT. Pocket domain proteins and Ap2/EREBP-related factors.
  • Dcoh is a bifunctional protein, complexed with biopterin. It serves as dimerization cofactor of hepatocyte nuclear factor- 1 and catalyzes the dehydration of the biopterin cofactor of phenylalanine hydroxylase.
  • the Dcoh protein has been reported by OMIN to be associated (as potentially diagnostic, therapeutic, causative, and/or related, etc%) with the following diseases: 1) hyperphenylalanemia (OMIN 126090, #264070). Clones in this category include fkd2_46kl2.
  • Beta-transducin subunits of G-proteins contain WD-40 repeats. The beta subunits seem to be required for the replacement of GDP by GTP as well as for membrane anchoring and receptor recognition. Due to the zinc finger the novel protein seems to be a new molecule involved in signal transduction and transcription. These proteins have been reported by OMIN to be associated (as potentially diagnostic, therapeutic, causative, and or related, etc%) with the following diseases: 1) essential hypertension (OMIN *139130). Clones in this category include utel_H2. * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * *
  • the invention therefore, specifically contemplates the following assemblages of materials, which track the above- identified fourteen functional groupings, that are useful in practicing the profiling aspects of the invention.
  • One type of assemblage is nucleic acid- based and can include the following groupings of sequences and their derivatives: all sequences; human fetal brain sequences; brain derived sequences; human fetal kidney library sequences; kidney derived sequences; human mammary carcinoma library sequences; mammary carcinoma derived sequences; human testis library sequences; testes derived sequences; cell cycle genes; cell structure and motility genes; differentiation and development genes; intracellular transport and trafficking genes; metabolism genes; nucleic acid management genes; signal transduction genes; transmembrane protein genes; and transcription factor genes.
  • Other assemblages contain proteins or their corresponding antibodies or antibody fragments, divided along the same groupings.
  • inventive molecules are useful as members of a database.
  • a database may be used, for example, in drug discovery and rationale drug design or in testing the novelty and non-obviousness of newly sequenced materials.
  • they are particularly suited in designing variants for the profiling (and other) applications described herein.
  • the following discussion of electronic embodiments applies equally to such variants, which, naturally, will be generated and stored using a computer using known methodologies.
  • one aspect of the invention contemplates a database of at least one of the inventive sequences stored on computer readable media.
  • the individual sequences may be grouped with regard to the individual functional and structural groups mentioned above.
  • the individual sequences of a database may exist in printed form, they are preferably in electronic form, as in an ascii or a text file. They may also exist as word processing files or they may be stored in database applications like DB2, Sybase, Oracle, GCG and GenBank.
  • database applications like DB2, Sybase, Oracle, GCG and GenBank.
  • Computer readable media refers to any medium which can be read and accessed by a computer. These include: magnetic storage media, like floppy discs, hard drives and magnetic tape; optical storage media, like CD-ROM; electrical storage media, like RAM and ROM; and hybrids of these categories, like magnetic/optical storage media.
  • magnetic storage media like floppy discs, hard drives and magnetic tape
  • optical storage media like CD-ROM
  • electrical storage media like RAM and ROM
  • hybrids of these categories like magnetic/optical storage media.
  • a protein of the present invention may exhibit cytokine, cell proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may induce production of other cytokines in certain cell populations.
  • cytokine cytokine
  • cell proliferation either inducing or inhibiting
  • cell differentiation either inducing or inhibiting
  • the activity of a protein of the present invention is evidenced by any one of a number of routine factor dependent cell proliferation assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/11, BaF3, MC9/G, M + (preB M + ), 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, Mo7e and CMK.
  • the activity of a protein of the invention may, among other means, be measured by the following methods:
  • Assays for T-cell or thymocyte proliferation include without limitation those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol.
  • Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp.
  • Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173: 1205-1211, 1991; Moreau et al., Nature 336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A.
  • Assays for T-cell clone responses to antigens include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., Proc. Natl. Acad. Sci.
  • a protein of the present invention may also exhibit immune stimulating or immune suppressing activity, including without limitation the activities for which assays are described herein.
  • a protein may be useful in the treatment of various immune deficiencies and disorders (including severe combined immunodeficiency (SOD)), e.g., in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells and other cell populations.
  • SOD severe combined immunodeficiency
  • These immune deficiencies may be genetic or be caused by vital (e.g., HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders.
  • infectious diseases causes by viral, bacterial, fungal or other infection may be treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, herpesviruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such as candidiasis.
  • a protein of the present invention may also be useful where a boost to the immune system generally may be desirable, i.e., in the treatment of cancer.
  • Autoimmune disorders which may be treated using a protein of the present invention include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease.
  • a protein of the present invention may also to be useful in the treatment of allergic reactions and conditions, such as asthma (particularly allergic asthma) or other respiratory problems.
  • Other conditions, in which immune suppression is desired may also be treatable using a protein of the present invention.
  • T cells may be inhibited by suppressing T cell responses or by inducing specific tolerance in T cells, or both.
  • Immunosuppression of T cell responses is generally an active, non-antigen-specific, process which requires continuous exposure of the T cells to the suppressive agent.
  • Tolerance which involves inducing non-responsiveness or anergy in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence of the tolerizing agent.
  • Down regulating or preventing one or more antigen functions (including without limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and organ transplantation and in graft-versus-host disease (GVHD).
  • B lymphocyte antigen functions such as, for example, B7
  • GVHD graft-versus-host disease
  • blockage of T cell function should result in reduced tissue destruction in tissue transplantation.
  • rejection of the transplant is initiated through its recognition as foreign by T cells, followed by an immune reaction that destroys the transplant.
  • a molecule which inhibits or blocks interaction of a B7 lymphocyte antigen with its natural ligand(s) on immune cells such as a soluble, monomeric form of a peptide having B7-2 activity alone or in conjunction with a monomeric form of a peptide having an activity of another B lymphocyte antigen (e.g., B7- 1, B7-3) or blocking antibody), prior to transplantation can lead to the binding of the molecule to the natural ligand(s) on the immune cells without transmitting the corresponding costimulatory signal.
  • Blocking B lymphocyte antigen function in this matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an immunosuppressant.
  • the lack of costimulation may also be sufficient to anergize the T cells, thereby inducing tolerance in a subject.
  • Induction of long-term tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration of these blocking reagents.
  • the efficacy of particular blocking reagents in preventing organ transplant rejection or GVHD can be assessed using animal models that are predictive of efficacy in humans.
  • appropriate systems which can be used include allogeneic cardiac grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et al., Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:11102-11105 (1992).
  • murine models of GVHD see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp.
  • Blocking antigen function may also be therapeutically useful for treating autoimmune diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are reactive against self tissue and which promote the production of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may reduce or eliminate disease symptoms.
  • Administration of reagents which block costimulation of T cells by disrupting receptor: ligand interactions of B lymphocyte antigens can be used to inhibit T cell activation and prevent production of autoantibodies or T cell-derived cytokines which may be involved in the disease process.
  • blocking reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to long-term relief from the disease.
  • the efficacy of blocking reagents in preventing or alleviating autoimmune disorders can be determined using a number of well-characterized animal models of human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 840-856).
  • Upregulation of an antigen function (preferably a B lymphocyte antigen function), as a means of up regulating immune responses, may also be useful in therapy. Upregulation of immune responses may be in the form of enhancing an existing immune response or eliciting an initial immune response. For example, enhancing an immune response through stimulating B lymphocyte antigen function may be useful in cases of viral infection. In addition, systemic viral diseases such as influenza, the common cold, and encephalitis might be alleviated by the administration of stimulatory forms of B lymphocyte antigens systemically.
  • anti-vital immune responses may be enhanced in an infected patient by removing T cells from the patient, costimulating the T cells in vitro with viral antigen- pulsed APCs either expressing a peptide of the present invention or together with a stimulatory form of a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the patient.
  • Another method of enhancing anti-viral immune responses would be to isolate infected cells from a patient, transfect them with a nucleic acid encoding a protein of the present invention as described herein such that the cells express all or a portion of the protein on their surface, and reintroduce the transfected cells into the patient.
  • the infected cells would now be capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo.
  • up regulation or enhancement of antigen function may be useful in the induction of tumor immunity.
  • Tumor cells e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, carcinoma
  • a nucleic acid encoding at least one peptide of the present invention can be administered to a subject to overcome tumor-specific tolerance in the subject. If desired, the tumor cell can be transfected to express a combination of peptides.
  • tumor cells obtained from a patient can be transfected ex vivo with an expression vector directing the expression of a peptide having B7-2-like activity alone, or in conjunction with a peptide having B7-l-like activity and/or B7-3-like activity.
  • the transfected tumor cells are returned to the patient to result in expression of the peptides on the surface of the transfected cell.
  • gene therapy techniques can be used to target a tumor cell for transfection in vivo.
  • tumor cells which lack MHC class I or MHC class II molecules, or which fail to reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with nucleic acid encoding all or a portion of (e.g.
  • a cytoplasmic-domain truncated portion of an MHC class I alpha chain protein and beta 2 microglobulin protein or an MHC class II alpha chain protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II proteins on the cell surface.
  • Expression of the appropriate class I or class II MHC in conjunction with a peptide having the activity of a B lymphocyte antigen induces a T cell mediated immune response against the transfected tumor cell.
  • a gene encoding an antisense construct which blocks expression of an MHC class II associated protein, such as the invariant chain can also be cotransfected with a DNA encoding a peptide having the activity of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce tumor specific immunity.
  • a T cell mediated immune response in a human subject may be sufficient to overcome tumor-specific tolerance in the subject.
  • the activity of a protein of the invention may, among other means, be measured by the following methods:
  • Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128: 1968-1974, 1982; Handa et al., J. Immunol.
  • T-cell-dependent immunoglobulin responses and isotype switching (which will identify, among others, proteins that modulate T-cell dependent antibody responses and that affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994.
  • MLR Mixed lymphocyte reaction
  • Dendritic cell-dependent assays (which will identify, among others, proteins expressed by dendritic cells that activate naive T-cells) include, without limitation, those described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of Experimental Medicine 173:549-559, 1991; Macatonia et al.
  • lymphocyte survival/apoptosis (which will identify, among others, proteins that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 53: 1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International Journal of Oncology 1:639-648, 1992.
  • Assays for proteins that influence early steps of T-cell commitment and development include, without limitation, those described in: Antica et al., Blood 84:111-117, 1994; Fine et al., Cellular Immunology 155:111-122, 1994; Galy et al., Blood 85:2770-2778, 1995; Toki et al., Proc. Nat. Acad Sci. USA 88:7548-7551, 1991.
  • a protein of the present invention may be useful in regulation of hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even marginal biological activity in support of colony forming cells or of factor-dependent cell lines indicates involvement in regulating hematopoiesis, e.g.
  • erythroid progenitor cells in supporting the growth and proliferation of erythroid progenitor cells alone or in combination with other cytokines, thereby indicating utility, for example, in treating various anemias or for use in conjunction with irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) useful, for example, in conjunction with chemotherapy to prevent or treat consequent myelo- suppression; in supporting the growth and proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or treatment of various platelet disorders such as thrombocytopenia, and generally for use in place of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell
  • the activity of a protein of the invention may, among other means, be measured by the following methods:
  • Assays for embryonic stem cell differentiation include, without limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81:2903- 2915, 1993.
  • Assays for stem cell survival and differentiation include, without limitation, those described in: Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. Freshney, et al.
  • a protein of the present invention also may have utility in compositions used for bone, cartilage, tendon, ligament and/or nerve tissue growth or regeneration, as well as for wound healing and tissue repair and replacement, and in the treatment of burns, incisions and ulcers.
  • a protein of the present invention which induces cartilage and/or bone growth in circumstances where bone is not normally formed, has application in the healing of bone fractures and cartilage damage or defects in humans and other animals.
  • Such a preparation employing a protein of the invention may have prophylactic use in closed as well as open fracture reduction and also in the improved fixation of artificial joints. De novo bone formation induced by an osteogenic agent contributes to the repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and also is useful in cosmetic plastic surgery.
  • a protein of this invention may also be used in the treatment of periodontal disease, and in other tooth repair processes. Such agents may provide an environment to attract bone-forming cells, stimulate growth of bone-forming cells or induce differentiation of progenitors of bone-forming cells.
  • a protein of the invention may also be useful in the treatment of osteoporosis or osteoarthritis, such as through stimulation of bone and/or cartilage repair or by blocking inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory processes.
  • tissue regeneration activity that may be attributable to the protein of the present invention is tendon/ligament formation.
  • a protein of the present invention which induces tendon/ligament-like tissue or other tissue formation in circumstances where such tissue is not normally formed, has application in the healing of tendon or ligament tears, deformities and other tendon or ligament defects in humans and other animals.
  • Such a preparation employing a tendon ligament-like tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and in repairing defects to tendon or ligament tissue.
  • compositions of the present invention contributes to the repair of congenital, trauma induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments.
  • the compositions of the present invention may provide environment to attract tendon- or ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair.
  • the compositions of the invention may also be useful in the treatment of tendonitis, carpal tunnel syndrome and other tendon or ligament defects.
  • the compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is well known in the art.
  • the protein of the present invention may also be useful for proliferation of neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a protein may be used in the treatment of diseases of the peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in accordance with the present invention include mechanical and traumatic disorders, such as spinal cord disorders, head trauma and cerebro vascular diseases such as stroke. Peripheral neuropathies resulting from chemotherapy or other medical therapies may also be treatable using a protein of the invention.
  • Proteins of the invention may also be useful to promote better or faster closure of non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular insufficiency, surgical and traumatic wounds, and the like.
  • a protein of the present invention may also exhibit activity for generation or regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular endothelium) tissue, or for promoting the growth of cells comprising such tissues.
  • organs including, for example, pancreas, liver, intestine, kidney, skin, endothelium
  • muscle smooth, skeletal or cardiac
  • vascular including vascular endothelium
  • a protein of the present invention may also be useful for gut protection or regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and conditions resulting from systemic cytokine damage.
  • a protein of the present invention may also be useful for promoting or inhibiting differentiation of tissues described above from precursor tissues or cells; or for inhibiting the growth of tissues described above.
  • the activity of a protein of the invention may, among other means, be measured by the following methods:
  • Assays for tissue generation activity include, without limitation, those described in: International Patent Publication No. WO95/ 16035 (bone, cartilage, tendon); International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. WO91/07491 (skin, endothelium).
  • Assays for wound healing activity include, without limitation, those described in: Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 71:382-84 (1978).
  • a protein of the present invention may also exhibit activin- or inhibin-related activities. Inhibins are characterized by their ability to inhibit the release of follicle stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the release of follicle stimulating hormone (FSH).
  • FSH follicle stimulating hormone
  • a protein of the present invention alone or in heterodimers with a member of the inhibin alpha family, may be useful as a contraceptive based on the ability of inhibins to decrease fertility in female mammals and decrease spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can induce infertility in these mammals.
  • the protein of the invention may be useful as a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885.
  • a protein of the invention may also be useful for advancement of the onset of fertility in sexually immature mammals, so as to increase the lifetime reproductive performance of domestic animals such as cows, sheep and pigs.
  • the activity of a protein of the invention may, among other means, be measured by the following methods:
  • Assays for activin/ inhibin activity include, without limitation, those described in: Vale et al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. USA 83:3091-3095, 1986.
  • a protein of the present invention may have chemotactic or chemokinetic activity (e.g., act as a chemokine) for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells.
  • Chemotactic and chemokinetic proteins can be used to mobilize or attract a desired cell population to a desired site of action.
  • Chemotactic or chemokinetic proteins provide particular advantages in treatment of wounds and other trauma to tissues, as well as in treatment of localized infections. For example, attraction of lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved immune responses against the tumor or infecting agent.
  • a protein or peptide has chemotactic activity for a particular cell population if it can stimulate, directly or indirectly, the directed orientation or movement of such cell population.
  • the protein or peptide has the ability to directly stimulate directed movement of cells. Whether a particular protein has chemotactic activity for a population of cells can be readily determined by employing such protein or peptide in any known assay for cell chemotaxis.
  • the activity of a protein of the invention may, among other means, be measured by the following methods:
  • Assays for chemotactic activity consist of assays that measure the ability of a protein to induce the migration of cells across a membrane as well as the ability of a protein to induce the adhesion of one cell population to another cell population.
  • Suitable assays for movement and adhesion include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest.
  • a protein of the invention may also exhibit hemostatic or thrombolytic activity. As a result, such a protein is expected to be useful in treatment of various coagulation disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events in treating wounds resulting from trauma, surgery or other causes.
  • a protein of the invention may also be useful for dissolving or inhibiting formation of thromboses and for treatment and prevention of conditions resulting therefrom (such as, for example, infarction of cardiac and central nervous system vessels (e.g., stroke).
  • the activity of a protein of the invention may, among other means, be measured by the following methods:
  • Assay for hemostatic and thrombolytic activity include, without limitation, those described in: Linet et al., J. Clin. Pharmacol. 26: 131-140, 1986; Burdick et al., Thrombosis Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 35:467-474, 1988.
  • a protein of the present invention may also demonstrate activity as receptors, receptor ligands or inhibitors or agonists of receptor/ligand interactions.
  • receptors and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions and their ligands (including without limitation, cellular adhesion molecules (such as selectins, integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen recognition and development of cellular and humoral immune responses).
  • Receptors and ligands are also useful for screening of potential peptide or small molecule inhibitors of the relevant receptor/ligand interaction.
  • a protein of the present invention may themselves be useful as inhibitors of receptor/ligand interactions.
  • the activity of a protein of the invention may, among other means, be measured by the following methods:
  • Suitable assays for receptor-ligand activity include without limitation those described imCurrent Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static conditions 7.28.1-7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661- 670, 1995.
  • Proteins of the present invention may also exhibit anti-inflammatory activity.
  • the anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production of other factors which more directly inhibit or promote an inflammatory response.
  • Proteins exhibiting such activities can be used to treat inflammatory conditions including chronic or acute conditions), including without limitation intimation associated with infection (such as septic shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, arthritis, complement- mediated hyperacute rejection, nephritis, cytokine or chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from over production of cytokines such as TNF or IL-1. Proteins of the invention may also be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material.
  • infection such as septic shock, sepsis or systemic inflammatory response syndrome (SIRS)
  • ischemia-reperfusion injury such as septic shock, sepsis or systemic inflammatory response syndrome (SIRS)
  • ischemia-reperfusion injury such as septic shock, sepsis or systemic inflammatory response syndrome (SIR
  • a protein of the invention may exhibit other anti-tumor activities.
  • a protein may inhibit tumor growth directly or indirectly (such as, for example, via ADCC).
  • a protein may exhibit its tumor inhibitory activity by acting on tumor tissue or tumor precursor tissue, by inhibiting formation of tissues necessary to support tumor growth (such as, for example, by inhibiting angiogenesis), by causing production of other factors, agents or cell types which inhibit tumor growth, or by suppressing, eliminating or inhibiting factors, agents or cell types which promote tumor growth.
  • a protein of the invention may also exhibit one or more of the following additional activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape (such as, for example, breast augmentation or diminution, change in bone form or shape); effecting biorhythms or caricadic cycles or rhythms; effecting the fertility of male or female subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); effecting behavioral characteristics, including, without limitation, appetite, libido, stress, cognition (including cognitive disorders), depression (including depressive disorders) and violent behaviors; providing analgesic effects or other pain reducing effects; promoting differentiation and
  • htes3_l 5c24 The new protein can find application in modulation of 2-hydroxyacid dehydrogenases-dependent pathways and as a new enzyme for biotechnologic production processes.
  • htes3_15i5 The new protein can find application in modulating the structure of the human spermatozoa radia spoke head and modulation of sperm motility in men.
  • htes3_15kl 1 The novel protein contains a protein kinase ATP-binding region signature and a serine/threonine protein kinase active-site signature. The new protein can find application in modulation of intracellular signal pathways dependent on this kinase.
  • htes3_17nl2 The new protein can find application in modulating/blocking the expression of SOX-controlled genes.
  • htes3_20k2 The new protein can find application as a target for the development of new nociception-modulating drugs.
  • htes3_20ml8 The new protein can find application in modulation of mitochondrial DNA replication and maintenance.
  • htes3_20d4 The new protein can find application in the regulation of gene expression by activition of nuclear GTP-binding proteins.
  • the X-linked retinitis pigmentosa is a result of a defect GTPase regulator, which contains a RCCl -type repeat.
  • htes3_21jl5 NY-CO-33 is a protein recognised by autologous antibodies of human colon cancer patients.
  • the novel protein contains 4 C2H2 Zinc fingers and is a new putativ transcription factor. The new protein can find application in modulating/blocking the expression of genes controlled by this transcription factor.
  • the new protein can find application in modulating chromosome transport in mitosis and meiosis and modulation of cell division.
  • htes3_26g22 The new protein can find application in modulating chromosome transport in mitosis and meiosis and modulation of cell division.
  • the novel TBP- binding protein is considered to participate in transcription regulation through the interaction with TBP.
  • the new protein can find application in modulation of gene transcription.
  • htes3_21116 The new protein can find application in modulation of protein translocation into the endoplasmic reticulum.
  • htes3_27dl The novel protein can find application in modulation of ubiquitin- and protein metabolism in cells.
  • htes3_2ml8 The novel protein can find application as multifunctional nuclease / exoribonuclease.
  • htes3_35b4 The new protein can find application in modulation of the mitotic spindle.
  • htes3_35b5 The novel protein can find application in modulating the v-ATPase activity in endocytic and secretory organelles.
  • htes3_35e21 Due to the close relationship to human interleukin-7, the novel interleukin is expected to act as a new growth factor for human B lineage cells.
  • the protein should induce the gene rearrangement of the T-cell receptor repertoire, leading to thymocyte commitment, and subsequently induce both cytotoxic T-cell- and lymphocyte-activated killer cells.
  • This new interleukin could find clinical application in a variety of conditions of hematolymphopoietic failure and different tumours, because of its recruitment of B cell lineage cells, cytotoxic T-cell- and lymphocyte-activated killer cells.
  • htes3_35kl6 Therefore it is a new fatty acid-Co A synthetasese/ligase with unknown substrate.
  • the new protein can find application in modulation of fatty acid metabolism and as a new enzyme for biotechnologic production processes.
  • htes3_35nl2 The new protein can find application in modulation of ADP-transport and energy metabolism in cells/mitochondria.
  • htes3_35n9 The new protein can find application in modulation of carboxylester metabolism and as a new enzyme for biotechnologic production processes.
  • htes3_35p22 The novel protein is closely raleted to human tre-2 and other enzymes involved in the degradation of ubiquitinated proteins.
  • the human tre-2 oncogene encodes a deubiquitinating enzyme, indicating a role for the ubiquitin system in mammalian growth control.
  • the novel protein can find application in cancer diagnostics and treatment, and in regulating protein stability and growth control via regulation of ubiquitination.
  • htes3_4h6 The novel kinesin protein can find application in modulating the function of kinesin and modulating intracellular transport via/on microtubules.
  • htes3_72kl5 FGDl -related F-actin-binding protein (Farbin/FGDl) is a novel F-actin- binding protein. The gene locus fgdl seems to be responsible for faciogenital dysplasia or Aarskog-Scott syndrome. Frabin binds F-actin and shows F-actin-cross- linking activity. Overexpression of frabin in Swiss 3T3 cells and COS7 cells induces cell shape change and c-Jun N-terminal kinase activation, as described for FGDl.
  • Cdc42p is an esin yeast
  • Cdc42p transduces signals to the actin cytoskeleton to initiate and maintain polarized growth and to mitogen-activated protein morphogenesis.
  • Cdc42p regulates a variety of actin- dependent events and induces the JNK/SAPK protein kinase cascade, which leads to the activation of transcription factors within the nucleus.
  • the novel protein seems to be the human orthologue of rat frabin.
  • the new protein can find application in modulating of cell structure and motility as well as modulation of the JNK/SAPK pathway.
  • htes3_72pl6 As Mem3, the novel protein is similar to yeast VPS (vacuolar protein sorting) 35. The null allele of VPS35 results in yeast in a differential defect in the sorting of vacuolar carboxypeptidase Y (CPY), proteinase A (PrA), proteinase B (PrB), and alkaline phosphatase (ALP).
  • the new protein can find application in modulation the sorting of proteins into different compartments.
  • htes3_7b22 The novel protein is related to paramyosin, a major structural component of thick filaments and invertebrate muscle.
  • htes3_7j3 The new protein is closely related to C-Takl and therefore should be involved in cell-cycle regulation, too. The new protein can find application in modulating/blocking the cell cycle.
  • htes3_7p9 The nuclear domain (ND)10 also described as POD or Kr bodies is involved in the development of acute promyelocytic leukemia and virus-host interactions. The NDP52 protein is part of this complex structure.
  • NDP52 is transcribed in all human tissues, but is redistributed upon viral infection and interferon treatment.
  • ND10 plays an important role in the viral life cycle.
  • the novel protein is similar to NDP52. It contains three leucine zippers and a RGD cell attachment site. This protein seems to be a novel part of the ND819) complex.
  • the new protein can find application in modulation of viral infections and tumour events.
  • htes3_8ml0 The poly(A)-binding protein (PABP) binds to the messenger (mRNA) 3'-poly(A) tail found on most eukaryotic mRNAs and together with the poly(A) tail has been implicated in governing the stability and the translation of mRNA.
  • PABP poly(A)-binding protein
  • Kidney hfkd2_24bl5 The new protein can find application in modulation of hexose metabolism pathways and as a new enzyme for biotechnologic production processes.
  • hfkd2_24n20 The new protein seems to be part of the signalling pathway between tyrosine kinases and the membrane/cyto skeleton. The new protein can find application in modulating cell adhesion/motility and membrane/cyto skeleton structure and dynamics.
  • hfkd2_3ol7 The new protein can find application in modulation of the respiratory electron transport chain pathways of mitochondria.
  • hfkd2_46j20 The new protein can find application in modulating the homoprotocatechuate degradative pathway and as a enzyme for biotechnologic production processes.
  • hfkd2_46kl9 The new protein can find application in modulating/blocking the expression of genes controlled by the hepatocyte nuclear factor- 1.
  • hfkd2_46m4 SARI proteins are involved in vesicular transport between the endoplasmic reticulum and the Golgi apparatus.
  • hfkd2_46kl4 rab6 is a ubiquitous ras-like GTPase involved in intra-Golgi transport.
  • the new protein can find application in modulating the transport of vesicles inside the Golgi apparatus.
  • hutel_18il9 The SREBP-2 protein is embedded in the membranes of the nucleus and endoplasmic reticulum. In cholesterol-depleted cells the proteins are cleaved to release soluble NH2 -terminal fragments that enter the nucleus and activate genes encoding the low density lipoprotein receptor and enzymes of cholesterol synthesis.
  • the new protein is a putative transcription factor capable of protein-protein interaction via a lim domain and additionally shows similarity to the common sunflower transcription factor SF3.
  • hute 1_1811 The novel protein is similar to several 40S ribosomal proteins and therefore seems to part of the corresponding ribosome sub-unit.
  • hute l_19g22 The new protein can find application in modulation of tissue- calcification, especially the uterus.
  • hutel_19hl7 The new protein can find application in modulating the response of cells to oxysterols.
  • hute l_20bl9 The novel protein seems to be a novel enzyme with sarcosine oxidase activity.
  • the new protein can find application in modulation of sarcosine metabolism and as a new enzyme for biotechnologic production processes.
  • hute l_20g21 The novel protein seems to be a new ras inhibitor protein.
  • the new protein can find application in modulating/blocking ras dependent signal transduction pathways.
  • hute l_20hl3 The novel protein is a new human alpha-adaptin.
  • the new protein can find application in modulating endocytosis and vesicle trafficking in cells.
  • hute l_20ml 1 The new protein can find application in modulating/blocking the activity of protein phosphatase- 1 and in modulating the cell cycle.
  • hute l_20m24 This protein is a putative mannosyl transferase that is involved in the assembly of the core oligosaccharide Glc3Man9GlcNAc2.
  • the new protein can find application in modulation of glycosylation of proteins and as a new enzyme for biotechnologic production processes.
  • hute l_22el2 The new protein can find application in modulating the cornichon modulated signal transduction way and also the EGF receptor signaling processes.
  • hute l_23el3 The novel protein contains a serine protease of the subtilase family with an aspartic acid-containing active site. The new protein can find application in modulation of proteinase activity in cells and as a new enzyme for proteomics and biotechnologic production processes.
  • hutel_24j6 The new protein can find application in modulation of cell-cell-adhesion.
  • hutel_24h3 The new protein can find application as a useful marker for chondro- osteogenic cell differentiation and for the modulation of chondro-osteogenic cell differentiation.
  • hfbr2_16cl6 The new protein can find application in modulating/blocking of cyto skeleton-membrane protein interaction.
  • hfbr2_23b21 The new protein can find application in modulating/blocking the guanylate cyclase-pathway.
  • hfbr2_23bl0 The new protein can find application in modulation of splicing.
  • hfbr2_2b5 The novel protein contains the typical (xxG)n repeat of collagen proteins and a Pfam von Willebrand factor type A domain. Therefore, the protein seems to be a new collagen alpha chain.
  • the new protein can find application in modulation of connective tissue, bone and cartilage development and maintainance.
  • hfbr2_2cl7 The new protein can find application in modulating/blocking G-protein- dependent pathways.
  • hfbr2_2dl5 The new protein can find application in modulating early spermatogenesis.
  • hfbr2_2il7 The new protein can find clinical application in modulating the transport of glycoproteins inside cells, especially of the LDL receptor.
  • hfbr2_2kl4 Tumour-suppressor genes are known to be involved in the control of cell growth and division, interacting with proteins which control the cell cycle.
  • the N33 gene is significantly methylated in tumour cells, a mechanism by which tumor- suppressor genes are inactivated in cancer.
  • the novel protein contains a RGD cell attachment site.
  • RNA helicases comprise a large family of proteins that are involved in basic biological systems such as nuclear and mitochondrial splicing processes, RNA editing, rRNA processing, translation initiation, nuclear mRNA export, and mRNA degradation. RNA helicases are essential factors in cell development and differentiation, and some of them play a role in transcription and replication of viral single-stranded RNA genomes.
  • the members of the largest subgroup, the DEAD and DEAH box proteins exhibit a strong dependence of the unwinding activity on ATP hydrolysis.
  • the novel protein contains a DEAD-box and is a new member of this subgroup.
  • hfbr_3g8 The new protein can find application modulating NAT assembly and action and therefore be important in metabolism of drugs and environmental mutagens.
  • hfbr2_62bl 1 The rac small GTPase is associated with type-I phosphatidylinositol 4- phosphate 5-kinase and regulating the production of phosphatidylinositol 4,5- bisphosphate.
  • the new protein is expected to activate p21 rac -related small GTPases.
  • hfbr2_62ol7 The new protein can find application in modulation of cholesterol binding and transport by LDL-receptors and LDL-binding proteins.
  • hfbr_6b24 The new protein can find application in modulation of rhamnose metabolism and as a new enzyme for biotechnologic production processes.
  • hfbr_72bl8 The new protein can find application in modulating DNA repair and mutagenesis.
  • hfbr_78c4 The new protein can find application in modulating/blocking the response of cells to interferons.
  • hfbr_78k24 These enzymes are involved in the processing of poly-ubiquitin precursors as well as that of ubiquinated proteins.
  • the new protein can find application in modulation of protein stability /degradation in cells.
  • hfbr_82e4 The new protein can find clinical application in modulating/blocking calmodulin-mediated pathways in human neuronal cells.
  • Variants include DNA and/or protein molecules that resemble, structurally and/or functionally, those set forth in herein. Variants may be isolated from natural sources (“homologs”), may be entirely synthetic or may be based in part on both natural and synthetic approaches.
  • Preferred molecules are characterized by a combination of one or more of these characteristics. For instance, some preferred molecules are described with reference to at least two structural characteristics, while others may be described with reference to at least one structural and at least one functional characteristic.
  • eukaryotic structural genes are comprised of both protein coding and non-coding portions.
  • messenger RNA When the messenger RNA is transcribed from the DNA template, it contains introns, which are non-coding, and exons, which are coding.
  • the introns In order to form a translation competent mRNA, the introns must be "spliced" out of this initial pre mRNA.
  • splice junctions that direct the cellular splicing machinery to the appropriate position.
  • the splice junctions are loosely conserved sequence regions of the pre mRNA, which almost invariably begin with GT and end with AG (DNA perspective).
  • the 5' end of the splice junction typically contains about nine somewhat conserved residues, for example, C/AAGTA/G AGT .
  • the 3' end usually contains a pyrimidine rich stretch of at least about 11 nucleotides, followed by NC/TAGG. Splicing occurs before the GT and after the AG. Mount, Nucleic Acids Res. 10:459-72 (1982).
  • exons often correspond to discrete functional domains of the protein product.
  • the intron exon arrangement thus creates a linear array of nucleotides which can be correlated to discrete, and often interchangeable, functional protein fragments. Go, Nature 291:90-92 (1981); Branden et al , EMBO J. 3:1307-10 (1984).
  • This linear arrangement creates the possibility of generating multiple different full length proteins by rearranging the order of the different functional portions in the array. For example, if a set of exons are arranged 1-2-3-4, where (-) represents the introns separating the exons, a splicing event need not simply produce 1234, but may produce 123, 134, 124 and so on. Production of different mRNA products in this way is commonly called “alternative splicing. " Andreadiset al. , Ann. Rev. Cell Biol. 3:207-42 (1987).
  • DNA molecules can be represented in modular fashion in terms of their coding regions. Essentially, these modules are exons (though each "exon” may in fact be made up of several exons), which may be combined in different ways to form a variety of different DNA molecules, each encoding a different functional protein. Splicing variants are indicated below.
  • a “degenerate variant” is a nucleotide fragment which differs from those of inventive molecules by nucleotide sequence, but due to the degeneracy of the genetic code, encodes an identical polypeptide sequence.
  • degenerate variants typically are described by reference to this relationship. It is well known that the degeneracy of the genetic code results in many possible DNA sequences which encode a particular protein. Indeed, of the three bases which comprise an amino acid- encoding triplet, the third position, and often the second, almost always may vary. This fact alone allows for a class of variant DNA molecules which encode protein sequences identical to those disclosed herein, yet have about 30% sequence variation. In other words, the variant DNA molecules are about 70% identical to the inventive DNAs, having no additional or deleted sequences. Thus, one aspect of the invention provides degenerate variant DNA molecules encoding the inventive protein sequences.
  • these variants have at least about 70% sequence identity with the DNA molecules described herein. In a preferred embodiment, these variants have at least about 80% sequence identity to the inventive molecules. In a more preferred embodiment these variants have at least about 90% sequence identity with the inventive molecules.
  • Variants according to the invention also may be made that conserve the overall molecular structure of the encoded proteins. Given the properties of the individual amino acids comprising the disclosed protein products, some rational substitutions will be recognized by the skilled worker. Amino acid substitutions, i.e. "conservative substitutions,” may be made, for instance, on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues involved.
  • nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine
  • polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine
  • positively charged (basic) amino acids include arginine, lysine, and histidine
  • negatively charged (acidic) amino acids include aspartic acid and glutamic acid. Substitutions typically may be made within groups (a)-(d).
  • glycine and proline may be substituted for one another based on their ability to disrupt ⁇ -helices.
  • certain amino acids such as alanine, cysteine, leucine, methionine, glutamic acid, glutamine, histidine and lysine are more commonly found in ⁇ -helices
  • valine, isoleucine, phenylalanine, tyrosine, tryptophan and threonine are more commonly found in ⁇ -pleated sheets.
  • Glycine, serine, aspartic acid, asparagine, and proline are commonly found in turns.
  • sequence identity between two polypeptide sequences indicates the percentage of amino acids that are identical between the sequences.
  • sequence similarity indicates the percentage of amino acids that either are identical or that represent conservative amino acid substitutions.
  • DNA variants within the scope of the invention may be described with reference to the product they encode.
  • some of the inventive DNA molecules encode a protein having a degree of homology with known proteins, or protein domains. It is expected, therefore, that they will have some or all of the requisite functional features of such molecules.
  • These "functionally equivalent variants" products are characterized by the fact that they are functionally equivalent, with respect to biological activity, to certain known molecules.
  • the instant invention provides information on common structural motifs, including consensus sequences that will guide the artisan in constructing functionally equivalent variants. It will be understood that the motifs, identified for each inventive protein, may be modified within the identified consensus sequences. Thus, the invention contemplates the proteins disclosed herein that contain variability in the consensus sequences identified, and the invention further contemplates the full range of nucleic acids encoding them, and the complements of those nucleic acids.
  • DNA variants within the invention also may be described by reference to their physical properties in hybridization.
  • DNA can be used to identify its complement and, since DNA is double stranded, its equivalent or homolog, using nucleic acid hybridization techniques. It will also be recognized that hybridization can occur with less than 100% complementarity.
  • hybridization techniques can be used to differentiate among DNA sequences based on their structural relatedness to a particular probe. For guidance regarding such conditions see, for example, Sambrook et al , 1989, MOLECULAR CLONING, A LABORATORY MANUAL, Cold Spring Harbor Press, N.Y. ; and Ausubel et al, 1989, CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Green Publishing Associates and Wiley Interscience, N.Y.
  • Structural relatedness between two polynucleotide sequences can be expressed as a function of "stringency" of the conditions under which the two sequences will hybridize with one another.
  • stringency refers to the extent that the conditions disfavor hybridization. Stringent conditions strongly disfavor hybridization, and only the most structurally related molecules will hybridize to one another under such conditions. Conversely, non-stringent conditions favor hybridization of molecules displaying a lesser degree of structural relatedness. Hybridization stringency, therefore, directly correlates with the structural relationships of two nucleic acid sequences. The following relationships are useful in correlating hybridization and relatedness (where T m is the melting temperature of a nucleic acid duplex):
  • T m of a duplex DNA decreases by 1°C with every increase of 1 % in the number of mismatched base pairs.
  • Hybridization stringency is a function of many factors, including overall DNA concentration, ionic strength, temperature, probe size and the presence of agents which disrupt hydrogen bonding. Factors promoting hybridization include high DNA concentrations, high ionic strengths, low temperamres, longer probe size and the absence of agents that disrupt hydrogen bonding.
  • Hybridization usually is done in two stages. First, in the "binding" stage, the probe is bound to the target under conditions favoring hybridization. Stringency is usually controlled at this stage by altering the temperature. For high stringency, the temperature is usually between 65°C and 70°C, unless short ( ⁇ 20 nt) oligonucleotide probes are used.
  • a representative hybridization solution comprises 6X SSC, 0.5% SDS, 5X Denhardt's solution and lOO ⁇ g of non-specific carrier DNA. See Ausubel et al , supra, section 2.9, supplement 27 (1994). Of course many different, yet functionally equivalent, buffer conditions are known. Where the degree of relatedness is lower, a lower temperature may be chosen. Low stringency binding temperatures are between about 25°C and 40°C. Medium stringency is between at least about 40°C to less than about 65°C. High stringency is at least about 65°C.
  • washing solutions typically contain lower salt concentrations.
  • One exemplary medium stringency solution contains 2X SSC and 0.1 % SDS.
  • a high stringency wash solution contains the equivalent (in ionic strength) of less than about 0.2X SSC, with a preferred stringent solution containing about 0. IX SSC.
  • the temperatures associated with various stringencies are the same as discussed above for "binding. "
  • the washing solution also typically is replaced a number of times during washing. For example, typical high stringency washing conditions comprise washing twice for 30 minutes at 55° C. and three times for 15 minutes at 60° C.
  • the present invention includes nucleic acid molecules that hybridize to the inventive molecules under high stringency binding and washing conditions. More preferred molecules (from an mRNA perspective) are those that are at least 50 % of the length of any one of those depicted in below. Particularly preferred molecules are at least 75 % of the length of those molecules.
  • the preferred DNA variants of the invention are those that retain the closest relationship, as described by "sequence identity" to the inventive DNA molecules. According to another aspect of the invention, therefore, substitutions, insertions, additions and deletions of defined properties are contemplated. It will be recognized that sequence identity between two polynucleotide sequences, as defined herein, generally is determined with reference to the protein coding region of the sequences. Thus, this definition does not at all limit the amount of DNA, such as vector DNA, that may be attached to the molecules described herein. Preferred DNA sequence variants include molecules encoding proteins sharing some or all of any relevant biological activity of the native molecule.
  • insertions and deletions in any recognized functional domain generally should be avoided, except as noted below in the section entitled "Proteins," where this domain is discussed in detail. Alterations in such domains usually will be limited to conservative amino acid substitutions. In addition, where insertions and deletions are desired, this may be accomplished at the N- and/or C-terminus of the protein molecule (or the corresponding coding regions of the DNA). If insertions or deletions are made within the protein, deletions of major structural features usually should be avoided. Thus, a preferred place to make insertion or deletion variants is in non-structural regions, such as linker regions between two alpha helices.
  • substitutions generally refer to alterations in the DNA sequence which do not change its overall length, but only alter one or more nucleotide positions, substituting one for another in the common sense of the word.
  • One class of preferred substitutions “degenerate substitutions, " are those that do not alter the encoded amino acid sequence. Some subsitutions retains 50%, 55%, 60% or 65% identity. Preferred substitutions retain at least about 70% identity, more preferably at least 70% or 75% identity, with the inventive DNAs. Some more preferred molecules have at least about 80% identity, more preferably at least 80% or 85% identity. Particularly preferred DNAs share at least about 90% identity, more preferably at least 90% or 95% identity.
  • Insertions unlike substitutions, alter the overall length of the DNA molecule, and thus sometimes the encoded protein. Insertions add extra nucleotides to the interior (not the 5' or 3' ends) of the subject DNAs. Preferred insertions are made with reference to the protein sequence encoded by the DNA. Thus, it is most preferred to provide an insertion in the DNA at a location that corresponds to an area of the encoded protein which lacks structure. For instance, it typically would not be beneficial, if the preservation of biological activity is desired, to provide an insertion within an alpha-helical region or a beta-pleated sheet. Accordingly, non-structural areas, such as those containing helix-breaking glycines and proline residues, are most preferred sites of insertion. Other preferred sites of insertion are the splice sites, which are indicated above in the description of the inventive DNA molecules.
  • the optimal size of insertions will vary depending upon the site of insertion and its effect on the overall conformation of the encoded protein, some general guides are useful. Generally, the total insertions (irrespective of their number) should not add more than about 30% (or preferably not more than 30%) to the overall size of the encoded protein. More preferably, the insertion adds less than about 10-20% (yet more preferably 10-20%) in size, with less than about 10% being most preferred. The number of insertions is limited only by the number of suitable insertions sites, and secondarily by the foregoing size preferences.
  • Additional sizes like insertions, also add to the overall size of the DNA molecule, and usually the encoded protein. However, instead of being made within the molecule, they are made on the 5' or 3' end, usually corresponding to the N- or C- terminus of the encoded protein. Unlike deletions, additions are not very size-dependent. Indeed, additions may be of virtually any size. Preferred additions, however, do not exceed about 100% of the size of the native molecule. More preferably, they add less than about 60 to 30% to the overall size, with less than about 30% being most preferred.
  • “Deletions” diminish the overall size of the DNA and, therefore, also reduce the size of the protein encoded by that DNA. Deletions may be made from either end of the molecule or internal to it. Typical preferred deletions remove discrete structural features of the encoded protein. For example, some deletions will comprise the deletion of one or more exons which may define a structural feature. Preferred deletions remove less than about 30% of the size of the subject molecule. More preferred deletions remove less than about 20% and most preferred deletions remove less than about 10% .
  • sequence identity refers to a comparison made between two molecules using, for example, the standard Smith- Waterman algorithm that is well known in the art.
  • Some molecules have at lease about 50%, 55% or 60% identity. Preferred molecules are those having at least about 65% sequence identity, more preferably at least 65% or 70% sequence identity. Other preferred molecules have at least about 80%, more preferably at least 80% or 85%, sequence identity. Particularly preferred molecules have at least about 90% sequence identity, more preferably at least 90% sequence identity. Most preferred molecules have at least about 95 % , more preferably at least 95 % , sequence identity. As used herein, two nucleic acid molecules or proteins are said to "share significant sequence identity" if the two contain regions which possess greater than 85 % sequence (amino acid or nucleic acid) identity.
  • Sequence identity is defined herein with reference the Blast 2 algorithm, which is available at the NCBI (http://www.ncbi.nlm.nih.gov/BLAST), using default parameters. References pertaining to this algorithm include: those found at http://www.ncbi.nlm.nih.gov/BLAST/blast_references.html; Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) "Basic local alignment search tool.” J. Mol. Biol. 215:403-410; Gish, W. & States, D.J. (1993) "Identification of protein coding regions by database similarity search.” Nature Genet.
  • variants of the inventive molecules can be constructed in several different ways. For example, they may be constructed as completely synthetic DNAs. Methods of efficiently synthesizing oligonucleotides in the range of 20 to about 150 nucleotides are widely available. See Ausubel et al , supra, section 2.11, Supplement 21 (1993). Overlapping oligonucleotides may be synthesized and assembled in a fashion first reported by Khorana et al, J. Mol. Biol. 72:209-217 (1971); see also Ausubel et al, Section 8.2. The synthetic DNAs are designed with convenient restriction sites engineered at the 5' and 3 ' ends of the gene to facilitate cloning into an appropriate vector.
  • An alternative method of generating variants is to start with one of the inventive DNAs and then to conduct site-directed mutagenesis. See Ausubel et al , supra, chapter 8, Supplement 37 (1997).
  • a target DNA is cloned into a single-stranded DNA bacteriophage vehicle.
  • Single-stranded DNA is isolated and hybridized with a oligonucleotide containing the desired nucleotide alteration(s).
  • the complementary strand is synthesized and the double stranded phage is introduced into a host.
  • Some of the resulting progeny will contain the desired mutant, which can be confirmed using DNA sequencing.
  • various methods are available that increase the probability that the progeny phage will be the desired mutant. These methods are well known to those in the field and kits are commercially available for generating such mutants.
  • homologs are essentially naturally-occurring variants and include allelic, species-specific and tissue-specific variants.
  • Region-specific primers or probes derived from the nucleotide sequence(s) provided can be used to prime DNA synthesis and PCR amplification, as well as to identify colonies containing cloned DNA encoding a homolog using known methods (Innis et al, PCR Protocols, Academic Press, San Diego, CA (1990)). Such an application is useful in diagnostic methods, as described in more detail below, as well as in preparing full-length DNAs from various sources.
  • primers derived from the inventive sequences When using primers derived from the inventive sequences, one skilled in the art will recognize that by employing high stringency conditions (e.g. , annealing at 50-60°C), only sequences with greater than 75% sequence identity to the primer will be amplified. By employing lower stringency conditions (e.g., annealing at 35-37°C), sequences which have greater than 40-50% sequence identity to the primer also will be amplified.
  • high stringency conditions e.g. , annealing at 50-60°C
  • lower stringency conditions e.g., annealing at 35-37°C
  • the PCR product may be subcloned and sequenced to confirm that it indeed displays the expected sequence identity.
  • the PCR fragment may then be used to isolate a full length cDNA clone by a variety of methods.
  • the amplified fragment may be labeled and used to screen a bacteriophage cDNA library.
  • the labeled fragment may be used to screen a genomic library.
  • RNA may be isolated, following standard procedures, from an appropriate cellular or tissue source.
  • a reverse transcription reaction may be performed on the RNA using an oligonucleotide primer specific for the most 5 ' end of the amplified fragment for the priming of first strand synthesis.
  • the resulting RNA/DNA hybrid may then be "tailed" with guanines using a standard terminal transferase reaction, the hybrid may be digested with RNAase H, and second strand synthesis may then be primed with a poly-C primer.
  • cDNA sequences upstream of the amplified fragment may easily be isolated.
  • DNA probes derived from the inventive sequences for colony/plaque hybridization When using DNA probes derived from the inventive sequences for colony/plaque hybridization, one skilled in the art will recognize that by employing medium to high stringency conditions (e.g., hybridizing at 50-65°C in 5X SSPC and 50% formamide, and washing at 50-65°C in 0.5X SSPC), sequences having regions with greater than 90% sequence identity to the probe can be obtained, and that by employing lower stringency conditions (e.g., hybridizing at 35-37°C in 5X SSPC and 40-45% formamide, and washing at 42°C in SSPC), sequences having regions with greater than 35-45% sequence identity to the probe will be obtained.
  • medium to high stringency conditions e.g., hybridizing at 50-65°C in 5X SSPC and 50% formamide, and washing at 50-65°C in 0.5X SSPC
  • lower stringency conditions e.g., hybridizing at 35-37°C in 5X SSPC
  • genomic or cDNA libraries can be constructed and screened in accord with the previous paragraph.
  • the libraries should be derived from a tissue or organism that is known to express the gene of interest, or that is suspected of expressing the gene.
  • the clone containing the homolog may then be purified through methods routinely practiced in the art, and subjected to sequence analysis.
  • an expression library can be constructed utilizing DNA isolated from or cDNA synthesized from a tissue or organism that is known to express the gene of interest, or that is suspected of expressing the gene. In this manner, clones may be induced and screened using standard antibody screening techniques in conjunction with antibodies raised against the normal gene product, as described herein. (For screening techniques, see, for example, Harlow, E. and Lane, eds., 1988, ANTIBODIES: A LABORATORY MANUAL, Cold Spring Harbor Press, Cold Spring Harbor Press.)
  • Any organism or tissue can be used as the source for homologs of the present invention so long as the organism or tissue naturally expresses such a protein or contains genes encoding the same.
  • the most preferred organism for isolating homologs is human.
  • proteins included within the invention is encoded by the inventive DNA molecules presented.
  • Other proteins according to the invention are those encoded by the DNA variants described above. As noted, these variants are designed with the encoded proteins in mind.
  • a preferred class of protein fragments includes those fragments which retain any biological activity. These molecules share functional features common the family of proteins, although these characteristics may vary in degree.
  • fragments of the inventive proteins are contemplated. Some preferred fragments are those which are capable of eliciting an immune response. Generally these "antigenic" fragments will be from about five amino acids in length to about fifty amino acids in length. Some preferred antigenic fragments are from five to about twenty amino acids long.
  • Antigenic response may refer to a T cell response, a B cell response or a response by cells of the macrophage/monocyte lineages. In most cases, however, it will refer to the immune response involved in the generation of antibodies. In other words, the relevant immune response is that of helper T cells and/or B cells. These preferred molecules comprise one or more T cell and /or B cell epitopes.
  • Antibodies raised against the proteins and protein fragments of the invention also are contemplated by the invention. Described below are antibody products and methods for producing antibodies capable of specifically recognizing one or more epitopes of the presently described proteins and their derivatives.
  • Antibodies include, but are not limited to polyclonal antibodies, monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies including single chain Fv (scFv) fragments, Fab fragments, F(ab') 2 fragments, fragments produced by a Fab expression library, anti-idiotypic (anti-Id) antibodies, epitope-binding fragments, and humanized forms of any of the above.
  • mAbs monoclonal antibodies
  • Fab fragments fragments
  • F(ab') 2 fragments fragments produced by a Fab expression library
  • anti-idiotypic antibodies anti-idiotypic antibodies
  • epitope-binding fragments and humanized forms of any of the above.
  • these antibodies may be used, for example, in the detection of a target protein in a biological sample. They also may be utilized as part of treatment methods, and/or may be used as part of diagnostic techniques whereby patients may be tested for abnormal levels or for the presence of abnormal forms of the such proteins.
  • Polyclonal antibodies are heterogeneous populations of antibody molecules derived from the sera of animals immunized with an antigen, such as an inventive protein or an antigenic derivative thereof.
  • Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single protein, can be prepared by immunizing suitable animals with the expressed protein described above, which can be unmodified or modified, as known in the art, to enhance immunogenicity. Immunization methods include subcutaneous or intraperitoneal injection of the polypeptide.
  • Effective polyclonal antibody production is affected by many factors related both to the antigen and to the host species. For example, small molecules tend to be less immunogenic than other and may require the use of carriers and/or adjuvant. In addition, host animal response may vary with site of inoculation. Both inadequate or excessive doses of antigen may result in low titer antisera. In general, however, small doses (high ng to low ⁇ g levels) of antigen administered at multiple intradermal sites appears to be most reliable. Host animals may include but are not limited to rabbits, mice, chickens and rats, to name but a few. An effective immunization protocol for rabbits can be found in Vaitukaitis, J. et ⁇ l, J. Clin. Endocrinol.
  • the protein immunogen may be modified or administered in an adjuvant in order to increase the protein's antigenicity.
  • Methods of increasing the antigenicity of a protein include, but are not limited to coupling the antigen with a heterologous protein (such as globulin ⁇ -galactosidase) or through the inclusion of an adjuvant during immunization.
  • Adjuvants include Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dimtrophenol, and potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacteriumparvum.
  • mineral gels such as aluminum hydroxide
  • surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dimtrophenol, and potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacteriumparvum.
  • Booster injections can be given at regular intervals, with at least one usually being required for optimal antibody production.
  • the antiserum may be harvested when the antibody titer begins to fall. Titer may be determined semi-quantitatively, for example, by double immunodiffusion in agar against known concentrations of the antigen. See, for example, Ouchterlony et al, Chap. 19 in: Handbook of Experimental Immunology, Wier, ed, Blackwell (1973). Plateau concentration of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12 ⁇ M).
  • the antiserum may be purified by affinity chromatography using the immobilized immunogen carried on a solid support. Such methods of affinity chromatography are well known in the art.
  • Affinity of the antisera for the antigen may be determined by preparing competitive binding curves, as described, for example, by Fisher, Chap. 42 in: Manual of Clinical Immunology, second edition, Rose and Friedman, eds., Amer. Soc. For Microbiology, Washington, D.C. (1980).
  • DNA molecules may be used directly. In this manner, a DNA encoding the protein immunogen is administered. Boosting and harvesting is done in a manner analogous to that detailed above. Yet another method of producing antibodies entails immunizing chickens and harvesting the antibodies from their eggs.
  • MAbs Monoclonal antibodies
  • MAbs Monoclonal antibodies
  • They may be obtained by any technique which provides for the production of antibody molecules by continuous cell lines in culture or in vivo.
  • MAbs may be produced by making hybridomas which are immortalized cells capable of secreting a specific monoclonal antibody.
  • Monoclonal antibodies to any of the proteins, peptides and epitopes thereof described herein can be prepared from murine hybridomas according to the classical method of Kohler, G. and Milstein, C, Nature 256:495-497 (1975) (and U.S. Patent No. 4,376,110) or modifications of the methods thereof, such as the human B-cell hybridoma technique (Kosbor et al , 1983, Immunology Today 4:72; Cole et al , 1983, Proc. Natl. Acad. Sci. USA 80: 2026-2030), and the EBV-hybridoma technique (Cole et al , 1985, MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96).
  • a mouse is repetitively inoculated with a few micrograms of the selected protein over a period of a few weeks. The mouse is then sacrificed, and the antibody producing cells of the spleen are isolated.
  • the spleen cells are fused, typically using polyethylene glycol, with mouse myeloma cells, such as SP2/0-Agl4 myeloma cells.
  • mouse myeloma cells such as SP2/0-Agl4 myeloma cells.
  • HAT media selective media comprising aminopterin (HAT media).
  • the successfully fused cells are diluted, and aliquots are plated to microliter plates where growth is continued.
  • Antibody-producing clones are identified by detection of antibody in the supernatant fluid of the wells by immunoassay procedures. These include ELISA, as originally described by Engvall, Meth. Enzymol. 70:419 (1980), western blot analysis, radioimmunoassay (Lutz et al , Exp. Cell Res. 175:109-124 (1988)) and modified methods thereof.
  • Selected positive clones can be expanded and their monoclonal antibody product harvested for use. Detailed procedures for monoclonal antibody production are described in Davis, L. et al BASIC METHODS IN MOLECULAR BIOLOGY, Elsevier, New York. Section 21-2 (1989).
  • the hybridoma clones may be cultivated w vitro or in vivo, for instance as ascites. Production of high titers of mAbs in vivo makes this the presently preferred method of production.
  • hybridoma culture in hollow fiber bioreactors provides a continuous high yield source of monoclonal antibodies.
  • the antibody class and subclass may be determined using procedures known in the art (Campbell, A.M. , Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1984)).
  • MAbs may be of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. Methods of purifying monoclonal antibodies are well known in the art.
  • Fragments or derivatives of antibodies include any portion of the antibody which is capable of binding the target antigen, or a specific portion thereof.
  • Antibody derivatives include poly-specific (e.g., bi-specific) antibodies, which contain binding sites specific for two or more different epitopes. These epitopes may be from the same or different inventive molecules or one or more epitope may be from a molecule not specifically disclosed here.
  • Antibody fragments specifically include F(ab Fab, Fab' and Fv fragments. These can be generated from any class of antibody, but typically are made from IgG or IgM. They may be made by conventional recombinant DNA techniques or, using the classical method, by proteolytic digestion with papain or pepsin. See CURRENT PROTOCOLS IN IMMUNOLOGY, chapter 2, Coligan et ⁇ /. , eds., (John Wiley & Sons 1991-92).
  • F(ab') 2 fragments are typically about 110 kDa (IgG) or about 150 kDa (IgM) and contain two antigen-binding regions, joined at the hinge by disulfide bond(s). Virtually all, if not all, of the Fc is absent in these fragments.
  • Fab' fragments are typically about 55 kDa (IgG) or about 75 kDa (IgM) and can be formed, for example, by reducing the disulfide bond(s) of an F(ab') 2 fragment. The resulting free sulfhydryl group(s) may be used to conveniently conjugate Fab' fragments to other molecules, such as detection reagents (e.g. , enzymes).
  • Fab fragments are monovalent and usually are about 50 kDa (from any source).
  • Fab fragments include the light (L) and heavy (H) chain, variable (V L and V H , respectively) and constant (C L C H , respectively) regions of the antigen-binding portion of the antibody.
  • the H and L portions are linked by an intramolecular disulfide bridge.
  • Fv fragments are typically about 25 kDa (regardless of source) and contain the variable regions of both the light and heavy chains (V L and V H , respectively).
  • V L and V H chains are held together only by non-covalent interacts and, thus, they readily dissociate. They do, however, have the advantage of small size and they retain the same binding properties of the larger Fab fragments. Accordingly, methods have been developed to crosslink the V L and V H chains, using, for example, glutaraldehyde (or other chemical crosslinkers), intermolecular disulfide bonds (by incorporation of cysteines) and peptide linkers.
  • SCFv single chain
  • Single chain antibodies are formed by linking the heavy and light chain fragments of the Fv region via an amino acid bridge, resulting in a single chain FV (SCFv).
  • a recombinant vector would be provided which comprises the appropriate regulatory elements driving expression of a cassette region.
  • the cassette region would contain a DNA encoding a peptide linker, with convenient sites at both the 5' and 3' ends of the linker for generating fusion proteins.
  • the DNA encoding a variable region(s) of interest may be cloned in the vector to form fusion proteins with the linker, thus generating an scFv.
  • DNAs encoding two Fvs may be ligated to the DNA encoding the linker, and the resulting tripartite fusion may be ligated directly into a conventional expression vector.
  • the scFv DNAs generated any of these methods may be expressed in prokaryotic or eukaryotic cells, depending on the vector chosen.
  • Antibody fragments which recognize specific epitopes may be generated by known techniques.
  • such fragments include but are not limited to: the F(ab' ⁇ fragments which can be produced by pepsin digestion of the antibody molecule and the Fab fragments which can be generated by reducing the disulfide bridges of the F(ab ⁇ fragments.
  • Fab expression libraries may be constructed (Huse et al., 1989, Science, 246: 1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity.
  • chimeric antibodies also include "chimeric antibodies” (Morrison et al , Proc. Natl. Acad. Sci. , 81:6851-6855 (1984); Neuberger et al , Nature, 312:604-608 (1984); Takeda et al , Nature, 314:452-454 (1985)). These chimeras are made by splicing the DNA encoding a mouse antibody molecule of appropriate specificity with, for instance, DNA encoding a human antibody molecule of appropriate specificity.
  • a chimeric antibody is a molecule in which different portions are derived from different animal species, such as those having a variable region derived from a murine mAb and a human immunoglobulin constant region. These are also known sometimes as "humanized” antibodies and they offer the added advantage of at least partial shielding from the human immune system. They are, therefore, particularly useful in therapeutic in vivo applications.
  • the present invention further provides the above-described antibodies in detectably labeled form.
  • Antibodies can be detectably labelled through the use of radio isotopes, affinity labels (such as biotin, avidin, etc.), enzymatic labels (such as horseradish peroxidase, alkaline phosphatase, etc.) fluorescent labels (such as FITC or rhodamine, etc.), paramagnetic atoms, etc. Procedures for accomplishing such labeling are well-known in the art, for example see (Sternberger et al , J. Histochem. Cytochem. 18:315 (1970); Bayer et al, Meth. Enzym. 62:308 (1979); Engval et al, Immunol. 109:129 (1972); Goding, J. Immunol. Meth. 13:215 (1976)).
  • the labeled antibodies of the present invention can be used for vitro, in vivo, and in situ diagnostic assays.
  • the foregoing antibodies also may be immobilized on a solid support.
  • solid supports include plastics such as polycarbonate, complex carbohydrates such as agarose and sepharose, acrylic resins and such as polyacrylamide and latex beads. Techniques for coupling antibodies to such solid supports are well known in the art (Weiret al, "Handbook of Experimental Immunology” 4th Ed., Blackwell Scientific Publications, Oxford, England, Chapter 10 (1986); Jacoby et al, Meth. Enzym. 34 Academic Press, N.Y. (1974)).
  • the immobilized antibodies of the present invention can be used for in vitro, in vivo, and in situ assays as well as for immunoaffimty purification of the proteins of the present invention.
  • the proteins, antibodies and polynucleotides of the present invention can be formulated according to known methods to prepare pharmaceutically useful compositions, whereby these materials, or their functional derivatives, are combined in admixture with a pharmaceutically acceptable carrier vehicle.
  • a pharmaceutically acceptable carrier vehicle e.g., a pharmaceutically acceptable carrier vehicle.
  • suitable vehicles and their formulation, inclusive of other human proteins, e.g., human serum albumin are described, for example, in Remington's Pharmaceutical Sciences (16th ed., Osol, A., Ed., Mack, Easton PA (1980)).
  • a pharmaceutically acceptable composition suitable for effective administration such compositions will contain an effective amount of one or more of the agents of the present invention, together with a suitable amount of carrier vehicle.
  • compositions for use in accordance with the present invention may be formulated in conventional manner using one or more physiologically acceptable carriers or excipients.
  • the compounds and their physiologically acceptable salts and solvate may be formulated for administration by inhalation or insufflation (either through the mouth or the nose) or oral, buccal, parenteral or rectal administration.
  • the pharmaceutical compositions may take the form of, for example, tablets or capsules prepared by conventional means with pharmaceutically acceptable excipients such as binding agents (e.g. , pregelatinised maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g. , lactose, microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g. , magnesium stearate, talc or silica); disintegrants (e.g. , potato starch or sodium starch glycolate); or wetting agents (e.g. , sodium lauryl sulphate).
  • binding agents e.g. , pregelatinised maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose
  • fillers e.g. , lactose, microcrystalline cellulose or calcium hydrogen phosphate
  • lubricants e.g. , magnesium stearate, talc or silica
  • disintegrants
  • Liquid preparations for oral administration may take the form of, for example, solutions, syrups or suspensions, or they maybe presented as a dry product for constitution with water or other suitable vehicle before use.
  • Such liquid preparations may be prepared by conventional means with pharmaceutically acceptable additives such as suspending agents (e.g. , sorbitol syrup, cellulose derivatives or hydrogenated edible fats); emulsifying agents (e.g. , lecithin or acacia); non-aqueous vehicles (e.g. , almond oil, oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g. , methyl or propyl- p-hydroxybenzoates or sorbic acid).
  • the preparations may also contain buffer salts, flavoring, coloring and sweetening agents as appropriate.
  • Preparations for oral administration may be suitably formulated to give controlled release of the active compound.
  • the composition may take the form of tablets or lozenges formulated in conventional manner.
  • the compounds for use according to the present invention are conveniently delivered in the form of an aerosol spray presentation from pressurized packs or a nebuliser, with the use of a suitable propellant, e.g. , dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas.
  • a suitable propellant e.g. , dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas.
  • a suitable propellant e.g. , dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas.
  • a suitable propellant e.g. , dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane
  • the compounds may be formulated for parenteral administration by injection, e.g. , by bolus injection or continuous infusion.
  • Formulations for injection may be presented in unit dosage form, e.g. , in ampules or in multi-dose containers, with an added preservative.
  • the compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents.
  • the active ingredient may be in powder form for constitution with a suitable vehicle, e.g. , sterile pyrogen-free water, before use.
  • the compounds may also be formulated in rectal compositions such as suppositories or retention enemas, e.g. , containing conventional suppository bases such as cocoa butter or other glycerides.
  • the compounds may also be formulated as a depot preparation. Such long acting formulations may be administered by implantation (for example subcutaneously or intramuscularly) or by intramuscular injection.
  • the compounds may be formulated with suitable polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt.
  • compositions may, if desired, be presented in a pack or dispenser device which may contain one or more unit dosage forms containing the active ingredient.
  • the pack may for example comprise metal or plastic foil, such as a blister pack.
  • the pack or dispenser device may be accompanied by instructions for administration.
  • the present invention further provides recombinant DNA constructs comprising one or more of the nucleotide sequences of the present invention.
  • the recombinant constructs of the present invention comprise a vector, such as a plasmid or viral vector, into which a DNA or DNA fragment, typically bearing an open reading frame, is inserted, in either orientation.
  • the gene products encoded by the subject DNAs may be produced by recombinant DNA technology using techniques well known in the art. See, for example, the techniques described in Sambrook et al., 1989, supra, and Ausubel et al., 1989, supra.
  • the DNA sequences may be chemically synthesized using, for example, synthesizers. See, for example, the techniques described in OLIGONUCLEOTIDE SYNTHESIS, 1984, Gait, ed., IRL Press, Oxford, which is incorporated by reference herein in its entirety. They may be assembled from fragments and short oligonucleotide linkers, or from a series of oligonucleotides. The are preferably made by RT-PCR methods. The resulting synthetic gene is capable of being expressed in a recombinant vector.
  • the recombinant constructs will be expression vectors, which are capable of expressing the RNA and/or protein products of the encoded DNA(s).
  • the vector may further comprise regulatory sequences, including for example, a promoter, operably linked to the open reading frame (ORF).
  • the vector may further comprise a selectable marker sequence.
  • Specific initiation signals may also be required for efficient translation of inserted target gene coding sequences. These signals include the ATG initiation codon and adjacent sequences. In cases where a target DNA includes its own initiation codon and adjacent sequences is inserted into the appropriate expression vector, no additional translation control signals may be needed. However, in cases where only a portion of an ORF is used, exogenous translational control signals, including, perhaps, the ATG initiation codon, must be provided. Furthermore, the initiation codon must be in phase with the reading frame of the desired coding sequence to ensure translation of the entire target. These exogenous translational control signals and initiation codons can be of a variety of origins, both natural and synthetic.
  • the efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer elements, transcription terminators, etc. (see Bittner et al, Methods in Enzymol. 153:516-544 (1987)).
  • Some appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et al, in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New York (1989), the disclosure of which is hereby incorporated by reference.
  • codon context and codon pairing of the sequence may be optimized for the particular expression organism, as explained by Hatfieldet /., U.S. Patent No. 5,082,767.
  • the present invention further provides host cells containing at least one of the DNAs of the present invention.
  • the host cell can be virtually any cell for which expression vectors are available. It may be, for example, a higher eukaryotic host cell, such as a mammalian cell, a lower eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell.
  • Introduction of the recombinant construct into the host cell can be effected by calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis et al, Basic Methods in Molecular Biology (1986)).
  • yeast e.g. Saccharomyces, Pichia transformed with recombinant yeast expression vectors containing the target DNA
  • insect cell systems infected with recombinant virus expression vectors (e.g. , baculovirus) containing the target DNA sequences
  • plant cell systems infected with recombinant virus expression vectors e.g. , cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV
  • recombinant plasmid expression vectors e.g. Ti plasmid
  • mammalian cell systems e.g.
  • COS COS, CHO, BHK, 293, 3T3 harboring recombinant expression constructs containing promoters derived from the genome of mammalian cells (e.g. , metallothionein promoter) or from mammalian viruses (e.g. , the adenovirus late promoter; the vaccinia virus 7.5K promoter).
  • promoters derived from the genome of mammalian cells e.g. , metallothionein promoter
  • mammalian viruses e.g. , the adenovirus late promoter; the vaccinia virus 7.5K promoter.
  • the resulting product may differ.
  • proteins expressed in most bacterial cultures e.g. , E. coli
  • polypeptides or proteins expressed in yeast will have a glycosylation pattern different from that expressed in mammalian cells.
  • recombinant expression vectors will include origins of replication and selectable markers permitting selection of the host cell, e.g. , the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly -expressed gene to direct transcription of a downstream structural sequence.
  • promoters can be derived from operons encoding glycolytic enzymes such as 3 -phosphogly cerate kinase (PGK), ⁇ -factor, acid phosphatase, or heat shock proteins, among others.
  • the heterologous structural sequence is assembled in appropriate phase with translation initiation and termination sequence, and in one aspect of the invention, a leader sequence capable of directing secretion of translated protein into the periplasmic space or extracellular medium.
  • the heterologous sequence can encode a fusion protein including an N-terminal or C-terminal identification peptide imparting desired characteristics, e.g. , stabilization or simplified purification of expressed recombinant product.
  • Useful expression vectors for bacterial use are constructed by inserting a structural DNA sequence encoding a desired protein together with suitable translation initiation and termination signals in operable reading phase with a functional promoter.
  • the vector will comprise one or more phenotypic selectable markers and an origin of replication to ensure maintenance of the vector and, if desirable, to provide amplification within the host.
  • Suitable prokaryotic hosts for transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various species within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may, also be employed as a matter of choice.
  • Bacterial vectors may be, for example, bacteriophage-, plasmid- or cosmid-based. These vectors can comprise a selectable marker and bacterial origin of replication derived from commercially available plasmids typically containing elements of the well known cloning vector pBR322 (ATCC 37017).
  • Such commercial vectors include, for example, GEM 1 (Promega Biotec, Madison, WI, USA), pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNHl ⁇ a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pKK232-8, pDR540, and pRIT5 (Pharmacia).
  • Bacterial promoters include lac, T3, T7, lambda P R or P L , tip, and ara.
  • the selected promoter is derepressed/ induced by appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an additional period.
  • appropriate means e.g., temperature shift or chemical induction
  • Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification.
  • a number of expression vectors may be advantageously selected depending upon the use intended for the protein being expressed. For example, when a large quantity of such a protein is to be produced, for the generation of antibodies or to screen peptide libraries, for example, vectors which direct the expression of high levels of fusion protein products that are readily purified may be desirable.
  • vectors include, but are not limited, to the E. coli expression vector pUR278 (Ruther et al., 1983, EMBO J. 2:1791), in which the coding sequence may be ligated into the vector in frame with the lac Z coding region so that a fiision protein is produced; pIN vectors (Inouye et al 1985, Nucleic Acids Res.
  • pGEX vectors may be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST).
  • GST glutathione S-transferase
  • fusion proteins are soluble and easily can be purified from lysed cells by adsorption to glutathione-agarose beads followed by elution in the presence of free glutathione.
  • the pGEX vectors are designed to include thrombin or factor Xa protease cleavage sites so that the cloned target gene protein can be released from the GST moiety.
  • full length cDNA sequences are appended with in-frame BamHl sites at the amino terminus and EcoRI sites at the carboxyl terminus using standard PCR methodologies (Innis et al., 1990, supra) and ligated into the pGEX-2TK vector (Pharmacia, Uppsala, Sweden).
  • the resulting cDNA construct contains a kinase recognition site at the amino terminus for radioactive labeling and glutathione S-transferase sequences at the carboxyl terminus for affinity purification (Nilsson, et al. 1985, EMBO J. 4: 1075; Zabeau and Stanley, 1982, EMBO J. 1: 1217.
  • mammalian cell culture systems can also be employed to express recombinant protein.
  • mammalian expression systems include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981), and other cell lines capable of expressing a compatible vector, for example, the C127, 3T3, CHO, HeLa and BHK cell lines.
  • Mammalian expression vectors will comprise an origin of replication, a suitable promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed sequences.
  • DNA sequences derived from the SV40 viral genome for example, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required nontranscribed genetic elements.
  • Mammalian promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I.
  • Exemplary mammalian vectors include pWLneo, pSV2cat, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, and pSVL (Pharmacia).
  • Selectable markers include CAT (chloramphenicol transferase).
  • a number of viral-based expression systems may be utilized.
  • the coding sequence of interest may be ligated to an adenovirus transcription/translation control complex, e.g. , the late promoter and tripartite leader sequence.
  • This chimeric gene may then be inserted in the adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g. , region El or E3) will result in a recombinant virus that is viable and capable of expressing a target protein in infected hosts.
  • a non-essential region of the viral genome e.g. , region El or E3
  • cDNA sequences encoding the full-length open reading frames are ligated into pCMV ⁇ replacing the ⁇ -galactosidase gene such that cDNA expression is driven by the CMV promoter (Alam, 1990, Anal. Biochem. 188: 245-254; MacGregor et al , 1989, Nucl Acids Res. 17: 2365; Norton et al 1985, Mol. Cell Biol. 5: 281).
  • a host cell strain may be chosen which modulates the expression of the inserted sequences, or modifies and processes the gene product in the specific fashion desired. Such modifications (e.g. , glycosylation) and processing (e.g. , cleavage) of protein products may be important for the function of the protein. Different host cells have characteristic and specific mechanisms for the post-translational processing and modification of proteins.
  • Appropriate cell lines or host systems can be chosen to ensure the correct modification and processing of the foreign protein expressed.
  • eukaryotic host cells which possess the cellular machinery for proper processing of the primary transcript, glycosylation, and phosphorylation of the gene product may be used.
  • mammalian host cells include but are not limited to CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, etc.
  • host cells can be transformed with DNA controlled by appropriate expression control elements (e.g. , promoter, enhancer, sequences, transcription terminators, polyadenylation sites, etc.), and a selectable marker.
  • appropriate expression control elements e.g. , promoter, enhancer, sequences, transcription terminators, polyadenylation sites, etc.
  • engineered cells may be allowed to grow for 1-2 days in an enriched media, and then are switched to a selective media.
  • the selectable marker in the recombinant plasmid confers resistance to the selection and allows cells to stably integrate the plasmid into their chromosomes and grow to form foci which in turn can be cloned and expanded into cell lines.
  • This method may advantageously be used to engineer cell lines which express the target protein.
  • Such engineered cell lines may be particularly useful in screening and evaluation of compounds that affect the endogenous activity of the protein.
  • a number of selection systems may be used, including but not limited to the herpes simplex virus thymidine kinase (Wigler, et al , Cell 11:223 (1977)), hypoxanthine-guanine phosphoribosyltransferase(Szybalskaet ⁇ /., Proc. Natl. Acad. Sci. USA 48:2026 (1962)), and adenine phosphoribosyltransferase(Lowy, et al , Cell 22:817 (1980)) genes can be employed in tk " , hgprt " or aprf cells, respectively.
  • antimetabolite resistance can be used as the basis of selection for dhfr, which confers resistance to methotrexate (Wigler, et al. , Proc. Natl. Acad, Sci. USA 77:3567 (1980)); O'Hare, et al , 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to mycophenolic acid (Mulligan et al , Proc. Natl. Acad. Sci. USA 78:2072 (1981)); neo, which confers resistance to the aminoglycoside G-418 (Colberre-Garapin, et al. , 1981, J. Mol. Biol 150: 1); and hydro, which confers resistance to hygromycin (Santerre, et al. , 1984, Gene 30: 147) genes.
  • fusion protein system allows for the ready purificationof non-denatured fusion proteins expressed in human cell lines (Janknecht, et al. , Proc. Natl Acad. Sci. USA 88: 8972-8976 (1991)).
  • the gene of interest is subcloned into a vaccinia-based plasmid such that the gene's open reading frame is translationally fused to an amino-terminal tag consisting of six histidine residues. Extracts from cells infected with recombinant vaccinia virus are loaded onto N? + nitriloacetic acid-agarose columns and histidine-tagged proteins are selectively eluted with imidazole-containing buffers.
  • Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign genes.
  • the virus grows in Spodoptera frugiperda cells.
  • the target coding sequence may be cloned individually into non-essential regions (for example the polyhedrin gene) of the virus and placed under control of an AcNPV promoter (for example the polyhedrin promoter).
  • Successful insertion of a target gene coding sequence will result in inactivation of the polyhedrin gene and production of non-occluded recombinant virus (i.e., virus lacking the proteinaceous coat coded for by the polyhedrin gene).
  • Recombinant proteins produced may be isolated by host cell lysis. This may be followed by one or more salting-out, aqueous ion exchange or size exclusion chromatography steps. Finally, high performance liquid chromatography (HPLC) can be employed for final purification steps.
  • Microbial cells employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, like lysozyme and chelators.
  • inclusion bodies are formed in bacterial systems, they may be extracted from cell pellets using, for example, detergents, reducing agents, salts, urea, guanidinium chloride and extremes of pH (e.g. ⁇ 4 or > 10). If denaturation occurs, protein refolding steps (e.g. , dialysis) can be used, as necessary, in completing configuration of the mature protein. If disulfide bridges are present in the native protein, they may be reoxidized using known methods.
  • the recombinant bacterial cells for example E. coli
  • suitable media for example LB
  • IPTG e.g. , lac operator-promoter
  • a higher temperature e.g. , ⁇ cl 857
  • the cells are collected by centrifugation and washed to remove residual media.
  • the bacterial cells are then lysed, for example, by disruption in a cell homogenizer and centrifuged to separate the cell membranes from the soluble cell components.
  • this centrifugation can be performed under conditions whereby the dense inclusion bodies are selectively enriched by incorporation of sugars such as sucrose into the buffer and centrifugation at a selective speed.
  • the inclusion bodies can then be washed in any of several solutions to remove some of the contaminating host proteins, then solubilized in solutions containing high concentrations of urea (e.g. 8M) or chaotropic agents such as guanidinium hydrochloride in the presence of reducing agents such as ⁇ -mercaptoethanolor DTT (dithiothreitol).
  • the protein may be advantageous to incubate the protein for several hours under conditions suitable for the protein to undergo a refolding process into a conformation which more closely resembles that of the native protein.
  • conditions generally include low protein concentrations less than 500 ⁇ g/ml), low levels of reducing agent, concentrations of urea less than 2 M and often the presence of reagents such as a mixmre of reduced and oxidized glutathione which facilitate the interchange of disulphide bonds within the protein molecule.
  • the refolding process can be monitored, for example, by SDS-PAGE or with antibodies which are specific for the native molecule.
  • the protein can then be purified further and separated from the refolding mixture by chromatography on any of several supports including ion exchange resins, gel permeation resins or on a variety of affinity columns.
  • the target protein When used as a component in assay systems such as those described, below, the target protein may be labeled, either directly or indirectly, to facilitate detection of the present res- like molecules either in vitro or in vivo.
  • suitable labeling systems including but not limited to radioisotopes such as 125 I; enzyme labeling systems that generate a detectable colorimetric signal or light when exposed to substrate; and fluorescent labels.
  • fusion proteins that can facilitate labeling, immobilization and/or detection.
  • These fusion proteins may, for example, add amino acids which facilitate further chemical modification. They also may add a functional moiety, such as an enzyme, which directly facilitates detection.
  • the invention further contemplates animal models for studying the function of the present molecules and for overproducing the protein products.
  • the disclosed DNA sequences may be used in conjunction with techniques for producing transgenic animals that are well known to those of skill in the art.
  • target gene sequences may for example be introduced into, and overexpressed in, the genome of the animal of interest, or, if endogenous target gene sequences are present, they may either be overexpressed or, alternatively, be disrupted in order to underexpress or inactivate target gene expression, such as described for the disruption of apoE in mice (Plum et al , Cell 71 : 343-353 (1992)).
  • the coding portion of the target gene sequence may be ligated to a regulatory sequence which is capable of driving gene expression in the animal and cell type of interest.
  • a regulatory sequence which is capable of driving gene expression in the animal and cell type of interest.
  • Such regulatory regions will be well known to those of skill in the art, and may be utilized in the absence of undue experimentation.
  • an endogenous target gene sequence such a sequence may be isolated and engineered such that when reintroduced into the genome of the animal of interest, the endogenous target gene alleles will be inactivated.
  • the engineered target gene sequence is introduced via gene targeting such that the endogenous target sequence is disrupted upon integration of the engineered target gene sequence into the animal ' s genome .
  • Animals of any species including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, goats, and non-human primates, e.g. , baboons, monkeys, and chimpanzees may be used to generate cardiovascular disease animal models. Goats, cows and sheep are particularly preferred for producing protein in vivo.
  • Any technique known in the art may be used to introduce a target gene transgene into animals to produce the founder lines of transgenic animals.
  • Such techniques include, but are not limited to pronuclear microinjection (Hoppe et al , U.S. Pat. No. 4,873,191 (1989)); retrovirus mediated gene transfer into germ lines (Van der Putten et al. , Proc. Natl. Acad. Sci., USA 82:6148-6152 (1985)); gene targeting in embryonic stem cells (Thompson et al , Cell 56:313-321 (1989)); electroporation of embryos (Lo, Mol. Cell. Biol.
  • the present invention provides for transgenic animals that carry the transgene in all their cells, as well as animals which carry the transgene in some, but not all their cells, i.e. , mosaic animals.
  • the transgene may be integrated as a single transgene or in concatamers, e.g. , head-to-head tandems or head-to-tail tandems.
  • the transgene may also be selectively introduced into and activated in a particular cell type by following, for example, the teaching of Lasko et al. (Lasko et al , Proc. Natl Acad. Sci. USA 89:3232-6236 (1992)).
  • regulatory sequences required for such a cell-type specific activation will depend upon the particular cell type of interest, and will be apparent to those of skill in the art.
  • gene targeting is preferred.
  • vectors containing some nucleotide sequences homologous to the endogenous target gene of interest are designed for the purpose of integrating, via homologous recombination with chromosomal sequences, into and disrupting the function of the nucleotide sequence of the endogenous target gene.
  • the transgene may also be selectively introduced into a particular cell type, thus inactivating the endogenous gene of interest in only that cell type, by following, for example, the teaching of Gu et al. Science 265: 103-106 (1994)).
  • the regulatory sequences required for such a cell-type specific inactivation will depend upon the particular cell type of interest, and will be apparent to those of skill in the art.
  • the expression of the recombinant target gene and protein may be assayed utilizing standard techniques. Initial screening may be accomplished by Southern blot analysis or PCR techniques to analyze animal tissues to assay whether integration of the transgene has taken place. The level of mRNA expression of the transgene in the tissues of the transgenic animals may also be assessed using techniques which include but are not limited to Northern blot analysis of tissue samples obtained from the animal, in situ hybridization analysis, and RT-PCR. Samples of target gene-expressing tissue, may also be evaluated immunocytochemically using antibodies specific for the target gene transgene gene product of interest.
  • transgenic animals that express target gene mRNA or target gene transgene peptide should then be further evaluated to identify those animals which display characteristic increased susceptibility to carcinogenesis. Additionally, specific cell types within the transgenic animals may be analyzed and assayed in vitro for cellular phenotypes characteristic of mutant phenotype.
  • target gene transgenic founder animals may be bred, inbred, outbred, or crossbred to produce colonies of the particular animal.
  • breeding strategies include but are not limited to: outbreeding of founder animals with more than one integration site in order to establish separate lines; inbreeding of separate lines in order to produce compound target gene transgenics that express the target gene transgene of interest at higher levels because of the effects of additive expression of each target gene transgene; crossing of heterozygous transgenic animals to produce animals homozygous for a given integration site in order both to augment expression and eliminate the possible need for screening of animals by DNA analysis; crossing of separate homozygous lines to produce compound heterozygous or homozygous lines; breeding animals to different inbred genetic backgrounds so as to examine effects of modifying alleles on expression of the target gene transgene and the possible development of carcinogenesis.
  • One such approach is to cross the target gene transgenic founder animals with a wild type strain to produce an Fl generation that exhibits increased susceptibility to carcinogenesis.
  • the Fl generation may then be inbred in order to develop a homozygous line, if it is found that homozygous target gene transgenic animals are viable.
  • a genomic fragment is cleaved with a restriction endonuclease and a heterologous cassette containing a neomycin-resistancegene is inserted at the cleavage site.
  • a suitable cassette is the GTI-II neo cassette described by Lufkin et al , Cell 66:1105 (1991).
  • the modified genomic fragment is cloned into a suitable targeting vector that is introduced into murine embryonic stem cells by electroporation. Cells that have undergone homologous recombination (and hence disruption of the gene) are selected by resistance to G418, and used to generate chimeric mice using well known methods. See Lufkin et al, supra. Traditional breeding methods then can be used to generate mice that are homozygous for the disrupted gene.
  • mice that are homozygous for the mutation then can be studied to provide insights into the role of the protein in, for example, carcinogenesis. These mice also can be used as models for developing new treatments for cancers. If this mutation is lethal in homozygous mice (for example during embryogenesis) heterozygous mice, which express only half the amount of the protein can also be studied.
  • control of cellular proliferation can be restored by gene therapy methods.
  • overexpression of the protein can be counteracted by concurrent expression of an antisense molecule that binds to and inhibits expression of the mRNA encoding the protein.
  • overexpression can be inhibited in an analogous manner using a ribozyme that cleaves the mRNA.
  • concomitant expression of the non-mutated molecule via introduction of an exogenous gene may be used.
  • Each of these methods requires a system for introducing a vector into the cells containing the mutated gene.
  • the vector encodes either an antisense or ribozyme transcript of the inventive protein.
  • the construction of a suitable vector can be achieved by any of the methods well-known in the art for the insertion of exogenous DNA into a vector. See, e.g. , Sambrook et al, Molecular Cloning (Cold Spring Harbor Press 2d ed. 1989), which is incorporated herein by reference.
  • the prior art teaches various methods of introducing exogenous genes into cells in vivo. See Rosenberg et al.
  • the routes of delivery include systemic admimstration and admimstration in situ.
  • Well-known techniques include systemic administration with cationic liposomes, and admimstration in situ with viral vectors.
  • Any one of the gene delivery methodologies described in the prior art is suitable for the introduction of a recombinant vector containing an inventive gene according to the invention into a MTX-resistant, transport-deficient cancer cell.
  • a listing of present-day vectors suitable for the purpose of this invention is set forth in Hodgson, Bio /Technology 13: 222 (1995), which is incorporated by reference.
  • liposome-mediated gene transfer is a suitable method for the introduction of a recombinant vector containing an inventive gene according to the invention into a MTX-resistant, transport-deficient cancer cell.
  • a cationic liposome such as DC-Chol/DOPE liposome
  • DC-Chol/DOPE liposome has been widely documented as an appropriate vehicle to deliver DNA to a wide range of tissues through intravenous injection of DNA/cationic liposome complexes. See Caplen et al , Nature Med. 1:39-46 (1995) and Zhu et al, Science 261:209- 211 (1993), which are herein incorporated by reference.
  • Liposomes transfer genes to the target cells by fusing with the plasma membrane.
  • liposome-DNA complex has no inherent mechanism to deliver the DNA to the nucleus. As such, the most of the lipid and DNA gets shunted to cytoplasmic waste systems and destroyed.
  • liposomes as a gene therapy vector is that liposomes contain no proteins, which thus minimizes the potential of host immune responses.
  • viral vector-mediated gene transfer is also a suitable method for the introduction of the vector into a target cell.
  • Appropriate viral vectors include adenovirus vectors and adeno-associated virus vectors, retrovirus vectors and herpesvirus vectors.
  • Adenoviruses are linear, double stranded DNA viruses complexed with core proteins and surrounded by capsid proteins.
  • the common serotypes 2 and 5 which are not associated with any human malignancies, are typically the base vectors.
  • the virus becomes a replication deficient vector capable of transferring the exogenous DNA to differentiated, non-proliferating cells.
  • the adenovirus fibre interacts with specific receptors on the cell surface, and the adenovirus surface proteins interact with the cell surface integrins.
  • the virus penton-cell integrin interaction provides the signal that brings the exogenous gene-containing virus into a cytoplasmic endosome.
  • adenovirus breaks out of the endosome and moves to the nucleus, the viral capsid falls apart, and the exogenous DNA enters the cell nucleus where it functions, in an epichromosomal fashion, to express the exogenous gene.
  • adenoviral vectors for gene therapy can be found in Berkner, Biotechniques 6:616-629 (1988) and Trapnell, Advanced Drug Delivery Rev. 12: 185-199 (1993), which are herein incorporated by reference.
  • Adenovirus-derived vectors particularly non-replicative adenovirus vectors, are characterized by their ability to accommodate exogenous DNA of 7.5 kB, relative stability, wide host range, low pathogenicity in man, and high titers (10 4 to 10 5 plaque forming units per cell). See Stratford- Perricaudet et al , PNAS 89:2581 (1992).
  • Adeno-associated virus (AAV) vectors also can be used for the present invention.
  • AAV is a linear single-stranded DNA parvovirus that is endogenous to many mammalian species.
  • AAV has a broad host range despite the limitation that AAV is a defective parvovirus which is dependent totally on either adenovirus or herpesvirus for its reproduction in vivo.
  • AAV as a vector for the introduction into target cells of exogenous DNA is well-known in the art. See, e.g. , Lebkowski et al , Mole. & Cell. Biol 8:3988 (1988), which is incorporated herein by reference.
  • the capsid gene of AAV is replaced by a desired DNA fragment, and transcomplementation of the deleted capsid function is used to create a recombinant virus stock. Upon infection the recombinant virus uncoats in the nucleus and integrates into the host genome.
  • retroviral vector-mediated gene transfer Another suitable virus-based gene delivery mechanism is retroviral vector-mediated gene transfer.
  • retroviral vectors are well-known in the art. See Breakfield et al. , Mole. Neuro. Biol 1:339 (1987) and Shih et al , in Vaccines 85: 177 (Cold Spring Harbor Press 1985).
  • a variety of retroviral vectors and retroviral vector-producing cell lines can be used for the present invention.
  • retroviral vectors include Moloney Murine Leukemia Virus, spleen necrosis virus, and vectors derived from retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma Virus, avian leukosis virus, human immunodeficiency virus, myeloproliferative sarcoma virus, and mammary tumor virus. These vectors include replication-competent and replication-defective retroviral vectors. In addition, amphotropic and xenotropic retroviral vectors can be used. In carrying out the invention, retroviral vectors can be introduced to a tumor directly or in the form of free retroviral vector producing-cell lines.
  • Suitable producer cells include fibroblasts, neurons, glial cells, keratinocytes, hepatocytes, connective tissue cells, ependymal cells, chromaffin cells. See Wolff et al, PNAS 84:3344 (1989).
  • Retroviral vectors generally are constructed such that the majority of its structural genes are deleted or replaced by exogenous DNA of interest, and such that the likelihood is reduced that viral proteins will be expressed. See Bender et al, J. Virol. 61: 1639 (1987) and Armento et al, J. Virol 61:1647 (1987), which are herein incorporated by reference.
  • a retroviral vector employed in the present invention must integrate into the genome of the host cell genome, an event which occurs only in mitotically active cells. The necessity for host cell replication effectively limits retroviral gene expression to tumor cells, which are highly replicative, and to a few normal tissues.
  • the normal tissue cells theoretically most likely to be transduced by a retroviral vector therefore, are the endothelial cells that line the blood vessels that supply blood to the tumor.
  • a retroviral vector would integrate into white blood cells both in the tumor or in the blood circulating through the tumor.
  • retroviral vector to normal tissues, however, is limited.
  • the local administration to a tumor of a retroviral vector or retroviral vector producing cells will restrict vector propagation to the local region of the tumor, minimizing transduction, integration, expression and subsequent cytotoxic effect on surrounding cells that are mitotically active.
  • replicatively deficient and replicatively competent retroviral vectors can be used in the invention, subject to their respective advantages and disadvantages.
  • the direct injection of cell lines that produce replication-deficient vectors may not deliver the vector to a large enough area to completely eradicate the tumor, since the vector will be released only form the original producer cells and their progeny, and diffusion is limited.
  • Similar constraints apply to the application of replication deficient vectors to tumors that grow slowly, such as human breast cancers which typically have doubling times of 30 days versus the 24 hours common among human gliomas.
  • the much shortened survival-time of the producer cells probably no more than 7-14 days in the absence of immunosuppression, limits to only a portion of their replicative cycle the exposure of the tumor cells to the retroviral vector.
  • replication-defective retroviruses for treating mmors requires producer cells and is limited because each replication-defective retrovirus particle can enter only a single cell and cannot productively infect others thereafter. Because these replication- defective retroviruses cannot spread to other tumor cells, they would be unable to completely penetrate a deep, multilayered tumor in vivo. See Markert et al, Neurosurg. 77: 590 (1992).
  • the injection of replication-competent retroviral vector particles or a cell line that produces a replication-competent retroviral vector virus may prove to be a more effective therapeutic because a replication competent retroviral vector will establish a productive infection that will transduce cells as long as it persists.
  • replicatively competent retroviral vectors may follow the tumor as it metastasizes, carried along and propagated by transduced tumor cells.
  • the risks for complications are greater, with replicatively competent vectors, however.
  • Such vectors may pose a greater risk then replicatively deficient vectors of transducing normal tissues, for instance.
  • the risks of undesired vector propagation for each type of cancer and affected body area can be weighed against the advantages in the situation of replicatively competent verses replicatively deficient retroviral vector to determine an optimum treatment.
  • amphotropic and xenotropic retroviral vectors may be used in the invention.
  • Amphotropic viruses have a very broad host range that includes most or all mammalian cells, as is well known to the art.
  • Xenotropic viruses can infect all mammalian cell cells except mouse cells.
  • amphotropic and xenotropic retroviruses from many species, including cows, sheep, pigs, dogs, cats, rats, and mice, inter alia can be used to provide retroviral vectors in accordance with the invention, provided the vectors can transfer genes into proliferating human cells in vivo.
  • Retroviral vector- containing cells have been implanted into brain tumors growing in human patients. See Oldfield et al, Hum. Gene Ther. 4: 39 (1993). These retroviral vectors carried the HSV-1 thymidine kinase (HSV-tk) gene into the surrounding brain tumor cells, which conferred sensitivity of the mmor cells to the antiviral drug ganciclovir.
  • HSV-1 thymidine kinase HSV-1 thymidine kinase
  • herpesvirus vector- mediated gene transfer Yet another suitable virus-based gene delivery mechanism is herpesvirus vector- mediated gene transfer. While much less is known about the use of herpesvirus vectors, replication-competent HSV-1 viral vectors have been described in the context of antitumor therapy. See Martuza et al, Science 252: 854 (1991), which is incorporated herein by reference. DIAGNOSTIC METHODS
  • the present invention also contemplates, for certain molecules described below, methods for diagnosis of human disease.
  • patients can be screened for the occurrence of cancers, or likelihood of occurrence of cancers, associated with mutations in the encoded protein.
  • DNA from tumor tissue obtained from patients suffering from cancer can be isolated and the gene encoding the protein can be sequenced.
  • mutations in the gene that are associated with a malignant cellular phenotype can be identified.
  • correlation of the nature of the observed mutations with subsequent observed clinical outcomes allows development of prognostic model for the predicted outcome in a particular patient.
  • PCR primers can be selected that flank known mutation sites, and the PCR products can be sequenced to detect the occurrence of the mutation.
  • the 3 ' residue of one PCR primer can be selected to be a match only for the residue found in the unmutated gene. If the gene is mutated, there will be a mismatch at the 3' end of the primer, and primer extension cannot occur, and no PCR product will be obtained.
  • primer mixtures can be used where the 3' residue of one primer is any nucleotide other than the nonmutated residue.
  • antibodies can be generated that selectively bind either mutated or non- mutated protein.
  • the antibodies then can be used to screen tissue samples for occurrence of mutations in a manner analogous to the DNA-based methods described supra.
  • the diagnostic methods described above can be used not only for diagnosis and for prognosis of existing disease, but may also be used to predict the likelihood of the future occurrence of disease.
  • clinically healthy patients can be screened for mutations in the inventive molecule that correlate with later disease onset. Such mutations may be observed in the heterozygous state in healthy individuals. In such cases a single mutation event can effectively disable proper functioning of the gene and induce a transformed or malignant phenotype.
  • This screening also may be carried out prenatally or neonatally.
  • DNA molecules according to the invention also are well suited for use in so-called "gene chip" diagnostic applications. Such applications have been developed by, inter alia, Synteni and Affymetrix.
  • all or part of the DNA molecules of the invention can be used either as a probe to screen a polynucleotide array on a "gene chip,” or they may be immobilized on the chip itself and used to identify other polynucleotides via hybridization to the surface of the chip.
  • gene chips have particular application for diagnosis of disease, or in forensic analysis to detect the presence or absence of an analyte. Suitable chip technology is described for example, in Wodicka et al , Nature Biotechnology, 15: 1359 (1997) which is hereby incorporated by reference in its entirety, and references cited therein.
  • inventive protein molecules will interact with another class of cellular proteins. This is particularly true of those molecule containing leucine zipper motifs.
  • Any method suitable for detecting protein-protein interactions can be employed for identifying interacting targets.
  • traditional methods which can be employed are co- immunoprecipitation, crosslinking and co-purification through gradients or chromatographic columns. Utilizing procedures such as these allows for the identification of GAP gene products.
  • a GAP protein can be used, in conjunction with standard techniques, to identify its corresponding pathway gene. For example, at least a portion of the amino acid sequence of the pathway gene product can be ascertained using techniques well known to those of skill in the art, such as via the Edman degradation technique (see, e.g.. Creighton, 1983, PROTEINS: STRUCTURES AND MOLECULAR PRINCIPLES, W.H. Freeman & Co. , N. Y.
  • the amino acid sequence obtained can be used as a guide for the generation of oligonucleotide mixtures that can be used to screen for pathway gene sequences. Screening can be accomplished, for example, by standard hybridization or PCR techniques. Techniques for the generation of oligonucleotide mixtures and for screening are well-known. (See e.g. , Ausubel, supra, and PCR PROTOCOLS: A GUIDE TO METHODS AND APPLICATIONS, 1990, Innis et al. , eds. Academic Press, Inc. , New York). Additionally, methods can be employed which result in the simultaneous identification of interacting target genes.
  • plasmids are constructed that encode two hybrid proteins: one consists of the DNA-binding domain of a transcription activator protein fused to a known protein, in this case an inventive protein, and the other contains the activator protein's activation domain fused to an unknown protein (a putative GAP, for instance) that is encoded by a cDNA which has been recombined into this plasmid as part of a cDNA library.
  • the plasmids are transformed into a strain of the yeast Saccharomyces cerevisiae that contains a reporter gene (e.g., lacZ) whose regulatory region contains the transcription activator's binding sites.
  • the two-hybrid system or related methodology can be used to screen activation domain libraries for proteins that interact with a known "bait" gene product.
  • gene products known to be involved in TH cell subpopulation-related disorders and/or differentiation, maintenance, and/or effector function of the subpopulations can be used as the bait gene products.
  • Total genomic or cDNA sequences are fused to the DNA encoding on activation domain.
  • This library and a plasmid encoding a hybrid of the bait gene product fused to the DNA-binding domain are cotransformed into a yeast reporter strain, and the resulting transformants are screened for those that express the reporter gene.
  • the bait gene can be cloned into a vector such that it is translationally fused to the DNA encoding the DNA- binding domain of the GAL4 protein. These colonies are purified and the library plasmids responsible for reporter gene expression are isolated. DNA sequencing is then used to identify the proteins encoded by the library plasmids.
  • cDNA library plates and clones originated from five cDNA libraries that were constructed by directional cloning. These are available through the Resource Center (http://www.rzpd.de) of the German Genome Project.
  • the hfbr2 human fetal brain; RZPD number DKFZp564
  • hfkd2 human fetal kidney; DKFZp566
  • Smart kit Chip (Clontech), except that PCR was carried out with primers that contained uracil residues to permit directional cloning without restriction digestion and ligation, and were complementary with the pAMPl (LifeTechnologies) cloning sites for directional cloning.
  • the htes3 (human testes; DKFZp434), hutel (human uterus; DKFZp586) and hmcfl (human mammary carcinoma; DKFZp727) libraries are conventional (Gubler, U., Hoffman, B.J., (1983), A simple and very efficient method for generating cDNA libraries. Gene 25, 263-269), size-selected cDNA libraries. They are cloned into pSPORTl (LifeTechnologies) via a Notl site which is introduced during reverse transcription downstream of the oligo dT primer and a Sail site that is introduced by the ligation of a adapters.
  • the human mammary carcinoma library was constructed fgrom MCF7 cells.
  • the cDNA sequences of this application were first identified among the sequences comprising various libraries. Technology has advanced considerably since the first cDNA libraries were made. Many small variations in both chemicals and machinery have been instituted over time, and these have improved both the efficiency and safety of the process. Although the cDNAs could be obtained using an older procedure, the procedure presented in this application is exemplary of one currently being used by persons skilled in the art. For the purpose of providing an exemplary method, the mRNA isolation and cDNA library construction described here is for the MCF-7 library (DKFZp727) from which the clones named DKFZphmcfl xxyyxx were obtained.
  • the human cell line MCF-7 was grown in DMEM supplemented with 10% fetal calf serum until confluency. 3 X 10 8 cells were harvested with a cell scraper in PBS. Cells were lysed in buffer containing 0.5 % NP-40 to leave the nuclei intact. The debris was pelleted by centrifugation at 15 000 x g for 10 minutes at 4 degrees Celsius. Proteins in the supernatant were degraded in presence of SDS and Proteinase K (30 minutes at 56 degrees Celsius). Precipitation of proteins was done in a Phenol/Chloroform extraction, RNA was precipitated from the aqueous phase with Na-acetate and Ethanol. Polyadenylated messages were isolated using Qiagen Oligotex (QIAGEN, Hilden Germany).
  • First strand cDNA synthesis was accomplished using an oligo (dT) primer which also contained an Notl restriction site.
  • Second strand synthesis was performed using a combination of DNA polymerase I, E. coli ligase and RNase H, followed by the addition of a Sail adaptor to the blunt ended cDNA.
  • the Sail adapted, double-stranded cDNA was then digested with Notl restriction enzyme, and fractionated by size on an agarose gel. DNA of the appropriate size was cut from the gel and cast into a second gel in a 90° angle. After electrophoresis in the second dimension, cDNA of the appropriate size was cut from the gel.
  • the agarose block was broken down with help of gelase.
  • the cDNA was purified with help of two phenol extractions and an ethanol precipitation.
  • the cDNA was ligated into Sall/Notl pre-digested pSportl vector (LifeTechnologies) and transformed into DH10B bacteria.
  • the libraries were arrayed into 384-well microtiter plates and spotted on high density nylon membranes for hybridization analysis. Filters and clones are available through the Resource Center. Whole plates were distributed to the sequencing partners of the consortium for systematic sequencing.
  • EST-sequence was blasted against the cDNA consortiums own database and after that against public databases and (with BLASTn and BLASTx against EMBL/EMBLNEW and assembled ESTs, please refer to EXAMPLE III: Bioinformatics analysis of full length cDNAs, for description and parameter settings). ESTs which were identical to known genes in more than 100 bp, with less than 2 mismatches, were excluded from further analysis.
  • ORFs Open reading frames
  • MIPS Munich Information Center for Protein Sequences
  • a script developed by MIPS computed the GC -content of the rl -sequence, which should be >40%. Writing similar scripts is within the ordinary skill of one in bioinformatics.
  • a very good ORF had at least one BLASTx match to other proteins.
  • a "good ORF” should extend to the 3' end and be longer than ⁇ 40 codons. If the ORF started in the rl sequence, in front of the potential start codon, there should not exist too many competing start codons in frame with the ORF start codon and the start should match the Kozak consensus ATG. If the EST sequence was to short to decide according to the potential ORF, and there were only a few or no start codons in the sequence the GC content of the Sequence should be greater than 40%. The rl sequences needed not contain an polyA-tail at the 3' end. In addition, the results of the blasting against the assembled human ESTs could help in questionable cases to decide whether to stop or to continue. A hit against these ESTs was an indication to go further.
  • Walking primers were generally designed using software (e.g. Haas, S., Vingron, M., Poustka, A., Wiemann, S. (1998) Primer design in large-scale sequencing. Nucleic Acids Res. 26, 3006-3012, Schwager, C, Wiemann, S., Ansorge, W. (1995) GeneSkipper: integrated software environment for DNA sequence assembly and alignment. HUGO Genome Digest 2, 8-9) that permitted complete automation of this usually time consuming process and helped in the parallel processing of large numbers of clones.
  • software e.g. Haas, S., Vingron, M., Poustka, A., Wiemann, S. (1998) Primer design in large-scale sequencing. Nucleic Acids Res. 26, 3006-3012, Schwager, C, Wiemann, S., Ansorge, W. (1995) GeneSkipper: integrated software environment for DNA sequence assembly and alignment. HUGO Genome Digest 2, 8-9) that permitted complete automation of this usually time consuming process and helped in the parallel processing of large numbers
  • An HSP consists of two sequence fragments of arbitrary but equal lengths whose alignment is locally maximal and for which the alignment BLAST approach is to look threshold or cut off score set by the user.
  • BLAST looks for HSPs between a query sequence and a database sequence, to evaluate the statistical significance of any matches found, and to report only those matches which satisfy the user-selected threshold of significance.
  • the parameter E establishes the statistically significant threshold for reporting database sequence matches.
  • E is interpreted as the upper bound of the expected frequency of chance occurrence of an HSP (or set of HSPs) within the context of the entire database search. Any database sequence whose match satisfies E is reported in the program output.
  • the cDNA-sequences were blasted against EMBL-STS to determine STS-sequence- match to the cDNA, thus providing a mapping information to the new cDNA.
  • the potential protein-sequences were generated automatically by a script searching for the longest open reading frame (ORF) in each of the three forward frames with a minimum length of 90 codons.
  • ORF open reading frame
  • Plasmids of cDNA-GFP fusions were transfected into mammalian tissue culture cells and allowed to express the proteins for up to 48 hours. Live cells were imaged at 24 hours and 48 hours after transfection and the localisations recorded. The chart, below, depicts the apparent final cellular localisations of 107 cDNA-GFP fusions.
  • Each cDNA in turn was subjected to bioinformatic analysis. Where possible, the potential subcellular localisations of the expressed proteins were determined. This information was then compared to the actual localisations determined from expression of the GFP-fusion proteins in mammalian cells.
  • DKFZphfbr 2 _16cl6 , 3 encodes a novel 586 amino acid 1 protein with .similarity to the human actin binding protein MAYVEN and Drosophila Kelch.
  • AVEN is a novel actin binding protein predominantly expressed in brain.
  • Drosophila kelch is involved in the maintenance of ring canal organization during oogenesis.
  • the amino half of the protein including the BTB domain mediates dimerization, while the amino half might allow cross-linking of ring canal actin filaments, thus organising the inner rim cytoskeleton.
  • the kelch repeat domain is necessary for ring canal localisation and believed to mediate an additional interaction, possibly with actin.
  • the new protein shares the features of both proteins and therefore should be involved in the organisation of cyto skeleton binding to membrane proteins .
  • the new protein can find application in modulating/blocking of cyto skeleton-membrane protein interaction.
  • Drosophila kelch is an oligomeric ring canal actin organizer.
  • KIAA0132 Human mRNA for KIAA0132 gene, complete eds. Homo sapiens (human)
  • DKFZphfbr2_16f21 encodes a novel 208 ammo acid protein with strong similarity to human zinc finger protein 216.
  • the novel protein shows strong similarity to the human zinc finger protein 216, but has no Zn finger.
  • PROSITE Contains no Zinc finger; No informative BLAST results; no predictive prosite, pfam or SCOP motife
  • the new protein can find application in studying the expression profile of bram-specific genes .
  • Entry AF062072_1 from database TREMBL gene: "ZNF216”; product: “zinc finger protein 216”; Homo sapiens zinc finger protein 216 (ZNF216) gene, complete eds.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Medicinal Chemistry (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Toxicology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
EP00966368A 1999-08-18 2000-08-18 Humane dns sequenzen Withdrawn EP1248798A2 (de)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US15650399P 1999-08-18 1999-08-18
US14949999P 1999-08-18 1999-08-18
US149499P 1999-08-18
US156503P 1999-09-28
PCT/IB2000/001496 WO2001012659A2 (en) 1999-08-18 2000-08-18 Human dna sequences

Publications (1)

Publication Number Publication Date
EP1248798A2 true EP1248798A2 (de) 2002-10-16

Family

ID=26846790

Family Applications (1)

Application Number Title Priority Date Filing Date
EP00966368A Withdrawn EP1248798A2 (de) 1999-08-18 2000-08-18 Humane dns sequenzen

Country Status (3)

Country Link
EP (1) EP1248798A2 (de)
AU (1) AU7680300A (de)
WO (1) WO2001012659A2 (de)

Families Citing this family (54)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7771719B1 (en) 2000-01-11 2010-08-10 Genentech, Inc. Pharmaceutical compositions, kits, and therapeutic uses of antagonist antibodies to IL-17E
US20040058355A1 (en) 1998-09-30 2004-03-25 Millennium Pharmaceuticals, Inc. Novel 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, H1983, M1983, 38555 or 593 molecules and uses therefor
AU5446900A (en) 1999-05-28 2000-12-18 Immunex Corporation Novel murine and human kinases
US7001752B1 (en) 1999-05-28 2006-02-21 Immunex Corporation Murine and human kinases
KR100543857B1 (ko) * 1999-09-01 2006-01-23 제넨테크, 인크. 혈관신생 및 심혈관형성의 촉진 또는 억제
WO2001021651A2 (en) * 1999-09-24 2001-03-29 Lexicon Genetics Incorporated Novel human protease inhibitor-like proteins and polynucleotides encoding the same
CA2391805A1 (en) * 1999-10-06 2001-04-12 The Regents Of The University Of California Differentially expressed genes associated with her-2/neu overexpression
US6770477B1 (en) 1999-10-06 2004-08-03 The Regents Of The University Of California Differentially expressed genes associated with HER-2/neu overexpression
DK1897945T3 (da) * 1999-12-23 2012-05-07 Genentech Inc IL-17 homologe polypeptider og terapeutiske anvendelser deraf.
WO2001059114A2 (en) * 2000-02-09 2001-08-16 The Government Of The United States Of America, As Represented By The Secretary Of Health And Human Services Tumor suppressor gene, p47ing3
EP1259620A2 (de) * 2000-02-29 2002-11-27 Millennium Pharmaceuticals, Inc. 2504, 15977, und 14760 : mitglieder der protein kinase familie und deren verwendungen
AR028256A1 (es) * 2000-03-16 2003-04-30 Amgen Inc Moleculas similares a receptores il-17 y usos de las mismas
CA2407656A1 (en) * 2000-04-27 2001-11-08 Kyowa Hakko Kogyo Co. Ltd. Myocardial cell proliferation-associated genes
US20010049106A1 (en) * 2000-04-27 2001-12-06 Leonard Buckbinder ADAMTS polypeptides, nucleic acids encoding them, and uses thereof
WO2001085942A2 (en) * 2000-05-05 2001-11-15 Incyte Genomics, Inc. Cytoskeleton-associated proteins
US6919185B2 (en) 2000-05-31 2005-07-19 Bayer Aktiengesellschaft Regulation of human transketolase-like enzyme
US20030096969A1 (en) 2000-06-02 2003-05-22 Genentech, Inc. Secreted and transmembrane polypeptides and nucleic acids encoding the same
EP1254912A1 (de) * 2001-04-30 2002-11-06 Boehringer Ingelheim International GmbH CYK-4 Polypeptide, dafür kodierende DNS Moleküle und deren Verwendung in Screeningverfahren
EP1297131A2 (de) * 2000-06-19 2003-04-02 Boehringer Ingelheim International GmbH Cyk-4 polypeptide, dafür kodierende dns-moleküle und deren verwendung zum screeningsverfahren
US20030105288A1 (en) * 2000-07-25 2003-06-05 Genentech, Inc. Secreted and transmembrane polypeptides and nucleic acids encoding the same
US6534309B1 (en) * 2000-08-03 2003-03-18 Cytokinetics, Inc. Motor proteins and methods for their use
CA2423104A1 (en) * 2000-09-20 2002-03-28 Pe Corporation (Ny) Isolated human transporter proteins, nucleic acid molecules encoding human transporter proteins, and uses thereof
WO2002024921A2 (en) * 2000-09-25 2002-03-28 Millennium Pharmaceuticals, Inc. 3700, a novel human protein kinase and uses therefor
TW499433B (en) 2000-12-27 2002-08-21 Ind Tech Res Inst Novel human Α1-chain collagen
US20030104385A1 (en) * 2001-08-02 2003-06-05 Evans Glen A. Nucleic acids and encoded polypeptides associated with bipolar disorder
AU2002328200B2 (en) * 2001-09-27 2008-01-03 Bionomics Limited DNA sequences for human angiogenesis genes
NZ543295A (en) * 2001-09-27 2008-04-30 Bionomics Ltd DNA sequences for human angiogenesis genes
DE60234713D1 (de) * 2001-12-18 2010-01-21 Endocube Sas Neue mit dem zelltod assoziierte proteine aus der thap familie und par4 verwandte signalwege, die bei der kontrolle von apoptosis involviert sind
US7858297B2 (en) 2001-12-18 2010-12-28 Centre National De La Recherche Scientifique Cnrs Chemokine-binding protein and methods of use
JP2004097199A (ja) * 2002-07-15 2004-04-02 Keio Gijuku 腫瘍抗原のスクリーニング方法
US20060257387A1 (en) * 2002-08-07 2006-11-16 National Institute Of Advanced Industrial Science And Technology Sugar chain synthase gene
JPWO2004018669A1 (ja) * 2002-08-21 2005-12-08 株式会社プロテイン・エクスプレス 塩誘導性キナーゼ2及びその用途
AU2003291625B2 (en) * 2002-09-16 2009-10-08 Genentech, Inc. Compositions and methods for the treatment of immune related diseases
ATE335816T1 (de) 2002-09-18 2006-09-15 Sirs Lab Gmbh Verfahren zum nachweis prokaryontischer dna
WO2004035789A1 (en) * 2002-10-18 2004-04-29 Lg Life Sciences Ltd. Gene families associated with cancers
US7423018B2 (en) * 2002-12-13 2008-09-09 University Of Massachusetts Kinesin-like proteins and methods of use
US7632640B2 (en) * 2003-12-08 2009-12-15 The Clinic For Special Children Association of TSPYL polymorphisms with SIDDT syndrome
DK1723233T3 (da) * 2004-03-05 2012-07-23 Sirs Lab Gmbh Fremgangsmåde til berigelse og/eller adskillelse prokaryotisk DNA ved hjælp af et protein, som specifikt binder DNA indeholdende ikke-methylerede CPG-motiver
EP1732382A4 (de) * 2004-03-17 2009-01-07 Univ Virginia Sfec, ein spermien flagellares energieträger-protein
EA012799B1 (ru) 2004-08-16 2009-12-30 Кварк Фармасьютикалс, Инк. Терапевтические применения ингибиторов rtp801
WO2006029176A2 (en) * 2004-09-08 2006-03-16 Ludwig Institute For Cancer Research Cancer-testis antigens
US7332591B2 (en) * 2004-12-21 2008-02-19 The University Of Iowa Research Foundation Bardet-Biedl susceptibility gene and uses thereof
US7825099B2 (en) 2006-01-20 2010-11-02 Quark Pharmaceuticals, Inc. Treatment or prevention of oto-pathologies by inhibition of pro-apoptotic genes
NL2000439C2 (nl) 2006-01-20 2009-03-16 Quark Biotech Therapeutische toepassingen van inhibitoren van RTP801.
WO2008054534A2 (en) 2006-05-11 2008-05-08 Quark Pharmaceuticals, Inc. Screening systems utilizing rtp801
WO2007141796A2 (en) 2006-06-09 2007-12-13 Quark Pharmaceuticals, Inc. Therapeutic uses of inhibitors of rtp801l
US7872119B2 (en) 2007-02-26 2011-01-18 Quark Pharmaceuticals, Inc. Inhibitors of RTP801 and their use in disease treatment
US8614311B2 (en) 2007-12-12 2013-12-24 Quark Pharmaceuticals, Inc. RTP801L siRNA compounds and methods of use thereof
KR101508086B1 (ko) 2008-05-05 2015-04-07 노비뮨 에스 에이 항-il-17a/il-17f 교차-반응성 항체 및 그의 사용 방법
WO2010108926A1 (en) * 2009-03-24 2010-09-30 Inserm (Institut National De La Sante Et De La Recherche Medicale) Method for diagnosing or predicting a non syndromic autosomal recessive optic atrophy, or a risk of a non syndromic autosomal recessive optic atrophy.
RU2605318C2 (ru) 2009-05-05 2016-12-20 Новиммун С.А. Анти-il-17f антитела и способы их применения
CN103421742B (zh) * 2013-05-31 2015-01-28 华中农业大学 抗伏马菌素b1的单克隆重组抗体h2
CN103421743B (zh) * 2013-05-31 2015-01-28 华中农业大学 抗伏马菌素b1的单克隆重组抗体h7
DE102020210725A1 (de) * 2019-10-31 2021-05-06 Gelita Ag Ernährungsphysiologisch optimiertes Kollagenpeptid

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5976837A (en) * 1997-03-14 1999-11-02 Genetics Institute, Inc. Secreted proteins and polynucleotides encoding them
WO2000009552A1 (en) * 1998-08-14 2000-02-24 Genetics Institute, Inc. Secreted proteins and polynucleotides encoding them

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO0112659A3 *

Also Published As

Publication number Publication date
WO2001012659A2 (en) 2001-02-22
AU7680300A (en) 2001-03-13
WO2001012659A3 (en) 2002-06-20

Similar Documents

Publication Publication Date Title
EP1248798A2 (de) Humane dns sequenzen
AU2023214237A1 (en) Modified polynucleotides for the production of biologics and proteins associated with human disease
KR102630357B1 (ko) 단백질 발현이 어려운 다중-부위 ssi 세포
AU2018223041A1 (en) Polynucleotides encoding rodent antibodies with human idiotypes and animals comprising same
EP0973896A2 (de) Sekretierte exprimierte sequenzkennzeichen (sests)
AU2016376191A1 (en) Materials and methods for treatment of amyotrophic lateral sclerosis and/or frontal temporal lobular degeneration
AU2016364667A1 (en) Materials and methods for treatment of Alpha-1 antitrypsin deficiency
EP0973898A2 (de) Sekretierte exprimierte etikettiersequenzen (sests)
JP2003088388A (ja) 新規な全長cDNA
EP1165784A2 (de) Nukleinsäuren mit offener leserahmen, die polypeptiden kodieren; "orfx"
JP2003135075A (ja) 新規な全長cDNA
KR20190129857A (ko) 아데노-관련 바이러스 생산을 위한 포유동물 세포
WO1995014772A1 (fr) Signature genique
KR20180093902A (ko) 태아와 임신 여성간에 상이하게 메틸화된 디엔에이 영역을 이용한 태아 염색체 이수성의 검출
WO1998045437A2 (en) SECRETED EXPRESSED SEQUENCE TAGS (sESTs)
WO1999055858A2 (de) Menschliche nukleinsäuresequenzen aus pankreastumorgewebe
KR20220054401A (ko) 감염의 숙주 rna 바이오마커의 신속한 조기-검출 및 인간의 covid-19 코로나바이러스 감염의 조기 식별을 위한 시스템, 방법 및 조성물
AU2016202635B2 (en) Method for assessing embryotoxicity
JP2003156489A (ja) 痛みに関連する分子の同定及び使用
KR102046839B1 (ko) 대장암의 시험관내 진단 또는 예후 예측 방법
KR20190104400A (ko) 다중 중쇄 면역글로불린 유전자좌를 갖는 트랜스제닉 설치류 기원의 인간 항체
CN115151558A (zh) 哺乳动物序列中的靶向整合增强基因表达
AU2017336160A1 (en) Screening methods using olfactory receptors and novel compounds identified using the same
KR20220025806A (ko) 핵산의 무작위 구성 표적화 통합
US20030207286A1 (en) Nucleic acid sequences showing enhanced expression in benign neuroblastoma compared with acritical human neuroblastoma

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20020313

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DERANGEWAND

17Q First examination report despatched

Effective date: 20031217

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20040302