DE19817946A1

DE19817946A1 - New nucleic acid sequences expressed in normal uterine tissues, and derived polypeptides, for treatment of uterine cancer and identification of therapeutic agents

Info

Publication number: DE19817946A1
Application number: DE19817946A
Authority: DE
Inventors: Andre Rosenthal; Thomas Specht; Bernd Hinzmann; Armin Schmitt; Christian Pilarsky; Edgar Dahl
Original assignee: METAGEN GESELLSCHAFT fur GENOMFORSCHUNG MBH
Current assignee: Metagen Pharmaceuticals GmbH
Priority date: 1998-04-17
Filing date: 1998-04-17
Publication date: 1999-10-21
Also published as: JP2002512017A; WO1999054353A3; WO1999054353A2; EP1071777A2

Abstract

62 nucleic acid (cDNA) sequences (A), fully defined in the specification, that are highly expressed in normal uterine tissue, are new. Independent claims are also included for the following: (a) a subset (A') of 20 (A) that encode at least part of a gene product, the allelic variants of (A') and the complements of (A') or their allelic variants; (b) bacterial or phage artificial chromosomes and cosmid clones containing functional genes, or the chromosome region containing them, corresponding to (A') for use as gene transfer vehicles; (c) nucleic acid sequences containing part of (A) sufficiently long to hybridize with (A); (d) expression cassette (EC) comprising (A), or their fragments, plus control and regulatory sequences; (e) DNA fragments containing a gene isolated using (A); (f) host cells containing (A) as a heterologous component; (g) recombinant production of polypeptides (B), or their fragments, by culturing cells of (f); (h) antibody (Ab) directed against (B), or its fragments, encoded by (A); (j) 55 partial polypeptide sequences, all reproduced, and sequences having at least 80% homology with them; (k) pharmaceutical composition containing at least one of these partial sequences; and (l) genomic genes, their promoters, enhancers, silencers, exon or intron structures, or splice variants obtained using (A).

Description

Die Erfindung betrifft menschliche Nukleinsäuresequenzen aus Uterusnormalgewebe, die für Genprodukte oder Teile davon kodieren, deren funktionale Gene, die mindestens ein biologisch aktives Polypeptid kodieren und deren Verwendung.The invention relates to human nucleic acid sequences from uterine normal tissue, encoding gene products or parts thereof whose functional genes are at least encode a biologically active polypeptide and its use.

Die Erfindung betrifft weiterhin die über die Sequenzen erhältlichen Polypeptide und deren Verwendung.The invention further relates to the polypeptides obtainable via the sequences and their use.

Eine der Hauptkrebstodesursachen bei Frauen ist der Uterustumor, für dessen Bekämpfung neue Therapien notwendig sind. Bisher verwendete Therapien, wie z. B. Chemotherapie, Hormontherapie oder chirurgische Entfernung des Tumorgewebes, führen häufig nicht zu einer vollständigen Heilung.One of the main cancer death causes in women is the uterine tumor, for which Fighting new therapies are necessary. Previously used therapies such. B. Chemotherapy, hormone therapy or surgical removal of tumor tissue, often do not lead to a complete cure.

Das Phänomen Krebs geht häufig einher mit der Über- oder Unterexpression gewisser Gene in den entarteten Zellen, wobei noch unklar ist, ob diese veränderten Expressionsraten Ursache oder Folge der malignen Transformation sind. Die Identifikation solcher Gene wäre ein wesentlicher Schritt für die Entwicklung neuer Therapien gegen Krebs. Der spontanen Entstehung von Krebs geht häufig eine Vielzahl von Mutationen voraus. Diese können verschiedenste Auswirkungen auf das Expressionsmuster in dem betroffenen Gewebe haben, wie z. B. Unter- oder Überexpression, aber auch Expression verkürzter Gene. Mehrere solcher Veränderungen durch solche Mutationskaskaden können schließlich zu bösartigen Entartungen führen. Die Komplexität solcher Zusammenhänge erschwert die experimentelle Herangehensweise sehr.The phenomenon of cancer is often associated with the over- or under-expression of certain Genes in the degenerate cells, although it is still unclear whether these changed Expression rates are cause or consequence of malignant transformation. The Identification of such genes would be an essential step in the development of new ones Therapies for cancer. The spontaneous development of cancer is often a variety of mutations ahead. These can have a variety of effects on the Have expression patterns in the affected tissue, such. B. sub- or Overexpression, but also expression of truncated genes. Several such Changes through such mutation cascades can eventually turn into malignant ones Cause degeneration. The complexity of such relationships makes it difficult experimental approach very.

Für die Suche nach Kandidatengenen, d. h. Genen, die im Vergleich zum Tumorgewebe im normalen Gewebe stärker exprimiert werden, wird eine Datenbank verwendet, die aus sogenannten ESTs besteht. ESTs (Expressed Sequence Tags) sind Sequenzen von cDNAs, d. h. revers transkribierten mRNAs, den Molekülen also, die die Expression von Genen widerspiegeln. Die EST-Sequenzen werden für normale und entartete Gewebe ermittelt. Solche Datenbanken werden von verschiedenen Betreibern z. T. kommerziell angeboten. Die ESTs der LifeSeq-Datenbank, die hier verwendet wird, sind in der Regel zwischen 150 und 350 Nukleotide lang. Sie repräsentieren ein für ein bestimmtes Gen unverkennbares Muster, obwohl dieses Gen normalerweise sehr viel länger ist (< 2000 Nukleotide). Durch Vergleich der Expressionsmuster von normalen und Tumorgewebe können ESTs identifiziert werden, die für die Tumorentstehung und -proliferation wichtig sind. Es besteht jedoch folgendes Problem: Da durch unterschiedliche Konstruktionen der cDNA-Bibliotheken die gefundenen EST-Sequenzen zu unterschiedlichen Regionen eines unbekannten Gens gehören können, ergäbe sich in einem solchen Fall ein völlig falsches Verhältnis des Vorkommens dieser ESTs in dem jeweiligen Gewebe. Dieses würde erst bemerkt werden, wenn das vollständige Gen bekannt ist und somit die ESTs dem gleichen Gen zugeordnet werden können.For the search for candidate genes, d. H. Genes compared to tumor tissue are expressed more strongly in normal tissue, a database is used consists of so-called ESTs. ESTs (Expressed Sequence Tags) are sequences of cDNAs, d. H. reverse transcribed mRNAs, the molecules that express the expression of Reflect genes. The EST sequences are for normal and degenerate tissues determined. Such databases are used by various operators such. T. commercial offered. The ESTs of the LifeSeq database used here are usually between 150 and 350 nucleotides long. They represent one for a particular gene unmistakable pattern, although this gene is usually much longer (<2000 Nucleotides). By comparing the expression patterns of normal and tumor tissue ESTs can be identified that are important for tumorigenesis and proliferation are. However, there is the following problem: Because of different constructions the cDNA libraries the EST sequences found to different regions belong to an unknown gene, would result in such a case completely Incorrect ratio of the occurrence of these ESTs in the respective tissue. This would not be noticed until the complete gene is known and thus the ESTs can be assigned to the same gene.

Es wurde nun gefunden, daß diese Fehlermöglichkeit verringert werden kann, wenn zuvor sämtliche ESTs aus dem jeweiligen Gewebstyp assembliert werden, bevor die Expressionsmuster miteinander verglichen werden. Es wurden also überlappende ESTs ein und desselben Gens zu längeren Sequenzen zusammengefaßt (s. Fig. 1, Fig. 2a und Fig. 3). Durch diese Verlängerung und damit Abdeckung eines wesentlich größeren Genbereichs in jeder der jeweiligen Banken sollte der oben beschriebene Fehler weitgehenst vermieden werden. Da es hierzu keine bestehenden Softwareprodukte gab, wurden Programme für das Assemblieren von genomischen Abschnitten verwendet, die abgewandelt eingesetzt und durch eigene Programme ergänzt wurden. Ein Flowchart der Assemblierungsprozedur ist in Fig. 2b1-2b4 dargestellt. It has now been found that this possibility of error can be reduced if all the ESTs from the respective tissue type are previously assembled before the expression patterns are compared with one another. There were thus overlapping ESTs of the same gene in longer sequences summarized (s. Fig. 1, Fig. 2a and Fig. 3). Due to this extension and thus coverage of a much larger gene range in each of the respective banks, the error described above should be avoided as far as possible. Because there were no existing software products, programs were used to assemble genomic sections, which were modified and supplemented by their own programs. A flowchart of the assembly procedure is shown in Fig. 2b1-2b4.

Es konnten nun die Nukleinsäure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 62 gefunden werden, die als Kandidatengene beim Uterustumor eine Rolle spielen.The nucleic acid sequences Seq. ID No. 1 to Seq. ID No. 62 found which play a role as candidate genes in the uterine tumor.

Von besonderem Interesse sind die Nukleinsäure-Sequenzen Seq. ID No. 1-20.Of particular interest are the nucleic acid sequences Seq. ID No. 1-20.

Die Erfindung betrifft somit Nukleinsäure-Sequenzen, die ein Genprodukt oder ein Teil davon kodieren, umfassend
The invention thus relates to nucleic acid sequences which encode a gene product or a part thereof

a) a nucleic acid sequence selected from the group of Nucleic Acid Sequences Seq ID. 1-20.
b) an allelic variation of the nucleic acids mentioned under a) sequences or
c) a nucleic acid sequence which is complementary to that given under a) or b) said nucleic acid sequences.

Die Erfindung betrifft weiterhin eine Nukleinsäure-Sequenz gemäß einer der Sequenzen Seq ID No. 1-20 oder eine komplementäre oder allelische Variante davon und die Nukleinsäure-Sequenzen davon, die eine 90%ige bis 95%ige Homologie zu einer humanen Nukleinsäure-Sequenz aufweisen.The invention further relates to a nucleic acid sequence according to one of the sequences Seq ID no. 1-20 or a complementary or allelic variant thereof and the Nucleic acid sequences thereof having a 90% to 95% homology to one have human nucleic acid sequence.

Die Erfindung betrifft auch die Nukleinsäure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 62, die im Uterusnormalgewebe erhöht exprimiert sind.The invention also relates to the nucleic acid sequences Seq. ID No. 1 to Seq. ID No. 62, which are expressed elevated in uterine normal tissue.

Die Erfindung betrifft ferner Nukleinsäure-Sequenzen, umfassend einen Teil der oben genannten Nukleinsäure-Sequenzen, in solch einer ausreichenden Größe, daß sie mit den Sequenzen Seq. ID No. 1-20 hybridisieren.The invention further relates to nucleic acid sequences comprising part of the above said nucleic acid sequences, of sufficient size to be compatible with the sequences Seq. ID No. Hybridize 1-20.

Die erfindungsgemäßen Nukleinsäure-Sequenzen weisen im allgemeinen eine Länge von mindestens 50 bis 4500 bp, vorzugsweise eine Länge von mindestens 150 bis 4000 bp, insbesondere eine Länge von 450 bis 3500 bp auf.The nucleic acid sequences according to the invention generally have a length of at least 50 to 4500 bp, preferably a length of at least 150 to 4000 bp, in particular a length of 450 to 3500 bp.

Mit den erfindungsgemäßen Teilsequenzen Seq. ID No. 1-20 können gemäß gängiger Verfahrenspraxis auch Expressionskassetten konstruiert werden, wobei auf der Kassette mindestens eine der erfindungsgemäßen Nukleinsäure-Sequenzen zusammen mit mindestens einer dem Fachmann allgemein bekannten Kontroll- oder regulatorischen Sequenz, wie z. B. einem geeigneten Promotor, kombiniert wird. Die erfindungsgemäßen Sequenzen können in sense oder antisense Orientierung eingefügt sein.With the subsequences Seq. ID No. 1-20 can according to more common Procedural practice also expression cassettes are constructed, based on the Cassette at least one of the nucleic acid sequences according to the invention together with at least one well-known to those skilled control or regulatory sequence, such. A suitable promoter. The Sequences according to the invention can be inserted in sense or antisense orientation his.

In der Literatur sind ist eine große Anzahl von Expressionskassetten bzw. Vektoren und Promotoren bekannt, die verwendet werden können. In the literature is a large number of expression cassettes or vectors and Promoters known that can be used.

Unter Expressionskassetten bzw. Vektoren sind zu verstehen: 1. bakterielle, wie z. B., phagescript, pBs, ϕX174, pBluescript SK, pBs KS, pNH8a, pNH16a, pNH18a, pNH46a (Stratagene), pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia), 2. eukaryontische, wie z. B. pWLneo, pSV2cat, pOG44, pXT1, pSG (Stratagene), pSVK3, pBPV, pMSG, pSVL (Pharmacia).Under expression cassettes or vectors are to be understood: 1. bacterial, such as. B., phagescript, pBs, φX174, pBluescript SK, pBs KS, pNH8a, pNH16a, pNH18a, pNH46a (Stratagene), pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia), 2. eukaryotic, such. PWLneo, pSV2cat, pOG44, pXT1, pSG (Stratagene), pSVK3, pBPV, pMSG, pSVL (Pharmacia).

Unter Kontroll- oder regulatorischer Sequenz sind geeignete Promotoren zu verstehen. Hierbei sind zwei bevorzugte Vektoren der pKK232-8 und der PCM7 Vektor. Im einzelnen sind folgende Promotoren gemeint: lacI, IacZ, T3, T7, gpt, lambda PR, trc, CMV, HSV Thymidin-Kinase, SV40, LTRs aus Retrovirus und Maus Metallothionein-I.By control or regulatory sequence are meant suitable promoters. Two preferred vectors are pKK232-8 and PCM7 vector. in the the following promoters are meant: lacI, lacZ, T3, T7, gpt, lambda PR, trc, CMV, HSV thymidine kinase, SV40, LTRs from retrovirus and mouse metallothionein-I.

Die auf der Expressionskassette befindlichen DNA-Sequenzen können ein Fusionsprotein kodieren, das ein bekanntes Protein und ein biologisch aktives Polypeptid-Fragment umfaßt.The DNA sequences present on the expression cassette can be Encode a fusion protein that is a known protein and a biologically active protein Polypeptide fragment.

Die Expressionskassetten sind ebenfalls Gegenstand der vorliegenden Erfindung.The expression cassettes are also the subject of the present invention.

Die erfindungsgemäßen Nukleinsäure-Fragmente können zur Herstellung von Vollängen-Genen verwendet werden. Die erhältlichen Gene sind ebenfalls Gegenstand der vorliegenden Erfindung.The nucleic acid fragments according to the invention can be used for the production of Full-length genes are used. The available genes are also subject matter of the present invention.

Die Erfindung betrifft auch die Verwendung der erfindungsgemäßen Nukleinsäure- Sequenzen, sowie die aus der Verwendung erhältlichen Gen-Fragmente.The invention also relates to the use of the nucleic acid Sequences, as well as available from use gene fragments.

Die erfindungsgemäßen Nukleinsäure-Sequenzen können mit geeigneten Vektoren in Wirtszellen gebracht werden, in denen als heterologer Teil die auf den Nukleinsäure- Fragmenten enthaltene genetischen Information befindet, die exprimiert wird.The nucleic acid sequences according to the invention can be labeled with suitable vectors in Host cells in which, as a heterologous part, the nucleic acid Fragments contained genetic information that is expressed.

Die die Nukleinsäure-Fragmente enthaltenden Wirtszellen sind ebenfalls Gegenstand der vorliegenden Erfindung.The host cells containing the nucleic acid fragments are also subject matter of the present invention.

Geeignete Wirtszellen sind z. B. prokaryontische Zellsysteme wie E. coli oder eukaryontische Zellsysteme wie tierische oder humane Zellen oder Hefen.Suitable host cells are z. B. prokaryotic cell systems such as E. coli or eukaryotic cell systems such as animal or human cells or yeasts.

Die erfindungsgemäßen Nukleinsäure-Sequenzen können in sense oder antisense Form verwendet werden.The nucleic acid sequences of the invention may be in sense or antisense Form to be used.

Die Herstellung der Polypeptide oder deren Fragment erfolgt durch Kultivierung der Wirtszellen gemäß gängiger Kultivierungsmethoden und anschließender Isolierung und Aufreinigung der Peptide bzw. Fragmente, ebenfalls mittels gängiger Verfahren. Die Erfindung betrifft ferner Nukleinsäure-Sequenzen, die mindestens eine Teilsequenz eines biologisch aktiven Polypeptids kodieren.The production of the polypeptides or their fragment is carried out by cultivating the Host cells according to common cultivation methods and subsequent isolation and Purification of the peptides or fragments, also by conventional methods. The invention further relates to nucleic acid sequences which comprise at least one partial sequence a biologically active polypeptide.

Ferner betrifft die vorliegende Erfindung Polypeptid-Teilsequenzen, sogenannte ORF (open-reading-frame)-Peptide, gemäß den Sequenzprotokollen ORF ID No. 63-117.Furthermore, the present invention relates to polypeptide partial sequences, so-called ORF (open-reading-frame) peptides, according to the sequence protocols ORF ID no. 63-117.

Die Erfindung betrifft ferner die Polypeptid-Sequenzen, die mindestens eine 80%ige Homologie, insbesondere eine 90%ige Homologie zu den erfindungsgemäßen Polypeptid-Teilsequenzen der ORF. ID No. 63-117 aufweisen. The invention further relates to the polypeptide sequences comprising at least an 80% Homology, in particular a 90% homology to the invention Polypeptide partial sequences of the ORF. ID No. 63-117.

Die Erfindung betrifft auch Antikörper, die gegen ein Polypeptid oder Fragment davon gerichtete sind, welche von den erfindungsgemäßen Nukleinsäuren der Sequenzen Seq. ID No. 1 bis Seq. ID No. 62 kodiert werden.The invention also relates to antibodies directed against a polypeptide or fragment thereof directed, which of the nucleic acids according to the invention of the sequences Seq. ID No. 1 to Seq. ID No. 62 are coded.

Unter Antikörper sind insbesondere monoklonale Antikörper und Antikörper aus Phage- Displays zu verstehen.Antibodies include in particular monoclonal antibodies and antibodies from phage To understand displays.

Die erfindungsgemäßen Polypeptide der Sequenzen ORF ID No. 63-117 können auch als Tool zum Auffinden von Wirkstoffen gegen den Uterustumor verwendet werden, was ebenfalls Gegenstand der vorliegenden Erfindung ist.The polypeptides according to the invention of the sequences ORF ID no. 63-117 can too used as a tool to find agents against the uterine tumor, what is also the subject of the present invention.

Ebenfalls Gegenstand der vorliegenden Erfindung ist die Verwendung der Nukleinsäure-Sequenzen gemäß den Sequenzen Seq. ID No. 1 bis Seq. ID No. 62 zur Expression von Polypeptiden, die als Tools zum Auffinden von Wirkstoffen gegen den Uterustumor verwendet werden können.Likewise provided by the present invention is the use of Nucleic acid sequences according to the sequences Seq. ID No. 1 to Seq. ID No. 62 to Expression of Polypeptides Used as Tools to Find Agents Against Uterine tumor can be used.

Die Erfindung betrifft auch die Verwendung der gefundenen Polypeptid-Teilsequenzen ORF. ID No. 63-117 als Arzneimittel in der Gentherapie zur Behandlung gegen den Uterustumor, bzw. zur Herstellung eines Arzneimittels zur Behandlung gegen den Uterustumor.The invention also relates to the use of the polypeptide partial sequences found ORF. ID No. 63-117 as a drug in gene therapy for treatment against Uterus tumor, or for the manufacture of a medicament for treatment against the Uterine tumor.

Die Erfindung betrifft auch Arzneimittel, die mindestens eine Polypeptid-Teilsequenz ORF. ID No. 63-117 enthalten.The invention also relates to pharmaceutical compositions containing at least one polypeptide partial sequence ORF. ID No. 63-117 included.

Die gefundenen erfindungsgemäßen Nukleinsäure-Sequenzen können auch genomische oder mRNA-Sequenzen sein.The nucleic acid sequences of the invention found can also be genomic or mRNA sequences.

Die Erfindung betrifft auch genomische Gene, ihre Exon- und Intronstruktur und deren Spleißvarianten, erhältlich aus den cDNAs der Sequenzen Seq. ID No. 1 bis Seq. ID No. 62, sowie deren Verwendung zusammen mit geeigneten regulativen Elementen, wie geeigneten Promotoren und/oder Enhancern.The invention also relates to genomic genes, their exon and intron structure and their Splice variants obtainable from the cDNAs of sequences Seq. ID No. 1 to Seq. ID No. 62, and their use together with appropriate regulatory elements, such as suitable promoters and / or enhancers.

Mit den erfindungsgemäßen Nukleinsäuren (cDNA-Sequenzen) Seq. ID No. 1-62 werden genomische BAC-, PAC- und Cosmid-Bibliotheken gescreent und über komplementäre Basenpaarung (Hybridisierung) spezifisch humane Klone isoliert. Die so isolierten BAC-, PAC- und Cosmid-Klone werden mit Hilfe der Fluoreszenz-in-situ- Hybridisation auf Metaphasenchromosomen hybridisiert und entsprechende Chromosomenabschnitte identifiziert, auf denen die entsprechenden genomischen Gene liegen. BAC-, PAC- und Cosmid-Klone werden sequenziert, um die entsprechenden genomischen Gene in ihrer vollständigen Struktur (Promotoren, Enhancer, Silencer, Exons und Introns) aufzuklären. BAC-, PAC- und Cosmid-Klone können als eigenständige Moleküle für den Gentransfer eingesetzt werden (s. Fig. 5).With the nucleic acids according to the invention (cDNA sequences) Seq. ID No. 1-62, genomic BAC, PAC and cosmid libraries are screened and specific human clones are isolated via complementary base pairing (hybridization). The thus isolated BAC, PAC and cosmid clones are hybridized to metaphase chromosomes by fluorescence in situ hybridization and corresponding chromosome segments are identified on which the corresponding genomic genes are located. BAC, PAC and cosmid clones are sequenced to elucidate the corresponding genomic genes in their complete structure (promoters, enhancers, silencers, exons and introns). BAC, PAC and cosmid clones can be used as independent molecules for gene transfer (see Fig. 5).

Die Erfindung betrifft auch BAC-, PAC- und Cosmid-Klone, enthaltend funktionelle Gene und ihre chromosomale Lokalisation, entsprechend den Sequenzen Seq. ID. No. 1 bis Seq. ID No. 62, zur Verwendung als Vehikel zum Gentransfer. The invention also relates to BAC, PAC and cosmid clones containing functional genes and their chromosomal localization, according to sequences Seq. ID. No. 1 to Seq. ID No. 62, for use as a vehicle for gene transfer.

Meanings of technical terms and abbreviations

Nukleinsäuren = Unter Nukleinsäuren sind in der vorliegenden Erfindung zu verstehen: mRNA, partielle cDNA, vollängen cDNA und genomische Gene (Chromosomen).
ORF = Open Reading Frame, eine definierte Abfolge von Aminosäuren, die von der cDNA-Sequenz abgeleitet werden kann.
Contig = eine Menge von DNA-Sequenzen, die aufgrund sehr großer Ähnlichkeiten zu einer Sequenz zusammengefaßt werden können (Consensus)
Singleton = ein Contig, der nur eine Sequenz enthält.Nucleic acids = By nucleic acids are meant in the present invention: mRNA, partial cDNA, full length cDNA and genomic genes (chromosomes).
ORF = Open Reading Frame, a defined sequence of amino acids that can be derived from the cDNA sequence.
Contig = a set of DNA sequences that can be grouped together because of their very similarities (consensus)
Singleton = a contig that contains only one sequence.

Explanation of the alignment parameters

minimal initial match = minimaler anfänglicher Identitätsbereich
maximum pads per read = maximale Anzahl von Insertionen
maximum percent mismatch = maximale Abweichung in %.minimal initial match = minimum initial identity range
maximum pads per read = maximum number of insertions
maximum percent mismatch = maximum deviation in%.

Explanation of the pictures

Fig. 1 zeigt die systematische Gen-Suche in der Incyte LifeSeq Datenbank, FIG. 1 shows the systematic gene search in the Incyte LifeSeq database

Fig. 2a zeigt das Prinzip der EST-Assemblierung, Fig. 2a shows the principle of EST assembly,

Fig. 2b1-2b4 zeigt das gesamte Prinzip der EST-Assemblierung, Fig. 2b1-2b4 shows the entire principle of the EST assembly,

Fig. 3 zeigt die in silico Subtraktion der Genexpression in verschiedenen Geweben, Fig. 3 shows in silico subtraction of gene expression in various tissues,

Fig. 4a zeigt die Bestimmung der gewebsspezifischen Expression über elektronischen Northern, FIG. 4a shows the determination of tissue-specific expression via electronic Northern,

Fig. 4b zeigt den elektronischen Northern, FIG. 4b shows the electronic Northern,

Fig. 5 zeigt die Isolierung von genomischen BAC- und PAC-Klonen. Figure 5 shows the isolation of genomic BAC and PAC clones.

Die nachfolgenden Beispiele erläutern die Herstellung der erfindungsgemäßen Nukleinsäure-Sequenzen, ohne die Erfindung auf diese Beispiele und Nukleinsäure- Sequenzen zu beschränken.The following examples illustrate the preparation of the invention Nucleic acid sequences, without the invention being limited to these examples and nucleic acid Restrict sequences.

example 1 Search for tumor-related candidate genes

Zuerst wurden sämtliche ESTs des entsprechenden Gewebes aus der LifeSeq- Datenbank (vom Oktober 1997) extrahiert. Diese wurden dann mittels des Programms GAP4 des Staden-Pakets mit den Parametern 0% mismatch, 8 pads per read und einem minimalen match von 20 assembliert. Die nicht in die GAP4-Datenbank aufgenommenen Sequenzen (Fails) wurden erst bei 1% mismatch und dann nochmals bei 2% mismatch mit der Datenbank assembliert. Aus den Contigs der Datenbank, die aus mehr als einer Sequenz bestanden, wurden Consensussequenzen errechnet. Die Singletons der Datenbank, die nur aus einer Sequenz bestanden, wurden mit den nicht in die GAP4-Datenbank aufgenommenen Sequenzen bei 2% mismatch erneut assembliert. Wiederum wurden für die Contigs die Consensussequenzen ermittelt. Alle übrigen ESTs wurden bei 4% mismatch erneut assembliert. Die Consensussequenzen wurden abermals extrahiert und mit den vorherigen Consensussequenzen sowie den Singletons und den nicht in die Datenbank aufgenommenen Sequenzen abschließend bei 4% mismatch assembliert. Die Consensussequenzen wurden gebildet und mit den Singletons und Fails als Ausgangsbasis für die Gewebsvergleiche verwendet. Durch diese Prozedur konnte sichergestellt werden, daß unter den verwendeten Parametern sämtliche Sequenzen von einander unabhängige Genbereiche darstellten.First, all ESTs of the corresponding tissue from the LifeSeq Database (of October 1997) extracted. These were then using the program GAP4 of the Staden package with the parameters 0% mismatch, 8 pads per read and a minimum match of 20 assembled. Not in the GAP4 database Recorded sequences (Fails) were first mismatched at 1% and then again assembled at 2% mismatch with the database. From the contigs of the database, the consisted of more than one sequence, consensus sequences were calculated. The Singletons of the database, which consisted of only one sequence, were not recaptured sequences in the GAP4 database at 2% mismatch assembled. Again, the consensus sequences were determined for the contigs. All remaining ESTs were reassembled at 4% mismatch. The consensus sequences were extracted again and with the previous consensus sequences and the Singletons and non-database sequences assembled at 4% mismatch. The consensus sequences were formed and with the Singletons and Fails used as a starting point for the tissue comparisons. By this procedure could be ensured that among the parameters used all sequences represented independent gene regions.

Fig. 2b1-2b4 veranschaulicht die Verlängerung der Uterusnormalgewebe ESTs. Fig. 2b1-2b4 illustrates the extension of the uterine normal tissues ESTs.

Die so assemblierten Sequenzen der jeweiligen Gewebe wurden anschließend mittels des gleichen Programms miteinander verglichen (Fig. 3). Hierzu wurden erst alle Sequenzen des ersten Gewebes in die Datenbank eingegeben. (Daher war es wichtig, daß diese voneinander unabhängig waren.)The sequences of the respective tissues thus assembled were then compared with one another by means of the same program ( FIG. 3). For this purpose, first all sequences of the first tissue were entered into the database. (Therefore, it was important that they were independent of each other.)

Dann wurden alle Sequenzen des zweiten Gewebes mit allen des ersten verglichen. Das Ergebnis waren Sequenzen, die für das erste bzw. das zweite Gewebe spezifisch waren, sowie welche, die in beiden vorkamen. Bei Letzteren wurde das Verhältnis der Häufigkeit des Vorkommens in den jeweiligen Geweben ausgewertet. Sämtliche, die Auswertung der assemblierten Sequenzen betreffenden Programme, wurden selbst entwickelt.Then all sequences of the second tissue were compared to all of the first. The result was sequences specific to the first and second tissues, respectively were, as well as those that occurred in both. In the latter case, the ratio of Frequency of occurrence in the respective tissues evaluated. All that Evaluation of the assembled sequences related programs were themselves developed.

Alle Sequenzen, die mehr als viermal in jeweils einem der verglichenen Gewebe vorkamen, sowie alle, die mindestens fünfmal so häufig in einem der beiden Gewebe vorkamen wurden weiter untersucht. Diese Sequenzen wurden einem elektronischen Northern (s. Beispiel 2.1) unterzogen, wodurch die Verteilung in sämtlichen Tumor- und Normal-Geweben untersucht wurde (s. Fig. 4a und Fig. 4b). Die relevanten Kandidaten wurden dann mit Hilfe sämtlicher Incyte ESTs und allen ESTs öffentlicher Datenbanken verlängert (s. Beispiel 3). Anschließend wurden die Sequenzen und ihre Übersetzung in mögliche Proteine mit allen Nukleotid- und Proteindatenbanken verglichen, sowie auf mögliche, für Proteine kodierende Regionen untersucht. All sequences that occurred more than four times in each of the compared tissues as well as those that occurred at least five times as frequently in one of the two tissues were further investigated. These sequences were an electronic Northern (s. Example 2.1) is subjected, whereby the distribution in all tumor and normal tissues was examined (s. Fig. 4a and Fig. 4b). The relevant candidates were then extended using all Incyte ESTs and all ESTs of public databases (see Example 3). Subsequently, the sequences and their translation into possible proteins were compared with all nucleotide and protein databases, as well as examined for possible coding regions for proteins.

Example 2 Algorithm for identification and extension of partial cDNA sequences with altered expression pattern

Im folgenden soll ein Algorithmus zur Auffindung über- oder unterexprimierter Gene erläutert werden. Die einzelnen Schritte sind der besseren Übersicht halber auch in einem Flußdiagramm zusammengefaßt (s. Fig. 4b).In the following, an algorithm for finding overexpressed or underexpressed genes will be explained. The individual steps are also summarized in a flow chart for the sake of clarity (see Fig. 4b).

2.1 Electronic Northern blot

Zu einer partiellen DNA-Sequenz S, z. B. einem einzelnen EST oder einem Contig von ESTs, werden mittels eines Standardprogramms zur Homolgiesuche, z. B. BLAST (Altschul, S. F., Gish W., Miller, W., Myers, E. W. und Lipman, D. J. (1990) J. Mol. Biol., 215, 403-410), BLAST2 (Altschul, S. F., Madden, T. L., Schäffer, A. A., Zhang, J., Zhang, Z., Miller, W. und Lipman, D. J. (1997) Nucleic Acids Research 25 3389- 3402) oder FASTA (Pearson, W. R. und Lipman, D. J. (1988) Proc. Natl. Acad. Sci. USA 85, 2444-2448), die homologen Sequenzen in verschiedenen nach Geweben geordneten (privaten oder öffentlichen) EST-Bibliotheken bestimmt. Die dadurch ermittelten (relativen oder absoluten) Gewebe-spezifischen Vorkommenshäufigkeiten dieser Partial-Sequenz S werden als elektronischer Northern-Blot bezeichnet.To a partial DNA sequence S, z. A single EST or a contig of ESTs are created using a standard homolgine search program, eg. B. BLAST (Altschul, S.F., Gish W., Miller, W., Myers, E.W., and Lipman, D.J. (1990) J. Mol. Biol. 215, 403-410), BLAST2 (Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W. and Lipman, D.J. (1997) Nucleic Acids Research 25 3389- 3402) or FASTA (Pearson, W.R. and Lipman, D.J. (1988) Proc. Natl. Acad. Sci. USA 85, 2444-2448), the homologous sequences in different tissues ordered (private or public) EST libraries. The result determined (relative or absolute) tissue-specific occurrence frequencies This partial sequence S are referred to as electronic Northern Blot.

2.1.1

Analog der unter 2.1 beschriebenen Verfahrensweise wurde die Sequenz Seq. ID No. 27 gefunden, die 6,7× stärker im normalen Uterusmyometrium als im Myomgewebe vorkommt und das humane gas 1 Gen darstellt.Analogously to the procedure described under 2.1, the sequence Seq. ID No. 27 found to be 6.7 × stronger in the normal uterine myometrium than in the myoma tissue occurs and represents the human gas 1 gene.

Das Ergebnis ist wie folgt:The result is as follows:

Elektronischer Northern für SEQ. ID. NO: 27 Electronic Northern for SEQ. ID. NO: 27

In analoger Verfahrensweise wurden auch folgende Northerns gefunden:In an analogous procedure, the following Northerns were also found:

Elektronischer Northern für SEQ. ID. NO: 1 Electronic Northern for SEQ. ID. NO: 1

Elektronischer Northern für SEQ. ID. NO: 2 Electronic Northern for SEQ. ID. NO: 2

Elektronischer Northern für SEQ. ID. NO: 3 Electronic Northern for SEQ. ID. NO: 3

Elektronischer Northern für SEQ. ID. NO: 4 Electronic Northern for SEQ. ID. NO: 4

Elektronischer Northern für SEQ. ID. NO: 5 Electronic Northern for SEQ. ID. NO: 5

Elektronischer Northern für SEQ. ID. NO: 6 Electronic Northern for SEQ. ID. NO: 6

Elektronischer Northern für SEQ. ID. NO: 7 Electronic Northern for SEQ. ID. NO: 7

Elektronischer Northern für SEQ. ID. NO: 8 Electronic Northern for SEQ. ID. NO: 8

Elektronischer Northern für SEQ. ID. NO: 9 Electronic Northern for SEQ. ID. NO: 9

Elektronischer Northern für SEQ. ID. NO: 10 Electronic Northern for SEQ. ID. NO: 10

Elektronischer Northern für SEQ. ID. NO: 11 Electronic Northern for SEQ. ID. NO: 11

Elektronischer Northern für SEQ. ID. NO: 12 Electronic Northern for SEQ. ID. NO: 12

Elektronischer Northern für SEQ. ID. NO: 13 Electronic Northern for SEQ. ID. NO: 13

Elektronischer Northern für SEQ. ID. NO: 14 Electronic Northern for SEQ. ID. NO: 14

Elektronischer Northern für SEQ. ID. NO: 15 Electronic Northern for SEQ. ID. NO: 15

Elektronischer Northern für SEQ. ID. NO: 16 Electronic Northern for SEQ. ID. NO: 16

Elektronischer Northern für SEQ. ID. NO: 17 Electronic Northern for SEQ. ID. NO: 17

Elektronischer Northern für SEQ. ID. NO: 18 Electronic Northern for SEQ. ID. NO: 18

Elektronischer Northern für SEQ. ID. NO: 19 Electronic Northern for SEQ. ID. NO: 19

Elektronischer Northern für SEQ. ID. NO: 20 Electronic Northern for SEQ. ID. NO: 20

Elektronischer Northern für SEQ. ID. NO: 21 Electronic Northern for SEQ. ID. NO: 21

Elektronischer Northern für SEQ. ID. NO: 22 Electronic Northern for SEQ. ID. NO: 22

Elektronischer Northern für SEQ. ID. NO: 23 Electronic Northern for SEQ. ID. NO: 23

Elektronischer Northern für SEQ. ID. NO: 24 Electronic Northern for SEQ. ID. NO: 24

Elektronischer Northern für SEQ. ID. NO: 25 Electronic Northern for SEQ. ID. NO: 25

Elektronischer Northern für SEQ. ID. NO: 26 Electronic Northern for SEQ. ID. NO: 26

Elektronischer Northern für SEQ. ID. NO: 28 Electronic Northern for SEQ. ID. NO: 28

Elektronischer Northern für SEQ. ID. NO: 29 Electronic Northern for SEQ. ID. NO: 29

Elektronischer Northern für SEQ. ID. NO: 30 Electronic Northern for SEQ. ID. NO: 30

Elektronischer Northern für SEQ. ID. NO: 31 Electronic Northern for SEQ. ID. NO: 31

Elektronischer Northern für SEQ. ID. NO: 32 Electronic Northern for SEQ. ID. NO: 32

Elektronischer Northern für SEQ. ID. NO: 33 Electronic Northern for SEQ. ID. NO: 33

Elektronischer Northern für SEQ. ID. NO: 34 Electronic Northern for SEQ. ID. NO: 34

Elektronischer Northern für SEQ. ID. NO: 35 Electronic Northern for SEQ. ID. NO: 35

Elektronischer Northern für SEQ. ID. NO: 36 Electronic Northern for SEQ. ID. NO: 36

Elektronischer Northern für SEQ. ID. NO: 37 Electronic Northern for SEQ. ID. NO: 37

Elektronischer Northern für SEQ. ID. NO: 38 Electronic Northern for SEQ. ID. NO: 38

Elektronischer Northern für SEQ. ID. NO: 39 Electronic Northern for SEQ. ID. NO: 39

Elektronischer Northern für SEQ. ID. NO: 40 Electronic Northern for SEQ. ID. NO: 40

Elektronischer Ncrthern für SEQ. ID. NO: 41 Electronic device for SEQ. ID. NO: 41

Elektronischer Northern für SEQ. ID. NO: 42 Electronic Northern for SEQ. ID. NO: 42

Elektronischer Northern für SEQ. ID. NO: 43 Electronic Northern for SEQ. ID. NO: 43

Elektronischer Northern für SEQ. ID. NO: 44 Electronic Northern for SEQ. ID. NO: 44

Elektronischer Northern für SEQ. ID. NO: 45 Electronic Northern for SEQ. ID. NO: 45

Elektronischer Northern für SEQ. ID. NO: 46 Electronic Northern for SEQ. ID. NO: 46

Elektronischer Northern für SEQ. ID. NO: 47 Electronic Northern for SEQ. ID. NO: 47

Elektronischer Northern für SEQ. ID. NO: 48 Electronic Northern for SEQ. ID. NO: 48

Elektronischer Northern für SEQ. ID. NO: 49 Electronic Northern for SEQ. ID. NO: 49

Elektronischer Northern für SEQ. ID. NO: 50 Electronic Northern for SEQ. ID. NO: 50

Elektronischer Northern für SEQ. ID. NO: 51 Electronic Northern for SEQ. ID. NO: 51

Elektronischer Northern für SEQ. ID. NO: 52 Electronic Northern for SEQ. ID. NO: 52

Elektronischer Northern für SEQ. ID. NO: 53 Electronic Northern for SEQ. ID. NO: 53

Elektronischer Northern für SEQ. ID. NO: 54 Electronic Northern for SEQ. ID. NO: 54

Elektronischer Northern für SEQ. ID. NO: 55 Electronic Northern for SEQ. ID. NO: 55

Elektronischer Northern für SEQ. ID. NO: 56 Electronic Northern for SEQ. ID. NO: 56

Elektronischer Northern für SEQ. ID. NO: 57 Electronic Northern for SEQ. ID. NO: 57

Elektronischer Northern für SEQ. ID. NO: 58 Electronic Northern for SEQ. ID. NO: 58

Elektronischer Northern für SEQ. ID. NO: 59 Electronic Northern for SEQ. ID. NO: 59

Elektronischer Northern für SEQ. ID. NO: 60 Electronic Northern for SEQ. ID. NO: 60

QUERY: utn_1572013_13.fasta.ext QUERY: utn_1572013_13.fasta.ext

Elektronischer Northern für SEQ. ID. NO: 61 Electronic Northern for SEQ. ID. NO: 61

QUERY: uen_2932156_37.fasta.ext QUERY: uen_2932156_37.fasta.ext

Elektronischer Northern für SEQ. ID. NO: 62 Electronic Northern for SEQ. ID. NO: 62

QUERY: uen_3751007_1.fasta.ext QUERY: uen_3751007_1.fasta.ext

2.2 Fisher test

Um zu entscheiden, ob eine Partial-Sequenz S eines Gens in einer Bibliothek für Normal-Gewebe signifikant häufiger oder seltener vorkommt als in einer Bibliothek für entartetes Gewebe, wird Fishers Exakter Test, ein statistisches Standardverfahren (Hays, W. L., (1991) Statistics, Harcourt Brace College Publishers, Fort Worth), durchgeführt.To decide if a partial sequence S of a gene in a library for Normal tissue is significantly more common or less common than in a library for degenerated tissue, becomes Fisher's Exact Test, a standard statistical procedure (Hays, W.L., (1991) Statistics, Harcourt Brace College Publishers, Fort Worth), carried out.

Die Null-Hypothese lautet: die beiden Bibliotheken können bezüglich der Häufigkeit zu S homologer Sequenzen nicht unterschieden werden. Falls die Null-Hypothese mit hinreichend hoher Sicherheit abgelehnt werden kann, wird das zu S gehörende Gen als interessanter Kandidat für ein Krebs-Gen akzeptiert, und es wird im nächsten Schritt versucht, eine Verlängerung seiner Sequenz zu erreichen.The null hypothesis is: the two libraries may increase in frequency S homologous sequences can not be distinguished. If the null hypothesis with can be denied sufficiently high security, the gene belonging to S as interesting candidate for a cancer gene is accepted, and it will be in the next step trying to achieve an extension of his sequence.

Example 3 Automatic extension of the partial sequence

Die automatische Verlängerung der Partial-Sequenz S vollzieht sich in drei Schritten:
The automatic extension of the partial sequence S takes place in three steps:

1. Determination of all to S homologous sequences from the total amount of the Available sequences using BLAST
2. Assembly of these sequences by means of the standard program GAP4 (Bonfield,
J.K., Smith, K.F., and Staden R. (1995), Nucleic Acids Research 23, 4992-4999) (Contig formation).
3. Calculation of a consensus sequence 0 from the assembled sequences

Die Konsens-Sequenz 0 wird im allgemeinen länger sein als die Ausgangssequenz S. Ihr elektronischer Northern-Blot wird demzufolge von dem für S abweichen. Ein erneuter Fisher-Test entscheidet, ob die Alternativ-Hypothese der Abweichung von einer gleichmäßigen Expression in beiden Bibliotheken aufrechterhalten werden kann. Ist dies der Fall, wird versucht, 0 in gleicher Weise wie S zu verlängern. Diese Iteration wird mit der jeweils erhaltenen Konsensus-Sequenzen C_i (i: Index der Iteration) fortgesetzt bis die Alternativ-Hypothese verworfen wird (if H₀ Exit; Abbruchkriterium I) oder bis keine automatische Verlängerung mehr möglich ist (while C_i < C_i-1; Abbruchkriterium II).The consensus sequence 0 will generally be longer than the starting sequence S. As a result, your electronic Northern blot will differ from that for S. A renewed Fisher test will determine whether the alternative hypothesis of deviation from uniform expression can be maintained in both libraries. If this is the case, an attempt is made to extend 0 in the same way as S. This iteration is continued with the respectively obtained consensus sequences C _i (i: index of the iteration) until the alternative hypothesis is rejected (if H ₀ exit, termination criterion I) or until no automatic extension is possible (while C _i <C _i-1 , termination criterion II).

Im Fall des Abbruchkriteriums II bekommt man mit der nach der letzten Iteration vorliegenden Konsens-Sequenz eine komplette oder annähernd komplette Sequenz eines Gens, das mit hoher statistischer Sicherheit mit Krebs in Zusammenhang gebracht werden kann.In the case of the termination criterion II you get with the after the last iteration present consensus sequence a complete or nearly complete sequence a gene that is associated with cancer with high statistical certainty can be brought.

Analog der oben beschriebenen Beispiele konnten die in der Tabelle I beschriebenen Nukleinsäure-Sequenzen aus Uterusnormalgewebe gefunden werden. Analogous to the examples described above, those described in Table I. Nucleic acid sequences can be found from uterine normal tissue.

Ferner konnten zu den einzelnen Nukleinsäure-Sequenzen die Peptidsequenzen (ORF's) bestimmt werden, die in der Tabelle II aufgelistet sind, wobei wenigen Nukleinsäure-Sequenzen kein Peptid zugeordnet werden kann und einigen Nukleinsäure-Sequenzen mehr als ein Peptid zugeordnet werden kann. Wie bereits oben erwähnt, sind sowohl die ermittelten Nukleinsäure-Sequenzen, als auch die den Nukleinsäure-Sequenzen zugeordneten Peptid-Sequenzen Gegenstand der vorliegenden Erfindung. Furthermore, the peptide sequences for the individual nucleic acid sequences (ORF's) listed in Table II, with few Nucleic acid sequences no peptide can be assigned and some Nucleic acid sequences can be assigned more than one peptide. As already mentioned above, both the detected nucleic acid sequences, as well as the Nucleic acid sequences associated with peptide sequences subject of present invention.

DNA-Sequenz DNA sequence Peptid-Sequenzen Peptide sequences Seq. ID. NOSeq. ID. NO (ORF's Seq. ID. NO)(ORF's Seq. ID NO) 11 6363 64@64 @ 65@65 @ 22 6666 67@67 @ 68@68 @ 33 6969 70@70 @ 71@71 @ 44 7272 73@73 @ 74@74 @ 55 7575 76@76 @ 77@77 @ 66 7878 79@79 @ 80@80 @ 77 8181 82@82 @ 88th 8383 84@84 @ 85@85 @ 99 8686 8787 8888 1010 8989 90@90 @ 91@91 @ 1111 9292 93@93 @ 94@94 @ 1212 9595 96@96 @ 1313 9797 9898 99@99 @ 1414 100100 101@101 @ 102@102 @ 1515 103103 104@104 @ 105@105 @ 1616 106106 107@107 @ 1717 108108 109@109 @ 1818 110110 111@111 @ 1919 112112 113@113 @ 114@114 @ 2020 115115 116@116 @ 117117

Die erfinderischen Nukleinsäure-Sequenzen Seq. ID No. 1-62 der ermittelten Kandidatengene und die ermittelten Aminosäure-Sequenzen Seq. ID No. 63-117 werden in dem nachfolgenden Sequenzprotokoll beschrieben.The inventive nucleic acid sequences Seq. ID No. 1-62 of the determined Candidate genes and the determined amino acid sequences Seq. ID No. 63-117 are described in the sequence listing below.

Sequenzprotokoll sequence Listing

Claims

A nucleic acid sequence encoding a gene product or a portion thereof

a) a nucleic acid sequence selected from the group Seq ID. No. 1-20.
b) an allelic variation of the nucleic acid sequences mentioned under a) or
c) a nucleic acid sequence which is complementary to the nucleic acid sequences mentioned under a) or b).

2. A nucleic acid sequence according to one of the sequences Seq ID. No. 1-20, or one complementary or allelic variant thereof.

3. Nucleic acid sequence Seq. ID. No. 1-62, characterized in that they are in Uterine normal tissue are expressed increased.

4. BAC, PAC and cosmid clones containing functional genes and their chromosomal Localization, according to the sequences Seq. ID. No. 1-62, for use as Vehicle for gene transfer.

5. A nucleic acid sequence according to claims 1 to 4, characterized characterized in that they have a 90% homology to a human nucleic acid Sequence has.

6. A nucleic acid sequence according to claims 1 to 4, characterized characterized in that it has a 95% homology to a human nucleic acid Sequence has.

A nucleic acid sequence comprising a part of claims 1 to 6 said nucleic acid sequences, of sufficient size to hybridize with the sequences according to claims 1 to 6.

8. A nucleic acid sequence according to claims 1 to 7, characterized characterized in that the size of the fragment has a length of at least 50 to 4500 bp.

9. A nucleic acid sequence according to claims 1 to 7, characterized characterized in that the size of the fragment has a length of at least 50 to 4000 bp.

10. A nucleic acid sequence according to any one of claims 1 to 9, the at least a partial sequence of a biologically active polypeptide encoded.

11. An expression cassette comprising a nucleic acid fragment or a A sequence according to any one of claims 1 to 9, together with at least one Control or regulatory sequence.

12. An expression cassette comprising a nucleic acid fragment or a A sequence according to claim 11 wherein the control or regulatory sequence is a suitable promoter.

13. An expression cassette according to any one of claims 11 and 12, characterized in that the DNA sequences present on the cassette are Encode a fusion protein that is a known protein and a biologically active protein Polypeptide fragment.

14. Use of the nucleic acid sequences according to claims 1 to 10 for Production of full-length genes.

15. A DNA fragment comprising a gene resulting from the use according to claim 14 is available.

16. Host cell containing as heterologous part of its expressible genetic Information a nucleic acid fragment according to any one of claims 1 to 10.

17. Host cell according to claim 16, characterized in that it is a prokaryotic or eukaryotic cell system.

18. Host cell according to one of claims 16 or 17, characterized in that the prokaryotic cell system E. coli and the eukaryotic cell system animal, human or yeast cell system.

19. A method for producing a polypeptide or a fragment thereby characterized in that the host cells are cultured according to claims 16 to 18 become.

20. An antibody directed against a polypeptide or fragment which from the nucleic acids of sequences Seq. ID. No. 1-62 is encoded according to Claim 19 is available.

21. An antibody according to claim 20, characterized in that it is monoclonal is.

22. An antibody according to claim 20, characterized in that it contains a phage Display antibody is.

23. Polypeptide partial sequences according to the sequences Seq. ID. No. ORF 63-117.

24. Polypeptide partial sequences according to claim 22, having at least 80% homology to these sequences.

25. A resulting from a phage display polypeptide which to the Polypeptide partial sequences according to claim 24 can bind.

26. Polypeptide partial sequences according to claim 22, having at least 90% homology to these sequences.

27. Use of the polypeptide partial sequences according to the sequences Seq. ID. No. 63-117, as tools for finding agents against the uterine tumor.

28. Use of the nucleic acid sequences according to the sequences Seq. ID. No. 1 to Seq. ID. No. 62 for the expression of polypeptides used as tools for finding of drugs against the uterine tumor can be used.

29. Use of Nucleic Acid Sequences Seq. ID. No. 1-20 in sense or antisense Shape.

30. Use of Polypeptide Partial Sequences Seq. ID. No. 63-117 as a drug in gene therapy for the treatment of the uterine tumor.

31. Use of Polypeptide Partial Sequences Seq. ID. No. 63-117, for production a drug for the treatment of the uterine tumor.

32. A pharmaceutical composition containing at least one polypeptide partial sequence Seq. ID. No. 63- 117th

33. A nucleic acid sequence according to claims 1 to 10, characterized characterized in that it is a genomic sequence.

34. A nucleic acid sequence according to claims 1 to 10, characterized characterized in that it is an mRNA sequence.

35. Genomic Genes, Their Promoters, Enhancers, Silencers, Exon Structure, Intron structure and its splice variants, obtainable from the CDNAs of the sequences Seq. ID. No. 1 to Seq. ID. No. 62nd

36. Use of the genomic genes according to claim 33, together with appropriate regulatory elements.

37. Use according to claim 34, characterized in that the regulative Element is a suitable promoter and / or enhancer.

38. A nucleic acid sequence according to claims 1 to 7, characterized characterized in that the size of the fragment has a length of at least 300 to 3500 bp.