FR2892730A1

FR2892730A1 - Detecting the presence/risk of cancer development in a mammal, comprises detecting the presence/absence or (relative) quantity e.g. of nucleic acids and/or polypeptides coded by the nucleic acids, which indicates the presence/risk

Info

Publication number: FR2892730A1
Application number: FR0511080A
Authority: FR
Inventors: Alexander Krause; Philippe Leissner; Malick Paye; Bruno Mougin; Fabien Schweighoffer; Laurent Bracco
Original assignee: Biomerieux SA; ExonHit Therapeutics SA
Current assignee: Biomerieux SA; ExonHit Therapeutics SA
Priority date: 2005-10-28
Filing date: 2005-10-28
Publication date: 2007-05-04
Also published as: CN101365802A

Abstract

Detecting the presence/risk of cancer development in a mammal, comprises detecting nucleic acids (I) comprising one of (SEQ ID No. 23, 52, 53, 148 and 225) or a distinctive fragment having at least 15, preferably 16, 17, 18, 19, 20, 25 or 30 consecutive bases, nucleic acids (II) having a complementary sequence of (I), functional analogs (III) of (I) or (II) from an another species and/or polypeptides coded by (I)-(III), in a sample, where the presence/absence or (relative) quantity of the target molecules indicates the presence/risk of cancer development. Detecting the presence/risk of cancer development in a mammal, comprises detection of the nucleic acids (I) comprising one of 5 fully defined 30-102 amino acid sequences (SEQ ID No. 23, 52, 53, 148 and 225) (PANEL 11) given in the table 1 of the specification or a distinctive fragment of those having at least 15, preferably 16, 17, 18, 19, 20, 25 or 30 consecutive bases, the nucleic acids (II) having a complementary sequence of (I), the functional analogs (III) of (I) or (II) from an another species and/or the polypeptides coded by (I)-(III), in a biological sample of the mammal, where the presence, absence or (relative) quantity of the target molecules in the sample being an indication of the presence or risk of cancer development in mammal. Independent claims are included for: (1) an in vitro or ex vivo detection of the presence/risk of cancer development in a mammal, comprising the determination of the presence/absence of an alteration in a gene or RNA e.g. receptors, adapters, enzymes, factors involved in gene expression regulation, chemokines, cytokines or interleukines, i.e. the gene or RNA involved in the stimulation of toll-like receptors, secretion of cytokines or activation of T lymphocytes, in a biological sample of the mammal, preferably in a blood (derivative) sample, where the presence of such alteration indicates presence/risk of cancer development in mammal; (2) the use of a specific nucleic probe of target nucleic acids, comprising of 15-400 bases, for the in vitro detection of human cancer; (3) the use of a nucleic starter allowing the amplification of total or part of a target nucleic acid, being simple-strand, with a length range of 5-50 bases, for in vitro detection of human cancer; (4) a product comprising a support on which at least two probes are immobilized of distinct nucleic acids comprising a complementary and/or specific sequence from at least two distinct target nucleic acids comprising at least two sequences comprising one of 437 fully defined 30-500 amino acid sequences (SEQ ID No. 1-437) given in the specification or at least a gene or RNA indicated in the table 5 or 6, of the specification; and (5) a product comprising a support on which at least a set of two distinct nucleic acid probes comprising a complementary sequence and/or specific nucleic acids of the panels 1-11, are immobilized.

Description

Procédé de détection du cancerMethod of detecting cancer

La présente demande concerne des méthodes et compositions utilisables pour la détection du cancer chez les mammifères, en particulier les humains. Elle décrit notamment des marqueurs sériques des cancers et leurs utilisations dans des méthodes de diagnostic. Elle concerne également des outils et/ou kits utilisables pour la mise en oeuvre de ces méthodes (réactifs, sondes, amorces, anticorps, puces, cellules, etc.), leur préparation et leurs utilisations. L'invention est utilisable pour détecter la présence ou l'évolution d'un cancer chez les mammifères, notamment du cancer du sein, y compris en phase précoce. The present application relates to methods and compositions useful for the detection of cancer in mammals, particularly humans. In particular, it describes serum markers of cancers and their uses in diagnostic methods. It also relates to tools and / or kits that can be used to implement these methods (reagents, probes, primers, antibodies, chips, cells, etc.), their preparation and their uses. The invention is useful for detecting the presence or the evolution of a cancer in mammals, in particular breast cancer, including in the early phase.

Chez la femme, le cancer du sein est la première cause de mortalité par cancer dans les pays industrialisés. L'âge est le facteur de risque le plus important. Ainsi, le risque augmente de 0,5% par année d'âge dans les pays occidentaux. D'autres facteurs de risques sont connus tels que le nombre des grossesses et l'âge de la première grossesse, l'allaitement, l'âge de la puberté et de la ménopause, les traitements oestrogéniques après la survenue de la ménopause, le stress et la nutrition. In women, breast cancer is the leading cause of cancer death in industrialized countries. Age is the most important risk factor. Thus, the risk increases by 0.5% per year of age in Western countries. Other risk factors are known such as the number of pregnancies and the age of first pregnancy, breastfeeding, the age of puberty and menopause, estrogenic treatments after the onset of menopause, stress and nutrition.

Le diagnostic du cancer du sein est généralement effectué par mammographie. Toutefois, il est estimé que la taille minimale d'une tumeur repérable par mammographie est de 1 cm, ce qui présente un passé évolutif de 8 ans en moyenne lors du diagnostic. Les petites tumeurs sont beaucoup moins malignes que ce que l'on peut extrapoler de leurs tailles : l'agressivité des grosses tumeurs ne vient pas seulement de leurs tailles, mais aussi de leur agressivité inhérente , qui augmente avec Page d'une tumeur (Bucchi et al., Br J Cancer 2005, p. 156-161 ; Norden T, Eur J Cancer 1997, p. 624-628). The diagnosis of breast cancer is usually done by mammography. However, it is estimated that the minimum size of a mammogram-detectable tumor is 1 cm, which has an evolutionary history of 8 years on average during diagnosis. Small tumors are much less malignant than can be extrapolated from their sizes: the aggressiveness of large tumors does not only come from their size, but also from their inherent aggression, which increases with a tumor (Bucchi et al., Br J Cancer 2005, 156-161; Norden T, Eur J Cancer 1997, pp. 624-628).

Le bénéfice d'une mammographie a été démontré sur des centaines de milliers de patientes ces 30 dernières années. Toutefois, des biais existent dans ces tests, tels que une dépendance de la sensibilité à Page, à la prise d'hormones, au nombre de mammographies effectuées, à l'expérience du médecin et autres (voir Fletcher and Elmore, NEJM 2003, p. 1672f or Baines CJ, Breast J 2005, S7-10). The benefit of mammography has been demonstrated in hundreds of thousands of patients over the last 30 years. However, biases exist in these tests, such as dependence on Page sensitivity, hormone intake, number of mammograms performed, physician experience, and others (see Fletcher and Elmore, NEJM 2003, p. 1672f or Baines CJ, Breast J 2005, S7-10).

L'analyse de l'expression d'un panel de gènes cibles est également pertinente dans la lutte contre le cancer du sein, et on peut citer notamment l'analyse d'un panel de 176 gènes qui sont exprimés de manière différentielle entre des patientes exprimant le récepteur ER et des patientes n'exprimant pas le récepteur ER (Bertucci et al, Human Molecular Genetics, 2000 ; 9 : 2981-2991). On peut citer également l'analyse d'un panel de 37 gènes qui permet de diagnostiquer un cancer du sein d'une façon précoce (Sharma et al, , Breast cancer Research 2005, 7 : R634-R644). Toutefois, les patientes incluses dans cette étude avaient toutes des soupçons d'une maladie mammaire ( suspect initial mammogram ), et ce panel de gènes pourrait être mal adapté pour un test de routine de diagnostic du cancer du sein préalablement à toute mammographie. The analysis of the expression of a panel of target genes is also relevant in the fight against breast cancer, and we can notably cite the analysis of a panel of 176 genes that are expressed differentially between patients. expressing the ER receptor and patients not expressing the ER receptor (Bertucci et al, Human Molecular Genetics, 2000; 9: 2981-2991). One can also cite the analysis of a panel of 37 genes which makes it possible to diagnose breast cancer in an early manner (Sharma et al., Breast Cancer Research 2005, 7: R634-R644). However, the patients included in this study all had suspicions of breast disease (suspected initial mammogram), and this array of genes might be poorly suited for a routine breast cancer diagnostic test prior to any mammogram.

Il existe donc un besoin important de disposer d'outils et de tests diagnostiques capables de détecter le cancer de manière fiable, simple et à un stade précoce. There is therefore a great need for diagnostic tools and tests that can detect cancer reliably, easily and at an early stage.

La présente demande fournit un ensemble de marqueurs biologiques qui peuvent être utilisés, seuls ou en combinaison(s), de préférence en combinaison(s), pour détecter, caractériser ou suivre, de manière fiable, la présence ou l'évolution d'un cancer chez un mammifère, de préférence le cancer du sein. L'invention est particulièrement avantageuse dans la mesure où elle peut être mise en oeuvre à partir de sang total, sans nécessiter de biopsie tissulaire ni d'étapes de séparations. The present application provides a set of biological markers that can be used, alone or in combination, preferably in combination, to reliably detect, characterize or track the presence or evolution of a cancer in a mammal, preferably breast cancer. The invention is particularly advantageous insofar as it can be carried out from whole blood, without requiring tissue biopsy or separation steps.

Plus particulièrement, la présente demande résulte de l'identification de marqueurs génétiques sériques caractéristiques de patientes humaines présentant un cancer du sein. Ces marqueurs correspondent par exemple à des variations d'épissage ou de niveaux d'expression de gènes, et peuvent permettre, seuls ou, avantageusement, en combinaison, de détecter chez des patients la présence ou le stade d'évolution d'un cancer. Ils permettent avantageusement de détecter la présence d'un cancer du sein, dès les phases précoces de celui-ci (c'est-à-dire notamment à un stade où une mammographie serait inefficace), de manière fiable et simple, à partir d'un échantillon de sang total. More particularly, the present application results from the identification of serum genetic markers characteristic of human patients with breast cancer. These markers correspond for example to variations in splicing or gene expression levels, and may allow, alone or, advantageously, in combination, to detect in patients the presence or stage of evolution of a cancer. They advantageously make it possible to detect the presence of a breast cancer, in the early phases of it (that is to say, in particular at a stage where a mammogram would be ineffective), in a reliable and simple manner, from a sample of whole blood.

Un objet de la présente demande concerne une méthode pour détecter (in vitro ou ex vivo) la présence ou le risque de développer un cancer chez un mammifère, comprenant la détermination de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du mammifère, d'une, de préférence plusieurs molécules cibles choisies parmi: a) les acides nucléiques comprenant une séquence choisie parmi SEQ ID NOs: 1- 318 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, b) les acides nucléiques ayant une séquence complémentaire d'une séquence selon a), c) les analogues fonctionnels d'acides nucléiques selon a) ou b), ou d) les polypeptides codés par les acides nucléiques selon a) à c), la présence (ou l'absence ou la quantité (relative)) de telles molécules cibles dans l'échantillon étant une indication de la présence ou du risque de développer un cancer chez ce mammifère. An object of the present application relates to a method for detecting (in vitro or ex vivo) the presence or risk of developing cancer in a mammal, including determining the presence (or absence or amount of (relative) ), in a biological sample of the mammal, one, preferably several target molecules selected from: a) nucleic acids comprising a sequence selected from SEQ ID NOs: 1- 318 or a distinctive fragment thereof having at least 15 preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, b) the nucleic acids having a sequence complementary to a sequence according to a), c) the functional analogues of nucleic acids according to a) or b), or d) the polypeptides encoded by the nucleic acids according to a) to c), the presence (or absence or (relative) amount) of such target molecules in the sample being indicative of the presence or of the risk of developing cancer at this mammal.

Dans une variante particulière de mise en oeuvre, la méthode comprend la détermination combinée de la présence ou absence ou quantité (relative) d'au moins 5, 10, 15, 20, 30, 40, 50, 60, 70 ou plus des molécules cibles telles que définies ci-dessus. La détermination "combinée" désigne le fait qu'un profil d'hybridation (ou une signature) impliquant plusieurs marqueurs est déterminé. La détermination combinée est typiquement réalisée de manière simultanée, c'est-à-dire par mesure globale d'un profil d'expression. Néanmoins, la détermination combinée peut également être effectuée par mesures en parallèle ou séquentielle de plusieurs marqueurs, conduisant à l'identification d'un profil. L'invention permet en effet d'établir et de déterminer un profil d'hybridation (ou une signature) sur un ensemble de marqueurs, afin d'évaluer la présence ou le risque de développer un cancer chez un mammifère. Le profil d'hybridation est typiquement réalisé en utilisant une combinaison de plusieurs marqueurs choisis par les cibles indiquées ci-dessus, par exemple contenant l'ensemble de ces cibles. In a particular variant embodiment, the method comprises the combined determination of the presence or absence or (relative) amount of at least 5, 10, 15, 20, 30, 40, 50, 60, 70 or more of the molecules. targets as defined above. The "combined" determination refers to the fact that a hybridization profile (or signature) involving multiple markers is determined. The combined determination is typically performed simultaneously, i.e., by overall measurement of an expression profile. Nevertheless, the combined determination can also be performed by parallel or sequential measurements of several markers, leading to the identification of a profile. The invention makes it possible to establish and determine a hybridization profile (or a signature) on a set of markers, in order to evaluate the presence or risk of developing cancer in a mammal. The hybridization profile is typically performed using a combination of several markers selected by the targets indicated above, for example containing all of these targets.

Dans un mode de mise en oeuvre particulier, la méthode de l'invention comprend la détermination de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du mammifère, d'au moins 10 molécules cibles distinctes choisies parmi celles définies ci-dessus. In a particular embodiment, the method of the invention comprises determining the presence (or the absence or the (relative) amount) of at least 10 target molecules in a biological sample of the mammal. distinct from those defined above.

Dans des modes préférés de mise en oeuvre, la méthode de l'invention comprend la détermination combinée de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du mammifère, de sous-ensembles particuliers de molécules cibles choisies parmi celles définies ci-dessus. De tels sous-ensembles, décrits dans les exemples, sont particulièrement adaptés à la détection, notamment en phase précoce, de la présence d'un cancer du sein chez des patientes à partir d'un échantillon de sang total. In preferred embodiments, the method of the invention comprises the combined determination of the presence (or absence or (relative) amount), in a biological sample of the mammal, of particular subsets of target molecules selected from those defined above. Such subsets, described in the examples, are particularly suitable for the detection, particularly in the early phase, of the presence of breast cancer in patients from a whole blood sample.

Ainsi, dans un mode de mise en oeuvre particulier, la méthode de l'invention comprend la détermination combinée de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du mammifère, des acides nucléiques comprenant les séquences représentées dans SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 et 312 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, ou ayant une séquence complémentaire de celles-ci et/ou des analogues fonctionnels de ceux-ci provenant d'autres espèces, et/ou des polypeptides codés par ces acides nucléiques. Les exemples fournis dans la présente demande montrent en effet que ce panel de marqueurs permet de détecter de manière prédictive la présence, le risque de développer ou le stade d'évolution d'un cancer. Dans un mode particulier, la méthode comprend en outre la détection d'une ou plusieurs des autres molécules cibles telles que définies précédemment. Thus, in a particular embodiment, the method of the invention comprises the combined determination of the presence (or the absence or the (relative) amount), in a biological sample of the mammal, of nucleic acids comprising the sequences shown in SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240, and 312, or a distinctive fragment thereof. having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, or having a sequence complementary thereto and / or functional analogues thereof from other species and / or polypeptides encoded by these nucleic acids. The examples provided in the present application indeed show that this panel of markers makes it possible to predictively detect the presence, the risk of developing or the stage of evolution of a cancer. In a particular embodiment, the method further comprises the detection of one or more of the other target molecules as defined above.

Dans un autre mode de mise en oeuvre particulier, la méthode de l'invention comprend la détermination combinée de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du mammifère, des acides nucléiques comprenant les séquences représentées dans SEQ ID NOs: 18, 19, 23, 26, 27, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 161, 188, 225, 228, 240, 280 et 312 ou un fragment distinctif de celles-ci ayant au 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, ou ayant une séquence complémentaire de celles-ci et/ou des analogues fonctionnels de ceux-ci provenant d'autres espèces, et/ou des polypeptides codés par ces acides nucléiques. Les exemples fournis dans la présente demande montrent en effet que ce panel de marqueurs permet de détecter de manière prédictive la présence, le risque de développer ou le stade d'évolution d'un cancer. Dans un mode particulier, la méthode comprend en outre la détection d'une ou plusieurs des autres molécules cibles telles que définies précédemment. In another particular embodiment, the method of the invention comprises the combined determination of the presence (or the absence or the (relative) amount), in a biological sample of the mammal, of the nucleic acids comprising the Sequences represented in SEQ ID NOs: 18, 19, 23, 26, 27, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 161, 188, 225, 228, 240, 280 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, or having a sequence complementary thereto and / or functional analogues thereof. from other species, and / or polypeptides encoded by these nucleic acids. The examples provided in the present application indeed show that this panel of markers makes it possible to predictively detect the presence, the risk of developing or the stage of evolution of a cancer. In a particular embodiment, the method further comprises the detection of one or more of the other target molecules as defined above.

Dans un autre mode de mise en oeuvre particulier, la méthode de l'invention comprend la détermination simultanée de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du mammifère, des acides nucléiques comprenant les séquences représentées dans SEQ ID NOs: 13, 16-19, 23, 26-28, 47, 51-55, 58, 69, 80, 81, 89, 116, 121, 125, 145, 148, 158, 160, 161, 164, 189, 190, 225, 229, 240, 248, 280, 281, 284, 299, 300, 310 et 312 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, ou ayant une séquence complémentaire de celles-ci et/ou des analogues fonctionnels de ceux-ci provenant d'autres espèces, et/ou des polypeptides codés par ces acides nucléiques. Les exemples fournis dans la présente demande montrent en effet que ce panel de marqueurs permet de détecter de manière prédictive la présence, le risque de développer ou le stade d'évolution d'un cancer. In another particular embodiment, the method of the invention comprises the simultaneous determination of the presence (or the absence or the (relative) amount), in a biological sample of the mammal, of the nucleic acids comprising the sequences represented in SEQ ID NOs: 13, 16-19, 23, 26-28, 47, 51-55, 58, 69, 80, 81, 89, 116, 121, 125, 145, 148, 158, 160, 161 , 164, 189, 190, 225, 229, 240, 248, 280, 281, 284, 299, 300, 310 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18 , 19, 20, 25 or 30 consecutive bases, or having a sequence complementary thereto and / or functional analogues thereof from other species, and / or polypeptides encoded by these nucleic acids. The examples provided in the present application indeed show that this panel of markers makes it possible to predictively detect the presence, the risk of developing or the stage of evolution of a cancer.

Dans un mode particulier, la méthode comprend en outre la détection d'une ou plusieurs des autres molécules cibles telles que définies précédemment. In a particular embodiment, the method further comprises the detection of one or more of the other target molecules as defined above.

Dans un autre mode de mise en oeuvre particulier, la méthode de l'invention comprend la détermination simultanée de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du mammifère, des acides nucléiques comprenant les séquences représentées dans SEQ ID NOs: 7, 13, 14, 16-19, 23-28, 47, 51-55, 58, 69, 80, 81, 89, 116, 121, 125, 137, 139, 145, 148, 158, 160, 161, 164, 189, 190, 225, 228, 229, 240, 245, 248, 252, 280, 281, 284, 290, 298-300, 310 et 312 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, ou ayant une séquence complémentaire de celles-ci et/ou des analogues fonctionnels de ceux-ci provenant d'autres espèces, et/ou des polypeptides codés par ces acides nucléiques. Les exemples fournis dans la présente demande montrent en effet que ce panel de marqueurs permet de détecter de manière prédictive la présence, le risque de développer ou le stade d'évolution d'un cancer. Dans un mode particulier, la méthode comprend en outre la détection d'une ou plusieurs des autres molécules cibles telles que définies précédemment. In another particular embodiment, the method of the invention comprises the simultaneous determination of the presence (or the absence or the (relative) amount), in a biological sample of the mammal, of the nucleic acids comprising the sequences represented in SEQ ID NOs: 7, 13, 14, 16-19, 23-28, 47, 51-55, 58, 69, 80, 81, 89, 116, 121, 125, 137, 139, 145, 148 , 158, 160, 161, 164, 189, 190, 225, 228, 229, 240, 245, 248, 252, 280, 281, 284, 290, 298-300, 310 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, or having a sequence complementary thereto and / or functional analogues thereof from other species, and / or polypeptides encoded by these nucleic acids. The examples provided in the present application indeed show that this panel of markers makes it possible to predictively detect the presence, the risk of developing or the stage of evolution of a cancer. In a particular embodiment, the method further comprises the detection of one or more of the other target molecules as defined above.

Dans un autre mode de mise en oeuvre particulier, la méthode de l'invention comprend la détermination simultanée de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du mammifère, des acides nucléiques comprenant les séquences représentées dans SEQ ID NOs: 5, 7, 13, 14, 16-20, 23-28, 47, 51-55, 58, 64, 69, 80, 81, 88-90, 116, 121, 125, 137, 139, 145, 148, 158, 160, 161, 164, 188-191, 208, 222, 225, 228, 229, 236, 240, 242, 245, 248, 252, 280, 281, 284, 290, 298-300 et 309-312 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, ou ayant une séquence complémentaire de celles-ci et/ou des analogues fonctionnels de ceux-ci provenant d'autres espèces, et/ou des polypeptides codés par ces acides nucléiques. Les exemples fournis dans la présente demande montrent en effet que ce panel de marqueurs permet de détecter de manière prédictive la présence, le risque de développer ou le stade d'évolution d'un cancer. Dans un mode particulier, la méthode comprend en outre la détection d'une ou plusieurs des autres molécules cibles telles que définies précédemment. In another particular embodiment, the method of the invention comprises the simultaneous determination of the presence (or the absence or the (relative) amount), in a biological sample of the mammal, of the nucleic acids comprising the sequences represented in SEQ ID NOs: 5, 7, 13, 14, 16-20, 23-28, 47, 51-55, 58, 64, 69, 80, 81, 88-90, 116, 121, 125, 137 , 139, 145, 148, 158, 160, 161, 164, 188-191, 208, 222, 225, 228, 229, 236, 240, 242, 245, 248, 252, 280, 281, 284, 290, 298 -300 and 309-312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, or having a sequence complementary thereto and or functional analogues thereof from other species, and / or polypeptides encoded by these nucleic acids. The examples provided in the present application indeed show that this panel of markers makes it possible to predictively detect the presence, the risk of developing or the stage of evolution of a cancer. In a particular embodiment, the method further comprises the detection of one or more of the other target molecules as defined above.

Dans un mode spécifique de mise en oeuvre, la méthode de l'invention comprend la détermination de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du mammifère, des acides nucléiques comprenant respectivement les séquences représentées dans SEQ ID NOs: 1-318 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, ou des acides nucléiques ayant une séquence complémentaire de celles-ci. In a specific mode of implementation, the method of the invention comprises the determination of the presence (or absence or (relative) amount), in a biological sample of the mammal, of the nucleic acids comprising respectively the sequences represented in SEQ ID NOs: 1-318 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, or nucleic acids having a complementary sequence of these.

Molécule Cible La présente invention repose sur la mise en évidence et la caractérisation d'événements biologiques sériques caractéristiques de la présence d'un cancer du sein chez un patient humain. Ces événements constituent des biomarqueurs, dont la détection chez un patient permet, de préférence en combinaison, de déterminer, même à un stade précoce, le risque de développer un tel cancer ou la présence d'un tel cancer. The present invention is based on the demonstration and characterization of serum biological events characteristic of the presence of breast cancer in a human patient. These events constitute biomarkers, the detection of which in a patient makes it possible, preferably in combination, to determine, even at an early stage, the risk of developing such a cancer or the presence of such a cancer.

Les événements biologiques identifiés correspondent typiquement à des modifications dans la régulation de l'expression de gènes. Il peut s'agir d'une inhibition partielle ou totale de l'expression de gènes ou de certaines formes de gènes, d'une augmentation de l'expression de gènes ou de certaines formes de gènes, de l'apparition ou de la disparition de formes d'épissages de gènes, etc. The identified biological events typically correspond to changes in the regulation of gene expression. It can be a partial or total inhibition of the expression of genes or certain forms of genes, an increase in the expression of genes or certain forms of genes, the appearance or disappearance forms of gene splicing, etc.

L'invention repose donc sur la détection, dans un échantillon, d'une ou plusieurs molécules cibles représentatives des événements biologiques ainsi identifiés. Comme indiqué précédemment, ces molécules cibles peuvent être choisies parmi : The invention is therefore based on the detection, in a sample, of one or more target molecules representative of the biological events thus identified. As indicated above, these target molecules can be chosen from:

a) les acides nucléiques comprenant une séquence choisie parmi SEQ ID NOs: 1- 318 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, b) les acides nucléiques ayant une séquence complémentaire d'une séquence selon a), c) les analogues fonctionnels d'acides nucléiques selon a) ou b), ou d) les polypeptides codés par les acides nucléiques selon a) à c). a) nucleic acids comprising a sequence selected from SEQ ID NOs: 1-31 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, b) the nucleic acids having a sequence complementary to a sequence according to a), c) the functional analogues of nucleic acids according to a) or b), or d) the polypeptides encoded by the nucleic acids according to a) to c).

La molécule cible peut être la séquence complète du gène ou de l'ARN ou de la protéine correspondant aux séquences SEQ ID NOs: 1-318, ou un fragment distinctif de celle-ci, c'est-à-dire un fragment dont la séquence est spécifique dudit gène ou ARN, ou de la dite protéine, et/ou comporte un domaine de variabilité (épissage, délétion, polymorphisme, etc.) représentatif de l'événement biologique à détecter. La liste complète des marqueurs, ainsi que les gènes correspondants sont indiqués dans le tableau 1. The target molecule may be the complete sequence of the gene or of the RNA or of the protein corresponding to the sequences SEQ ID NOs: 1-318, or a distinctive fragment thereof, that is to say a fragment of which the sequence is specific for said gene or RNA, or said protein, and / or comprises a domain of variability (splicing, deletion, polymorphism, etc.) representative of the biological event to be detected. The complete list of markers and the corresponding genes are shown in Table 1.

Le terme analogue fonctionnel désigne un analogue provenant d'une autre espèce de mammifère. En effet, les séquences SEQ ID NOs: 1- 318 ont été identifiées à partir de sujets humains, et ces séquences constituent des marqueurs efficaces et adaptés à la détection du cancer chez les patients humains. Néanmoins, pour une application des méthodes de l'invention à d'autres espèces de mammifères, il est généralement préférable d'utiliser des analogues fonctionnels de ces séquences, caractérisés dans l'espèce considérée. Ces analogues peuvent être identifiés par toute technique connue de l'homme du métier, notamment en considération des séquences fournies dans la demande et des noms des gènes correspondants. The term functional analogue refers to an analogue from another mammalian species. Indeed, the sequences SEQ ID NOs: 1- 318 have been identified from human subjects, and these sequences constitute effective and adapted markers for the detection of cancer in human patients. Nevertheless, for an application of the methods of the invention to other mammalian species, it is generally preferable to use functional analogues of these sequences, characterized in the species under consideration. These analogs can be identified by any technique known to those skilled in the art, in particular in view of the sequences provided in the application and the names of the corresponding genes.

Dans un mode de réalisation particulier, la méthode comprend la détermination de la présence d'au moins un acide nucléique selon a) à c). In a particular embodiment, the method comprises determining the presence of at least one nucleic acid according to a) to c).

Dans un mode de réalisation tout particulier, la méthode est utilisée pour détecter un cancer chez un sujet humain et comprend la détermination de la présence d'au moins un acide nucléique selon a) ou b). Plus préférentiellement encore, la méthode comprend la détection combinée de la présence ou absence d'un panel de marqueurs cibles, tels que définis dans la présente demande. In a very particular embodiment, the method is used to detect cancer in a human subject and includes determining the presence of at least one nucleic acid according to a) or b). More preferably still, the method comprises the combined detection of the presence or absence of a panel of target markers, as defined in the present application.

Un mode de réalisation particulier de l'invention réside dans une méthode pour détecter la présence ou le risque de développer un cancer du sein chez un mammifère, comprenant la détermination combinée de la présence ou de la quantité, dans un échantillon de sang du mammifère, des molécules cibles suivantes: a) les acides nucléiques comprenant les séquences représentées dans SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 et 312 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, et/ou b) les acides nucléiques ayant une séquence complémentaire de séquences selon a), 20 et/ou c) les analogues fonctionnels des acides nucléiques selon a) ou b) provenant d'une autre espèce, et/ou d) les polypeptides codés par les acides nucléiques selon a) à c), la présence de telles molécules cibles dans l'échantillon étant une indication de la présence 25 ou du risque de développer un cancer du sein chez ce mammifère. A particular embodiment of the invention is a method for detecting the presence or risk of developing breast cancer in a mammal, comprising the combined determination of the presence or amount, in a mammalian blood sample, the following target molecules: a) nucleic acids comprising the sequences shown in SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228 , 240 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, and / or b) nucleic acids having a sequence complementary to sequences according to a), and / or c) the functional analogues of the nucleic acids according to a) or b) from another species, and / or d) the polypeptides encoded by the nucleic acids according to a) to c), the presence of such target molecules in the sample being an indication of the presence or risk of developing breast cancer in this mammal.

Un autre objet spécifique de l'invention réside dans une méthode pour détecter la présence ou le risque de développer un cancer du sein chez un mammifère, comprenant la détection, dans un échantillon de sang du mammifère, des molécules cibles suivantes: 30 a) les acides nucléiques comprenant les séquences représentées dans SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 et 312 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, et/ou b) les acides nucléiques ayant une séquence complémentaire de séquences selon a), et/ou c) les analogues fonctionnels des acides nucléiques selon a) ou b) provenant d'une autre espèce, et/ou d) les polypeptides codés par les acides nucléiques selon a) à c), la présence, l'absence ou la quantité (relative) de ces molécules cibles dans l'échantillon étant une indication de la présence ou du risque de développer un cancer du sein chez ce 10 mammifère. Another specific object of the invention is a method for detecting the presence or risk of developing breast cancer in a mammal, comprising detecting, in a mammalian blood sample, the following target molecules: nucleic acids comprising the sequences shown in SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, and / or b) the nucleic acids having a sequence complementary to sequences according to a), and / or c) the functional analogues of the nucleic acids according to a) or b) from another species, and / or d) the polypeptides encoded by the nucleic acids according to a) to c), the presence, absence or quantity ( relative) of these target molecules in the sample being an indication of the presence or risk of developing breast cancer in this mammal.

Détection d'un acide nucléiqueDetection of a nucleic acid

Différentes techniques permettant la détection d'une espèce d'acide nucléique dans un 15 échantillon sont utilisables dans la présente invention, comme par exemple le Northern Blot, l'hybridation sélective, l'utilisation de supports revêtus d'oligonucléotides sondes, l'amplification d'acide nucléique comme par exemple par RT-PCR, PCR quantitative ou ligation-PCR, etc. Ces méthodes peuvent comprendre l'utilisation d'une sonde nucléique (par exemple un oligonucléotide) capable de détecter sélectivement ou spécifiquement 20 l'acide nucléique cible dans l'échantillon. L'amplification peut être réalisée selon différentes méthodes connues en soi de l'homme du métier, telles que la PCR, la LCR, l'amplification médiée par transcription (TMA), l'amplification par déplacement de brin (SDA), NASBA, l'emploi d'oligonucléotides spécifiques d'allèles (ASO), l'amplification spécifique d'allèle, le Southern blot, l'analyse conformationnelle SSCA, l'hybridation in 25 situ (e.g., FISH), la migration sur gel, l'analyse d'hétéroduplexes, etc. Various techniques for detecting a species of nucleic acid in a sample are useful in the present invention, such as, for example, Northern blotting, selective hybridization, the use of probed oligonucleotide-coated media, amplification nucleic acid as for example by RT-PCR, quantitative PCR or ligation-PCR, etc. Such methods may include the use of a nucleic acid probe (eg an oligonucleotide) capable of selectively or specifically detecting the target nucleic acid in the sample. The amplification may be carried out according to various methods known per se to those skilled in the art, such as PCR, CSF, transcription-mediated amplification (TMA), strand displacement amplification (SDA), NASBA, the use of allele-specific oligonucleotides (ASO), allele-specific amplification, Southern blotting, SSCA conformational analysis, in situ hybridization (eg, FISH), gel migration, analysis of heteroduplexes, etc.

Selon un mode préféré de mise en oeuvre, la méthode comprend la détection de la présence ou de l'absence ou de la quantité (relative) d'un acide nucléique selon a) à c) par hybridation sélective ou amplification sélective. L'hybridation sélective est typiquement réalisée en utilisant des sondes nucléiques, de préférence immobilisées sur un support, tel qu'un support solide ou semi-solide présentant 30 au moins une surface, plane ou non, permettant l'immobilisation de sondes nucléiques. De tels supports sont par exemple une lame, bille, membrane, filtre, colonne, plaque, etc. Ils peuvent être réalisés en tout matériau compatible, comme notamment du verre, silice, plastique, fibre, métal, polymère, etc. Les sondes nucléiques peuvent être tout acide nucléique (ADN, ARN, PNA, etc.), de préférence simple-brin, comprenant une séquence spécifique d'une molécule cible telle que définie en a) à c) ci-dessus. Les sondes comprennent typiquement de 5 à 400 bases, de préférence de 8 à 200, plus préférentiellement moins de 100, et encore plus préférentiellement moins de 75, 60, 50, 40 ou même 30 bases. Les sondes peuvent être des oligonucléotides synthétiques, produits sur la base des séquences de molécules cibles de l'invention selon des techniques de synthèse classique. De tels oligonucléotides comportent typiquement de 10 à 50 bases, de préférence de 20 à 40, par exemple 25 bases environ. Dans un mode particulièrement avantageux, on utilise plusieurs oligonucléotides (ou sondes) différents pour détecter la même molécule cible. Il peut s'agir d'oligonucléotides spécifiques de régions différentes de la même molécule cible, ou centrés différemment sur un même région. On peut également utiliser des couples de sondes, dont un membre est parfaitement apparié à la molécule cible, et un autre présente un mésappariement, permettant ainsi d'estimer le bruit de fond. Dans les exemples qui suivent, on a utilisé 6 à 11 couples d'oligonucléotides de 25 bases pour chaque molécule cible. According to a preferred embodiment, the method comprises the detection of the presence or absence or the (relative) amount of a nucleic acid according to a) to c) by selective hybridization or selective amplification. Selective hybridization is typically performed using nucleic probes, preferably immobilized on a support, such as a solid or semi-solid support having at least one surface, flat or otherwise, for immobilizing nucleic probes. Such supports are for example a blade, ball, membrane, filter, column, plate, etc. They can be made of any compatible material, such as glass, silica, plastic, fiber, metal, polymer, etc. The nucleic probes may be any nucleic acid (DNA, RNA, PNA, etc.), preferably single-stranded, comprising a sequence specific for a target molecule as defined in a) to c) above. The probes typically comprise from 5 to 400 bases, preferably from 8 to 200, more preferably less than 100, and even more preferentially less than 75, 60, 50, 40 or even 30 bases. The probes may be synthetic oligonucleotides, produced on the basis of the target molecule sequences of the invention according to conventional synthesis techniques. Such oligonucleotides typically have from 10 to 50 bases, preferably from 20 to 40, for example about 25 bases. In a particularly advantageous mode, several different oligonucleotides (or probes) are used to detect the same target molecule. They may be oligonucleotides specific for different regions of the same target molecule, or centered differently on the same region. It is also possible to use pairs of probes, one of which is perfectly matched to the target molecule, and another has a mismatch, thus making it possible to estimate the background noise. In the following examples, 6 to 11 pairs of 25 base oligonucleotides were used for each target molecule.

Les sondes peuvent être synthétisées préalablement puis déposées sur le support, ou synthétisées directement in situ, sur le support, selon des méthodes connues en soi de l'homme du métier. Les sondes peuvent également être fabriquées par des techniques génétiques, par exemple par amplification, recombinaison, ligation, etc. The probes may be synthesized beforehand and then deposited on the support, or synthesized directly in situ, on the support, according to methods known per se to those skilled in the art. The probes can also be manufactured by genetic techniques, for example by amplification, recombination, ligation, etc.

Les sondes ainsi définies constituent un autre objet de la présente demande, ainsi que leurs utilisations (essentiellement in vitro) pour la détection d'un cancer chez un sujet. De manière particulièrement préférée, on utilise un ensemble de sondes nucléiques comprenant tout ou un fragment de 15 bases consécutives au moins, de préférence de 17, 19, 20, 22 ou 25 bases consécutives au moins, de chacune des séquences SEQ ID NO : 1-318, ou d'un brin complémentaire de celles-ci, avantageusement immobilisées sur un support. The probes thus defined constitute another object of the present application, as well as their uses (essentially in vitro) for the detection of cancer in a subject. Particularly preferably, a set of nucleic probes comprising all or a fragment of at least 15 consecutive bases, preferably at least 17, 19, 20, 22 or 25 consecutive bases, of each of the sequences SEQ ID NO: 1 is used. -318, or a complementary strand thereof, advantageously immobilized on a support.

L'hybridation peut être réalisée dans des conditions classiques, connues de l'homme du métier et ajustables par celui-ci (Sambrook, Fritsch, Maniatis (1989) Molecular Cloning, Cold Spring Harbor Laboratory Press). En particulier, l'hybridation peut être réalisée dans des conditions de stringence élevée, moyenne ou faible, selon le niveau de sensibilité recherché, la quantité de matériel disponible, etc. Par exemple, des conditions appropriées d'hybridation incluent une température comprise entre 55 et 63 C pendant 2 à 18 heures sur des supports de basse densité D'autres conditions d'hybridation peuvent être nécessaires pour des supports de haute densité, comme une température d'hybridation entre 45 et 55 C. Après l'hybridation, différents lavages peuvent être réalisés pour éliminer les molécules non-hybridées, typiquement dans des tampons SSC comprenant du SDS, tels que un tampon comprenant 0,1 à 10 X SSC et 0,5-0,01% SDS. D'autres tampons de lavage contenant du SSPE, du MES, du NaCl ou du EDTA peuvent également être utilisés. Hybridization can be carried out under standard conditions known to those skilled in the art and adjustable by it (Sambrook, Fritsch, Maniatis (1989) Molecular Cloning, Cold Spring Harbor Laboratory Press). In particular, the hybridization can be carried out under conditions of high, medium or low stringency, depending on the desired level of sensitivity, the quantity of available material, etc. For example, suitable hybridization conditions include a temperature of between 55 and 63 ° C for 2 to 18 hours on low density media. Other hybridization conditions may be necessary for high density media, such as hybridization between 45 and 55 C. After hybridization, different washes can be performed to remove unhybridized molecules, typically in SSC buffers comprising SDS, such as a buffer comprising 0.1 to 10 X SSC and 0, 5-0.01% SDS. Other wash buffers containing SSPE, MES, NaCl or EDTA may also be used.

Dans un mode de mise en oeuvre typique, les acides nucléiques (ou les puces ou supports) sont pré-hybridés dans un tampon d'hybridation (Rapid Hybrid Buffer, Amersham) contenant typiquement 100 g/ml d'ADN de sperme de saumon à 65 C pendant 30 min. Les acides nucléiques de l'échantillon sont ensuite mis en contact avec les sondes (typiquement appliqués sur le support ou la puce) à 65 C pendant 2 à 18 heures. De préférence, les acides nucléique de l'échantillon sont marqués au préalable, par tout marquage connu (radioactif, enzymatique, fluorescent, luminescent, etc.). Les supports sont ensuite lavés dans un tampon 5X SSC, 0,1% SDS à 65 C pendant 30 min, puis dans un tampon 0.2X SSC, 0,1% SDS. Le profil d'hybridation est analysé selon des techniques classiques, comme par exemple en mesurant le marquage sur le support au moyen d'un instrument adapté (par exemple Instantlmager, Packard Instruments). Les conditions de l'hybridation peuvent naturellement être ajustées par l'homme du métier, par exemple en modifiant la température d'hybridation et/ou la concentration saline du tampon ainsi que par ajout de substances auxiliaires comme le formamide ou de l'ADN simple brin. In a typical embodiment, the nucleic acids (or the chips or supports) are prehybridized in hybridization buffer (Rapid Hybrid Buffer, Amersham) typically containing 100 g / ml of salmon sperm DNA. 65 C for 30 min. The nucleic acids of the sample are then placed in contact with the probes (typically applied on the support or the chip) at 65 ° C. for 2 to 18 hours. Preferably, the nucleic acids of the sample are marked beforehand with any known labeling (radioactive, enzymatic, fluorescent, luminescent, etc.). The supports are then washed in a 5 × SSC buffer, 0.1% SDS at 65 ° C. for 30 min, then in a 0.2 × SSC buffer, 0.1% SDS. The hybridization profile is analyzed according to conventional techniques, for example by measuring the marking on the support by means of a suitable instrument (for example Instantlmager, Packard Instruments). Hybridization conditions can naturally be adjusted by those skilled in the art, for example by modifying the hybridization temperature and / or the saline concentration of the buffer, as well as by adding auxiliary substances such as formamide or simple DNA. strand.

Un objet particulier de l'invention réside ainsi dans une méthode pour détecter la présence ou le risque de développer un cancer du sein chez un mammifère, comprenant la mise en contact, dans des conditions permettant une hybridation entre séquences complémentaires, des acides nucléiques issus d'un échantillon de sang du mammifère et d'un ensemble de sondes spécifique des molécules cibles suivantes: a) les acides nucléiques comprenant les séquences représentées dans SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 et 312 ou un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, et/ou b) les acides nucléiques ayant une séquence complémentaire de séquences selon a), et/ou c) les analogues fonctionnels des acides nucléiques selon a) ou b) provenant d'une autre espèce, pour obtenir un profil d'hybridation, le profil d'hybridation étant caractéristique de la présence ou du risque de développer un cancer du sein chez ce mammifère. A particular object of the invention thus lies in a method for detecting the presence or risk of developing breast cancer in a mammal, comprising bringing into contact, under conditions allowing hybridization between complementary sequences, nucleic acids derived from a sample of mammalian blood and a set of probes specific for the following target molecules: a) the nucleic acids comprising the sequences shown in SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 bases consecutive, and / or b) the nucleic acids having a sequence complementary to sequences according to a), and / or c) the functional analogues of the nucleic acids according to a) or b) from another species, to obtain a profile of hybridization, the hybridization profile being characteristic of the presence or risk of developing breast cancer in this mammal.

Dans des modes particuliers de mise en oeuvre, les procédés de l'invention utilisent en outre d'autres molécules cibles, notamment les sous-ensembles de molécules cibles mentionnés dans la présente demande. In particular embodiments, the methods of the invention also use other target molecules, in particular the subsets of target molecules mentioned in the present application.

Ainsi, un autre objet particulier de l'invention réside dans une méthode pour détecter la présence ou le risque de développer un cancer chez un mammifère, comprenant la mise en contact, dans des conditions permettant une hybridation entre séquences complémentaires, des acides nucléiques issus d'un échantillon de sang du mammifère et d'un ensemble de sondes spécifique d'au moins deux molécules distinctes choisies parmi les cibles suivantes: a) les acides nucléiques comprenant les séquences représentées dans SEQ ID NOs: 1-318 ou un fragment distinctif de celles- ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, et/ou b) les acides nucléiques ayant une séquence complémentaire de séquences selon a), et/ou c) les analogues fonctionnels des acides nucléiques selon a) ou b) provenant d'une autre espèce, pour obtenir un profil d'hybridation, le profil d'hybridation étant caractéristique de la présence ou du risque de développer un cancer chez ce mammifère. Thus, another particular object of the invention resides in a method for detecting the presence or the risk of developing a cancer in a mammal, comprising contacting, under conditions allowing hybridization between complementary sequences, nucleic acids derived from a mammalian blood sample and a set of probes specific for at least two distinct molecules selected from the following targets: a) nucleic acids comprising the sequences shown in SEQ ID NOs: 1-318 or a distinctive fragment of these having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, and / or b) the nucleic acids having a sequence complementary to sequences according to a), and / or c ) the functional analogues of the nucleic acids according to a) or b) from another species, to obtain a hybridization profile, the hybridization profile being characteristic of the presence or risk of deviation elopper cancer in this mammal.

L'amplification sélective est de préférence réalisée en utilisant une amorce ou une paire d'amorces permettant l'amplification de tout ou partie d'un des acides nucléiques cibles dans l'échantillon, lorsque celui-ci y est présent. L'amorce peut être spécifique d'une séquence cible selon SEQ ID NO : 1-318, ou d'une région flanquant la séquence cible dans un acide nucléique de l'échantillon. L'amorce comprend typiquement un acide nucléique simple-brin, d'une longueur comprise avantageusement entre 5 et50 bases, de préférence entre 5 et 30. Une telle amorce constitue un autre objet de la présente demande, ainsi que son utilisation (essentiellement in vitro) pour la détection d'un cancer chez un sujet. The selective amplification is preferably performed using a primer or pair of primers for amplifying all or part of one of the target nucleic acids in the sample, when present therein. The primer may be specific for a target sequence according to SEQ ID NO: 1-318, or a region flanking the target sequence in a nucleic acid of the sample. The primer typically comprises a single-stranded nucleic acid, advantageously between 5 and 50 bases long, preferably between 5 and 30. Such a primer constitutes another object of the present application, as well as its use (essentially in vitro ) for the detection of cancer in a subject.

A cet égard, un autre objet de l'invention réside dans l'utilisation d'une amorce nucléotidique ou d'un ensemble d'amorces nucléotidiques permettant l'amplification de tout ou partie d'un ou, de préférence plusieurs gènes ou ARN comprenant une séquence cible selon SEQ ID NO : 1-318, pour la détection d'un cancer chez un mammifère, de préférence du cancer du sein, tout particulièrement chez un être humain. In this respect, another subject of the invention lies in the use of a nucleotide primer or a set of nucleotide primers allowing the amplification of all or part of one or, preferably, several genes or RNAs comprising a target sequence according to SEQ ID NO: 1-318, for the detection of cancer in a mammal, preferably breast cancer, particularly in a human being.

Détection d'un polypeptideDetection of a polypeptide

Dans un autre mode de réalisation, la méthode comprend la détermination de la présence d'un polypeptide selon d). La mise en évidence d'un polypeptide dans un échantillon peut être réalisée par toute technique connue en soi, comme notamment au moyen d'un ligand spécifique, par exemple un anticorps ou un fragment ou dérivé d'anticorps. De préférence, le ligand est un anticorps spécifique du polypeptide, ou un fragment d'un tel anticorps (par exemple un Fab, Fab', CDR, etc.), ou un dérivé d'un tel anticorps (par exemple un anticorps simple-chaîne, ScFv). Le ligand est typiquement immobilisé sur un support, tel qu'une lame, bille, colonne, plaque, etc. La présence du polypeptide cible dans l'échantillon peut être détectée par mise en évidence d'un complexe entre la cible et le ligand, par exemple en utilisant un ligand marqué, en utilisant un deuxième ligand de révélation marqué, etc. Des techniques immunologiques utilisables et bien connues sont les techniques ELISA, RIA, etc. In another embodiment, the method comprises determining the presence of a polypeptide according to d). The detection of a polypeptide in a sample can be carried out by any technique known per se, such as in particular by means of a specific ligand, for example an antibody or an antibody fragment or derivative. Preferably, the ligand is an antibody specific for the polypeptide, or a fragment of such an antibody (for example an Fab, Fab ', CDR, etc.), or a derivative of such an antibody (for example a single antibody). chain, ScFv). The ligand is typically immobilized on a support, such as a slide, ball, column, plate, etc. The presence of the target polypeptide in the sample may be detected by demonstrating a complex between the target and the ligand, for example using a labeled ligand, using a second labeled revealing ligand, etc. Immunological techniques that can be used and are well known are ELISA, RIA, etc.

Des anticorps spécifiques des polypeptides cibles peuvent être produits par des techniques conventionnelles, notamment par immunisation d'un animal non-humain avec un immunogène comprenant le polypeptide (ou un fragment immunogène de celui-ci), et récupération des anticorps (polyclonaux) ou des cellules productrices (pour produire des monoclonaux). Des techniques de production d'anticorps poly- ou monoclonaux, de fragments ScFv, d'anticorps humains ou humanisés sont décrites par exemple dans Harlow et al., Antibodies: A laboratory Manual, CSH Press, 1988 ; Ward et al., Nature 341 (1989) 544 ; Bird et al., Science 242 (1988) 423 ; WO94/02602 ; US5,223,409 ; US5,877,293 ; WO93/01288. L'immunogène peut être fabriqué par synthèse, ou par expression, dans un hôte approprié, d'un acide nucléique cible tel que défini ci-avant. Un tel anticorps, monoclonal ou polyclonal, ainsi que ses dérivés ayant la même spécificité antigénique, constituent également un objet de la présente demande, de même que leur utilisation pour détecter un cancer. Antibodies specific for the target polypeptides may be produced by conventional techniques, including immunizing a non-human animal with an immunogen comprising the polypeptide (or an immunogenic fragment thereof), and recovering antibodies (polyclonal) or producing cells (to produce monoclonals). Techniques for producing poly- or monoclonal antibodies, ScFv fragments, human or humanized antibodies are described for example in Harlow et al., Antibodies: A laboratory Manual, CSH Press, 1988; Ward et al., Nature 341 (1989) 544; Bird et al., Science 242 (1988) 423; WO94 / 02602; US5,223,409; US5,877,293; WO93 / 01288. The immunogen may be synthetically manufactured, or by expression, in a suitable host, of a target nucleic acid as defined above. Such an antibody, monoclonal or polyclonal, as well as its derivatives having the same antigenic specificity, are also an object of the present application, as well as their use for detecting cancer.

Mise en oeuvre du procédé La méthode de l'invention est applicable à tout échantillon biologique du mammifère testé, en particulier tout échantillon comportant des acides nucléiques ou des polypeptides. On peut citer avantageusement un échantillon de sang, plasma, plaquette, salive, urine, selles, etc., plus généralement tout tissu, organe ou, avantageusement, fluide biologique comportant des acides nucléiques ou des polypeptides. Implementation of the method The method of the invention is applicable to any biological sample of the mammal under test, in particular any sample comprising nucleic acids or polypeptides. A sample of blood, plasma, platelet, saliva, urine, stool, etc., may be advantageously mentioned, more generally any tissue, organ or, advantageously, biological fluid comprising nucleic acids or polypeptides.

Dans un mode de mise en oeuvre préféré et particulièrement avantageux, l'échantillon est un échantillon de sang ou plasma. L'invention découle en effet de l'identification de marqueurs sanguins du cancer, et permet donc une détection de ces pathologies sans biopsie tissulaire, mais uniquement à partir de prélèvements sanguins. In a preferred and particularly advantageous embodiment, the sample is a blood or plasma sample. The invention results indeed from the identification of blood markers of cancer, and thus allows detection of these pathologies without tissue biopsy, but only from blood samples.

L'échantillon peut être obtenu par toute technique connue en soi, par exemple par prélèvement, par des techniques non invasives, à partir de collections ou banques d'échantillons, etc. L'échantillon peut par ailleurs être pré-traité pour faciliter l'accessibilité des molécules cibles, par exemple par lyse (mécanique, chimique, enzymatique, etc.), purification, centrifugation, séparation, etc. Avantageusement, on utilise le système PaxGene (Feezor et al., Physiol Genomics 2004, : pp. 247-254). L'échantillon peut également être marqué, pour faciliter la détermination de la présence des molécules cibles (marquage fluorescent, radioactif, luminescent, chimique, enzymatique, etc.). The sample can be obtained by any technique known per se, for example by sampling, by non-invasive techniques, from collections or sample banks, etc. The sample may also be pre-treated to facilitate the accessibility of the target molecules, for example by lysis (mechanical, chemical, enzymatic, etc.), purification, centrifugation, separation, etc. Advantageously, the PaxGene system is used (Feezor et al., Physiol Genomics 2004, pp. 247-254). The sample can also be labeled, to facilitate the determination of the presence of the target molecules (fluorescent, radioactive, luminescent, chemical, enzymatic labeling, etc.).

Dans un mode de mise en oeuvre préféré, l'échantillon biologique est un échantillon de sang total, c'est-à-dire n'ayant pas subi d'étape de séparation, qui peut être éventuellement dilué. In a preferred embodiment, the biological sample is a whole blood sample, that is to say having not undergone a separation step, which may optionally be diluted.

L'invention est applicable à tout mammifère, de préférence les humains. La méthode de l'invention est particulièrement utile pour la détection du cancer du sein, notamment de la présence, du risque de développement ou du stade de développement du cancer du sein chez l'être humain. Ainsi, les données fournies dans les exemples montrent que l'invention permet de détecter la présence d'un cancer du sein avec une sensibilité supérieure à 92% et une spécificité supérieure à 86%. The invention is applicable to any mammal, preferably humans. The method of the invention is particularly useful for the detection of breast cancer, including the presence, risk of development or stage of development of breast cancer in humans. Thus, the data provided in the examples show that the invention makes it possible to detect the presence of breast cancer with a sensitivity greater than 92% and a specificity greater than 86%.

Un objet particulier de la présente demande concerne une méthode pour détecter la présence, l'évolution ou le risque de développer un cancer du sein chez un sujet humain, comprenant la détermination combinée de la présence (ou de l'absence ou de la quantité (relative)), dans un échantillon biologique du sujet humain, de molécules cibles choisies parmi: a) les acides nucléiques comprenant une séquence choisie parmi SEQ ID NO: 1-318 ou un fragment de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, b) les acides nucléiques ayant une séquence complémentaire d'une séquence selon a), et c) les polypeptides codés par les acides nucléiques selon a) ou b). A particular object of the present application relates to a method for detecting the presence, evolution or risk of developing breast cancer in a human subject, comprising the combined determination of the presence (or absence or quantity thereof ( relative)), in a biological sample of the human subject, target molecules selected from: a) nucleic acids comprising a sequence selected from SEQ ID NO: 1-318 or a fragment thereof having at least 15, preferably at least minus 16, 17, 18, 19, 20, 25 or 30 consecutive bases, b) the nucleic acids having a sequence complementary to a sequence according to a), and c) the polypeptides encoded by the nucleic acids according to a) or b) .

De préférence, la méthode comprend la détermination combinée de la présence, absence ou quantité de 5, 10, 20, 30, 40, 50 ou 60 molécules cibles telles que définies ci-dessus. Preferably, the method comprises the combined determination of the presence, absence or quantity of 5, 10, 20, 30, 40, 50 or 60 target molecules as defined above.

Un autre objet particulier de la présente demande concerne une méthode pour détecter la présence, l'évolution ou le risque de développer un cancer du sein chez un sujet humain, comprenant la mise en contact d'un échantillon biologique du sujet contenant des acides Another particular object of the present application relates to a method for detecting the presence, evolution or risk of developing breast cancer in a human subject, comprising contacting a biological sample of the acid-containing subject.

16 nucléiques avec un produit comprenant un support sur lequel sont immobilisés des acides nucléiques comprenant une séquence complémentaire et/ou spécifique d'une ou, de préférence, plusieurs molécules cibles choisies parmi (i) les acides nucléiques comprenant une séquence choisie parmi SEQ ID NO: 1-318 ou un fragment de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives et (ii) les acides nucléiques ayant une séquence complémentaire d'une séquence selon (i), et la détermination du profil d'hybridation, le profil indiquant la présence, le stade ou le risque de développer un cancer du sein chez ledit sujet humain. De préférence, le produit comprend des acides nucléiques distincts comprenant une séquence complémentaire et/ou spécifique d'au moins 5, 10, 20, 30, 40, 50, 60 ou plus molécules cibles différentes telles que mentionnées ci-dessus. 16 nucleic acids with a product comprising a support on which are immobilized nucleic acids comprising a sequence complementary to and / or specific to one or, preferably, several target molecules chosen from (i) nucleic acids comprising a sequence chosen from SEQ ID NO : 1-318 or a fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases and (ii) nucleic acids having a sequence complementary to a sequence according to (i), and determining the hybridization profile, the profile indicating the presence, the stage or the risk of developing breast cancer in said human subject. Preferably, the product comprises distinct nucleic acids comprising a complementary and / or specific sequence of at least 5, 10, 20, 30, 40, 50, 60 or more different target molecules as mentioned above.

Un autre objet de la présente demande concerne un produit comprenant un support sur lequel sont immobilisés des acides nucléiques comprenant une séquence complémentaire et/ou spécifique d'une ou, de préférence, plusieurs molécules cibles choisies parmi (i) les acides nucléiques comprenant une séquence choisie parmi SEQ ID NO: 1-318 ou un fragment de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives et (ii) les acides nucléiques ayant une séquence complémentaire d'une séquence selon (i). De préférence, le produit comprend des acides nucléiques distincts comprenant une séquence complémentaire et/ou spécifique d'au moins 5, 10, 20, 30, 40, 50, 60 ou plus molécules cibles différentes telles que mentionnées ci-dessus. De préférence, le produit comprend un support sur lequel sont immobilisées au moins 18 sondes d'acides nucléiques distinctes comprenant une séquence complémentaire et/ou spécifique de 18 acides nucléiques cibles distincts choisis parmi SEQ ID NO: 1-318, de préférence de 18 acides nucléiques cibles distincts tels que définis ci-dessus. Un autre objet de la présente demande concerne un produit comprenant un support sur lequel est immobilisé au moins un, de préférence plusieurs, acides nucléiques comprenant une séquence choisie parmi SEQ ID NO: 1-318, ou un analogue fonctionnel de celles-ci. Another subject of the present application relates to a product comprising a support on which are immobilized nucleic acids comprising a complementary and / or specific sequence of one or, preferably, several target molecules selected from (i) nucleic acids comprising a sequence selected from SEQ ID NO: 1-318 or a fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases and (ii) the nucleic acids having a sequence complementary to a sequence according to (i). Preferably, the product comprises distinct nucleic acids comprising a complementary and / or specific sequence of at least 5, 10, 20, 30, 40, 50, 60 or more different target molecules as mentioned above. Preferably, the product comprises a support on which are immobilized at least 18 distinct nucleic acid probes comprising a complementary and / or specific sequence of 18 distinct target nucleic acids chosen from SEQ ID NO: 1-318, preferably from 18 acids. distinct target nuclei as defined above. Another object of the present application relates to a product comprising a support on which is immobilized at least one, preferably several, nucleic acids comprising a sequence chosen from SEQ ID NO: 1-318, or a functional analogue thereof.

De préférence, le produit comprend au moins 5, 10, 20, 30, 40, 50, 60 ou plus acides nucléiques différents choisis parmi les acides nucléiques mentionnés ci-dessus. Dans un mode particulier de mise en oeuvre, le produit comprend chacun des acides nucléiques de séquence SEQ ID NO: 1-318. Preferably, the product comprises at least 5, 10, 20, 30, 40, 50, 60 or more different nucleic acids selected from the nucleic acids mentioned above. In a particular embodiment, the product each comprises nucleic acids of sequence SEQ ID NO: 1-318.

Un autre objet de la présente demande concerne un produit comprenant un support sur lequel est immobilisé au moins un ligand d'un polypeptide codé par un acide nucléique cible tel que défini ci-dessus, c'est-à-dire un acide nucléique comprenant une séquence choisie parmi SEQ ID NO: 1-318, un fragment distinctif de celles-ci ayant au moins 15, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30 bases consécutives, un acide nucléique ayant une séquence complémentaire de celles-ci ou un analogue fonctionnel de celles-ci. Another subject of the present application relates to a product comprising a support on which at least one ligand of a polypeptide encoded by a target nucleic acid as defined above, that is to say a nucleic acid comprising a sequence selected from SEQ ID NO: 1-318, a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, a nucleic acid having a complementary sequence thereof or a functional analogue thereof.

De préférence, le produit comprend au moins 5, 10, 20, 30, 40, 50, 60 ou plus ligands de polypeptides différents choisis parmi les polypeptides mentionnés ci-dessus. Preferably, the product comprises at least 5, 10, 20, 30, 40, 50, 60 or more different polypeptide ligands selected from the polypeptides mentioned above.

Le support peut être tout support solide ou semi-solide présentant au moins une surface, plane ou non, permettant l'immobilisation d'acides nucléiques ou de polypeptides. De tels supports sont par exemple une lame, bille, membrane, filtre, colonne, plaque, etc. Ils peuvent être réalisés en tout matériau compatible, comme notamment du verre, silice, plastique, fibre, métal, polymère, polystyrène, téflon, etc. Les réactifs peuvent être immobilisés sur la surface du support par des techniques connues, ou, dans le cas des acides nucléiques, synthétisés directement in situ sur le support. Des techniques d'immobilisation incluent l'adsorption passive (Inouye et al., J. Clin. Microbiol. 28 (1990) 1469), la liaison covalente. Des techniques sont décrites par exemple dans WO90/03382, WO99/46403. Les réactifs immobilisés sur le support peuvent être ordonnés selon un schéma pré-établi, pour faciliter la détection et l'identification des complexes formés, et selon une densité variable et adaptable. The support can be any solid or semi-solid support having at least one surface, flat or not, allowing the immobilization of nucleic acids or polypeptides. Such supports are for example a blade, ball, membrane, filter, column, plate, etc. They can be made of any compatible material, such as glass, silica, plastic, fiber, metal, polymer, polystyrene, teflon, etc. The reagents can be immobilized on the surface of the support by known techniques, or, in the case of nucleic acids, synthesized directly in situ on the support. Immobilization techniques include passive adsorption (Inouye et al., J. Clin Microbiol 28 (1990) 1469), the covalent bond. Techniques are described for example in WO90 / 03382, WO99 / 46403. The reagents immobilized on the support can be ordered according to a pre-established scheme, to facilitate the detection and identification of the complexes formed, and in a variable and adaptable density.

Dans un mode de mise en oeuvre, le produit de l'invention comprend un pluralité d'oligonucléotides synthétiques, d'une longueur comprise entre 5 et 100 bases, spécifiques d'un ou plusieurs acides nucléiques cibles définis en a) à c). In one embodiment, the product of the invention comprises a plurality of synthetic oligonucleotides, of a length of between 5 and 100 bases, specific for one or more target nucleic acids defined in a) to c).

Les produits de l'invention comprennent typiquement des molécules contrôle, permettant d'étalonner et/ou normaliser les résultats. The products of the invention typically comprise control molecules for calibrating and / or normalizing the results.

Un autre objet de la présente demande concerne un kit comprenant un compartiment ou conteneur comprenant au moins un, de préférence plusieurs, acides nucléiques comprenant une séquence complémentaire et/ou spécifique d'un acide nucléique cible tel que défini ci- dessus et/ou un, de préférence plusieurs ligands d'un polypeptide cible tel que défini précédemment. De préférence, le produit comprend au moins 5, 10, 20, 30, 40, 50, 60 ou plus acides nucléiques et/ou ligands différents choisis parmi les acides nucléiques et ligands mentionnés ci-dessus. Dans un mode particulier de mise en oeuvre, le produit comprend chacun des acides nucléiques de séquence SEQ ID NO: 1-318 ou un ligand pour chacun des polypeptides cibles tels que définis ci-dessus. Le kit peut comprendre par ailleurs des réactifs pour une réaction d'hybridation ou immunologique, ainsi que, le cas échéant, des contrôles et/ou instructions. Another object of the present application relates to a kit comprising a compartment or container comprising at least one, preferably several, nucleic acids comprising a complementary and / or specific sequence of a target nucleic acid as defined above and / or a , preferably several ligands of a target polypeptide as defined above. Preferably, the product comprises at least 5, 10, 20, 30, 40, 50, 60 or more nucleic acids and / or different ligands selected from the nucleic acids and ligands mentioned above. In a particular embodiment, the product comprises each nucleic acid of sequence SEQ ID NO: 1-318 or a ligand for each of the target polypeptides as defined above. The kit may furthermore comprise reagents for a hybridization or immunological reaction, as well as, if appropriate, controls and / or instructions.

Un autre objet de l'invention concerne l'utilisation d'un produit ou kit tel que défini ci-dessus pour la détection d'un cancer chez un sujet mammifère, de préférence un sujet 10 humain, en particulier du cancer du sein. Another subject of the invention relates to the use of a product or kit as defined above for the detection of cancer in a mammalian subject, preferably a human subject, in particular breast cancer.

Un autre objet de l'invention concerne un acide nucléique de séquence choisie parmi SEQ ID NO : 1-318, ou un fragment distinctif de celles-ci comprenant au moins 15 bases consécutives, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30, ou un acide nucléique 15 ayant une séquence complémentaire de celles-ci, ou un analogue fonctionnel de celles-ci. L'invention concerne également un vecteur de clonage ou d'expression comportant ces acides nucléiques, ainsi que toute cellule recombinante comprenant un tel vecteur ou acide nucléique. Another subject of the invention relates to a nucleic acid with a sequence chosen from SEQ ID NO: 1-318, or a distinctive fragment thereof comprising at least 15 consecutive bases, preferably at least 16, 17, 18, 19, 20, 25 or 30, or a nucleic acid having a sequence complementary thereto, or a functional analogue thereof. The invention also relates to a cloning vector or expression comprising these nucleic acids, as well as any recombinant cell comprising such a vector or nucleic acid.

20 Un autre objet de l'invention concerne l'utilisation d'un acide nucléique comprenant une séquence choisie parmi SEQ ID NO: 1-318, ou un fragment distinctif de celles-ci comprenant au moins 15 bases consécutives, de préférence au moins 16, 17, 18, 19, 20, 25 ou 30, ou un acide nucléique ayant une séquence complémentaire de celles-ci, ou un analogue fonctionnel de celles-ci, pour la détection (essentiellement in vitro) d'un cancer 25 chez un sujet mammifère. Another object of the invention relates to the use of a nucleic acid comprising a sequence selected from SEQ ID NO: 1-318, or a distinctive fragment thereof comprising at least 15 consecutive bases, preferably at least 16 , 17, 18, 19, 20, 25 or 30, or a nucleic acid having a sequence complementary thereto, or a functional analogue thereof, for the detection (essentially in vitro) of a cancer in a mammal subject.

Selon un exemple particulier de mise en oeuvre de l'invention, on prélève un échantillon de sang d'un mammifère à tester. L'échantillon de sang est éventuellement traité de manière à rendre les acides nucléiques plus accessibles, et ceux-ci sont marqués. Les acides 30 nucléiques sont ensuite appliqués sur un produit tel que défini ci-avant et le profil d'hybridation est déterminé, permettant de diagnostiquer la présence ou non d'un cancer chez le sujet. La méthode de l'invention est simple, pratiquée ex vivo, et permet la détection précoce d'un cancer à partir d'un échantillon de sang. According to a particular embodiment of the invention, a blood sample is taken from a mammal to be tested. The blood sample is optionally processed to make the nucleic acids more accessible, and these are labeled. The nucleic acids are then applied to a product as defined above and the hybridization profile is determined to diagnose the presence or absence of cancer in the subject. The method of the invention is simple, practiced ex vivo, and allows the early detection of cancer from a blood sample.

Il est entendu que toute technique équivalente peut être utilisée dans le cadre de la présente 5 demande pour déterminer la présence d'une molécule cible. It is understood that any equivalent technique may be used in the context of this application to determine the presence of a target molecule.

D'autres aspects et avantages de l'invention apparaîtront à la lecture des exemples qui suivent, qui doivent être considérés comme illustratifs et non limitatifs. Other aspects and advantages of the invention will appear on reading the examples which follow, which should be considered as illustrative and not limiting.

10 LEGENDE DES FIGURES Figure 1 : Amplification par PCR des cDNAs DATAS issus du profilage n 1 (PBMN) à l'aide de 10 couples d'amorces semi-dégénérées. Le profilage DATAS a été effectué dans les deux directions (cancers stade I & II contre contrôles et contrôles contre cancers stade I 15 & II). Un contrôle positif à l'aide de cDNA issu de cellules HepG2 est inclus. Figure 2 : Amplification par PCR des cDNAs DATAS issus du profilage n 2 (PBMO) à l'aide de 10 couples d'amorces semi-dégénérées. Le profilage DATAS a été effectué dans les deux directions (cancers stade III & IV contre contrôles et contrôles contre cancers stade III & IV). Un contrôle positif à l'aide de cDNA issu de cellules HepG2 est inclus 20 Figure 3 : Amplification par PCR des cDNAs DATAS issus du profilage n 3 (PBMP) à l'aide de 10 couples d'amorces semi-dégénérées. Le profilage DATAS a été effectué dans les deux directions (cancers stade I à IV contre contrôles et contrôles contre cancers stade I à IV). Un contrôle positif à l'aide de cDNA issu de cellules HepG2 est inclus Figure 4 : Configuration spécifique de sondes pour mesurer l'expression de variants 25 générés par épissage alternatif. La sonde A , commune aux deux isoformes mesure l'expression des deux variants. La sonde B, spécifique de la séquence additionnelle de la forme longue, ainsi que les sondes C et D, spécifiques des séquences de jonction autour de cette séquence additionnelle mesurent l'expression de l'isoforme longue. La sonde E, spécifique de la jonction issue de l'absence de la séquence additionnelle, mesure 30 l'expression de l'isoforme courte. FIGURE 1 Figure 1: PCR amplification of DATAS cDNAs derived from profiling # 1 (PBMN) using 10 pairs of semi-degenerate primers. DATAS profiling was performed in both directions (stage I & II cancers versus controls and controls against stage I 15 & II cancers). A positive control using cDNA from HepG2 cells is included. Figure 2: PCR amplification of DATAS cDNAs from profiling n 2 (PBMO) using 10 pairs of semi-degenerate primers. DATAS profiling was performed in both directions (stage III & IV cancers versus controls and controls against stage III & IV cancers). A positive control using cDNA from HepG2 cells is included. Figure 3: PCR amplification of DATAS cDNAs from profiling n 3 (PBMP) using 10 pairs of semi-degenerate primers. DATAS profiling was performed in both directions (stage I to IV cancers versus controls and controls against stage I to IV cancers). A positive control using cDNA from HepG2 cells is included. Figure 4: Specific configuration of probes to measure the expression of variants generated by alternative splicing. Probe A, common to both isoforms, measures the expression of both variants. Probe B, specific for the additional sequence of the long form, as well as probes C and D, specific for the junction sequences around this additional sequence measure the expression of the long isoform. E probe, specific for the junction resulting from the absence of the additional sequence, measures the expression of the short isoform.

Figure 5: Définition des séquences "cibles". 15 nucléotides de part et d'autre de chaque jonction et jusqu'à 500 nucléotides pour les séquences additionnelle et commune en amont sont captures. Figure 6: Séquences cibles et données associées. Description des colonnes : idn : n d'identification de l'événement d'épissage ; Description : type d'événement d'épissage ; forme longue : n d'accession de la forme longue ; forme courte : n d'accession de la forme courte ; cible A à E : Séquences des cible A à E ; long.: taille des séquences de la colonne précédente. Figure 7: Clustering hiérarchique sur 37 contrôles et 55 patientes en utilisant 100 oligonucléotides. Figure 5: Definition of the "target" sequences. 15 nucleotides on either side of each junction and up to 500 nucleotides for the additional and common upstream sequences are captured. Figure 6: Target sequences and associated data. Description of the columns: idn: n identification of the splicing event; Description: type of splicing event; long form: n of accession of the long form; short form: n of accession of the short form; target A to E: Sequences of targets A to E; long .: size of the sequences of the previous column. Figure 7: Hierarchical clustering on 37 controls and 55 patients using 100 oligonucleotides.

EXEMPLESEXAMPLES

1. Caractéristiques des échantillons biologiques 15 Les exemples présentés ci après ont été réalisés initialement à partir de 92 échantillons sanguins (5 ml de sang total, prélevé dans deux tubes PaxGene). Ces échantillons regroupaient 37 échantillons sanguins provenant de patientes contrôles saines (S, obtenus de l'Etablissement Français du Sang) et 55 échantillons de patientes atteintes d'un cancer 20 du sein en phase 1/II (Cl/II). 2. Extraction des ARN totaux de l'échantillon sanguin 1. Characteristics of the biological samples The examples presented below were initially made from 92 blood samples (5 ml of whole blood taken from two PaxGene tubes). These samples included 37 blood samples from healthy control patients (S, obtained from the French Blood Establishment) and 55 samples from patients with stage 1 / II breast cancer (Cl / II). 2. Extraction of total RNA from the blood sample

Les prélèvements sanguins ont été collectés directement dans des tubes PAXGene' Blood 25 RNA (PreAnalytix, Hombrechtikon, CH). Après l'étape de prélèvement de l'échantillon sanguin et afin d'obtenir une lyse totale des cellules, les tubes ont été laissés à température ambiante pendant 4 h puis conservés à -20 C jusqu'à l'extraction du matériel biologique. Plus précisément, dans ce protocole, les ARN totaux ont été extraits à l'aide des kits PAXGene Blood RNA (PreAnalytix) en respectant les recommandations du fabriquant. 30 Brièvement, les tubes ont été centrifugés (15 min, 3000 g) afin d'obtenir un culot d'acides nucléiques. Ce culot a été lavé et repris dans un tampon contenant de la protéinase K nécessaire à la digestion des protéines (10 min à 55 C). Une nouvelle centrifugation (5 min, 19 000g) a été effectuée pour éliminer les débris cellulaires et de l'éthanol a été ajouté afin d'optimiser les conditions de fixation des acides nucléiques. Les ARN totaux ont été spécifiquement fixés sur les colonnes PAXgene RNA spin column et, avant l'élution de ceux-ci, une digestion de l'ADN contaminant a été effectuée à l'aide du RNAse free DNAse set (Qiagen, Hilden, Allemagne). La qualité des ARN totaux a été analysée par le bioanalyzer AGILENT 2100 (Agilent Technologies, Waldbronn, Allemagne). 3. Expériences de profilage DATAS Blood samples were collected directly into PAXGene 'Blood 25 RNA tubes (PreAnalytix, Hombrechtikon, CH). After the blood sample collection step and in order to obtain a total lysis of the cells, the tubes were left at ambient temperature for 4 h and then stored at -20 ° C. until the biological material was extracted. Specifically, in this protocol, the total RNAs were extracted using PAXGene Blood RNA kits (PreAnalytix) according to the manufacturer's recommendations. Briefly, the tubes were centrifuged (15 min, 3000 g) to obtain a nucleic acid pellet. This pellet was washed and taken up in a buffer containing proteinase K necessary for protein digestion (10 min at 55 ° C.). Further centrifugation (5 min, 19000g) was performed to remove cell debris and ethanol was added to optimize the nucleic acid binding conditions. The total RNAs were specifically fixed on the PAXgene RNA spin column and, before elution thereof, digestion of the contaminating DNA was performed using the RNAse free DNAse set (Qiagen, Hilden, Germany). ). The quality of the total RNAs was analyzed by the AGILENT 2100 bioanalyzer (Agilent Technologies, Waldbronn, Germany). 3. DATAS Profiling Experiments

Trois séries de profilage DATAS ont été effectuées entre les groupes suivants : Three series of DATAS profiling were performed between the following groups:

DATAS n 1 : Cancers du sein précoces (Stade I et II) contre groupe Contrôle (étude 15 PBMN) DATAS n 2 : Cancers du sein tardifs (Stade III et IV) contre groupe Contrôle (étude PBMO) DATAS n 3 : Cancers du sein (Stade I, II, III et IV) contre groupe Contrôle (étude PMNP) DATAS n 1: Early breast cancers (Stage I and II) versus control group (PBMN study) DATAS n 2: Late breast cancers (Stage III and IV) versus control group (PBMO study) DATAS n 3: Breast cancer (Stage I, II, III and IV) vs Control group (PMNP study)

20 DATAS est un technologie de profilage de l'expression génétique entre deux échantillons qui permet de caractériser les différences qualitatives au niveau des ARNs messagers, telles que celles générées par épissage alternatif. Cette technologie propriétaire est décrite dans le brevet US 6,251,590. DATAS is a technology for profiling gene expression between two samples that characterizes the qualitative differences in messenger RNAs, such as those generated by alternative splicing. This proprietary technology is described in US Patent 6,251,590.

25 Les ARNs totaux correspondant aux deux situations, l'une normale (mN) et l'autre pathologique (mP), sont isolés à partir des échantillons sanguins à l'aide du système PAXgene (PreAnalytix) décrit ci-dessus. Ces ARNs (50 g par groupe) sont convertis en ADN complémentaires (cN) et (cP) à l'aide de reverse transcriptase (RT) (Invitrogen) et d'un oligonucleotide oligodT25 biotinylé (Invitrogen). Les échantillons ayant contribué 30 aux 50 g par groupe sont indiqués dans les tableaux A à F. The total RNAs corresponding to the two situations, one normal (mN) and the other pathological (mP), are isolated from the blood samples using the PAXgene system (PreAnalytix) described above. These RNAs (50 g per group) are converted to complementary (cN) and (cP) DNAs using reverse transcriptase (RT) (Invitrogen) and a biotinylated oligodT25 oligonucleotide (Invitrogen). Samples contributing 30 to 50 g per group are shown in Tables A to F.

Des hybrides mN/cP et cN/mP sont ensuite réalisés en phase liquide. Après précipitation à l'éthanol de mN et cP et de cN et mP, les précipitats sont repris dans 30 l de solution d'hybridation contenant 80% formamide et 0.1% SDS pour une incubation à 40 C sur la nuit. Les hétéroduplexes sont ensuite capturés à l'aide de billes magnétiques Streptavidine (Dynal). Un aimant permet de maintenir les billes dans le tube pendant les opérations de rinçage. Les billes / hétéroduplexes sont alors reprises dans 50 gl de tampon RNAseH et incubées avec la RNaseH (Invitrogen) 30 minutes à 37 C. Le surnageant est récupéré après une nouvelle application de l'aimant. L'ADN résiduel est éliminé par action de la DNAseI (Ambion). Après inactivation de l'enzyme, les fragments d'ARNs sont précipités à l'éthanol et repris dans de l'eau traitée au DEPC supplémentée de RNAse out (Ambion). Les fragments d'ARNs sont ensuite reverse transcrits (kit Reverse transcription TaqMan, Applied Biosystem) à l'aide d'oligonucléotides hexamères aléatoires. Les ADNs complémentaires obtenus sont alors amplifiés par PCR à l'aide d'amorces semidégénérées. 10 couples d'amorces sont généralement utilisés. Les amplicons obtenus peuvent être visualisés par électrophorèse sur gel d'agarose (figure 1, 2 et 3). On observe des populations amplifiées hétérogènes de nature et d'intensité qui varient selon les amorces et les échantillons utilisés. Ces populations amplifiées sont clonés dans un vecteur de clonage TOPO TA (Invitrogen) pour être transformés dans une souche de bactéries E. Coli compétentes (Invitrogen). Les colonies sont repiquées dans des plaques 96 puits et cultivées sur la nuit dans un milieu 2XTY supplémenté d'ampicilline. Un stock glycérolé (50%) est alors effectué. Généralement, on repique une plaque 96 puits par couple d'amorce pour une des deux directions. Cela fournit ainsi 96x10x2 = 1920 colonies à caractériser. 1728 clones ont été séquencés dans le cadre du profilage DATAS n 1, 1920 ont été clonés et séquencés dans le cadre du profilage DATAS n 2 et 1920 ont été clones et séquencés dans le cadre du profilage DATAS n 3. Hybrids mN / cP and cN / mP are then made in the liquid phase. After ethanol precipitation of mN and cP and cN and mP, the precipitates are taken up in 30 l of hybridization solution containing 80% formamide and 0.1% SDS for incubation at 40 C overnight. The heteroduplexes are then captured using magnetic beads Streptavidin (Dynal). A magnet keeps the balls in the tube during rinsing operations. The beads / heteroduplexes are then taken up in 50 μl of RNAseH buffer and incubated with RNaseH (Invitrogen) for 30 minutes at 37 ° C. The supernatant is recovered after a new application of the magnet. Residual DNA is removed by the action of DNAseI (Ambion). After inactivation of the enzyme, the fragments of RNAs are precipitated with ethanol and taken up in DEPC treated water supplemented with RNAse out (Ambion). The RNA fragments are then reverse transcribed (TaqMan Reverse transcription kit, Applied Biosystem) using random hexamer oligonucleotides. The complementary DNAs obtained are then amplified by PCR using semidegenerated primers. 10 pairs of primers are generally used. The amplicons obtained can be visualized by agarose gel electrophoresis (FIGS. 1, 2 and 3). There are heterogeneous amplified populations of nature and intensity that vary according to the primers and the samples used. These amplified populations are cloned into a TOPO TA cloning vector (Invitrogen) to be transformed into a strain of competent E. coli bacteria (Invitrogen). The colonies are subcultured into 96-well plates and grown overnight in 2XTY supplemented with ampicillin. A glycerol stock (50%) is then made. Generally, a 96-well plate is passed through a pair of primer for one of the two directions. This provides 96x10x2 = 1920 colonies to characterize. 1728 clones were sequenced in the framework of profiling DATAS n 1, 1920 were cloned and sequenced as part of profiling DATAS n 2 and 1920 were cloned and sequenced as part of profiling DATAS n 3.

4. Analyse Bioinformatique Le tableau G récapitule le nombre de clones caractérisés dans les trois banques de profilage. Un clone qualifié de singleton signifie que la séquence de ce clone n'a été identifiée qu'une seule fois dans la banque et qu'aucun autre clone ne recouvre cette30 séquence. Un cluster correspond à un groupe de clones dont les séquences se chevauchent. Les trois profilages DATAS ont ainsi générés 1741 clones non redondants. 4. Bioinformatic Analysis Table G summarizes the number of clones characterized in the three profiling banks. A so-called singleton clone means that the sequence of this clone has been identified only once in the library and that no other clone overlaps this sequence. A cluster is a group of clones whose sequences overlap. The three DATAS profilings generated 1741 non-redundant clones.

Tableau G : Caractérisation des clones obtenus au sein des trois banques DATAS Banques DATAS Cancers I Cancers III & Tous cancers Consolidation &II vs. IV vs. vs. contrôle sur les trois contrôle contrôle banques Nombre de clones 1728 1920 1920 5568 séquencés Nombre de clusters 267 347 303 886 Nombre de 239 362 440 855 singletons Nombre de clones non 506 709 743 1741(*) redondants au sein de la banque Une analyse bioinformatique a été effectuée sur les 1741 clones afin d'identifier les gènes auxquels ils sont associés et les événements d'épissage connus dans les banques publiques d'acides nucléiques correspondants. Table G: Characterization of clones obtained in the three DATAS banks Banks DATAS Cancers I Cancers III & All cancers Consolidation & II vs. IV vs. vs. control over the three control control banks Number of clones 1728 1920 1920 5568 sequenced Number of clusters 267 347 303 886 Number of 239 362 440 855 singletons Number of clones no 506 709 743 1741 (*) redundant within the bank A bioinformatics analysis has 1741 clones were performed to identify the genes they are associated with and the known splicing events in the corresponding public nucleic acid libraries.

Ainsi, les 1741 clones DATAS ont pu être associés à 1170 gènes différents, certains fragments DATAS non chevauchant étant associés à différentes régions du même gène. Thus, the 1741 DATAS clones could be associated with 1170 different genes, some non-overlapping DATAS fragments being associated with different regions of the same gene.

5. Sélection et dessin des événements cibles pour le dessin du microarray La détection et la quantification de l'expression de variants d'épissage par microarray nécessite l'utilisation d'une configuration de sondes particulière. Tout couple ARN messager de référence / variant d'épissage peut être modélisé en tant que Isoforme longue / Isoforme courte (Figure 4). Ainsi, un variant d'épissage associé à un saut d'exon sera l'isoforme courte par rapport au variant de référence. Un variant d'épissage comportant un nouvel exon ou une rétention d'intron sera l'isoforme longue par rapport au variant de référence. Les autres événements d'épissage alternatifs (utilisations de sites cryptiques 5' ou 3' d'épissage) peuvent également être modélisés de la même manière. 5. Selection and Drawing of Target Events for Microarray Drawing The detection and quantification of the expression of microarray splice variants requires the use of a particular probe configuration. Any reference messenger RNA / splice variant pair can be modeled as long isoform / short isoform (Figure 4). Thus, a splice variant associated with an exon skip will be the short isoform relative to the reference variant. A splice variant with a new exon or intron retention will be the long isoform relative to the reference variant. Other alternative splicing events (uses of 5 'or 3' splicing sites) can also be modeled in the same way.

Le jeu de sondes nécessaire à la mesure de l'expression de variants d'épissage est 25 également indiqué sur la figure 4. Ce jeu est composé de deux sondes exoniques traditionnelles A et B et de trois sondes de jonction de type exon-exon ou exon-intron C, D et E. La sonde A mesure l'expression des deux isoformes, les sondes B, C et D l'expression de l'isoforme longue et la sonde E l'expression de la forme courte. The set of probes necessary for measuring the expression of splice variants is also indicated in FIG. 4. This set consists of two traditional exon probes A and B and three exon-exon junction probes or Exon-intron C, D and E. Probe A measures the expression of both isoforms, probes B, C and D express the long isoform and probe E express the short form.

Afin de concevoir toutes ces sondes, il est nécessaire d'identifier les événements d'épissage correspondant aux fragments DATAS, puis les régions cibles à partir desquelles seront dessinées les sondes. Les séquences cibles correspondant aux sondes de jonction C, D et E sont définies par une longueur de 30 nucléotides, 15 nucléotides de part et d'autre de la jonction. Il est ainsi possible de couvrir toute jonction par des sondes de 25 nucléotides de type : 10/15, 11/14, 12/13, 13/12, 14/11 et 15/10 (le signe / représentant la zone de jonction). Les contraintes sont moins fortes sur le sondes exoniques A et B. La séquence additionnelle, spécifique de la forme longue (séquence 2 de la figure 4) et la séquence commune en amont du site d'épissage (séquence 1 de la figure 4) définissent les séquences cibles pour ces sondes avec toutefois un plafond de 500 nucléotides pour les séquences pouvant dépasser cette taille. La définition des séquences cibles est résumée figure 5. In order to design all these probes, it is necessary to identify the splicing events corresponding to the DATAS fragments, then the target regions from which the probes will be drawn. The target sequences corresponding to the junction probes C, D and E are defined by a length of 30 nucleotides, 15 nucleotides on either side of the junction. It is thus possible to cover any junction with probes of 25 nucleotides of the type: 10/15, 11/14, 12/13, 13/12, 14/11 and 15/10 (the sign / representing the junction zone) . The constraints are less strong on the exonic probes A and B. The additional sequence, specific for the long form (sequence 2 of FIG. 4) and the common sequence upstream of the splice site (sequence 1 of FIG. the target sequences for these probes with a ceiling of 500 nucleotides for sequences that can exceed this size. The definition of the target sequences is summarized in Figure 5.

Ainsi, les 1170 gènes correspondant aux 1741 clones DATAS ont été utilisés pour identifier, pour chacun d'entre eux les cDNAs et ESTs représentés dans les banques publiques de données de séquences, présentant potentiellement des différences qualitatives de séquences, source d'événements d'épissages. Les événements retenus sont localisés à moins de 100 nucléotides par rapport aux extrémités 5' ou 3' des fragments DATAS. Thus, the 1170 genes corresponding to the 1741 DATAS clones were used to identify, for each of them, the cDNAs and ESTs represented in the public databases of sequence data, potentially having qualitative differences in sequences, a source of events. splicing. The events retained are located within 100 nucleotides with respect to the 5 'or 3' ends of the DATAS fragments.

2108 événements ont ainsi été retenus et répertoriés dans un classeur Excel dont une partie décrivant les informations extraites est représentée sur la figure 6. Ces informations comprennent, pour un événement donné : un numéro d'identification de l'événement, la nature de l'événement, les numéros d'accès des formes longues et courtes, les séquences cibles A à E et la taille de ces séquences cibles. 2108 events have been selected and listed in an Excel workbook, part of which describes the extracted information is shown in Figure 6. This information includes, for a given event: an identification number of the event, the nature of the event, the access numbers of the long and short forms, the target sequences A to E and the size of these target sequences.

Un contrôle qualité strict a été appliqué pour la définition de ces séquences. Les ambiguités sur certains nucléotides ont été corrigées en les substituant par les nucléotides correspondant sur l'ARN de référence RefSeq ou à partir de l'ADN génomique. Toutes les séquences cibles ont été réalignées afin de confirmer leur appartenance aux formes courtes ou longues appropriées. A strict quality control has been applied for the definition of these sequences. Ambiguities on certain nucleotides were corrected by substituting them with the corresponding nucleotides on RefSeq reference RNA or from genomic DNA. All target sequences were realigned to confirm their membership in the appropriate short or long forms.

6. Dessin des sondes et de la puce Afin de pouvoir mesurer l'expression des 2108 événements décrits précédemment, une puce à ADN à façon a été conçue. Sur cette puce, chaque événement était caractérisé par ses cinq séquences cibles A-E. Pour chaque séquence A et B, 11 couples de sondes de 25 nucléotides ont été conçus, alors que les séquences cibles du type C, D, ou E ont été détectées avec 6 couples de sondes de 25 nucléotides. 6. Drawing Probes and Chip In order to measure the expression of the 2108 events described above, a custom microarray was designed. On this chip, each event was characterized by its five A-E target sequences. For each sequence A and B, 11 pairs of 25 nucleotide probes were designed, whereas the C, D, or E target sequences were detected with 6 pairs of 25 nucleotide probes.

Par couple de sondes, on entend une première sonde qui s'hybridait parfaitement (on parle alors de sondes PM ou perfect match) avec un des ADNc issus d'un transcrit cible, et une deuxième sonde, identique à la première sonde à l'exception d'un mésappariement (on parle alors de sonde MM ou mismatched) au centre de la sonde. Chaque sonde MM servait à estimer le bruit de fond correspondant à une hybridation entre deux fragments nucléotidiques de séquence non complémentaire. (Affyrnetrix technical note "Statistical Algorithms Reference Guide" ; Lipshutz, et al (1999) Nat. Genet. 1 Suppl., 20-24). Si le design d'au moins 6 sondes pour les séquences A et B ou d'au moins 4 sondes pour les séquences C, D, et E était impossible, ces séquences ne figuraient pas sur la puce à façon. Une telle situation peut résulter de séquences de basse complexité, contenant des structures répétitives ou des structures hairpin consécutives ou non-consécutives. Une taille de séquences A-E inférieure à 30 nucléotides menait également à l'exclusion de ces sondes. Uniquement les séquences de bonne qualité, orientées en direction 5' -> 3', servaient pour les design des sondes effectuées selon les consignes d'Affymetrix. By pair of probes is meant a first probe that hybridises perfectly (this is called PM probes or perfect match) with one of the cDNAs from a target transcript, and a second probe, identical to the first probe at the same time. except for a mismatch (this is called a MM or mismatched probe) in the center of the probe. Each MM probe was used to estimate the background noise corresponding to a hybridization between two nucleotide fragments of non-complementary sequence. (Affyrnetrix technical note "Statistical Algorithms Reference Guide", Lipshutz, et al (1999) Nat Genet 1 Suppl., 20-24). If the design of at least 6 probes for sequences A and B or at least 4 probes for sequences C, D, and E was impossible, these sequences were not included on the chip. Such a situation can result from sequences of low complexity, containing repetitive structures or consecutive or non-consecutive hairpin structures. An A-E sequence size of less than 30 nucleotides also led to the exclusion of these probes. Only the good quality sequences, oriented in the 5 '-> 3' direction, were used for the design of the probes carried out according to the instructions of Affymetrix.

7. Synthèse d'ARNc, obtention et marquage des ADNc et fragmentation Afin d'analyser l'expression des transcrits cibles selon l'invention, les 7. Synthesis of cRNA, obtaining and labeling cDNAs and fragmentation In order to analyze the expression of the target transcripts according to the invention, the

ADN complémentaires (ADNc) des ARNm contenus dans les ARN totaux tels que purifiés ci dessus, ont été obtenus à partir de 400 ng d'ARN totaux par l'utilisation d'une enzyme de Klenow 3'-5'-exonuclease, de 100 unités de l'enzyme de transcription reverse SuperScript II (Invitrogen), 10 unités de l'inhibiteur de la RNAse H Superase-IN (Ambion, Huntigdon, UK) et 200 pmol d'amorce random contenant le promoteur T7 (RP-T7-primer, Eurogenetec, Seraing, Belgique). La totalité des ADNc ainsi obtenue a ensuite été engagée dans une transcription in-vitro, qui a été effectuée avec un kit MEGAscript T7 (Ambion) pendant 16h à 37 C. L'ARNc résultant a été ultérieurement purifié sur colonne avec un RNeasy Mini kit (Invitrogen), et la qualité de l'ARNc obtenu a été analysée par le bioanalyzer AGILENT 2100. Les ARNc purifiés ont ensuite été quantifiés par spectrophotométrie, et la solution d'ARNc a été ajustée à une concentration de 1,24 g/gl d'ARNc. Vingt-six microgrammes d'ARNc ont ensuite été dispatchés sur deux tubes Eppendorf, et 3 g d'amorces random sont ajoutés à chaque tube. La transcription reverse a été effectuée avec 800 unités de SuperScriptII (Invitrogen) et 10 unités d'un inhibiteur de la RNAse H, en présence d'enzyme de Klenow, pendant 1h à 37 C. L'ADNc à double-brin résultant de cette approche a ensuite été purifié à l'aide du QIAquick PCR Purification Kit (Qiagen) et quantifié par spectrophotométrie. Seize microgrammes d'ARNc, repartis sur trois tubes Eppendorf, ont ensuite été fragmentés avec 0.6 unités de DNAse I par tube pendant 10 minutes à 37 C. L'efficacité de la fragmentation a été verifiée à l'aide du bioanalyzer 2100 (Agilent). Les ADNc fragmentés ont ensuite été marqués avec de la biotine en utilisant 330 unités du terminale transferase (Roche Molecular Biochemicals, Meylan, France) et 1 l de DNA Labeling Reagent (DLR-la, 5 mM) [Affymetrix] par microgramme de ADNc pendant 60 min à 37 C. Complementary DNAs (cDNA) of the mRNAs contained in the total RNAs as purified above, were obtained from 400 ng of total RNA by the use of a Klenow enzyme 3'-5'-exonuclease, 100 units of the SuperScript II reverse transcription enzyme (Invitrogen), 10 units of the RNAse H Superase-IN inhibitor (Ambion, Huntigdon, UK) and 200 pmol of random primer containing the T7 promoter (RP-T7- primer, Eurogenetec, Seraing, Belgium). All the cDNAs thus obtained were then engaged in in vitro transcription, which was carried out with a MEGAscript T7 kit (Ambion) for 16 h at 37 C. The resulting cRNA was subsequently column purified with an RNeasy Mini kit. (Invitrogen), and the quality of the obtained cRNA was analyzed by the AGILENT 2100 bioanalyzer. The purified cRNAs were then quantified by spectrophotometry, and the cRNA solution was adjusted to a concentration of 1.24 g / gl cRNA. Twenty-six micrograms of cRNA were then dispatched on two Eppendorf tubes, and 3 g of random primers were added to each tube. The reverse transcription was carried out with 800 units of SuperScriptII (Invitrogen) and 10 units of an RNAse H inhibitor, in the presence of Klenow enzyme, for 1 h at 37 C. The double-stranded cDNA resulting from this approach was then purified using the QIAquick PCR Purification Kit (Qiagen) and quantified spectrophotometrically. Sixteen micrograms of cRNA, divided into three Eppendorf tubes, were then fragmented with 0.6 units of DNAse I per tube for 10 minutes at 37 C. The fragmentation efficiency was verified using the bioanalyzer 2100 (Agilent). . The fragmented cDNAs were then labeled with biotin using 330 units of the terminal transferase (Roche Molecular Biochemicals, Meylan, France) and 1 l of DNA Labeling Reagent (DLR-1α, 5 mM) [Affymetrix] per microgram of cDNA during 60 min at 37 C.

La totalité d'ADNc fragmenté et marqué a finalement été hybridée sur la puce à ADN à façon (intitulée A520138F , cf exemple 6) selon un protocole d'hybridation standard 25 adapté à des puces 11 m. All of the fragmented and labeled cDNA was finally hybridized to the custom DNA chip (labeled A520138F, cf Example 6) according to a standard hybridization protocol adapted to 11 m chips.

8. Mise en évidence de profils d'expression pour le diagnostic du cancer du sein à partir d'échantillon sanguins 8. Demonstration of expression profiles for the diagnosis of breast cancer from blood samples

30 8.1. Mise en évidence d'un profil d'expression des transcrits permettant de discriminer les patientes Contrôles (S) des patientes atteintes d'un cancer en stade I/II L'expression d'environ 2000 variants d'ARN, représentant environ 800 gènes, a été analysée et comparée entre des patientes S et C I/II. Pour cela, 16 g d'ADNc fragmentés issus de chaque échantillon ont été ajoutés à un tampon d'hybridation (Affymetrix) et 200 l de cette solution ont été mis en contact pendant 16 h à 50 C sur des puces d'expression. 30 8.1. Demonstration of a Transcript Expression Profile to Discriminate Patients Controls (S) of Patients with Stage I / II Cancer The Expression of Approximately 2000 RNA Variants, Representing About 800 Genes, was analyzed and compared between patients S and CI / II. For this, 16 g of fragmented cDNA from each sample were added to hybridization buffer (Affymetrix) and 200 l of this solution were contacted for 16 h at 50 ° C on expression chips.

Afin d'enregistrer les meilleures performances d'hybridation et de lavage, des ARN qualifiés de contrôle biotinylés (bioB, bioC, bioD et cre) et des oligonucléotides (oligo B2) ont également été inclus dans le tampon d'hybridation. Après l'étape d'hybridation, les ADNc biotinylés et hybridés sur la puce, ont été révélés par l'utilisation d'une solution de streptavidine-phycoerythrine et le signal a été amplifié par l'utilisation d'anticorps anti- streptavidine. L'hybridation a été réalisée dans une étuve d'hybridation GeneChip Hybridisation oven (Affymetrix), et le protocole Euk GE-WS2 du protocole d'Affymetrix a été suivi. Les étapes de lavage et de révélation ont été réalisées sur une station Fluidics Station 450 (Affymetrix). Chaque puce a ensuite été analysée sur un scanner Affymetrix G3000 GeneArray Scanner à une résolution de 1,5 microns afin de repérer les zones hybridées sur la puce. Ce scanner permet la détection du signal émis par les molécules fluorescentes après excitation par un laser argon en utilisant la technique du microscope à épifluorescence. On obtient ainsi pour chaque position, un signal proportionnel à la quantité d'ADNc fixés. Le signal a ensuite été analysé par le logiciel GeneChip Operating Software (GCOS 1.2, Affymetrix). In order to record the best hybridization and washing performances, biotinylated control RNAs (bioB, bioC, bioD and cre) and oligonucleotides (oligo B2) were also included in the hybridization buffer. After the hybridization step, the biotinylated and hybridized cDNAs on the chip were revealed by the use of streptavidin-phycoerythrin solution and the signal was amplified by the use of anti-streptavidin antibodies. Hybridization was performed in a GeneChip Hybridization Furnace (Affymetrix), and the Euk GE-WS2 Protocol of Affymetrix was followed. The washing and revelation steps were performed on a Fluidics Station 450 station (Affymetrix). Each chip was then analyzed on an Affymetrix G3000 GeneArray Scanner scanner at a resolution of 1.5 microns to identify hybridized areas on the chip. This scanner allows the detection of the signal emitted by the fluorescent molecules after excitation by an argon laser using the epifluorescence microscope technique. For each position, a signal is obtained which is proportional to the quantity of cDNAs fixed. The signal was then analyzed by GeneChip Operating Software (GCOS 1.2, Affymetrix).

Afin de prévenir les variations obtenues par l'utilisation de différentes puces, il a été réalisé une approche de normalisation utilisant l'outil Bioconductor , qui permet d'harmoniser la distribution moyenne des données brutes obtenues pour chaque puce. Les résultats obtenus sur une puce peuvent alors être comparés aux résultats obtenus sur une autre puce. Le logiciel GCOS 1.2 permettait aussi d'inclure un algorithme statistique pour considérer si un transcrit était exprimé ou non. In order to prevent the variations obtained by the use of different chips, a standardization approach using the Bioconductor tool, which allows to harmonize the average distribution of the raw data obtained for each chip, was realized. The results obtained on one chip can then be compared to the results obtained on another chip. The GCOS 1.2 software also included the inclusion of a statistical algorithm to consider whether a transcript was expressed or not.

A partir des 6.242 groupes de sondes, représentant environ 2.000 transcrits, de la puce, les inventeurs ont sélectionné les transcrits pertinents qui étaient corrélés au développement d'un cancer du sein. Les transcrits qui ont un niveau d'expression trop faible sur la majorité des puces ainsi que les transcrits qui ne présentent pas de variation importante entre les différentes puces ont été exclus (Li et al, 2001, Bioinformatics, 17 : 1131-1142). From the 6,242 groups of probes, representing about 2,000 transcripts, of the chip, the inventors selected the relevant transcripts that were correlated with the development of breast cancer. Transcripts that have too low a level of expression on the majority of the chips as well as transcripts that do not show significant variation between the different chips have been excluded (Li et al, 2001, Bioinformatics, 17: 1131-1142).

La recherche d'un panel de transcrits discriminant les groupes de patientes EFS et CUII a été réalisée par une technique de Data Mining, nommée random forest algorithm (http://ligarto.org/rdiaz/Papers/jornadas.bioinfo.randomForest.pdf). Outre l'analyse des données avec l'algorithme random forest, qui représente une analyse de type multivariée, une analyse dite univariée servait également à identifier des transcrits différentiellement exprimés entre les patientes EFS et CUII. Cette analyse, nommée SAM (pour Significance Analysis of Microarrays ) est principalement basée sur une version modifiée du test t de Student qui permet de prévenir le biais introduit par les gènes avec une variabilité faible. Cette approche permet de contrôler le taux de gènes faux positifs dans une analyse univariée. The search for a panel of transcripts discriminating groups of patients EFS and CUII was carried out by a technique of Data Mining, named random forest algorithm (http://ligarto.org/rdiaz/Papers/jornadas.bioinfo.randomForest.pdf ). In addition to analyzing the data with the random forest algorithm, which represents a multivariate analysis, a so-called univariate analysis was also used to identify differentially expressed transcripts between the EFS and CUII patients. This analysis, named SAM (for Significance Analysis of Microarrays), is mainly based on a modified version of the Student's t-test, which makes it possible to prevent bias introduced by genes with low variability. This approach controls the level of false positive genes in a univariate analysis.

L'ensemble des analyses mentionnées ci-dessus a permis de mettre en évidence un premier panel de transcrits, comprenant 318 transcrits pertinents selon l'invention (cf tableau 1, SEQ ID Nos: 1-318). L'augmentation ou la diminution d'expression de chacun de ces transcrits, observée chez les patientes S par rapport aux patientes C I/II est présentée dans le tableau 1. All of the analyzes mentioned above made it possible to demonstrate a first panel of transcripts, comprising 318 relevant transcripts according to the invention (see Table 1, SEQ ID Nos: 1-318). The increase or decrease in expression of each of these transcripts observed in patients S versus patients C I / II is presented in Table 1.

Les inventeurs ont étudié l'expression simultanée de 318 transcrits pour obtenir un profil d'expression. En utilisant la méthode de random forest, 90 % des patients étaient correctement classifiés. Plus précisément, 32 des 37 contrôles et 51 des 55 patientes ont été correctement classifiés, ce que correspond à une sensibilité de 92,7 % et une spécificité de 86,4 %. Outre l'analyse sur les 92 échantillons initiaux, une analyse supplémentaire a été effectuée afin de valider la pertinence de la signature identifié ci-dessus : une cohorte indépendante de cinq contrôles saines et 16 cancer du sein phase UII a été analysée en aveugle. L'analyse d'une cohorte indépendante est un des meilleurs moyens pour tester la valeur predictive d'une signature de gènes ou de transcrits (cf. The SMRS working group, Nat Biotech 2005, 7 : p.833-838). Basé sur les séquences SEQ ID Nos: 1-318, l'algorithme random forest a classé correctement cinq contrôles sur cinq et 13 patientes sur seize (86% de classification). The inventors studied the simultaneous expression of 318 transcripts to obtain an expression profile. Using the random forest method, 90% of patients were correctly classified. More specifically, 32 of the 37 controls and 51 of 55 patients were correctly classified, which corresponds to a sensitivity of 92.7% and a specificity of 86.4%. In addition to the analysis of the initial 92 samples, an additional analysis was performed to validate the relevance of the signature identified above: an independent cohort of five healthy controls and 16 phase UII breast cancer was blinded. The analysis of an independent cohort is one of the best ways to test the predictive value of a signature of genes or transcripts (see The SMRS working group, Nat Biotech 2005, 7: p.833-838). Based on the sequences SEQ ID Nos: 1-318, the random forest algorithm correctly classified five controls out of five and 13 patients out of sixteen (86% classification).

8.2. Identification d'un sous-ensemble de 100 marqueurs Les inventeurs ont également étudié l'expression simultanée de 100 transcrits de séquence nucléotidique choisie parmi les séquences présentées dans le tableau 2 pour obtenir un profil d'expression. En utilisant la méthode de random forest, 89 % des patients étaient correctement classifiés. Plus précisement, 31 des 37 contrôles et 51 des 55 patientes ont été correctement classifiés, ce qui correspond à une sensibilité de 92,7 % et une spécificité de 83,7 %. Ces résultats ont été confirmés avec une autre technique d'analyse, le clustering hierarchique. Dans cette analyse non-supervisée, une Contrôle se positionne parmi les patientes (à gauche de la ligne pointillée rouge, cf. figure 7), et 10 patientes figurent parmi les contrôles saines (à droite de la ligne pointillée rouge). La figure 7 représente l'analyse de clustering hiérarchique d'échantillons de sang obtenu à partir de 55 patients atteintes d'un cancer en stade précoce (C 1/II, appelé également D) et 37 patients Contrôles (donneuses saines) en utilisant l'expression de 100 gènes identifiés par l'analyse algorithmique. La fonction de clustering hiérarchique du logiciel Spotfire organise les patients C 1/II et contrôles en colonnes, et les gènes en lignes de manière à obtenir en position adjacente les patients ou les gènes présentant des profils d'expression comparables. Le coefficient de corrélation de Pearson a été utilisé comme indice de similarité pour les gènes et les patients. Les résultats correspondent au niveau de fluorescence Affymetrix normalisé par l'outil bioconductor . Afin de tenir compte des différences constitutives d'expression entre les gènes, les niveaux d'expression de chaque gène ont été normalisés en calculant une variable centrée réduite. Le blanc représente les faibles niveaux d'expression, le gris les niveaux intermédiaires et le noir les niveaux forts. La hauteur des branches du dendrogramme indique l'indice de similarité entre les profils d'expression. Outre l'analyse sur les 92 échantillons initiaux, une analyse supplémentaire a été effectué afin de valider la pertinence de la signature identifié ci dessus : une cohorte indépendante de cinq contrôles saines et 16 cancer du sein phase I/I1 a été analysée en aveugle. Basé sur les top 100, l'algorithme random forest a classé correctement cinq contrôles sur cinq et 14 patientes sur seize (90% de classification). 8.2. Identification of a subset of 100 markers The inventors have also studied the simultaneous expression of 100 nucleotide sequence transcripts chosen from the sequences presented in Table 2 to obtain an expression profile. Using the random forest method, 89% of patients were correctly classified. More specifically, 31 of 37 controls and 51 of 55 patients were correctly classified, which corresponds to a sensitivity of 92.7% and a specificity of 83.7%. These results were confirmed with another analysis technique, hierarchical clustering. In this unsupervised analysis, a Control is positioned among the patients (to the left of the red dotted line, see Figure 7), and 10 patients are among the healthy controls (to the right of the red dotted line). Figure 7 shows the hierarchical clustering analysis of blood samples obtained from 55 patients with early stage cancer (C 1 / II, also called D) and 37 patients with controls (healthy donors) using expression of 100 genes identified by algorithmic analysis. The hierarchical clustering function of the Spotfire software organizes C 1 / II patients and controls in columns, and the genes in rows so as to obtain adjacent patients or genes with comparable expression profiles. The Pearson correlation coefficient was used as a similarity index for genes and patients. The results correspond to the Affymetrix fluorescence level normalized by the bioconductor tool. In order to account for the constitutive differences in expression between genes, the expression levels of each gene were normalized by calculating a reduced centered variable. White represents low levels of expression, gray intermediate levels and black high levels. The height of the branches of the dendrogram indicates the index of similarity between the expression profiles. In addition to the analysis of the initial 92 samples, an additional analysis was performed to validate the relevance of the signature identified above: an independent cohort of five healthy controls and 16 phase I / I1 breast cancer was blinded. Based on the top 100, the random forest algorithm correctly ranked five in five controls and 14 out of sixteen patients (90% classification).

Parmi cette combinaison de 100 gènes marqueurs, les inventeurs ont mis en évidence que des panels plus restreints permettaient également de discriminer les patients contrôles des patientes atteintes d'un cancer du sein, comme décrit dans les exemples suivants. 8.3. Identification d'un panel prédictif de 66 marqueurs Among this combination of 100 marker genes, the inventors have shown that smaller panels also make it possible to discriminate the control patients of patients suffering from breast cancer, as described in the following examples. 8.3. Identification of a predictive panel of 66 markers

Les inventeurs ont mis en évidence une combinaison de 66 marqueurs, basée sur les séquences SEQ ID Nos : 5, 7, 13, 14, 16-20, 23-28, 47, 51-55, 58, 64, 69, 80, 81, 88-90, 116, 121, 125, 137, 139, 145, 148, 158, 160, 161, 164, 188-191, 208, 222, 225, 228, 229, 236, 240, 242, 245, 248, 252, 280, 281, 284, 290, 298-300 et 309-312 (voir tableau 3). Cette combinaison permet de classifier correctement plus de 80 % des échantillons. 8.4. Identification d'un panel prédictif de 53 marqueurs Les inventeurs ont également mis en évidence une combinaison de 53 marqueurs, basée sur les séquences SEQ ID Nos : 7, 13, 14, 16-19, 23-28, 47, 51-55, 58, 69, 80, 81, 89, 116, 121, 125, 137, 139, 145, 148, 158, 160, 161, 164, 189, 190, 225, 228, 229, 240, 245, 248, 252, 280, 281, 284, 290, 298-300, 310 et 312. Cette combinaison permet également de classifier correctement plus de 80 % des échantillons. 8.5. Identification d'un panel prédictif de 42 marqueurs Les inventeurs ont également mis en évidence une combinaison de 42 marqueurs, basée sur les séquences SEQ ID Nos : 13, 16-19, 23, 26-28, 47, 51-55, 58, 69, 80, 81, 89, 116, 121, 125, 145, 148, 158, 160, 161, 164, 189, 190, 225, 229, 240, 248, 280, 281, 284, 299, 300, 310 et 312. Cette combinaison permet également de classifier correctement plus de 80 % des échantillons. 8.6. Identification d'un panel prédictif de 22 marqueurs Les inventeurs ont mis en évidence une combinaison de 22 marqueurs, basée sur les séquences SEQ ID Nos : 18, 19, 23, 26, 27, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 161, 188, 225, 228, 240, 280 et 312. Cette combinaison permet de classifier correctement 76 % des échantillons.30 8.7. Identification d'un panel prédictif de 18 marqueurs The inventors have demonstrated a combination of 66 markers, based on the sequences SEQ ID Nos: 5, 7, 13, 14, 16-20, 23-28, 47, 51-55, 58, 64, 69, 80, 81, 88-90, 116, 121, 125, 137, 139, 145, 148, 158, 160, 161, 164, 188-191, 208, 222, 225, 228, 229, 236, 240, 242, 245, 248, 252, 280, 281, 284, 290, 298-300 and 309-312 (see Table 3). This combination correctly classifies more than 80% of the samples. 8.4. Identification of a predictive panel of 53 markers The inventors have also demonstrated a combination of 53 markers, based on the sequences SEQ ID Nos: 7, 13, 14, 16-19, 23-28, 47, 51-55, 58, 69, 80, 81, 89, 116, 121, 125, 137, 139, 145, 148, 158, 160, 161, 164, 189, 190, 225, 228, 229, 240, 245, 248, 252, 280, 281, 284, 290, 298-300, 310 and 312. This combination also makes it possible to correctly classify more than 80% of the samples. 8.5. Identification of a Predictive Panel of 42 Markers The inventors have also demonstrated a combination of 42 markers, based on the sequences SEQ ID Nos: 13, 16-19, 23, 26-28, 47, 51-55, 58, 69, 80, 81, 89, 116, 121, 125, 145, 148, 158, 160, 161, 164, 189, 190, 225, 229, 240, 248, 280, 281, 284, 299, 300, 310 and 312. This combination also makes it possible to correctly classify more than 80% of the samples. 8.6. Identification of a panel predictive of 22 markers The inventors have demonstrated a combination of 22 markers, based on the sequences SEQ ID Nos: 18, 19, 23, 26, 27, 51, 52, 53, 54, 55, 69 , 80, 125, 145, 148, 161, 188, 225, 228, 240, 280 and 312. This combination correctly classifies 76% of the samples.30 8.7. Identification of a predictive panel of 18 markers

Les inventeurs ont aussi mis en évidence une combinaison de 18 marqueurs, basée sur les séquences cibles SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 et 312 présentées dans le tableau 3. Cette combinaison permet de classifier correctement 76 % des échantillons. The inventors have also demonstrated a combination of 18 markers, based on the target sequences SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 and 312 shown in Table 3. This combination correctly classifies 76% of the samples.

Les sondes cibles utilisées pour détecter cette combinaison de 18 gènes sont données dans le tableau 5. 76 % des patientes étaient correctement classifiées. The target probes used to detect this combination of 18 genes are given in Table 5. 76% of the patients were correctly classified.

Ceci confirme que l'analyse de l'expression de ces 18 gènes est un bon outil pour discriminer les patientes ayant un fort risque de rechute des patientes ayant un faible risque de rechute. L'utilisation de panel de gènes restreint est particulièrement adaptée pour obtenir un outil de détection et de pronostic. En effet, l'analyse de l'expression d'une dizaine de gènes ne nécessite pas la fabrication à façon de puce à ADN, et peut être mise en oeuvre directement par des techniques de PCR ou de NASBA, ou puce de basse densité, ce qui présente un atout économique important et une mise en oeuvre simplifiée. 5 Tableau A : Echantillons sélectionnés pour le groupe cancer du sein précoce pour le DATAS n 1 (PBMN) Stade Quantité Clinique utilisée I (ug) 10.0 Echantillon D104 D105 I 5.0 D106 I 2.0 D111 I 2.0 D114 I 3.0 D118 I 5.0 D123 I 6.0 D117 II 2.0 D121 II 5.0 D130 II 10.0 Tableau B : Echantillons sélectionnés pour le groupe contrôle pour le DATAS n 1 (PBMN) D1 Echantillon Stade Quantité Clinique utilisée DFS>5 (ug) 1.66 D3 DFS>5 1.66 D9 DFS > 5 1.66 Dl0 DFS>5 1.66 D17 DFS > 5 1.66 D28 DFS > 5 1.66 D36 DFS>5 1.66 D63 DFS>5 1.66 D75 DFS>5 1.66 D76 DFS>5 1.66 D11 DFS < 5 1.66 D14 DFS<5 1.66 D15 DFS < 5 1.66 D16 DFS<5 1.66 D27 DFS<5 1.66 D31 DFS < 5 1.66 D41 DFS<5 1.66 D66 DFS<5 1.66 D102 DFS< 5 1.66 D103 DFS<5 1.66 D67 Benin 1.66 D69 Benin 1.66 D70 Benin 1.66 10 D71 Benin 1.66 D72 Benin 1.66 D73 Benin 1.66 D74 Benin 1.66 D79 Benin 1. 66 D115 Benin 1.66 D125 Benin 1.66 Tableau C : Echantillons sélectionnés pour le groupe cancer du sein tardif pour le DATAS n 2 (PBMO) Echantillon Stade Quantité D55 Clinique utilisée (ug) S III _ 3.33 D59 S III 3.33 D90 SIII 3.33 D134 S III 3.33 D168 S III 3.33 D178 S III 3.33 D126 S III 3.33 D132 S III 3.33 D91 S IV 3.33 D92 S IV 3.33 D101 S IV 3.33 D138 S IV 3.33 D161 S IV 3.33 D170 S IV 3.33 D99 S IV 3.33 Tableau D : Echantillons sélectionnés pour le groupe contrôle pour le DATAS n 2 (PBMO) Echantillon Stade Quantité Dl Clinique utilisée DFS > 5 (ug) 1.66 D3 DFS>5 1.66 D9 DFS>5 1.66 D10 DFS>5 1.66 D17 DFS>5 1.66 D28 DFS>5 1.66 D36 DFS>5 1.66 D63 DFS>5 1.66 D75 DFS>5 1.66 D76 DFS > 5 1.66 D11 DFS<5 1.66 D14 DFS<5 1.66 D15 DFS<5 1.66 D16 DFS<5 1.66 D27 DFS<5 1.66 D31 DFS<5 1.66 D41 DFS<5 1.66 D66 DFS<5 1.66 D102 DFS<5 1.66 D103 DFS<5 1.66 D67 Benin 1.66 D69 Benin 1.66 D70 Benin 1.66 D71 Benin 1.66 D72 Benin 1.66 D73 Benin 1.66 D74 Benin 1.66 D79 Benin 1.66 D115 Benin 1.66 D125 Benin 1.66 Tableau E : Echantillons sélectionnés pour le groupe cancer du sein tous stades pour le 5 DATAS n 3 (PBMP). Echantillon Stade Quantité D91 Clinique utilisée SIV (ug) 1,3 D92 SIV 1,3 D101 SIV 1,3 D138 SIV 1,3 D161 SIV 1,3 D170 SIV 1,3 D99 SIV 1,3 D195 SIV 1,3 D197 SIV 1,3 D205 S IV 1,3 D55 S I I I 1,3 D59 S III 1,3 D90 S I I I 1,3 D134 S III 1,3 D168 S III 1,3 D178 S III 1,3 D126 S III 1,3 D132 S III 1,3 D135 S III 1,3 D185 S III 1,3 D108 S II 1,3 D109 S II 1,3 D148 S II 1,3 D156 S II 1,3 D160 S II 1,3 D162 S II 1,3 D163 S II 1,3 D166 S II 1,3 D167 S II 1,3 D172 S II 1,3 D112 S I 1,3 D120 S I 1,3 D122 S I 1,3 D127 S I 1,3 D131 S I 1,3 D145 S I 1,3 D147 S I 1,3 D153 S I 1,3 D173 S I 1,3 D176 S I 1,3 Tableau F : Echantillons sélectionnés pour le groupe contrôle pour le DATAS n 3 (PBMP) Echantillon Stade Quantité Dl Clinique utilisée DFS>5 (ug) 1.66 D3 DFS>5 1.66 D9 DFS>5 1.66 D10 DFS>5 1.66 D17 DFS > 5 1.66 D28 DFS>5 1.66 D36 DFS>5 1.66 D63 DFS>5 1.66 D75 DFS>5 1.66 D76 DFS>5 1.66 D11 DFS<5 1.66 D14 DFS<5 1.66 D15 DFS<5 1.66 D16 DFS<5 1.66 D27 DFS<5 1.66 D31 DFS<5 1.66 D41 DFS<5 1.66 D66 DFS<5 1.66 D102 DFS<5 1.66 D103 DFS<5 1.66 D67 Benin 1.66 D69 Benin 1.66 D70 Benin 1.66 Benin 1.66 D71 Benin 1.66 D72 Benin 1.66 D73 Benin 1.66 D74 Benin 1.66 D79 Benin 1.66 D115 Benin 1.66 D125 Tableau 1ù Liste de 318 transcrits exprimés différentiellement lors du développement d'un cancer du sein. SEQ Description de la N Genbank N C UII vs. Séquence Cible ID N séquence (reference) Genbank Sains (variant) 1 lysozyme NM 000239.1 BE720647.1 0,8 ATTTATCCTGCAGTGctttgctgcaa gata 2 cDNA AL832453.2 BU634341.1 0,8 AAATAAAATAT CA G G G ATAT G C DKFZ TCCCCCTTGAGACTGAAGGAA DKFZp451G151 CTGAAGATTTTAAACCTTAGTA (Leucine-rich repeat AGAACCACATTTCATCCCTATC kinase 2) AGAGAACTTTCTTGAGGCTTGT CCTAAAGTGGAGAGTTTCAGT GCCA 3 cDNA AL832453.2 BU634341.1 0,8 GAATGAATTTTCTTGctgctatgcctt DKFZp451 G 151 tCt (Leucine-rich repeat kinase 2) 4 cDNA AL832878.1 AI223156.1 0,8 TGCCAAGGAAGACCCCCTCCT DKFZp667I093 GACCCCTGTTCCGGCTTCAGA (Guanine nucleotide AAACCCGTTTAGGGAGAAGAA binding protein, gamma 2) G 1 1 1 1 1 CTGTGCCATCCTTTAA cDNA AL832878.1 AI223156.1 0,7 TGCAGATTTGATGGCctactgtgaa DKFZp667I093 gcaca (Guanine nucleotide binding protein, gamma 2) 6 chemokine (C-C NM 001838.2 BI910219.1 1 GATGAGGTCACGGACGATTAC motif) receptor 7 0,8 ATCGGAGACAACACCACAGTG GACTACACTTTGTTCGAGTCTT TGTGCTCCAAGAAGGACGTGC G GAACTTTAAAG CCTG GTTCCT CCCTATCATGTACTCCATCATT TGTTTCGTGGGCCTACTGGGC AATGGGCTGGTCGTGTTGACC TATATCTATTTCAAGAGGCTCA AGACCATGACCGATACCTACCT GCTCAACCTGGCGGTGGCAGA CATCCTCTTCCTCCTGACCCTT CCCTTCTGGGCCTACAGCGCG GCCAAGTCCTGGGTCTTCGGT GTCCACTTTTGCAAGCTCATCT TTGCCATCTACAAGATGAGCTT CTTCAGTG G CATG CTCCTACTT CTTTG CATCAG CATTGACCG CT ACGTGGCCATCGTCCAGGCTG TCTCAGCTCACCGCCACCGTG CCCGCGTCCTTCTCATCAGCA AGCTGTCCTGTGTGGGCATCT GGATACTAGCCACAGTGCTCT CCATCCC 7 Homo sapiens BC009917.1 BCO28225.1 1,4 ATGAAGAAAAACAAAgtgcacaga hypothetical protein gacccg DKFZp76I A052 8 cDNA clone BCO38965.1 N70893. 1 1,2 AGGACAGCCCTGGGCagagatga IMAGE:3920936 ggcaggg (High density lipoprotein binding protein (vigilin)) 9 cDNA clone BC040042.1 BI001496.1 1,3 TAATGCCAAGACAAAgccacggga IMAGE:5207605 ggagca (Immunoglobulin heavy constant gamma 2 (G2m marker)) 10 cDNA clone BC040042.1 BF841656.1 1,3 CACAGGTGTACACCCTGCCCC IMAGE:5207605 CATCCCGGGAGGAGATGACCA (Immunoglobulin AGAACCAGGTCAGCCTGACCT heavy constant GCCTGGTCAAAGGCTTCTACC gamma 2 (G2m CCAGCGACATCGCCGTGGAGT marker)) GGGAGAGCAATGGGCAGCCG GAGAACAACTACAAGACCACA CCTCCCATGCTGGACTCCGAC GGCTCCTTCTTCCTCTACAGCA AGCTCACCGTGGACAAGAGCA GGTGGCAGCAGGGGAACGTCT TCTCATGCTCCGTGATGCATGA GGCTCTGCACAACCACTACAC GCAGAAGAGCCTCTCCCTGTC TCCGGGTAAATGAGTGCCACG GCCGGCAAGCCCCCGCTCCCC AGGCTCTCGGGGTCGCGTGAG GATGCTTGGCACGTACCCCGT GTACATACTTCCCAGGCACCC AG CATG GAAATAAAG CACCCA GCGCT 11 Microtubule BC048206.1 AW015234. 1,3 TCTCTCTTTCCAATCTTACGCC associated 1 ATG G C CATCAGTTCATTTCAG C monoxygenase, CTTCCAGTGCTACACCCACTTC calponin and d L IM TTG G CTGACACACTTCTG CTCT domain containing 2 AAG GTGACTG GTTTTCTTG CCA ATTTTCAAAGAGTGGTACTAAC CCCCAACCCGCTTTCCGCACC CCGTCCTCTCCGCCAGCAGTA CTGGTTGCACTAACTGTGAGT GTCTTGCATACTGATGGACTCA TTTGGTGGCATGGTTGGCTAA CAGCATGGCGGGGGGTGTTCA GCTTGAGACCCATGCCTGTGT TCATTTCCCATGGAGCTGGCA GCCTGGTCTACCCCAAGTGCA TGCCCCGCCTCTCCTCTCTCC CTTGGGTCTGCCTGCGTGCAT GCTTCTCCAGTTGCGTCTGCG AAGCTACCTACTTTCTTGGGAG GGTCGACCTTGATCATGAAACA ATACCATGAGGGGGCCTCTGT CACCTTTGAAAAGAACACTTTT TGAGCAGCCTCAAAAAGCTCAT ACATAC 12 Microtubule BC048206.1 AF052170.1 1,4 TGGGAGGGTCGACCTTGATCA associated TGAAACAATACCATGAGGGGG monoxygenase, CCTCTGTCACCTTTGAAAAGAA calponin and LIM domain containing 2 CAC I i i i 1 GAGCAGCCTCAAAA AG CTCATACATAC CAG CG CCTT CTTAAATTGGCTCTAATGTAAA GATTGTTAATGTCATTTATCAAA ACCATAG GTGATTATTTG GAG G GATTTAAAAAACTTAATTACTCT CAGGCCTCATCCCAAGCTTGA CACATGCTCTGTAGGTTGAACA CATAATCACAAATATTCTAGCA AATGCTGCCTTGGTTGCAGCC TGCACTGTAGACCCAAGGGTT TTG CTG TG G CTCTTCTTAT CTC CCTTGGCTCATAAAGCCCCAG ATGATGCCAGAGCTTCAATTAG AGCCATCATCATCCCAGGCAG GGATATCTTTGAGAAATGACTC AGTTCAGCCCCAGGCCCCTGT GACTCTGCTTAAAGCACACATT TCTGCTGACTCTTGTACCTGGG GCAGCAGGATAATCACCAACA C 13 cellular homolog of NM 001997.2 NM 001997 0,5 GTCGCCCAGATCAAGgctcatgtag the fox sequence in .2 cctca the Finkel-Biskis- Reilly murine sarcoma virus 14 cellular homolog of NM 001997.2 W17004.1 0,6 GGCCGCATGCTTGGAaggtaaagt the fox sequence in ccatgg the Finkel-Biskis-Reilly murine sarcoma virus 15 cellular homolog of NM 001997.2 AU098396.1 0,6 GTCGCCCAGAGCAAGgctcatgta the fox sequence in gcctca the Finkel-Biskis-Reilly murine sarcoma virus 16 cellular homolog of NM 001997.2 AA063591.1 0,7 TGACCGGCCAGGAAAcggtcgccc the fox sequence in agatca the Finkel-Biskis-Reilly murine sarcoma virus 17 cellular homolog of NM 001997.2 AA094898.1 0,6 CAGGCCGCATGCTTGaggtaaagt the fox sequence in ccatgg the Finkel-Biskis-Reilly murine sarcoma virus 18 cellular homolog of NM 001997.2 AA187006.1 0,7 AGGCCGCATGCTTGGaggtaaagt the fox sequence in ccatgq the Finkel-Biskis-Reilly murine sarcoma virus 19 cellular homolog of NM 001997.2 AA225636.1 0,6 GGCCAGGAAACGGTCgcccagat the fox sequence in caaggta the Finkel-Biskis-Reilly murine sarcoma virus 20 cellular homolog of NM 001997.2 BM820687. 0,5 GGTCGCCCAGATCAAgctcatgtag the fox sequence in 1 cctca the Finkel-Biskis- Reilly murine sarcoma virus 21 cellular homolog of NM 001997.2 AA491544.1 0,7 GCCGCATGCTTGGAGgtaatgtcc the fox sequence in atggtt the Finkel-Biskis- Reilly murine sarcoma virus 22 cellular homolog of NM 001997.2 AV743892.1 0,5 GTCGCCCAGATCAAGgctctgtagc the fox sequence in ctcac the Finkel-Biskis- Reilly murine sarcoma virus 23 cellular homolog of NM 001997.2 BU603086.1 0,6 GAAGTAGCAGGCCGCatgcttgga the fox sequence in ggtaaa the Finkel-Biskis- Reilly murine sarcoma virus 24 cellular homolog of NM 001997.2 BF218408.1 0,6 AGGCCGATGCTTGGAggtaaagtc the fox sequence in catggt the Finkel-Biskis- Reilly murine sarcoma virus 25 cellular homolog of NM 001997.2 AI499403.1 0,7 CCCGCATGCTTGGAGgtaaagtcc the fox sequence in atggtt the Finkel-Biskis- Reilly murine sarcoma virus 26 cellular homolog of NM 001997.2 AW795076. 0,6 CGGTCGCCCAGATCAaggctcatg the fox sequence in 1 tagcct the Finkel-Biskis- Reilly murine sarcoma virus 27 cellular homolog of NM 001997.2 D52122.1 0,6 GGCCGCATGCTTGGggtaaagtcc the fox sequence in atggtt the Finkel-Biskis-Reilly murine ne sarcoma virus 28 cellular homolog of NM 001997.2 BE535673.1 0,6 GCCGCATGCTTGGAGgtaacagtc the fox sequence in catggt the Finkel-Biskis- Reilly murine sarcoma virus 29 re (c osphate ! NM 002355.2 CA430891.1 0,8 CGACACACCCTAGCGgacaatttta receptor ceptor (cation accct dependent) 30 mitogen-activated NM 002419.2 AK090614.1 1,2 TTCCATTCCATGCAGgaaggctgg protein kinase kinase aagcgc kinase 11 31 proteoglycan 1, NM 002727.1 BQ051861.1 0,8 AATCCTCAGTTCAAGgttatcctacg secretory granule caga 32 proteoglycan 1, NM 002727.1 BQ051861.1 0,8 GACTGACC 1 1 1 1 1 CCaaagacgag secretory granule aatcca 33 protein kinase C NM 002743.1 BU631834.1 1,5 CTCGCAGAAACCCAAccgctccac substrate 80K-H caccgt 34 SEC14-like 1 (S. CD366399.1 1,6 GTAGGTAGGTTCGTAgtagggttcg cerevisiae) NM 003003.1 taggt 35 SEC14-like 1 (S. NM 003003.1 AK130317.1 1,3 TAGGGCTAGTAGGTAGGGCTA cerevisiae) GTAGGTAGGGCTAGTAGGTAG GGCTAGTAGGTAGGGCTAGTA GGTAGGGCTAGTAGGTAGGGC TAGTAGGTAGGGTTCGTAGGT AGGGTTCGTAGGTAGGGTTCG TAG GTAG G GTTAGTAG CG CGT CTGTGCTGCTTCCACCTGGTG CTTCCTGTTCCCAAATCACAAG GGCCTGAAGGTGGTCCCTGCT TTCTCTTTCTCTTTCTCTGTGTC TCAGATG G CGATTTTG CTGACA GCTGCCAAGAAAATGCTTCACT CAACAGTCCTCATGTGCCCAG AGATGTTTATAGAACTGTTTGA ATTGCAGCCATCCCCTGCCCC CTCCCAGGCTGAAGATCTGTT C 1 1 1 1 1 AAGTTGATTCGGGAGT GGCATTCTTTTATACCCAAAGA CTGTAGTGCATCTTGAAGAGCT CAAAGCACATGACCGCACAAA TGCTTACAGGGTTTCCTCCCGA GTAATCCAATCTCACTCCCCTT GTAAGG 36 SEC14-like 1 (S. NM 003003.1 AK130317.1 TAGGGTTCGTAGGTAGGGCTA I cerevisiae) GTAGGTAGGGTTAGTAGGTAG GGCTAGTAGGTAGGGCTAGTA GGTAGGGTTAGTAGGTAGGGT TCGTAGGTAGGGCTGGTAGGT AGGGTTAGTAGGTAGGGCTAG TAGGTAGGGCTAGTAGGTAGG G CTAG TAG G TAG G GTTAG TAG GTAGGGCTAGTAGGTAGGGCT AGTAGGTAGGGTTAGTAGGTA 1,4 GGGTTCG SEC14-like 1 (S. NM 003003.1 AK130317.1 AGGTAGGGTTCGTAGgtagggcta 37 cerevisiae) 1,5 gtaggt cysteine-rich acidic NM 003118.1 BG325726.1 GTGAAGAAGATCCATGAGAAT secreted protein 1,4 GAGAAGCGCCTGGAG 38 (osteonectin) cysteine-rich NM 003118.1 BG325726.1 9aC999 protein secreted secreted protein 1,3 tacctc 39 (osteonectin) eted protein acidic NM 003118.1 , AA325849.1 CATGGAGCATTGCACCACCCG secreted secreted protein 40 Î (osteonectin) I 1 1 1 1 CGAGACCTGTGACCTG 1,3 GACAAT 41 cysteine-rich acidic NM 003118.1 AA325849.1 GAGAATGAGAAGCGCCTGGAG secreted protein GCAGGAGACCACCCCGTGGAG (osteonectin) CTGCTGGCCCGGGACTTCGAG AAGAACTATAACATGTACATCT TCCCTGTACACTGGCAGTTCG 1,3 GCCAGCTGGACCA 42 cysteine-rich acidic NM 003118.1 AA325849.1 1,3 GCACCCCATTGACGGgtacctctcc secreted protein cacac (osteonectin) 43 nuclear factor NM 003204.1 BM973053. 1,5 CGGGTCAGTGTACAGgaagaggc (erythroid-derived 2)- 1 aggcact like 1 44 nuclear factor NM 003204.1 BM973053. 1,5 TGCTGTGAGGCAGAGgaatgatgg (erythroid-derived 2)- 1 agaatc like 1 45 synuclein, alpha (non NM 000345.2 NM007308 1,2 TACGAACCTGAAGCCTAAGAAA A4 component of .1 TATCTTTGCTCCCAGTTTCTTG amyloid precursor) AGATCTGCTGACAGATGTTCCA TCCTGTACAAGTGCTCAGTTCC AATGTGCCCAGTCATGACATTT CTCAAAGTTTTTACAGTGTATC TCGAAGTCTTCCATCAGCAGTG ATTGAAGTATCTGTACCTGCCC CCACTCAGCATTTCGGTGCTTC CCTTTCACTGAAGTGAATACAT GGTAGCAGGGTCTTTGTGTGC TGTGGA I I I I GTGGCTTCAATC TACGATGTTAAAACAAATTAAA AACACCTAAGTGACTACCACTT ATTTCTAAATCCTCACTAIliii TTGTTGCTGTTGTTCAGAAGTT GTTAGTGATTTGCTATCATATA TTATAAGAIIIii AGGTGTCTTT TAATGATACTGTCTAAGAATAA TGACGTATTGTGAAATTTGTTA ATATATATAATACTTAAAAATAT GTGAGCATGAAACTATGCACCT ATAAATACTAAATA 46 synuclein, alpha (non NM 000345.2 NM_007308 1,4 CCACAGGAAGGAATTCTGGAA A4 component of .1 GATATGCCTGTGGATCCTGAC amyloid precursor) AATGAGGCTTAT 47 synuclein, alpha (non NM 000345.2 NM_007308 1,5 GACCAGTTGGGCAAGaatgaaga A4 component of .1 aggagcc amyloid precursor) 48 synuclein, alpha (non NM 000345.2 L36674.1 1,4 TAAAGGAATTCATTAGCCATGG A4 component of ATGTATTCATGAAAGGACTTTC amyloid precursor) AAAGGCCAAGGAGGGAGTTGT GGCTGCTGCTGAGAAAACCAA ACAGGGTGTGGCAGAAGCAGC AGGAAAGACAAAAGAGG 49 synuclein, alpha (non NM 000345.2 L36674.1 1,3 GTGTTCTCTATGTAGgctccaaaac A4 component of caagg amyloid precursor) 50 translationally NM 003295.1 AA223997.1 0,5 AAAACCTTTTATGACAG G G G CT controlled tumor GCAGAACAAATCAAGCACATC protein 1 CTTGCTAATTTCAAAAACTACC AG 51 translationally- NM 003295.1 CA848049.1 0,7 AGAtcgcggacgggttgtgcctggaggTG controlled tumor G protein 1 52 translationally- NM 003295.1 CA848049.1 0,5 TCCGACATCTACAAGatccgggag controlled tumor atcgcg protein 1 53 translationa11 NM 003295.1 BCO22436.1 0,6 CCGACATCTACAAGATCCGGG controiled tumor AGATCGCGGACGGGTTGTGCC protein 1 TG 54 translationally- NM 003295.1 BC040008.1 0,5 GCGCCGCTCCGGCTGCACCG controlled tumor CG CTCG CTCCGAGTTTCAG G C protein 1 TCGTGCTAAGCTAGCGCCGTC GTCGTCTCCCTTCAGTCGCCAT CATGATTATCTACC 55 TYRO protein NM 003332.1 BF092099.1 0,8 GAGGGGCTGCGGAGGcagcgac tyrosine kinase ccggaaac binding protein 56 triosephosphate NM 000365.3 BI226906.1 1,3 CATGCTCTGGCAGAGgatggctga isomerase 1 agtcca 57 transgelin 2 NM 003564.1 AA428309.1 1,3 ATTAACACCACTGACtgtg ctcacc acaca 58 uroporphyrinogen NM 000374.2 BQ008745.1 1,5 GTGTGCCGCTGATTGtggaccctg decarboxylase atgaca 59 cold shock domain NM 003651.3 BC009744.1 1,3 GAGGAGGAAGGGAGCGGCAG protein A CAGTGAAGGATTTGACCCCCC TG CCACTGATAG G CAGTTCTCT GGGGCCCGGAATCAGCTGCG CCGCCCCCAGTATCGCCCTCA GTACCGGCAGCGGCGGTTCCC GCCTTACCACGTGGGACAGAC CTTTGACCGTCGCTCACGGGT CTTACCCCAT 60 cold shock domain NM 003651.3 BC009744.1 1,5 CCCAACAGAATACAGgctggtgag protein A attgga 61 solute carrier family NM 003982.2 AW798524. 0,8 TCTCTGCTCTTCAATggtatcatggc 7 (cationic amino 1 attg acid transporter, y+ system), member 7 62 annexin A2 NM 004039.1 AV709641.1 0,9 GACGCTTCTGAGCTaaaagcttcca tgaag 63 annexin A2 NM 004039.1 BF244428.1 1,6 AGCTTGGAGGGTGATgtgtggatg aggtca 64 beta-2-microglobulin NM 004048.2 AV734235.1 0,7 ATTTG GATTG GATGAATTCCAA ATTCTGCTTGCTTGCIiit1AAT ATTGATATGCTTATACACTTAC ACTTTAT G CACAAAATG TAG G G TTATAATAATGTTAACATGGAG ATGATCTTCTTTATAATTCTACT TTGAGTGCTGTCTCCATGTTTG ATGTATCTGAG CAG GTTG CTCC ACAG GTAG CTCTAG GAG G G CT GGCAAC 65 beta-2-microglobulin NM 004048.2 BF912731.1 0,8 AGATAGTTAAGTGGGatcgagaca tgtaag 66 calreticulin NM 004343. 2 CA306742.1 1,4 GACTCCAAG CCTGAGg cag cag a gaaacaa 67 CD74 antigen NM004355.1 BQ029721.1 1,5 ACACCCAGACCCCAGgaagagcc aatgttt 68 HLA-B associated NM 004640.3 BM980603. 1,4 GCCCCAGGAGGAGAGcagtttaaa transcript 1 1 gatttt 69 SEC24 related gene NM 004922.1 AW449995. 1,8 GCAGCTGTCTGGAATgCagaggC family, member C (S. 1 agctgga cerevisiae) 70 actinin, alpha 4 NM 004924.3 CB051741.1 0,7 CCATCGGGGCAGAAGaacttcatc acagct 71 hemoglobin, alpha NM 000517.3 H78334.1 1,4 CGGTCAACTTTCAAGctccttaagc 1/2 cactg 72 hemoglobin, alpha NM 000517.3 H55830.1 1,4 TCAAGGCCGCCTGGGgatgttcctg 1/2 tCCtt 73 cyclin-dependent NM 004935.1 BI669825.1 0,8 GTGGCTCTGAAACGGggtgtgCCg kinase 5 agttcc 74 histone deacetylase 1 NM 004964.2 BF204295.1 1,7 CAGAGGAGAAGAAAGggtcaagg aggaggt 75 WD repeat domain 1 NM 005112.3 BCO30541.1 1,4 CTAAGGAACATCGACgaccacagc cgcttt 76 ubiquitin specific NM 005153.1 W19112. 1 0,7 ATCGGCTCTTTGCAGtggtctaccat protease 10 cacg 77 v-fos FBJ murine NM 005252.2 BG541010.1 0,8 CCTGTCAACGCGCAGgaCttCtgCa osteosarcoma viral cggac oncogenehomolog 78 v-fos FBJ murine NM 005252.2 BG541010.1 1,5 GGCAAGGTGGAACAGgagacaga osteosarcoma viral ccaacta oncogenehomolog 79 v-fos FBJ murine NM 005252.2 BI325046.1 1,5 GGCAAGGTGGAACAGgagacagg osteosarcoma viral acaacta oncogenehomolog 80 hemoglobin, alpha NM 000558.3 R91899.1 1,7 ACCAAGACCTACTTCccggtcaact 1/2 tcaag 81 hemoglobin, alpha NM 000558.3 H58664.1 1,7 GGAGGCCCTGGAGAGctcctaagc 1/2 cactgc 82 actin protein NM 005718.2 AI470163.1 0,7 ATTGCTGTGAAACAGgggtatgata 2/3 complex, subunit tcagc 4, 20kDa 83 B-cell receptor- NM 005745.5 AI962313. 1 1,3 GCAGTTGCCACCTTCCaCgCCtga associated protein 31 gcgtgg 84 RNA binding motif NM 005778.1 CA488450.1 0,8 GAGCAGACAAGTTTGactctgaac protein 5 aggaag 85 CD164 antigen, NM 006016.3 AF299342.1 0,8 CTGTGACTCCAACCTCACAACC sialomucin TGTGCGAAAGTCTACCTTTGAT GCAGCCAGTTTCATTGGAGGA ATTGTCCTGGTCTTGGGTGTG CAGGCTGTAATTTTCTTTCTTTA TAAATTCTGCAAATCTAAAGAA CGAAATTACCACACTCTGTAAA CAGACCCATTGAATTAATAAGG ACTGGTGATTCATTTGTGTAAC TCACTGAAGCCAAAATACTATC TTTTAAGATGTCCCACATGGAA GACGCTATTCCAGGATCTTTAA ATTTCCATGGATGCATATAGGA TGTTTGGGAGCATCATCCGTG AAGAAAAAATCAATTAAATCATT GTGTTCAACAGGAATATTTAAA ATATTCTGCATGAATCCTGTGG CTGTCTTATTTTAAATAGCTGC TGCTGTGGGATTATA 1 1 1 1 1 1 1 1 CCTTAACATGCCAAATATAACT TTCTGAAAGTGATGGAAAATGT TGTCTTGTGCAGACAACATCAT GGCTCTTGGCAGTTTA 86 CDI64 antigen, NM 006016.3 AF299343.1 0,8 CAGCCAATTCTACAGctaaaccca sialomucin cagttc 87 ubiquitous alpha NM 006082.1 BG981396.1 1,3 CACCCTGAGCAGCTCaccaccca tubulin caccacc 88 talin 1 NM 006289.2 AI393487.1 1,4 CCCAGAGTATTAACGCTCCAA GAGTATTATTAACGCTGCTGTA CCTCGATCTGAATCTGCCGGG GCCCCAGCCCACTCCACCCTG CCAGCAGCTTCCAGCCAGTCC CCACAGCCTCATCAGCTCTCTT CACCGIIIIIIGATACTATCTT CCCCCACCCCCAGCTACCCAT AGGGGCTGCAGAGTTATAAGC CCCAAACAGGTCATGCTCCAAT AAAAATGATTCTACCTACAA 89 talin 1 NM 006289.2 AI393487.1 1,4 TGCTTCGGAAGGAACgagagctgg aagagg 90 talin 1 NM 006289.2 AI417760.1 1,4 AGGAAGAAATGCTTCggaaggaa cgagagc 91 acidic (leucine-rich) NM 006305.2 AW577170. 1,4 AGGAGGATGAGGATGgaggatga nuclear 1 agaaggt phosphoprotein 32 family, member A 92 translocase of inner NM 006327.1 AU142330.1 0,7 GATGTTGCATGACAGgggcactttg mitochondrial ggcta membrane 23 homolog (yeast) 93 microspherule NM 006337.3 CD369365.1 1,5 AACTCTGTGGTGGAGacaggaag protein 1 ctggggc 94 APG7 autophagy 7 NM 006395.1 BO000091.1 0,7 CTTGTGCCTCACCAGatccgggga like (S. cerevisiae) tttCtt 95 acidic (leucine-rich) NM 006401.1 Y07570.1 1,5 GAAGTCAGTGAGGAGgaagaaga nuclear atttgga phosphoprotein 32 family, member B 96 acidic (leucine-rich) NM 006401.1 Y07570.1 1,4 AG GATGAG GATGAAG ag g agg aa nuclear gaaggtq phosphoprotein 32 family, member B 97 acidic (leucine-rich) NM 006401.1 BF195216.1 1,6 GAAGTCAGTGAGGAGaggaggaa nuclear gaaggtg phosphoprotein 32 family, member B 98 butyrophilin, NM 007049.2 BQ929466.1 1,3 CATTCTTACATGCTGaggaccgga subfamily 2, member gaagtg Al 99 EAP30 subunit of NM 007241.2 BF525899.1 1,2 CTGCAGCTGGCAGAGaagaatgg ELL complex ctacgtg 100 coatomer protein NM 007263.2 BM798704. 1,5 CCACGAGAGTCGGAGgaaggagC complex, subunit 1 epsilon tgaagag 101 soluble galactoside- NM 009587.1 BG390210.1 1,2 TACATCAGCTTCCAGacccagaca binding lectin 9 gtcatc (galectin 9) 102 soluble galactoside- NM 009587.1 BG698264.1 0,8 CTCCAGTGGAACCAGgtttgctgtg binding lectin 9 aactt (galectin 9) 103 CDKNIA interacting NM 012127.1 AK027287.1 1,3 TAGAAGCTGGTGGAGgtgaggtcc zinc finger protein 1 agagat 104 CDKNIA interacting NM 012127.1 AK027287.1 1,2 CAGGCACATTCACAGccgcatctgc zinc finger protein 1 cacaa 105 F-box protein 7 NM 012179.2 BE905968.1 1 1,3 AATTTTGAAGCTGAGTCAATTC AAGATAATG CG CATATG G CAG AG G G CACAG GTTTCTATCCCT CAGAACCCATGCTCTGTAGTG AATCGGTGGAAGGGCAAGTGC CACAZTCATTAGAGACCTTGTA TCAATCAGCTGACTGTTCTGAT GCCAATGATGCCTTGATAGTGT TGATACATCTTCTCATGTTGGA GTCA 106 WW domain binding NM 012478.2 BG820375.1 1,2 CCACCTCCCTACTACCCaccggaa protein 2 gataag 107 px19-likeprotein NM 013237.2 BM504662. 1,3 CACGCCCGGCTGATGggaatttggt 1 cttgc 108 insulin-like growth NM 000876.1 BM787853. 1,4 GGAAACAGTGATAAGTAAGCT factor 2 receptor 1 GACCACTTGCTGTAGGAGAAG TTCCAACGTGTCC 109 insulin-like growth BM787853. 1,3 GACGCATCTCAAAACAGAGGG factor 2 receptor NM 000876.1 1 CTGCATTCGAAGAAACCCTTGC TGCTTTAGTCCCGATAGGGTAT TTGACCCCGATATATTTTAGCA IIIIAATTCTCTCCCCCTATTTA TTGACTTTGACAATTACTCAGG TTTGAGAAAAAGGAAAAAAAAA CAGCCACCGTTTCTTCCTGCCA GCAGGGGTGTGATGTACCAGT TTGTCCATCTTGAGATGGTGAG GCTGTCAGTGTATGGGGCAGC TTCCGGCGGGATGTTGAACTG GTCATTAATGTGTCCCCTGAGT TG GAG CTCATTCTGTCTCTTTT CTCIIIIGCTTTCTGTTTCTTAA GGGCACACACACGTGCGTGCG AGCACACACACACATACGTGC ACAGGGTCCCCGAGTGCCTAG G I I I I GGAGAGTTTGCCTGTTC TATGCCTTTAGTCAGGAATGGC TGCACC I Iii I GCATGATATCT TCAAGCCTGGGCGTACAGAGC ACATTTGTCAGTA 1 1 1 1 1 GCCG insulin-like growth AAGGAGGTCAGGCCCCACTCC 110 factor 2 receptor NM 000876.1 BF222741.1 1,3 j TTCCTGATTGTTTACAGTCATT GGAATAAGGCATGGCTCAGAT CGGCCACAGGGCGGTACCTTG TGCCCAGGGTTTTGCCCCAAG TCCTCATTTAAAAGCATAAGGC CGGACGCATCTCAAAACAGAG GGCTGCATTCGAAGAAACCCT TG CTG CTTTAGTC C C GATAG G GTATTTGACCCCGATATATTTT AGCATTTTAATTCTCTCCCCCT ATTTATTGACTTTGACAATTACT CAGGTTTGAGAAAAAGGAAAAA AAAACAGCCACCGTTTCTTCCT GCCAGCAGGGGTGTGATGTAC CAGTTTGTCCATCTTGAGATGG TGAGGCTGTCAGTGTATGGGG CAGCTTCCGGCGGGATGTTGA ACTGGTCATTAATGTGTCCCCT GAGTTGGAGCTCATTCTGTCTC TTTTCTCTTTT G CTTTCTGTTTC TTAAGGGCACACACACGTGCG TGCGAGCACACACACACATAC GTGC 111 insulin-like growth NM 000876.1 BF222741.1 GTGGCTGATGGAAGAgatccagct factor 2 receptor 1,3 gcctcc 112 adducin 1 NM 014190.2 CA396829.1 1,3 AGAAGGGCTCTGAAGagaatctgg (alpha)/adducin 1 acgagg (alpha) eukaryotic translation BM142283. GTTAACCTCACCCTACAGATGA initiation factor 2-AGATAATAGAGCAAGAAAAAGA AATTGCAGAACTAAAGAAGCAG CTAAACCTCCTTTCTCAAGACA AAGGGGTGAGGGATGACGGAA AGGATGGGGGCGTGGGATGAA 113 alpha kinase 1 NM 014413. 2 1 1,3 AGTGGAC 114 KIAA0040 NM 014656.1 I CB050264.1 1,7 ATGGTTCCCAAGTGTgtgtaagtgt gtgta 115 Homocysteine- NM 014685.1 BG828243.1 0,8 TG CATCAG G G G CTTTTG TTC CA inducible, CCACCAAGTGCACAAGAGATA endoplasmic CCTGTGGTCTCTGCACCTGCT reticulum stress- CCAGCCCCTATTCACAACCAGT inducible, ubiquitin- TTCCAGCTGAAAACCAGCCTG like domain member CCAATCAGAATGCTGCTCCTCA 1 AGTG GTTG TTAATC CTG GAG C CAATCAAAATTTGCGGATGAAT GCACAAGGTGGCCCTATTGTG GAAGAAGATGATGAAATAAATC GAGATTGGTTGGATTGGACCT ATTCAGCAGCTACATTTTCTGT TTTTCTCAGTATCCTCTACTTCT ACTCCTCCCTGAGCAGATTCCT CATGGTCATGGGGGCCACCGT TGTTATGTACCT 116 Homocysteine- NM 014685.1 BG828243.1 0,7 GGAAAACATCTCAAGgcctgaagct inducible, gccca endoplasmic reticulum stress- inducible, ubiquitin- like domain member 117 Homocysteine- NM 014685.1 BG828243.1 0,8 GTACTACATGCAATAtttagcagcc inducible, actgc endoplasmic reticulum stress- inducible, ubiquitin- like domain member 1 118 lysosomal-associated NM 014713.2 AL039105.1 CTGATTCCATTCTTCTGTTACC protein transmembrane 4 alpha 0,8 GAC11111GACTTCGTCCTCAG TTGCCTGGTTGCTATTAGTTCT CTCACCTATTTGCCAAGAATCA AAGAATATCTGGATCAACTA 119 DAZ associated NM 014764.1 AU118651.1 0,8 AGCAGTACCTCCCTAAAGCATT protein 2 TTGAGGTAGGGGAGGTATCCA TTCATAAAATGAATGTGGG 120 ring finger protein 10 NM 014868.3 BU626650.1 1,5 CGAGAGCGCAGGATTGAGATA GAGGAGAACA 121 ring finger protein 10 NM 014868.3 BU626650.1 1,5 AGAAACAGGGCAAGTacccagaa gtccaca CAGAAGTCCACATTCCCCTCG AGAATCTACAGCAGTTTCCTGC CTTCAATTCTTATACCTGCTCC TCTGATTCTGCTTTGGGTCCCA CCAGCACCGAGGGCCATGGG GCCCTCTCCATTTCTCCTCTCA 122 ring finger protein 10 NM 014868.3 BF815780.1 1,3 GCAGAAGTC _ 123 ring finger protein 10 NM 014868. 3 BF815780.1 1,4 CAGGTTCCCATGCAGactttctgctg accc 124 ribosomal protein L4 BM846228. 0,7 TCAGTGAATTAG CAGgtcatcag ac NM 000968.2 1 tagtg 125 ribosomal protein L4 NM 000968.2 CB141160.1 0,7 GTCATCAGACTAGTGCTGAGT CTTGGGGTACTGGCAGAGCTG TGGCTCGAATTCCCAGAGTTC GAGGTGGTGGGACTCACCGCT CTGGCCAGGGTGCTTTTGGAA ACatgtgtcgtg 126 ribosomal protein L4 NM 000968.2 CD686462.1 0,6 GCGTGTGCTCGCCCACTGATA TCGGTGTACTCCGAAAAGGGG GAGTCATCTGGCAAAAATGTCA CTTTG C CTG CT G TATTCAAG G C TCCTATTCGACCAGATATTGTG AACTTTGTTCACACCAACTTGC GCAAAAACAACAGACAGCCCT ATGCTG 127 ribosomal protein L4 NM 000968.2 CD686462.1 0,7 TGGTCATGTCTAAAGgtcatcgtatt gagg 128 ribosomal protein L5 NM 000969.2 GATATCATTTGTCAG attg cttatg cc BE879402.1 0,8 Çgt 129 BAT2 domain NM 015172.1 AB029019.2 1,4 CACTTCCACCTTCAAccttagctcca containing 1 gttt 130 adenosine deaminase, NM 015841.1 AA096321.1 1,3 TCACACAGGACAGAGgaggcaga RNA-specific gcttctg 131 endoplasmic NM 015913.2 CA418006.1 0,8 TTCCCAAATTTCTACagcctctttcct reticulum thioredoxin Ctt superfamily member, 18 kDa 132 hypothetical protein NM 016127.3 AA576624.1 0,8 GGAGTCAAACACTGGatgcagaa MGC8721 attttgg 133 tetratricopeptide NM 017775.2 BE164618.1 1,5 TTTTGATGCACAGAGctaccaccac repeat domain 19 agtgc 134 hypothetical protein NM 017841.1 BI553945.1 0,8 TCI111IGCTAAAGAACATCTG FLJ20487 CAGCACATGACAGAAAAGCAG CTGAACCTCTATGACCGCCTG ATTAACGAGCCTAGTAATGACT GGGATATTT 135 hypothetical protein NM 017865.2 AK001070.1 1,6 ACCAGAGACCTTCCCACCTCC FLJ20531 AGGAGAGGAAGAGGGTGAGG AAGAAGAGGACAATGATGAGG ATGAAGAGGAGATGCTCAGTG ATGCCAGCTTATGGACCTACA G 136 ubiquitin B NM 018955.2 BE708511.1 1,4 CCCTGACCGGCAAGAcatcactctg gaggt 137 ubiquitin B NM 018955.2 AA206538.1 1,4 CAG GTCAAAATG CAGatcttcgtga aaacc 138 ubiquitin B NM 018955.2 AA206538.1 1,3 CAG G TCAAAAT G CAG a tcttcg tg a agacc 139 ubiquitin B NM 018955.2 AA340917.1 1,4 AGATCTTCGTGAAGACCCTGA CCGGCAAGACCATCACTCTGG AGGTGGAGCCCAGTGACACCA TCGAAAATGTGAAGGCCAAGA TCCAAGATAAAGAAGGCATCC CCCCCGACCAGCAGAGGCTCA TCTTTGCAGGCAAGCAGCTGG AAGATGGCCGCACTCTTTCTGA CTACAACATCCAGAAAGAGTC GACCCTGCACCTGGTCCTGCG CCTGAGGGGTGGCTGTTAATT CTTCAGTCATGGCATTCGCAGT G CCCAGTGATG G CATTACTCT G CACTATAG C CATTTG CCCCAA CTTAAGTTTAGAAATTACAAGT TTCAGTAATAGCTGAACCTGTT CAAAATGTTAATAAAGG 140 ubiquitin B NM 018955.2 BU661443.1 1,4 TCCCTGTGGGTGGACGTGGTT GGTGATTGGCAGGATCCT 141 ubiquitin B NM 018955.2 BU661443.1 1,5 GGTTGGCTTTGTTGGgtgagcttgtt tgtg 142 homoiogog, su sub family NM 018981.1 AF490904.1 1,3 CTTAGTGGCATGTTGgatggtcttgt om taat C, member 10 143 major NM 019111.2 AA505585.1 AGAGAGAAGATCACTGAAGAA histocompatibility ACTTCTGCTTTAATGACTTTAC complex, class II, DR AAAG CTG G CAATATTACAATC C alpha TTGACCTCAGTGAAAGCAGTCA 0,8 TCTTCAGCGTTTTCCAGCCCTA TAGCCACCCCAAGTGTGGTTAT GCCTCCTCGATTGCTCCGTACT CTAAC 144 major NM 019111.2 AA505585.1 0,8 ATCTAGCTGGCTTTCcctgtctattg class II, 9 complex, I, DR CCtt alpha 145 major NM 019111.2 CD686254.1 TCAGAGACAGTCTTCCTGCCC histocompatibility complex, class II, DR alpha 0,7 AGGGAAGACCACCTTTTCCGC AAGTTCCACTATCTCCCCTTCC TGCCCTCAACTGAGGACGTTTA CGACTGCAGGGTGGAGCACTG GGGCTTGGATGAGCCTCTTCT CAAGCACTGGGagtttgatgctccaag CCCtCtC 146 KIAA1191 protein NM 020444.2 ', BE313670.1 1,7 GCAGCAGGATCACAGgtggagga aggagaq 147 KIAA1191 protein NM 020444.2 BI254429.1 1,5 GCAGCAGGATCACAGaacagacc caggaaa 148 mesoderm induction NM 020948.1 AY124186.1 0,7 TTTCTCACACAGGCAtactccaaat early response 1 gCttC 149 Actinin, alpha 1 NM 001102.2 BG036045.1 ! 0,7 AACCACTTTGACCGGggagaagc agaattt 150 membrane-spanning NM_152851 GGAACTCTCTCTCTGATGCTGA 4-domains, subfamily NM 022349.2 .1 0,8 TTTGCACTCTGCTGGAATTCTG A, member 6A CCTAGCTGTGCTCACTGCTGT GCTGCGGTGGAAACAGGCTTA C myelin protein zero- TGCCCTGGCATTCTGGCAGAG AATCCTCACCAGTTCTCACCAA CCTTCCCCCCAGGCAAGGGCA GCTGCCAGCATGGTGCTCTGC CAGGACAGGTTTCCCTGAAGG AAGCTGCTCACACTGAGATGA GCCTCTCAGGGCAGGACCTCT TCCCAAGCCCTGCACACCCAC 151 like 1 NM 024569.2 AI693779.1 0,9 CCCTGCAGCCCTTTTGGCTC 152 mouse GCTTCAGTCCGCCGAgagcagtac mouse limbimb-bg ud a a ond NM 030915.1 W45195.1 1,1 cgtgtg heart Bene heterogeneous CTGACCCAGATAAAACAAAAAG TGGATTCTCTCCTGGAAAACCT 153 nuclear GGAAAAAATGGAAAAGGAACA ribonu cleoprotein C NM 031314.1 CF128443.1 1 1,2 GAGCAAACAAGCAG ib (C1/C2) polymerase (RNA) II NM 002694 GGTGGGGGTGGAAGCagccgca (DNA directed) polypeptide C C, , 154 33kDa NM 032940.1 .2 1,3 ggagcaag caspase 4, apoptosis- NM_001225 155 related cysteine NM 033306.1 .2 1,2 TGTTCCCTATGGCAGaaggcaacc protease acagaa 156 major NM 033554.2 BU621846.1 0,8 GTCGCTGAGAGCCTCTTCCTG histocompatibility CCCAGAACAGATTACAGCTTCC complex, class II, DP ACAAGTTCCATTACCTGACCTT alpha 1 TGTGCCCTCAGCAGAGGACTT CTATGACTGCAGGGTGGAGCA CTGGGGCTTGGACCAGCCGCT CCTCAAGCACTGGG 157 major NM 033554.2 BU621846.1 0,7 ACCCTCATCTGCCACATTGACA histocompatibility AGTTCTTCCCACCAGTGCTCAA II, CGTCACGTGG complex, class DP alpha 1 158 major NM 033554.2 BU621846.1 0,7 AAGGAGCCTGTGGAGctgggcca histocompatibility gcccaac complex, class II, DP alpha 1 159 FK506 binding NM 054014.1 NM 000801 GCTCCCTGTTCTTGGatctgccatg protein 1A, 12kDa .2 0,8 gaggq 160 chloride intracellular ! NM 001288.3 BG491600.1 CCAGGGACGGCCACTTCCTGG channel 1 TCCCCGACGCAACCATGGCTG AAGAACAACCGCAGGTCGAAT 0,8 TGTTCGTGAAG 161 chloride intracellular NM 001288.3 AV683308.1 GGGCAGCTCCCATTCctgctgtatg channel 1 0,7 gcact 162 vacuolar protein NM 080631.1 BX648347.1 ACCGGAGGGAAGAAGgtgtgctca sorting 41 (yeast) 1,4 gtgaag 163 zinc finger protein 1 BCO53361.1 1,5 jTGGAAAGGAGAAGGAATAAGA 384 NM 133476.2 CGGCAGGAGGAAGAGAGAGA 1 GAGG 164 chromosome 19 open NM 138774.2 AI375989.1 1,4 AC CTCATCTCG G CCAg tg ctg a cct reading frame 22 ggagg 165 nuclear receptor NM 000176.1 1 X03348.1 0, 8 TTCCTAAGGACGGTCTGAAGA 1 subfamily 3, group C, GCCAAGAGCTATTTGATGAAAT member 1 TAGAATGACCTACATCAAAGAG (glucoc ucocorticoid CTAGGAAAAGCCATTGTCAAGA receptor) GGGAAGGAAACTCCAGCCAGA ACTGGCAGCGGTTTTATCAACT GACAAAACTCTTG 166 Nucleosome NM 139207.1 BU620919. 1 0,8 AGTGATATGGTTCAG9aacacgat assembly protein 1- gaacct like 1 167 1 NM 139207.1 AU117948.1 0,8 ATGAATATTTTACAAATGAAGT Nucleosome GCTGACAAAGACATACAGGAT assembly protein 1- GAGGTCAGAA like 1 168 Nucleosome NM 139207.1 AU117948.1 0,8 GCTGGCCAGCCTATGagttttgtctt assembly protein 1-agaa like 1 169 Nucleosome NM 139207.1 AK122670.1 0,8 CCTGCCTAGGGTAGTTAAAAG assemblyprotein 1- ACGAGTGAATGCTCTCAAAAAC like 1 CTGCAAGTTAAATGTGCACAGA TAGAAGCCAAATTCTATGAGGA AGTTCACGATCTTGAAAGGAAG TATGCTGTTCTCTATCAG 170 TAF15 RNA NM 139215.1 BF812650.1 1,3 GCTATGGTGGGGACAGAGGAG polymerase II, TATA GCGGCTATGGAGGAGACCGAG box binding protein GAGGTGGCTATGGAGGAGATC (TBP)-associated GAGGTGGCTATGGAGGAGACC factor, 68kDa GAGGTGGAGGCTATGGTGGAG ACCGAGGAGGCTATGGAGGAG ATCGAGGAGGTTACGGAGGAG ATCGAGGAGGTTATGGAGGAG ATCGAGGAGGCTATGGAGGAG ACAGAAGCCGGGGGGGCTATG GAGGAGACCGTGGTGGTGGCA GTGGCTACGGTGGAGACCGAA GTGGAGGCTATGGAGGAGACA GGAGTGGTGGCGGCTATGGAG GAGACCGAGGTGGGGGCTAC GGAGGAGACCGAGGTGGCTAT GGAGG 171 TAF15 RNA NM 139215. 1 BF812650.1 1,3 GGGACAGAGGCGGCGgctatggt polymerase II, TATA ggggaca box binding protein (TBP)-associated factor, 68kDa GACCACAGCAATGACCAGCCC TCATTAGGGCCCTGGATGATTT TTG G T CTAATAAC G CATG CTAG TGTTGATGIIIIIIGGTCAGAG GGTATGAACAGGAAGAATTAAA TG CAG CAG G CTTTATTTTAAAT GCCGATTCACATTACTCTGTTC AAGCTGCGTTGAGATGTTAAAC TGGCTTACTATAGACTTCGTAA AAATGGCTCCAGAAAAGTAACA AACTGAAATCTTTGAGATCACA CAGGTTGGAAATATGTACATAA CTG CACAAG GTGTCAATTCTG C TCTACAGTGCAGTTTTAGTCAG iiiIAGTTGCATAGGTTTCCAT TGTATTTATAGTCTGTTTATGCT AAATCTGGCCAAAGATGAACAT TGTCCACCACTAAAATGCCTCT GCCACTTTGAATTCTGTGCTAA 172 DEAD (Asp-Glu- NM 001356.2 BE000596.1 0,8 I 1 I GTGGCCAGAATGCGGTG Ala-Asp) box polypeptide 3, X- linked ATCAAAACGCTCCATC I I I I I A CAGTGGCATAGGAAGACGGCA AAAATTTCCTAAAGTGCA 173 copine II NM 152727.4 AK094867.1 1,4 ACCCCTTCTG CTCAGgtgtgg atgg tattg 174 heat shock 70kDa NM 153201.1 BU731317.1 0,8 ACTCGTATCCCCAAGattcagaag protein 8 CttCtC 175 hypothetical protein NM 153233.1 BCO38360.1 1,4 ACTGGCCAGGACCTGgaagcaga FLJ36445 cacctct 176 dynein, cytoplasmic, NM 001378.1 U39575.1 1,4 GGAAGACAAAGAAGGagagattca intermediate agcagg polypeptide 2/dynein, cytoplasmic, intermediate 1 polypeptide 2 177 tropomyosin 3 NM 153649.1 BM006741. 1,3 TGATGAGAGTGAGAGgcagagac 1 ccgtgct 178 tropomyosin 3 NM 153649.1 BM006741. 1,3 TGATGAGAGTGAGAGag g atg ctg 1 gaccag 179 EST BE881352.1 - BE881352.1 1,3 TCAATATAAAACCCCcacctaccac acatt 180 EST AW368637.1 - AW368637. 1,4 CTTAAACTCCAGCACcatcatagcc 1 accat 181 eukaryotic translation NM 001402.4 AU146228.1 1,5 CACCAATGGAAGCAGtggacaag elongation factor 1 aaggctg alpha 1 182 eukaryotic translation NM 001402. 4 BU580573.1 0,8 CATCAAAGCAGTGGACAAGAA elongation factor 1 GGCTGCTGGAGCTGGCAAGGT alpha 1 CACCAAGTCTGCCCAGAAAGC TCAGAAGGCTAAATGAATATTA TCCCTAATACCTGCCACCCCAC TCTTAATCAGTGGTGGAAGAAC GGTCTCAGAACTGTTTGTTTCA ATTGGCCATTTAAGTTTAGTAG TAAAAGACTGGTTAATGATAAC AATGCATCGTAAAACCTTCAGA AGGAAAGGAGAATGTTTTGTG GACCACTTTGGTTTTC 1 I 1 1 G CGTGTG G CAGTTTTAAGTTAT TAG 11111 AAAATCAGTACTTTT TAATGGAAACAACTTGACCAAA AATTTGTCACAGAA I I I I GAGA CCCATTAAAAAAGTTAAATGAG 183 eukaryotic translation NM 001402.4 AA595862.1 0,8 TGCGGTGGGTGTCATCAAAGC elongation factor 1 AGTGGACAAGAAGGCTGCTGG alpha 1 AGCTGGCAAGGTCACCAAGTC TGCCCAGAAAGCTCAGAAGGC TAAATGAATATTATCCCTAATAC CTGCCACCCCACTCTTAATCAG TGGTGGAAGAACGGTCTCAGA ACTGTTTGTTTCAATTGGCCAT TTAAGTTTAGTAGTAAAAGACT GGTTAATGATAACAATGCATCG TAAAACCTTCAGAAGG 184 eukaryotic translation NM 001404.3 AA206367.1 0,7 CTGAGTCCAGATTGGCAGGTG elongation factor 1 GACTACGAGTCATACACATGG gamma CGGAAACTGGATCCTGGCAGC GAGGAGACCCAGACGCTGGTT 1 eukaryotic translation NM 001404.3 AA206367.1 0,6 CAGCATGTGGGCAAAGCCTTC 185 elongation factor 1 AATCAGGGCAAGATCTTCAAGT gamma GAACATCTCTTGCCATCACCTA G 186 eukaryotic translation NM 001404.3 BQ375267.1 0,8 GTTCTAGAGCCTTCTTTCCGCC elongation factor 1 AGGCCTTCCCAATACCAACCG gamma CTGGTTCCTCACCTGCATTAAC CAGCCCCAGTTCCGGGCTGTC TTGGGCGAAGTGAAACTGTGT GAGAAGATGGCCCAGTTTGAT Gctaaaaagtttgcagag 187 eukaryotic translation NM 001404.3 BU783548.1 0,6 ATTTAAGCGCAAGTACTCCAAT elongation factor 1 GAGGACACACTCTCTGTGGCA gamma CTGCCATATTTCTGGGAGCACT TTGATAAGGACGGCTGGTCCC TGTGGTACTCAGAGTATCGCTT CCCTGAAGAACTCACTCAGAC CTTCATGAGCTGCAATCTCATC ACTG 188 eukaryotic translation NM 001404.3 BE502067.1 0,6 TGGACAAGCTGAGGAAGAATG elongation factor 1 gamma CCTTCGCCAGTGTCATCCTTTT TGGAACCAACAATAGCAGCTC CATTTCTGGAGTCTGGGTCTTC ! CGAGGCCAG 189 eukaryotic translation NM 001404.3 BE502067.1 0,6 CAGGTGGACTACGAGTCATAC elongation factor 1 ACATGGCGGAAACTGGATCCT gamma GGCAGCGAGGAGACCCAGAC GCTGGTTCGAGAGTACTTTTCC TGGGAGGGGGCCTTCCAGCAT GTGGGCAAAGCCTTCAA 190 eukaryotic translation j GAGCTTGCCTTTCCGctgagtcca elongation factor 1 NM 001404.3 BE502067.1 0,6 gattgg gamma 191 eukaryotic translation NM 001404.3 Î AAGGAGGAGAAAAAGGCGGCT elongation factor 1 GCCCCTGCTCCTGAGGAGGAG gamma ATGGATGAATGTGAGCAGGCG CTGGCTGCTGAGCCCAAGGCC AAGGACCCCTTCGCTCACCTG BG533219.1 0,8 CCCAAGAG 192 eukaryotic translation NM 001404.3 BG615194.1 0,7 TTCCGCCAGGCCTTTCCCAATA elongation factor 1 CCAACCGCTGGTTCCTCACCT gamma GCATTAACCAGCCCCAGTTCC GGGCTGTCTTGGGCGAAGTGA AACTGTGTGAGAAGA 193 eukaryotic translation NM 001404.3 BG702200.1 0,8 GTTCACGGGAAGAGAAGCAGA elongation factor 1 AGCCCCAGGCTGAGCGGA gamma 194 O-linked N- AW002377. ; AAAGAATATCTAGCCctctgttcaac acetylglucosamine NM 181673.1 1 0,8 acca (GIcNAc) transferase 195 tyled N- AW002377. 0,8 CACGAAAAGTAGCCGctctggttga acetylglucosamine NM 181673.1 1 agctt ace (GIcNAc) transferase 196 WD ana AB028960.1 AK023778.1 1,3 CCAATTTCTTTGGCAgcaacgctca tetratricopeptide gtata repeats 1 197 actin, gamma 1 NM 001614.2 CD687776.1 1,2 CATGGAGAAGATCTGggcgcacc actggca 198 arachidonate 5- NM 001629.2 BX366320. 1 1,3 AGAGAGAACGCAGAGgccacgga lipoxygenase- agccctq activating protein 199 Homo sapiens AK026373.1 AV725084.1 1,2 , ATTTCAACAGCTGAGgaaggtgtctt cDNA: FLJ22720 fis, gctg clone HS114320 Homo sapiens AK026373.1 BQ276346.1 i TTTGGCAGGAAGGTGTCTTGC 200 cDNA: FLJ22720 fis, 0,8 TGCAGGTAACTAATGAAGAAGT clone HSI14320 GGTCAACCACAGAGTCTTCAA GAAATAAGAAATTCTGTACCAT CTGAAAGTAGTTCTTGTTGGTG CCTTCATTTAAAAAGCACTCTT TAAAATAAAAGGGAAATGTTTT CTGATAAAA 201 DKFZp586K2322 AL080113.1 AV751235.1 0,8 TTTATTTTCAAATGCAGTGTAG (NM_006386 or AGCTAGATTAAAAGCAACTCTT NM 030881) TG CCACCTACTCTG C CCTTTTG GCAAAGTTACCTTGAACAAAGA ATCTTAAG G GTTTATTAAGAAC TCTTTAIIIICTTCATACCCTGT TCTCTGCAGTGCTTTCTAACAG CTTCTGGGTGCAGA I I I I CTTC G G CATCCTTTTG CACTCAG CTT ATTACAG GTAG GTAGTG CTTAA GAAAAGTCATGGAGGACTAAA GCCTAAGTCCTTTTCACTTTTC CTCCATCTGAAGGTAGGTGAG TTCATCCTCTTCATAGTAATGC TGIIIIACCAAGACTTTATAGC AGATG GACCCAGAAAGAATTTT CTGCTATTGTGTTCACTACAAC AGGATAGGGACATCAGACAGC CCCAGAAACCCCTTCCAGATCT GATATGGGACTATTAA I I I I I AT GCTGTTAATTGGTATTCATTCA CAATGCAGTTGAAGGGGGAAG GCTCCACTGCATTCTTTG 202 calmodulin 2 ATTGCAAAACGGGTGtattatccag (phosphorylase NM 001743.3 BF701704.1 0,8 •tact kinase, delta) 203 calpain, small subunit Î 1/calpain, small CCTTTGAGGCAGCAGatgaaagtg subunit 1 NM 001749.1 BE907701. 1 1,4 ggaaca 204 Cysteinyl-tRNA AAACAGGAACAAGAAgcagcaaa synthetase NM 001751.3 AK125503.1 1,5 gctggcc 205 CD97 antigen NM_078481 ATACCGTCTGTGAAGatgtggacg NM 001784.2 .1 1,3 agtgca 206 CD97 antigen NM 001784.2 BIO28545.1 1,2 CAGGCTGGAAGCCCAGACACG GAATCCCGGATAACCAAAAGG ACACTGTCTGTGAAG 207 Homo sapiens BCO34141.1 AV688287.1 1,4 CAAGGGACCAAGGTGgagcagttg immunoglobulin aaatct kappa constant, mRNA 208 ferritin, heavy NM 002032.1 BG248923.1 0,6 TGTCTCTGGGGATCCCTAGTAT polypeptide 1 AACACATGCA 209 Homo sapiens BCO53635.1 AI806846.1 1,5 CCGGCTGGTCAAAGTgtgggtgct chromosome 4 open Î ggcagc reading frame 9 210 Homo sapiens T-cell K02885.1 U85050.1 0,8 GGTTCGGGGACCAGGttaaccgtt receptor active beta- gtagag chais V-D-- L2-C-beta- 1 (TC (TCRB) mRNA 211 Homo sapiens T-cell K02885.1 AF043180.1 0,8 GAGGACCTGAACAAGGTGTTC receptor active beta- CCACCCGAGGTCGCTGTGTTT drain -- GAGCCATCAGAAGCAGAGATC 1.2-C-bet beta-1 (TC (TCRB) TCCCACACCCAAAAGGCCACA mRNA CTGGTGTGCCTGGCCACAGGT ATCTTCCCTGACCACGTGGAG CTGAGCTGGTGGGTGAATGGG AAGGAGGTGCACAGTGGGGTC AGCACGGACCCGCAGCCCCTC AAGGAGCAGCCCGCCCTCAAT GACTCCAGATACTGCCTGAGC AGCCGCCTGAGGGTCTCGGCC ACCTTCTGGCAGAACCCCCGC AACCACTTCCGCTGTCAAGTCC AGTTCTACGGGCTCTCGGAGA ATGACGAGTGGACCCAGGATA GGGCCAAACCCGTCACCCAGA TCGTCAGCGCCGAG 212 Homo sapiens T-cell K02885.1 AF317590.1 1,2 TCTGTGCCAGCAGCCttggacaga receptor active beta- tttatg chairs V-D-J-beta- 1.2-C-beta-1 (TCRB) mRNA 213 Homo sapiens T-cell K02885.1 ATATCTCTGCAGCGTcgggggtca receptor active beta- AF327908.1 0,7 atctqg chairs - 1.2-C-bet beta-1 (TC RB) mRNA 214 Homo sapiens T-cell CTGCAGTGCTAGAGAtccggggtc receptor active beta- K02885.1 AF327022.1 0,7 ccatca chairs V-D-J-beta- 1.2-C-beta-1 (TCRB) mRNA 215 Human T-cell CCAGATCGTCAGCGCcgaggcctg receptor active beta- M12886.1 BCO36926.1 0,8 gggtag chain mRNA 216 Human myosin-IXb CCTCTGGCCTCACAGtgtcccagg mRNA U42391.1 BF948234.1 1,3 agcaca 217 glutathione TGGAACTTCACCAAGttcctcatc9 a peop NM 002085.1 BI254515.1 0,8 caag (Phospholi1ipi d hydroperoxidase) 218 Poly(A) binding protein, cytoplasmic TTCCAACTGTTTAAAattgatcaggg 1 NM 002568.1 AA033548.1 0,9 acca 219 CAGTATGGTGAGGAGgtgtgagag prohibitin NM 002634.2 BI261762.1 1,4 ggtcca 220 BMO 14639. TCAGAGTGGAAGCAGgtgagaatg prohibitin NM 002634.2 1 1,6 gagggg 221 BMO 14639. CTTTGGGCGAGGAGAgtgtgagag prohibitin NM 002634.2 1 1,4 ggtcca 222 GCGAGGAGAGTGCGTgtgtgaga prohibitin NM 002634.2 BE536369.1 1,4 gggtcca 223 TTCCAGAAGCCAGAGAGACCA AGTGTTATGTAAGAAGTAGTGT spermidine/spermine CGGCTGTGTAGAACCACTGAC NI-acetyltransferase NM 002970.1 CD108391.1 0,8 TACACAGGCCGAAGTTACTGA GAACTTGGACAGAAAAAATAGC CAGCAAGTGTTCAAACTACTGA GG TTAGATATG CTG CACTTAAGAATACTAG G G C AGGTT 224 eted protein secreted secreted protein NM 003118.1 BG424815.1 1,4 CAGGAGACACCCCG (osteonectin) 225 or NM 003295.1 BG284235.1 0,6 TTCTTTATTGGTGAAAACATGA ontr ied tum- ATC CAGATG G CATG GTTG CTCT controlled t ATTG GACTACCGTGAG GATG G protein 1 TGTGACCCCATATATGATTTTC TTTAAGGATGGTTTA 226 translationally- NM 003295.1 BG284235.1 0,7 GAAATGGAAAAATGTtaacaaatgt controlled tumor ggcaa protein 1 227 translationally- NM 003295.1 CD641954.1 0,7 TTATTTTGGATCTATCACCTGT controlled tumor CATCATAACTG G CTTCTG CTTG protein 1 TCATCCACACAACACCAGGACT TAAGACAAATGGGACTGATGTC ATCTTGAGCTCTTCATTTATTTT This confirms that the analysis of the expression of these 18 genes is a good tool to discriminate patients with a high risk of relapse of patients with a low risk of relapse. The use of restricted gene panel is particularly suitable for obtaining a tool for detection and prognosis. Indeed, the analysis of the expression of a dozen genes does not require custom manufacture of a DNA chip, and can be implemented directly by PCR or NASBA techniques, or low density chip, which presents an important economic asset and a simplified implementation. Table A: Selected samples for early breast cancer group for DATAS # 1 (PBMN) Stage Quantity Clinic Used I (ug) 10.0 Sample D104 D105 I 5.0 D106 I 2.0 D111 I 2.0 D114 I 3.0 D118 I 5.0 D123 I 6.0 D117 II 2.0 D121 II 5.0 D130 II 10.0 Table B: Samples selected for the control group for DATAS No. 1 (PBMN) D1 Sample Stage Quantity Clinical used DFS> 5 (μg) 1.66 D3 DFS> 5 1.66 D9 DFS> 5 1.66 D10 DFS> 5 1.66 D17 DFS> 5 1.66 D28 DFS> 5 1.66 D36 DFS> 5 1.66 D63 DFS> 5 1.66 D75 DFS> 5 1.66 D76 DFS> 5 1.66 D11 DFS <5 1.66 D14 DFS <5 1.66 D15 DFS <5 1.66 D16 DFS <5 1.66 D27 DFS <5 1.66 D31 DFS <5 1.66 D41 DFS <5 1.66 D66 DFS <5 1.66 D102 DFS <5 1.66 D103 DFS <5 1.66 D67 Benin 1.66 D69 Benin 1.66 D70 Benin 1.66 10 D71 Benin 1.66 D72 Benin 1.66 D73 Benin 1.66 D74 Benin 1.66 D79 Benin 1. 66 D115 Benin 1.66 D125 Benin 1.66 Table C: Samples selected for the late breast cancer group for the DATAS n 2 (PBMO) Sample Stage Quantity D55 Clinical used (μg) S III _ 3.33 D59 S III 3.33 D90 SIII 3.33 D134 S III 3.33 D168 S III 3.33 D178 S III 3.33 D126 S III 3.33 D132 S III 3.33 D91 S IV 3.33 D83 S IV 3.33 D101 S IV 3.33 D138 S IV 3.33 D161 S IV 3.33 D170 S IV 3.33 D99 S IV 3.33 Table D: Samples selected for the control group for DATAS n 2 (PBMO) Sample Stage Quantity Dl Clinic used DFS> 5 (ug) 1.66 D3 DFS> 5 1.66 D9 DFS> 5 1.66 D10 DFS> 5 1.66 D17 DFS> 5 1.66 D28 DFS> 5 1.66 D36 DFS> 5 1.66 D63 DFS> 5 1.66 D75 DFS> 5 1.66 D76 DFS> 5 1.66 D11 DFS <5 1.66 D14 DFS <5 1.66 D15 DFS <5 1.66 D16 DFS <5 1.66 D27 DFS <5 1.66 D31 DFS <5 1.66 D41 DFS <5 1.66 D66 DFS <5 1.66 D102 DFS <5 1.66 D103 DFS <5 1.66 D67 Benin 1.66 D69 Benin 1.66 D70 Benin 1.66 D71 Benin 1.66 D72 Benin 1.66 D73 Benin 1.66 D74 Benin 1.66 D79 Benin 1.66 D115 Benin 1.66 D125 Benin 1.66 Table E: Selected samples for the all-stage breast cancer group for 5 DATAS n 3 (PBMP). Sample Stage Quantity D91 Clinic used SIV (ug) 1.3 D92 SIV 1.3 D101 SIV 1.3 D138 SIV 1.3 D161 SIV 1.3 D170 SIV 1.3 D99 SIV 1.3 D195 SIV 1.3 D197 SIV 1,3 D205 S IV 1,3 D55 SIII 1,3 D59 S III 1,3 D90 SIII 1,3 D134 S III 1,3 D168 S III 1,3 D178 S III 1,3 D126 S III 1,3 D132 S III 1.3 D135 S III 1.3 D185 S III 1.3 D108 S II 1.3 D109 S II 1.3 D148 S II 1.3 D156 S II 1.3 D160 S II 1.3 D162 S II 1,3 D163 S II 1,3 D166 S II 1,3 D167 S II 1,3 D172 S II 1,3 D112 IF 1,3 D120 IF 1,3 D122 IF 1,3 D127 IF 1,3 D131 IF 1 , 3 D145 IF 1.3 D147 IF 1.3 D153 IF 1.3 D173 IF 1.3 D176 IF 1.3 Table F: Samples selected for the control group for DATAS # 3 (PBMP) Sample Stage Quantity Dl Clinic used DFS> 5 (ug) 1.66 D3 DFS> 5 1.66 D9 DFS> 5 1.66 D10 DFS> 5 1.66 D17 DFS> 5 1.66 D28 DFS> 5 1.66 D36 DFS> 5 1.66 D63 DFS> 5 1.66 D75 DFS> 5 1.66 D76 DFS> 5 1.66 D11 DFS <5 1.66 D14 DFS <5 1.66 D15 DFS <5 1.66 D16 DFS <5 1.66 D27 DFS <5 1.66 D31 DFS <5 1.66 D41 DFS <5 1.66 D66 DFS <5 1.66 D102 DFS <5 1.66 D103 DFS <5 1. 66 D67 Benin 1. 66 D69 Benin 1. 66 D70 Benin 1. 66 Benin 1. 66 D71 Benin 1. 66 D72 Benin 1. 66 D73 Benin 1. 66 D74 Benin 1. 66 D79 Benin 1. 66 D115 Benin 1. 66 D125 Table 1 List of 318 transcripts differentially expressed in the development of breast cancer. SEQ Description of the N Genbank N C UII vs. Sequence Target ID N sequence (reference) Genbank Healthy (variant) 1 lysozyme NM 000239. BE720647. 1 0.8 ATTTATCCTGCAGTGctttgctgcaa gata 2 cDNA AL832453. 2 BU634341. 1 0.8 AAATAAAATAT CA G G ATAT G G DKFZ TCCCCCTTGAGACTGAAGGAA DKFZp451G151 CTGAAGATTTTAAACCTTAGTA (Leucine-rich repeat AGAACCACATTTCATCCCTATC kinase 2) AGAGAACTTTCTTGAGGCTTGT CCTAAAGTGGAGAGTTTCAGT GCCA 3 cDNA AL832453. 2 BU634341. 1 0.8 GAATGAATTTTCTTGctgctatgcctt DKFZp451 G 151 tCt (leucine-rich repeat kinase 2) 4 cDNA AL832878. 1 AI223156. 1 0.8 TGCCAAGGAAGACCCCCTCCT DKFZp667I093 GACCCCTGTTCCGGCTTCAGA (Guanine nucleotide AAACCCGTTTAGGGAGAAGAA binding protein, gamma 2) G 1 1 1 1 1 CTGTGCCATCCTTTAA cDNA AL832878. 1 AI223156. 1 0.7 TGCAGATTTGATGGCctactgtgaa DKFZp667I093 gcaca (guanine nucleotide binding protein, gamma 2) 6 chemokine (C-C NM 001838. 2 BI910219. January 1 GATGAGGTCACGGACGATTAC motif) receptor 7 G 0.8 ATCGGAGACAACACCACAGTG GACTACACTTTGTTCGAGTCTT TGTGCTCCAAGAAGGACGTGC GAACTTTAAAG CCTG GTTCCT CCCTATCATGTACTCCATCATT TGTTTCGTGGGCCTACTGGGC AATGGGCTGGTCGTGTTGACC TATATCTATTTCAAGAGGCTCA AGACCATGACCGATACCTACCT GCTCAACCTGGCGGTGGCAGA CATCCTCTTCCTCCTGACCCTT CCCTTCTGGGCCTACAGCGCG GCCAAGTCCTGGGTCTTCGGT GTCCACTTTTGCAAGCTCATCT TTGCCATCTACAAGATGAGCTT CTTCAGTG G CATG CT CTCCTACTT CTTTG CATCAG CATTGACCG ACGTGGCCATCGTCCAGGCTG TCTCAGCTCACCGCCACCGTG CCCGCGTCCTTCTCATCAGCA AGCTGTCCTGTGTGGGCATCT GGATACTAGCCACAGTGCTCT CCATCCC 7 BC009917 Homo sapiens. 1 BCO28225. 1 1.4 ATGAAGAAAAACAAAgtgcacaga hypothetical protein gacccg DKFZp76I A052 8 cDNA clone BCO38965. 1 N70893. 1 1.2 AGGACAGCCCTGGGCagagatga IMAGE: 3920936 ggcaggg (High density lipoprotein binding protein (vigilin)) 9 cDNA clone BC040042. 1 BI001496. 1 1,3 TAATGCCAAGACAAAgccacggga IMAGE: 5207605 ggagca (Immunoglobulin heavy constant gamma 2 (G2m marker)) 10 cDNA clone BC040042. 1 BF841656. 1 1.3 CACAGGTGTACACCCTGCCCC IMAGE: 5207605 CATCCCGGGAGGAGATGACCA (Immunoglobulin heavy constant AGAACCAGGTCAGCCTGACCT GCCTGGTCAAAGGCTTCTACC gamma 2 (G2M CCAGCGACATCGCCGTGGAGT marker)) GGGAGAGCAATGGGCAGCCG GAGAACAACTACAAGACCACA CCTCCCATGCTGGACTCCGAC GGCTCCTTCTTCCTCTACAGCA AGCTCACCGTGGACAAGAGCA GGTGGCAGCAGGGGAACGTCT TCTCATGCTCCGTGATGCATGA GGCTCTGCACAACCACTACAC GCAGAAGAGCCTCTCCCTGTC TCCGGGTAAATGAGTGCCACG GCCGGCAAGCCCCCGCTCCCC AGGCTCTCGGGGTCGCGTGAG GATGCTTGGCACGTACCCCGT GTACATACTTCCCAGGCACCC AG CATG GAAATAAAG CACCCA GCGCT 11 BC048206 Microtubule. 1 AW015234. 1.3 TCTCTCTTTCCAATCTTACGCC associated ATG GC 1 C CATCAGTTCATTTCAG monoxygenase, calponin and CTTCCAGTGCTACACCCACTTC d L IM TTG G CTGACACACTTCTG CTCT domain containing 2 GTGACTG GTTTTCTTG AAG CCA ATTTTCAAAGAGTGGTACTAAC CCCCAACCCGCTTTCCGCACC CCGTCCTCTCCGCCAGCAGTA CTGGTTGCACTAACTGTGAGT GTCTTGCATACTGATGGACTCA TTTGGTGGCATGGTTGGCTAA CAGCATGGCGGGGGGTGTTCA GCTTGAGACCCATGCCTGTGT TCATTTCCCATGGAGCTGGCA GCCTGGTCTACCCCAAGTGCA TGCCCCGCCTCTCCTCTCTCC CTTGGGTCTGCCTGCGTGCAT GCTTCTCCAGTTGCGTCTGCG AAGCTACCTACTTTCTTGGGAG GGTCGACCTTGATCATGAAACA ATACCATGAGGGGGCCTCTGT CACCTTTGAAAAGAACACTTTT TGAGCAGCCTCAAAAAGCTCAT ACATAC 12 Microtubule BC048206. 1 AF052170. 1 1.4 TGGGAGGGTCGACCTTGATCA TGAAACAATACCATGAGGGGG associated monoxygenase, calponin and LIM CCTCTGTCACCTTTGAAAAGAA domain containing 2 iii 1 I CAC AG GAGCAGCCTCAAAA CTCATACATAC CAG CG CCTT CTTAAATTGGCTCTAATGTAAA GATTGTTAATGTCATTTATCAAA ACCATAG GTGATTATTTG GAG G GATTTAAAAAACTTAATTACTCT CAGGCCTCATCCCAAGCTTGA CACATGCTCTGTAGGTTGAACA CATAATCACAAATATTCTAGCA AATGCTGCCTTGGTTGCAGCC TGCACTGTAGACCCAAGGGTT TTG CTG CTC TG G CTCTTCTTAT CCTTGGCTCATAAAGCCCCAG ATGATGCCAGAGCTTCAATTAG AGCCATCATCATCCCAGGCAG GGATATCTTTGAGAAATGACTC AGTTCAGCCCCAGGCCCCTGT GACTCTGCTTAAAGCACACATT TCTGCTGACTCTTGTACCTGGG GCAGCAGGATAATCACCAACA C 13 cellular homolog of NM 001997. 2 NM 001997 0.5 GTCGCCCAGATCAAGgctcatgtag the fox sequence in. 2 cctca the Finkel-Biskis-Reilly murine sarcoma virus 14 cellular homolog of NM 001997. 2 W17004. 1 0.6 GGCCGCATGCTTGGAaggtaaagt the fox sequence in the Finkel-Biskis-Reilly murine sarcoma virus 15 cellular homolog of NM 001997. 2 AU098396. 1 0.6 GTCGCCCAGAGCAAGgctcatgta the fox sequence in gcctca the Finkel-Biskis-Reilly murine sarcoma virus 16 cellular homolog of NM 001997. 2 AA063591. 1 0.7 TGACCGGCCAGGAAAcggtcgccc the fox sequence in agatca the Finkel-Biskis-Reilly murine sarcoma virus 17 cellular homolog of NM 001997. 2 AA094898. 1 0.6 CAGGCCGCATGCTTGaggtaaagt the fox sequence in ccatgg the Finkel-Biskis-Reilly murine sarcoma virus 18 cellular homolog of NM 001997. 2 AA187006. 1 0.7 AGGCCGCATGCTTGGaggtaaagt the fox sequence in ccatgq the Finkel-Biskis-Reilly murine sarcoma virus 19 cellular homolog of NM 001997. 2 AA225636. 1 0.6 GGCCAGGAAACGGTCgcccagat the fox sequence in the Finkel-Biskis-Reilly murine sarcoma virus 20 cellular homolog of NM 001997. 2 BM820687. 0.5 GGTCGCCCAGATCAAgctcatgtag the fox sequence in 1 cctca the Finkel-Biskis- mouse Reilly sarcoma virus 21 cellular homolog of NM 001997. 2 AA491544. 1 0.7 GCCGCATGCTTGGAGgtaatgtcc the fox sequence in atggtt Finkel-Biskis-Reilly murine sarcoma virus 22 cellular homolog of NM 001997. 2 AV743892. 1 0.5 GTCGCCCAGATCAAGgctctgtagc the fox sequence in ctcac the Finkel-Biskis- Reilly murine sarcoma virus 23 cellular homolog of NM 001997. 2 BU603086. 1 0.6 GAAGTAGCAGGCCGCatgcttgga the fox sequence in ggtaaa the Finkel-Biskis- Murine Reilly sarcoma virus 24 cellular homolog of NM 001997. 2 BF218408. 1 0.6 AGGCCGATGCTTGGAggtaaagtc the fox sequence in the Finkel-Biskis-Reilly murine sarcoma virus 25 cellular homolog of NM 001997. 2 AI499403. 1 0.7 CCCGCATGCTTGGAGgtaaagtcc the fox sequence in atggtt Finkel-Biskis-murine Reilly sarcoma virus 26 cellular homolog of NM 001997. 2 AW795076. 0.6 CGGTCGCCCAGATCAaggctcatg the fox sequence in 1 tagcct the Finkel-Biskis- Reilly murine sarcoma virus 27 cellular homolog of NM 001997. 2 D52122. 1 0.6 GGCCGCATGCTTGGggtaaagtcc the fox sequence in atggtt the Finkel-Biskis-Reilly murine sarcoma virus 28 cellular homolog of NM 001997. BE535673. 1 0.6 GCCGCATGCTTGGAGGtaacagtc the fox sequence in the Finkel-Biskis- Reis murine sarcoma virus 29 re (cosphate! NM 002355. 2 CA430891. 1 0.8 CGACACACCCTAGCGgacaatttta receptor ceptor (cation accct dependent) mitogen-activated NM 002419. 2 AK090614. 1 1,2 TTCCATTCCATGCAGgaaggctgg protein kinase kinase kinase aagcgc kinase 11 31 proteoglycan 1, NM 002727. 1 BQ051861. 1 0.8 AATCCTCAGTTCAAGgttatcctacg secretory granule caga 32 proteoglycan 1, NM 002727. 1 BQ051861. 1 0.8 GACTGACC 1 1 1 1 1 CCaaagacgag secretory granule aatcca 33 protein kinase C NM 002743. 1 BU631834. 1 1.5 CTCGCAGAAACCCAAccgctccac substrate 80K-H caccgt 34 SEC14-like 1 (S. CD366399. 1 1.6 GTAGGTAGGTTCGTAgtagggttcg cerevisiae) NM 003003. 1 taggt 35 SEC14-like 1 (S. NM 003003. 1 AK130317. 1 1.3 TAGGGCTAGTAGGTAGGGCTA cerevisiae) GTAGGTAGGGCTAGTAGGTAG GGCTAGTAGGTAGGGCTAGTA GGTAGGGCTAGTAGGTAGGGC TAGTAGGTAGGGTTCGTAGGT AGGGTTCGTAGGTAGGGTTCG TAG GTAG G GTTAGTAG CG CGT CTGTGCTGCTTCCACCTGGTG CTTCCTGTTCCCAAATCACAAG GGCCTGAAGGTGGTCCCTGCT TTCTCTTTCTCTTTCTCTGTGTC TCAGATG G CGATTTTG CTGACA GCTGCCAAGAAAATGCTTCACT CAACAGTCCTCATGTGCCCAG AGATGTTTATAGAACTGTTTGA ATTGCAGCCATCCCCTGCCCC CTCCCAGGCTGAAGATCTGTT C 1 1 1 1 1 AAGTTGATTCGGGAGT GGCATTCTTTTATACCCAAAGA CTGTAGTGCATCTTGAAGAGCT CAAAGCACATGACCGCACAAA TGCTTACAGGGTTTCCTCCCGA GTAATCCAATCTCACTCCCCTT GTAAGG 36 SEC14-like 1 ( S. NM 003003. 1 AK130317. 1 TAGGGTTCGTAGGTAGGGCTA I cerevisiae) GTAGGTAGGGTTAGTAGGTAG GGCTAGTAGGTAGGGCTAGTA GGTAGGGTTAGTAGGTAGGGT TCGTAGGTAGGGCTGGTAGGT AGGGTTAGTAGGTAGGGCTAG TAGGTAGGGCTAGTAGGTAGG G CTAG TAG G TAG G TAG GTTAG GTAGGGCTAGTAGGTAGGGCT AGTAGGTAGGGTTAGTAGGTA 1.4 GGGTTCG SEC14-like 1 (S. NM 003003. 1 AK130317. 1 AGGTAGGGTTCGTAGgtagggcta 37 cerevisiae) 1.5 gtaggt cysteine-rich acidic NM 003118. 1 BG325726. 1 GTGAAGAAGATCCATGAGAAT secreted protein 1,4 GAGAAGCGCCTGGAG 38 (osteonectin) cysteine-rich NM 003118. 1 BG325726. 1 9aC999 protein secreted secreted protein 1,3 tacctc 39 (osteonectin) and acid protein NM 003118. 1, AA325849. 1 CATAGGAGCATTGCACCACCCG secreted secreted protein 40 Î (osteonectin) I 1 1 1 1 CGAGACCTGTGACCTG 1,3 GACAAT 41 cysteine-rich acidic NM 003118. 1 AA325849. 1 GAGAATGAGAAGCGCCTGGAG secreted protein GCAGGAGACCACCCCGTGGAG (osteonectin) CTGCTGGCCCGGGACTTCGAGAAGAACTATAACATGTACATCT TCCCTGTACACTGGCAGTTCG1,3 GCCAGCTGGACCA 42 cysteine-rich acidic NM 003118. 1 AA325849. 1 ng GCACCCCATTGACGGgtacctctcc secreted protein cacac (osteonectin) 43 nuclear factor NM 003204. 1 BM973053. 1.5 CGGGTCAGTGTACAGgaagaggc (erythroid-derived 2) - 1 aggcact like 1 44 nuclear factor NM 003204. 1 BM973053. 1.5 TGCTGTGAGGCAGAGgaatgatgg (erythroid-derived 2) - 1 agaatc like 1 synuclein, alpha (non-NM 000345. 2 NM007308 1.2 TACGAACCTGAAGCCTAAGAAA A4 component of. 1 TATCTTTGCTCCCAGTTTCTTG amyloid precursor) AGATCTGCTGACAGATGTTCCA TCCTGTACAAGTGCTCAGTTCC AATGTGCCCAGTCATGACATTT CTCAAAGTTTTTACAGTGTATC TCGAAGTCTTCCATCAGCAGTG ATTGAAGTATCTGTACCTGCCC CCACTCAGCATTTCGGTGCTTC CCTTTCACTGAAGTGAATACAT GGTAGCAGGGTCTTTGTGTGC TGTGGA I I I I GTGGCTTCAATC TACGATGTTAAAACAAATTAAA AACACCTAAGTGACTACCACTT ATTTCTAAATCCTCACTAIliii TTGTTGCTGTTGTTCAGAAGTT GTTAGTGATTTGCTATCATATA TTATAAGAIIIii AGGTGTCTTT TAATGATACTGTCTAAGAATAA TGACGTATTGTGAAATTTGTTA ATATATATAATACTTAAAAATAT GTGAGCATGAAACTATGCACCT ATAAATACTAAATA 46 synuclein, alpha (non NM 000345. 2 NM_007308 1.4 CCACAGGAAGGAATTCTGGAA A4 component of. 1 GATATGCCTGTGGATCCTGAC amyloid precursor) AATGAGGCTTAT synuclein, alpha (non-NM 000345. 2 NM_007308 1.5 GACCAGTTGGGCAAGaatgaaga A4 component of. 1 aggagcc amyloid precursor) synuclein, alpha (non-NM 000345. 2 L36674. 1,. 2 L36674. 1 1,3 GTGTTCTCTATGTAGgctccaaaac A4 component of caagg amyloid precursor) 50 translationally NM 003295. 1 AA223997. 1 0.5 AAAACCTTTTATGACAG GG G CT controlled tumor GCAGAACAAATCAAGCACATC protein 1 CTTGCTAATTTCAAAAACTACC AG 51 translationally- NM 003295. 1 CA848049. 1 0.7 AGAtcgcggacgggttgtgcctggaggTG controlled tumor G protein 1 52 translationally- NM 003295. 1 CA848049. 1 0.5 TCCGACATCTACAAGatccgggag controlled tumor atcgcg protein 1 53 translational NM 003295. 1 BCO22436. 1 0.6 CCGACATCTACAAGATCCGGG controiled tumor AGATCGCGGACGGGTTGTGCC protein 1 TG 54 translationally- NM 003295. 1 BC040008. 1 0.5 GCGCCGCTCCGGCTGCACCG controlled tumor CG CTCG CTCCGAGTTTCAG G C protein 1 TCGTGCTAAGCTAGCGCCGTC GTCGTCTCCCTTCAGTCGCCAT CATGATTATCTACC 55 TYRO protein NM 003332. 1 BF092099. 1 0.8 GAGGGGCTGCGGAGGcagcgac tyrosine kinase ccggaaac binding protein 56 triosephosphate NM 000365. 3 BI226906. 1 1,3 CATGCTCTGGCAGAGgatggctga isomerase 1 agtcca 57 transgelin 2 NM 003564. 1 AA428309. 1 1,3 ATTAACACCACTGACtgtg ctcacc acaca 58 uroporphyrinogen NM 000374. 2 BQ008745. 1 1.5 GTGTGCCGCTGATTGtggaccctg decarboxylase atgaca 59 cold shock domain NM 003651. 3 BC009744. 1 1,3 GAGGAGGAAGGGAGCGGCAG protein A CAGTGAAGGATTTGACCCCCC TG CCACTGATAG G CAGTTCTCT GGGGCCCGGAATCAGCTGCG CCGCCCCCAGTATCGCCCTCA GTACCGGCAGCGGCGGTTCCC GCCTTACCACGTGGGACAGAC CTTTGACCGTCGCTCACGGGT CTTACCCCAT 60 cold shock domain NM 003651. 3 BC009744. 1 1.5 CCCAACAGAATACAGgctggtgag protein A attgga 61 solute carrier family NM 003982. 2 AW798524. 0.8 TCTCTGCTCTTCAATggtatcatgggg 7 (cationic amino acid absorbing agent, y + system), member 7 62 annexin A2 NM 004039. 1 AV709641. 1 0.9 GACGCTTCTGAGCTaaaagcttcca tgaag 63 annexin A2 NM 004039. 1 BF244428. 1 1.6 AGCTTGGAGGGTGATgtgtggatg aggtca 64 beta-2-microglobulin NM 004048. 2 AV734235. 1 0.7 ATTTG GATTG GATGAATTCCAA ATTCTGCTTGCTTGCIiit1AAT ATTGATATGCTTATACACTTAC ACTTTAT G CACAAAATG TAG G G TTATAATAATGTTAACATGGAG ATGATCTTCTTTATAATTCTACT TTGAGTGCTGTCTCCATGTTTG ATGTATCTGAG CAG GTTG CTCC ACAG GTAG CTCTAG GAGG G CT GGCAAC 65 beta-2-microglobulin NM 004048. 2 BF912731. 1 0.8 AGATAGTTAAGTGGGatcgagaca tgtaag 66 calreticulin NM 004343. 2 CA306742. 1 1.4 GACTCCAAG CCTGAGg cag cag a gaaacaa 67 CD74 antigen NM004355. 1 BQ029721. 1 1.5 ACACCCAGACCCCAGgaagagcc aatgttt 68 HLA-B associated NM 004640. 3 BM980603. 1,4 GCCCCAGGAGGAGAGcagtttaaa transcript 1 1 gatttt 69 SEC24 related gene NM 004922. 1 AW449995. 1.8 GCAGCTGTCTGGAATgCagaggC family, member C (S. 1 agctgga cerevisiae) 70 actinin, alpha 4 NM 004924. 3 CB051741. 1 0.7 CCATCGGGGCAGAAGaacttcatc acagct 71 hemoglobin, alpha NM 000517. 3 H78334. 1, 1.4 CGGTCAACTTTCAAGctccttaagc 1/2 cactg 72 hemoglobin, alpha NM 000517. 3 H55830. 1 1.4 TCAAGGCCGCCTGGGgatgttcctg 1/2 tCCtt 73 cyclin-dependent NM 004935. 1 BI669825. 1 0.8 GTGGCTCTGAAACGGggtgtgCCg kinase agttcc 74 histone deacetylase 1 NM 004964. 2 BF204295. 1 1.7 CAGAGGAGAAGAAAGggtcaagg aggaggt 75 WD repeat domain 1 NM 005112. 3 BCO30541. 1 1.4 CTAAGGAACATCGACgaccacagc cgcttt 76 ubiquitin specific NM 005153. 1 W19112. 1 0.7 ATCGGCTCTTTGCAGtggtctaccat protease 10 cacg 77 v-fos murine FBJ NM 005252. 2 BG541010. 1 0.8 CCTGTCAACGCGCAGgaCttCtgCa osteosarcoma viral cggac oncogenehomolog 78 v-fos murine FBJ NM 005252. 2 BG541010. 1 1.5 GGCAAGGTGGAACAGgagacaga osteosarcoma viral ccaacta oncogenehomolog 79 v-fos murine FBJ NM 005252. 2 BI325046. 1 1.5 GGCAAGGTGGAACAGgagacagg osteosarcoma viral acaacta oncogenehomolog 80 hemoglobin, alpha NM 000558. 3 R91899. 1, 1.7 ACCAAGACCTACTTCcgggtcaact 1/2 tcaag 81 hemoglobin, alpha NM 000558. 3 H58664. 1 1.7 GGAGGCCCTGGAGAGctcctaagc 1/2 cactgc 82 actin protein NM 005718. 2 AI470163. 1 0.7 ATTGCTGTGAAACAGgggtatgata 2/3 complex, subunit tcagc 4, 20kDa 83 B-cell receptor NM 005745. AI962313. 1 1,3 GCAGTTGCCACCTTCCaCgCCtga associated protein 31gcgtgg 84 RNA binding pattern NM 005778. 1 CA488450. 1 0.8 GAGCAGACAAGTTTGactctgaac protein 5 aggaag 85 CD164 antigen, NM 006016. 3 AF299342. 1 0.8 CTGTGACTCCAACCTCACAACC Sialomucin TGTGCGAAAGTCTACCTTTGAT GCAGCCAGTTTCATTGGAGGA ATTGTCCTGGTCTTGGGTGTG CAGGCTGTAATTTTCTTTCTTTA TAAATTCTGCAAATCTAAAGAA CGAAATTACCACACTCTGTAAA CAGACCCATTGAATTAATAAGG ACTGGTGATTCATTTGTGTAAC TCACTGAAGCCAAAATACTATC TTTTAAGATGTCCCACATGGAA GACGCTATTCCAGGATCTTTAA ATTTCCATGGATGCATATAGGA TGTTTGGGAGCATCATCCGTG AAGAAAAAATCAATTAAATCATT GTGTTCAACAGGAATATTTAAA ATATTCTGCATGAATCCTGTGG CTGTCTTATTTTAAATAGCTGC TGCTGTGGGATTATA 1 1 1 1 1 1 1 1 CCTTAACATGCCAAATATAACT TTCTGAAAGTGATGGAAAATGT TGTCTTGTGCAGACAACATCAT GGCTCTTGGCAGTTTA 86 CDI64 antigen, NM 006 016. 3 AF299343. 1 0.8 CAGCCAATTCTACAGctaaaccca sialomucin cagttc 87 ubiquitous alpha NM 006082. 1 BG981396. 1 1.3 CACCCTGAGCAGCTCaccaccca tubulin caccacc 88 talin 1 NM 006289. 2 AI393487. 1 1.4 CCCAGAGTATTAACGCTCCAA GAGTATTATTAACGCTGCTGTA CCTCGATCTGAATCTGCCGGG GCCCCAGCCCACTCCACCCTG CCAGCAGCTTCCAGCCAGTCC CCACAGCCTCATCAGCTCTCTT CACCGIIIIIIGATACTATCTT CCCCCACCCCCAGCTACCCAT AGGGGCTGCAGAGTTATAAGC CCCAAACAGGTCATGCTCCAAT AAAAATGATTCTACCTACAA 89 talin 1 NM 006289. 2 AI393487. 1 1.4 TGCTTCGGAAGGAACgagagctgg aagagg 90 talin 1 NM 006289. 2 AI417760. 1 1.4 AGGAAGAAATGCTTCggaaggaa acidic (leucine-rich) NM 006305. 2 AW577170. 1.4 AGGAGGATGAGGATGgaggatga nuclear 1 agaaggt phosphoprotein 32 family, member A 92 translocase of inner NM 006327. 1 AU142330. 1 0.7 GATGTTGCATGACAGGGACTACT mitochondrial ggcta membrane 23 homolog (yeast) 93 microspherule NM 006337. 3 CD369365. 1 1.5 AACTCTGTGGTGGAGacaggaag protein 1 ctggggc 94 APG7 autophagy 7 NM 006395. 1 BO000091. 1 0.7 CTTGTGCCTCACCAGatccgggga like (S. cerevisiae) tttCtt 95 acidic (leucine-rich) NM 006401. Y07570. 1 1.5 GAAGTCAGTGAGGAGgaagaaga nuclear atttgga phosphoprotein 32 family, member B 96 acidic (leucine-rich) NM 006401. Y07570. 1 1.4 AG GATGAG GATGAAG ag g agg aa nuclear gaaggtq phosphoprotein 32 family, member B 97 acidic (leucine-rich) NM 006401. 1 BF195216. 1 GAAGTCAGTGAGGAGaggaggaa nuclear gaaggtg phosphoprotein 32 family, member B 98 butyrophilin, NM 007049. 2 BQ929466. 1 1,3 CATTCTTACATGCTGaggaccgga subfamily 2, member gaagtg Al 99 EAP30 subunit of NM 007241. 2 BF525899. 1 1.2 CTGCAGCTGGCAGAGaagaatgg ELL complex ctacgtg 100 coatomer protein NM 007263. BM798704. 1.5 CCACGAGAGTCGGAGgaaggagC complex, subunit 1 epsilon tgaagag 101 soluble galactoside-NM 009587. 1 BG390210. 1 1.2 TACATCAGCTTCCAGaccagal binding lectin 9 gtcatc (galectin 9) 102 soluble galactoside-NM 009587. 1 BG698264. 1 0.8 CTCCAGTGGAACCAGgtttgctgtg binding lectin 9 aactt (galectin 9) 103 CDKNIA interacting NM 012127. AK027287. 1 1,3 TAGAAGCTGGTGGAGgtgaggtcc zinc finger protein 1 Agagat 104 CDKNIA interacting NM 012127. AK027287. 1 1,2 CAGGCACATTCACAGccgcatctgc zinc finger protein 1 cocoa 105 F-box protein 7 NM 012179. BE905968. 1 1 1,3 AATTTTGAAGCTGAGTCAATTC AAGATAATG CG CATATG G CAG AG G G CACAG GTTTCTATCCCT CAGAACCCATGCTCTGTAGTG AATCGGTGGAAGGGCAAGTGC CACAZTCATTAGAGACCTTGTA TCAATCAGCTGACTGTTCTGAT GCCAATGATGCCTTGATAGTGT TGATACATCTTCTCATGTTGGA GTCA 106 WW domain binding NM 012478. 2 BG820375. 1 1,2 CCACCTCCCTACTACCCaccggaa protein 2 gataag 107 px19-likeprotein NM 013237. 2 BM504662. 1,3 CACGCCCGGCTGATGggaatttggt 1 cttgc 108 insulin-like growth NM 000876. BM787853. 1,4 GGAAACAGTGATAAGTAAGCT factor 2 receptor 1 GACCACTTGCTGTAGGAGAAG TTCCAACGTGTCC 109 insulin-like growth BM787853. 1,3 GACGCATCTCAAAACAGAGGG factor 2 receptor NM 000876. January 1 CTGCATTCGAAGAAACCCTTGC TGCTTTAGTCCCGATAGGGTAT TTGACCCCGATATATTTTAGCA IIIIAATTCTCTCCCCCTATTTA TTGACTTTGACAATTACTCAGG TTTGAGAAAAAGGAAAAAAAAA CAGCCACCGTTTCTTCCTGCCA GCAGGGGTGTGATGTACCAGT TTGTCCATCTTGAGATGGTGAG GCTGTCAGTGTATGGGGCAGC TTCCGGCGGGATGTTGAACTG GTCATTAATGTGTCCCCTGAGT TG GAG CTCATTCTGTCTCTTTT CTCIIIIGCTTTCTGTTTCTTAA GGGCACACACACGTGCGTGCG AGCACACACACACATACGTGC ACAGGGTCCCCGAGTGCCTAG G I I I I GGAGAGTTTGCCTGTTC TATGCCTTTAGTCAGGAATGGC TGCACC I Iii I GCATGATATCT TCAAGCCTGGGCGTACAGAGC ACATTTGTCAGTA 1 1 1 1 1 GCCG insulin-like growth AAGGAGGTCAGGCCCCACTCC 110 factor 2 receptor NM 000876. 1 BF222741. 1 1.3 j TTCCTGATTGTTTACAGTCATT GGAATAAGGCATGGCTCAGAT CGGCCACAGGGCGGTACCTTG TGCCCAGGGTTTTGCCCCAAG TCCTCATTTAAAAGCATAAGGC CGGACGCATCTCAAAACAGAG GGCTGCATTCGAAGAAACCCT TG CTG CTTTAGTC C C G GATAG GTATTTGACCCCGATATATTTT AGCATTTTAATTCTCTCCCCCT ATTTATTGACTTTGACAATTACT CAGGTTTGAGAAAAAGGAAAAA AAAACAGCCACCGTTTCTTCCT GCCAGCAGGGGTGTGATGTAC CAGTTTGTCCATCTTGAGATGG TGAGGCTGTCAGTGTATGGGG CAGCTTCCGGCGGGATGTTGA ACTGGTCATTAATGTGTCCCCT GAGTTGGAGCTCATTCTGTCTC TTTTCTCTTTT G CTTTCTGTTTC TTAAGGGCACACACACGTGCG TGCGAGCACACACACACATAC GTGC 111 insulin-like growth NM 000876. 1 BF222741. 1 GTGGCTGATGGAAGAgatccagct factor 2 receptor 1.3 gcctcc 112 adducin 1 NM 014190. 2 CA396829. 1 1,3 AGAAGGGCTCTGAAGagaatctgg (alpha) / adducin 1 acgagg (alpha) eukaryotic translation BM142283. GTTAACCTCACCCTACAGATGA initiation factor 2-AGATAATAGAGCAAGAAAAAGA AATTGCAGAACTAAAGAAGCAG CTAAACCTCCTTTCTCAAGACA AAGGGGTGAGGGATGACGGAA AGGATGGGGGCGTGGGATGAA 113 alpha kinase 1 NM 014413. 2 1 1.3 AGTGGAC 114 KIAA0040 NM 014656. 1 I CB050264. 1 1,7 ATGGTTCCCAAGTGTgtgtaagtgt gtgta 115 Homocysteine-NM 014685. 1 BG828243. 1 0.8 G G G TG CATCAG CTTTTG TTC CA inducible, CCACCAAGTGCACAAGAGATA endoplasmic reticulum stress-CCTGTGGTCTCTGCACCTGCT CCAGCCCCTATTCACAACCAGT inducible, ubiquitin-like domain member TTCCAGCTGAAAACCAGCCTG CCAATCAGAATGCTGCTCCTCA 1 AGTG GTTG TTAATC CTG GAG C CAATCAAAATTTGCGGATGAAT GCACAAGGTGGCCCTATTGTG GAAGAAGATGATGAAATAAATC GAGATTGGTTGGATTGGACCT ATTCAGCAGCTACATTTTCTGT TTTTCTCAGTATCCTCTACTTCT ACTCCTCCCTGAGCAGATTCCT CATGGTCATGGGGGCCACCGT TGTTATGTACCT 116 NM 014685 homocysteine. 1 BG828243. 1 0.7 GGAAAACATCTCAAGgcctgaagct inducible, gccca endoplasmic stress-inducible reticulum, ubiquitin-like domain member 117 Homocysteine-NM 014685. 1 BG828243. 1 0.8 GTACTACATGCAATAtttagcagcc inducible, actgc endoplasmic reticulum stress-inducible, ubiquitin-like domain member 118 lysosomal-associated NM 014713. 2 AL039105. 1 CTGATTCCATTCTTCTGTTACC protein transmembrane 4 alpha 0.8 GAC11111GACTTCGTCCTCAG TTGCCTGGTTGCTATTAGTTCT CTCACCTATTTGCCAAGAATCA AAGAATATCTGGATCAACTA 119 DAZ associated NM 014764. 1 AU118651. 1 0.8 AGCAGTACCTCCCTAAAGCATT protein 2 TTGAGGTAGGGGGAGGTATCCA TTCATAAAATGAATGTGGG 120 ring finger protein 10 NM 014868. BU626650. 1 1.5 CGAGAGCGCAGGATTGAGATA GAGGAGAACA 121 ring finger protein 10 NM 014868. BU626650. 1 1.5 AGAAACAGGGCAAGTacccagaa gtccaca CAGAAGTCCACATTCCCCTCG AGAATCTACAGCAGTTTCCTGC CTTCAATTCTTATACCTGCTCC TCTGATTCTGCTTTGGGTCCCA CCAGCACCGAGGGCCATGGG GCCCTCTCCATTTCTCCTCTCA 122 ring finger protein 10 NM 014868. 3 BF815780. 1 1,3 GCAGAAGTC 123 ring finger protein 10 NM 014868. 3 BF815780. 1 1,4 CAGGTTCCCATGCAGactttctgctg accc 124 ribosomal protein L4 BM846228. 0.7 TCAGTGAATTAG CAGgtcatcag ac NM 000968. 2 1 tagtg 125 ribosomal protein L4 NM 000968. 2 CB141160. 1 0.7 GTCATCAGACTAGTGCTGAGT CTTGGGGTACTGGCAGAGCTG TGGCTCGAATTCCCAGAGTTC GAGGTGGTGGGACTCACCGCT CTGGCCAGGGTGCTTTTGGAA ACatgtgtcgtg 126 ribosomal protein L4 NM 000968. 2 CD686462. 1 0.6 GCGTGTGCTCGCCCACTGATA TCGGTGTACTCCGAAAAGGGG GAGTCATCTGGCAAAAATGTCA CTTTG C CTG CT G TATTCAAG G C TCCTATTCGACCAGATATTGTG AACTTTGTTCACACCAACTTGC GCAAAAACAACAGACAGCCCT ATGCTG 127 ribosomal protein L4 NM 000968. 2 CD686462. 1 0.7 TGGTCATGTCTAAAGgtcatcgtatt gagg 128 ribosomal protein L5 NM 000969. 2 GATATCATTTGTCAG attg cttatg cc BE879402. 1 0.8 Cgt 129 BAT2 domain NM 015172. 1 AB029019. 2 1,4 CACTTCCACCTTCAAccttagctcca containing 1 gttt 130 adenosine deaminase, NM 015841. 1 AA096321. 1 1,3 TCACACAGGACAGAGgaggcaga RNA-specific gcttctg 131 endoplasmic NM 015913. 2 CA418006. 1 0.8 TTCCCAAATTTCTACagcctctttcct reticulum thioredoxin Ctt superfamily member, 18 kDa 132 hypothetical protein NM 016127. 3 AA576624. 1 0.8 GGAGTCAAACACTGGatgcagaa MGC8721 attttgg 133 tetratricopeptide NM 017775. BE164618. 1 1.5 TTTTGATGCACAGAGctaccaccac repeat domain 19 agtgc 134 hypothetical protein NM 017841. 1 BI553945. 1 0.8 TCI111IGCTAAAGAACATCTG FLJ20487 CAGCACATGACAGAAAAGCAG CTGAACCTCTATGACCGCCTG ATTAACGAGCCTAGTAATGACT GGGATATTT 135 hypothetical protein NM 017865. 2 AK001070. 1 1.6 ACCAGAGACCTTCCCACCTCC FLJ20531 AGGAGAGGAAGAGGGTGAGG AAGAAGAGGACAATGATGAGG ATGAAGAGGAGATGCTCAGTG ATGCCAGCTTATGGACCTACA G 136 ubiquitin B NM 018955. BE708511. 1 1,4 CCCTGACCGGCAAGAcatcactctg gaggt 137 ubiquitin B NM 018955. 2 AA206538. 1 1.4 CAG GTCAAAATG CAGatcttcgtga aaacc 138 ubiquitin B NM 018955. 2 AA206538. 1 1,3 CAG G TCAAAAT G CAG has been agglutinated to agglutine ubiquitin B NM 018955. 2 AA340917. 1 1.4 AGATCTTCGTGAAGACCCTGA CCGGCAAGACCATCACTCTGG AGGTGGAGCCCAGTGACACCA TCGAAAATGTGAAGGCCAAGA TCCAAGATAAAGAAGGCATCC CCCCCGACCAGCAGAGGCTCA TCTTTGCAGGCAAGCAGCTGG AAGATGGCCGCACTCTTTCTGA CTACAACATCCAGAAAGAGTC GACCCTGCACCTGGTCCTGCG CCTGAGGGGTGGCTGTTAATT CTTCAGTCATGGCATTCGCAGT CCCAGTGATG G G G CATTACTCT CACTATAG C CATTTG CCCCAA CTTAAGTTTAGAAATTACAAGT TTCAGTAATAGCTGAACCTGTT CAAAATGTTAATAAAGG ubiquitin B 140 018 955 NM. 2 BU661443. 1 1.4 TCCCTGTGGGTGGACGTGGTT GGTGATTGGCAGGATCCT 141 ubiquitin B NM 018955. 2 BU661443. 1 1,5 GGTTGGCTTTGTTGGgtgagcttgttgtg 142 homoiogog, su sub family NM 018981. 1 AF490904. 1 1,3 CTTAGTGGCATGTTGgatggtcttgt om taat C, member 10,143 major NM 019111. 2 AA505585. 1 AGAGAGAAGATCACTGAAGAA histocompatibility ACTTCTGCTTTAATGACTTTAC complex, class II, DR AAAG CTG G CAATATTACAATC C alpha TTGACCTCAGTGAAAGCAGTCA 0.8 TCTTCAGCGTTTTCCAGCCCTA TAGCCACCCCAAGTGTGGTTAT GCCTCCTCGATTGCTCCGTACT CTAAC 144 major NM 019111. 2 AA505585. 1 0.8 ATCTAGCTGGCTTTCcctgtctattg class II, 9 complex, I, DR CCtt alpha 145 major NM 019111. 2 CD686254. 1 TCAGAGACAGTCTTCCTGCCC histocompatibility complex, class II, DR alpha 0.7 AGGGAAGACCACCTTTTCCGC AAGTTCCACTATCTCCCCTTCC TGCCCTCAACTGAGGACGTTTA CGACTGCAGGGTGGAGCACTG GGGCTTGGATGAGCCTCTTCT CAAGCACTGGGagtgtgatgctccaag CCCtCtC 146 KIAA1191 protein NM 020444. 2 ', BE313670. 1 1.7 GCAGCAGGATCACAGgtggagga aggagaq 147 KIAA1191 protein NM 020444. 2 BI254429. 1 1.5 GCAGCAGGATCACAGaacagacc caggaaa 148 mesoderm induction NM 020948. 1 AY124186. 1 0.7 TTTCTCACACAGGCAtactccaaat early response 1 gCtC 149 Actinin, alpha 1 NM 001102. 2 BG036045. 1! 0.7 AACCACTTTGACCGGggagaagc agaattt 150 membrane-spanning NM_152851 GGAACTCTCTCTCTGATGCTGA 4-domains, subfamily NM 022349. 2. 1 0.8 TTTGCACTCTGCTGGAATTCTG A, member 6A CCTAGCTGTGCTCACTGCTGT GCTGCGGTGGAAACAGGCTTA C myelin protein Zero- TGCCCTGGCATTCTGGCAGAG AATCCTCACCAGTTCTCACCAA CCTTCCCCCCAGGCAAGGGCA GCTGCCAGCATGGTGCTCTGC CAGGACAGGTTTCCCTGAAGG AAGCTGCTCACACTGAGATGA GCCTCTCAGGGCAGGACCTCT TCCCAAGCCCTGCACACCCAC like 1 151 024 569 NM. 2 AI693779. 1 0,9 CCCTGCAGCCCTTTTGGCTC 152 mouse GCTTCAGTCCGCCGAgagcagtac mouse limbimb-bg ud to NM 030915. 1 W45195. 1 1.1 cgtgtg heart Bene heterogeneous CTGACCCAGATAAAACAAAAAG TGGATTCTCTCCTGGAAAACCT 153 nuclear GGAAAAAATGGAAAAGGAACA ribonu cleoprotein C NM 031314. 1 CF128443. GAGCAAACAAGCAG ib (C1 / C2) polymerase (RNA) II NMG 002694 GGTGGGGGTGGAAGCagccgca (DNA directed) polypeptide C C15433kDa NM 032940. 1. 2 1,3 ggagcaag caspase 4, apoptosis-NM_001225 155 related cysteine NM 033306. 1. 2 1,2 TGTTCCCTATGGCAGaaggcaacc protease acagaa 156 major NM 033554. 2 BU621846. 1 0.8 GTCGCTGAGAGCCTCTTCCTG histocompatibility CCCAGAACAGATTACAGCTTCC complex, class II, DP ACAAGTTCCATTACCTGACCTT alpha 1 TGTGCCCTCAGCAGAGGACTT CTATGACTGCAGGGTGGAGCA CTGGGGCTTGGACCAGCCGCT CCTCAAGCACTGGG 157 major NM 033554. 2 BU621846. 1 0.7 ACCCTCATCTGCCACATTGACA histocompatibility AGTTCTTCCCACCAGTGCTCAA II, CGTCACGTGG complex, class DP alpha 1 158 major NM 033554. 2 BU621846. 1 0.7 AAGGAGCCTGTGGAGctgggcca histocompatibility gcccaac complex, class II, DP alpha 1 159 FK506 binding NM 054014. 1 NM 000801 GCTCCCTGTTCTTGGatctgccatg protein 1A, 12kDa. 2 0.8 gaggq 160 intracellular chloride! NM 001288. 3 BG491600. 1 CCAGGGACGGCCACTTCCTGG channel 1 TCCCCGACGCAACCATGGCTG AAGAACAACCGCAGGTCGAAT 0.8 TGTTCGTGAAG 161 intracellular chloride NM 001288. 3 AV683308. 1 GGGCAGCTCCCATTCctgctgtatg channel 1 0.7 gcact 162 vacuolar protein NM 080631. 1 BX648347. 1 ACCGGAGGGAAGAAGgtgtgctca sorting 41 (yeast) 1,4 gtgaag 163 zinc finger protein 1 BCO53361. 1 1.5 μGGAAAGGAGAAGGAATAAGA 384 NM 133476. 2 CGGCAGGAGGAAGAGAGAGAGA 1 GAGG 164 chromosome 19 open NM 138774. 2 AI375989. 1 1,4 AC CTCATCTCG G CCAg tg ctg cct reading frame 22 ggagg 165 nuclear receptor NM 000176. 1 1 X03348. TTCCTAAGGACGGTCTGAAGA 1 subfamily 3, group C, GCCAAGAGCTATTTGATGAAAT member 1 TAGAATGACCTACATCAAAGAG (glucoc ucocorticoid CTAGGAAAAGCCATTGTCAAGA receptor) GGGAAGGAAACTCCAGCCAGA ACTGGCAGCGGTTTTATCAACT GACAAAACTCTTG 166 Nucleosome NM 139207. 1 BU620919. 1 0.8 AGTGATATGGTTCAG9aacacgat assembly protein 1- gaacct like 1 167 1 NM 139207. 1 AU117948. 1 0.8 ATGAATATTTTACAAATGAAGT Nucleosome GCTGACAAAGACATACAGGAT assembly protein 1- GAGGTCAGAA like 1 168 Nucleosome NM 139207. 1 AU117948. 1 0.8 GCTGGCCAGCCTATGagttttgtctt assembly protein 1-agaa like 1 169 Nucleosome NM 139207. 1 AK122670. 1 0.8 CCTGCCTAGGGTAGTTAAAAG assemblyprotein 1- ACGAGTGAATGCTCTCAAAAAC like 1 CTGCAAGTTAAATGTGCACAGA TAGAAGCCAAATTCTATGAGGA AGTTCACGATCTTGAAAGGAAG TATGCTGTTCTCTATCAG 170 TAF15 RNA NM 139215. 1 BF812650. 1 1.3 GCTATGGTGGGGACAGAGGAG polymerase II, TATA box binding protein GCGGCTATGGAGGAGACCGAG GAGGTGGCTATGGAGGAGATC (TBP) -Associated GAGGTGGCTATGGAGGAGACC factor, 68kDa GAGGTGGAGGCTATGGTGGAG ACCGAGGAGGCTATGGAGGAG ATCGAGGAGGTTACGGAGGAG ATCGAGGAGGTTATGGAGGAG ATCGAGGAGGCTATGGAGGAG ACAGAAGCCGGGGGGGCTATG GAGGAGACCGTGGTGGTGGCA GTGGCTACGGTGGAGACCGAA GTGGAGGCTATGGAGGAGACA GGAGTGGTGGCGGCTATGGAG GAGACCGAGGTGGGGGCTAC GGAGGAGACCGAGGTGGCTAT GGAGG 171 TAF15 RNA NM 139215. 1 BF812650. 1 1.3 GGGACAGAGGCGGCGgctatggt polymerase II, TATA box binding protein ggggaca (TBP) -Associated factor, 68kDa GACCACAGCAATGACCAGCCC TCATTAGGGCCCTGGATGATTT TTG GT CTAATAAC G CATG CGAT TGTTGATGIIIIIIGGTCAGAG GGTATGAACAGGAAGAATTAAA TG CAG CAG G CTTTATTTTAAAT GCCGATTCACATTACTCTGTTC AAGCTGCGTTGAGATGTTAAAC TGGCTTACTATAGACTTCGTAA AAATGGCTCCAGAAAAGTAACA AACTGAAATCTTTGAGATCACA CAGGTTGGAAATATGTACATAA CTG CACAAG GTGTCAATTCTG C TCTACAGTGCAGTTTTAGTCAG iiiIAGTTGCATAGGTTTCCAT TGTATTTATAGTCTGTTTATGCT AAATCTGGCCAAAGATGAACAT TGTCCACCACTAAAATGCCTCT GCCACTTTGAATTCTGTGCTAA 172 DEAD (Asp-Glu- NM 001356. BE000596. 1 GTGGCCAGAATGCGGTG Ala-Asp) box polypeptide 3, X-linked ATCAAAACGCTCCATC I CAGTGGCATAGGAAGACGGCA AAAATTTCCTAAAGTGCA 173 girlfriend II NM 152727. AK094867. 1 1,4 ACCCCTTCTG CTCAGgtgtgg atgg tattg 174 heat shock 70kDa NM 153201. 1 BU731317. 1 0.8 ACTCGTATCCCCAAGattcagaag protein 8 CttCtC 175 hypothetical protein NM 153233. 1 BCO38360. 1 1.4 ACTGGCCAGGACCTGgaagcaga FLJ36445 cacctct 176 dynein, cytoplasmic, NM 001378. 1 U39575. 1 1.4 GGAAGACAAAGAAGGagagattca intermediate agcagg polypeptide 2 / dynein, cytoplasmic, intermediate 1 polypeptide 2177 tropomyosin 3 NM 153649. 1 BM006741. 1,3 TGATGAGAGTGAGAGgcagagac 1 ccgtgct 178 tropomyosin 3 NM 153649. 1 BM006741. 1,3 TGATGAGAGTGAGAGagaggatgctg 1gaccag 179 EST BE881352. 1 - BE881352. 1 1,3 TCAATATAAAACCCCcacctaccac acatt 180 EST AW368637. 1 - AW368637. 1.4 CTTAAACTCCAGCACcatcatagcc 1 accat 181 eukaryotic translation NM 001402. 4 AU146228. 1 1.5 CACCAATGGAAGCAGtggacaag elongation factor 1 aaggctg alpha 1 182 eukaryotic translation NM 001402. BU580573. 1 0.8 CATCAAAGCAGTGGACAAGAA elongation factor 1 alpha 1 GGCTGCTGGAGCTGGCAAGGT CACCAAGTCTGCCCAGAAAGC TCAGAAGGCTAAATGAATATTA TCCCTAATACCTGCCACCCCAC TCTTAATCAGTGGTGGAAGAAC GGTCTCAGAACTGTTTGTTTCA ATTGGCCATTTAAGTTTAGTAG TAAAAGACTGGTTAATGATAAC AATGCATCGTAAAACCTTCAGA AGGAAAGGAGAATGTTTTGTG GACCACTTTGGTTTTC 1 I 1 1 G G CGTGTG CAGTTTTAAGTTAT TAG 11111 AAAATCAGTACTTTT TAATGGAAACAACTTGACCAAA AATTTGTCACAGAA I I I I GAGA CCCATTAAAAAAGTTAAATGAG 183 NM 001402 eukaryotic translation. AA595862. 1 0.8 TGCGGTGGGTGTCATCAAAGC elongation factor 1 alpha 1 AGTGGACAAGAAGGCTGCTGG AGCTGGCAAGGTCACCAAGTC TGCCCAGAAAGCTCAGAAGGC TAAATGAATATTATCCCTAATAC CTGCCACCCCACTCTTAATCAG TGGTGGAAGAACGGTCTCAGA ACTGTTTGTTTCAATTGGCCAT TTAAGTTTAGTAGTAAAAGACT GGTTAATGATAACAATGCATCG TAAAACCTTCAGAAGG 184 NM 001404 eukaryotic translation. 3 AA206367. 1 0.7 CTGAGTCCAGATTGGCAGGTG elongation factor 1 GACTACGAGTCATACACATGG gamma CGGAAACTGGATCCTGGCAGC GAGGAGACCCAGACGCTGGTT 1 eukaryotic translation NM 001404. 3 AA206367. 1 0.6 CAGCATGTGGGCAAAGCCTTC 185 elongation factor 1 AATCAGGGCAAGATCTTCAAGT gamma GAACATCTCTTGCCATCACCTA G 186 eukaryotic translation NM 001404. 3 BQ375267. 1 0.8 GTTCTAGAGCCTTCTTTCCGCC elongation factor 1 AGGCCTTCCCAATACCAACCG gamma CTGGTTCCTCACCTGCATTAAC CAGCCCCAGTTCCGGGCTGTC TTGGGCGAAGTGAAACTGTGT GAGAAGATGGCCCAGTTTGAT Gctaaaaagtttgcagag 187 eukaryotic translation NM 001404. 3 BU783548. 1 0.6 ATTTAAGCGCAAGTACTCCAAT elongation factor 1 GAGGACACACTCTCTGTGGCA gamma CTGCCATATTTCTGGGAGCACT TTGATAAGGACGGCTGGTCCC TGTGGTACTCAGAGTATCGCTT CCCTGAAGAACTCACTCAGAC CTTCATGAGCTGCAATCTCATC ACTG 188 eukaryotic translation NM 001404. 3 BE502067. 1 0.6 TGGACAAGCTGAGGAAGAATG elongation factor 1 gamma CCTTCGCCAGTGTCATCCTTTT TGGAACCAACAATAGCAGCTC CATTTCTGGAGTCTGGGTCTTC! CGAGGCCAG 189 eukaryotic translation NM 001404. 3 BE502067. 1 0.6 CAGGTGGACTACGAGTCATAC elongation factor 1 ACATGGCGGAAACTGGATCCT gamma GGCAGCGAGGAGACCCAGAC GCTGGTTCGAGAGTACTTTTCC TGGGAGGGGGCCTTCCAGCAT GTGGGCAAAGCCTTCAA 190 eukaryotic translation j GAGCTTGCCTTTCCGctgagtcca elongation factor 1 NM 001404. 3 BE502067. 1 0.6 gattgg gamma 191 eukaryotic translation NM 001404. 3 Î AAGGAGGAGAAAAAGGCGGCT elongation factor 1 GCCCCTGCTCCTGAGGAGGAG gamma ATGGATGAATGTGAGCAGGCG CTGGCTGCTGAGCCCAAGGCC AAGGACCCCTTCGCTCACCTG BG533219. 1 0.8 CCCAAGAG 192 eukaryotic translation NM 001404. 3 BG615194. 1 0.7 TTCCGCCAGGCCTTTCCCAATA elongation factor 1 CCAACCGCTGGTTCCTCACCT gamma GCATTAACCAGCCCCAGTTCC GGGCTGTCTTGGGCGAAGTGA AACTGTGTGAGAAGA 193 eukaryotic translation NM 001404. 3 BG702200. 1 0.8 GTTCACGGGAAGAGAAGCAGA elongation factor 1 AGCCCCAGGCTGAGCGGA gamma 194 O-linked N-AW002377. ; AAAGAATATCTAGCCctctgttcaac acetylglucosamine NM 181673. 1 1 0.8 Acca (GIcNAc) transferase 195 tyled N-AW002377. 0.8 CACGAAAAGTAGCCGctctggtga acetylglucosamine NM 181673. 1 agct ace (GIcNAc) transferase 196 WD ana AB028960. AK023778. 1 1,3 CCAATTTCTTTGGCAgcaacgctca tetratricopeptide gtata repeats 1,197 actin, gamma 1 NM 001614. 2 CD687776. 1 1,2 CATGGAGAAGATCTGggcgcacc actggca 198 arachidonate 5- NM 001629. 2 BX366320. 1 1,3 AGAGAGAACGCAGAGgccacgga lipoxygenase-agccctq activating protein 199 Homo sapiens AK026373. 1 AV725084. 1 1,2, ATTTCAACAGCTGAGgaaggtgtctt cDNA: FLJ22720 fis, gctg clone HS114320 Homo sapiens AK026373. 1 BQ276346. 1 TTTGGCAGGAAGGTGTCTTGC 200 cDNA: FLJ22720 fis, 0.8 TGCAGGTAACTAATGAAGAAGT clone HSI14320 GGTCAACCACAGAGTCTTCAA GAAATAAGAAATTCTGTACCAT CTGAAAGTAGTTCTTGTTGGTG CCTTCATTTAAAAAGCACTCTT TAAAATAAAAGGGAAATGTTTT CTGATAAAA 201 DKFZp586K2322 AL080113. 1 AV751235. 1 0.8 TTTATTTTCAAATGCAGTGTAG (NM_006386 or NM 030881 AGCTAGATTAAAAGCAACTCTT) TG CCACCTACTCTG C CCTTTTG GCAAAGTTACCTTGAACAAAGA ATCTTAAG G GTTTATTAAGAAC TCTTTAIIIICTTCATACCCTGT TCTCTGCAGTGCTTTCTAACAG CTTCTGGGTGCAGA IIII CTTC GG CTT CATCCTTTTG CACTCAG ATTACAG GTAG GTAGTG CTTAA GAAAAGTCATGGAGGACTAAA GCCTAAGTCCTTTTCACTTTTC CTCCATCTGAAGGTAGGTGAG TTCATCCTCTTCATAGTAATGC TGIIIIACCAAGACTTTATAGC AGATG GACCCAGAAAGAATTTT CTGCTATTGTGTTCACTACAAC AGGATAGGGACATCAGACAGC CCCAGAAACCCCTTCCAGATCT GATATGGGACTATTAA IIIII AT GCTGTTAATTGGTATTCATTCA CAATGCAGTTGAAGGGGGAAG GCTCCACTGCATTCTTTG 202 calmodulin 2 ATTGCAAAACGGGTGtattatccag (phosphorylase NM 001743. 3 BF701704. 1 0.8 • tact kinase, delta) 203 calpain, small subunit Î 1 / calpain, small CCTTTGAGGCAGCAGatgaaagtg subunit 1 NM 001749. BE907701. 1 1.4 ggaaca 204 Cysteinyl-tRNA AAACAGGAACAAGAAgcagcaaa synthetase NM 001751. 3 AK125503. 1 1.5 gctggcc 205 CD97 antigen NM_078481 ATACCGTCTGTGAAGatgtggacg NM 001784. 2. 1 1,3 agglucos 206 CD97 antigen NM 001784. 2 BIO28545. 1 1,2 CAGGCTGGAAGCCCAGACACG GAATCCCGGATAACCAAAAGG ACACTGTCTGTGAAG 207 Homo sapiens BCO34141. 1 AV688287. 1 1.4 CAAGGGACCAAGGTGgagcagttg immunoglobulin constant kappa aaatct, mRNA 208 ferritin, heavy NM 002032. 1 BG248923. 1 0.6 TGTCTCTGGGGATCCCTAGTAT polypeptide 1 AACACATGCA 209 Homo sapiens BCO53635. 1 AI806846. 1 1,5 CCGGCTGGTCAAAGTgtgggtgct chromosome 4 open Î ggcagc reading frame 9 210 Homo sapiens T-cell K02885. 1 U85050. 1 0.8 GGTTCGGGGACCAGGttaaccgtt receptor active betagtagag chais V-D-L2-C-beta- 1 (TC (TCRB) mRNA 211 Homo sapiens T-cell K02885. 1 AF043180. 1 0.8 GAGGACCTGAACAAGGTGTTC receptor active beta- CCACCCGAGGTCGCTGTGTTT drain - GAGCCATCAGAAGCAGAGATC 1. 2-C-beta-1 bet (TC (TCRB) TCCCACACCCAAAAGGCCACA mRNA CTGGTGTGCCTGGCCACAGGT ATCTTCCCTGACCACGTGGAG CTGAGCTGGTGGGTGAATGGG AAGGAGGTGCACAGTGGGGTC AGCACGGACCCGCAGCCCCTC AAGGAGCAGCCCGCCCTCAAT GACTCCAGATACTGCCTGAGC AGCCGCCTGAGGGTCTCGGCC ACCTTCTGGCAGAACCCCCGC AACCACTTCCGCTGTCAAGTCC AGTTCTACGGGCTCTCGGAGA ATGACGAGTGGACCCAGGATA GGGCCAAACCCGTCACCCAGA TCGTCAGCGCCGAG 212 Homo sapiens T-cell K02885. 1 AF317590. 1 1,2 TCTGTGCCAGCAGCCTAGACAGAGAGAGACAGAGAGAGOAGA BACTIVE ACTIVE BACTERATORS V-D-J-BETA-1. 2-C-beta-1 (TCRB) mRNA 213 Homo sapiens T-cell K02885. ATATCTCTGCAGCGTcgggggtca receptor active beta- AF327908. 1 0.7 atctqg flesh - 1. 2-C-bet beta-1 (TC RB) mRNA 214 Homo sapiens T-cell CTGCAGTGCTAGAGAtccggggtc receptor active beta- K02885. 1 AF327022. 1 0.7 ccatca flesh V-D-J-beta- 1. 2-C-beta-1 (TCRB) mRNA 215 Human T-cell CCAGATCGTCAGCGCcgaggcctg receptor active beta- M12886. 1 BCO36926. 1 0.8 gggtag chain mRNA 216 Human myosin-IXb CCTCTGGCCTCACAGtgtcccagg mRNA U42391. 1 BF948234. 1 1,3 agcaca 217 glutathione TGGAACTTCACCAAGttcctcatc9 a peop NM 002085. 1 BI254515. 1 0.8 caag (Phospholipid hydroperoxidase) 218 Poly (A) binding protein, cytoplasmic TTCCAACTGTTTAAAattgatcaggg 1 NM 002568. 1 AA033548. 1 0,9 acca 219 CAGTATGGTGAGGAGgtgtgagag prohibitin NM 002634. 2 BI261762. 1 1.4 ggtcca 220 BMO 14639. TCAGAGTGGAAGCAGgtgagaatg prohibitin NM 002634. 2 1 1.6 gagggg 221 BMO 14639. CTTTGGGCGAGGAGAgtgtgagag prohibitin NM 002634. 2 1 1.4 ggtcca 222 GCGAGGAGAGTGCGTgtgtgaga prohibitin NM 002634. BE536369. 1 1.4 gggtcca 223 TTCCAGAAGCCAGAGAGACCA AGTGTTATGTAAGAAGTAGTGT spermidine / spermine CGGCTGTGTAGAACCACTGAC NI-acetyltransferase NM 002970. 1 CD108391. 1 0.8 TACACAGGCCGAAGTTACTGA GAACTTGGACAGAAAAAATAGC CAGCAAGTGTTCAAACTACTGA GG TTAGATATG CTG CACTTAAGAATACTAG GGC AGGTT 224 secret protein secreted protein NM 003118. 1 BG424815. 1 1.4 CAGGAGACACCCCG (osteonectin) 225 or NM 003295. 1 BG284235. 1 0.6 TTCTTTATTGGTGAAAACATGA ontr ied tumor ATC CAGATG G CATG GTTG CTCT controlled ATTG GACTACCGTGAG GATG G protein 1 TGTGACCCCATATATGATTTTC TTTAAGGATGGTTTA 226 translationally- NM 003295. 1 BG284235. 1 0.7 GAAATGGAAAAATGTtaacaaatgt controlled tumor ggcaa protein 1 227 translationally- NM 003295. 1 CD641954. 1 0.7 TTATTTTGGATCTATCACCTGT controlled tumor CATCATAACTG G CTTCTG CTTG protein 1 TCATCCACACAACACCAGGACT TAAGACAAATGGGACTGATGTC ATCTTGAGCTCTTCATTTATTTT

GACTGTGATTTATTTGGAGTGG AGGCATTG 1 1 1 1 1 AAGAAAAAC ATGTCATGTAGGTTGTCTAAAA ATAAAATGCATTTAAAC 228 translationally- NM 003295.1 AV749932.1 0,6 CGTCGTCTCCCTTCAGTCGCC controlled tumor ATCATGATTATCTACC contro protein 1 229 translationally- NM 003295.1 AV749932.1 0,6 GGGACCTCATCAGCCacgatgag controlled tumor atgttct protein 1 230 vasodil ator- NM 003370.1 BQ340296.1 1,5 TTTATTTCCTACCAGcagg agg a g stimulaulated ccagag phosphoprotein 231 cold shock domain 1,2 CCCAAGGTACCGTAGcaggggac protein A NM 003651.3 BG739968.1 ctcctcg 232 cold shock domain 1,5 AATAACCCACGGAAATATCTGC protein A NM 003651.3 BE935120.1 .GCAGTGTAGGAGATGGAGAAA CTGTAGAGTTTGATGTGGTTGA AGGAGAGAA 233 cold shock domain 1,5 GTATTTGTACATCAGactgccatca protein A NM 003651.3 BE935120.1 agaag 234 chromosome 22 open 1,5 GAAGCAGAAGAGGTGTGAAAG reading frame 19 NM 003678.2 AK122673.1 AAGGTGCTGCTGGGAGGGGA GTCTGACAACCCAGC 235 Homo sapiens 0,8 ACGACATCGCAGAAGGTGGCT ve mb assoo CGGAAATTCTGGTGGAAGAAC membrane protein 8 GTGAAGATGATTGTCCTTATCT (endobrevin) NM 003761.2 BG623073.1 GCGTGATTG11111ATCATCAT CCTCTTCATTGTGCTCTTTGCC ACTGGTGCCTTCTCTTAAGTAA CAGGGAACCTCTCCCACCTGC CCTTC I I I I CAGGGACAACCCT CCATAAATGTGTGCCAAGAGG GTCTCCTTTCCTGTCTTCCTCT ACAGAGAATGCTGCTCGGTCC TCCTACCCCTCTTCCCGAGGC CTGCTGCCACGTTGTATGCCC CAGAAGGTACCTTGGTCCCCC GGAAGGAGAGAA 236 Homo sapiens NM 003761.2 BG623073.1 0,7 GATCTGGAAGCCACAtctgagcact vesicle-associated tcaag membrane protein 8 (endobrevin) 237 CASPB and FADD- NM 003879.3 BI871546.1 1,3 AGTACAAGCAGTCTGgtggatgga like apoptosis atggaa regulator 238 beta-2-microglobulin NM 004048.2 BM 831738. 0,8 AG GTTTACTCACGTCATCCAG C 1 AGAGAATGGAAAGTCAAATTTC CTGAATTGCTATGTGTCTGGGT TTCATCCATCCGACATTGAAGT TGACTTACTGAAGAATGGAGA GAGAATTGAAAAAGTGGAGCA TTCAGACTTGTCTTTCAGCAAG GACTGGTCTTTCTATCTCTTGT ACTACACTGAATTCACCCCCAC TGAAAAAGATGAGTATGCCTGC CGTGTGAACCATGTGACTTTGT CACAGCCCAAGATAGTTAAGT GGG 239 beta-2-microglobulin NM 004048.2 BM831738. 0,7 TG GAG GCTATCCAGCgtactccaa 1 agattc 240 Guanine nucleotide NM 004125.2 BC015206.1 0,6 GTGGAGAGGATCAAGgtctctcag binding protein, gcagct gamma 10 241 BCL2-associated NM 004323.2 BI826041.1 1,2 TGACTGTCACCCACAgcaatgaga athanogene agcacg 242 c-src tyrosine kinase NM 004383.1 BG953215.1 1,1 CCTCAAGTTCTCGCTagatgtctgc gaggc 243 HLA-B associated NM 004640.3 CB148695.1 1,4 GCCCCAGGAGGAGAGgtgagctg transcript 1 aagatgg 244 family with sequence NM 004699.1 AA024425.1 0,8 CACGGGGGAAGAGTGgaccactct similarity 50, tcaact member A 245 family with sequence NM 004699.1 AA584911.1 1,6 TCAAGAGTGAGTGTTtgcggagtca si gaggc similarity 5 50, member A 246 GNAS complex locus NM 000516.3 BU784018.1 1,5 CTACTCCTGAG GATG gtgtgtatg g CttCC 247 GNAS complex locus NM 000516.3 BG911454.1 1,4 GACGCCAGGGTTTGGgtgctggag aatctg 248 hemoglobin, alpha NM 000517.3 AA343446.1 1,7 TGCTTCTCCCCGCAGgatgttcctgt 1/2 CCtt 249 WD repeat domain 1 NM 005112.3 BX648190.1 1,2 AAGTTCACAATTG G Cg a cca cag c cgcttt 250 retinoblastoma NM 005610.1 BF475362.1 0,8 TTTATTCATGGTGGTCATACTG binding protein 4 CCAAGATATCTGATTTCTCCTG GAATCCCAATGAACCTTGGGT GATTTGTTCTGTATCAGAAGAC AATATCATG 251 small EDRK-rich NM 005770.2 BG435668.1 0,9 CCGTCGCCATGACCCgcggtaacc factor 2 agcgtg 252 putative translation NM 005801.2 BG655736.1 0,7 CCTTTGTGCTTGCAGaagtttgcctg initiation factor caat 253 membrane NM 005898.2 AA437165.1 0,8 AACAGCTTCAAACAGt99 tt99 cact ch t, tacc chromosome 11, surface marker 1 254 capping protein (actin NM 006136.1 BG702980.1 0,7 filament) muscle Z-line, alpha 2 AGCAAAAAA 1 1 1 1 1 Ggaatggtcgtt ggag 255 microspherule NM 006337.3 CB110581.1 1,4 AACTCTGTGGTGGAGgtgagctgg protein 1 ggagga 256 Transforming, acidic NM 006342.1 BM313828. 1,3 ACAGTGGAGCAGAAGgtgggtgcg coiled-coil containing 1 qgaagc protein 3 257 acidic (leucine-rich) NM 006401.1 AI446778.1 1,2 AAGACGAGGACGATGAGGATG nuclear GTGAAGAAGAGGAGTTTGATG phosphoprotein 32 AAGAAGATGATGAAGATGAAG family, member B ATGTAGAAGGGGATGAGGACG ACGATGAAGTCAGTGAGGAG 258 acidic (leucine-rich) NM 006401.1 AI446778.1 1,3 AGGAGGAGGACGAAGaaggaga nuclea r agatgagg phosphoprotein 32 phosp family, member B 259 granulysin/granulysin NM 006433.2 BI838502.1 0,8 GATAAGCCCACCCAGagaagtgttt ccaat 260 lysosomal associated BQ006415.1 1,4 aqatgctccaqaaggtgagtgtggctgcag multispanning membrane protein 5 NM 006762.1 261 lysosomal associated BQ006415.1 1,5 AAGATGCTCCAGAAGgtgagtgtgg multispanning 99 membrane protein 5 NM 006762.1 ctgca 262 transforming growth AI610679.1 1,4 CGGGCTACNANATGCGCTTGG factor, beta 1 GGGGAGCCAGGACGGAGGAA (Camurati- GAGGAGAGAGAAAGAGA Engelmann disease) NM000660.1 263 myosin IF NM 012335.2 BF823263.1 1,3 GTGGACAATGGGAAGctgctggaa gggcct 264 ribosomal protein BM826692. 0,8 AAGCCTACAAGAAAGtttgcctatct L13a NM 012423.2 1 gggg 265 glutathione SN59567.1 1,4 CATGCTGTTCCTTCCTCGCCAC transferase pi NM 000852.2 CCTCTGCTTC 266 chromosome 11 open BE041814.1 0,8 TCGCGGGGCAAAATGgagctcga reading frame 10 NM 014206.1 ggccatg 267 integrin, alpha X AA251543.1 1,4 TGCGACCGCCTACAGgtgacctcc (p150)), , (antigen aaagct (p 15 alpha pha polypeptide) NM 000887.3 268 ring finger protein 10 NM 014868.3 BM471027. 1,3 ACTTTCTGCTGACCCCTCTGTC 1 ACCCACTGCCAGTCAGGGCAG TCCCTCATTCTGCGTTGGGAGT CTGGAAGAAGACTCTCCCTTC 269 ribosomal protein L4 NM 000968.2 CB164625.1 0,7 GGTGCTTTTGGAAACatgtgtcgtg gaggc 270 KIAA0690 NM 015179.2 BG506709.1 TCTCTGGGCAAGCAGaaagcaaa 1,6 aggtgat 271 KIAA0690 NM 015179.2 BG506709.1 1,6 GAATACAAGGCCAAGaaagcaaa aggtgat 272 bridging integrator 2 NM 016187.1 AK093638.1 0,9 AAGGCCATCGTATGGaataatgatc tCCtt 273 ribosomal protein NM 000990.2 BG824148.1 0,8 CAACTTCGACAAATAccacccagg L27a ctactt 274 ribosomal protein NM 000990.2 BI906450.1 0,8 gccacccaggctactttgggaaagt L27a 275 ribosomal protein NM 000990.2 CB119057.1 0,7 GCCACGGCCGCATAGgcaagcac L27a cggaagc 276 ankyrin repeat NM 017664.1 BCO39715.1 0,7 GGAGCATTCCATATAGAAACTG domain 10 CTGAAACTGCCACAGGTGCTT CTCCGAAAACCTTACAGTTGTG GCATTGAATGTTCAGTATCGCT TCCTTTCTGCACACG 277 ribosomal protein NM 000992.2 BQ335217.1 0,7 CCAAGTTCCTGAGGAacatgcgctt L29 tgcca 278 ribosomal protein NM 001017.2 CA843486.1 0,8 TTCCGTCTGATTCTAATAGAGA S13 GCCGGATTCACCGTTTGGCTC GATATTATAAGACCAAGCGAGT CCTCCCTCCCAATTGGAAATA 279 CNDP dipeptidase 2 NM 018235.1 AW502844. 1,5 CACAGCAGCATCAAGgtggagtgc (metallopeptidase 1 agcaac M20 family) 280 ribosomal protein NM 001021.2 BE731466.1 0,5 AATTATGTTCCTGAGgtctcagcctt S17 ggat 281 ribosomal protein NM 001021.2 AV752729.1 0,6 GCCGCGTTCCACCAAAACCGT S17 GAAGAAAGCGGCCCGGGTCAT CATAGAAAAGTACTACACGCG CCTGGGCAACGACTTCCACAC GAACAAGCGCGTGTGCGAGGA GATCGCCATTATCCCCAGCAAA AAGCTCCG 282 sulfatase 2 NM 018837.1 AB033073.2 1, 5 GCGAGAGTGTGTCGAgtgagtgtg cgtctg 283 ubiquitin B NM 018955.2 BG286180.1 1, 6 G AAG G C G GAAAAGAG g tca a a atg cagatc 284 ubiquitin B NM 018955.2 BG286180.1 1,4 GGTATCCGCTAACAGgtcaaaatg cagatc 285 glycogen synthase NM 019884.1 BQ927367.1 1,3 ACTTCAGTGCTGGTGgtgagggCa kinase 3 alpha tagcct 286 KIAA1185 protein NM 020710.1 BQ352354.1 1,1 GTGTTAGGAGCGGAGataccttca cttgct 287 peptidylprolyl NM 021130.1 AA340318.1 0,8 CGCGTCTCCTTTGAGctgtttgcag isomerase A acaag (cyclophilin A) 288 peptidylprolyl NM 021130.1 AA357116.1 0,8 TCCCAAAGACAGCAGaaaattttcgt isomerase A gctc (cyclophilin A) 289 peptidylprolyl NM 021130.1 BE293161. 1 0,7 CGCGTCTCCTTTGAGgtacggggc isomerase A ctggat (cyclophilin A) 290 hypothetical protein NM 024841.1 J BQ919225.1 ' ATTGTTGAACTGGATgcggctgttg FLJ14213 1,4 aagag 291 hypothetical protein NM 024841.1 AK055042.1 0,8 GATCAACGTTTTCAAAG G G G G FLJ14213 TGGCTTGCAAAGCAACGAGCT CTATGCCCT 292 ring finger protein 34 NM 025126.2 NM_194271 0,9 GTCGCGGCCATGAAGgtggggga .1 gtggtac 293 t-complex 1 NM 030752.1 AA160249.1 0,7 TTCTTC I I I I CCTAGgggtctcggg aacag 294 RAB34, member NM 031934.3 W32152.1 1,6 AAGAAG GAT CTGAGTg tg a g tg tg c RAS oncogene cagtg family 295 zinc finger, CCHC NM 032226.1 BG530408.1 0,8 ACTGGTCCATCAGTGACAAAG domain containing 7 ACATTGAG 296 guanylate binding NM 052942.2 AA164465.1 1,5 AAAAAAAGAAGAAAGaggcacaag protein 5 tgaaag 297 ubiquitin-conjugating NM 058167.2 BG766070.1 1,4 TGCGCGGAACCCGAGatgagcag enzyme E2, J2 caccagc 298 chloride intracellular NM 001288.3 AAl26087.1 0,8 CCTGAGTCCAACACAGCTGGG channel 1 CTGGACATATTTGCCAAAIIII CTGCCTACATCAAGAATTCAAA CCCAGCACTCAATGACA 299 chloride intracellular NM 001288.3 AAl26087.1 0,7 CCCAGGTACCCCAAGctggcagct channel 1 ctgaac 300 chloride intracellular NM 001288.3 AAl26087.1 0,8 GCTGTGCCCTCCCAGgtaccccaa channel 1 gctggc 301 peptidylprolyl NM 130906.1 BU195819.1 1,4 CCCAAAACATGTGAGgctggagta isomerase caatgg (cyclophilin)-like 3 302 cAMP responsive NM 134442.2 BI831922.1 0,8 GCCACATTAGCCCAGgtatctatgc element binding cagca protein 1 303 heat shock 70kDa NM 153201.1 CD655642.1 0,8 GCCTACCTTGGGAAGactgttacca protein 8 atgct 304 E74 like factor 1 (ets NM 172373.2 AA251399.1 0,8 GCCAAGAAGCTTGAGAGAAGA domain transcription AAAATTTCAGAAAAATTGTCTC factor) AATTTGACTAGAATATCAATGA ACCAGGAAAACTGAAGCACCT TCCCTAAAGAAAACTTGGGTAT ACAATTACTCCACAGACAGAGC TGAGGG 1 1 1 1 1 1 ACCCAAATCA GTCACTGGATTTTGCTGCCTGA TACGTGAATCTTCTTGGAATTT TTCTCATGTGGATCTAAGGGGA ATG CTTTATTATG G CTG CTGTT GTCCAACAGAACGACCTAGTAT TTGAATTTGCTAGTAACGTCAT G 305 eukaryotic o lion NM 001402.4 BX454657.1 0,8 GGCTTCACTGCTCAG t attatcct elongation on factor r 1 g g alpha 1 gaac 306 eukaryotic translation NM 001402.4 BQ365319.1 0,8 AGCTTCTCAGACTATccacctttggg elongation factor 1 taag alpha 1 307 arachidonate 5- 0,9 AAGTGGAGCACGAAAGCAGGA lipoxygenase-CCCAGAATGGGAGGAGCTTCC activating protein NM 001629.2 BF892107.1 AGAGGACCGGAACACTTGCCT TTGAGCGGGTCTACACTGCCA A 308 ribosomal protein NM 021104.1 EXH-001 0,7 AAGAAAGATGAGGCAGAGGTC L41 CAAGTAAACCGCTAGCTTGTTG 309 ribosomal protein NM 021104.1 EXH-002 0,6 GAAGCGAATGCGCAGgctgaagc L41 gcaaaag 310 translationally NM 003295.1 EXH-003 0,5 AATGCATATTTAAACTAAATTGA controlled tumor TCCTGTAGTGTTCCTGGAGAA protein 1 GCTAGAGCCTGATTGTAGGCT ACTACTCATCAATTAACTTCTA CAGTGGAGACTACTTCTGGGA CTGGAATATAAAAA 311 G1 to S phase NM 002094.1 EXH004 1,5 ATTACCGTTTATTCCATATCTG transition 1/G1 to S GATAATTTGCCGAACTTCAATA phase transition 1 GATCAGTTGATGGACCAATCA GGCTGCCAATTGTGG 312 cyclin T2 NM 001241.2 EXH-005 0,5 ttgtgtgagctattcaaactcttcaacccctga 313 cyclin T2 NM 001241.2 EXH-006 0,7 AGCCAGTTGTCATTTttacaggattg tgtq 314 zinc finger protein NM 133476.2 EXH-007 1,5 AGTTCAGGAGCCCTGGAAAGG 384 AGAAGGAATAAGACGGCAGGA GGAAGAGA 315 TBC1 domain NM 020773.1 EXH-008 CTACCTAATTGATTGcccggggcc family, member 14 0,7 ctgatt 316 F-box protein 7 NM 012179.2 EXH-009 GGAAGCGCGGGTGGTCGGCT GGGGTCCGGCTCCTGGAGAAC ATGGCCCGGCCTCCCGGGGG 1,3 CT 317 F-box protein 7 NM 012179.2 EXH-010 CTGGTCCCCTCCTCGttctaatacc 1,4 cgatt 318 maltase- GTAATTCAACTGCCAAGTGGTG glucoamylase (alpha- GAAGAGGGAAATAGAAGAACT glucosidase) NM 004668.1 EXH-011 ATACAACAATCCACAGAATCCA 1,4 GAGAG Les variants SEQ ID Nos: 308-318 correspondent à de nouvelles ESTs Tableau 2 ù Liste de 100 transcrits exprimés différentiellement lors du développement d'un cancer du sein SEQ Description de la N Genbank N Genbank C 1/II Séquence Cible ID N séquence (reference) (variant) vs. Sains cDNA DKFZp451 G 151 (Leucinerich repeat GAATGAATTTTCTTGctgctatgcct 3 kinase 2) AL832453.2 BU634341.1 0,8 ttct cDNA DKFZp667I093 AL832878.1 Al223156.1 0,7 TGCAGATTTGATGGCctactgtga (Guanine nucleotide agcaca binding protein, gamma 2) 7 Homo sapiens B0009917.1 ; BCO28225.1 1,4 ATGAAGAAAAACAAAgtgcacag hypothetical protein agacccg DKFZp761AO52 12 Microtubule associated BC048206.1 AF052170.1 1,4 TGGGAGGGTCGACCTTGATC monoxygenase, ATGAAACAATACCATGAGGGG calponin and LIM GCCTCTGTCACCTTTGAAAAG domain containing 2 AACACTTTTTGAGCAGCCTCA AAAAGCTCATACATACCAGCG CCTTCTTAAATTGGCTCTAATG TAAAGATTGTTAATGTCATTTA TCAAAACCATAGGTGATTATTT GGAGGGATTTAAAAAACTTAA TTACTCTCAGGCCTCATCCCA AGCTTGACACATGCTCTGTAG 1GTTGAACACATAATCACAAAT ATTCTAGCAAATGCTGCCTTG GTTGCAGCCTGCACTGTAGAC CCAAGGGTTTTGCTGTGGCTC TTCTTATCTCCCTTGGCTCATA AAGCCCCAGATGATGCCAGA GCTTCAATTAGAGCCATCATC AT C C CAG G CAG G GATATCTTT GAGAAATGACTCAGTTCAGCC CCAGGCCCCTGTGACTCTGCT TAAAGCACACATTTCTGCTGA CTCTTGTACCTGGGGCAGCA GGATAATCACCAACAC 14 cellular homolog of NM 001997.2 W17004.1 0,6 GGCCGCATGCTTGGAaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 16 cellular homolog of NM 001997.2 AA063591.1 0,7 TGACCGGCCAGGAAAcggtcgcc the fox sequence in the cagatca Finkel-Biskis-Reilly murine sarcoma virus 17 cellular homolog of NM 001997.2 AA094898.1 0,6 CAGGCCGCATGCTTGaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 18 cellular homolog of NM 001997.2 AA187006.1 0,7 AGGCCGCATGCTTGGaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 19 cellular homolog of NM 001997.2 AA225636.1 0,6 GGCCAGGAAACGGTCgcccaga the fox sequence in the tcaaggta Finkel-Biskis-Reilly murine sarcoma virus 23 cellular homolog of NM 001997.2 BU603086.1 0,6 g gg the fox sequence in the GAAGTAGCAGGCCGCat cttFinkel-Biskis-Reill aggtaaa y murine sarcoma virus 24 cellular homolog of NM 001997.2 BF218408.1 0,6 AGGCCGATGCTTGGA taaa t the fox sequence in the gg g Finkel-Biskis-Reilly ccatggt murine sarcoma virus 25 cellular homolog of NM 001997.2 AI499403.1 0,7 CCCGCATGCTTGGAGgtaaagtc the fox sequence in the catggtt Finkel-Biskis-Reilly murine sarcoma virus 26 cellular homolog of NM 001997.2 AW795076.1 0,6 CGGTCGCCCAGATCAaggctcat the fox sequence in the gtagcct Finkel-Biskis-Reilly murine sarcoma virus 35 SEC14-like 1 (S. NM 003003.1 AK130317.1 1,3 TAG G G CTAGTAG G TAG G G CT cerevisiae) AGTAGGTAGGGCTAGTAGGTA G G G CTAG TAG G TAG G G CTAG TAGGTAGGGCTAGTAGGTAG G G CTAGTAG GTAG G GTTCGTA GGTAGGGTTCGTAGGTAGGG TTCGTAGGTAGGGTTAGTAGC GCGTCTGTGCTGCTTCCACCT GGTGCTTCCTGTTCCCAAATC ACAAGGGCCTGAAGGTGGTC CCTGCTTTCTCTTTCTCTTTCT CTGTGTCTCAGATGGCGATTT TGCTGACAGCTGCCAAGAAAA TGCTTCACTCAACAGTCCTCA TGTGCCCAGAGATGTTTATAG AACTGTTTGAATTGCAGCCAT CCCCTGCCCCCTCCCAGGCT GAAGATCTGTTCIIIiIAAGTT GATTCG G GAGTG G CATTCTTT TATACCCAAAGACTGTAGTGC AT CTTGAAGAG CTCAAAG CAC ATGACCGCACAAATGCTTACA GGGTTTCCTCCCGAGTAATCC AATCTCACTCCCCTTGTAAGG 43 nuclear factor NM 003204.1 BM973053.1 1,5 CGGGTCAGTGTACAGgaagagg (erythroid-derived 2)- caggcact like 1 44 nuclear factor NM 003204.1 BM973053.1 1,5 TGCTGTGAGGCAGAGgaatgatg (erythroid-derived 2)- gagaatc like 1 46 synuclein, alpha (non NM 000345.2 NM 007308.1 1,4 _ CCACAGGAAGGAATTCTGGAA A4 component of GATATGCCTGTGGATCCTGAC amyloid precursor) AATGAGGCTTAT 47 synuclein, alpha (non NM 000345.2 NM 007308.1 1,5 GACCAGTTGGGCAAGaatgaag A4 component of aaggagcc amyloid precursor) 1 translationally- NM 003295.1 CA848049.1 0,7 AGAtcgcggacgggttgtgcctggaggT controlled tumor GG protein 1 52 translationally- NM 003295.1 CA848049.1 0,5 TCCGACATCTACAAGatccggga controlled tumor gatcgcg protein 1 53 translationally- NM 003295.1 BCO22436.1 0,6 CCGACATCTACAAGATCCGGG controlled tumor AGATCGCGGACGGGTTGTGC protein 1 CTG 54 translationally- NM 003295.1 BC040008.1 0,5 GCGCCGCTCCGGCTGCACCG controlled tumor CGCTCGCTCCGAGTTTCAGG protein 1 CTCGTGCTAAGCTAGCGCCGT CGTCGTCTCCCTTCAGTCGCC ATCATGATTATCTACC 55 TYRO protein tyrosine NM 003332.1 BF092099.1 0,8 GAGGGGCTGCGGAGGcagcga kinase binding protein cccggaaac 60 cold shock domain NM 003651.3 B0009744.1 1,5 CCCAACAGAATACAGgctggtga protein A gattgga 64 beta-2-microglobulin NM 004048.2 AV734235.1 0,7 ATTTG GATTG GATGAATTCCA AATT CT G CTT G CTT G CTTTTTA ATATTGATATGCTTATACACTT ACACTTTATGCACAAAATGTA GGGTTATAATAATGTTAACAT G GACATGATCTTCTTTATAATT CTACTTTGAGTGCTGTCTCCA TGTTTGATGTATCTGAGCAGG TTGCTCCACAGGTAGCTCTAG GAGGGCTGGCAAC 69 SEC24 NM 004922.1 GCAGCTGTCTGGAATgcagagg family, member C C ( (S. AW449995.1 1,8 cagctgga cerevisiae) 71 hemoglobin, alpha 1/2 NM 000517.3 H78334.1 C G G T CAA CTTT CAA G et c ctta a g 1,4 ccactg 72 hemoglobin, alpha 1/2 NM 000517.3 H55830.1 TCAAGGCCGCCTGGGgatgttcct 1,4 .gtCCtt 80 hemoglobin, alpha 1/2 NM 000558.3 R91899.1 ACCAAGACCTACTTCccggtcaac 1,7 ttcaag 81 hemoglobin, alpha 1/2 NM 000558.3 H58664.1 GGAGGCCCTGGAGAGctcctaag 1,7 CCaCtgC 85 CD164 antigen, NM 006016. GACTGTGATTTATTTGGAGTGG AGGCATTG 1 1 1 1 1 AAGAAAAAC ATGTCATGTAGGTTGTCTAAAA ATAAAATGCATTTAAAC 228 translationally- NM 003295.1 AV749932.1 0.6 CGTCGTCTCCCTTCAGTCGCC controlled tumor ATCATGATTATCTACC contro protein 1 229 translationally- NM 003295.1 AV749932.1 0.6 GGGACCTCATCAGCCacgatgag controlled tumor atgttct protein 1 230 vasodil ator- NM TTTATTTCCTACCAGcagg agg ag stimulaulated ccagag phosphoprotein 231 cold shock domain 1,2 CCCAAGGTACCGTAGcaggggac protein A NM 003651.3 BG739968.1 ctcctcg 232 cold shock domain 1.5 AATAACCCACGGAAATATCTGC protein A NM 003651.3 BE935120.1 .GCAGTGTAGGAGATGGAGAAA CTGTAGAGTTTGATGTGGTTGA AGGAGAGAA 233 cold shock domain 1.5 GTATTTGTACATCAGactgccatca protein A NM 003651.3 BE935120.1 agaag 234 chromosome 22 open 1.5 GAAGCAGAAGAGGTGTGAAAG reading frame 19 NM 003678.2 AK122673.1 AAGGTGCTGCTGGGAGGGGA GTCTGACAACCCAGC 235 Homo sapiens 0.8 ACGACATCGCAGAAGGTGGCT ve mb assoo CGGAAATTCTGGTGGAAGAAC membrane protein 8 GTGAAGATGATTGTCCTTATCT (endo brevin) NM 003761.2 BG623073.1 GCGTGATTG11111ATCATCAT CCTCTTCATTGTGCTCTTTGCC ACTGGTGCCTTCTCTTAAGTAA CAGGGAACCTCTCCCACCTGC CCTTC IIII CAGGGACAACCCT CCATAAATGTGTGCCAAGAGG GTCTCCTTTCCTGTCTTCCTCT ACAGAGAATGCTGCTCGGTCC TCCTACCCCTCTTCCCGAGGC CTGCTGCCACGTTGTATGCCC CAGAAGGTACCTTGGTCCCCC GGAAGGAGAGAA 236 Homo sapiens NM 003761.2 BG623073.1 0.7 GATCTGGAAGCCACAtctgagcact vesicle-associated membrane protein 8 tcaag (endobrevin) 237 CASPB and FADD- NM 003879.3 BI871546.1 1.3 AGTACAAGCAGTCTGgtggatgga like apoptosis regulator 238 atggaa beta-2-microglobulin BM NM 004048.2 831738. 0.8 AG GTTTACTCACGTCATCCAG C 1 AGAGAATGGAAAGTCAAATTTC CTGAATTGCTATGTGTCTGGGT TTCATCCATCCGACATTGAAGT TGACTTACTGAAGAATGGAGA GAGAATTGAAAAAGTGGAGCA TTCAGACTTGTCTTTCAGCAAG GACTGGTCTTTCTATCTCTTGT ACTACACTGAATTCACCCCCAC TGAAAAAGATGAGTATGCCTGC CGTGTGAACCATGTGACTTTGT CACAGCCCAAGATAGTTAAGT GGG 239 beta-2-microglobulin NM 004048.2 BM831738 . 0.7 TG GAG GCTATCCAGCgtactccaa 1 agatam 240 Guanine nucleotide NM004125.2 BC015206.1 0.6 GTGGAGAGGATCAAGgtctctcag binding protein, gcagct gamma 10 241 BCL2-associated NM 004323.2 BI826041.1 1,2 TGACTGTCACCCACAgcaatgaga athanogene agcacg 242 c-src tyrosine kinase NM 004383.1 BG953215 .1 1,1 CCTCAAGTTCTCGCTagatgtctgc gaggc 243 HLA-B associated NM 004640.3 CB148695.1 1,4 GCCCCAGGAGGAGAGgtgagctg transcript 1 aagatgg 244 family with sequence NM 004699.1 AA024425.1 0.8 CACGGGGGAAGAGTGgaccactct similarity 50, tcaact member A 245 family with sequence NM 004699.1 AA584911 .1 1.6 TCAAGAGTGAGTGTTtgcggagtca si gaggc similarity 5 50, member A 246 GNAS complex locus NM 000516.3 BU784018.1 1.5 CTACTCCTGAG GATG gtgtgtatg g CttCC 247 GNAS complex locus NM 000516.3 BG911454.1 1,4 GACGCCAGGGTTTGGgtgctggag aatctg 248 hemoglobin, alpha NM 000517.3 AA343446.1 1.7 TGCTTCTCCCCGCAGgatgttcctgt 1/2 CCtt 249 WD repeat domain 1 NM 005112.3 BX648190.1 1.2 AAGTTCACAATTG G Cg a cca cag c ccctt 250 retinoblastoma NM 0 05610.1 BF475362.1 0.8 TTTATTCATGGTGGTCATACTG binding protein 4 CCAAGATATCTGATTTCTCCTG GAATCCCAATGAACCTTGGGT GATTTGTTCTGTATCAGAAGAC AATATCATG 251 small EDRK-rich NM 005770.2 BG435668.1 0.9 CCGTCGCCATGACCCgcggtaacc factor 2 agcgtg 252 NM 005801.2 putative translation initiation factor BG655736.1 0.7 CCTTTGTGCTTGCAGaagtttgcctg caat membrane 253 NM 005898.2 AA437165.1 0.8 AACAGCTTCAAACAGt99 tt99 cactus, tacc chromosome 11, surface marker 1 254 capping protein (actin NM 006136.1 BG702980.1 0.7 filament) Z-line muscle, alpha 2 AGCAAAAAA 1 1 1 1 1 Ggaatggtcgtt ggag 255 Microspherule NM 003333.3 CB110581.1 1.4 AACTCTGTGGTGGAGgtgagctgg protein 1 ggagga 256 Transforming, acidic NM 006342.1 BM313828. 1.3 ACAGTGGAGCAGAAGgtgggtgcg coiled-coil protein Containing 1 qgaagc 3257 acidic (leucine-rich) NM 006401.1 AI446778.1 1.2 AAGACGAGGACGATGAGGATG GTGAAGAAGAGGAGTTTGATG nuclear phosphoprotein 32 AAGAAGATGATGAAGATGAAG family, member B 258 ATGTAGAAGGGGATGAGGACG ACGATGAAGTCAGTGAGGAG acidic (leucine-rich) NM 006401.1 AI446778. 1 1,3 AGGAGGAGGACGAAGaaggaga nuclea ragatagagg phosphoprotein 32 phosp family, member B 259 granulysin / granulysin NM 006433.2 BI838502.1 0.8 GATAAGCCCACCCAGagaagtgttt ccaat 260 lysosomal associated BQ006415.1 1,4 aqatgctccaqaaggtgagtgtggctgcag multispanning membrane protein 5 NM 006762.1 261 lysosomal associated BQ006415. 1 1.5 AAGATGCTCCAGAAGgtgagtgtgg multispanning 99 protein membrane 5 NM 006762.1 ctgca 262 transforming growth AI610679.1 1,4 CGGGCTACNANATGCGCTTGG factor, beta 1 GGGGAGCCAGGACGGAGGAA (Camurati-GAGGAGAGAGAAAGAGA Engelmann disease) NM000660.1 263 myosin IF NM 012335.2 BF823263.1 1,3 GTGGACAATGGGAAGctgctggaa gggcct 264 ribosomal protein BM826692. 0.8 AAGCCTACAAGAAAGTTtgcctatct L13a NM 012423.2 1 gggg 265 glutathione SN59567.1 1.4 CATGCTGTTCCTTCCTCGCCAC transferase pi NM 000852.2 CCTCTGCTTC 266 chromosome 11 open BE041814.1 0.8 TCGCGGGGCAAAATGgagctcga reading frame 10 NM 014206.1 ggccatg 267 integrin, alpha X AA251543.1 1, 4 TGCGACCGCCTACAGgtgacctcc (p150)), (aaagct antigen (p 15 pha alpha polypeptide) 268 NM 000887.3 ring finger protein 10 NM 014868.3 BM471027. 1.3 ACTTTCTGCTGACCCCTCTGTC 1 ACCCACTGCCAGTCAGGGCAG TCCCTCATTCTGCGTTGGGAGT CTGGAAGAAGACTCTCCCTTC 269 ribosomal protein L4 NM 000968.2 CB164625.1 0.7 GGTGCTTTTGGAAACatgtgtcgtg gaggc 270 KIAA0690 NM 015179.2 BG506709.1 TCTCTGGGCAAGCAGaaagcaaa 1.6 aggtgat 271 KIAA0690 NM 015179.2 BG506709.1 1.6 GAATACAAGGCCAAGaaagcaaa aggtgat 272 bridging integrator 2 NM 016187.1 AK093638.1 0.9 AAGGCCATCGTATGGaataatgatc tCCtt 273 ribosomal protein NM 000990.2 BG824148.1 0.8 CAACTTCGACAAATAccacccagg L27a ctactt 274 ribosomal protein NM 000990.2 BI906450.1 0.8 gccacccaggctactttgggaaagt L27a 2 75 ribosomal protein NM 000990.2 CB119057.1 0.7 GCCACGGCCGCATAGgcaagcac L27a cggaagc 276 ankyrin repeat NM 017664.1 BCO39715.1 0.7 GGAGCATTCCATATAGAAACTG domain 10 CTGAAACTGCCACAGGTGCTT CTCCGAAAACCTTACAGTTGTG GCATTGAATGTTCAGTATCGCT TCCTTTCTGCACACG 277 ribosomal protein NM 000992.2 BQ335217.1 0.7 CCAAGTTCCTGAGGAacatgcgctt L29 ribosomal protein tgcca 278 NM CACCPTCTGATTCTAATAGAGA S13 GCCGGATTCACCGTTTGGCTC GATATTATAAGACCAAGCGAGT CCTCCCTCCCAATTGGAAATA 279 CNDP Dipeptidase 2 NM 018235.1 AW502844. 1.5 CACAGCAGCATCAAGgtggagtgc (1 agcaac metallopeptidase M20 family) 280 ribosomal protein NM 001021.2 BE731466.1 0.5 AATTATGTTCCTGAGgtctcagcctt GGAT 281 ribosomal protein S17 NM 001021.2 AV752729.1 0.6 GCCGCGTTCCACCAAAACCGT S17 GAAGAAAGCGGCCCGGGTCAT CATAGAAAAGTACTACACGCG CCTGGGCAACGACTTCCACAC GAACAAGCGCGTGTGCGAGGA GATCGCCATTATCCCCAGCAAA AAGCTCCG 282 sulfatase 2 NM 018837.1 AB033073 .2 1, 5 GCGAGAGTGTGTCGAgtgagtgtg cgtctg 283 ubiquitin B NM 018955.2 BG286180.1 1, 6 G AAG GCG GAAAAGAG gtca aa atg cagatc 284 ubiquitin B NM 018955.2 BG286180.1 1,4 GGTATCCGCTAACAGgtcaaaatg cagatc 285 glycogen synthase NM 019884.1 BQ927367.1 1, 3 ACTTCAGTGCTGGTGgtgagggCa kinase 3 alpha tagcct 286 KIAA1185 protein NM 020710.1 BQ352354.1 1.1 GTGTTAGGAGCGGAGataccttca cttgct peptidyl 287 NM 021130.1 AA340318.1 0.8 CGCGTCTCCTTTGAGctgtttgcag acaag isomerase A (cyclophilin A) 288 NM 021130.1 peptidyl prolyl isomerase A AA357116.1 0.8 TCCCAAAGACAGCAGaaaattttcgt gctc (cyclophilin A) 289 peptidylprolyl NM 02113 0.1 BE293161. 1 0.7 CGCGTCTCCTTTGAGgtacggggc isomerase A ctggat (cyclophilin A) 290 hypothetical protein NM 024841.1 J BQ919225.1 'ATTGTTGAACTGGATgcggctgttg FLJ14213 1.4 aagag 291 hypothetical protein NM 024841.1 AK055042.1 0.8 GATCAACGTTTTCAAAG GGGG FLJ14213 TGGCTTGCAAAGCAACGAGCT CTATGCCCT 292 ring finger protein 34 NM 025126.2 NM_194271 0,9 GTCGCGGCCATGAAGgtggggga .1 gtggtac 293 t-complex 1 NM 030752.1 AA160249.1 0.7 TTCTTC IIII CCTAGgggtctcggg aacag 294 RAB34, member NM 031934.3 W32152.1 1.6 AAGAAG GAT CTGAGTg tg ag tg tg c RAS oncogene cagtg family 295 zinc finger, CCHC NM 032226.1 BG530408.1 0.8 ACTGGTCCATCAGTGACAAAG domain containing 7 ACATTGAG 296 guanylate binding NM 052942.2 AA164465.1 1.5 AAAAAAAGAAGAAAGaggagacagainte protein 5 tgaaag 297 ubiquitin-conjugating NM 058167.2 BG766070.1 1,4 TGCGCGGAACCCGAGatgagcag enzyme E2, J2 0.8 CCTGAGTCCAACACAGCTGGG channel 1 CTGGACATATTTGCCAAAIIII CTGCCTACATCAAGAATTCAAA CCCAGCACTCAATGACA 299 chl caccagc 298 chloride intracellular NM 001288.3 AAl26087.1 intracellular orine intracellular NM 1 021 031 0171 ccagggtagctcgggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggg responsive NM 134442.2 BI831922.1 0.8 GCCACATTAGCCCAGgtatctatgc element binding cagca protein 1 303 heat shock 70kDa NM 153201.1 CD655642.1 0.8 GCCTACCTTGGGAAGactgttacca protein 8 atgst 304 E74 like factor 1 (and NM 172373.2 AA251399.1 0.8 GCCAAGAAGCTTGAGAGAAGA domain transcription AAAATTTCAGAAAAATTGTCTC factor) AATTTGACTAGAATATCAATGA ACCAGGAAAACTGAAGCACCT TCCCTAAAGAAAACTTGGGTAT ACAATTACTCCACAGACAGAGC TGAGGG 1 1 1 1 1 1 ACCCAAATCA GTCACTGGATTTTGCTGCCTGA TACGTGAATCTTCTTGGAATTT TTCTCATGTGGATCTAAGGGGA CTTTATTATG G ATG GTC CTGTT GTCCAACAGAACGACCTAGTAT TTGAATTTGCTAGTAACGTCAT G 305 o eukaryotic lion NM 001402.4 BX454657.1 0.8 GGCTTCACTGCTCAG t attatcct elongation factor r is 1 gg alpha 1 GAAC 0.8 AGCTTCTCAGACTATccacctttggg elongation factor 1 taag alpha 1 307 arachidonate 5- 0,9 AAGTGGAGCACGAAAGCAGGA lipoxygenase-CCCAGAATGGGAGGAGCTTCC activating protein NMAGGACCGGAACACTTGCCT TTGAGCGGGTCTACACTGCCA A 308 ribosomal protein NM 021104.1 EXH-001 0.7 AAGAAAGATGAGGCAGAGGTC L41 ribosomal protein CAAGTAAACCGCTAGCTTGTTG 309 NM 021104.1 EXH-002 0.6 GAAGCGAATGCGCAGgctgaagc L41 gcaaaag 310 translationally NM 003295.1 EXH-003 0.5 AATGCATATTTAAACTAAATTGA controlled tumor protein 1 TCCTGTAGTGTTCCTGGAGAA GCTAGAGCCTGATTGTAGGCT ACTACTCATCAATTAACTTCTA CAGTGGAGACTACTTCTGGGA CTGGAATATAAAAA 311 G1 to S phase of NM 002094.1 EXH004 1.5 ATTACCGTTTATTCCATATCTG transition 1 / G1 to S GATAATTTGCCGAACTTCAATA transition phase 1 GATCAGTTGATGGACCAATCA GGCTGCCAATTGTGG 312 cyclin T2 NM 001241.2 EXH-005 0.5 ttgtgtgagctattcaaactcttcaacccctga 313 cyclin T2 NM 001241.2 EXH-006 0.7 AGCCAGTTGTCATTTttacaggattg tgtq 314 zinc finger protein NM 133476. 2 EXH-007 1,5 AGTTCAGGAGCCCTGGAAAGG 384 AGAAGGAATAAGACGGCAGGA GGAAGAGA 315 TBC1 domain NM 020773.1 EXH-008 CTACCTAATTGATTGcccggggcc family, member 14 0.7 ctgatt 316 F-box protein 7 NM 012179.2 EXH-009 GGAAGCGCGGGTGGTCGGCT GGGGTCCGGCTCCTGGAGAAC ATGGCCCGGCCTCCCGGGGG 1,3 CT 317 F-box EXH-010 CTGGTCCCCTCCTCGTTctaatacc 1,4 cgatt 318 maltase- GTAATTCAACTGCCAAGTGGTG glucoamylase (alpha-GAAGAGGGAAATAGAAGAACT glucosidase) NM 004668.1 EXH-011 ATACAACAATCCACAGAATCCA 1,4 GAGAG SEQ ID NO: 308-318 variants correspond to new ESTs Table 2 where List of 100 transcripts expressed differentially during the development of breast cancer SEQ Description of the N Genbank N Genbank C 1 / II Sequence Target ID N sequence (reference) (variant) vs.. Healthy cDNA DKFZp451G151 (Leucinerich repeat GAATGAATTTTCTTGctgctatgcct 3 kinase 2) AL832453.2 BU634341.1 0.8 ttct cDNA DKFZp667I093 AL832878.1 Al223156.1 0.7 TGCAGATTTGATGGCctactgtga (Guanine nucleotide agcaca binding protein, gamma 2) 7 Homo sapiens B0009917. 1; BCO28225.1 1.4 ATGAAGAAAAACAAAgtgcacag hypothetical protein agacccg DKFZp761AO52 12 Microtubule associated BC048206.1 AF052170.1 1.4 TGGGAGGGTCGACCTTGATC monoxygenase, calponin and LIM ATGAAACAATACCATGAGGGG GCCTCTGTCACCTTTGAAAAG domain containing 2 AACACTTTTTGAGCAGCCTCA AAAAGCTCATACATACCAGCG CCTTCTTAAATTGGCTCTAATG TAAAGATTGTTAATGTCATTTA TCAAAACCATAGGTGATTATTT GGAGGGATTTAAAAAACTTAA TTACTCTCAGGCCTCATCCCA AGCTTGACACATGCTCTGTAG 1GTTGAACACATAATCACAAAT ATTCTAGCAAATGCTGCCTTG GTTGCAGCCTGCACTGTAGAC CCAAGGGTTTTGCTGTGGCTC TTCTTATCTCCCTTGGCTCATA AAGCCCCAGATGATGCCAGA GCTTCAATTAGAGCCATCATC AT CC CAG G CAG G GATATCTTT GAGAAATGACTCAGTTCAGCC CCAGGCCCCTGTGACTCTGCT TAAAGCACACATTTCTGCTGA CTCTTGTACCTGGGGCAGCA GGATAATCACCAACAC 14 cellular homolog of NM 001997.2 W17004.1 0.6 GGCCGCATGCTTGGAaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 16 cellular homolog of NM 001997.2 AA063591.1 0.7 TGACCGGCCAGGAAAcggtcgcc the fox sequence in the cagatca Finkel-Biskis-Reilly murine sarc oma virus 17 cellular homolog of NM 001997.2 AA094898.1 0.6 CAGGCCGCATGCTTGaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 18 cellular homolog of NM 001997.2 AA187006.1 0.7 AGGCCGCATGCTTGGaggtaaa the fox sequence in the gtccatgg Finkel- Biskis-Reilly murine sarcoma virus 19 cellular homolog of NM 001997.2 AA225636.1 0.6 GGCCAGGAAACGGTCgcccaga the fox sequence in the tcaaggta Finkel-Biskis-Reilly murine sarcoma virus 23 cellular homolog of NM 001997.2 BU603086.1 0.6 g gg the fox sequence in the GAAGTAGCAGGCCGCat cttFinkel-Biskis-Reill aggtaaa y murine sarcoma virus 24 cellular homolog of NM 001997.2 BF218408.1 0,6 AGGCCGATGCTTGGA taaa t the fox sequence in the gg Finkel-Biskis-Reilly ccatggt murine sarcoma virus 25 cellular homolog of NM 001997.2 AI499403.1 0.7 CCCGCATGCTTGGAGgtaaagtc the fox sequence in the catggtt Finkel-Biskis-Reilly murine sarcoma virus 26 cellular homolog of NM 001997.2 AW795076.1 0.6 CGGTCGCCCAGATCAaggctcat the fox sequence in t he gtagcct Finkel-Biskis-Reilly murine sarcoma virus 35 SEC14-like 1 (S. NM 003003.1 AK130317.1 1.3 TAG TAG GG GG G CT CTAGTAG cerevisiae) AGTAGGTAGGGCTAGTAGGTA CTAG GGG TAG G TAG GG GG CTAG TAGGTAGGGCTAGTAGGTAG CTAGTAG GTAG G GTTCGTA GGTAGGGTTCGTAGGTAGGG TTCGTAGGTAGGGTTAGTAGC GCGTCTGTGCTGCTTCCACCT GGTGCTTCCTGTTCCCAAATC ACAAGGGCCTGAAGGTGGTC CCTGCTTTCTCTTTCTCTTTCT CTGTGTCTCAGATGGCGATTT TGCTGACAGCTGCCAAGAAAA TGCTTCACTCAACAGTCCTCA TGTGCCCAGAGATGTTTATAG AACTGTTTGAATTGCAGCCAT CCCCTGCCCCCTCCCAGGCT GAAGATCTGTTCIIIiIAAGTT GATTCG G G GAGTG CATTCTTT TATACCCAAAGACTGTAGTGC CTFGAAGAG CTCAAAG CAC ATGACCGCACAAATGCTTACA GGGTTTCCTCCCGAGTAATCC AATCTCACTCCCCTTGTAAGG 43 nuclear factor NM 003204.1 BM973053.1 1.5 CGGGTCAGTGTACAGgaagagg (erythroid-derived 2) - caggcact like 1 44 nuclear factor NM 003204.1 BM973053.1 1.5 TGCTGTGAGGCAGAGgaatgatg (erythroid-derived 2) - gagaatc like Synuclein, alpha (non-NM000345.2 NM 0031308.1 1,4-CCACAGGAAGGAATTCTGGAA A4 component of GATATGCCTGTGGATCCTGAC amyloid precursor) AATGAGGCTTAT synuclein, alpha (non-NM 000345.2 NM 007308. 1 1.5 GACCAGTTGGGCAAGaatgaag A4 component of aaggagcc amyloid precursor) 1 translationally- NM 003295.1 CA848049.1 0.7 AGAtcgcggacgggttgtgcctggaggT controlled tumor GG protein 1 52 translationally- NM 003295.1 CA848049.1 0.5 TCCGACATCTACAAGatccgga controlled tumor gatcgcg protein 1 53 translationally- NM 003295.1 BCO22436.1 0.6 CCGACATCTACAAGATCCGGG AGATCGCGGACGGGTTGTGC controlled tumor protein 1 CTG 54 translationally- NM 003295.1 BC040008.1 0.5 GCGCCGCTCCGGCTGCACCG CGCTCGCTCCGAGTTTCAGG controlled tumor protein 1 CTCGTGCTAAGCTAGCGCCGT CGTCGTCTCCCTTCAGTCGCC ATCATGATTATCTACC 55 TYRO protein tyrosine NM 003332.1 BF092099.1 0.8 GAGGGGCTGCGGAGGcagcga kinase binding protein CCCAACAGAATACAGgctggtga protein A gattgga 64 beta-2-microglobulin NM 004048.2 AV734235.1 0.7 ATTTG GATTG GATGAATTCCA AATT CT G CTT G CTT G CTTTTTA ATATTGATATGCTTATACACTT ACACTTTATGCACAAAATGTA GGGTTATAATAATGTTAACAT G GACATGATCTTCTTTATAATT CTACTTTGAGTGCTGTCTCCA TGT TTGATGTATCTGAGCAGG TTGCTCCACAGGTAGCTCTAGGAGGGCTGGCAAC 69 SEC24 NM 004922.1 GCAGCTGTCTGGAATgcagagg family, member C C ((S. AW449995.1 1.8 cagctgga cerevisiae) 71 hemoglobin, alpha 1/2 NM 000517.3 H78334.1 CAOT CGGT CTTT CAA G and c ct ag 1.4 ccactg 72 hemoglobin, alpha 1/2 NM 000517.3 H55830.1 TCAAGGCCGCCTGGGgatgttcct 1.4 .gtCCTT 80 hemoglobin, alpha 1/2 NM 000558.3 R91899.1 ACCAAGACCTACTTCcggggca 1.7 ttcaag 81 hemoglobin, alpha 1/2 NM 000558.3 H58664.1 GGAGGCCCTGGAGAGctcctaag 1.7 CCaCtgC 85 CD164 antigen, NM 006016.

3 AF299342.I CTGTGACTCCAACCTCACAAC sialomucin CTGTGCGAAAGTCTACCTTTG ATG CAG CCAGTTTCATTG GAG GAATTGTCCTGGTCTTGGGTG T G CA G G CT G TAATTTT CTTT CT TTATAAATTCTGCAAATCTAAA GAACGAAATTACCACACTCTG TAAACAGACCCATTGAATTAAT AAGGACTGGTGATTCATTTGT GTAACTCACTGAAGCCAAAAT ACTATCTTTTAAGATGTCCCAC ATGGAAGACGCTATTCCAGGA T CTTTAAATTT C C AT G G AT G CA TATAGGATGTTTGGGAGCATC 0,8 ATCCGTGAAGAAAAAATCAAT TAAATCATTGTGTTCAACAGG AATATTTAAAATATTCTGCATG AATCCTGTGGCTGTCTTAIIII AAATAGCTGCTGCTGTGGGAT TATA 1 1 1 1 1 ITCCTTAACATG CCAAATATAACTTTCTGAAAGT GATGGAAAATGTTGTCTTGTG CAGACAACATCATGGCTCTTG GCAGTTTA 86 CD164 antigen, NM 006016.3 AF299343.1 0,8 CAGCCAATTCTACAGctaaaccc sialomucin acagttc CCCAGAGTATTAACGCTCCAA GAGTATTATTAACGCTGCTGT ACCTCGATCTGAATCTGCCGG GGCCCCAGCCCACTCCACCC TGCCAGCAGCTTCCAGCCAGT CCCCACAGCCTCATCAGCTCT CTTCACCGIIIIIIGATACTAT CTTCCCCCACCCCCAGCTACC CATAGGGGCTGCAGAGTTATA AGCCCCAAACAGGTCATGCTC CAATAAAAATGATTCTACCTAC 88 talin 1 NM 006289.2 AI393487.1 1,4 AA 89 talin 1 NM 006289.2 AI393487.1 TGCTTCGGAAGGAACgagagctg 1,4 gaagagg 90 talin 1 NM 006289.2 AI417760.1 AGGAAGAAATGCTTCggaagga 1,4 acgagaqc 95 acidic (leucine-rich) NM 006401.1 Y07570.1 GAAGTCAGTGAGGAG aa aa nuclea r g g g hosphoprotein 32 1,5 aatttgga phosp family, member B 102 soluble galactoside- CTCCAGTGGAACCAGgtttgctgtg binding lectin 9 NM 009587.1 BG698264.1 0,8 aactt (galectin 9) 116 Homocysteine- NM 014685.1 BG828243.3 AF299342.I CTGTGACTCCAACCTCACAAC Sialomucin CTGTGCGAAAGTCTACCTTTG ATG CAG GAG CCAGTTTCATTG GAATTGTCCTGGTCTTGGGTG TG CT CA GG G CT TAATTTT CTTT TTATAAATTCTGCAAATCTAAA GAACGAAATTACCACACTCTG TAAACAGACCCATTGAATTAAT AAGGACTGGTGATTCATTTGT GTAACTCACTGAAGCCAAAAT ACTATCTTTTAAGATGTCCCAC ATGGAAGACGCTATTCCAGGA T CTTTAAATTT CC GG AT AT G CA TATAGGATGTTTGGGAGCATC 0.8 ATCCGTGAAGAAAAAATCAAT TAAATCATTGTGTTCAACAGG AATATTTAAAATATTCTGCATG AATCCTGTGGCTGTCTTAIIII AAATAGCTGCTGCTGTGGGAT TATA 1 1 1 1 1 ITCCTTAACATG CCAAATATAACTTTCTGAAAGT GATGGAAAATGTTGTCTTGTG CAGACAACATCATGGCTCTTG GCAGTTTA 86 CD164 antigen, NM 006016.3 AF299343.1 0.8 CAGCCAATTCTACAGctaaaccc Sialomucin acagttc CCCAGAGTATTAACGCTCCAA GAGTATTATTAACGCTGCTGT ACCTCGATCTGAATCTGCCGG GGCCCCAGCCCACTCCACCC TGCCAGCAGCTTCCAGCCAGT CCCCACAGCCTCATCAGCTCT CTTCACCGIIIIIIGATACTAT CTTCCCCCACCCCCAGCTACC CATAGGGGCTGCAGAGTTATA AGCCCCAAACAGGTCATGCTC CAATAAAAATGATTCTACCTAC 88 talin 1 NM 006289.2 AI393487.1 1.4 AA 89 talin 1 NM 006289.2 AI393487. 1 TGCTTCGGAAGGAACg agagctg 1,4 gaagagg 90 talin 1 AGGAAGAAATGCTTCggaagga 1,4 acgagaqc acidic (leucine-rich) NMAGGAGAGGAGGAG aa aa nuclea rggg hosphoprotein 32 1,5 aatttgga phosp family, member B 102 soluble galactoside- CTCCAGTGGAACCAGgtttgctgtg binding lectin 9 BG698264.1 0.8 aactt (galectin 9) Homocysteine-NM 014685.1 BG828243.

1 GGAAAACATCTCAAGgcctgaag inducible, endoplasmic 0,7 ctqccca reticulum stress- inducible, ubiquitin- like domain member 1 120 ring finger protein 10 NM 014868.3 BU626650.1 CGAGAGCGCAGGATTGAGAT 1,5 AGAGGAGAACA 121 ring finger protein 10 NM 014868.3 BU626650.1 AGAAACAGGGCAAGTacccaga 1,5 agtccaca 125 ribosomal protein L4 NM 000968.2 CB141160.1 GTCATCAGACTAGTGCTGAGT CTTGGGGTACTGGCAGAGCT GTGGCTCGAATTCCCAGAGTT CGAGGTGGTGGGACTCACCG CTCTGGCCAGGGTGCTTTTGG 0,7 AAACatgtgtcgtg 126 ribosomal protein L4 NM 000968.2 CD686462.1 0,6 GCGTGTGCTCGCCCACTGATA TCGGTGTACTCCGAAAAGGG GGAGTCATCTGGCAAAAATGT CACTTTGCCTGCTGTATTCAA GGCTCCTATTCGACCAGATAT TGTGAACTTTGTTCACACCAA CTTGCGCAAAAACAACAGACA GCCCTATGCTG 137 ubiquitin B NM 018955.2 AA206538.1 1,4 CAG GTCAAAATG CAG atcttcgtg aaaacc 138 ubiquitin B NM 018955.2 AA206538.1 1,3 CAG GTCAAAATG CAG atcttcg tg aagacc 139 ubiquitin B NM 018955.2 AA340917.1 1,4 AGATCTTCGTGAAGACCCTGA CCGGCAAGACCATCACTCTG GAGGTGGAGCCCAGTGACAC CATCGAAAATGTGAAGGCCAA GATCCAAGATAAAGAAGGCAT CCCCCCCGACCAGCAGAGGC T CAT CTTTG CAG G CAAG CAG C TG GAAGATG G CCG CACTCTTT CTGACTACAACATCCAGAAAG AGTCGACCCTGCACCTGGTC CTGCGCCTGAGGGGTGGCTG TTAATTCTTCAGTCATGGCATT CGCAGTGCCCAGTGATGGCA TTACTCTGCACTATAGCCATTT GCCCCAACTTAAGTTTAGAAA TTACAAGTTTCAGTAATAGCT GAACCTGTTCAAAATGTTAATA AAGG 140 ubiquitin B NM 018955.2 BU661443.1 1,4 TCCCTGTGGGTGGACGTGGT TGGTGATTGGCAGGATCCT 141 ubiquitin B NM 018955.2 BU661443.1 1,5 GGTTGGCTTTGTTGGgtgagcttg tttgtg major TCAGAGACAGTCTTCCTGCCC AGGGAAGACCACCTTTTCCGC AAGTTCCACTATCTCCCCTTC CTGCCCTCAACTGAGGACGTT TACGACTGCAGGGTGGAGCA CTGGGGCTTGGATGAGCCTC 145 c , DR NM 019111.2 CD686254.1 0,7 TTCTCAAGCACTGGGagtttgatg amplex, c1 class ass II, I ctccaagccctctc omp alpha 147 KIAA1191 protein NM 020444.2 BI254429. 1 1,5 GCAGCAGGATCACAGaacagac ccaggaaa mesoderm induction TTTCTCACACAGGCAtactccaaa 148 earlyresponse 1 NM 020948.1 AY124186.1 0,7 tgcttc myelin protein zero TGCCCTGGCATTCTGGCAGA GAATCCTCACCAGTTCTCACC AACCTTCCCCCCAGGCAAGG GCAGCTGCCAGCATGGTGCT CTGCCAGGACAGGTTTCCCTG AAGGAAGCTGCTCACACTGAG ATGAGCCTCTCAGGGCAGGA CCTCTTCCCAAGCCCTGCACA CCCACCCCTGCAGCCCTTTTG 151 like 1 NM 024569.2 AI693779.1 0,9 GCTC likely ortholog of GCTTCAGTCCGCCGAgagcagta moule limb-bud and 152 heart !eue ' NM 030915.1 W45195.1 1,1 ccgtgtg 155 caspase 4, apoptosis- NM 033306.1 NM 001225.2 1,2 TGTTCCCTATGGCAGaaggcaac related cysteine cacagaa protease 158 major NM 033554.2 BU621846.1 0,7 AAGGAGCCTGTGGAGctgggcca histocompatibility gcccaac complex, clans II, DP alpha 1 159 FK506 binding protein NM 054014.1 NM 000801.2 0,8 GCTCCCTGTTCTTGGatctgccat 1A, 12kDa ggaggg 160 chloride intracellular NM 001288.3 BG491600.1 0,8 CCAGGGACGGCCACTTCCTG channel 1 GTCCCCGACGCAACCATGGC TGAAGAACAACCGCAGGTCG AATTGTTCGTGAAG 161 chloride intracellular NM 001288.3 AV683308.1 0,7 GGGCAGCTCCCATTCctgctgtat channel 1 ggcact 164 chromosome 19 open NM 138774.2 AI375989.1 1,4 ACCTCATCTCGGCCAgtgctgacc reading frame 22 tggagg 177 tropomyosin 3 NM 153649.1 BM006741.1 1,3 TGATGAGAGTGAGAGgcagaga cccgtgct 184 eukaryotic translation NM 001404.3 AA206367.1 0,7 CTGAGTCCAGATTGGCAGGT elongation factor 1 GGACTACGAGTCATACACATG gamma GCGGAAACTGGATCCTGGCA GCGAGGAGACCCAGACGCTG GTT 185 eukaryotic translation NM 001404.3 AA206367.1 0,6 CAGCATGTGGGCAAAGCCTTC elongation factor 1 AATCAGGGCAAGATCTTCAAG gamma TGAACATCTCTTGCCATCACC TAG 187 eukaryotic translation NM 001404.3 BU783548.1 0,6 ATTTAAGCGCAAGTACTCCAA elongation factor l TGAGGACACACTCTCTGTGGC gamma ACT G C CATATTT CTG G GAG CA CTTTGATAAG GACG G CTG GTC CCTGTGGTACTCAGAGTATCG CTTCCCTGAAGAACTCACTCA GACCTTCATGAGCTGCAATCT CATCACTG 188 eukaryotic translation NM 001404.3 BE502067.1 0,6 TGGACAAGCTGAGGAAGAAT elongation factor 1 GCCTTCGCCAGTGTCATCCTT gamma TTTGGAACCAACAATAGCAGC TCCATTTCTGGAGTCTGGGTC TTCCGAGGCCAG 189 eukaryotic translation NM 001404.3 BE502067.1 0,6 CAGGTGGACTACGAGTCATAC elongation factor 1 ACATGGCGGAAACTGGATCCT gamma GGCAGCGAGGAGACCCAGAC GCTGGTTCGAGAGTACTTTTC CTGGGAGGGGGCCTTCCAGC ATGTGGGCAAAGCCTTCAA 190 eukaryotic translation NM 001404.3 BE502067.1 0,6 GAGCTTGCCTTTCCGctgagtcca elongation factor 1 gattgg gamma 191 eukaryotic translation Ij 0,8 AAGGAGGAGAAAAAGGCGGC elongation factor 1 NM 001404.3 BG533219.1 TGCCCCTGCTCCTGAGGAGG gamma AGATGGATGAATGTGAGCAG GCGCTGGCTGCTGAGCCCAA GGCCAAGGACCCCTTCGCTC ACCTGCCCAAGAG 203 calpain, small subunit NM 001749.1 BE907701.1 1,4 _ 1/calpain, small CCTTTGAGGCAGCAGatgaaagt subunit 1 gggaaca 204 Cysteinyl-tRNA NM 001751.3 AK125503.1 1,5 AAACAGGAACAAGAAgcagcaaa synthetase gctggcc 222 prohibitin NM 002634.2 BE536369.1 1,4 GCGAGGAGAGTGCGTgtgtgaga gggtcca 225 translationally- NM 003295.1 BG284235.1 0,6 TTCTTTATTGGTGAAAACATGA controlled tumor ATCCAGATG G CATG GTTG CTC protein 1 TATTGGACTACCGTGAGGATG GTGTGACCCCATATATGATTT TCTTTAAGGATGGTTTA translationally-TTATTTTGGATCTATCACCTGT controlled tumor CATCATAACTG G CTTCTG CTT GTCATCCACACAACACCAGGA CTTAAGACAAATGGGACTGAT GTCATCTTGAGCTCTTCATTTA TTTTGACTGTGATTTATTTG GA GTGGAGGCATTG I I 1 1 1 AAGA AAAACATGTCATGTAGGTTGT CTAAAAATAAAATGCATTTAAA 227 protein 1 _ NM 003295.1 CD641954.1 0,7 C 228 translationally- NM 003295.1 AV749932.1 0,6 CGTCGTCTCCCTTCAGTCGCC controlled tumor ATCATGATTATCTACC protein 1 _ 229 translationally- NM 003295.1 AV749932.1 0,6 GGGACCTCATCAGCCacgatgag controlled tumor atgttct protein 1 232 cold shock domain NM 003651.3 BE935120.1 1,5 AATAACCCACGGAAATATCTG protein A CGCAGTGTAGGAGATGGAGA AACTGTAGAGTTTGATGTGGT TGAAGGAGAGAA 236 Homo sapiens vesicle- NM 003761.2 BG623073.1 0,7 GATCTGGAAGCCACAtctgagca associated membrane cttcaag protein 8 (endobrevin) 237 CASP8 and FADD- NM 003879.3 BI871546.1 1,3 AGTACAAGCAGTCTGgtggatgg like apoptosis aatggaa regulator 239 beta-2-microglobulin NM 004048.2 BM831738.1 0,7 TG GAG G CTATCCAG Cg ta ctcca aagattc 240 Guanine nucleotide NM 004125.2 BC015206.1 0,6 GTGGAGAGGATCAAGgtctctcag binding protein, gcagct gamma 10 242 c-src tyrosine kinase NM 004383.1 BG953215.1 1,1 CCTCAAGTTCTCGCTagatgtctg cgaggc 248 hemoglobin, alpha 1/2 NM 000517.3 AA343446.1 1,7 TGCTTCTCCCCGCAGgatgttcct gtcctt 253 membrane component, NM 005898.2 AA437165.1 0,8 AACAGCTTCAAACAGt99tt9gÇa chromosome e 1 111, , cttacc surface marker 1 254 capping protein (actin NM 006136.1 BG702980.1 0,7 filament) muscle z- fine, alpha 2 AGCAAAAAATTTTTGgaatggtcgt tggag 258 acidic (leucine-rich) NM 006401.1 AI446778.1 1,3 AGGAGGAGGACGAAGaaggag nuclear phosphoprotein 32 aagatgagg family, member B 261 lysosomal associated NM 006762.1 BQ006415.1 1,5 AAGATGCTCCAGAAGgtgagtgtg multispanning gctgca membrane protein 5 266 chromosome 11 open NM 014206.1 GGAAAACATCTCAAGgcctgaag inducible, endoplasmic 0.7ctqccca stress-inducible reticulum, ubiquitin-like domain member 1 120 ring finger protein 10 NM 014868.3 BU626650.1 CGAGAGCGCAGGATTGAGAT 1.5 AGAGGAGAACA 121 ring finger protein 10 NM 014868.3 BU626650.1 AGAAACAGGGCAAGTaccaccia 1.5 agtccaca 125 ribosomal protein L4 NM 000968.2 CB141160.1 GTCATCAGACTAGTGCTGAGT CTTGGGGTACTGGCAGAGCT GTGGCTCGAATTCCCAGAGTT CGAGGTGGTGGGACTCACCG CTCTGGCCAGGGTGCTTTTGG 0.7 AAACatgtgtcgtg 126 ribosomal protein L4 NM 000968.2 CD686462.1 0.6 GCGTGTGCTCGCCCACTGATA TCGGTGTACTCCGAAAAGGG GGAGTCATCTGGCAAAAATGT CACTTTGCCTGCTGTATTCAA GGCTCCTATTCGACCAGATAT TGTGAACTTTGTTCACACCAA CTTGCGCAAAAACAACAGACA GCCCTATGCTG ubiquitin B 137 NM 018955.2 1.4 AA206538.1 CAG GTCAAAATG CAG atcttcgtg aaaacc 138 ubiquitin B NM 018955.2 AA206538.1 1,3 CAG GTCAAAATG CAG atcttcg tg aagacc 139 ubiquitin B NM 018955.2 AA340917.1 1.4 AGATCTTCGTGAAGACCCTGA CCGGCAAGACCATCACTCTG GAGGTGGAGCCCAGTGACAC CATCGAAAATGTGAAGGCCAA GATCCAAGATAAAGAAGGCAT CCCCC CCGACCAGCAGAGGC CTTTG T CAT CAG G C CAAG CAG TG GAAGATG G CGC CACTCTTT CTGACTACAACATCCAGAAAG AGTCGACCCTGCACCTGGTC CTGCGCCTGAGGGGTGGCTG TTAATTCTTCAGTCATGGCATT CGCAGTGCCCAGTGATGGCA TTACTCTGCACTATAGCCATTT GCCCCAACTTAAGTTTAGAAA TTACAAGTTTCAGTAATAGCT GAACCTGTTCAAAATGTTAATA AAGG ubiquitin B 140 NM 018955.2 BU661443.1 1.4 TCCCTGTGGGTGGACGTGGT TGGTGATTGGCAGGATCCT ubiquitin B 141 NM 018955.2 BU661443.1 1.5 GGTTGGCTTTGTTGGgtgagcttg tttgtg Major TCAGAGACAGTCTTCCTGCCC AGGGAAGACCACCTTTTCCGC AAGTTCCACTATCTCCCCTTC CTGCCCTCAACTGAGGACGTT TACGACTGCAGGGTGGAGCA CTGGGGCTTGGATGAGCCTC 145 c, DR NM 019111.2 CD686254.1 0.7 TTCTCAAGCACTGGGagtttgatg Amplex, c1 class ass II, I ctccaagccctctc omp 147 KIAA1191 protein alpha NM 020444.2 BI254429. 1 1.5 GCAGCAGGATCACAGaacagac ccaggaaa mesoderm induction TTTCTCACACAGGCAtactccaaa 148 earlyresponse 1 NM 020948.1 AY124186.1 0.7 tgcttc myelin protein zero TGCCCTGGCATTCTGGCAGA GAATCCTCACCAGTTCTCACC AACCTTCCCCCCAGGCAAGG GCAGCTGCCAGCATGGTGCT CTGCCAGGACAGGTTTCCCTG AAGGAAGCTGCTCACACTGAG ATGAGCCTCTCAGGGCAGGA CCTCTTCCCAAGCCCTGCACA CCCACCCCTGCAGCCCTTTTG like 1 151 NM 024569.2 AI693779.1 0.9 GCTC Likely ortholog of GCTTCAGTCCGCCGAgagcagta mold limb-bud and 152 heart! eue 'NM 030915.1 W45195.1 1,1 ccgtgtg 155 caspase 4, apoptosis-NM 033306.1 NM 001225.2 1,2 TGTTCCCTATGGCAGaaggcaac related cysteine cacagaa protease 158 major NM 033554.2 BU621846.1 0.7 AAGGAGCCTGTGGAGctgggcca histocompatibility gcccaac complex , clans II, DP alpha 1 159 FK506 binding protein NM 04014.1 NM 000801.2 0.8 GCTCCCTGTTCTTGGatctgccat 1A, 12kDa ggaggg 160 intracellular chloride NM 001288.3 BG491600.1 0.8 CCAGGGACGGCCACTTCCTG channel 1 GTCCCCGACGCAACCATGGC TGAAGAACAACCGCAGGTCG AATTGTTCGTGAAG 161 intracellular chloride NM 001288.3 AV683308.1 0.7 GGGCAGCTCCCATTCctgctgtat channel 1 ggcact 164 chromosome 19 open NM 138774.2 AI375989.1 1,4 ACCTCATCTCGGCCAgtgctgacc reading frame 22 tggagg 177 tropomyosin 3 NM 153649.1 BM006741.1 1,3 TGATGAGAGTGAGAGgcagaga cccgtgct 184 eukaryotic translation NM 001404.3 AA206367.1 0.7 CTGAGTCCAGATTGGCAGGT elongation factor 1 gamma GGACTACGAGTCATACACATG GCGGAAACTGGATCCTGGCA GCGAGGAGACCCAGACGCTG eukaryotic translation GTT 185 NM 001404.3 AA206367.1 0.6 CAGCATGTGGGCAAAGCCTTC elongation factor 1 gamma AATCAGGGCAAGATCTTCAAG TGAACATCTCTTGCCATCACC TAG 187 NM 001404.3 eukaryotic translation elongation factor BU783548.1 0.6 ATTTAAGCGCAAGTACTCCAA the TGAGGACACACTCTCTGTGGC gamma ACT GC CATATTT GAG CTG G CA CTTTGATAAG GACG G CTG GTC CCTGTGGTACTCAGAGTATCG CTTCCCTGAAGAACTCACTCA GACCTTCATGAGCTGCAATCT CATCACTG 188 eukaryotic translation NM 001404.3 BE502067.1 0.6 TGGACAAGCTGAGGAAGAAT elongation factor 1 gamma GCCTTCGCCAGTGTCATCCTT TTTGGAACCAACAATAGCAGC TCCATTTCTGGAGTCTGGGTC TTCCGAGGCCAG 18 9 eukaryotic translation NM 001404.3 BE502067.1 0.6 CAGGTGGACTACGAGTCATAC elongation factor 1 gamma ACATGGCGGAAACTGGATCCT GGCAGCGAGGAGACCCAGAC GCTGGTTCGAGAGTACTTTTC CTGGGAGGGGGCCTTCCAGC ATGTGGGCAAAGCCTTCAA 190 eukaryotic translation NM 001404.3 BE502067.1 0.6 GAGCTTGCCTTTCCGctgagtcca elongation factor 1 gamma 191 gattgg eukaryotic translation Ij 0.8 AAGGAGGAGAAAAAGGCGGC elongation factor 1 NM 001404.3 BG533219.1 TGCCCCTGCTCCTGAGGAGG gamma AGATGGATGAATGTGAGCAG GCGCTGGCTGCTGAGCCCAA GGCCAAGGACCCCTTCGCTC ACCTGCCCAAGAG 203 calpain, small subunit NM 001749.1 BE907701.1 1.4 _ 1 / calpain, small subunit 1 CCTTTGAGGCAGCAGatgaaagt gggaaca 204 NM 001751.3 cysteinyl-tRNA synthetase AK125503.1 1.5 AAACAGGAACAAGAAgcagcaaa gctggcc 222 prohibitin NM 002634.2 BE536369.1 1,4 GCGAGGAGAGTGCGTgtgtgaga gggtcca 225 translationally- NM 003295.1 BG284235.1 0.6 TTCTTTATTGGTGAAAACATGA controlled tumor ATCCAGATG G CATG GTTG CTC protein 1 TATTGGACTACCGTGAGGATG GTGTGACCCCATATATGATTT TCTTTAAGGATGGTTTA translationally- TTATTTTGGATCTATCACCTGT controlled tumor CATCATAACTG G CTTCTG CTT GTCATCCACACAACACCAGGA CTTAAGACAAATGGGACTGAT GTCATCTTGAGCTCTTCATTTA TTTTGACTGTGATTTATTTG GA GTGGAGGCATTG II 1 1 1 AAGA AAAACATGTCATGTAGGTTGT CTAAAAATAAAATGCATTTAAA 227 protein 1 _ NM 003295.1 CD641954.1 0.7 C 228 translationally- NM 003295.1 AV749932.1 0.6 CGTCGTCTCCCTTCAGTCGCC ATCATGATTATCTACC controlled tumor protein 1 _1609 translationally 0 NMG_CATCAGCCacgatgag controlled tumor atgttct protein 1 232 cold shock domain NM 003651.3 BE935120.1 1.5 AATAACCCACGGAAATATCTG protein A CGCAGTGTAGGAGATGGAGA AACTGTAGAGTTTGATGTGGT TGAAGGAGAGAA 236 Homo sapiens vesicle- NM 003761.2 BG623073.1 0.7 GATCTGGAAGCCACAtctgagca associated membrane cttcaag protein 8 (endobrevin) 237 CASP8 and FADD-NM 003879.3 BI871546.1 1,3 AGTACAAGCAGTCTGgtggatgg like apoptosis aatggaa regulator 239 beta-2-microglobulin NM 004048.2 BM831738.1 0.7 TG GAG G CTATCCAG nucleotide NM 004125.2 BC015206. 1 0.6 GTGGAGAGGATCAAGgtctctctctctcag binding protein, gamma gcagct 10 242 c-src tyrosine kinase NMG BG953215.1 1.1 CCTCAAGTTCTCGCTagatgtctg cgaggc 248 hemoglobin, alpha 1/2 NM 000517.3 AA343446.1 1.7 TGCTTCTCCCCGCAGgatgttcct gtcctt 253 membrane component, NM 005898.2 AA437165.1 0.8 AACAGCTTCAAACAGt99tt9gÇa chromosome e 1 111,, cttacc surface marker 1 254 capping protein (actin NM 006136.1 BG702980.1 0.7 filament) z-fine muscle, alpha 2 AGCAAAAAATTTTTGgaatggtcgt tggag 258 acidic (leucine-rich) NM 006401.1 AI446778.1 1,3 AGGAGGAGGACGAAGaaggag nuclear phosphoprotein 32 aagatgagg family, member B 261 lysosomal associated NM 006762.1 BQ006415.1 1.5 AAGATGCTCCAGAAGgtgagtgtg multispanning gctgca membrane protein 5 266 chromosome 11 open NM 014206.

1 BE041814.1 0,8 TCGCGGGGCAAAATGgagctcga reading frame 10 ggccatg 269 ribosomal protein L4 NM 000968.2 CB164625.1 0,7 G GTGCTTTTG GAAACatgtgtcgtg gaggc 276 ankyrin repeat domain NM 017664.1 BCO39715.1 0, 7 GGAGCATTCCATATAGAAACT 10 GCTGAAACTGCCACAGGTGCT TCTCCGAAAACCTTACAGTTG TGGCATTGAATGTTCAGTATC GCTTCCTTTCTGCACACG 280 ribosomal protein S17 NM 001021.2 BE731466.1 0,5 AATTATGTTCCTGAGgtctcagcct tggat 281 ribosomal protein S17 NM 001021.2 AV752729.1 0,6 GCCGCGTTCCACCAAAACCGT GAAGAAAGCGGCCCGGGTCA TCATAGAAAAGTACTACACGC GCCTGGGCAACGACTTCCAC ACGAACAAGCGCGTGTGCGA GGAGATCGCCATTATCCCCAG CAAAAAGCTCCG 282 sulfatase 2 NM 018837.1 AB033073.2 1,5 GCGAGAGTGTGTCGAgtgagtgt gcgtctg 283 ubiquitin B NM 018955.2 BG286180.1 1,6 GAAGGCGGAAAAGAGgtcaaaat gcagatc 284 ubiquitin B NM 018955.2 BG286180.1 1,4 GGTATCCGCTAACAGgtcaaaat gcagatc 290 hypothetical protein NM 024841.1 BQ919225.1 1,4 ATTGTTGAACTGGATgcggctgttg FLJ14213 aagag 298 chloride intracellular NM 001288.3 AAl26087.1 0,8 CCTGAGTCCAACACAGCTGG channel 1 G CTG G ACATATTTG C CAAATT TTCTGCCTACATCAAGAATTC AAACCCAGCACTCAATGACA 299 chloride intracellular NM 001288.3 AAl26087.1 0,7 CCCAGGTACCCCAAGctggcagc channel 1 tctgaac 300 chloride intracellular NM 001288.3 AAl26087.1 0,8 GCTGTGCCCTCCCAGgtacccca channel 1 agctggc 307 arachidonate 5- NM 001629.2 BF892107.1 0,9 AAGTGGAGCACGAAAGCAGG 1ipoxygenase ACCCAGAATGGGAGGAGCTT activating protein CCAGAGGACCGGAACACTTG CCTTTGAGCGGGTCTACACTG CCAA 309 ribosomal protein L41 NM 021104.1 EXH-002 0,6 GAAGCGAATGCGCAGgctgaag cgcaaaaq 312 cyclin T2 NM 001241.2 EXH-005 0,5 ttgtgtgagctattcaaactcttcaacccctga 314 zinc finger protein 384 NM 133476.2 EXH-007 1,5 AGTTCAGGAGCCCTGGAAAG GAGAAGGAATAAGACGGCAG GAGGAAGAGA Tableau 3 ù Panel de 66 gènes exprimés différentiellement dans les cancers du sein des patients R et NR SEQ Description de la N Genbank N Genbank C I/11 Séquence Cible ID N séquence (reference) (variant) vs. Sains cDNA DKFZp667I093 AL832878.1 Al223156.1 0,7 TGCAGATTTGATGGCctactgtga (Guanine nucleotide agcaca binding protein, gamma 2) 7 Homo sapiens BO009917.1 BCO28225.1 1,4 ATGAAGAAAAACAAAgtgcacag hypothetical protein agacccg DKFZp761A052 13 cellular homolog of NM 001997.2 NM 001997.2 0,5 GTCGCCCAGATCAAGgctcatgta the fox sequence in the gcctca Finkel-Biskis-Reilly murine sarcoma virus 14 cellular homolog of NM 001997.2 W17004.1 0,6 GGCCGCATGCTTGGAaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 16 cellular homolog of NM 001997.2 AA063591.1 0,7 TGACCGGCCAGGAAAcggtcgcc the fox sequence in the cagatca Finkel-Biskis-Reilly murine sarcoma virus 17 cellular homolog of NM 001997.2 AA094898.1 0,6 CAGGCCGCATGCTTGaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 18 cellular homolog of NM 001997.2 AA187006.1 0,7 AGGCCGCATGCTTGGaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 19 cellular homolog of NM 001997.2 AA225636.1 0,6 GGCCAGGAAACGGTCgcccaga the fox sequence in the tcaaggta Finkel-Biskis-Reilly murine sarcoma virus 20 cellular homolog of NM 001997.2 I 0,5 GGTCGCCCAGATCAAgctcatgta the fox sequence in the BM820687.1 gcctca Finkel-Biskis-Reilly murine sarcoma virus 23 cellular homolog of NM 001997.2 BU603086.1 0,6 GAAGTAGCAGGCCGCatgcttgg the fox sequence in the 9g Finkel-Biskis-Reilly aggtaaa murine sarcoma virus 24 cellular homolog of NM 001997.2 BF218408. 1 0,6 AGGCCGATGCTTGGAggtaaagt the fox sequence in the ccatggt Finkel-Biskis-Reilly murine sarcoma virus 25 cellular homolog of NM 001997.2 AI499403.1 0,7 CCCGCATGCTTGGAG9taaa9tc the fox sequence in the catggtt Finkel-Biskis-Reilly murine sarcoma virus 26 cellular homolog of NM 001997.2 AW795076.1 0,6 CGGTCGCCCAGATCAaggctcat the fox sequence in the gtagcct Finkel-Biskis-Reilly murine sarcoma virus 27 cellular homolog of NM 001997.2 D52122.1 0,6 GGCCGCATGCTTGGggtaaagtc the fox sequence in the Finkel-Biskis-Reilly catggtt murine sarcoma virus 28 cellular homolog of NM 001997.2 BE535673.1 0,6 GCCGCATGCTTGGAGgtaacagt the fox sequence in the ccatggt Finkel-Biskis-Reilly murine sarcoma virus 47 synuclein, alpha (non NM 000345.2 NM 007308.1 1,5 GACCAGTTGGGCAAGaatgaag A4 component of aaggagcc amyloid precursor) 51 translationally- NM 003295.1 CA848049.1 0, 7 AGAtcgcggacgggttgtgcctggaggT controlled tumor GG protein 1 52 translationally- NM 003295.1 CA848049.1 0,5 TCCGACATCTACAAGatccggga controlled tumor gatcgcg protein 1 53 translationally- NM 003295.1 BCO22436.1 0,6 CCGACATCTACAAGATCCGGG controlled tumor AGATCGCGGACGGGTTGTGC protein 1 CTG 54 translationally- NM 003295.1 BC040008.1 0,5 GCGCCGCTCCGGCTGCACCG controlled tumor CGCTCGCTCCGAGTTTCAGG protein 1 CTCGTGCTAAGCTAGCGCCGT CGTCGTCTCCCTTCAGTCGCC ATCATGATTATCTACC 55 TYRO protein tyrosine NM 003332.1 BF092099.1 0,8 GAGGGGCTGCGGAGGcagcga kinase binding protein cccggaaac 58 uroporphyrinogen NM 000374.2 BQ008745.1 1,5 GTGTGCCGCTGATTGtggaccct decarboxylase gatgaca 64 beta-2-microglobulin NM 004048.2 AV734235.1 0,7 ATTTG GATTG GATGAATTC CA AATTCTGCTTGCTTGC I I I I 1A ATATTGATATGCTTATACACTT ACACTTTATGCACAAAATGTA GGGTTATAATAATGTTAACAT GGACATGATCTTCTTTATAATT CTACTTTGAGTGCTGTCTCCA TGTTTGATGTATCTGAGCAGG TTGCTCCACAGGTAGCTCTAG GAGGGCTGGCAAC 69 SEC24 related gene NM 004922.1 AW449995.1 1,8 GCAGCTGTCTGGAATgcagagg family, member C (S. cagctgga cerevisiae) 80 hemoglobin, alpha 1/2 NM 000558.3 R91899.1 1,7 ACCAAGACCTACTTCccggtcaac ttcaag 81 hemoglobin, alpha 1/2 NM 000558.3 H58664.1 1,7 GGAGGCCCTGGAGAGctcctaag ccactgc 88 talin 1 NM 006289.2 AI393487.1 1,4 CCCAGAGTATTAACGCTCCAA GAGTATTATTAACGCTGCTGT ACCTCGATCTGAATCTGCCGG GGCCCCAGCCCACTCCACCC TGCCAGCAGCTTCCAGCCAGT CCCCACAGCCTCATCAGCTCT CTTCACCGIIIIIIGATACTAT CTTCCCCCACCCCCAGCTACC CATAGGGGCTGCAGAGTTATA AGCCCCAAACAGGTCATGCTC CAATAAAAATGATTCTACCTAC AA 89 talin 1 NM 006289.2 A1393487.1 1,4 TGCTTCGGAAGGAACgagagctg gaagagg 90 talin 1 NM 006289.2 A1417760.1 1,4 AGGAAGAAATGCTTCggaagga acgagagc 116 Homocysteine- NM 014685.1 BG828243.1 0,7 GGAAAACATCTCAAGgcctgaag inducible, endoplasmic ctgccCa ulum stress- inucible, ubiquitin- ind like domain member 1 121 ring finger protein 10 NM 014868.3 BU626650.1 1,5 AGAAACAG G G CAAGTacccag a agtccaca 125 ribosomal protein L4 NM 000968.2 CB141160.1 0,7 GTCATCAGACTAGTGCTGAGT CTTGGGGTACTGGCAGAGCT GTGGCTCGAATTCCCAGAGTT CGAGGTGGTGGGACTCACCG CTCTGGCCAGGGTGCTTTTGG AAACatgtgtcgtg 137 ubiquitin B NM 018955.2 AA206538.1 1,4 CAG G T CAAAATG CAG at cttcg tg aaaacc 139 ubiquitin B NM 018955.2 AA340917.1 1,4 AGATCTTCGTGAAGACCCTGA CCGGCAAGACCATCACTCTG GAGGTGGAGCCCAGTGACAC CATCGAAAATGTGAAGGCCAA GATCCAAGATAAAGAAGGCAT CCCCCCCGACCAGCAGAGGC TCATCTTTGCAGGCAAGCAGC TGGAAGATGGCCGCACTCTTT CTGACTACAACATCCAGAAAG AGTCGACCCTGCACCTGGTC CTGCGCCTGAGGGGTGGCTG TTAATTCTTCAGTCATGGCATT CGCAGTGCCCAGTGATGGCA TTACTCTGCACTATAGCCATTT G C C C CAA CTTAAGTTTAG AAA TTACAAGTTTCAGTAATAGCT GAACCTGTTCAAAATGTTAATA AAGG 145 major NM 019111.2 CD686254.1 0,7 TCAGAGACAGTCTTCCTGCCC omplexplex class II, c I, DR o, ciasslI alpha AGGGAAGACCACCTTTTCCGC AAGTTCCACTATCTCCCCTTC CTGCCCTCAACTGAGGACGTT TACGACTGCAGGGTGGAGCA CTGGGGCTTGGATGAGCCTC TTCTCAAGCACTGGGagtttgatg ctccaagccctctc 148 mesoderm induction NM 020948.1 BE041814.1 0.8 TCGCGGGGCAAAATGgagctcga reading frame 10 ggccatg 269 ribosomal protein L4 NM 000968.2 CB164625.1 0.7 G GTGCTTTTG GAAACatgtgtcgtg gaggc 276 ankyrin repeat domain NM 017664.1 BCO39715.1 0, 7 GGAGCATTCCATATAGAAACT 10 GCTGAAACTGCCACAGGTGCT TCTCCGAAAACCTTACAGTTG TGGCATTGAATGTTCAGTATC GCTTCCTTTCTGCACACG 280 ribosomal protein S17 NM 001021.2 BE731466.1 0.5 AATTATGTTCCTGAGgtctcagcct tggat 281 ribosomal protein S17 NM 001021.2 AV752729.1 0.6 GCCGCGTTCCACCAAAACCGT GAAGAAAGCGGCCCGGGTCA TCATAGAAAAGTACTACACGC GCCTGGGCAACGACTTCCAC ACGAACAAGCGCGTGTGCGA GGAGATCGCCATTATCCCCAG CAAAAAGCTCCG 282 sulfatase 2 NM 018837.1 AB033073.2 1.5 GCGAGAGTGTGTCGAgtgagtgt gcgtctg ubiquitin B 283 NM 018955.2 BG286180.1 1.6 GAAGGCGGAAAAGAGgtcaaaat gcagatc 284 ubiquitin B NM 018955.2 BG286180.1 1,4 GGTATCCGCTAACAGgtcaaaat gcagatc 290 hypothetical protein NM 024841.1 BQ919225.1 1,4 ATTGTTGAACTGGATgcggctgttg FLJ14213 aagag 298 intracellular chloride NM 001288.3 AAl26087.1 0.8 CCTGAGTCCAACACAGCTGG channel 1 G CTG G ACATATTTG C CAAATT TTCTGCCTACATCAAGAATTC AAACCCAGCACTCAATGACA 299 NM 001288.3 AAl26087.1 chloride intracellular channel 1 0.7 CCCAGGTACCCCAAGctggcagc tctgaac 300 NM 001288.3 AAl26087.1 chloride intracellular channel 1 0.8 GCTGTGCCCTCCCAGgtacccca agctggc 307 arachidonate 5- NM 001629.2 BF892107.1 0.9 AAGTGGAGCACGAAAGCAGG 1ipoxygenase ACCCAGAATGGGAGGAGCTT activating protein CCAGAGGACCGGAACACTTG CCTTTGAGCGGGTCTACACTG CCAA 309 ribosomal protein L41 NM 021104.1 EXH-002 0.6 312 GAAGCGAATGCGCAGgctgaag cgcaaaaq cyclin T2 NM 001241.2 EXH-005 0.5 ttgtgtgagctattcaaactcttcaacccctga 314 zinc finger protein 384 NM 133476.2 EXH-007 1.5 Table AGTTCAGGAGCCCTGGAAAG GAGAAGGAATAAGACGGCAG GAGGAAGAGA 3-panel of 66 differentially expressed genes in breast cancers of R and NR patients SEQ Description of N Genbank N Genbank CI / 11 Sequence Target ID N sequence (reference) (variant) vs. Healthy TGDNA DKFZp667I093 AL832878.1 Al223156.1 0.7 TGCAGATTTGATGGCctactgtga (Guanine nucleotide agcaca binding protein, gamma 2) 7 Homo sapiens BO009917.1 BCO28225.1 1,4 ATGAAGAAAAACAAAgtgcacag hypothetical protein agacccg DKFZp761A052 13 cellular homolog of NM 001997.2 NM 001997.2 0, 5 GTCGCCCAGATCAAGgctcatgta the fox sequence in the gcctca Finkel-Biskis-Reilly murine sarcoma virus 14 cellular homolog of NM 001997.2 W17004.1 0.6 GGCCGCATGCTTGGAaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 16 cellular homolog of NM 001997.2 AA063591 .1 0.7 TGACCGGCCAGGAAAcggtcgcc the fox sequence in the cagatca Finkel-Biskis-Reilly murine sarcoma virus 17 cellular homolog of NM 001997.2 AA094898.1 0.6 CAGGCCGCATGCTTGaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 18 cellular homolog of NM 001997.2 AA187006.1 0.7 AGGCCGCATGCTTGGaggtaaa the fox sequence in the gtccatgg Finkel-Biskis-Reilly murine sarcoma virus 19 cellular homolog of NM 001997.2 AA225636.1 0.6 GGCCAGGAAACGGTCgcccaga the fox sequence in the tcaaggta Finkel-Biskis-Reilly murine sarcoma virus 20 cellular homolog of NM 001997.2 I 0.5 GGTCGCCCAGATCAAgctcatgta the fox sequence in the BM820687.1 gcctca Finkel-Biskis-Reilly murine sarcoma virus 23 cellular homolog of NM 001997.2 BU603086.1 0.6 GAAGTAGCAGGCCGCatgcttgg the fox sequence in the 9g Finkel-Biskis-Reilly murine aggtaaa sarcoma virus 24 cellular homolog of NM 001997.2 BF218408. 1 0.6 AGGCCGATGCTTGGAggtaaagt the fox sequence in the ccatggt Finkel-Biskis-Reilly murine sarcoma virus 25 cellular homolog of NM 001997.2 AI499403.1 0.7 CCCGCATGCTTGGAG9taaa9tc the fox sequence in the catggtt Finkel-Biskis-Reilly murine sarcoma virus 26 cellular homolog of NM 001997.2 AW795076.1 0.6 CGGTCGCCCAGATCAaggctcat the fox sequence in the gtagcct Finkel-Biskis-Reilly murine sarcoma virus 27 cellular homolog of NM 001997.2 D52122.1 0.6 GGCCGCATGCTTGGggtaaagtc the fox sequence in the Finkel-Biskis-Reilly catggtt murine sarcoma virus 28 cellular homolog of NM 001997.2 BE535673.1 0.6 GCCGCATGCTTGGAGgtaacagt the fox sequence in the ccatggt Finkel-Biskis-Reilly murine sarcoma virus 47 synuclein, alpha (not NM 000345.2 NM 007308.1 1.5 GACCAGTTGGGCAAGaatgaag A4 component of aaggagcc amyloid precursor) 51 translationally - NM 003295.1 CA848049.1 0, 7 AGAtcgcggacgggttgtgcctggaggT controlled tumor GG protein 1 52 translationally- NM 003295.1 CA848049.1 0.5 TCCGACATCTACAAGatccggga controll ed tumor gatcgcg protein 1 53 translationally- NM 003295.1 BCO22436.1 0.6 CCGACATCTACAAGATCCGGG AGATCGCGGACGGGTTGTGC controlled tumor protein 1 CTG 54 translationally- NM 003295.1 BC040008.1 0.5 GCGCCGCTCCGGCTGCACCG controlled tumor protein 1 CGCTCGCTCCGAGTTTCAGG CTCGTGCTAAGCTAGCGCCGT CGTCGTCTCCCTTCAGTCGCC ATCATGATTATCTACC 55 TYRO protein tyrosine NM 003332.1 BF092099 .1 0.8 GAGGGGCTGCGGAGGcagcga kinase binding protein cccggaaac 58 NM 000374.2 uroporphyrinogen decarboxylase BQ008745.1 1.5 GTGTGCCGCTGATTGtggaccct gatgaca 64 beta-2-microglobulin NM 004048.2 AV734235.1 0.7 ATTTG GATTG GATGAATTC CA AATTCTGCTTGCTTGC IIII 1A ATATTGATATGCTTATACACTT ACACTTTATGCACAAAATGTA GGGTTATAATAATGTTAACAT GGACATGATCTTCTTTATAATT CTACTTTGAGTGCTGTCTCCA TGTTTGATGTATCTGAGCAGG TTGCTCCACAGGTAGCTCTAG GAGGGCTGGCAAC 69 SEC24 related gene NM 004922.1 AW449995.1 1.8 GCAGCTGTCTGGAATgcagagg family, member C (S. cagctgga cerevisiae) 80 hemoglobin, alpha 1/2 NM 000558.3 R91899.1 1.7 ACCAAGACCTACTTCccggtcaac ttcaag 81 hemoglobin, alpha 1/2 NM 000558.3 H58664.1 1.7 GGAGGCCCTGGAGAGctcctaag ccactgc 88 talin 1 NM 006289.2 AI393487.1 1.4 CCCAGAGTATTAACGCTCCAA GAGTATTATTAACGCTGCTGT ACCTCGATCTGAATCTGCCGG GGCCCCAGCCCACTCCACCC TGCCAGCAGCTTCCAGCCAGT CCCCACAGCCTCATCAGCTCT CTTCACCGIIIIIIGATACTAT CTTCCCCCACCCCCAGCTACC CATAGGGGCTGCAGAGTTATA AGCCCCAAACAGGTCATGCTC CAATAAAAATGATTCTACCTAC AA 89 talin 1 NM 006289.2 A1393487.1 1.4 TGCTTCGGAAGGAACgagagctg gaagagg 90 talin 1 NM 006289.2 A1417760.1 1.4 AGGAAGAAATGCTTCggaagga acgagagc 116 homocysteine NM 014685.1 BG828243.1 0.7 GGAAAACATCTCAAGgcctgaag inducible, endoplasmic ctgccCa ulum stress-inucible, ubiquitin-ind like domain member 1 121 ring finger protein 10 NM 014868.3 BU626650.1 1.5 AGAAACAG GG CAAGTaccagal agtccaca 125 ribosomal protein L4 NM 000968.2 CB141160.1 0.7 GTCATCAGACTAGTGCTGAGT CTTGGGGTACTGGCAGAGCT GTGGCTCGAATTCCCAGAGTT CGAGGTGGTGGGACTCACCG CTCT GGCCAGGGTGCTTTTGG AAACatgtgtcgtg ubiquitin B SL 137 018955.2 AA206538.1 1.4 GT CAAAATG CAG CAG at cttcg tg YYYYCC ubiquitin B SL 139 018955.2 AA340917.1 1.4 AGATCTTCGTGAAGACCCTGA CCGGCAAGACCATCACTCTG GAGGTGGAGCCCAGTGACAC CATCGAAAATGTGAAGGCCAA GATCCAAGATAAAGAAGGCAT CCCCCCCGACCAGCAGAGGC TCATCTTTGCAGGCAAGCAGC TGGAAGATGGCCGCACTCTTT CTGACTACAACATCCAGAAAG AGTCGACCCTGCACCTGGTC CTGCGCCTGAGGGGTGGCTG TTAATTCTTCAGTCATGGCATT CGCAGTGCCCAGTGATGGCA TTACTCTGCACTATAGCCATTT GCCC CAA AAA CTTAAGTTTAG TTACAAGTTTCAGTAATAGCT GAACCTGTTCAAAATGTTAATA AAGG 145 Major NM 019111.2 CD686254.1 0.7 TCAGAGACAGTCTTCCTGCCC omplexplex class II, c I, DR o, ciasslI alpha AGGGAAGACCACCTTTTCCGC AAGTTCCACTATCTCCCCTTC CTGCCCTCAACTGAGGACGTT TACGACTGCAGGGTGGAGCA CTGGGGCTTGGATGAGCCTC TTCTCAAGCACTGGGagtttgatg ctccaagccctctc 148 NM 020 948 mesoderm induction.

1 AY124186.1 0,7 TTTCTCACACAGGCAtactccaaa early response 1 tgcttc 158 major NM 033554.2 BU621846.1 0,7 AAGGAGCCTGTGGAGctgggcca histocability gcccaac II, complexex, cl class DP alpha 1 160 chloride intracellular NM 001288.3 BG491600.1 0,8 CCAGGGACGGCCACTTCCTG channel 1 GTCCCCGACGCAACCATGGC TGAAGAACAACCGCAGGTCG AATTGTTCGTGAAG 161 chloride intracellular NM 001288.1 AY124186.1 0.7 TTTCTCACACAGGCAtactccaaa early response 1 tgcttc 158 major NM 033554.2 BU621846.1 0.7 AAGGAGCCTGTGGAGctgggcca histocability gcccaac II, complexex, cl class DP alpha 1 160 intracellular chloride NM 001288.3 BG491600.1 0.8 CCAGGGACGGCCACTTCCTG channel 1 GTCCCCGACGCAACCATGGC TGAAGAACAACCGCAGGTCG AATTGTTCGTGAAG 161 intracellular chloride NM 001288.

3 AV683308.1 0,7 GGGCAGCTCCCATTCctgctgtat channel 1 ggcact 164 chromosome 19 open NM 138774.2 AI375989.1 1,4 ACCTCATCTCGGCCAgtgctgacc reading frame 22 tggagg 188 eukaryotic translation NM 001404.3 BE502067.1 0,6 TGGACAAGCTGAGGAAGAAT elongation factor 1 GCCTTCGCCAGTGTCATCCTT gamma TTTGGAACCAACAATAGCAGC TCCATTTCTGGAGTCTGGGTC TTCCGAGGCCAG 189 eukaryotic translation NM 001404.3 BE502067.1 0,6 CAGGTGGACTACGAGTCATAC elongation factor 1 ACATGGCGGAAACTGGATCCT gamma GGCAGCGAGGAGACCCAGAC GCTGGTTCGAGAGTACTTTTC CTGGGAGGGGGCCTTCCAGC ATGTGGGCAAAGCCTTCAA 190 eukaryotic translation NM 001404.3 BE502067.1 0,6 GAGCTTGCCTTTCCGctgagtcca elongation factor 1 gattgg gamma 191 eukaryotic translation NM 001404.3 BG533219.1 0,8 AAGGAGGAGAAAAAGGCGGC elongation factor 1 TGCCCCTGCTCCTGAGGAGG gamma AGATGGATGAATGTGAGCAG GCGCTGGCTGCTGAGCCCAA GGCCAAGGACCCCTTCGCTC ACCTGCCCAAGAG 208 ferritin, heavy NM 002032.1 BG248923.1 0,6 TGTCTCTGGGGATCCCTAGTA polypeptide 1 TAACACATGCA 222 prohibitin NM 002634.2 BE536369.1 1,4 GCGAGGAGAGTGCGTgtgtgaga gggtcca 225 translationally- NM 003295.1 BG284235.1 0,6 TTCTTTATTGGTGAAAACATGA controlled tumor AT C CAGATG G CATG G TTG CT C controlled GTGTGACCCCATATATGATTT protein 1 TCTTTAAGGATGGTTTA 228 translationally- NM 003295.1 AV749932.1 0,6 CGTCGTCTCCCTTCAGTCGCC controlled tumor ATCATGATTATCTACC protein 1 229 translationally- NM 003295.1 AV749932.1 0, 6 GGGACCTCATCAGCCacgatgag controlled tumor atgttct protein 1 236 Homo sapiens vesicle- NM 003761.2 BG623073.1 0,7 GATCTGGAAGCCACAtctgagca associated membrane cttcaag protein 8 (endobrevin) 240 Guanine nucleotide NM 004125.2 BC015206.1 0,6 GTGGAGAGGATCAAGgtctctcag binding protein, gcagct gamma 10 242 c-src tyrosine kinase NM 004383.1 BG953215.1 1,1 CCTCAAGTTCTCGCTagatgtctg cgaggc 245 family with sequence NM 004699.1 AA584911.1 1,6 TCAAGAGTGAGTGTTtgcggagtc similarity 50, member agacgc A 248 hemoglobin, alpha 1/2 NM 000517.3 AA343446.1 1,7 TGCTTCTCCCCGCAGgatgttcct gtcctt 252 putative translation NM 005801.2 BG655736.1 0,7 CCTTTGTGCTTGCAGaagtttgcctinitiation factor gcaat 280 ribosomal protein S17 NM 001021.2 BE731466.1 0,5 AATTATGTTCCTGAGgtctcagcct tggat 281 ribosomal protein S17 NM 001021.2 AV752729.1 0,6 GCCGCGTTCCACCAAAACCGT GAAGAAAGCGGCCCGGGTCA TCATAGAAAAGTACTACACGC GCCTGGGCAACGACTTCCAC ACGAACAAGCGCGTGTGCGA GGAGATCGCCATTATCCCCAG CAAAAAGCTCCG 284 ubiquitin B NM 018955.2 BG286180.1 1,4 GGTATCCGCTAACAGgtcaaaat gcagatc 290 hypothetical protein NM 024841.1 BQ919225.1 1,4 ATTGTTGAACTGGATgcggctgttg FLJ14213 aagag 298 chloride intracellular NM 001288.3 AAl26087.1 0,8 CCTGAGTCCAACACAGCTGG channel 1 G CTG GACATATTTG C CAAATT TTCTGCCTACATCAAGAATTC AAACCCAGCACTCAATGACA 299 chloride intracellular NM 001288.3 AAl26087.1 0,7 CCCAGGTACCCCAAGCtggCagC channel 1 tctgaac 300 chloride intracellular NM 001288.3 AAI26087.1 0,8 GCTGTGCCCTCCCAGgtacccca channel 1 agctggc 309 ribosomal protein L41 NM 021104.1 EXH-002 0,6 GAAGCGAATGCGCAGgctgaag cgcaaaag 310 translationally NM 003295.1 EXH-003 0,5 AATGCATATTTAAACTAAATTG controlled tumor ATCCTGTAGTGTTCCTGGAGA protein 1 AGCTAGAGCCTGATTGTAGGC TACTACTCATCAATTAACTTCT ACAGTGGAGACTACTTCTGGG ACTGGAATATAAAAA 311 G1 to S phase ATTACCGTTTATTCCATATCTG transition 1/Gl to S GATAATTTGCCGAACTTCAAT phase transition 1 AGATCAGTTGATGGACCAATC NM 002094.1 EXH-004 1,5 AGGCTGCCAATTGTGG 312 cyclin T2 NM 001241.2 EXH-005 0,5 ttgtgtgagctattcaaactcttcaacccctga3 AV683308.1 0.7 GGGCAGCTCCCATTCctgctgtat channel 1 ggcact 164 NM 138774.2 chromosome 19 open reading frame ACCTCATCTCGGCCAgtgctgacc AI375989.1 1.4 22 188 tggagg eukaryotic translation NM 001404.3 BE502067.1 0.6 TGGACAAGCTGAGGAAGAAT elongation factor 1 gamma GCCTTCGCCAGTGTCATCCTT TTTGGAACCAACAATAGCAGC TCCATTTCTGGAGTCTGGGTC eukaryotic TTCCGAGGCCAG 189 translation NM 001404.3 BE502067.1 0.6 CAGGTGGACTACGAGTCATAC elongation factor 1 gamma ACATGGCGGAAACTGGATCCT GGCAGCGAGGAGACCCAGAC GCTGGTTCGAGAGTACTTTTC CTGGGAGGGGGCCTTCCAGC ATGTGGGCAAAGCCTTCAA 190 eukaryotic translation NM 001404.3 BE502067.1 0.6 GAGCTTGCCTTTCCGctgagtcca elongation factor 1 gamma 191 gattgg NM 001404.3 eukaryotic translation elongation factor BG533219.1 0.8 AAGGAGGAGAAAAAGGCGGC 1 TGCCCCTGCTCCTGAGGAGG gamma AGATGGATGAATGTGAGCAG GCGCTGGCTGCTGAGCCCAA GGCCAAGGACCCCTTCGCTC ACCTGCCCAAGAG 208 ferritin, heavy NM 002032.1 BG248923.1 0.6 TGTCTCTGGGGATCCCTAGTA polypeptide 1 TAACACATGCA 222 prohibitin NM 002634.2 BE536369.1 1.4 GCGAGG AGCTTGCGTgtgtgaga gggtcca 225 translationally- NM 003295.1 BG284235.1 0.6 TTCTTTATTGGTGAAAACATGA controlled tumor AT C CAGATG G CATG G TTG CT C controlled GTGTGACCCCATATATGATTT protein 1 TCTTTAAGGATGGTTTA 228 translationally- NM 003295.1 AV749932.1 0.6 CGTCGTCTCCCTTCAGTCGCC controlled tumor ATCATGATTATCTACC protein 1 229 translationally ## STR1 ## GGGACCTCATCAGCCacgatgag controlled tumor atgttct protein 1 236 Homo sapiens vesicle- NM 003761.2 BG623073.1 0.7 GATCTGGAAGCCACAtctgagca associated membrane cttcaag protein 8 (endobrevin) 240 Guanine nucleotide NM 004125.2 BC015206.1 0.6 GTGGAGAGGATCAAGgtctctcag binding protein , gcagct gamma, 242 c-src tyrosine kinase NMCT. CCTCAAGTTCTCGCTagatgtctg cgaggc 245 family with sequence NM 58/461 A.184911.1 1.6 TCAAGAGTGAGTGTTtgcggagtc similarity 50, member agacgc A 248 hemoglobin, alpha 1/2 NM 000517.3 AA343446. 1 1.7 TGCTTCTCCCCGCAGgatgttcct gtcctt 252 putative translation NM 005801.2 BG655736.1 0.7 CCTTTGT GCTTGCAGaagtttgcctinitiation factor gcaat 280 ribosomal protein S17 NM 001021.2 BE731466.1 0.5 AATTATGTTCCTGAGgtctcagcct tggat 281 ribosomal protein S17 NM 001021.2 AV752729.1 0.6 GCCGCGTTCCACCAAAACCGT GAAGAAAGCGGCCCGGGTCA TCATAGAAAAGTACTACACGC GCCTGGGCAACGACTTCCAC ACGAACAAGCGCGTGTGCGA GGAGATCGCCATTATCCCCAG CAAAAAGCTCCG ubiquitin B 284 NM 018955.2 BG286180.1 1.4 GGTATCCGCTAACAGgtcaaaat gcagatc 290 NM 024841.1 hypothetical protein FLJ14213 BQ919225.1 1.4 ATTGTTGAACTGGATgcggctgttg aagag 298 NM 001288.3 AAl26087.1 chloride intracellular channel 1 0.8 CCTGAGTCCAACACAGCTGG G CTG GACATATTTG C CAAATT TTCTGCCTACATCAAGAATTC AAACCCAGCACTCAATGACA 299 NM 001288.3 AAl26087.1 chloride intracellular channel 1 0.7 CCCAGGTACCCCAAGCtggCagC tctgaac 300 intracellular chloride NM 001288.3 AAI26087.1 0.8 GCTGTGCCCTCCCAGgtacccca channel 1 agctggc 309 ribosomal protein L41 NM 021104.1 EXH-002 0.6 GAAGCGAATGCGCAGgctgaag cgcaaaag 310 translational NM 003295.1 EXH-003 0.5 AATGCATATTTAAACTAAATTG controlled tumo r ATCCTGTAGTGTTCCTGGAGA protein 1 AGCTAGAGCCTGATTGTAGGC TACTACTCATCAATTAACTTCT ACAGTGGAGACTACTTCTGGG ACTGGAATATAAAAA 311 G1 to S-phase transition ATTACCGTTTATTCCATATCTG 1 / Gl to S-phase transition GATAATTTGCCGAACTTCAAT 1 AGATCAGTTGATGGACCAATC NM 002094.1 EXH-004 1.5 312 AGGCTGCCAATTGTGG cyclin T2 NM 001241.2 EXH-005 0.5 ttgtgtgagctattcaaactcttcaacccctga

Claims

A method for detecting the presence or risk of developing breast cancer in a mammal, comprising detecting, in a mammalian blood sample, the following target molecules: a) nucleic acids comprising the sequences shown in SEQ ID NOs 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, and / or b) the nucleic acids having a sequence complementary to sequences according to a), and / or c) the functional analogues of the nucleic acids according to ) or b) from another species, and / or d) the polypeptides encoded by the nucleic acids according to a) to c), the presence, absence or (relative) amount of these target molecules in the sample being an indication of the presence or risk of developing breast cancer in this mammal.

2. Method according to claim 1, characterized in that the nucleic acids according to a) comprise the sequences represented in SEQ ID NOs: 18, 19, 23, 26, 27, 51, 52, 53, 54, 55, 69, 80 , 125, 145, 148, 161, 188, 225, 228, 240, 280 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases.

3. Method according to claim 1, characterized in that the nucleic acids according to a) comprise the sequences represented in SEQ ID NOs: 13, 16-19, 23, 26-28, 47, 51-55, 58, 69, 80 , 81, 89, 116, 121, 125, 145, 148, 158, 160, 161, 164, 189, 190, 225, 229, 240, 248, 280, 281, 284, 299, 300, 310 and 312 or a a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases.

4. Method according to claim 1, characterized in that the nucleic acids according to a) comprise the sequences represented in SEQ ID NOs: 7, 13, 14, 16-19, 23-28, 47, 51-55, 58, 69. , 80, 81, 89, 116, 121, 125, 137, 139, 145, 148, 158, 160, 161, 164, 189, 190, 225, 228, 229, 240, 245, 248, 252, 280, 281 , 284, 290, 298-300, 310 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases.

5. Method according to claim 1, characterized in that the nucleic acids according to a) comprise the sequences represented in SEQ ID NOs: 5, 7, 13, 14, 16-20, 23-28, 47, 51-55, 58 ,,,,,,,, , 240, 242, 245, 248, 252, 280, 281, 284, 290, 298-300 and 309-312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19 , 20, 25 or 30 consecutive bases.

The method according to any one of claims 1 to 5, characterized in that the nucleic acids according to a) further comprise the additional sequences shown in SEQ ID NOs: 1-318 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases.

7. A method according to any one of the preceding claims comprising detecting the presence or absence of a nucleic acid according to a) to c) by selective hybridization or selective amplification.

The method of claim 1 for detecting the presence or risk of developing breast cancer in a mammal, comprising contacting, under conditions allowing hybridization between complementary sequences, nucleic acids derived from a sample of mammalian blood and a set of probes specific for the following target molecules: a) nucleic acids comprising the sequences shown in SEQ ID NOs: 18, 19, 23, 26, 51, 52, 53, 54, 55, 69, 80, 125, 145, 148, 225, 228, 240 and 312 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25, or 30 consecutive bases, and / or b) nucleic acids having a sequence complementary to sequences according to a), and / orc) the functional analogues of the nucleic acids according to a) or b) from another species, to obtain a hybridization profile, the profile of hybridization being characteristic of the presence or risk to develop breast cancer in this mammal.

9. Method according to claim 8, characterized in that the probes are immobilized on a support.

The method according to any one of claims 1 to 6, comprising detecting the presence or absence of a polypeptide according to d) using a specific antibody or a fragment or derivative thereof. this.

11. Method according to any one of the preceding claims, characterized in that the blood sample is a whole blood sample.

12. Use of a nucleic acid probe specific for a target nucleic acid as defined in one of claims 1 to 6, said probe comprising from 15 to 400 bases, for the in vitro detection of breast cancer in a patient. human subject. 20

13. Use of a nucleic primer allowing the amplification of all or part of a target nucleic acid as defined in one of claims 1 to 6, said primer being single-strand, of a length of between 5 and 50 bases, for the in vitro detection of breast cancer in a human subject. 25

14. A product comprising a support on which at least 18 distinct nucleic acid probes comprising a complementary and / or specific sequence of 18 distinct target nucleic acids selected from SEQ ID NO: 1-318, preferably 18 nucleic acid targets, are immobilized. as defined in claim 1.

15. Use of a product according to claim 14 for the in vitro or ex vivo detection of the presence or risk of developing breast cancer in a mammal. 15

A method for detecting the presence or risk of developing cancer in a mammal, comprising detecting, in a sample of the mammal, one or more target molecules selected from: a) nucleic acids comprising the sequences shown in SEQ ID NOs: 1-318 or a distinctive fragment thereof having at least 15, preferably at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, and / or b) the nucleic acids having a complementary sequence sequences according to a), and / or c) the functional analogues of the nucleic acids according to a) or b) from another species, and / or d) the polypeptides encoded by the nucleic acids according to a) to c), the the presence, absence or (relative) amount of these target molecules in the sample being an indication of the presence or risk of developing cancer in that mammal.

17. A product comprising a support on which at least two distinct nucleic acid probes comprising a complementary and / or specific sequence of two distinct target nucleic acids selected from SEQ ID NO: 1-318 are immobilized.

18. A method for detecting the presence or risk of developing cancer in a mammal, comprising contacting, under conditions allowing hybridization between complementary sequences, nucleic acids from a mammalian blood sample and a probe set specific for at least two distinct molecules selected from the following targets: a) nucleic acids comprising the sequences shown in SEQ ID NOs: 1-318 or a distinctive fragment thereof having at least 15, preferably at least at least 16, 17, 18, 19, 20, 25 or 30 consecutive bases, and / or b) the nucleic acids having a sequence complementary to sequences according to a), and / or c) the functional analogues of the nucleic acids according to a) or b) from another species, to obtain a hybridization profile, the hybridization profile being characteristic of the presence or risk of developing cancer in this mammal.