US20190376148A1 - Diagnostic for lung disorders using class prediction - Google Patents

Diagnostic for lung disorders using class prediction Download PDF

Info

Publication number
US20190376148A1
US20190376148A1 US16/510,584 US201916510584A US2019376148A1 US 20190376148 A1 US20190376148 A1 US 20190376148A1 US 201916510584 A US201916510584 A US 201916510584A US 2019376148 A1 US2019376148 A1 US 2019376148A1
Authority
US
United States
Prior art keywords
genes
group
expression
unigene
gene
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US16/510,584
Inventor
Jerome S. Brody
Avrum Spira
Jennifer E. Beane-Ebel
Marc E. Lenburg
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Boston University
Original Assignee
Boston University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Boston University filed Critical Boston University
Priority to US16/510,584 priority Critical patent/US20190376148A1/en
Publication of US20190376148A1 publication Critical patent/US20190376148A1/en
Assigned to TRUSTEES OF BOSTON UNIVERSITY reassignment TRUSTEES OF BOSTON UNIVERSITY ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SPIRA, AVRUM, BEANE-EBEL, JENNIFER E., BRODY, JEROME S., LENBURG, MARC E.
Abandoned legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/16Primer sets for multiplex assays

Definitions

  • the present invention is directed to diagnostic and prognostic methods by using analysis of gene group expression patterns in a subject. More specifically, the invention is directed to diagnostic and prognostic methods for detecting lung diseases, particularly lung cancer in subjects, preferably humans that have been exposed to air pollutants.
  • Lung disorders represent a serious health problem in the modern society. For example, lung cancer claims more than 150,000 lives every year in the United States, exceeding the combined mortality from breast, prostate and colorectal cancers. Cigarette smoking is the most predominant cause of lung cancer. Presently, 25% of the U.S. population smokes, but only 10% to 15% of heavy smokers develop lung cancer. There are also other disorders associated with smoking such as emphysema. There are also health questions arising from people exposed to smokers, for example, second hand smoke. Former smokers remain at risk for developing such disorders including cancer and now constitute a large reservoir of new lung cancer cases. In addition to cigarette smoke, exposure to other air pollutants such as asbestos, and smog, pose a serious lung disease risk to individuals who have been exposed to such pollutants.
  • other air pollutants such as asbestos, and smog
  • the methods that are currently employed to diagnose lung cancer include chest X-ray analysis, bronchoscopy or sputum cytological analysis, computer tomographic analysis of the chest, and positron electron tomographic (PET) analysis.
  • chest X-ray analysis e.g., chest X-ray analysis
  • bronchoscopy or sputum cytological analysis e.g., computer tomographic analysis of the chest
  • PET positron electron tomographic
  • the present invention provides compositions and methods for diagnosis and prognosis of lung diseases which provides a diagnostic test that is both very sensitive and specific.
  • Our preferred groups are the groups of 96 (Table 1), 84 (Table 2), 50 (Table 3), 36 (Table 4), 80 (Table 5), 535 (Table 6) and 20 (Table 7). In some instances, we have found that one can enhance the accuracy of the diagnosis by adding certain additional genes to any of these specific groups. When one uses these groups, the genes in the group are compared to a control or a control group.
  • the control groups can be non-smokers, smokers, or former smokers.
  • comparing can be performed in the biological sample from a smoker against a control group of smokers who do not have lung cancer.
  • the transcripts or expression products against the control for increased expression or decreased expression, which depends upon the particular gene and is set forth in the tables—not all the genes surveyed will show an increase or decrease. However, at least 50% of the genes surveyed must provide the described pattern. Greater reliability if obtained as the percent approaches 100%.
  • the invention provides a group of genes the expression of which is altered in individuals who are at risk of developing lung diseases, such as lung cancer, because of the exposure to air pollutants.
  • the invention also provides groups of genes the expression of which is consistently altered as a group in individuals who are at risk of developing lung diseases because of the exposure to air pollutants.
  • the present invention provides gene groups the expression pattern or profile of which can be used in methods to diagnose lung diseases, such as lung cancer and even the type of lung cancer, in more than 60%, preferably more than 65%, still more preferably at least about 70%, still more preferably about 75%, or still more preferably about 80%-95% accuracy from a sample taken from airways of an individual screened for a lung disease, such as lung cancer.
  • the invention provides a method of diagnosing a lung disease such as lung cancer using a combination of bronchoscopy and the analysis of gene expression pattern of the gene groups as described in the present invention.
  • the invention provides gene groups that can be used in diagnosis and prognosis of lung diseases.
  • the invention provides groups of genes the expression profile of which provides a diagnostic and or prognostic test to determine lung disease in an individual exposed to air pollutants.
  • the invention provides groups of genes the expression profile of which can distinguish individuals with lung cancer from individuals without lung cancer.
  • the invention provides an early asymptomatic screening system for lung cancer by using the analysis of the disclosed gene expression profiles. Such screening can be performed, for example, in similar age groups as colonoscopy for screening colon cancer. Because early detection in lung cancer is crucial for efficient treatment, the gene expression analysis system of the present invention provides a vastly improved method to detect tumor cells that cannot yet be discovered by any other means currently available.
  • the probes that can be used to measure expression of the gene groups of the invention can be nucleic acid probes capable of hybridizing to the individual gene/transcript sequences identified in the present invention, or antibodies targeting the proteins encoded by the individual gene group gene products of the invention.
  • the probes are preferably immobilized on a surface, such as a gene or protein chip so as to allow diagnosis and prognosis of lung diseases in an individual.
  • the invention provides a group of genes that can be used as individual predictors of lung disease. These genes were identified using probabilities with a t-test analysis and show differential expression in smokers as opposed to non-smokers.
  • the group of genes comprise ranging from 1 to 96, and all combinations in between, for example 5, 10, 15, 20, 25, 30, for example at least 36, at least about, 40, 45, 50, 60, 70, 80, 90, or 96 gene transcripts, selected from the group consisting of genes identified by the following GenBank sequence identification numbers (the identification numbers for each gene are separated by “;” while the alternative GenBank ID numbers are separated by “///”): NM_003335; NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_001319; NM_006545.1; NM_021145.1; NM_002437.1; NM_006286; NM_001003698 //
  • the gene/transcript analysis comprises a group of about 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80, 80-90, 90-100, 100-120, 120-140, 140-150, 150-160, 160-170, 170-180, 180-190, 190-200, 200-210, 210-220, 220-230, 230-240, 240-250, 250-260, 260-270, 270-280, 280-290, 290-300, 300-310, 310-320, 320-330, 330-340, 340-350, 350-360, 360-370, 370-380, 380-390, 390-400, 400-410, 410-420, 420-430, 430-440, 440-450, 450-460, 460-470, 470-480, 480-490, 490-500, 500-510, 510-520, 520-530, and up to about 535 genes selected from the group consisting of genes or transcripts
  • the genes are selected from the group consisting of genes or transcripts as shown in Table 5.
  • the genes are selected from the genes or transcripts as shown in Table 7.
  • the transcript analysis gene group comprises a group of individual genes the change of expression of which is predictive of a lung disease either alone or as a group, the gene transcripts selected from the group consisting of NM_007062.1; NM_001281.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; NM_002268 /// NM_032771; NM_007048 /// NM_194441; NM_006694; U85430.1; NM_004691; AB014576.1; BF218804; BE467941; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1
  • the gene group comprises a probe set capable of specifically hybridizing to at least all of the 36 gene products.
  • Gene product can be mRNA which can be recognized by an oligonucleotide or modified oligonucleotide probe, or protein, in which case the probe can be, for example an antibody specific to that protein or an antigenic epitope of the protein.
  • the invention provides a gene group, wherein the expression pattern of the group of genes provides diagnostic for a lung disease.
  • the gene group comprises gene transcripts encoded by a gene group consisting of at least for example 5, 10, 15, 20, 25, 30, preferably at least 36, still more preferably 40, still more preferably 45, and still more preferably 46, 47, 48, 49, or all 50 of the genes selected from the group consisting of and identified by their GenBank identification numbers: NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U 93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC
  • the invention provides a group of about 30-180, preferably, a group of about 36-150 genes, still more preferably a group of about 36-100, and still more preferably a group of about 36-50 genes, the expression profile of which is diagnostic of lung cancer in individuals who smoke.
  • the invention provides a group of genes the expression of which is decreased in an individual having lung cancer.
  • the group of genes comprises at least 5-10, 10-15, 15-20, 20-25 genes selected from the group consisting of NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_006545.1; NM_002437.1; NM_006286; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_001696; NM_005494 /// NM_058246; NM_006368; NM_002268 /// NM_032771; NM_006694; NM_004691; NM_012394; NM_021800; NM_016049; NM_138387; NM_024531; and NM_018509.
  • One or more other genes can be added
  • the group of genes comprises genes selected from the group consisting of NM_014182.1; NM_001281.1; NM_024006.1; AF135421.1; L76200.1; NM_000346.1; BC008710.1; BC000423.2; BC008710.1; NM_007062; BC075839.1 /// BC073760.1; BC072436.1 /// BC004560.2; BC001016.2; BC005023.1; BC000360.2; BC007455.2; BC023528.2 /// BC047680.1; BC064957.1; BC008710.1; BC066329.1; BC023976.2; BC008591.2 /// BC050440.1 /// BC048096.1; and BC028912.1.
  • the group of genes comprises genes selected from the group consisting of NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; and AB014576.1.
  • the invention provides a group of genes the expression of which is increased in an individual having lung cancer.
  • the group of genes comprises genes selected from the group consisting of NM_003335; NM_001319; NM_021145.1; NM_001003698 /// NM_001003699 ///; NM_002955; NM_002853.1; NM_019067.1; NM_024917.1; NM_020979.1; NM_005597.1; NM_007031.1; NM_009590.1; NM_020217.1; NM_025026.1; NM_014709.1; NM_014896.1; AF010144; NM_005374.1; NM_006534 /// NM_181659; NM_014033; NM_016138; NM_007048 /// NM_194441; NM_000051 /// NM_138292 /// NM_13
  • the group of genes comprises genes selected from the group consisting of NM_030757.1; R83000; AK021571.1; NM_17932.1; U85430.1; AI683552; BC002642.1; AW024467; NM_030972.1; BC021135.1; AL161952.1; AK026565.1; AK023783.1; BF218804; AK023843.1; BC001602.1; BC034707.1; BC064619.1; AY280502.1; BC059387.1; BC061522.1; U50532.1; BC006547.2; BC008797.2; BC000807.1; AL080112.1; BC033718.1 /// BC046176.1 ///; BC038443.1; Hs.288575 (UNIGENE ID); AF020591.1; BC002503.2; BC009185.2; Hs.528304 (UNIGENE ID); U50532.1; BC013923.2; BC031091; Hs.249591
  • the group of genes comprises genes selected from the group consisting of BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1.
  • the invention provides a method for diagnosing a lung disease comprising obtaining a nucleic acid sample from lung, airways or mouth of an individual exposed to an air pollutant, analyzing the gene transcript levels of one or more gene groups provided by the present invention in the sample, and comparing the expression pattern of the gene group in the sample to an expression pattern of the same gene group in an individual, who is exposed to similar air pollutant but not having lung disease, such as lung cancer or emphysema, wherein the difference in the expression pattern is indicative of the test individual having or being at high risk of developing a lung disease.
  • the decreased expression of one or more of the genes is indicative of the person having a lung disease or being at high risk of developing a lung disease, preferably lung cancer, in the near future and needing frequent follow ups to allow early treatment of the disease.
  • the lung disease is lung cancer.
  • the air pollutant is cigarette smoke.
  • the diagnosis can separate the individuals, such as smokers, who are at lesser risk of developing lung diseases, such as lung cancer by analyzing the expression pattern of the gene groups of the invention provides a method of excluding individuals from invasive and frequent follow ups.
  • the invention provides methods for prognosis, diagnosis and therapy designs for lung diseases comprising obtaining an airway sample from an individual who smokes and analyzing expression profile of the gene groups of the present invention, wherein an expression pattern of the gene group that deviates from that in a healthy age, race, and gender matched smoker, is indicative of an increased risk of developing a lung disease.
  • Tables 1-4 indicate the expression pattern differences as either being down or up as compared to a control, which is an individual exposed to similar airway pollutant but not affected with a lung disease.
  • the invention also provides methods for prognosis, diagnosis and therapy designs for lung diseases comprising obtaining an airway sample from a non-smoker individual and analyzing expression profile of the gene groups of the present invention, wherein an expression pattern of the gene group that deviates from that in a healthy age, race, and gender matched smoker, is indicative of an increased risk of developing a lung disease.
  • the analysis is performed from a biological sample obtained from bronchial airways.
  • the analysis is performed from a biological sample obtained from buccal mucosa.
  • the analysis is performed using nucleic acids, preferably RNA, in the biological sample.
  • the analysis is performed analyzing the amount of proteins encoded by the genes of the gene groups of the invention present in the sample.
  • the analysis is performed using DNA by analyzing the gene expression regulatory regions of the groups of genes of the present invention using nucleic acid polymorphisms, such as single nucleic acid polymorphisms or SNPs, wherein polymorphisms known to be associated with increased or decreased expression are used to indicate increased or decreased gene expression in the individual. For example, methylation patterns of the regulatory regions of these genes can be analyzed.
  • the present invention provides a minimally invasive sample procurement method for obtaining airway epithelial cell RNA that can be analyzed by expression profiling of the groups of genes, for example, by array-based gene expression profiling.
  • expression profiling of the groups of genes for example, by array-based gene expression profiling.
  • array-based gene expression profiling can be used to diagnose individuals who are already affected with a lung disease, such as lung cancer, or who are at high risk of developing lung disease, such as lung cancer, as a consequence of being exposed to air pollutants.
  • These methods can also be used to identify further patterns of gene expression that are diagnostic of lung disorders/diseases, for example, cancer or emphysema, and to identify subjects at risk for developing lung disorders.
  • the invention further provides a gene group microarray consisting of one or more of the gene groups provided by the invention, specifically intended for the diagnosis or prediction of lung disorders or determining susceptibility of an individual to lung disorders.
  • the invention relates to a method of diagnosing a disease or disorder of the lung comprising obtaining a sample, nucleic acid or protein sample, from an individual to be diagnosed; and determining the expression of group of identified genes in said sample, wherein changed expression of such gene compared to the expression pattern of the same gene in a healthy individual with similar life style and environment is indicative of the individual having a disease of the lung.
  • the invention relates to a method of diagnosing a disease or disorder of the lung comprising obtaining at least two samples, nucleic acid or protein samples, in at least one time interval from an individual to be diagnosed; and determining the expression of the group of identified genes in said sample, wherein changed expression of at least about for example 5, 10, 15, 20, 25, 30, preferably at least about 36, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, or 180 of such genes in the sample taken later in time compared to the sample taken earlier in time is diagnostic of a lung disease.
  • the disease of the lung is selected from the group consisting of asthma, chronic bronchitis, emphysema, primary pulmonary hypertension, acute respiratory distress syndrome, hypersensitivity pneumonitis, eosinophilic pneumonia, persistent fungal infection, pulmonary fibrosis, systemic sclerosis, idiopathic pulmonary hemosiderosis, pulmonary alveolar proteinosis, and lung cancer, such as adenocarcinoma, squamous cell carcinoma, small cell carcinoma, large cell carcinoma, and benign neoplasm of the lung (e.g., bronchial adenomas and hamartomas).
  • asthma chronic bronchitis
  • emphysema primary pulmonary hypertension
  • acute respiratory distress syndrome hypersensitivity pneumonitis
  • eosinophilic pneumonia persistent fungal infection
  • pulmonary fibrosis systemic sclerosis
  • idiopathic pulmonary hemosiderosis pulmonary alveolar proteinosis
  • lung cancer such as a
  • the nucleic acid sample is RNA.
  • the nucleic acid sample is obtained from an airway epithelial cell.
  • the airway epithelial cell is obtained from a bronchoscopy or buccal mucosal scraping.
  • individual to be diagnosed is an individual who has been exposed to tobacco smoke, an individual who has smoked, or an individual who currently smokes.
  • the invention also provides an array, for example, a microarray for diagnosis of a disease of the lung having immobilized thereon a plurality of oligonucleotides which hybridize specifically to genes of the gene groups which are differentially expressed in airways exposed to air pollutants, such as cigarette smoke, and have or are at high risk of developing lung disease, as compared to those individuals who are exposed to similar air pollutants and airways which are not exposed to such pollutants.
  • the oligonucleotides hybridize specifically to one allelic form of one or more genes which are differentially expressed for a disease of the lung.
  • the differentially expressed genes are selected from the group consisting of the genes shown in tables 1-4; preferably the group of genes comprises genes selected from the Table 3.
  • the group of genes comprises the group of at least 20 genes selected from Table 3 and additional 5-10 genes selected from Tables 1 and 2.
  • at least about 10 genes are selected from Table 4.
  • Some aspects of the present invention also provide prognostic and diagnostic methods to assess lung disease risk caused by airway pollutants.
  • the methods according to the present invention use a novel minimally invasive sample procurement method and gene expression-based tools for the diagnosis and prognosis of diseases of the lung, particularly diagnosis and prognosis of lung cancer.
  • Some aspects of the present invention are based on the finding that that there are different patterns of gene expression between smokers and non-smokers.
  • the genes involved can be grouped into clusters of related genes that are reacting to the irritants or pollutants.
  • Epithelial cell gene expression profiles obtained from relatively accessible sites can thus provide important prognostic, diagnostic, and therapeutic information which can be applied to diagnose and treat lung disorders.
  • Some aspects of the present invention are based on characterization of “airway transcriptomes” or a signature gene expression profiles of the airways and identification of changes in this transcriptome that are associated with epithelial exposure to pollutants, such as direct or indirect exposure to cigarette smoke, asbestos, and smog.
  • airway transcriptome gene expression profiles provide information on lung tissue function upon cessation from smoking, predisposition to lung cancer in non-smokers and smokers, and predisposition to other lung diseases.
  • the airway transcriptome expression pattern can be obtained from a non-smoker, wherein deviations in the normal expression pattern are indicative of increased risk of lung diseases.
  • the airway transcriptome expression pattern can also be obtained from a non-smoking subject exposed to air pollutants, wherein deviation in the expression pattern associated with normal response to the air pollutants is indicative of increased risk of developing lung disease.
  • the invention provides an “airway transcriptome” the expression pattern of which is useful in prognostic, diagnostic and therapeutic applications as described herein.
  • airway transcriptome the expression pattern of which is useful in prognostic, diagnostic and therapeutic applications as described herein.
  • the expression patterns of the airway transcriptome are useful in prognosis of lung disease, diagnosis of lung disease and a periodical screening of the same individual to see if that individual has been exposed to risky airway pollutants such as cigarette smoke that change his/her expression pattern.
  • the invention provides distinct airway “expression clusters”, i.e., sub-transcriptomes, comprised of related genes among the 85 genes that can be quickly screened for diagnosis, prognosis or treatment purposes.
  • the invention provides an airway sub-transcriptome comprising mucin genes of the airway transcriptome. Examples of mucin genes include muc 5 subtypes A, B, and C.
  • the invention provides a sub-transcriptome comprising cell adhesion molecules of the airway transcriptome, such as carcinoembryonic antigen-related adhesion molecule 6 and claudin 10 encoding genes.
  • the invention provides a sub-transcriptome comprising detoxification related genes of the airway transcriptome.
  • cytochrome P450 subfamily I dioxin-inducible
  • NADPH dehydrogenase encoding genes
  • upregulation of transcripts of cytochrome P450 subfamily I (dioxin-inducible) encoding genes upregulation of transcripts of cytochrome P450 subfamily I (dioxin-inducible) encoding genes.
  • the invention provides a sub-transcriptome comprising immune system regulation associated genes of the airway transcriptome.
  • immunoregulatory genes include small inducible cytokine subfamily D encoding genes.
  • the invention provides a sub-transcriptome comprising metallothionein genes of the airway transcriptome.
  • metallothionein genes include MTX G, X, and L encodinggenes.
  • the subtranscriptome comprises protooncogenes and oncogenes such as RAB1-1A and CEACAM6.
  • the subtranscriptome includes tumor suppressor genes such as SLIT1, and SLIT2.
  • the invention provides a lung cancer “diagnostic airway transcriptome” comprising 208 genes selected from the group consisting of 208238_x_at-probeset; 216384_x_at-probeset; 217679_x_at-probeset; 216859_x_at-probeset; 211200_s_at-probeset; PDPK1; ADAM28; ACACB; ASMTL; ACVR2B; ADATI; ALMSI; ANK3; ANK3-; DARS; AFURS1; ATP8B1; ABCCI; BTF3; BRD4; CELSR2; CALM31 CAPZB; CAPZBI CFLAR; CTSS; CD24; CBX3; C21orf106; C6orf111; C6orf62; CHC1; DCLRE1C; EML2; EMS1; EPHB6-; EEF2; FGFR3; FLJ20288; FVT1; GGTLA4;
  • the invention provides methods of diagnosing lung cancer in an individual comprising comping a biological sample from the airways of the individual and analyzing the expression of at least 10 genes, preferably at least 50 genes, still more preferably at least 100 genes, still more preferably at least 150 genes, still more preferably at least 200 genes selected from genes of the diagnostic airway transcriptome, wherein deviation in the expression of at least one, preferably at least 5, 10, 20, 50, 100, 150, 200 genes as compared to a control group is indicative of lung cancer in the individual.
  • Deviation is preferably decrease of the transcription of at least one gene selected from the group consisting of of 208238_x_at -probeset; 216384_x_at-probeset; 217679_x_at-probeset; 216859_x_at-probeset; 211200_s_at-probeset; PDPK1; ADAM28; ACACB; ASMTL; ACVR2B; ADATI; ALMS1; ANK3; ANK3; DARS; AFURS1; ATP8B1; ABCCI; BTF3; BRD4; CELSR2; CALM31CAPZB; CAPZB1CFLAR; CTSS; CD24; CBX3; C21orf106; C6orf111; C6orf62; CHC1; DCLREIC; EML2; EMSI; EPHB6; EEF2; FGFR3; FLJ20288; FVT1; GGTLA4; GRP; GLUL; HDGF; Homo sapiens
  • Deviation is preferably increase of the expression of at least one gene selected from the group consisting of 217414_x_at-probeset; 217232_x_at-probeset; ATF3; ASXL2; ARF4L; APGSL; ATP6VOB; BAGI; BTG2; COMT; CTSZ; CGI-128; C14orf87; CLDN3; CYR61; CKAP1; DAF; DAF; DSIPI, DKFZP564G2022; DNAJB9; DDOST; DUSP1; DUSP6; DKC1; EGRI; EIF4EL3; EXT2; GMPPB; GSN; GUKI; HSPA8; Homo sapiens PR02275 mRNA, complete eds; Homo sapiens transcribed sequence with strong similarity to protein ref:NP_006442.2, polyadenylate binding protein-interacting protein 1; HAX1; DKFZP434K046;
  • the genes are referred to using their HUGO names or alternatively the probeset number on Affymetrix (Affymetrix, Inc. (U.S.), Santa Clara, Calif.) probesets.
  • the invention provides methods of prognosis and diagnosis of lung diseases comprising obtaining a biological sample from a subject's airways, analyzing the level of expression of at least one gene of the airway transcriptome, comparing the level of expression of the at least one gene of at least one of the airway transcriptome to the level of expression in a control, wherein deviation in the level of expression in the sample from the control is indicative of increased risk of lung disease.
  • the analysis is performed using expression of at least two genes of the airway transcriptome, more preferably at least three genes, still more preferably at least four to 10 genes, still more preferably at least 10-20 genes, still more preferably at least 20-30, still more preferably at least 30-40, still more preferably at least 40-50, still more preferably at least 50-60, still more preferably at least 60-70, still more preferably at least 70-85 genes is analyzed.
  • the expression level of the genes of one or more of the sub-transcriptomes is analyzed.
  • gene expression of one or more genes belonging to at least two different sub-transcriptome sets is analyzed.
  • gene expression of at least one gene from at least three sub-transcriptome sets is analyzed.
  • gene expression of at least one gene from at least four sub-transcriptome sets is analyzed.
  • gene expression of at least one gene from at least five sub-transcriptome sets is analyzed.
  • the expression analysis according to the methods of the present invention can be performed using nucleic acids, particularly RNA, DNA or protein analysis.
  • the cell samples are preferably obtained from bronchial airways using, for example, endoscopic cytobrush in connection with a fiberoptic bronchoscopy.
  • the cells are obtained from the individual's mouth buccal cells, using, for example, a scraping of the buccal mucosa.
  • the invention provides a prognostic and/or diagnostic immunohistochemical approach, such as a dip-stick analysis, to determine risk of developing lung disease.
  • Antibodies against at least one, preferably more proteins encoded by the genes of the airway transcriptome are either commercially available or can be produced using methods well know to one skilled in the art.
  • the invention further provides an airway transcriptone expression pattern of genes that correlate with time since cigarette discontinuance in former smokers, i.e., the expression of these genes in a healthy smoker returns to normal, or healthy non-smoker levels, after about two years from quitting smoking.
  • genes include: MAGF, GCLC, UTGIAI0, SLIT2, PECI, SLIT1, and TNFSF13. If the transcription of these genes has not returned to the level of a healthy non-smoker, as measured using the methods of the present invention, within a time period of about 1-5 years, preferably about 1.5-2.5 years, the individual with a remaining abnormal expression is at increased risk of developing a lung disease.
  • the invention further provides an airway transcriptome expression pattern of genes the expression of which remains abnormal after cessation from smoking.
  • genes include: CX3CL1, RNAHP, MT1X, MT1L, TU3A, HLF, CYFIP2, PLA2G10, HN1, GMDS, PLEKHB2, CEACAM6, ME1, and DPYSL3.
  • the invention provides methods for prognosis, diagnosis and therapy designs for lung diseases comprising obtaining an airway sample from an individual who smokes and analyzing expression of at least one, preferably at least two, more preferably at least three, still more preferably at least four, still more preferably at least five, still more preferably at least six, seven, eight, and still more preferably at least nine genes of the normal airway transcriptome, wherein an expression pattern of the gene or genes that deviates from that in a healthy age, race, and gender matched smoker, is indicative of an increased risk of developing a lung disease.
  • the invention also provides methods for prognosis, diagnosis and therapy designs for lung diseases comprising obtaining an airway sample from a non-smoker individual and analyzing expression of at least one, preferably at least two, more preferably at least three, still more preferably at least four, still more preferably at least five, still more preferably at least six, seven, eight, and still more preferably at least nine genes of the normal airway transcriptome, wherein an expression pattern of the gene or genes that deviates from that in a healthy age, race, and gender matched non-smoker, is indicative of an increased risk of developing a lung disease.
  • Non-smoking individual whose expression pattern begins to resemble that of a smoker and at increased risk of developing a lung disease.
  • the analysis is performed from a biological sample obtained from bronchial airways. In one embodiment, the analysis is performed from a biological sample obtained from buccal mucosa. In one embodiment, the analysis is performed using nucleic acids, preferably RNA, in the biological sample. In one embodiment, the analysis is performed analyzing the amount of proteins encoded by the genes of the airway transcriptome present in the sample.
  • the analysis is performed using DNA by analyzing the gene expression regulatory regions of the airway transcriptome genes using nucleic acid polymorphisms, such as single nucleic acid polymorphisms or SNPs, wherein polymorphisms known to be associated with increased or decreased expression are used to indicate increased or decreased gene expression in the individual.
  • nucleic acid polymorphisms such as single nucleic acid polymorphisms or SNPs
  • the present invention provides a method for determining whether a subject has or is at risk of developing a lung disorder, comprising (a) obtaining a biological sample comprising epithelial cells from a part of an airway of the subject separate from a lung of the subject; (b) assaying nucleic acid molecules derived from the biological sample to identify a level of gene expression in the biological sample; (c) processing the level of gene expression against a control to determine a deviation in the level of expression; and (d) based on the deviation in (c), determining that the subject has the lung cancer or is at risk of developing the lung disorder.
  • the present invention provides a minimally invasive sample procurement method for obtaining airway epithelial cell RNA that can be analyzed by expression profiling, for example, by array-based gene expression profiling. These methods can be used to determine if airway epithelial cell gene expression profiles are affected by cigarette smoke and if these profiles differ in smokers with and without lung cancer. These methods can also be used to identify patterns of gene expression that are diagnostic of lung disorders/diseases, for example, cancer or emphysema, and to identify subjects at risk for developing lung disorders. All or a subset of the genes identified according to the methods described herein can be used to design an array, for example, a microarray, specifically intended for the diagnosis or prediction of lung disorders or susceptibility to lung disorders. The efficacy of such custom-designed arrays can be further tested, for example, in a large clinical trial of smokers.
  • the invention relates to a method of diagnosing a disease or disorder of the lung comprising obtaining a sample, nucleic acid or protein sample, from an individual to be diagnosed; and determining the expression of one or more of the 85 identified genes in said sample, wherein changed expression of such gene compared to the expression pattern of the same gene in a healthy individual with similar life style and environment is indicative of the individual having a disease of the lung.
  • the invention relates to a method of diagnosing a disease or disorder of the lung comprising obtaining at least two samples, nucleic acid or protein samples, in at least one time interval from an individual to be diagnosed; and determining the expression of one or more of the 85 identified genes in said samples, wherein changed expression of such gene or genes in the sample taken later in time compared to the sample taken earlier in time is diagnostic of a lung disease.
  • the disease of the lung is selected from the group consisting of asthma, chronic bronchitis, emphysema, primary pulmonary hypertension, acute respiratory distress syndrome, hypersensitivity pneumonitis, eosinophilic pneumonia, persistent fungal infection, pulmonary fibrosis, systemic sclerosis, ideopathic pulmonary hemosiderosis, pulmonary alveolar proteinosis, and lung cancer, such as adenocarcinoma, squamous cell carcinoma, small cell carcinoma, large cell carcinoma, and benign neoplasms of the lung (e.g., bronchial adenomas and hamartomas).
  • the nucleic acid sample is RNA.
  • the nucleic acid sample is obtained from an airway epithelial cell.
  • the airway epithelial cell is obtained from a bronchoscopy or buccal mucosal scraping.
  • individual to be diagnosed is an individual who has been exposed to tobacco smoke, an individual who has smoked, or an individual who smokes.
  • the genes are selected from the group consisting of the genes shown in FIGS. 20A-20F ; 21 A- 22 B; and FIG. 24 .
  • the expression of two or more, five or more, ten or more, fifteen or more, twenty or more, fifty or more or one hundred or more informative genes is determined.
  • the expression is determined using a microarry having one or more oligonucleotides (probes) for said one or more genes immobilized thereon.
  • the invention further relates to a method of obtaining a nucleic acid sample for use in expression analysis for a disease of the lung comprising obtaining an airway epithelial cell sample from an individual; and rendering the nucleic acid molecules in said cell sample available for hybridization.
  • the invention also relates to a method of treating a disease of the lung comprising administering to an individual in need thereof an effective amount of an agent which increases the expression of a gene whose expression is decreased in said individual as compared with a normal individual.
  • the invention further relates to a method of treating a disease of the lung comprising administering to an individual in need thereof an effective amount of an agent, which changes the expression of a gene to that expression level seen in a healthy individual having the similar life style and environment, and a pharmaceutically acceptable carrier.
  • the invention also relates to a method of treating a disease of the lung comprising administering to an individual in need thereof an effective amount of an agent which increases the activity of an expression product of such gene whose activity is decreased in said individual as compared with a normal individual.
  • the invention also relates to a method of treating a disease of the lung comprising administering to an individual in need thereof an effective amount of an agent which decreases the activity of an expression product of such gene whose activity is increased in said individual as compared with a normal individual.
  • the invention also provides an array, for example, a microarray for diagnosis of a disease of the lung having immobilized thereon a plurality of oligonucleotides which hybridize specifically to one or more genes which are differentially expressed in airways exposed to air pollutants, such as cigarette smoke, and airways which are not exposed to such pollutants.
  • the oligonucleotides hybridize specifically to one allelic form of one or more genes which are differentially expressed for a disease of the lung.
  • the differentially expressed genes are selected from the group consisting of the genes shown in FIGS. 20A-20F, 21A-21B and FIG. 24 .
  • prognostic and diagnostic methods of the present invention are based on the finding that deviation from the normal expression pattern in the airway transcriptome is indicative of abnormal response of the airway cells and thus predisposes the subject to diseases of the lung. Therefore, all the comparisons as provided in the methods are performed against a normal airway transcriptome of a “normal” or “healthy” individual exposed to the pollutant, as provided by this invention. Examples of these normal expression patterns of the genes belonging to the airway transcriptome of the present invention are provided in FIG. 24 .
  • the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in the cell adhesion regulating genes of the airway transcriptome, wherein decrease in the expression compared with a “normal” smoker expression pattern is indicative of an increased risk of developing a lung disease.
  • cell adhesion regulation related genes include carcinoembryonic antigen-related adhesion molecule 6 and claudin 10 encoding genes.
  • carcinoembryonic antigen-related adhesion molecule 6 and claudin 10 encoding genes include carcinoembryonic antigen-related adhesion molecule 6 and claudin 10 encoding genes.
  • an about at least 2-20 fold, preferably about at least 3 fold, still more preferably at least about 4 fold, still more preferably about at least 5 fold decrease in expression of carcinoembryonic antigen-related adhesion molecule 6 encoding gene is indicative of an increased risk of developing a lung disease.
  • an about 2-20, preferably at least about, 3 fold, still more preferably at least about 4 fold, still more preferably at least about 5 fold decrease in the transcript level of claudin 10 encoding gene is indicative of an increased risk of developing a lung disease.
  • the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in the detoxification related genes of the airway transcriptome, wherein decrease in the expression compared with a “normal” smoker expression pattern is indicative of an increased risk of developing a lung disease.
  • these genes include cytochrome P450 subfamily I (dioxin-inducible) encoding genes, NADPH dehydrogenase encoding genes.
  • cytochrome P450 subfamily I dioxin-inducible
  • the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in the immune system regulation associated genes of the airway transcriptome, wherein increase in the expression compared with a “normal” smoker expression pattern is indicative of an increased risk of developing a lung disease.
  • immunoregulatory genes include small inducible cytokine subfamily D encoding genes. For example, about 1-10 fold difference in the expression of cytokine subfamily D encoding genes is indicative of increased risk of developing lung disease.
  • the difference in expression is least about 2 fold preferably about at least 3 fold, still more preferably at least about 4 fold, still more preferably about at least 5 fold decrease decrease in the expression of small inducible cytokine subfamily D encoding genes is indicative of an increased risk of developing a lung disease.
  • the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in the metalothionein regulation associated genes of the airway transcriptome, wherein decrease in the expression compared with a “normal” smoker is indicative of an increased risk of developing a lung disease.
  • metalothionein regulation associated genes include MTX G, X, and L encoding genes. At least about 1.5-10 fold difference in the expression of these genes in indicative of increased risk of developing lung disease. For example, at least about 1.5 fold., still more preferably at least about 2 fold, still more preferably at least about 2.5 fold, still more preferably at least about 3 fold, still more preferably at least about 4 fold, still more preferably about at least. 5 fold increase in the expression of metalothionein regulation associated genes include MTX G, X, and L encoding genes indicative of an increased risk of developing a lung disease.
  • FIG. 1 shows Table 1, which sets forth a listing a group of 96 genes, their expression profile in lung cancer as compared to an individual not having lung cancer but being exposed to similar environmental stress, i.e. air pollutant, in this example, cigarette smoke. These genes were identified using Student's t-test.
  • FIG. 2 shows Table 2, listing a group of 84 genes, their expression profile in lung cancer as compared to an individual not having lung cancer but being exposed to similar environmental stress, i.e. air pollutant, in this example, cigarette smoke. These genes were identified using Student's t-test.
  • FIG. 3 shows Table 3, listing a group of 50 genes, and their expression profile in lung cancer as compared using a class-prediction model to an individual not having lung cancer but being exposed to similar environmental stress, i.e. air pollutant, in this example, cigarette smoke.
  • FIG. 4 shows Table 4, listing a group of 36 genes, their expression profile in lung cancer as compared to an individual not having lung cancer but being exposed to similar environmental stress, i.e. air pollutant, in this example, cigarette smoke.
  • This group of genes is a combination of predictive genes identified using both Student's t-test and class-prediction model.
  • FIG. 5 shows an example of the results using class prediction model as obtained in Example 1.
  • Training set included 74 samples, and the test set 24 samples.
  • the mean age for the training set was 55 years, and the mean pack years smoked by the training set was 38.
  • the mean age for the test set was 56 years, and the mean pack years smoked by the test set was 41.
  • FIG. 6 shows an example of the 50 gene class prediction model obtained in Example 1.
  • Each square represents expression of one transcript.
  • the transcript can be identified by the probe identifier on the y-axis according to the Affymetrix Human Genome Gene chip U133 probe numbers (see Appendix).
  • the individual samples are identified on the x-axis.
  • the samples are shown in this figure as individuals with lung cancer (“cancer”) and individuals without lung cancer (“no cancer”).
  • the gene expression is shown as higher in darker squares and lower in lighter squares.
  • FIG. 7 shows a comparison of sample-quality metrics.
  • the graph plots the Affymetrix MAS 5.0 percent present (y-axis) versus the z-score derived filter (x-axis).
  • the two metrics have a correlation (R2) of 0.82.
  • FIG. 8 shows distribution of accuracies for real vs. random 1000 runs. Histogram comparing test set class prediction accuracies of 1000 “sample randomized” classifiers generated by randomly assigning samples into training and test sets with true class labels (unshaded) versus 1000 “sample and class randomized” classifiers where the training set class labels were randomized following sample assignment to the training or test set (shaded).
  • FIG. 9 shows classification accuracy as a function of the average prediction strength over the 1000 runs of the algorithm with different training/test sets.
  • FIG. 10A shows the number of times each of the 80-predictive probe sets from the actual biomarker was present in the predictive lists of 80 probe sets derived from 1000 runs of the algorithm.
  • FIG. 10B shows the Number of times a probe set was present in the predictive lists of 80 probe sets derived from 1000 random runs of the algorithm described in Supplemental Table 7.
  • FIG. 11 shows Boxplot of the Prediction Strength values of the test set sample predictions made by the Weighted Voting algorithm across the 1000 runs with different training and test sets.
  • the black boxplots (first two boxes from the left) are derived from the actual training and test set data with correct sample labels, the grey boxplots (last two boxes on the right) are derived from the test set predictions based on training sets with randomized sample labels.
  • FIG. 12 shows homogeneity of gene expression in large airway samples from smokers with lung cancer of varying cell types.
  • Principal Component Analysis was performed on the gene-expression measurements for the 80 genes in our predictor and all of the airway epithelium samples from patients with lung cancer. Gene expression measurements were Z(0,1) normalized prior to PCA.
  • the graph shows the sample loadings for the first two principal components which together account for 58% of the variation among samples from smokers with cancer. There is no apparent separation of the samples with regard to lung tumor subtype.
  • Four genes IL8, FOS, TPD52, and RAB1A were found to be up-regulated in cancer group on both microarray and RT-PCR platforms; three genes (DCLRE1C, BACH2, and DUOX1) were found to be down-regulated in cancer group on both platforms.
  • FIG. 15 shows hierarchical clustering of class-predictor genes.
  • Z-score-normalized gene-expression measurements of the eighty class-predictor genes in the 52 test-set samples are shown in a false-color scale and organized from top to bottom by hierarchical clustering.
  • the Affymetrix U133A probeset ID and HUGO symbol are given to the right of each gene.
  • the test-set samples are organized from left to right first by whether the patient had a clinical diagnosis of cancer. Within these two groups, the samples are organized by the accuracy of the class-predictor diagnosis (samples classified incorrectly are on the right shown in dark green). 43/52 (83%) test samples are classified correctly.
  • the sample ID is given at the top of each column
  • the prediction strength of each of the diagnoses made by the class-prediction algorithm is indicated in a false-color scale immediately below the prediction accuracy.
  • Prediction strength is a measure of the level of diagnostic confidence and varies on a continuous scale from 0 to 1 where 1 indicates a high degree of confidence.
  • FIG. 16 shows a Comparison of Receiver Operating Characteristic (ROC) curves.
  • Sensitivity y-axis
  • 1-Specificity x-axis
  • the solid black line represents the ROC curve for the airway gene expression classifier.
  • the dotted black line represents the average ROC curve for 1000 classifiers derived by randomizing the training set class labels (“class randomized”).
  • FIG. 17 shows the Principal Component Analysis (PCA) of biomarker gene expression in lung tissue samples.
  • the 80 biomarker probesets were mapped to 64 probesets in the Bhattacharjee et al. HGU95Av2 microarray dataset of lung cancer and normal lung tissue.
  • the PCA is a representation of the overall variation in expression of the 64 biomarker probesets.
  • the normal lung samples (NL) are represented in green, the adenocarcinomas (AD) in red, the small cells (SC) in blue, and the squamous (SQ) lung cancer samples in yellow.
  • FIGS. 18A-18C show data obtained in this study.
  • FIG. 18A shows bronchoscopy results for the 129 patients in the study. Only 32 of the 60 patients that had a final diagnosis of cancer had bronchoscopies that were diagnostic of lung cancer. The remaining 97 samples had bronchoscopies that were negative for lung cancer including 5 that had a definitive alternate benign diagnosis. This resulted in 92 patients with non-diagnostic bronchoscopy that required further tests and/or clinical follow-up.
  • FIG. 18B shows biomarker prediction results. 36 of the 92 patients with non-diagnostic bronchoscopies exhibited a gene expression profile that was positive for lung cancer.
  • FIG. 18C shows combined test results.
  • a positive test result from either bronchoscopy or gene expression is considered indicative of lung cancer a sensitivity of 95% (57 of 60 cancer patients) with only a 16% false positive rate (11 of 69 non-cancer patients) is achieved.
  • the shading of each contingency table is reflective of the overall fraction of each sample type in each quadrant.
  • FIGS. 19A-19B show a comparison of bronchoscopy and biomarker prediction by A) cancer stage or B) cancer subtype. Each square symbolizes one patient sample. The upper half represents the biomarker prediction accuracy and the lower half represents the bronchoscopy accuracy. Not all cancer samples are represented in this figure.
  • FIG. 19A includes only Non Small Cell cancer samples that could be staged using the TMN system (48 of the 60 total cancer samples).
  • FIG. 19B includes samples that could be histologically classified as Adenocarcinoma, Squamous Cell Carcinoma and Small Cell Carcinoma (45 of the 60 total cancer samples).
  • FIGS. 20A-20F show a list of genes which are differentially expressed in smokers and non-smokers. T-test statistical results are shown.
  • FIGS. 21A-21G show a list of genes which are differentially expressed insmokers and smokers with lung cancer. T-test statistical results are shown.
  • FIG. 22 is a schematic diagram showing an example of loss of heterozygosity analysis.
  • FIG. 23 is a graph showing fractional allelic loss in smokers and non-smokers.
  • FIG. 24A and FIG. 24B shows clustering of current and never smoker samples.
  • Hierarchical clustering of current (n-34) and never (n 23) smokers according to the expression of the 97 probesets representing the 85 genes differentially expressed between current and never smokers. While current and never smokers separate into 2 groups, three current smokers appear to cluster with never smokers (rectangle). Expression of a number of redox-related and xenobiotic genes in these subjects was not increased (brackets) and therefore resembled that of never smokers despite substantial smoke exposure.
  • FIGS. 25A-25B show a multidimensional scaling plot of current, never, and former smoker samples.
  • Multidimensional scaling plot of current (lighter grey boxes), never (medium grey boxes, mainly clustered on the left hand side of the graph) and former smokers (darkest grey boxes) in 97 dimensional space according to the expression of the 97 probesets reflecting the 85 differentially expressed genes between current and never smokers.
  • FIG. 25A illustrates that current and never smokers separate into their 2 classes according to the expression of these genes.
  • FIG. 25B shows that when former smokers are plotted according to the expression of these genes, a majority of former smokers appear to group more closely to never smokers. There are, however, a number of former smokers who group more closely to current smokers (black circle).
  • the only clinical variable that differed between the 2 groups of former smokers was length of smoking cessation (p ⁇ 0.05), with formers smokers who quit within 2 years clustering with current smokers.
  • the MDS plots are reduced dimension representations of the data and the axes on the figure have no units.
  • FIG. 26 shows genes expression of which is irreversibly altered by cigarette smoke.
  • Samples are grouped according to smoking status and length of smoking cessation (samples are not being clustered and thus there is no dendogram on the sample axis).
  • FIGS. 27A-27C show Scatterplots of spatial ( FIG. 27A ) and temporal ( FIG. 27B ) replicate samples (2 fold, 10 fold and 30 fold lines of change shown; axes are log scaled). Histogram of fold changes computed between all replicates and between unrelated samples ( FIG. 27C ).
  • FIG. 29 shows variability in gene expression in the normal airway transcriptome. This histogram shows the number of genes in the normal airway transcriptome ( ⁇ 7100 genes whose median detection p value ⁇ 0.05) according to their coefficient of variation (standard deviation/mean*100) across the 23 healthy never smokers. Approximately 90% of the genes have a coefficient of variation below 50%
  • FIG. 30 shows hierarchical clustering of all 18 former smokers according to the expression of the top 97 probesets that were differentially expressed between current and never smokers.
  • the only clinical variable that statistically differed (p ⁇ 0.05) between the 2 molecular subclasses of former smokers was length of smoking cessation.
  • Patient ID denoted with “F”
  • time since patient quit smoking in years
  • FIGS. 31A-31E show real time QRT-PCR and microarray data for select genes that were found to be differentially expressed between current and never smokers on microarray analysis. Fold change is relative to one of the never smokers.
  • NQO1 NAD(P)H dehydrogenase, quinone 1, FIG. 31A
  • ALDH3A1 aldehyde dehydrogenase 3 family, member A1, FIG. 31B
  • CYPIB1 cytochrome P450, subfamily I (dioxin-inducible), polypeptide 1 FIG. 31C
  • CEACAMS carcinoembryonic antigen-related cell adhesion molecule 5 FIG.
  • FIG. 32 shows a table of genes present in bronchial epithelial cells that should be expressed in bronchial epithelial cells.
  • FIG. 33 shows genes absent in bronchial epithelial cells that should not be expressed in bronchial airway epithelial cells.
  • FIG. 35 shows analysis of replicates. Pearson correlation coefficients were computed between replicate samples, between samples from the same group (never or current smoker), and between samples from two different groups (never versus current smoker). The mean R squared values from the analyses are reported.
  • FIG. 36A-36C show multiple linear regression results performed on the top 10 percent most variable genes (calculated using the coefficient of variation) in the normal airway transcriptome.
  • FIGS. 37A-37B show genes correlated with pack-years among current smokers (p ⁇ 0.0001). Pearson correlation for gene expression and pack-years smoking. R-values and p-values for 51 genes that were tightly correlated with pack-years among current smokers are reported. The 5 genes shown in bold are the genes whose expression is most significantly correlated to pack-years as assessed by a permutation analysis.
  • FIG. 38A-38B show summary of analysis of genes irreversibly altered by cigarette smoke. At-test was performed between former and never smoker across all 9968 genes, and 44 genes were found to have a p value threshold below 0.00098. These 44 genes are listed in the table according to their p value on t-test between current and never smokers, as the intersection of these 2 t-tests (former vs. never and current vs. never) correspond to irreversibly altered genes. Fifteen genes (shown in bold) were found to be irreversible altered by cigarette smoking given that they are in common with the list of 97 probesets significantly differentially expressed between current and never smokers. In addition to the 15 genes, 12 more genes had at-test p value between current and never smokers of less than 0.001, and only 7 of the 44 genes had p values between current and never smokers of greater than 0.05.
  • FIGS. 39A-39B show ANCOVA and 2 way ANOVA.
  • An ANCOVA was performed to test the effect of smoking status (never or current) on gene expression while controlling for the effect of age (the covariate).
  • a two-way ANOVA was performed to test the effect of smoking status (never or current) on gene expression while controlling for the fixed effects of race (encoded as three racial groups: Caucasian, African American, and other) or gender and the interaction terms of status:race or status:gender.
  • the table lists the genes found to be significantly different between never and current smokers controlling for the effects of age, race, and gender. Many of the genes listed are labeled “common” because they are also found in the set of 97 probesets found to be significantly different between never and current smokers based on at-test analysis.
  • FIG. 40 shows a multidimensional scaling plot of all smokers with and without cancer plotted in 208 dimensional space according to the expression of the 208 genes that distinguish the 2 classes on t-test.
  • the present invention is directed to gene/transcript groups and methods of using the expression profile of these gene/transcript groups in diagnosis and prognosis of lung diseases.
  • Table 6 One example of the gene transcript groups useful in the diagnostic/prognostic tests of the invention is set forth in Table 6. We have found that taking any group that has at least 20 of the Table 6 genes provides a much greater diagnostic capability than chance alone.
  • Our preferred groups are the groups of 96 (Table 1), 84 (Table 2), 50 (Table 3), 36 (Table 4), 80 (Table 5), 535 (Table 6) and 20 (Table 7). In some instances, we have found that one can enhance the accuracy of the diagnosis by adding additional genes to any of these specific groups.
  • any one or more genes and or transcripts from Table 7 may be added as a lung cancer marker for a gene expression analysis.
  • the genes in the group are compared to a control or a control group.
  • the control groups can be non-smokers, smokers, or former smokers.
  • comparing can be performed in the biological sample from a smoker against a control group of smokers who do not have lung cancer.
  • At least 50% of the genes surveyed must provide the described pattern. Greater reliability if obtained as the percent approaches 100%.
  • the presently described gene expression profile can also be used to screen for individuals who are susceptible for lung cancer. For example, a smoker, who is over a certain age, for example over 40 years old, or a smoker who has smoked, for example, a certain number of years, may wish to be screened for lung cancer.
  • the gene expression analysis as described herein can provide an accurate very early diagnosis for lung cancer. This is particularly useful in diagnosis of lung cancer, because the earlier the cancer is detected, the better the survival rate is.
  • the group of 80 genes as presented in Table 5 are part of the most frequently chosen genes across 1000 statistical test runs (see Examples below for more details regarding the statistical testing).
  • the 535 genes of Table 6 in our data show up more than 67 times out of 1000.
  • All the 80 genes in Table 5 form a subset of the 535 genes.
  • Table 7 shows the top 20 genes which are subset of the 535 list. The direction of change in expression is shown using signal to noise ratio.
  • a negative number in Tables 5, 6, and 7 means that expression of this gene or transcript is up in lung cancer samples. Positive number in Table 5, 6, and 7, indicates that the expression of this gene or transcript is down in lung cancer.
  • any combination of the genes and/or transcripts of Table 6 can be used.
  • Table 7 provides 20 of the most frequently variably expressed genes in lung cancer when compared to samples without cancer. Accordingly, in one embodiment, any combination of about 3-5, 5-10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or all 20 genes and/or transcripts of Table 7, or any sub-combination thereof are used.
  • the invention provides a gene group the expression profile of which is useful in diagnosing lung diseases and which comprises probes that hybridize ranging from 1 to 96 and all combinations in between for example 5, 10, 15, 20, 25, 30, 35, at least about 36, at least to 40, at lest to 50, at least to 60, to at least 70, to at least 80, to at least 90, or all of the following 96 gene sequences: NM_003335; NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_001319; NM_006545.1; NM_021145.1; NM_002437.1; NM_006286; NM_001003698 /// NM_001003699 /// NM_002955; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_002853.1; NM_019067.1
  • the invention provides a gene group the expression profile of which is useful in diagnosing lung diseases and comprises probes that hybridize to at least, for example, 5, 10, 15, 20, 25, 30, 35, at least about 36, at least to 40, at least to 50, at least to 60, to at least 70, to at least 80, to all of the following 84 gene sequences: NM_030757.1; R83000; AK021571.1; NM_014182.1; NM_17932.1; U85430.1; AI683552; BC002642.1; AW024467; NM_030972.1; BC021135.1; AL161952.1; AK026565.1; AK023783.1; BF218804; NM_001281.1; NM_024006.1; AK023843.1; BC001602.1; BC034707.1; BC064619.1; AY280502.1; BC059387.1; AF135421.1; BC061522.1; L76200.1; U50532.1; BC006547.2
  • the invention provides a gene group the expression profile of which is useful in diagnosing lung diseases and comprises probes that hybridize to at least, for example 5, 10, 15, 20, 25, 30, preferably at least about 36, still more preferably at least to 40, still more preferably at lest to 45, still more preferably all of the following 50 gene sequences, although it can include any and all members, for example, 20, 21, 22, up to and including 36: NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_00
  • the invention provides a gene group the expression profile of which is useful in diagnosing lung diseases and comprises probes that hybridize to at least for example 5, 10, 15, 20, preferably at least about 25, still more preferably at least to 30, still more preferably all of the following 36 gene sequences: NM_007062.1; NM_001281.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; NM_002268 /// NM_032771; NM_007048 /// NM_194441; NM_006694; U85430.1; NM_004691; AB014576.1; BF218804; BE467941; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AL080112.1; AW971983;
  • a probe set useful in the methods of the present invention is selected from the nucleic acid probes of between 10-15, 15-20, 20-180, preferably between 30-180, still more preferably between 36-96, still more preferably between 36-84, still more preferably between 36-50 probes, included in the Affymetrix Inc.
  • the analysis can be performed as follows.
  • samples of non-smokers and smokers, non-asbestos exposed individuals and asbestos-exposed individuals, non-smog exposed individuals and smog-exposed individuals, smokers without a lung disease and smokers with lung disease to obtain the differentially expressed gene groups between individuals with no lung disease and individuals with lung disease.
  • the obtained expression analysis such as microarray or microbead raw data consists of signal strength and detection p-value.
  • One normalizes or scales the data, and filters the poor quality chips/bead sets based on images of the expression data, control probes, and histograms.
  • One filters contaminated specimens which contain non-epithelial cells.
  • one filters the genes of importance using detection p-value.
  • This results in identification of transcripts present in normal airways (normal airway transcriptome). Variability and multiple regression analysis can be used. This also results in identification of effects of smoking on airway epithelial cell transcription.
  • T-test and Pearson correlation analysis One can also identify a group or a set of transcripts that are differentially expressed in samples with lung disease, such as lung cancer and samples without cancer. This analysis was performed using class prediction models.
  • a weighted voting method For analysis of the data, one can use, for example, a weighted voting method.
  • Committees of variable sizes of the top ranked genes are used to evaluate test samples, but genes with more significant p-values can be more heavily weighed.
  • the accuracy can be validated using cross-valid
  • Table 1 shows 96 genes that were identified as a group distinguishing smokers with cancer from smokers without cancer. The difference in expression is indicated at the column on the right as either “down”, which indicates that the expression of that particular transcript was lower in smokers with cancer than in smokers without cancer, and “up”, which indicates that the expression of that particular transcript was higher in smokers with cancer than smokers without cancer.
  • the exemplary probes shown in the column “Affymetrix Id in the Human Genome U133 chip” can be used. Sequences for the Affymetrix probes are provided in the Appendix.
  • Table 2 shows one preferred 84 gene group that was identified as a group distinguishing smokers with cancer from smokers without cancer. The difference in expression is indicated at the column on the right as either “down”, which indicates that the expression of that particular transcript was lower in smokers with cancer than in smokers without cancer, and “up”, which indicates that the expression of that particular transcript was higher in smokers with cancer than smokers without cancer.
  • the exemplary probes shown in the column “Affymetrix Id in the Human Genome U133 chip” can be used in the expression analysis.
  • BF218804 AFURS1 ATPase family homolog down 212297_at up-regulated in senescence cells NM_001281.1 CKAP1 cytoskeleton associated up 201804_x_at protein 1 NM_024006.1 IMAGE3455200 hypothetical protein up 217949_s_at IMAGE3455200 AK023843.1 PGF placental growth factor, down 215179_x_at vascular endothelial growth factor-related protein BC001602.1 CFLAR CASP8 and FADD-like down 211316_x_at apoptosis regulator BC034707.1 — Homo sapiens down 217653_x_at transcribed sequence with weak similarity to protein ref: NP_060312.1 ( H.
  • Hs.528304 ADAM28 a disintegrin and down 208268_at (UNIGENE ID) metalloproteinase domain 28 U50532.1 CG005 hypothetical protein down 221899_at from BCRA2 region BC013923.2 SOX2 SRY (sex determining down 213721_at region Y)-box 2 BC031091 ODAG ocular development- down 214718_at associated gene NM_007062 PWP1 nuclear phosphoprotein up 201608_s_at similar to S.
  • BC008591.2 /// KIAA0100 KIAA0100 gene up 201729_s_at BC050440.1 /// product BC048096.1 AF365931.1 ZNF264 zinc finger protein 264 down 205917_at AF257099.1 PTMA prothymosin, alpha down 200772_x_at (gene sequence 28) BC028912.1 DNAJB9 DnaJ (Hsp40) homolog, up 202842_s_at subfamily B, member 9
  • Table 3 shows one preferred 50 gene group that was identified as a group distinguishing smokers with cancer from smokers without cancer. The difference in expression is indicated at the column on the right as either “down”, which indicates that the expression of that particular transcript was lower in smokers with cancer than in smokers without cancer, and “up”, which indicates that the expression of that particular transcript was higher in smokers with cancer than smokers without cancer.
  • This gene group was identified using the GenePattern server from the Broad Institute, which includes the Weighted Voting algorithm. The default settings, i.e., the signal to noise ratio and no gene filtering, were used.
  • the exemplary probes shown in the column “ Affymetrix Id in the Human Genome U133 chip” can be used in the expression analysis.
  • Table 4 shows one preferred 36 gene group that was identified as a group distinguishing smokers with cancer from smokers without cancer. The difference in expression is indicated at the column on the right as either “down”, which indicates that the expression of that particular transcript was lower in smokers with cancer than in smokers without cancer, and “up”, which indicates that the expression of that particular transcript was higher in smokers with cancer than smokers without cancer.
  • the exemplary probes shown in the column “Affymetrix Id in the Human Genome U133 chip” can be used in the expression analysis.
  • the gene group of the present invention comprises at least, for example, 5, 10, 15, 20, 25, 30, more preferably at least 36, still more preferably at least about 40, still more preferably at least about 50, still more preferably at least about 60, still more preferably at least about 70, still more preferably at least about 80, still more preferably at least about 86, still more preferably at least about 90, still more preferably at least about 96 of the genes as shown in Tables 1-4.
  • the gene group comprises 36-180 genes selected from the group consisting of the genes listed in Tables 1-4.
  • the invention provides group of genes the expression of which is lower in individuals with cancer.
  • the invention provides of a group of genes useful in diagnosing lung diseases, wherein the expression of the group of genes is lower in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-30, still more preferably at least about 30-40, still more preferably at least about 40-50, still more preferably at least about 50-60, still more preferably at least about 60-70, still more preferably about 72 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 1): NM_003335; NM_001319; NM_021145.1; NM_001003698 /// NM_001003699 ///; NM_002955; NM_002853.1; NM_019067.1; NM_024917.1; NM
  • the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is lower in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-30, still more preferably at least about 30-40, still more preferably at least about 40-50, still more preferably at least about 50-60, still more preferably about 63 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 2): NM_030757.1; R83000; AK021571.1; NM_17932.1; U85430.1; AI683552; BC002642.1; AW024467; NM_030972.1; BC021135.1; AL161952.1; AK026565.1; AK023783.1; BF218804; AK0
  • the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is lower in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-25, still more preferably about 25 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 3): BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_01
  • the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is higher in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least to 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-25, still more preferably about 25 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 1): NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_006545.1; NM_002437.1; NM_006286; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_001696; NM_005494 /// NM_058246;
  • the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is higher in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least to 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-23, still more preferably about 23 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 2): NM_014182.1; NM_001281.1; NM_024006.1; AF135421.1; L76200.1; NM_000346.1; BC008710.1; BC000423.2; BC008710.1; NM_007062; BC075839.1 /// BC073760.1; BC072436.1 /// BC004560.2; BC001016.2; BC005023.1; BC000360.2; BC007455.2; BC023528.
  • the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is higher in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least to 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-25, still more preferably about 25 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 3): NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1;
  • the invention provides a method of diagnosing lung disease comprising the steps of measuring the expression profile of a gene group in an individual suspected of being affected or being at high risk of a lung disease (i.e. test individual), and comparing the expression profile (i.e. control profile) to an expression profile of an individual without the lung disease who has also been exposed to similar air pollutant than the test individual (i.e.
  • Groups of about 36 genes as shown in table 4, about 50 genes as shown in table 3, about 84 genes as shown in table 2 and about 96 genes as shown in table 1 are preferred.
  • the different gene groups can also be combined, so that the test individual can be screened for all, three, two, or just one group as shown in tables 1-4.
  • the expression profile of a test individual exposed to cigarette smoke is compared to the expression profile of the 50 genes shown in table 3, using the Affymetrix inc probe set on a gene chip as shown in table 3, the expression profile that is similar to the one shown in FIG. 10 for the individuals with cancer, is indicative that the test individual has cancer.
  • the expression profile is more like the expression profile of the individuals who do not have cancer in FIG. 10 , the test individual likely is not affected with lung cancer.
  • the group of 50 genes was identified using the GenePattern server from the Broad Institute, which includes the Weighted Voting algorithm. The default settings, i.e., the signal to noise ratio and no gene filtering, were used. GenePattern is available through the World Wide Wed at location broad.mit.edu/cancer/software/genepattern. This program allows analysis of data in groups rather than as individual genes. Thus, in one preferred embodiment, the expression of substantially all 50 genes of Table 3, are analyzed together.
  • the expression pattern of all the genes in the Table 3 is analyzed.
  • 1, 2, 3, 4, 5, 6, 7, 8, 9, 10-15, 15-20, 20-30, or more of the individual predictor genes identified using the t-test analysis are analyzed. Any combination of, for example, 5-10 or more of the group predictor genes and 5-10, or more of the individual genes can also be used.
  • expression profile refers to the amount of the gene product of each of the analyzed individual genes in the sample.
  • the “expression profile” is like a signature expression map, like the one shown for each individual in FIG. 10 , on the Y-axis.
  • lung disease refers to disorders including, but not limited to, asthma, chronic bronchitis, emphysema, bronchietasis, primary pulmonary hypertension and acute respiratory distress syndrome.
  • the methods described herein may also be used to diagnose or treat lung disorders that involve the immune system including, hypersensitivity pneumonitis, eosinophilic pneumonias, and persistent fungal infections, pulmonary fibrosis, systemic sclerosis, idiopathic pulmonary hemosiderosis, pulmonary alveolar proteinosis, cancers of the lung such as adenocarcinoma, squamous cell carcinoma, small cell and large cell carcinomas, and benign neoplasm of the lung including bronchial adenomas and hamartomas.
  • the lung disease is lung cancer.
  • the biological samples useful according to the present invention include, but are not limited to tissue samples, cell samples, and excretion samples, such as sputum or saliva, of the airways.
  • tissue samples such as sputum or saliva
  • the samples useful for the analysis methods according to the present invention can be taken from the mouth, the bronchial airways, and the lungs.
  • air pollutants refers to any air impurities or environmental airway stress inducing agents, such as cigarette smoke, cigar smoke, smog, asbestos, and other air pollutants that have suspected or proven association to lung diseases.
  • field defect means that the transcription pattern of epithelial cells lining the entire airway including the mouth buccal mucosa, airways, and lung tissue changes in response to airway pollutants. Therefore, the present invention provides methods to identify epithelial cell gene expression patterns that are associated with diseases and disorders of the lung.
  • altered expression refers to either increased or decreased expression in an individual exposed to air pollutant, such as a smoker, with cancer when compared to an expression pattern of the lung cells from an individual exposed to similar air pollutant, such as smoker, who does not have cancer.
  • Tables 1 and 2 show the preferred expression pattern changes of the invention.
  • the terms “up” and “down” in the tables refer to the amount of expression in a smoker with cancer to the amount of expression in a smoker without cancer Similar expression pattern changes are likely associated with development of cancer in individuals who have been exposed to other airway pollutants.
  • the group of genes the expression of which is analyzed in diagnosis and/or prognosis of lung cancer are selected from the group of 80 genes as shown in Table 5. Any combination of genes can be selected from the 80 genes. In one embodiment, the combination of 20 genes shown in Table 7 is selected. In one embodiment, a combination of genes from Table 6 is selected.
  • the analysis of the gene expression of one or more genes and/or transcripts of the groups or their subgroups of the present invention can be performed using any gene expression method known to one skilled in the art. Such methods include, but are not limited to expression analysis using nucleic acid chips (e.g. Affymetrix chips) and quantitative RT-PCR based methods using, for example real-time detection of the transcripts. Analysis of transcript levels according to the present invention can be made using total or messenger RNA or proteins encoded by the genes identified in the diagnostic gene groups of the present invention as a starting material.
  • nucleic acid chips e.g. Affymetrix chips
  • quantitative RT-PCR based methods using, for example real-time detection of the transcripts.
  • Analysis of transcript levels according to the present invention can be made using total or messenger RNA or proteins encoded by the genes identified in the diagnostic gene groups of the present invention as a starting material.
  • the analysis is an immunohistochemical analysis with an antibody directed against proteins comprising at least about 10-20, 20-30, preferably at least 36,_at least 36-50, 50, about 50-60, 60-70, 70-80, 80-90, 96, 100-180, 180-200, 200-250, 250-300, 300-350, 350-400, 400-450, 450-500, 500-535 proteins encoded by the genes and/or transcripts as shown in Tables 1-7.
  • the methods of analyzing transcript levels of the gene groups in an individual include Northern-blot hybridization, ribonuclease protection assay, and reverse transcriptase polymerase chain reaction (RT-PCR) based methods.
  • RT-PCR reverse transcriptase polymerase chain reaction
  • the different RT-PCR based techniques are the most suitable quantification method for diagnostic purposes of the present invention, because they are very sensitive and thus require only a small sample size which is desirable for a diagnostic test.
  • a number of quantitative RT-PCR based methods have been described and are useful in measuring the amount of transcripts according to the present invention.
  • RNA quantification using PCR and complementary DNA (cDNA) arrays (Shalon et al., Genome Research 6(7):639-45, 1996; Bernard et al., Nucleic Acids Research 24(8):1435-42, 1996), real competitive PCR using a MALDI-TOF Mass spectrometry based approach (Ding et al, PNAS, 100: 3059-64, 2003), solid-phase mini-sequencing technique, which is based upon a primer extension reaction (U.S. Pat. No. 6,013,431, Suomalainen et al. Mol. Biotechnol.
  • the samples are preferably obtained from bronchial airways using, for example, endoscopic cytobrush in connection with a fiber optic bronchoscopy.
  • the cells are obtained from the individual's mouth buccal cells, using, for example, a scraping of the buccal mucosa.
  • the invention provides a prognostic and/or diagnostic immunohistochemical approach, such as a dip-stick analysis, to determine risk of developing lung disease.
  • Antibodies against proteins, or antigenic epitopes thereof, that are encoded by the group of genes of the present invention, are either commercially available or can be produced using methods well know to one skilled in the art.
  • Some aspects of the present invention also provide prognostic, diagnostic, and therapeutic tools for the disorders of lung, particularly, lung cancer.
  • the invention is based on the identification of a “field defect” phenomenon and specific expression patterns related to airway epithelial cell exposure to pollutants, such as cigarette smoke.
  • the airway expression patterns of the present invention can be analyzed using nucleic acids and/or proteins from a biological sample of the airways.
  • lung cancer involves histopathological and molecular progression from normal to premalignant to cancer.
  • Gene expression arrays of lung tumors have been used to characterize expression profiles of lung cancers, and to show the progression of molecular changes from non-malignant lung tissue to lung cancer.
  • the present invention provides for the first time, a method of obtaining cells from other parts of the airways to identify the epithelial gene expression pattern in an individual.
  • the ability to determine which individuals have molecular changes in their airway epithelial cells and how these changes relate to a lung disorder, such as premalignant and malignant changes is a significant improvement for determining risk and for diagnosing a lung disorder such as cancer at a stage when treatment can be more effective, thus reducing the mortality and morbidity rates of lung cancer.
  • a lung disorder such as cancer at a stage when treatment can be more effective, thus reducing the mortality and morbidity rates of lung cancer.
  • Lung disorders which may be diagnosed or treated by methods described herein include, but are not limited to, asthma, chronic bronchitis, emphysema, bronchietasis, primary pulmonary hypertension and acute respiratory distress syndrome.
  • the methods described herein may also be used to diagnose or treat lung disorders that involve the immune system including, hypersensitivity pneumonitis, eosinophilic pneumonias, and persistent fungal infections, pulmonary fibrosis, systemic sclerosis, ideopathic pulmonary hemosiderosis, pulmonary alveolar proteinosis, cancers of the lung such as adenocarcinoma, squamous cell carcinoma, small cell and large cell carcinomas, and benign neoplasms of the lung including bronchial adenomas and hamartomas.
  • the biological samples useful according to the present invention include, but are not limited to tissue samples, cell samples, and excretion samples, such as sputum or saliva, of the airways.
  • tissue samples such as sputum or saliva
  • the samples useful for the analysis methods according to the present invention can be taken from the mouth, the bronchial airways, and the lungs.
  • the invention provides an “airway transcriptome” the expression pattern of which is useful in prognostic, diagnostic and therapeutic applications as described herein.
  • the airway transcriptome of the present invention comprises 85 genes the expression of which differs significantly between healthy smokers and healthy non-smokers.
  • the airway transcriptome according to the present invention comprises 85 genes, corresponding to 97 probesets, as a number of genes are represented by more than one probeset on the affymetrix array, identified from the about 7100 probesets the expression of which was statistically analyzed using epithelial cell RNA samples from smokers and non-smokers. Therefore, the invention also provides proteins that are encoded by the 85 genes.
  • the 85 identified airway transcriptome genes are listed on the following
  • HLF hepatic leukemia factor (OMIM#142385) 2.
  • CYFIP2 CYTOPLASMIC FMRP-INTERACTING PROTEIN 2 (OMIM#606323) 3.
  • MGLL monoglyceride lipase (GenBank gi: 47117287) 4.
  • HSPA2 HEAT-SHOCK 70-KD PROTEIN 2 (OMIM#140560) 5.
  • DKFZP586B2420 GeneCards TM database (Weitzman Institute of Science, Rehovot, Israel) at http://www6.unito.it/cgi-bin/cards/carddisp?DKFZPS86B2420 6.
  • SLIT1 SLIT, DROSOPHILA , HOMOLOG OF, 1 (OMIM#603742) 7.
  • SLIT2 SLIT, DROSOPHILA , HOMOLOG OF, 2 (OMIM#603746) 8.
  • C14orf132 hypothetical protein (GeneCards TM database Id No. GC14P094495 at http://bioinfo.enic.es/cgi-bin/db/genecards/carddisp?C14orf132 9.
  • TU3A DOWNREGULATED IN RENAL CELL CARCINOMA 1 (OMIM#608295) 10.
  • MMP10 MATRIX METTALLOPROTEIN 10 (OMIM#185260) 11.
  • CCND2 CYCLIN D2 CCND2 (OMIM#123833) 12.
  • MT1F METALLOTHIONEN 1F (OMIM#156352) 15. RNAHP Homo sapiens RNA helicase-related protein (Unigene/Hs. 8765) 16.
  • MT1X METALLOTHIONEIN 1X (OMIM#156359) 17.
  • MT1L METALLOTHIONEIN 1L (OMIM#156358) 18.
  • MT1G METALLOTHIONEIN 1G (OMIM#156353) 19. PEC1 GenBank ID No. AI541256 20. TNFSF13 TUMOR NECROSIS FACTOR LIGAND SUPERFAMILY, MEMBER 13 (OMIM#604472) 21. GMDS GDP-MANNOSE 4,6-DEHYDRATASE (OMIM#602884) 22. ZNF232 ZINC FINGER PROTEIN 2 (OMIM#194500) 23. GALNT12 UDP-N-ACETYL-ALPHA-D-GALACTOSAMINE:POLYPEPTIDE N- ACETYLGALACTOSAMINYLTRANSFERASE 13 (OMIM#608369) 24.
  • AP2B1 ADAPTOR-RELATED PROTEIN COMPLEX 2 BETA-1 SUBUNIT (OMIM#601925) 25. HN1 HUMANIN (OMIM#606120) 26. ABCC1 ATP-BINDING CASSETTE, SUBFAMILY C, MEMBER 1 (OMIM#158343) 27. RAB11A RAS FAMILY, MEMBER RAB11A (OMIM#605570) 28. MSMB MICROSEMINOPROTEIN, BETA (OMIM#157145) 29. MAFG V-MAF AVIAN MUSCULOAPONEUROTIC FIBROSARCOMA ONCOGENE FAMILY, PROTEIN G (OMIM#602020) 30.
  • PDG phosphoglycerate dehydrogenase (GeneID: 26227 at Entrez Gene, NCBI Database) 38.
  • HTATIP2 HIV-1 TAT-INTERACTING PROTEIN 2, 30-KD (OMIM#605628) 39.
  • ADH7 ALCOHOL DEHYDROGENASE 7 (OMIM#600086) 42.
  • GCLC GLUTAMATE-CYSTEINE LIGASE, CATALYTIC SUBUNIT (OMIM#606857) 43.
  • UPK1B UROPLAKIN 1B (OMIM#602380) 44.
  • PLEKHB2 pleckstrin homology domain containing, family B (evectins) member 2, GENEATLAS GENE DATABASE AT http://www.dsi.univ-paris5.fr/genatlas/fiche1.php?symbol PLEKHB2 45.
  • TCN1 TRANSCOBALAMIN 1 (OMIM#189905) 46.
  • TRIM16 TRIPARTITE MOTIF-CONTAINING PROTEIN 16 47.
  • UGT1A9 UDP-GLYCOSYLTRANSFERASE 1 FAMILY, POLYPEPTIDE A9 (OMIM#606434) 48.
  • TALDO1 TRANSALDOLASE 1 (OMIM#602063) 56. GPX2 GLUTATHIONE PEROXIDASE 2 (OMIM#138319) 57.
  • CLDN10 CLAUDIN 10 (GeneCards TM ID: GC13P093783) 63.
  • TXN thioredoxin (OMIM#187700) 64.
  • CEACAM6 CARCINOEMBRYONIC ANTIGEN-RELATED CELL ADHESION MOLECULE 6 (OMIM#163980) 73.
  • AP1G1 ADAPTOR-RELATED PROTEIN COMPLEX 1, GAMMA-1 SUBUNIT (OMIM#603533)
  • CA12 CARBONIC ANHYDRASE XII (OMIM#603263) 75.
  • FLJ20151 hypothetical protein GeneCards TM ID: GC15MO61330
  • BCL2L13 apoptosis facilitator (GeneID: 23786, Entrez) 77.
  • SRPUL Homo sapenes subhi-repeat protein (MutDB at http://mutdb.org/AnnoSNP/data/DD/S0/9U/AC.nt.html) 78.
  • FLJ13052 Homo sapiens NAD kinase (GenBank ID gi: 20070325) 79.
  • OASIS cAMP responsive element binding protein 3-like 1 (GeneBank ID gi: 21668501) 81. MUC5B MUCIN 5, SUBTYPE B, TRACHEOBRONCHIAL (OMIM#600770) 82. S100P S100 CALCIUM-BINDING PROTEIN P (OMIM#600614) 83. SDR1 dehydrogenease/reductase (SDR family) member 3 (GeneID: 9249, Entrez) 84. PLA2G10 PHOSPHOLIPASE A2, GROUP X (OMIM#603603) 85. DPYSL3 DIHYDROPYRIMIDINASE-LIKE 3 (OMIM#601168)
  • the invention further provides a lung cancer diagnostic airway transcriptome comprising at least 208 genes that are differentially expressed between smokers with lung cancer and smokers witout lung cancer.
  • the genes identified as being part of the diagnostic airway transcriptome are 208238_x_at-probeset; 216384x_at-probeset; 217679_x_at-probeset; 216859_x_at-probeset; 211200_s_at-probeset; PDPKI; ADAM28; ACACB; ASMTL; ACVR2B; ADAT1; ALMS1; ANK3; ANK3; OARS; AFURS1; ATP8B1; ABCC1; BTF3; BRD4; CELSR2; CALM31 CAPZB; CAPZB1 CFLAR; CTSS; CD24; CBX3; C21orf106; C6orf111; C6orf62; CHC1; DCLRE1C; EML2; EMS 1 EPHB6; EEF2;
  • Deviation in the expression compared to control group can be increased expression or decreased expression of one or more of the 208 genes.
  • probeset numbers as referred to herein and throughout the specification, refer to the Affymetrix probesets.
  • the methods to identify the airway transcriptomes can be used to identify airway transcriptomes in other animals than humans by performing the statistical comparisons as provided in the Examples below in any two animal groups, wherein one group is exposed to an airway pollutant and the other group is not exposed to such pollutant and performing the gene expression analysis of any large probeset, such as the probeset of 7119 genes used in some of the Examples. Therefore, the subject or individual as described herein and throughout the specification is not limited to human, but encompasses other mammals and animals, such as murine, bovine, swine, and other primates. This methodology can also be carried out with lung disorders to create new clusters of genes wherein change in their expression is related to specific disorders.
  • the invention provides a method of determining an increased risk of lung disease, such as lung cancer, in a smoker comprising taking an airway sample from the individual, analyzing the expression of at least one, preferably at least two, still more preferably at least 4, still more preferably at least 5, still more preferably at least 6, still more preferably at least 7, still more preferably at least 8, still more preferably at least 8, and still more preferably at least all 9 of the outlier genes including AKRICI; MSNIB; TM4SF1; UPKIB; FLJ20152; SEC14L3; HT021; GALNT6; and AKRIC2, wherein deviation of the expression of at least one, preferably at least two, still more preferably at least 4, still more preferably at least 5, still more preferably at least 6, still more preferably at least 7, still more preferably at least 8, still more preferably at least 8, and still more preferably at least all 9 as compared to a control group is indicative of the smoker being at increased risk of developing a lung disease, for example,
  • the methods disclosed herein can also be used to show exposure of a non-smoker to environmental pollutants by showing increased expression in a biological sample taken from the airways of the non-smoker of genes encoding proteins associated with detoxification, and genes encoding mucins and cell adhesion molecules or decreased expression of genes encoding certain proteins associated with immune regulation and metallothioneins. If such changes are observed, an entire group of individuals at work or home environment of the exposed individual may be analyzed and if any of them does not show the indicative increases and decreases in the expression of the airway transcriptome, they may be at greater risk of developing a lung disease and susceptible for intervention.
  • Screening can be used to single out high risk workers from the risky environment to transfer to a less risky environment.
  • the invention provides prognostic and diagnostic methods to screen for individuals at risk of developing diseases of the lung, such as lung cancer, comprising screening for changes in the gene expression pattern of the airway transcriptome.
  • the method comprises obtaining a cell sample from the airways of an individual and measuring the level of expression of 1-85 gene transcripts of the airway transcriptome as provided herein.
  • the level of at least two, still more preferably at least 3, 4, 5, 6, 7, 8, 9, 10 transcripts, and still more preferably, the level of at least 10-15, 15-20, 20-50, or more transcripts, and still more preferably all of the 97 trasncripts in the airway transcriptome are measured, wherein difference in the expression of at least one, preferably at least two, still more preferably at least three, and still more preferably at least 4, 5, 6, 7, 8, 9, 10, 10-15, 15-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-85 genes present in the airway transcriptome compared to a normal airway transcriptome is indicative of increased risk of a lung disease.
  • the control being at least one, preferably a group of more than one individual exposed to the same pollutant and having a normal or healthy response to the exposure.
  • difference in at least one of the detoxification related genes, mucin genes, and/or cell adhesion related genes compared to the level of these genes expressed in a control is indicative of the individual being at an increased risk of developing diseases of the lung.
  • the differences in expression of at least one immune system regulation and/or metallothionein regulation related genes compared to the level of these genes expressed in a control group indicates that the individual is at risk of developing diseases of the lung.
  • the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in at least on of the mucin genes of the airway transcriptome, wherein increase in the expression compared with control group is indicative of an increased risk of developing a lung disease.
  • mucin genes include muc 5 subtypes A, B, and. C.
  • the invention provides a tool for screening for changes in the airway transcriptome during long time intervals, such as weeks, months, or even years.
  • the airway transcriptome expression analysis is therefore performed at time intervals, preferably two or more time intervals, such as in connection with an annual physical examination, so that the changes in the airway transcriptome expression pattern can be tracked in individual basis.
  • the screening methods of the invention are useful in following up the response of the airways to a variety of pollutants that the subject is exposed to during extended periods. Such pollutants include direct or indirect exposure to cigarette smoke or other air pollutants.
  • the control as used herein is a healthy individual, whose responses to airway pollutants are in the normal range of a smoker as provided by, for example, the transcription patterns shown in FIG. 24 .
  • Analysis of transcript levels according to the present invention can be made using total or messenger RNA or proteins encoded by the genes identified in the airway trascriptome of the present invention as a starting material.
  • the analysis is an immunohistochemical analysis with an antibody directed against at least one, preferably at least two, still more preferably at least 4-10 proteins encoded by the genes of the airway transcriptome
  • the methods of analyzing transcript levels of one or more of the 85 transcripts in an individual include Northern-blot hybridization, ribonuclease protection assay, and reverse transcriptase polymerase chain reaction (RT-PCR) based methods.
  • RT-PCR reverse transcriptase polymerase chain reaction
  • the different RT-PCR based techniques are the most suitable quantification method for diagnostic purposes of the present invention, because they are very sensitive and thus require only a small sample size which is desirable for a diagnostic test.
  • a number of quantitative RT-PCR based methods have been described and are useful in measuring the amount of transcripts according to the present invention.
  • RNA quantification using PCR and complementary DNA (cDNA) arrays (Shalon et al., Genome Research 6(7):639-45, 1996; Bernard et al., Nucleic Acids Research 24(8):1435-42, 1996), solid-phase mini-sequencing technique, which is based upon a primer extension reaction (U.S. Pat. No. 6,013,431, Suomalainen et al. Mol. Biotechnol. June; 15(2):123-31, 2000), ion-pair high-performance liquid chromatography (Doris et al. J. Chromatogr. A May 8; 806(1):47-60, 1998), and 5′ nuclease assay or real-time RT-PCR (Holland et al. Proc Natl Acad Sci USA 88: 7276 7280, 1991).
  • cDNA complementary DNA
  • the present invention provides a method for detecting risk of developing lung cancer in a subject exposed to cigarette smoke comprising measuring the level of 1-97 proteins encoded by the airway transcriptome in a biological sample of the subject.
  • a biological sample of the subject Preferably at least one, still more preferably at least two, still more preferably at least three, and still more preferably at least 4-10, or more of the proteins encoded by the airway transcriptome in a biological sample of the subject are analyzed.
  • the method comprises binding an antibody against one or more of the proteins encoded by the airway transcriptome (the “protein”) to a solid support chosen from the group consisting of dip-stick and membrane; incubating the solid support in the presence of the sample to be analyzed under conditions where antibody-antigen complexes form; incubating the support with an anti-protein antibody conjugated to a detectable moeity which produces a signal; visually detecting said signal, wherein said signal is proportional to the amount of protein in said sample; and comparing the signal in said sample to a standard, wherein a difference in the amount of the protein in the sample compared to said standard of at feast one, preferably at least two, still more preferably at least 3-5, still more preferably at least 5-10, proteins is indicative of an increased risk of developing lung cancer.
  • the standard levels are measured to indicate expression levels in a normal airway exposed to cigarette smoke, as exemplified in the smoker transcript pattern shown, for example on FIG. 24 .
  • the assay reagents, pipettes/dropper, and test tubes may be provided in the form of a kit. Accordingly, the invention further provides a test kit for visual detection of one or more proteins encoded by the airway transcriptome, wherein detection of a level that differs from a pattern in a control individual is considered indicative of an increased risk of developing lung disease in the subject.
  • the test kit comprises one or more solutions containing a known concentration of one or more proteins encoded by the airway transcriptome (the “protein”) to serve as a standard; a solution of a anti-protein antibody bound to an enzyme; a chrotnogen which changes color or shade by the action of the enzyme; a solid support chosen from the group consisting of dip-stick and membrane carrying on the surface thereof an antibody to the protein.
  • the inventions disclosed herein contemplate either one dipstick capable of detecting all the diagnostically important gene products or alternatively, a series of dipsticks capable of detecting the amount proteins of a smaller sub-group of diagnostic proteins of the present invention.
  • Antibodies can be prepared by means well known in the art.
  • the term “antibodies” is meant to include monoclonal antibodies, polyclonal antibodies and antibodies prepared by recombinant nucleic acid techniques that are selectively reactive with a desired antigen.
  • Antibodies against the diagnostic proteins according to the present invention can be used in standard techniques such as Western blotting or immunohistochemistry to quantify the level of expression of the proteins of the diagnostic airway proteome. This is quantified according to the expression of the gene transcript, i.e. the increased expression of transcript corresponds to increased expression of the gene product, i.e. protein. Similarly decreased expression of the transcript corresponds to decreased expression of the gene product or protein.
  • Tables 5 and 6 describe a group of genes the expression of which is altered in lung cancer.
  • Immunohistochemical applications include assays, wherein increased presence of the protein can be assessed, for example, from a saliva or sputum sample.
  • the immunohistochemical assays according to the present invention can be performed using methods utilizing solid supports.
  • the solid support can be a any phase used in performing immunoassays, including dipsticks, membranes, absorptive pads, beads, microtiter wells, test tubes, and the like.
  • test devices which may be conveniently used by the testing personnel or the patient for self-testing, having minimal or no previous training
  • Such preferred test devices include dipsticks, membrane assay systems as described in U.S. Pat. No. 4,632,901. The preparation and use of such conventional test systems is well described in the patent, medical, and scientific literature.
  • the anti-protein antibody is bound to one end of the stick such that the end with the antibody can be dipped into the solutions as described below for the detection of the protein.
  • the samples can be applied onto the antibody-coated dipstick or membrane by pipette or dropper or the like.
  • the antibody against proteins encoded by the diagnostic airway transcriptome can be of any isotype, such as IgA, IgG or IgM, Fab fragments, or the like.
  • the antibody may be a monoclonal or polyclonal and produced by methods as generally described, for example, in Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, 1988, incorporated herein by reference.
  • the antibody can be applied to the solid support by direct or indirect means. Indirect bonding allows maximum exposure of the protein binding sites to the assay solutions since the sites are not themselves used for binding to the support.
  • polyclonal antibodies are used since polyclonal antibodies can recognize different epitopes of the protein thereby enhancing the sensitivity of the assay.
  • the solid support is preferably non-specifically blocked after binding the protein antibodies to the solid support.
  • Non-specific blocking of surrounding areas can be with whole or derivatized bovine serum albumin, or albumin from other animals, whole animal serum, casein, non-fat milk, and the like.
  • the sample is applied onto the solid support with bound protein-specific antibody such that the protein will be bound to the solid support through said antibodies. Excess and unbound components of the sample are removed and the solid support is preferably washed so the antibody-antigen complexes are retained on the solid support.
  • the solid support may be washed with a washing solution which may contain a detergent such as Tween-20, Tween-80 or sodium dodecyl sulfate.
  • the second antibody which reacts with protein is applied.
  • the second antibody may be labeled, preferably with a visible label.
  • the labels may be soluble or particulate and may include dyed immunoglobulin binding substances, simple dyes or dye polymers, dyed latex beads, dye-containing liposomes, dyed cells or organisms, or metallic, organic, inorganic, or dye solids.
  • the labels may be bound to the protein antibodies by a variety of means that are well known in the art.
  • the labels may be enzymes that can be coupled to a signal producing system.
  • visible labels examples include alkaline phosphatase, beta-galactosidase, horseradish peroxidase, and biotin. Many enzyme-chromogen or enzyme-substrate-chromogen combinations are known and used for enzyme-linked assays. Dye labels also encompass radioactive labels and fluorescent dyes.
  • corresponding steps may be carried out with a known amount or amounts of the protein and such a step can be the standard for the assay.
  • a sample from a healthy individual exposed to a similar air pollutant such as cigarette smoke, can be used to create a standard for any and all of the diagnostic gene group encoded proteins.
  • the solid support is washed again to remove unbound labeled antibody and the labeled antibody is visualized and quantified.
  • the accumulation of label will generally be assessed visually. This visual detection may allow for detection of different colors, for example, red color, yellow color, brown color, or green color, depending on label used. Accumulated label may also be detected by optical detection devices such as reflectance analyzers, video image analyzers and the like.
  • the visible intensity of accumulated label could correlate with the concentration of protein in the sample.
  • the correlation between the visible intensity of accumulated label and the amount of the protein may be made by comparison of the visible intensity to a set of reference standards.
  • the standards have been assayed in the same way as the unknown sample, and more preferably alongside the sample, either on the same or on a different solid support.
  • the concentration of standards to be used can range from about 1 mg of protein per liter of solution, up to about 50 mg of protein per liter of solution.
  • concentrations of an airway gene group encoded proteins are used so that quantification of the unknown by comparison of intensity of color is more accurate.
  • the present invention provides a method for detecting risk of developing lung cancer in a subject exposed to cigarette smoke comprising measuring the transcription profile of the proteins encoded by one or more groups of genes of the invention in a biological sample of the subject.
  • a biological sample of the subject Preferably at least about 30, still more preferably at least about 36, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, or about 180 of the proteins encoded by the airway transcriptome in a biological sample of the subject are analyzed.
  • the method comprises binding an antibody against each protein encoded by the gene in the gene group (the “protein”) to a solid support chosen from the group consisting of dip-stick and membrane; incubating the solid support in the presence of the sample to be analyzed under conditions where antibody-antigen complexes form; incubating the support with an anti-protein antibody conjugated to a detectable moiety which produces a signal; visually detecting said signal, wherein said signal is proportional to the amount of protein in said sample; and comparing the signal in said sample to a standard, wherein a difference in the amount of the protein in the sample compared to said standard of the same group of proteins, is indicative of diagnosis of or an increased risk of developing lung cancer.
  • the standard levels are measured to indicate expression levels in an airway exposed to cigarette smoke where no cancer has been detected.
  • the assay reagents, pipettes/dropper, and test tubes may be provided in the form of a kit. Accordingly, the invention further provides a test kit for visual detection of the proteins encoded by the airway gene groups, wherein detection of a level that differs from a pattern in a control individual is considered indicative of an increased risk of developing lung disease in the subject.
  • the test kit comprises one or more solutions containing a known concentration of one or more proteins encoded by the airway transcriptome (the “protein”) to serve as a standard; a solution of a anti-protein antibody bound to an enzyme; a chromogen which changes color or shade by the action of the enzyme; a solid support chosen from the group consisting of dip-stick and membrane carrying on the surface thereof an antibody to the protein. Instructions including the up or down regulation of the each of the genes in the groups as provided by the Tables 1 and 2 are included with the kit.
  • the practice of the present invention may employ, unless otherwise indicated, conventional techniques and descriptions of organic chemistry, polymer technology, molecular biology (including recombinant techniques), cell biology, biochemistry, and immunology, which are within the skill of the art.
  • Such conventional techniques include polymer array synthesis, hybridization, ligation, and detection of hybridization using a label. Specific illustrations of suitable techniques can be had by reference to the example herein below. However, other equivalent conventional procedures can, of course, also be used.
  • Such conventional techniques and descriptions can be found in standard laboratory manuals such as Genome Analysis: A Laboratory Manual Series (Vols.
  • the methods of the present invention can employ solid substrates, including arrays in some preferred embodiments.
  • Methods and techniques applicable to polymer (including protein) array synthesis have been described in U.S. Ser. No. 09/536,841, WO 00/58516, U.S. Pat. Nos.
  • Patents that describe synthesis techniques in specific embodiments include U.S. Pat. Nos. 5,412,087, 6,147,205, 6,262,216, 6,310,189, 5,889,165, and 5,959,098. Nucleic acid arrays are described in many of the above patents, but the same techniques are applied to polypeptide and protein arrays.
  • Nucleic acid arrays that are useful in the present invention include, but are not limited to those that are commercially available from Affymetrix (Santa Clara, Calif.) under the brand name GeneChip7. Example arrays are shown on the website at affymetrix.com.
  • the present invention also contemplates sample preparation methods in certain preferred embodiments.
  • the nucleic acid sample may be amplified by a variety of mechanisms, some of which may employ PCR. See, e.g., PCR Technology: Principles and Applications for DNA Amplification (Ed. H. A. Erlich, Freeman Press, NY, N.Y., 1992); PCR Protocols: A Guide to Methods and Applications (Eds. Innis, et al., Academic Press, San Diego, Calif., 1990); Mattila et al., Nucleic Acids Res. 19, 4967 (1991); Eckert et al., PCR Methods and Applications 1, 17 (1991); PCR (Eds.
  • LCR ligase chain reaction
  • LCR ligase chain reaction
  • Landegren et al. Science 241, 1077 (1988) and Barringer et al. Gene 89:117 (1990)
  • transcription amplification Kwoh et al., Proc. Natl. Acad. Sci. USA 86, 1173 (1989) and WO88/10315
  • self-sustained sequence replication Guatelli et al., Proc. Nat. Acad. Sci. USA, 87, 1874 (1990) and WO90/06995
  • selective amplification of target polynucleotide sequences U.S. Pat. No.
  • the present invention also contemplates signal detection of hybridization between the sample and the probe in certain embodiments. See, for example, U.S. Pat. Nos. 5,143,854, 5,578,832; 5,631,734; 5,834,758; 5,936,324; 5,981,956; 6,025,601; 6,141,096; 6,185,030; 6,201,639; 6,218,803; and 6,225,625, in provisional U.S. Patent application 60/364,731 and in PCT Application PCT/US99/06097 (published as WO99/47964).
  • Computer software products of the invention typically include computer readable medium having computer-executable instructions for performing the logic steps of the method of the invention.
  • Suitable computer readable medium include floppy disk, CD-ROM/DVD/DVD-ROM, hard-disk drive, flash memory, ROM/RAM, magnetic tapes and etc.
  • the computer executable instructions may be written in a suitable computer language or combination of several languages. Basic computational biology methods are described in, e.g.
  • the present invention also makes use of various computer program products and software for a variety of purposes, such as probe design, management of data, analysis, and instrument operation. See, for example, U.S. Pat. Nos. 5,593,839, 5,795,716, 5,733,729, 5,974,164, 6,066,454, 6,090,555, 6,185,561, 6,188,783, 6,223,127, 6,229,911 and 6,308,170.
  • the present invention may have embodiments that include methods for providing gene expression profile information over networks such as the Internet as shown in, for example, U.S. patent applications Ser. Nos. 10/063,559, 60/349,546, 60/376,003, 60/394,574, 60/403,381.
  • fractional ranges are also included in the exemplified amounts that are described. Therefore, for example, a range of 1-3 includes fractions such as 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, etc. This applies particularly to the amount of increase or decrease of expression of any particular gene or transcript.
  • RNA/DNA epithelial nucleic acids
  • RNA and U133A Affymetrix array that represents transcripts from about 22,500 genes.
  • microarray data analysis was performed as follows. We first scanned the Affymetrix chips that had been hybridized with the study group samples. The obtained microarray raw data consisted of signal strength and detection p-value. We normalized or scaled the data, and filtered the poor quality chips based on images, control probes, and histograms according to standard Affymetrix instructions. We also filtered contaminated specimens which contained non-epithelial cells. Lastly, the genes of importance were filtered using detection p-value. This resulted in identification of transcripts present in normal airways (normal airway transcriptome), with variability and multiple regression analysis. This also resulted in identification of effects of smoking on airway epithelial cell transcription. For this, we used T-test and Pearson correlation analysis. We also identified a group or a set of transcripts that were differentially expressed in samples with lung cancer and samples without cancer. This analysis was performed using class prediction models.
  • Committees of variable sizes of the top ranked genes were used to evaluate test samples, but genes with more significant p-values were more heavily weighed.
  • FIG. 8 shows diagrams of the class prediction model analysis used in the Example 1.
  • the chip used was the Human Genome U133 Set.
  • Microarray Suite 5.0 software to analyze raw data from the chip (i.e. to convert the image file into numerical data). Both the chip and the software are proprietary materials from Affymetrix. Bronchoscopy was performed to obtain nucleic acid samples from 98 smoker individuals.
  • the predictor groups of genes were identified using the GenePattern server from the Broad Institute, which includes the Weighted Voting algorithm. The default settings, i.e., the signal to noise ratio and no gene filtering, were used. GenePattern is available at World Wide Web from broad.mitedu/cancer/software/genepattern. This program allows analysis of data in groups rather than as individual genes.
  • Table 1 shows the top 96 genes from our analysis with different expression patterns in smokers with cancer and smokers without cancer.
  • Table 2 shows the 84 genes that were also identified in our previous screens as individual predictors of lung cancer.
  • Table 4 shows a novel group of 36 genes the expression of which was different between the smokers with cancer and smokers without cancer.
  • Table 3 shows a group of 50 genes that we identified as most predictive of development of cancer in smokers. That is, that when the expression of these genes was analyzed and reflected the pattern (expression down or up) as shown in Table 3, we could identify the individuals who will develop cancer based on this combined expression profile of these genes.
  • the expression analysis of these 50 genes was predictive of a smoker developing lung cancer in over 70% of the samples. Accuracy of diagnosis of lung cancer in our sample was 80-85% on cross-validation and independent dataset (accuracy includes both the sensitivity and specificity).
  • the sensitivity percent of cancer cases correctly diagnosed was approximately 75% as compared to sensitivity of 40% using standard bronchoscopy technique. (Specificity is percent of non-cancer cases correctly diagnosed).
  • Subjects were classified as having lung cancer if their bronchoscopy studies (brushing, bronchoalveolar lavage or endobronchial biopsy) or a subsequent lung biopsy (transthoracic biopsy or surgical lung biopsy) yielded tumor cells on pathology/cytology. Subjects were classified with an alternative benign diagnosis if the bronchoscopy or subsequent lung biopsy yielded a non-lung cancer diagnosis or if their radiographic abnormality resolved on follow up chest imaging. The study was approved by the Institutional Review Boards of all 4 medical centers and all participants provided written informed consent.
  • Airway epithelial cell collection Following completion of the standard diagnostic bronchoscopy studies, bronchial airway epithelial cells were obtained from the “uninvolved” right mainstem bronchus with an endoscopic cytobrush (Cellebrity Endoscopic Cytobrush, Boston Scientific, Boston, Mass.). If a suspicious lesion (endobronchial or submucosal) was seen in the right mainstem bronchus, cells were then obtained from the uninvolved left mainstem bronchus. The brushes were immediately placed in TRIzol reagent (Invitrogen, Carlsbad, Calif.) after removal from the bronchoscope and kept at ⁇ 80° C. until RNA isolation was performed.
  • TRIzol reagent Invitrogen, Carlsbad, Calif.
  • Microarray data acquisition and preprocessing 6-8 ⁇ g of total RNA was processed, labeled, and hybridized to Affymetrix HG-U133A GeneChips containing approximately 22,215 human transcripts as described previously(17). We obtained sufficient quantity of high quality RNA for microarray studies from 152 of the 208 samples. The quantity of RNA obtained improved during the course of the study so that 90% of brushings yielded sufficient high quality RNA during the latter half of the study. Log-normalized probe-level data was obtained from CEL files using the Robust Multichip Average (RMA) algorithm(18). A z-score filter was employed to filter out arrays of poor quality (see supplement for details), leaving 129 samples with a final diagnosis available for analysis.
  • RMA Robust Multichip Average
  • Table 1a shows demographic features and characteristics of the two patient classes being studied. Statistical differences between the two patient classes and associated p values were calculated using T-tests, Chi-square tests and Fisher's exact tests where appropriate.
  • Gene selection was conducted through internal cross-validation within the training set using the weighted voting algorithm(19). The internal cross-validation was repeated 50 times, and the top 40 up-and top 40 down-regulated probesets in cancer most frequently chosen during internal cross-validation runs were selected as the final gene committee of 80 features (see sections, infra, for details regarding the algorithm and the number of genes selected for the committee).
  • the accuracy, sensitivity, and specificity of the biomarker were assessed on the independent test set of 52 samples. This was accomplished by using the weighted vote algorithm to predict the class of each test set sample based on the gene expression of the 80 probesets and the probe set weights derived from the 77 samples in the training set.
  • To assess the performance of our classifier we first created 1000 predictors from the training set where we randomized the training set class labels. We evaluated the performance of these “class-randomized” classifiers for predicting the sample class of the test set samples and compared these to our classifier using ROC analysis.
  • Quantitative PCR Validation Real time PCR (QRT-PCR) was used to confirm the differential expression of a select number of genes in our predictor. Primer sequences were designed with Primer Express software (Applied Biosystems, Foster City, Calif.). Forty cycles of amplification, data acquisition, and data analysis were carried out in an ABI Prism 7700 Sequence Detector (Applied Biosystems, Foster City, Calif.). All real time PCR experiments were carried out in triplicate on each sample (see sections infra).
  • the 80-gene lung cancer biomarker derived from airway epithelium gene expression was evaluated for its ability to distinguish between normal and cancerous lung tissue using an Affymetrix HGU95Av2 dataset published by Bhattacharjee et al(21) that we processed using RMA.
  • Affymetrix HGU95Av2 dataset published by Bhattacharjee et al(21) that we processed using RMA.
  • 64 HGU95Av2 probesets were identified that measure the expression of genes that corresponded to the 80 probesets in our airway classifier. This resulted in a partial airway epithelium signature that was then used to classify tumor and normal samples from the dataset.
  • PCA analysis of the lung tissue samples was performed using the expression of these 64 probesets.
  • RMA was performed in BioConductor.
  • the upstream gene filtering by ANCOVA, and the implementation of the weighted voted algorithm and internal cross validation used to generate the data were executed through an R script we wrote for this purpose.
  • the PAM algorithm was carried out using the ‘pamr’ library in R. All other statistical analyses including Student's T-Tests, Fisher's exact tests, ROC curves and PCA were performed using the R statistical package.
  • Study Population and Epithelial samples 129 subjects that had microarrays passing the quality control filter described above were included in the class prediction analysis (see Supplemental FIG. 1 ). Demographic data on these subjects, including 60 smokers with primary lung cancer and 69 smokers without lung cancer is presented in Table 1. Cell type and stage information for all cancer patients is shown in Supplemental Table 1. Bronchial brushings yielded 90% epithelial cells, as determined by cytokeratin staining, with the majority being ciliated cells with normal bronchial airway morphology. No dysplastic or cancer cells were seen on any representative brushings obtained from smokers with or without cancer.
  • Hierarchical clustering of the 80 genes selected for the diagnostic biomarker in the test set samples is shown in FIG. 15 .
  • Principal Component Analysis of all cancer samples according to the expression of these 80 genes did not reveal grouping by cell type ( FIG. 10 ).
  • the accuracy of this 80-gene classifier was similar when microarray data was preprocessed in MAS 5.0 and when the PAM class prediction algorithm was used (see Supplemental Table 3).
  • the performance of the classifier was not dependent on the particular composition of the training and test set on which it was derived and tested: 500 training and test sets (derived from the 129 samples) resulted in classifiers with similar accuracy as the classifier derived from our training set ( FIG. 11 ).
  • 500 training and test sets resulted in classifiers with similar accuracy as the classifier derived from our training set ( FIG. 11 ).
  • the classifier is better able to distinguish the two sample classes than 500 classifiers derived by randomly selecting genes (see FIG. 12 ).
  • Bronchoscopy was diagnostic (via endoscopic brushing, washings or biopsy of the affected region) in 32/60 (53%) of lung cancer patients and 5/69 non-cancer patients.
  • our class prediction model had an accuracy of 85% with 89% sensitivity and 83% specificity.
  • Combining bronchoscopy with our gene expression signature resulted in a 95% diagnostic sensitivity (57/60) across all cancer subjects.
  • a negative bronchoscopy and negative gene expression signature for lung cancer resulted in a 95% negative predictive value (NPV) for disease ( FIG. 18 ).
  • NPV negative predictive value
  • the positive predictive value of our gene expression profile for lung cancer was approximately 70% ( FIG. 18 ).
  • Stage and cell type subgroup analysis The diagnostic yield of our airway gene expression signature vs. bronchoscopy according to stage and cell type of the lung cancer samples is shown in FIG. 19 .
  • Lung cancer is the leading cause of death from cancer in the United States, in part because of the lack of sensitive and specific diagnostic tools that are useful in early-stage disease. With approximately 90 million former and current smokers in the U.S., physicians increasingly encounter smokers with clinical suspicion for lung cancer on the basis of an abnormal radiographic imaging study and/or respiratory symptoms. Flexible bronchoscopy represents a relatively noninvasive initial diagnostic test to employ in this setting. This study was undertaken in order to develop a gene expression-based diagnostic, that when combined with flexible bronchoscopy, would provide a sensitive and specific one-step procedure for the diagnosis of lung cancer.
  • profiling airway gene expression across smokers with and without lung cancer can also provide insight into the nature of the “field of injury” reported in smokers and potential pathways implicated in lung carcinogenesis.
  • Previous studies have demonstrated allelic loss and methylation of tumor suppressor genes in histologically normal bronchial epithelial cells from smokers with and without lung cancer(12; 13; 15). Whether these changes are random mutational effects or are directly related to lung cancer has been unclear.
  • the finding that our airway gene signature was capable of distinguishing lung cancer tissue from normal lung suggests that the airway biomarker is, at least in part, reflective of changes occurring in the cancerous tissue and may provide insights into lung cancer biology.
  • Rab proteins represent a family of at least 60 different Ras-like GTPases that have crucial roles in vesicle trafficking, signal transduction, and receptor recycling, and dysregulation of RAB gene expression has been implicated in tumorigenesis(24).
  • Shimada et al.(25) found a high prevalence of Rab1A-overexpression in head and neck squamous cell carcinomas and also in premalignant tongue lesions, suggesting that it may be an early marker of smoking-related respiratory tract carcinogenesis.
  • the classifier contained several pro-inflammatory genes, including Interleukin-8 (IL-8) and beta-defensin 1 that were up regulated in smokers with lung cancer.
  • IL-8 originally discovered as a chemotactic factor for leukocytes, has been shown to contribute to human cancer progression through its mitogenic and angiogenic properties(26; 27).
  • Beta defensins antimicrobial agents expressed in lung epithelial cells, have recently found to be elevated in the serum of patients with lung cancer as compared to healthy smokers or patients with pneumonia(28). Higher levels of these mediators of chronic inflammation in response to tobacco exposure may result in increased oxidative stress and contribute to tumor promotion and progression in the lung(29; 30)
  • BACH2 a transcription factor
  • DNA repair protein 1C a DNA repair enzyme 1C
  • BACH-2 a transcription factor
  • oxidative-stress(31) a DNA repair enzyme 1C
  • a subset of healthy smokers respond differently to tobacco smoke, failing to induce a set of detoxification enzymes in their normal airway epithelium, and that these individuals may be predisposed to its carcinogenic effects(17).
  • a component of the airway “field defect” may reflect whether a given smoker is appropriately increasing expression of protective genes in response to the toxin. This inappropriate response may reflect a genetic susceptibility to lung cancer or alternatively, epigenetic silencing or deletion of that gene by the carcinogen.
  • a negative bronchoscopy and negative gene expression signature for lung cancer results in a 95% negative predictive value (NPV), allowing these patients to be followed non-aggressively with repeat imaging studies.
  • the positive predictive value is ⁇ 70%, and these patients would likely require further invasive testing (i.e. transthoracic needle biopsy or open lung biopsy) to confirm the presumptive lung cancer diagnosis.
  • this represents a substantial reduction in the numbers of patients requiring further invasive diagnostic testing compared to using bronchoscopy alone.
  • 92/129 patients were bronchoscopy negative and would have required further diagnostic work up.
  • the negative predictive gene expression profile in 56 of these 92 negative bronchoscopy subjects would leave only 36 subjects who would require further evaluation (see FIG. 18 ).
  • NSCLC 48 NSCLC staging Squamous Cell 23 IA 2 Adenocarcinoma 11 IB 9 Large Cell 4 IIA 2 Not classified 10 IIB 0 Small Cell 11 IIIA 9 Unknown 1 IIIB 9 IV 17
  • Supplemental Table 1 above shows cell type and staging information for 60 lung cancer patients in the 129 primary sample set used to build and test the class prediction model. Staging information limited to the 48 non-small cell samples.
  • the demographic features of the samples in training and test shown are shown in Supplemental Table 2 below.
  • each patient's clinical pre-test probability for disease varied.
  • three independent pulmonary clinicians blinded to the final diagnoses, evaluated each patient's clinical history (including age, smoking status, cumulative tobacco exposure, co-morbidities, symptoms/signs and radiographic findings) and assigned a pre-bronchoscopy probability for lung cancer.
  • Each patient was classified into one of three risk groups: low ( ⁇ 10% probability of lung cancer), medium (10-50% probability of lung cancer) and high (>50% probability of lung cancer). The final risk assignment for each patient was decided by the majority opinion.
  • Bronchial airway epithelial cells were obtained from the subjects described above via flexible bronchoscopy. Following local anesthesia with 2% topical lidocaine to the oropharynx, flexible bronchoscopy was performed via the mouth or nose. Following completion of the standard diagnostic bronchoscopy studies (i.e. bronchoalveolar lavage, brushing and endo/transbronchial biopsy of the affected region), brushings were obtained via three endoscopic cytobrushes from the right mainstem bronchus. The cytobrush was rubbed over the surface of the airway several times and then retracted from the bronchoscope so that epithelial cells could be placed immediately in TRIzol solution and kept at ⁇ 80° C. until RNA isolation was performed.
  • Microarray data acquisition 6-8 ⁇ g of total RNA from bronchial epithelial cells were converted into double-stranded cDNA with SuperScript II reverse transcriptase (Invitrogen) using an oligo-dT primer containing a T7 RNA polymerase promoter (Genset, Boulder, Colo.).
  • the ENZO Bioarray RNA transcript labeling kit (Enzo Life Sciences, Inc, Farmingdale, N.Y.) was used for in vitro transcription of the purified double stranded cDNA.
  • the biotin-labeled cRNA was then purified using the RNeasy kit (Qiagen) and fragmented into fragments of approximately 200 base pairs by alkaline treatment. Each cRNA sample was then hybridized overnight onto the Affymetrix HG-U133A array followed by a washing and staining protocol. Confocal laser scanning (Agilent) was then performed to detect the streptavidin-labeled fluor.
  • RMA Robust Multichip Average
  • Supplemental Table 4 shows the Average Pearson Correlations between 7 pairs of replicate samples where probe-set gene expression values were determined using Microarray Suite 5.0 (Affy), logged data from Microarray Suite 5.0 (log 2 Affy), and RMA. RMA maximizes the correlation between replicate samples.
  • each probeset on the array was z-score normalized to have a mean of zero and a standard deviation of 1 across all 152 samples. These normalized gene-expression values were averaged across all probe-sets for each sample. The assumption explicit in this analysis is that poor-quality samples will have probeset intensities that consistently trend higher or lower across all samples and thus have an average z-score that differs from zero. This average z-score metric correlates with Affymetrix MAS 5.0 quality metrics such as percent present ( FIG. 7 ) and GAPDH 3′/5′ ratio. Microarrays that had an average z-score with a value greater than 0.129 ( ⁇ 15% of the 152 samples) were filtered out. The resulting sample set consisted of 60 smokers with cancer and 69 smokers without cancer.
  • Prospective validation test set CEL files for the additional 40 samples were added to the collection of airway epithelial CEL files described above, and the entire set was analyzed using RMA to derive expression values for the new samples. Microarrays that had an average z-score with a value greater than 0.129 (5 of the 40 samples) were filtered out. Class prediction of the 35 remaining prospective samples was conducted using the vote weights for the 80-predictive probesets derived from the training set of 77 samples using expression values computed in the section above.
  • Class Prediction Algorithm The 129-sample set (60 cancer samples, 69 no cancer samples) was used to develop a class-prediction algorithm capable of distinguishing between the two classes.
  • One potentially confounding difference between the two groups is a difference in cumulative tobacco-smoke exposure as measured by pack-years.
  • the pack-years each patient smoked was included as a covariate in the training set ANCOVA gene filter.
  • pre-bronchoscopy clinical risk for lung cancer there are differences in the pre-bronchoscopy clinical risk for lung cancer among the 129 patients.
  • the training set was constructed with roughly equal numbers of cancer and no cancer samples from a spectrum of lung cancer risk.
  • the weighted voting algorithm (Golub T R, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 1999; 286(5439):531-537) was implemented as the class prediction method, with several modifications to the gene-selection methodology. Genes that varied between smokers with and without cancer in the training set samples after adjusting for tobacco-smoke exposure (p ⁇ 0.05) were identified using an ANCOVA with pack-years as the covariate. Further gene selection was performed using the signal to noise metric and internal cross-validation where the 40 most consistently up- and the 40 most consistently down-regulated probesets were identified. The internal cross validation involved leaving 30% of the training samples out of each round of cross-validation, and selecting genes based on the remaining 70% of the samples.
  • the final gene committee consisted of eighty probesets that were identified as being most frequently up-regulated or down-regulated across 50 rounds of internal cross-validation. The parameters of this gene-selection algorithm were chosen to maximize the average accuracy, sensitivity and specificity obtained from fifty runs. This algorithm was implemented in R and yields results that are comparable to the original implementation of the weighted-voted algorithm in GenePattern when a specific training, test, and gene set are given as input.
  • the algorithm was run using a training set of 77 samples to arrive at a final set of genes capable of distinguishing between smokers with and without lung cancer.
  • the accuracy, sensitivity and specificity of this classifier were tested against 52 samples that were not included in the training set.
  • the performance of this classifier in predicting the class of each test-set sample was assessed by comparing it to runs of the algorithm where either: 1) different training/tests sets were used; 2) the cancer status of the training set of 77 samples were randomized; or 3) the genes in the classifier were randomly chosen (see randomization section below for details).
  • Randomization The accuracy, sensitivity, specificity, and area under the ROC curve (using the signed prediction strength as a continuous cancer predictor) for the 80-probeset predictor (above) were compared to 1000 runs of the algorithm using three different types of randomization. First, the class labels of the training set of 77 samples were permuted and the algorithm, including gene selection, was re-run 1000 times (referred to in Supplemental Table 5 as Random 1).
  • the second randomization used the 80 genes in the original predictor but permuted the class labels of the training set samples over 1000 runs to randomize the gene weights used in the classification step of the algorithm (referred to in Supplemental Table 5 as Random 2).
  • the third randomization method involved randomly selecting 80 probesets for each of 1000 random classifiers (referred to in Supplemental Table 5 as Random 3).
  • the p-value for each metric and randomization method shown indicate the percentage of 1000 runs using that randomization method that exceeded or was equal to the performance of the actual classifier.
  • the actual classifier was compared to 1000 runs of the algorithm where different training/test sets were chosen but the correct sample labels were retained. Empirically derived p-values were also computed to compare the actual classifier to the 1000 runs of the algorithm (see Supplemental Table 6). These data indicate that the actual classifier was derived using a representative training and test set.
  • Supplemental Table 6 above shows a comparison of actual classifier to 1000 runs of the algorithm with different training/test sets.
  • Supplemental Table 7 above shows comparison of runs of the algorithm using different training/test sets to runs where the class labels of the training sets were randomized (1000 runs were conducted).
  • Characteristics of the 1000 additional runs of the algorithm The number of times a sample in the test set was classified correctly and its average prediction strength was computed across the 1000 runs of the algorithm.
  • the average prediction strength when a sample was classified correctly was 0.54 for cancers and 0.61 for no cancers, and the average prediction strength when a sample was misclassified was 0.31 for cancer and 0.37 for no cancers.
  • the slightly higher prediction strength for smokers without cancer is reflective of the fact that predictors have a slightly higher specificity on average.
  • Supplemental FIG. 3 shows that samples that are consistently classified correctly or classified incorrectly are classified with higher confidence higher average prediction strength).
  • the number of times the 80-predictive probesets used in the biomarker were selected in each of the 1000 runs was examined.
  • the majority of the 80-biomarker probesets were chosen frequently over the 1000 runs (37 probesets were present in over 800 runs, and 58 of the probesets were present in over half of the runs).
  • the cancer status of the training set samples are randomized over 1000 runs (Supplemental Table 7)
  • the most frequently selected probeset is chosen 66 times, and the average is 7.3 times. (See FIG. 10B ).
  • Supplemental Table 8 shows a comparison of accuracy, sensitivity and specificity for our 80 probeset classifier on the 52 sample test set using alternative microarray data preprocessing algorithms and class prediction algorithms.
  • Prediction Strength The Weighted voting algorithm predicts a sample's class by summing the votes each gene on the class prediction committee gives to one class versus the other. The level of confidence with which a prediction is made is captured by the Prediction Strength (PS) and is calculated as follows:
  • V winning refers to the total gene committee votes for the winning class and V losing refers to the total gene committee votes for the losing class. Since V winning is always greater than V losing , PS confidence varies from 0 (arbitrary) to 1 (complete confidence) for any given sample.
  • Cancer cell type To determine if the tumor cell subtype affects the expression of genes that distinguish airway epithelium from smokers with and without lung cancer, Principal Component Analysis (PCA) was performed on the gene-expression measurements for the 80 probesets in our predictor and all of the airway epithelium samples from patients with lung cancer ( FIG. 12 ). Gene expression measurements were Z(0,1) normalized prior to PCA. There is no apparent separation of the samples with regard to cancer subtype.
  • PCA Principal Component Analysis
  • Probesets were mapped between the HGU133A array and HGU95Av2 array using Chip Comparer at the Duke University's database available through the world wide web at tenero.duhs.duke.edu/genearray/perl/chip/chipcomparer.pl. 64 probesets on the HGU95Av2 array mapped to the 80-predictive probesets. The 64 probesets on the HGU95Av2 correspond to 48 out of the 80 predictive probesets (32/80 predictive probesets have no clear corresponding probe on the HGU95Av2 array).
  • the mean difference between normal and malignant samples was calculated based on the projected values for principal component 1 for the actual 64 probesets and for each of the 1000 random sets of 64 probesets.
  • the mean difference between normal and malignant from the 1000 random gene sets was used to generate a null distribution.
  • the weighted voted algorithm was re-run using two types of data randomization. First, the class labels of the training set of 20 samples were permuted and the algorithm, including gene selection, was re-run 1000 times (referred to in Supplemental Table 9 as Random 1).
  • the second randomization involved permuting the class labels of the training set of 20 samples and re-running the algorithm 1000 times keeping the list of 64-probsets constant (referred to in Supplemental Table 9 as Random 2).
  • the class labels were permuted such that half the samples were correctly labeled.
  • the p-value for each metric and randomization method shown indicate the percentage of 1000 runs using that randomization method that exceeded or were equal to the performance of the actual classifier.
  • RNA samples (1 ⁇ g of the RNA used in the microarray experiment) were treated with DNAfree (Ambion, Austin, Tex.), according to the manufacturer's protocol, to remove contaminating genomic DNA. Total RNA was reverse-transcribed using random hexamers (Applied Biosystems) and SuperScript II reverse transcriptase (Invitrogen).
  • the resulting first-strand cDNA was diluted with nuclease-free water (Ambion) to 5 ng/ ⁇ l.
  • PCR amplification mixtures 25 ⁇ l contained 10 ng template cDNA, 12.5 ⁇ l of 2 ⁇ SYBR Green PCR master mix (Applied Biosystems) and 300 nM forward and reverse primers. Forty cycles of amplification and data acquisition were carried out in an Applied Biosystems 7500 Real Time PCR System. Threshold determinations were automatically performed by Sequence Detection Software (version 1.2.3) (Applied Biosystems) for each reaction. All real-time PCR experiments were carried out in triplicate on each sample (6 samples total; 3 smokers with lung cancer and 3 smokers without lung cancer).
  • LOH a >20% change in normalized allele height ratio ( FIG. 22 ) (Liloglou T, et al., Cancer Res., 61:1624-1628 (2001); Liloglou T, et al., Int. J. Oneal., 16:5-14 (2000)). All instances of LOH were verified by repetition and the mean allele height ratio was used for data analysis. LOH was measured by comparing tumor DNA to nonmalignant lung DNA rather than to lymphocyte DNA, which was unavailable for this study. Thus, LOH represented allelic loss between two somatic sites in the same lung, rather than between tumor tissue and constitutional genomic DNA.
  • FAL fractional allelic loss
  • Chromosomes 1.0p, 9p, and 5q were the most frequent sites of LOH in nonsmokers' tumors while 9p and 5q were the most frequent sites in smokers. Increased FAL in nonsmokers was most pronounced at five chromosomal arms: 3p, 8p, 9p, 10p, and 18q with FAL ranging from 55 to 87%.
  • These microsatellites harbor several known or candidate tumor suppressor genes such as FHIT, DLCL (Daigo Y, et al., Cancer Res., 59:1966-1972 (1999)), RASSFI (Dammam R, el al., Nat.
  • Samples of epithelial cells obtained by brushing airway surfaces, were obtained from intra- and extra-pulmonary airways in 11 normal non-smokers (NS), 15 smokers without lung cancer (S), and 9 smokers with lung cancer (SC). 5-10 ug of RNA was extracted using standard trizol-based methods, quality of RNA was assayed in gels, and the RNA was processed using standard protocols developed by Affymetrix for the U133 human array. Expression profiles, predictive algorithms, and identification of critical genes are made using bioinformatic methods.
  • FIGS. 20A-20F show a list of genes which are differentially expressed in smokers and non-smokers.
  • FIGS. 21A-21B show a list of genes which are differentially expressed in smokers and smokers with lung cancer. T-test statistical results are shown.
  • Cigarette smoking is responsible for 90% of all lung cancers, the leading cause of cancer deaths in the US and the world (2, 3).
  • Smoking is also the major cause of chronic obstructive pulmonary disease (COPD), the fourth leading cause of death in the US (4).
  • COPD chronic obstructive pulmonary disease
  • COPD chronic obstructive pulmonary disease
  • Bronchial airway epithelial cells were obtained from brushings of the right mainstem bronchus taken during fiberoptic bronchoscopy using an endoscopic cytobrush (CELEBRITY Endoscopy Cytology Brush, Boston Scientific, Boston, Mass.).
  • the brushes were immediately placed in TRizol reagent (Invitrogen, Carlsbad, Calif.) after removal from the bronchoscope and kept at ⁇ 80° C. until RNA isolation was performed. Any other RNA protection protocol known to one skilled in the art can also be used.
  • TRizol Reagent Invitrogen
  • RNA was extracted from the brushes using TRizol Reagent (Invitrogen) as per the manufacturer protocol, with a yield of 8-15 ⁇ g of RNA per patient.
  • RNA isolation or purification can be used to isolate RNA from the samples. Integrity of the RNA was confirmed by running it on a RNA denaturing gel. Epithelial cell content of representative bronchial brushing samples was quantified by cytocentrifugation (ThermoShandon Cytospin, Pittsburgh, Pa.) of the cell pellet and staining with a cytokeratin antibody (Signet, Dedham Mass.). The study was approved by the Institutional Review Board of Boston University Medical Center and all participants provided written informed consent.
  • the ENZO Bioarray RNA transcript labeling kit (Affymetrix) was used for in vitro transcription of the purified double stranded cDNA.
  • the biotin-labeled cRNA was purified using the RNeasy kit (Qiagen) and fragmented into approximately 200 base pairs by alkaline treatment (200mM Tris-acetate, pH 8.2, 500 mM potassium acetate, 150 mM magnesium acetate). Each verified cRNA sample was then hybridized overnight onto the Affymetrix HG-U133A array and confocal laser scanning (Agilent) was then performed to detect the streptavidin-labeled fluor.
  • a single weighted mean expression level for each gene along with a p (detection) -value (which indicates whether the transcript was reliably detected) was derived using Microarray Suite 5.0 software (Affymetrix, SantaClara, Calif.).
  • the MAS 5.0 software also generated a detection p-value (p (detection) -value) for each gene which indicates whether the transcript was reliably detected.
  • p (detection) -value p (detection) -value
  • Microarray data normalization was accomplished in MAS 5.0, where the mean intensity for each array (top and bottom 2% of genes excluded) was corrected (by a scaling factor) to a set target intensity of 100.
  • the list of genes on this array is available at hap://www.affymetrix.com/analysis/download center.affx.
  • Arrays of poor quality were excluded based on several quality control measures. Each array's scanned image was required to be free of any significant artifacts and the bacterial genes spiked into the hybridization mix had to have a P (detection) -value below 0.05 (called present). If an array passed this criteria, it was evaluated based on three other quality measures: the 3′ to 5′ ratio of the intensity for Glyceraldehyde-3-phosphate dehydrogenase (GAPDH), the percent of genes detected as present, and the percent of “outlier” genes as determined by a computational algorithm we developed (see httn://pulm.bumc.bu.edu/aged/supplemental.html for further details, which are herein incorporated by reference).
  • GPDH Glyceraldehyde-3-phosphate dehydrogenase
  • Microarray Data Analysis Clinical information and array data as well as gene annotations are stored in an interactive MYSQL database coded in Pert available at http://pulm.bumc.bu.edu/aged/index.html. All statistical analyses below and within the database were performed using R software version 1.6.2 (available at http://r-project.org). The gene annotations used for each probe set were from the October 2003 NetAffx HG-U13 3A Annotation Files.
  • FIGS. 27A and 27B Two systematic approaches were implemented to assess the variability between replicates versus the variability between unrelated samples. Pearson correlation coefficients were computed between replicates as well as between unrelated samples within a group (never or current smoker) and between groups (never versus current smoker) using the filtered gene list (9968 genes).
  • FIG. 35 reports the mean R squared values for each of the four comparisons. The results demonstrate that the mean correlation among replicates is higher than between two unrelated samples, and that the within group correlations between unrelated samples are higher than the between group correlations between unrelated samples.
  • the second approach uses a different methodology, but yields similar results to those described in FIG. 35 .
  • a differential gene expression ratio was computed between replicate samples and between all possible combinations of two unrelated samples (Lenburg M, Liou L, Gerry N, Frampton G, Cohen H & Christman M. (2003) BMC Cancer 3 , 31).
  • a histogram of the log base 2 ratio values or fold changes is displayed in FIG. 27C .
  • the number of fold changes computed for the replicate samples is less than the number of fold changes computed for unrelated samples, therefore, the frequencies in the histogram are calculated as a percent of the total fold changes calculated.
  • the histogram clearly shows that there is less variability among the replicate samples. In the replicate samples there is a higher frequency of genes having a fold change close to or equal to one compared to unrelated samples.
  • the normal large airway transcriptome was defined by the genes whose median p (detection) -value was less than 0.05 across all 23 healthy never smokers (7119 genes expressed across majority of subjects), as well as a subset of these 7119 genes whose p (detection) -value was less than 0.05 in all 23 subjects (2382 genes expressed across all subjects).
  • the coefficient of variation for each gene in the transcriptome was calculated. as the standard deviation divided by the mean expression level multiplied by 100 for that gene across all nonsmoking individuals.
  • the GOMINER software (16) was used to functionally classify the genes expressed across all nonsmokers (2382 probesets) by the molecular function categories within Gene Ontology (GO). Multiple linear regressions were performed on the top ten percent most variable probesets (712 probesets, as measured by the coefficient of variation) in the normal airway transcriptome (7119 probesets) in order to study the effects of age, gender, and race on gene expression.
  • genes expressed at low levels are not necessarily accurately detected by microarray technology.
  • the probe sets which define the normal airway transcriptome therefore, will represent genes which are expressed at a measurable level in either the majority or all of the nonsmoking healthy subjects.
  • One of the limitations to this approach is that we will be excluding genes expressed at low levels in the normal airway transcriptome.
  • FIG. 29 shows that the majority of genes in the normal airway transcriptome have coefficients of variation below 50.
  • the coefficients of variation for the top ten percent most variable genes ranged from 50.78 to 273.04.
  • a general linear model was used to explore the relationship between gene expression and age (numerical variable), race (categorical variable with two groups Caucasian or Other), and gender (categorical variable).
  • the model included the three main effects plus the three possible two-way interactions.
  • Models having a p-value less than 0.01 (83 genes) were chosen for further analysis.
  • the following diagnostic plots were assessed: residuals versus the fitted values plot, normal Q-Q plot, and Cook's distance plot. Based on the graphs, 13 models were removed because the residuals were not normally distributed or had unequal variance.
  • the age:race interaction term is absent from the table because none of the models had p-values less than 0.01 for this term.
  • Pearson correlation coefficients were calculated (see supplementary information).
  • a permutation test was used to assess the significance of our p-value threshold for any given gene's comparison between two groups (p (t-test) -value) or between a clinical variable (p (t-test) -value) (see supporting information for details).
  • Partek 5.0 software http://www.partek.com.
  • Coefficients of variation (sd/mean*100) were computed across never, former, and current smoker subjects for each of the 9968 probesets.
  • the top 1000 most variable probesets (%CV>56.52) were selected and hierarchical clustering of these probesets and samples was performed using log transformed z-score normalized data using a Pearson correlation (uncentered) similarity metric and average linkage clustering using CLUSTER and TREEVIEW software programs obtained at http://rana.1bI.gov/EisenSoftware.htm.
  • the clustering dendogram of the samples is displayed in FIG. 28 .
  • the samples do not cluster according to their classification of never, former, or current smokers, and therefore, a supervised approach was needed (see below).
  • the dendogram does not reveal a clustering pattern that is related to technical variation in the processing of the samples.
  • Quantitative PCR Validation Real time PCR(QRT-PCR) was used to confirm the differential expression of a select number of genes. Primer sequences were designed with Primer Express software (Applied Biosystems, Foster City, Calif.). Forty cycles of amplification, data acquisition, and data analysis were carried out in an ABI Prism 7700 Sequence Detector (Applied Biosystems, Foster City, Calif.). All real time PCR experiments were carried out in triplicate on each sample.
  • RNA samples 500 ng of residual sample from array experiment
  • DNAfree Ambion
  • Total RNA was reverse transcribed using Superscript II (Gibco).
  • Five microliters of the reverse transcription reaction was added to 45 ⁇ l of SYBR Green PCR master mix (Applied Biosystems). Forty cycles of amplification, data acquisition, and data analysis were carried out in an ABI Prism 7700 Sequence Detector (PE Applied Biosystems). Threshold determinations were automatically performed by the instrument for each reaction.
  • the cycle at which a sample crosses the threshold (a PCR cycle where the fluorescence emission exceeds that of nontemplate controls) is called the threshold cycle, or CT.
  • CT The cycle at which a sample crosses the threshold (a PCR cycle where the fluorescence emission exceeds that of nontemplate controls) is called the threshold cycle, or CT.
  • a high CT value corresponds to a small amount of template DNA, and a low CT corresponds to a large amount of template present initially. All real time PCR experiments were carried out in triplicate on each sample (mean of the triplicate shown). Data from the QRT-PCR for 5 genes that changed in response to cigarette exposure along with the tnicroarray results for these genes is shown in FIGS. 31A-31E .
  • Additional Information including the raw image data from all microarray samples (.DAT files), expression levels for all genes in all samples (stored in a relational database), user-defined statistical and graphical analysis of data and clinical data on all subjects is available at http://pulm.bumc.bu.edu/aged/. Data from our microarray experiments has also been deposited in NCBI's Gene Expression Omnibus under accession GSE994.
  • Results and Discussion Study Population and replicate samples. Microarrays from 75 subjects passed the quality control filters described above and are included in this study. Demographic data on these subjects, including 23 never smokers, 34 current smokers, and 18 former smokers, is presented in FIG. 34 . Bronchial brushings yielded 90% epithelial cells, as determined by cytokeratin staining, with the majority being ciliated cells. Samples taken from the right and left main bronchi in the same individual were highly reproducible with an R 2 value of 0.92, as were samples from the same individual taken 3 months apart with an R 2 value of 0.85 (see FIGS. 27A-27C ).
  • the Normal Airway Transcriptome 7119 genes were expressed at measurable levels in the majority of never smokers and 2382 genes were expressed in all of the 23 healthy never smokers. There was relatively little variation in expression levels of the 7119 genes; 90% had a coefficient of variation (SD/mean) of ⁇ 50% (see FIG. 29 ). Only a small part of the variation between subjects could be explained by age, gender or race on multiple linear regression analysis (see FIGS. 36A-36C ).
  • Table 6 depicts the GOMINER molecular functions(16) of the 2382 genes expressed in large airway epithelial cells of all healthy never smokers. Genes associated with oxidant stress, ion and electron transport, chaperone activity, vesicular transport, ribosomal structure and binding functions are over-represented. Genes associated with transcriptional regulation, signal transduction, pores and channels are under-represented as well as immune, cytokine and chemokine genes. Upper airway epithelial cells, at least in normal subjects, appear to serve as an oxidant and detoxifying defense system for the lung, but serve few other complex functions in the basal state.
  • Table 6 GOMINER molecular functions of genes in airway epithelial cells. Major molecular functional categories and subcategories of 2382 genes expressed in all never smoker subjects. Over- or under-representation of categories is deteiniined using Fisher's Exact Test. The null hypothesis is that the number of genes in our flagged set belonging to a category divided by the total number of genes in the category is equal to the number of flagged genes NOT in the category divided by the total number of genes NOT in the category. Equivalency in these two proportions is consistent with a random distribution of genes into functional categories and indicates no enrichment or depletion of genes in the category being tested. Categories considered to be statistically (p (GO) ⁇ 0.05) over- or under-represented by GOMINER are shown.
  • Cells/arrays refers to the ratio of the number of genes expressed in epithelial cells divided by the number of genes on U133A array in each functional category. Actual numbers are in parentheses.
  • Cigarette Smoking on the Airway Transcriptome altered the airway epithelial cell expression of a large number of genes. Ninety-seven genes were found to he differentially expressed by t-test between current and never smokers at p ⁇ 1.06*10 ⁇ 5 . This (p (t-test) -value) threshold was selected based on a permutation analysis performed to address the multiple comparison problem inherent in ally microarray analysis (see supporting information for further details). We chose a very stringent multiple comparison correction and (p (t-test) -value) threshold in order to identify a subset of genes altered by cigarette smoking with only a small probability of having a false positive.
  • genes that were increased in smokers tended to be involved in regulation of oxidant stress and glutathione metabolism, xenobiotic metabolism, and secretion.
  • Expression of several putative oncogenes pirin, CA12, and CEACAM6 were also increased.
  • Genes that decreased in smokers tended to be involved in regulation of inflammation, although expression of several putative tumor suppressor genes (TU3A, SLIT1 and 2, GAS6) were decreased. Changes in the expression of select genes were confirmed by real time RT-PCR (see FIGS. 31A-31E ).
  • FIG. 24 shows two-dimensional hierarchical clustering of all the current and never smokers based on the 97 genes that are differentially expressed between the two groups (tree for genes not shown). There were three current smokers (patients 456, 4147 and 4164) whose expression of a subset of genes was similar to that of never smokers. These three smokers, who were similar clinically to other smokers, also segregated in the same fashion when clusters were based on the top 361 genes differentially expressed.
  • the general effect of smoking on large airway epithelial cells was to induce expression of xenobiotic metabolism and redox stress-related genes and to decrease expression of some genes associated with regulation of inflammation.
  • Several putative oncogenes were upregulated and tumor suppressor genes were downregulated although their roles, in smoking-induced lung cancer remain to be determined. Risk for developing lung cancer in smokers has been shown to increase with cumulative pack-years of exposure(22), and a number of putative oncogenes correlate positively with pack-years, while putative tumor suppressor genes correlate negatively.
  • a gene's actual (p (t-test) -value) is compared to all other permuted (p (t-test) -values) to determine if the any of the permuted (p (t-test) -values) is equal to or lower than the actual gene's (p (t-test) -value).
  • An adjusted (p (t-test) -value) is computed for each gene based on the permutation test.
  • the adjusted (p (t-test) -value) is the probability of observing at least as small a (p (t-test) -value) (in any gene) as the gene's actual (p (t-test) -value) in any random permutation.
  • a gene is considered significant if less than 50 out of 1000 permutations (0.05) yield a gene with a permuted (p (t-test) -value) equal to or lower than the actual gene's (p (t-test) -value).
  • the permuted (p (t-test) -value) threshold was found to be 1.06*10 ⁇ 5 .
  • Ninety-seven genes were considered differentially expressed between current and never smokers at this threshold.
  • One shortcoming of this methodology is that is impossible to compute all possible permutations of the group assignments for large sample sizes.
  • the mean (p (t-test) -value) was used as a cutoff and yielded a gene list of ninety-seven genes. In this case, the distribution of the data is such that the permuted P ⁇ t-test)-value threshold is slightly less strict than the equivalent Bonferroni cutoff.
  • the q-value of the 97 th gene was 0.005, which means that among all 97 t-tests that we designate as significant only 0.5% of them will be false positives.
  • a less strict (p (t-test) -value) cutoff of 4.06*10 ⁇ 4 q-value 0.01) yields 261 genes with approximately 3 false positive genes.
  • the q-values were calculated using the program Q-Value which can be downloaded from http://faculty.washington.edu/ ⁇ istorey/gvalue/. Larger lists of genes can be accessed through our database by selecting a less restrictive (p (t-test) -value) threshold (http://pulm.bumc.bu.edu/aged).
  • the threshold was found to be 3.19*10 6 genes falling below this threshold.
  • Supplementary Table 6 displays the top 51 genes with unadjusted (p (correlation) -value) below 0.0001.
  • the (p (correlation) -value) threshold found using the permutation based multiple comparison correction is more strict than the Bonferroni threshold of 2.4*10 ⁇ 5 because the correction is data dependent and pack-year values in our study are quite variable.
  • the current smokers in our study have an average number of pack-years of 22, but there are 3 “outlier” current smokers with extremely high pack-year histories (>70 pack-years). These smokers with extremely high pack years underpin the linear fit and result in better correlations even for random permutations, and thus lead to a stricter multiple comparison correction threshold.
  • FIG. 25A shows a multidimensional scaling plot of never and current smokers according to the expression of the 97 genes that distinguish current smokers from never smokers.
  • FIG. 25B shows that former smokers who discontinued smoking less than 2 years prior to this study tend to cluster with current smokers, whereas former smokers who discontinued smoking for more than 2 years group more closely with never smokers.
  • Hierarchical clustering of all 18 former smokers according to the expression of these same 97 genes also reveals 2 subgroups of former smokers, with the length of smoking cessation being the only clinical variable that was statistically different between the 2 subgroups (see FIG. 30 ).
  • Reversible genes were predominantly drug metabolizing and antioxidant genes.
  • the metallothionein genes that remained abnormal in former smokers are located at 16q13, suggesting that this may represent a fragile site for DNA injury in smokers.
  • the persistence of abnormal expression of select genes after smoking cessation may provide growth advantages to a subset of epithelial cells allowing for clonal expansion and perpetuation of these cells years after smoking had been discontinued. These permanent changes might explain the persistent risk of lung cancer in former smokers.
  • Hierarchical clustering of the genes and samples was performed using log transformed z-score normalized data using a Pearson correlation (uncentered) similarity metric and average linkage clustering using CLUSTER and TREEVIEW software programs obtained at http://rana.1bI.gov/EisenSoftware.htmMDS and PCA were performed using Partek 5.0 software obtained at www.partek.com.
  • At-test was performed between former and never smoker across all 9968 genes, and 44 genes were found to have a (p (t-test) -value) threshold below 0.00098. While the permuted (p (t-test) -value) threshold for this extension of our t-test should have been computed across all 9968 genes, the former smokers are the smallest group in our study and thus we chose a less restrictive (p (t-test) -value) threshold. Although a there was about a 100-fold increase in the amount of genes analyzed there was only about a 3-fold increase in the number of genes found to be significantly different between never and former smokers. Therefore, most genes that are significantly different between never and former smokers are also significantly different between current and never smokers.
  • Cigarette smoking a major cause of lung disease, induces xenobiotic and redox regulating genes as well as several oncogenes, and decreases expression of several tumor suppressor genes and genes that regulate airway inflammation.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • Hospice & Palliative Care (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Oncology (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The present invention provides methods for diagnosis and prognosis of lung cancer using expression analysis of one or more groups of genes, and a combination of expression analysis with bronchoscopy. The methods of the invention provide far superior detection accuracy for lung cancer when compared to any other currently available method for lung cancer diagnostic or prognosis. The invention also provides methods of diagnosis and prognosis of other lung diseases, particularly in individuals who are exposed to air pollutants, such as cigarette or cigar smoke, smog, asbestos and the like air contaminants or pollutants.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • The present application is a continuation-in-part of U.S. application Ser. No. 15/888,831, filed on Feb. 5, 2018, which is a continuation of U.S. application Ser. No. 14/613,210, filed on Feb. 3, 2015, which is a continuation of U.S. application Ser. No. 13/524,749, filed on Jun. 15, 2012, which is a continuation of U.S. application Ser. No. 12/869,525, filed on Aug. 26, 2010, which is a continuation of U.S. application Ser. No. 11/918,588, filed Feb. 8, 2008, which is a national stage filing under 35 U.S.C. 371 of International Application PCT/US2006/014132, filed Apr. 14, 2006, which claims the benefit of priority under 35 U.S.C. 119(e) to U.S. provisional application Ser. No. 60/671,243, filed on Apr. 14, 2005, the contents of which are herein incorporated by reference in their entirety. International Application PCT/US2006/014132 was published under PCT Article 21(2) in English.
  • GOVERNMENT FUNDING
  • This invention was made with Government Support under Contract No. HL 071771 awarded by the National Institutes of Health. The Government has certain rights in the invention.
  • BACKGROUND OF THE INVENTION Field of the Invention
  • The present invention is directed to diagnostic and prognostic methods by using analysis of gene group expression patterns in a subject. More specifically, the invention is directed to diagnostic and prognostic methods for detecting lung diseases, particularly lung cancer in subjects, preferably humans that have been exposed to air pollutants.
  • Background
  • Lung disorders represent a serious health problem in the modern society. For example, lung cancer claims more than 150,000 lives every year in the United States, exceeding the combined mortality from breast, prostate and colorectal cancers. Cigarette smoking is the most predominant cause of lung cancer. Presently, 25% of the U.S. population smokes, but only 10% to 15% of heavy smokers develop lung cancer. There are also other disorders associated with smoking such as emphysema. There are also health questions arising from people exposed to smokers, for example, second hand smoke. Former smokers remain at risk for developing such disorders including cancer and now constitute a large reservoir of new lung cancer cases. In addition to cigarette smoke, exposure to other air pollutants such as asbestos, and smog, pose a serious lung disease risk to individuals who have been exposed to such pollutants.
  • Approximately 85% of all subjects with lung cancer die within three years of diagnosis. Unfortunately survival rates have not changed substantially of the past several decades. This is largely because there are no effective methods for identifying smokers who are at highest risk for developing lung cancer and no effective tools for early diagnosis.
  • The methods that are currently employed to diagnose lung cancer include chest X-ray analysis, bronchoscopy or sputum cytological analysis, computer tomographic analysis of the chest, and positron electron tomographic (PET) analysis. However, none of these methods provide a combination of both sensitivity and specificity needed for an optimal diagnostic test.
  • Classification of human lung cancer by gene expression profiling has been described in several recent publications (M. Garber, “Diversity of gene expression in adenocarcinoma of the lung,” PNAS, 98(24): 13784-13789 (2001); A. Bhattacharjee, “Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses,” PNAS, 98(24):13790-13795 (2001)), but no specific gene set is used as a classifier to diagnose lung cancer in bronchial epithelial tissue samples.
  • Moreover, while it appears that a subset of smokers are more susceptible to, for example, the carcinogenic effects of cigarette smoke and are more likely to develop lung cancer, the particular risk factors, and particularly genetic risk factors, for individuals have gone largely unidentified. Same applies to lung cancer associated with, for example, asbestos exposure.
  • Therefore, there exists a great need to develop sensitive diagnostic methods that can be used for early diagnosis and prognosis of lung diseases, particularly in individuals who are at risk of developing lung diseases, particularly individuals who are exposed to air pollutants such as cigarette/cigar smoke, asbestos and other toxic air pollutants.
  • SUMMARY OF THE INVENTION
  • The present invention provides compositions and methods for diagnosis and prognosis of lung diseases which provides a diagnostic test that is both very sensitive and specific.
  • We have found a group of gene transcripts that we can use individually and in groups or subsets for enhanced diagnosis for lung diseases, such as lung cancer, using gene expression analysis. We provide detailed guidance on the increase and/or decrease of expression of these genes for diagnosis and prognosis of lung diseases, such as lung cancer.
  • One example of the gene transcript groups useful in the diagnostic/prognostic tests of the invention are set forth in Table 6. We have found that taking groups of at least 20 of the Table 6 genes provides a much greater diagnostic capability than chance alone.
  • Preferably one would use more than 20 of these gene transcript, for example about 20-100 and any combination between, for example, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, and so on. Our preferred groups are the groups of 96 (Table 1), 84 (Table 2), 50 (Table 3), 36 (Table 4), 80 (Table 5), 535 (Table 6) and 20 (Table 7). In some instances, we have found that one can enhance the accuracy of the diagnosis by adding certain additional genes to any of these specific groups. When one uses these groups, the genes in the group are compared to a control or a control group. The control groups can be non-smokers, smokers, or former smokers. Preferably, one compares the gene transcripts or their expression product in the biological sample of an individual against a similar group, except that the members of the control groups do not have the lung disorder, such as emphysema or lung cancer. For example, comparing can be performed in the biological sample from a smoker against a control group of smokers who do not have lung cancer. When one compares the transcripts or expression products against the control for increased expression or decreased expression, which depends upon the particular gene and is set forth in the tables—not all the genes surveyed will show an increase or decrease. However, at least 50% of the genes surveyed must provide the described pattern. Greater reliability if obtained as the percent approaches 100%. Thus, in one embodiment, one wants at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% of the genes surveyed to show the altered pattern indicative of lung disease, such as lung cancer, as set forth in the tables, infra.
  • In one embodiment, the invention provides a group of genes the expression of which is altered in individuals who are at risk of developing lung diseases, such as lung cancer, because of the exposure to air pollutants. The invention also provides groups of genes the expression of which is consistently altered as a group in individuals who are at risk of developing lung diseases because of the exposure to air pollutants.
  • The present invention provides gene groups the expression pattern or profile of which can be used in methods to diagnose lung diseases, such as lung cancer and even the type of lung cancer, in more than 60%, preferably more than 65%, still more preferably at least about 70%, still more preferably about 75%, or still more preferably about 80%-95% accuracy from a sample taken from airways of an individual screened for a lung disease, such as lung cancer.
  • In one embodiment, the invention provides a method of diagnosing a lung disease such as lung cancer using a combination of bronchoscopy and the analysis of gene expression pattern of the gene groups as described in the present invention.
  • Accordingly, the invention provides gene groups that can be used in diagnosis and prognosis of lung diseases. Particularly, the invention provides groups of genes the expression profile of which provides a diagnostic and or prognostic test to determine lung disease in an individual exposed to air pollutants. For example, the invention provides groups of genes the expression profile of which can distinguish individuals with lung cancer from individuals without lung cancer.
  • In one embodiment, the invention provides an early asymptomatic screening system for lung cancer by using the analysis of the disclosed gene expression profiles. Such screening can be performed, for example, in similar age groups as colonoscopy for screening colon cancer. Because early detection in lung cancer is crucial for efficient treatment, the gene expression analysis system of the present invention provides a vastly improved method to detect tumor cells that cannot yet be discovered by any other means currently available.
  • The probes that can be used to measure expression of the gene groups of the invention can be nucleic acid probes capable of hybridizing to the individual gene/transcript sequences identified in the present invention, or antibodies targeting the proteins encoded by the individual gene group gene products of the invention. The probes are preferably immobilized on a surface, such as a gene or protein chip so as to allow diagnosis and prognosis of lung diseases in an individual.
  • In one embodiment, the invention provides a group of genes that can be used as individual predictors of lung disease. These genes were identified using probabilities with a t-test analysis and show differential expression in smokers as opposed to non-smokers. The group of genes comprise ranging from 1 to 96, and all combinations in between, for example 5, 10, 15, 20, 25, 30, for example at least 36, at least about, 40, 45, 50, 60, 70, 80, 90, or 96 gene transcripts, selected from the group consisting of genes identified by the following GenBank sequence identification numbers (the identification numbers for each gene are separated by “;” while the alternative GenBank ID numbers are separated by “///”): NM_003335; NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_001319; NM_006545.1; NM_021145.1; NM_002437.1; NM_006286; NM_001003698 /// NM_001003699 /// NM_002955; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_002853.1; NM_019067.1; NM_024917.1; NM_020979.1; NM_005597.1; NM_007031.1; NM_009590.1; NM_020217.1; NM_025026.1; NM_014709.1; NM_014896.1; AF010144; NM_005374.1; NM_001696; NM_005494 /// NM_058246; NM_006534 /// NM_181659; NM_006368; NM_002268 /// NM_032771; NM_014033; NM_016138; NM_007048 /// NM_194441; NM_006694; NM_000051 /// NM_138292 /// NM_138293; NM_000410 /// NM_139002 /// NM_139003 /// NM_139004 /// NM_139005 /// NM_139006 /// NM_139007 /// NM_139008 /// NM_139009 /// NM_139010 /// NM_139011; NM_004691; NM_012070 /// NM_139321 /// NM_139322; NM_006095; AI632181; AW024467; NM_021814; NM_005547.1; NM_203458; NM_015547 /// NM_147161; AB007958.1; NM_207488; NM_005809 /// NM_181737 /// NM_181738; NM_016248 /// NM_144490; AK022213.1; NM_005708; NM_207102; AK023895; NM_144606 /// NM_144997; NM_018530; AK021474; U43604.1; AU147017; AF222691.1; NM_015116; NM_001005375 /// NM_001005785 /// NM_001005786 /// NM_004081 /// NM_020363 /// NM_020364 /// NM_020420; AC004692; NM_001014; NM_000585 /// NM_172174 /// NM_172175; NM_054020 /// NM_172095 /// NM_172096 /// NM_172097; BE466926; NM_018011; NM_024077; NM_012394; NM_019011 /// NM_207111 /// NM_207116; NM_017646; NM_021800; NM_016049; NM_014395; NM_014336; NM_018097; NM_019014; NM_024804; NM_018260; NM_018118; NM_014128; NM_024084; NM_005294; AF077053; NM_138387; NM_024531; NM_000693; NM_018509; NM_033128; NM_020706; AI523613; and NM_014884, the expression profile of which can be used to diagnose lung disease, for example lung cancer, in lung cell sample from a smoker, when the expression pattern is compared to the expression pattern of the same group of genes in a smoker who does not have or is not at risk of developing lung cancer.
  • In another embodiment, the gene/transcript analysis comprises a group of about 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80, 80-90, 90-100, 100-120, 120-140, 140-150, 150-160, 160-170, 170-180, 180-190, 190-200, 200-210, 210-220, 220-230, 230-240, 240-250, 250-260, 260-270, 270-280, 280-290, 290-300, 300-310, 310-320, 320-330, 330-340, 340-350, 350-360, 360-370, 370-380, 380-390, 390-400, 400-410, 410-420, 420-430, 430-440, 440-450, 450-460, 460-470, 470-480, 480-490, 490-500, 500-510, 510-520, 520-530, and up to about 535 genes selected from the group consisting of genes or transcripts as shown in the Table 6.
  • In one embodiment, the genes are selected from the group consisting of genes or transcripts as shown in Table 5.
  • In another embodiment, the genes are selected from the genes or transcripts as shown in Table 7.
  • In one embodiment, the transcript analysis gene group comprises a group of individual genes the change of expression of which is predictive of a lung disease either alone or as a group, the gene transcripts selected from the group consisting of NM_007062.1; NM_001281.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; NM_002268 /// NM_032771; NM_007048 /// NM_194441; NM_006694; U85430.1; NM_004691; AB014576.1; BF218804; BE467941; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_021971.1; NM_014128.1; AA133341; AF198444.1.
  • In one embodiment, the gene group comprises a probe set capable of specifically hybridizing to at least all of the 36 gene products. Gene product can be mRNA which can be recognized by an oligonucleotide or modified oligonucleotide probe, or protein, in which case the probe can be, for example an antibody specific to that protein or an antigenic epitope of the protein.
  • In yet another embodiment, the invention provides a gene group, wherein the expression pattern of the group of genes provides diagnostic for a lung disease. The gene group comprises gene transcripts encoded by a gene group consisting of at least for example 5, 10, 15, 20, 25, 30, preferably at least 36, still more preferably 40, still more preferably 45, and still more preferably 46, 47, 48, 49, or all 50 of the genes selected from the group consisting of and identified by their GenBank identification numbers: NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U 93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; AB014576.1; BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1. In one preferred embodiment, one can use at least 20 of the 36 genes that overlap with the individual predictors and, for example, 5-9 of the non-overlapping genes and combinations thereof.
  • In another embodiment, the invention provides a group of about 30-180, preferably, a group of about 36-150 genes, still more preferably a group of about 36-100, and still more preferably a group of about 36-50 genes, the expression profile of which is diagnostic of lung cancer in individuals who smoke.
  • In one embodiment, the invention provides a group of genes the expression of which is decreased in an individual having lung cancer. In one embodiment, the group of genes comprises at least 5-10, 10-15, 15-20, 20-25 genes selected from the group consisting of NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_006545.1; NM_002437.1; NM_006286; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_001696; NM_005494 /// NM_058246; NM_006368; NM_002268 /// NM_032771; NM_006694; NM_004691; NM_012394; NM_021800; NM_016049; NM_138387; NM_024531; and NM_018509. One or more other genes can be added to the analysis mixtures in addition to these genes.
  • In another embodiment, the group of genes comprises genes selected from the group consisting of NM_014182.1; NM_001281.1; NM_024006.1; AF135421.1; L76200.1; NM_000346.1; BC008710.1; BC000423.2; BC008710.1; NM_007062; BC075839.1 /// BC073760.1; BC072436.1 /// BC004560.2; BC001016.2; BC005023.1; BC000360.2; BC007455.2; BC023528.2 /// BC047680.1; BC064957.1; BC008710.1; BC066329.1; BC023976.2; BC008591.2 /// BC050440.1 /// BC048096.1; and BC028912.1.
  • In yet another embodiment, the group of genes comprises genes selected from the group consisting of NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; and AB014576.1.
  • In one embodiment, the invention provides a group of genes the expression of which is increased in an individual having lung cancer. In one embodiment, the group of genes comprises genes selected from the group consisting of NM_003335; NM_001319; NM_021145.1; NM_001003698 /// NM_001003699 ///; NM_002955; NM_002853.1; NM_019067.1; NM_024917.1; NM_020979.1; NM_005597.1; NM_007031.1; NM_009590.1; NM_020217.1; NM_025026.1; NM_014709.1; NM_014896.1; AF010144; NM_005374.1; NM_006534 /// NM_181659; NM_014033; NM_016138; NM_007048 /// NM_194441; NM_000051 /// NM_138292 /// NM_138293; NM_000410 /// NM_139002 /// NM_139003 /// NM_139004 /// NM_139005 /// NM_139006 /// NM_139007 /// NM_139008 /// NM_139009 /// NM_139010 /// NM_139011; NM_012070 /// NM_139321 /// NM_139322; NM_006095; AI632181; AW024467; NM_021814; NM_005547.1; NM_203458; NM_015547 /// NM_147161; AB007958.1; NM_207488; NM_005809 /// NM_181737 /// NM_181738; NM_016248 /// NM_144490; AK022213.1; NM_005708; NM_207102; AK023895; NM_144606 /// NM_144997; NM_018530; AK021474; U43604.1; AU147017; AF222691.1; NM_015116; NM_001005375 /// NM_001005785 /// NM_001005786 /// NM_004081 /// NM_020363 /// NM_020364 /// NM_020420; AC004692; NM_001014; NM_000585 /// NM_172174 /// NM_172175; NM_054020 /// NM_172095 /// NM_172096 /// NM_172097; BE466926; NM_018011; NM_024077; NM_019011 /// NM_207111 /// NM_207116; NM_017646; NM_014395; NM_014336; NM_018097; NM_019014; NM_024804; NM_018260; NM_018118; NM_014128; NM_024084; NM_005294; AF077053; NM_000693; NM_033128; NM_020706; AI523613; and NM_014884.
  • In one embodiment, the group of genes comprises genes selected from the group consisting of NM_030757.1; R83000; AK021571.1; NM_17932.1; U85430.1; AI683552; BC002642.1; AW024467; NM_030972.1; BC021135.1; AL161952.1; AK026565.1; AK023783.1; BF218804; AK023843.1; BC001602.1; BC034707.1; BC064619.1; AY280502.1; BC059387.1; BC061522.1; U50532.1; BC006547.2; BC008797.2; BC000807.1; AL080112.1; BC033718.1 /// BC046176.1 ///; BC038443.1; Hs.288575 (UNIGENE ID); AF020591.1; BC002503.2; BC009185.2; Hs.528304 (UNIGENE ID); U50532.1; BC013923.2; BC031091; Hs.249591 (Unigene ID); Hs.286261 (Unigene ID); AF348514.1; BC066337.1 /// BC058736.1 /// BC050555.1; Hs.216623 (Unigene ID); BC072400.1; BC041073.1; U43965.1; BC021258.2; BC016057.1; BC016713.1 /// BC014535.1 /// AF237771.1; BC000701.2; BC010067.2; Hs.156701 (Unigene ID); BC030619.2; U43965.1; Hs.438867 (Unigene ID); BC035025.2 /// BC050330.1; BC074852.2 /// BC074851.2; Hs.445885 (Unigene ID); AF365931.1; and AF257099.1.
  • In one embodiment, the group of genes comprises genes selected from the group consisting of BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1.
  • In another embodiment, the invention provides a method for diagnosing a lung disease comprising obtaining a nucleic acid sample from lung, airways or mouth of an individual exposed to an air pollutant, analyzing the gene transcript levels of one or more gene groups provided by the present invention in the sample, and comparing the expression pattern of the gene group in the sample to an expression pattern of the same gene group in an individual, who is exposed to similar air pollutant but not having lung disease, such as lung cancer or emphysema, wherein the difference in the expression pattern is indicative of the test individual having or being at high risk of developing a lung disease. The decreased expression of one or more of the genes, preferably all of the genes including the genes listed on Tables 1-4 as “down” when compared to a control, and/or increased expression of one or more genes, preferably all of the genes listed on Tables 1-4 as “up” when compared to an individual exposed to similar air pollutants who does not have a lung disease, is indicative of the person having a lung disease or being at high risk of developing a lung disease, preferably lung cancer, in the near future and needing frequent follow ups to allow early treatment of the disease.
  • In one preferred embodiment, the lung disease is lung cancer. In one embodiment, the air pollutant is cigarette smoke.
  • Alternatively, the diagnosis can separate the individuals, such as smokers, who are at lesser risk of developing lung diseases, such as lung cancer by analyzing the expression pattern of the gene groups of the invention provides a method of excluding individuals from invasive and frequent follow ups.
  • Accordingly, the invention provides methods for prognosis, diagnosis and therapy designs for lung diseases comprising obtaining an airway sample from an individual who smokes and analyzing expression profile of the gene groups of the present invention, wherein an expression pattern of the gene group that deviates from that in a healthy age, race, and gender matched smoker, is indicative of an increased risk of developing a lung disease. Tables 1-4 indicate the expression pattern differences as either being down or up as compared to a control, which is an individual exposed to similar airway pollutant but not affected with a lung disease.
  • The invention also provides methods for prognosis, diagnosis and therapy designs for lung diseases comprising obtaining an airway sample from a non-smoker individual and analyzing expression profile of the gene groups of the present invention, wherein an expression pattern of the gene group that deviates from that in a healthy age, race, and gender matched smoker, is indicative of an increased risk of developing a lung disease.
  • In one embodiment, the analysis is performed from a biological sample obtained from bronchial airways.
  • In one embodiment, the analysis is performed from a biological sample obtained from buccal mucosa.
  • In one embodiment, the analysis is performed using nucleic acids, preferably RNA, in the biological sample.
  • In one embodiment, the analysis is performed analyzing the amount of proteins encoded by the genes of the gene groups of the invention present in the sample.
  • In one embodiment the analysis is performed using DNA by analyzing the gene expression regulatory regions of the groups of genes of the present invention using nucleic acid polymorphisms, such as single nucleic acid polymorphisms or SNPs, wherein polymorphisms known to be associated with increased or decreased expression are used to indicate increased or decreased gene expression in the individual. For example, methylation patterns of the regulatory regions of these genes can be analyzed.
  • In one embodiment, the present invention provides a minimally invasive sample procurement method for obtaining airway epithelial cell RNA that can be analyzed by expression profiling of the groups of genes, for example, by array-based gene expression profiling. These methods can be used to diagnose individuals who are already affected with a lung disease, such as lung cancer, or who are at high risk of developing lung disease, such as lung cancer, as a consequence of being exposed to air pollutants. These methods can also be used to identify further patterns of gene expression that are diagnostic of lung disorders/diseases, for example, cancer or emphysema, and to identify subjects at risk for developing lung disorders.
  • The invention further provides a gene group microarray consisting of one or more of the gene groups provided by the invention, specifically intended for the diagnosis or prediction of lung disorders or determining susceptibility of an individual to lung disorders.
  • In one embodiment, the invention relates to a method of diagnosing a disease or disorder of the lung comprising obtaining a sample, nucleic acid or protein sample, from an individual to be diagnosed; and determining the expression of group of identified genes in said sample, wherein changed expression of such gene compared to the expression pattern of the same gene in a healthy individual with similar life style and environment is indicative of the individual having a disease of the lung.
  • In one embodiment, the invention relates to a method of diagnosing a disease or disorder of the lung comprising obtaining at least two samples, nucleic acid or protein samples, in at least one time interval from an individual to be diagnosed; and determining the expression of the group of identified genes in said sample, wherein changed expression of at least about for example 5, 10, 15, 20, 25, 30, preferably at least about 36, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, or 180 of such genes in the sample taken later in time compared to the sample taken earlier in time is diagnostic of a lung disease.
  • In one embodiment, the disease of the lung is selected from the group consisting of asthma, chronic bronchitis, emphysema, primary pulmonary hypertension, acute respiratory distress syndrome, hypersensitivity pneumonitis, eosinophilic pneumonia, persistent fungal infection, pulmonary fibrosis, systemic sclerosis, idiopathic pulmonary hemosiderosis, pulmonary alveolar proteinosis, and lung cancer, such as adenocarcinoma, squamous cell carcinoma, small cell carcinoma, large cell carcinoma, and benign neoplasm of the lung (e.g., bronchial adenomas and hamartomas).
  • In a particular embodiment, the nucleic acid sample is RNA.
  • In a preferred embodiment, the nucleic acid sample is obtained from an airway epithelial cell. In one embodiment, the airway epithelial cell is obtained from a bronchoscopy or buccal mucosal scraping.
  • In one embodiment, individual to be diagnosed is an individual who has been exposed to tobacco smoke, an individual who has smoked, or an individual who currently smokes.
  • The invention also provides an array, for example, a microarray for diagnosis of a disease of the lung having immobilized thereon a plurality of oligonucleotides which hybridize specifically to genes of the gene groups which are differentially expressed in airways exposed to air pollutants, such as cigarette smoke, and have or are at high risk of developing lung disease, as compared to those individuals who are exposed to similar air pollutants and airways which are not exposed to such pollutants. In one embodiment, the oligonucleotides hybridize specifically to one allelic form of one or more genes which are differentially expressed for a disease of the lung. In a particular embodiment, the differentially expressed genes are selected from the group consisting of the genes shown in tables 1-4; preferably the group of genes comprises genes selected from the Table 3. In one preferred embodiment, the group of genes comprises the group of at least 20 genes selected from Table 3 and additional 5-10 genes selected from Tables 1 and 2. In one preferred embodiment, at least about 10 genes are selected from Table 4.
  • Some aspects of the present invention also provide prognostic and diagnostic methods to assess lung disease risk caused by airway pollutants. The methods according to the present invention use a novel minimally invasive sample procurement method and gene expression-based tools for the diagnosis and prognosis of diseases of the lung, particularly diagnosis and prognosis of lung cancer.
  • We have shown that exposure of airways to pollutants such as cigarette smoke, causes a so-called “field defect”, which refers to gene expression changes in all the epithelial cells lining the airways from mouth mucosal epithelial lining through the bronchial epithelial cell lining to the lungs. Because of this field defect, it is now possible to detect changes, for example, pre-malignant and malignant changes resulting in diseases of the lung using cell samples isolated from epithelial cells obtained not only from the lung biopsies but also from other, more accessible, parts of the airways including bronchial or mouth epithelial cell samples.
  • Some aspects of the present invention are based on the finding that that there are different patterns of gene expression between smokers and non-smokers. The genes involved can be grouped into clusters of related genes that are reacting to the irritants or pollutants. We have found unique sets of expressed genes or gene expression patterns associated with pre-malignancy in the lung and lung cancer in smokers and non-smokers. All of these expression patterns constitute expression signatures that indicate operability and pathways of cellular function that can be used to guide decisions regarding prognosis, diagnosis and possible therapy. Epithelial cell gene expression profiles obtained from relatively accessible sites can thus provide important prognostic, diagnostic, and therapeutic information which can be applied to diagnose and treat lung disorders.
  • We have found that cigarette smoking induces xenobiotic and redox regulating genes as well as several oncogenes, and decreases expression of several tumor suppressor genes and genes that regulate airway inflammation. We have identified a subset of smokers, who respond differently to cigarette smoke and appear thus to be predisposed, for example, to its carcinogenic effects, which permits us to screen for individuals at risks of developing lung diseases.
  • Some aspects of the present invention are based on characterization of “airway transcriptomes” or a signature gene expression profiles of the airways and identification of changes in this transcriptome that are associated with epithelial exposure to pollutants, such as direct or indirect exposure to cigarette smoke, asbestos, and smog. These airway transcriptome gene expression profiles provide information on lung tissue function upon cessation from smoking, predisposition to lung cancer in non-smokers and smokers, and predisposition to other lung diseases. The airway transcriptome expression pattern can be obtained from a non-smoker, wherein deviations in the normal expression pattern are indicative of increased risk of lung diseases. The airway transcriptome expression pattern can also be obtained from a non-smoking subject exposed to air pollutants, wherein deviation in the expression pattern associated with normal response to the air pollutants is indicative of increased risk of developing lung disease.
  • Accordingly, in one embodiment, the invention provides an “airway transcriptome” the expression pattern of which is useful in prognostic, diagnostic and therapeutic applications as described herein. We have discovered the expression of 85 genes, corresponding to 97 probesets on the Affymetrix U133A Genechip array, having expression patterns that differ significantly between healthy smokers and healthy non-smokers. Examples of these expression patterns are shown in FIG. 24. The expression patterns of the airway transcriptome are useful in prognosis of lung disease, diagnosis of lung disease and a periodical screening of the same individual to see if that individual has been exposed to risky airway pollutants such as cigarette smoke that change his/her expression pattern.
  • In one embodiment, the invention provides distinct airway “expression clusters”, i.e., sub-transcriptomes, comprised of related genes among the 85 genes that can be quickly screened for diagnosis, prognosis or treatment purposes. In one embodiment, the invention provides an airway sub-transcriptome comprising mucin genes of the airway transcriptome. Examples of mucin genes include muc 5 subtypes A, B, and C. In another embodiment, the invention provides a sub-transcriptome comprising cell adhesion molecules of the airway transcriptome, such as carcinoembryonic antigen-related adhesion molecule 6 and claudin 10 encoding genes. In another embodiment, the invention provides a sub-transcriptome comprising detoxification related genes of the airway transcriptome. Examples of these genes include cytochrome P450 subfamily I (dioxin-inducible) encoding genes, NADPH dehydrogenase encoding genes. For example, upregulation of transcripts of cytochrome P450 subfamily I (dioxin-inducible) encoding genes.
  • In yet another embodiment, the invention provides a sub-transcriptome comprising immune system regulation associated genes of the airway transcriptome. Examples of immunoregulatory genes include small inducible cytokine subfamily D encoding genes.
  • In another embodiment, the invention provides a sub-transcriptome comprising metallothionein genes of the airway transcriptome. Examples of metallothionein genes include MTX G, X, and L encodinggenes.
  • In another embodiment, the subtranscriptome comprises protooncogenes and oncogenes such as RAB1-1A and CEACAM6. In another embodiment, the subtranscriptome includes tumor suppressor genes such as SLIT1, and SLIT2.
  • In one embodiment, the invention provides a lung cancer “diagnostic airway transcriptome” comprising 208 genes selected from the group consisting of group consisting of 208238_x_at-probeset; 216384_x_at-probeset; 217679_x_at-probeset; 216859_x_at-probeset; 211200_s_at-probeset; PDPK1; ADAM28; ACACB; ASMTL; ACVR2B; ADATI; ALMSI; ANK3; ANK3-; DARS; AFURS1; ATP8B1; ABCCI; BTF3; BRD4; CELSR2; CALM31 CAPZB; CAPZBI CFLAR; CTSS; CD24; CBX3; C21orf106; C6orf111; C6orf62; CHC1; DCLRE1C; EML2; EMS1; EPHB6-; EEF2; FGFR3; FLJ20288; FVT1; GGTLA4; GRP; GLUL; HDGF; Homo sapiens cDNA FLJ11452 fis, clone HEMBA1001435; Homo sapiens cDNA FLJ12005 fis, clone HEMBB1001565; Homo sapiens cDNA FLJ13721 fis, clone PLACE2000450; Homo sapiens cDNA FLJ14090 fis, clone MAMMA1000264; Homo sapiens cDNA FLJ14253 fis, clone OVARC1001376; Homo sapiens fetal thymus prothymosin alpha mRNA, complete eds Homo sapiens fetal thymus prothymosin alpha mRNA; Homo sapiens transcribed sequence with strong similarity to protein ref:NP_004726.1 (H. sapiens) leucine rich repeat (in FLU) interacting protein 1; Homo sapiens transcribed sequence with weak similarity to protein ref:NP_060312.1 (H. sapiens) hypothetical protein FLJ20489; Homo sapiens transcribed sequence with weak similarity to protein ref:NP_060312.1 (H. sapiens) hypothetical protein FLJ20489; 222282_at-probeset corresponding to Homo sapiens transcribed sequences; 21.5032_at-probeset corresponding to Homo sapiens transcribed sequences; 8181 1_at-probeset corresponding to Homo sapiens transcribed sequences; DKFZp547K1 113; ET; FLJ10534; FLJ10743; FLJ13171; FLJ14639; FLJ14675; FLJ20195; FLJ20686; FLJ20700; CG005; CG005; MGC5384; IMP-2; INADAL; INHBC; KIAA0379; KIAA0676; KIAA0779; KIAAI 193; KTNI; KLF5; LRRFIP1; MKRN4; MANIC1; MVK; MUC20; MPZL1; MYO1A; MRLC2; NFATC3; ODAG; PARVA; PASK; PIK3C2B; PGF; PKP4; PRKX; PRKY; PTPRF; PTMA; PTMA; PHTF2; RAB14; ARHGEF6; RIPX; REC8L1; RIOK3; SEMA3F; SRRM21 MGC709071 SMT3H2; SLC28A3; SAT; SFRSI 11 SOX2; THOC2; TRIM51 USP7; USP9X; USH1C; AF020591; ZNF13 I; ZNF160; ZNF264; 217414_x_at-probeset;; 217232_x_at-probeset;; ATF3; ASXL2; ARF4L; APG5L; ATP6V0B; BAGI; BTG2; COMT; CTSZ; CGI-128; C14orf87; CLDN3; CYR61; CKAP1; DAF; DAF; DSIPI; DKFZP564G2022; DNAJB9; DDOST; DUSP1; DUSP6; DKC1; EGRI; EIF4EL3; EXT2; GMPPB; GSN; GUKI; HSPA8; Homo sapiens PRO2275 mRNA, complete eds; Homo sapiens transcribed sequence with strong similarity to protein ref:NP_006442.2, polyadenylate binding protein-interacting protein 1; HAXI; DKFZP434K046; IMAGE3455200; HYOUI; IDN 3; JUNB; KRT8; KIAA0IO0; KIAA0102; APH-IA; LSM4; MAGED2; MRPS7; MOCS2; MNDA; NDUFA8; NNT; NFIL3; PWPI; NR4A2; NUDT4; ORMDL2; PDAP2; PPIH; PBX3; P4HA2; PPP1R15A; PRGII P2RX4; SUi1; SUi1; SUi1; RAB5C; ARHB; RNASE4; RNH; RNPC4; SEC23B; SERPINAI; SH3GLB1; SLC35B1.; SOX9; SOX9; STCH; SDHC; TINF2; TCF8; E2-EPF; FOS; JUN; ZFP36; ZNF500; and ZDHHC4.
  • Accordingly, the invention provides methods of diagnosing lung cancer in an individual comprising trucing a biological sample from the airways of the individual and analyzing the expression of at least 10 genes, preferably at least 50 genes, still more preferably at least 100 genes, still more preferably at least 150 genes, still more preferably at least 200 genes selected from genes of the diagnostic airway transcriptome, wherein deviation in the expression of at least one, preferably at least 5, 10, 20, 50, 100, 150, 200 genes as compared to a control group is indicative of lung cancer in the individual.
  • Deviation is preferably decrease of the transcription of at least one gene selected from the group consisting of of 208238_x_at -probeset; 216384_x_at-probeset; 217679_x_at-probeset; 216859_x_at-probeset; 211200_s_at-probeset; PDPK1; ADAM28; ACACB; ASMTL; ACVR2B; ADATI; ALMS1; ANK3; ANK3; DARS; AFURS1; ATP8B1; ABCCI; BTF3; BRD4; CELSR2; CALM31CAPZB; CAPZB1CFLAR; CTSS; CD24; CBX3; C21orf106; C6orf111; C6orf62; CHC1; DCLREIC; EML2; EMSI; EPHB6; EEF2; FGFR3; FLJ20288; FVT1; GGTLA4; GRP; GLUL; HDGF; Homo sapiens cDNA FLJ11452 fis, clone HEMBA1001435; Homo sapiens cDNA FLJ12005 fis, clone HEMBB1001565; Homo sapiens cDNA FLJ13721 fis, clone PLACE2000450; Homo sapiens cDNA FLJ14090 fis, clone MAMMA1000264; Homo sapiens cDNA FLJ14253 fis, clone OVARC1001376; Homo sapiens fetal thymus prothymosin alpha mRNA, complete eds; Homo sapiens transcribed sequence with strong similarity to protein ref:NP_004726.1 (H. sapiens) leucine rich repeat (in FL1I) interacting protein 1; Homo sapiens transcribed sequence with weak similarity to protein ref:NP_060312.1 (H. sapiens) hypothetical protein FLJ20489; Homo sapiens transcribed sequence with weak similarity to protein ref:NP_060312.1 (H. sapiens) hypothetical protein FLJ20489; 222282_at-probeset corresponding to Homo sapiens transcribed sequences; 215032_at-probeset corresponding to Homo sapiens transcribed sequences; 81811_at-probeset corresponding to Homo sapiens transcribed sequences; DKFZp547K11 13; ET; FLJ10534; FLJ10743; FLJ13171; FLJ14639; FLJ14675; FLJ20195; FLJ20686; FLJ20700; CGOOS; CGOOS; MGC5384; IMP-2; INADL; INHBC; KIAA0379; KIAA0676; KIAA0779; KIAAI 193; KTN1; KLFS; LRRFIP1; MKRN4; MANIC1; MVK; MUC20; MPZLI; MYO1A; MRLC2; NFATC3; ODAG; PARVA; PASK; PIK3C2B; PGF; PKP4; PRKX; PRKY; PTPRF; PTMA; PTMA; PHTF2; RAB14; ARHGEF6; RIPX; REC8L1; RIOK3; SEMA3F; SRRM21 MGC709071 SMT3H2; SLC28A3; SAT; SFRS1 11 SOX2; THOC2; TRIM51 USP7; USP9X; USHIC; AF020591; ZNFI31; ZNF160; and ZNF264 genes.
  • Deviation is preferably increase of the expression of at least one gene selected from the group consisting of 217414_x_at-probeset; 217232_x_at-probeset; ATF3; ASXL2; ARF4L; APGSL; ATP6VOB; BAGI; BTG2; COMT; CTSZ; CGI-128; C14orf87; CLDN3; CYR61; CKAP1; DAF; DAF; DSIPI, DKFZP564G2022; DNAJB9; DDOST; DUSP1; DUSP6; DKC1; EGRI; EIF4EL3; EXT2; GMPPB; GSN; GUKI; HSPA8; Homo sapiens PR02275 mRNA, complete eds; Homo sapiens transcribed sequence with strong similarity to protein ref:NP_006442.2, polyadenylate binding protein-interacting protein 1; HAX1; DKFZP434K046; IMAGE3455200; HYOUI; IDN3; JUNB; KRT8; KIAAO1OO; KIAA0102; APH-IA; LSM4; MAGED2; MRPS7; MOCS2; MNDA; NDUFA8; NNT; NFIL3; PWPI; NR4A2; NUDT4; ORMDL2; PDAP2; PPIH; PBX3; P4HA2; PPPIRI5A; PRGII P2RX4; SUi1; SUi1; SUi1; RABSC; ARHB; RNASE4; RNH; RNPC4; SEC23B; SERPINAI; SH3GLB1; SLC35B1; SOX9; SOX9; STCH; SDRC; TINF2; TCF8; E2-EPF; FOS; JUN; ZFP36; ZNFS00; and ZDHHC4 genes.
  • The genes are referred to using their HUGO names or alternatively the probeset number on Affymetrix (Affymetrix, Inc. (U.S.), Santa Clara, Calif.) probesets.
  • In one embodiment, the invention provides methods of prognosis and diagnosis of lung diseases comprising obtaining a biological sample from a subject's airways, analyzing the level of expression of at least one gene of the airway transcriptome, comparing the level of expression of the at least one gene of at least one of the airway transcriptome to the level of expression in a control, wherein deviation in the level of expression in the sample from the control is indicative of increased risk of lung disease.
  • Preferably the analysis is performed using expression of at least two genes of the airway transcriptome, more preferably at least three genes, still more preferably at least four to 10 genes, still more preferably at least 10-20 genes, still more preferably at least 20-30, still more preferably at least 30-40, still more preferably at least 40-50, still more preferably at least 50-60, still more preferably at least 60-70, still more preferably at least 70-85 genes is analyzed.
  • In one preferred embodiment, the expression level of the genes of one or more of the sub-transcriptomes is analyzed. Preferably, gene expression of one or more genes belonging to at least two different sub-transcriptome sets is analyzed. Still more preferably, gene expression of at least one gene from at least three sub-transcriptome sets is analyzed. Still more preferably, gene expression of at least one gene from at least four sub-transcriptome sets is analyzed. Still more preferably, gene expression of at least one gene from at least five sub-transcriptome sets is analyzed.
  • The expression analysis according to the methods of the present invention can be performed using nucleic acids, particularly RNA, DNA or protein analysis.
  • The cell samples are preferably obtained from bronchial airways using, for example, endoscopic cytobrush in connection with a fiberoptic bronchoscopy. In one preferred embodiment, the cells are obtained from the individual's mouth buccal cells, using, for example, a scraping of the buccal mucosa. In one preferred embodiment, the invention provides a prognostic and/or diagnostic immunohistochemical approach, such as a dip-stick analysis, to determine risk of developing lung disease. Antibodies against at least one, preferably more proteins encoded by the genes of the airway transcriptome are either commercially available or can be produced using methods well know to one skilled in the art.
  • The invention further provides an airway transcriptone expression pattern of genes that correlate with time since cigarette discontinuance in former smokers, i.e., the expression of these genes in a healthy smoker returns to normal, or healthy non-smoker levels, after about two years from quitting smoking. These genes include: MAGF, GCLC, UTGIAI0, SLIT2, PECI, SLIT1, and TNFSF13. If the transcription of these genes has not returned to the level of a healthy non-smoker, as measured using the methods of the present invention, within a time period of about 1-5 years, preferably about 1.5-2.5 years, the individual with a remaining abnormal expression is at increased risk of developing a lung disease.
  • The invention further provides an airway transcriptome expression pattern of genes the expression of which remains abnormal after cessation from smoking. These genes include: CX3CL1, RNAHP, MT1X, MT1L, TU3A, HLF, CYFIP2, PLA2G10, HN1, GMDS, PLEKHB2, CEACAM6, ME1, and DPYSL3.
  • Accordingly, the invention provides methods for prognosis, diagnosis and therapy designs for lung diseases comprising obtaining an airway sample from an individual who smokes and analyzing expression of at least one, preferably at least two, more preferably at least three, still more preferably at least four, still more preferably at least five, still more preferably at least six, seven, eight, and still more preferably at least nine genes of the normal airway transcriptome, wherein an expression pattern of the gene or genes that deviates from that in a healthy age, race, and gender matched smoker, is indicative of an increased risk of developing a lung disease.
  • The invention also provides methods for prognosis, diagnosis and therapy designs for lung diseases comprising obtaining an airway sample from a non-smoker individual and analyzing expression of at least one, preferably at least two, more preferably at least three, still more preferably at least four, still more preferably at least five, still more preferably at least six, seven, eight, and still more preferably at least nine genes of the normal airway transcriptome, wherein an expression pattern of the gene or genes that deviates from that in a healthy age, race, and gender matched non-smoker, is indicative of an increased risk of developing a lung disease. Non-smoking individual whose expression pattern begins to resemble that of a smoker and at increased risk of developing a lung disease.
  • In one embodiment, the analysis is performed from a biological sample obtained from bronchial airways. In one embodiment, the analysis is performed from a biological sample obtained from buccal mucosa. In one embodiment, the analysis is performed using nucleic acids, preferably RNA, in the biological sample. In one embodiment, the analysis is performed analyzing the amount of proteins encoded by the genes of the airway transcriptome present in the sample.
  • In one embodiment the analysis is performed using DNA by analyzing the gene expression regulatory regions of the airway transcriptome genes using nucleic acid polymorphisms, such as single nucleic acid polymorphisms or SNPs, wherein polymorphisms known to be associated with increased or decreased expression are used to indicate increased or decreased gene expression in the individual.
  • In one embodiment, the present invention provides a method for determining whether a subject has or is at risk of developing a lung disorder, comprising (a) obtaining a biological sample comprising epithelial cells from a part of an airway of the subject separate from a lung of the subject; (b) assaying nucleic acid molecules derived from the biological sample to identify a level of gene expression in the biological sample; (c) processing the level of gene expression against a control to determine a deviation in the level of expression; and (d) based on the deviation in (c), determining that the subject has the lung cancer or is at risk of developing the lung disorder.
  • In one embodiment, the present invention provides a minimally invasive sample procurement method for obtaining airway epithelial cell RNA that can be analyzed by expression profiling, for example, by array-based gene expression profiling. These methods can be used to determine if airway epithelial cell gene expression profiles are affected by cigarette smoke and if these profiles differ in smokers with and without lung cancer. These methods can also be used to identify patterns of gene expression that are diagnostic of lung disorders/diseases, for example, cancer or emphysema, and to identify subjects at risk for developing lung disorders. All or a subset of the genes identified according to the methods described herein can be used to design an array, for example, a microarray, specifically intended for the diagnosis or prediction of lung disorders or susceptibility to lung disorders. The efficacy of such custom-designed arrays can be further tested, for example, in a large clinical trial of smokers.
  • In one embodiment, the invention relates to a method of diagnosing a disease or disorder of the lung comprising obtaining a sample, nucleic acid or protein sample, from an individual to be diagnosed; and determining the expression of one or more of the 85 identified genes in said sample, wherein changed expression of such gene compared to the expression pattern of the same gene in a healthy individual with similar life style and environment is indicative of the individual having a disease of the lung.
  • In one embodiment, the invention relates to a method of diagnosing a disease or disorder of the lung comprising obtaining at least two samples, nucleic acid or protein samples, in at least one time interval from an individual to be diagnosed; and determining the expression of one or more of the 85 identified genes in said samples, wherein changed expression of such gene or genes in the sample taken later in time compared to the sample taken earlier in time is diagnostic of a lung disease.
  • In one embodiment, the disease of the lung is selected from the group consisting of asthma, chronic bronchitis, emphysema, primary pulmonary hypertension, acute respiratory distress syndrome, hypersensitivity pneumonitis, eosinophilic pneumonia, persistent fungal infection, pulmonary fibrosis, systemic sclerosis, ideopathic pulmonary hemosiderosis, pulmonary alveolar proteinosis, and lung cancer, such as adenocarcinoma, squamous cell carcinoma, small cell carcinoma, large cell carcinoma, and benign neoplasms of the lung (e.g., bronchial adenomas and hamartomas). In a particular embodiment, the nucleic acid sample is RNA. In a preferred embodiment, the nucleic acid sample is obtained from an airway epithelial cell. In one embodiment, the airway epithelial cell is obtained from a bronchoscopy or buccal mucosal scraping. In one embodiment, individual to be diagnosed is an individual who has been exposed to tobacco smoke, an individual who has smoked, or an individual who smokes.
  • In a preferred embodiment of the method, the genes are selected from the group consisting of the genes shown in FIGS. 20A-20F; 21A-22B; and FIG. 24. Preferably the expression of two or more, five or more, ten or more, fifteen or more, twenty or more, fifty or more or one hundred or more informative genes is determined. In a preferred embodiment, the expression is determined using a microarry having one or more oligonucleotides (probes) for said one or more genes immobilized thereon.
  • The invention further relates to a method of obtaining a nucleic acid sample for use in expression analysis for a disease of the lung comprising obtaining an airway epithelial cell sample from an individual; and rendering the nucleic acid molecules in said cell sample available for hybridization. The invention also relates to a method of treating a disease of the lung comprising administering to an individual in need thereof an effective amount of an agent which increases the expression of a gene whose expression is decreased in said individual as compared with a normal individual.
  • The invention further relates to a method of treating a disease of the lung comprising administering to an individual in need thereof an effective amount of an agent, which changes the expression of a gene to that expression level seen in a healthy individual having the similar life style and environment, and a pharmaceutically acceptable carrier.
  • The invention also relates to a method of treating a disease of the lung comprising administering to an individual in need thereof an effective amount of an agent which increases the activity of an expression product of such gene whose activity is decreased in said individual as compared with a normal individual.
  • The invention also relates to a method of treating a disease of the lung comprising administering to an individual in need thereof an effective amount of an agent which decreases the activity of an expression product of such gene whose activity is increased in said individual as compared with a normal individual.
  • The invention also provides an array, for example, a microarray for diagnosis of a disease of the lung having immobilized thereon a plurality of oligonucleotides which hybridize specifically to one or more genes which are differentially expressed in airways exposed to air pollutants, such as cigarette smoke, and airways which are not exposed to such pollutants. In one embodiment, the oligonucleotides hybridize specifically to one allelic form of one or more genes which are differentially expressed for a disease of the lung. In a particular embodiment, the differentially expressed genes are selected from the group consisting of the genes shown in FIGS. 20A-20F, 21A-21B and FIG. 24.
  • In some aspects, prognostic and diagnostic methods of the present invention are based on the finding that deviation from the normal expression pattern in the airway transcriptome is indicative of abnormal response of the airway cells and thus predisposes the subject to diseases of the lung. Therefore, all the comparisons as provided in the methods are performed against a normal airway transcriptome of a “normal” or “healthy” individual exposed to the pollutant, as provided by this invention. Examples of these normal expression patterns of the genes belonging to the airway transcriptome of the present invention are provided in FIG. 24.
  • In one embodiment, the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in the cell adhesion regulating genes of the airway transcriptome, wherein decrease in the expression compared with a “normal” smoker expression pattern is indicative of an increased risk of developing a lung disease. Examples of cell adhesion regulation related genes include carcinoembryonic antigen-related adhesion molecule 6 and claudin 10 encoding genes. For example, an about at least 2-20 fold, preferably about at least 3 fold, still more preferably at least about 4 fold, still more preferably about at least 5 fold decrease in expression of carcinoembryonic antigen-related adhesion molecule 6 encoding gene is indicative of an increased risk of developing a lung disease. Also, for example, an about 2-20, preferably at least about, 3 fold, still more preferably at least about 4 fold, still more preferably at least about 5 fold decrease in the transcript level of claudin 10 encoding gene is indicative of an increased risk of developing a lung disease.
  • In one embodiment, the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in the detoxification related genes of the airway transcriptome, wherein decrease in the expression compared with a “normal” smoker expression pattern is indicative of an increased risk of developing a lung disease. Examples of these genes include cytochrome P450 subfamily I (dioxin-inducible) encoding genes, NADPH dehydrogenase encoding genes. For example, upregulation of transcripts of cytochrome P450 subfamily I (dioxin-inducible) encoding genes of about 2-50 fold, preferably at least about, 5 fold, still more preferably about 10 fold, still more preferably at least about 15 fold, still more preferably at least about 20 fold, still more preferably at least about 30 fold, and downregulation of transcription of NADPH dehydrogenase encoding genes of about 2-20, preferably about at least 3 fold, still more preferably at least about 4 fold, still more preferably about at least 5 fold decrease compared to expression in a “normal” smoker is indicative of an increased risk of developing a lung disease.
  • In one embodiment, the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in the immune system regulation associated genes of the airway transcriptome, wherein increase in the expression compared with a “normal” smoker expression pattern is indicative of an increased risk of developing a lung disease. Examples of immunoregulatory genes include small inducible cytokine subfamily D encoding genes. For example, about 1-10 fold difference in the expression of cytokine subfamily D encoding genes is indicative of increased risk of developing lung disease. Preferably, the difference in expression is least about 2 fold preferably about at least 3 fold, still more preferably at least about 4 fold, still more preferably about at least 5 fold decrease decrease in the expression of small inducible cytokine subfamily D encoding genes is indicative of an increased risk of developing a lung disease.
  • In one embodiment, the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in the metalothionein regulation associated genes of the airway transcriptome, wherein decrease in the expression compared with a “normal” smoker is indicative of an increased risk of developing a lung disease. Examples of metalothionein regulation associated genes include MTX G, X, and L encoding genes. At least about 1.5-10 fold difference in the expression of these genes in indicative of increased risk of developing lung disease. For example, at least about 1.5 fold., still more preferably at least about 2 fold, still more preferably at least about 2.5 fold, still more preferably at least about 3 fold, still more preferably at least about 4 fold, still more preferably about at least. 5 fold increase in the expression of metalothionein regulation associated genes include MTX G, X, and L encoding genes indicative of an increased risk of developing a lung disease.
  • BRIEF DESCRIPTION OF FIGURES
  • FIG. 1 shows Table 1, which sets forth a listing a group of 96 genes, their expression profile in lung cancer as compared to an individual not having lung cancer but being exposed to similar environmental stress, i.e. air pollutant, in this example, cigarette smoke. These genes were identified using Student's t-test.
  • FIG. 2 shows Table 2, listing a group of 84 genes, their expression profile in lung cancer as compared to an individual not having lung cancer but being exposed to similar environmental stress, i.e. air pollutant, in this example, cigarette smoke. These genes were identified using Student's t-test.
  • FIG. 3 shows Table 3, listing a group of 50 genes, and their expression profile in lung cancer as compared using a class-prediction model to an individual not having lung cancer but being exposed to similar environmental stress, i.e. air pollutant, in this example, cigarette smoke.
  • FIG. 4 shows Table 4, listing a group of 36 genes, their expression profile in lung cancer as compared to an individual not having lung cancer but being exposed to similar environmental stress, i.e. air pollutant, in this example, cigarette smoke. This group of genes is a combination of predictive genes identified using both Student's t-test and class-prediction model.
  • FIG. 5 shows an example of the results using class prediction model as obtained in Example 1. Training set included 74 samples, and the test set 24 samples. The mean age for the training set was 55 years, and the mean pack years smoked by the training set was 38. The mean age for the test set was 56 years, and the mean pack years smoked by the test set was 41.
  • FIG. 6 shows an example of the 50 gene class prediction model obtained in Example 1. Each square represents expression of one transcript. The transcript can be identified by the probe identifier on the y-axis according to the Affymetrix Human Genome Gene chip U133 probe numbers (see Appendix). The individual samples are identified on the x-axis. The samples are shown in this figure as individuals with lung cancer (“cancer”) and individuals without lung cancer (“no cancer”). The gene expression is shown as higher in darker squares and lower in lighter squares. One can clearly see the differences between the gene expression of these 50 genes in these two groups just by visually observing the pattern of lighter and darker squares.
  • FIG. 7 shows a comparison of sample-quality metrics. The graph plots the Affymetrix MAS 5.0 percent present (y-axis) versus the z-score derived filter (x-axis). The two metrics have a correlation (R2) of 0.82.
  • FIG. 8 shows distribution of accuracies for real vs. random 1000 runs. Histogram comparing test set class prediction accuracies of 1000 “sample randomized” classifiers generated by randomly assigning samples into training and test sets with true class labels (unshaded) versus 1000 “sample and class randomized” classifiers where the training set class labels were randomized following sample assignment to the training or test set (shaded).
  • FIG. 9 shows classification accuracy as a function of the average prediction strength over the 1000 runs of the algorithm with different training/test sets.
  • FIG. 10A shows the number of times each of the 80-predictive probe sets from the actual biomarker was present in the predictive lists of 80 probe sets derived from 1000 runs of the algorithm.
  • FIG. 10B shows the Number of times a probe set was present in the predictive lists of 80 probe sets derived from 1000 random runs of the algorithm described in Supplemental Table 7.
  • FIG. 11 shows Boxplot of the Prediction Strength values of the test set sample predictions made by the Weighted Voting algorithm across the 1000 runs with different training and test sets. The black boxplots (first two boxes from the left) are derived from the actual training and test set data with correct sample labels, the grey boxplots (last two boxes on the right) are derived from the test set predictions based on training sets with randomized sample labels.
  • FIG. 12 shows homogeneity of gene expression in large airway samples from smokers with lung cancer of varying cell types. Principal Component Analysis (PCA) was performed on the gene-expression measurements for the 80 genes in our predictor and all of the airway epithelium samples from patients with lung cancer. Gene expression measurements were Z(0,1) normalized prior to PCA. The graph shows the sample loadings for the first two principal components which together account for 58% of the variation among samples from smokers with cancer. There is no apparent separation of the samples with regard to lung tumor subtype.
  • FIG. 13 shows real time RT-PCR and microarray data for selected genes distinguishing smokers with and without cancer. Fold change for each gene is shown as the ratio of average expression level of cancer group (n=3) to the average expression of non-cancer group (n=3). Four genes (IL8, FOS, TPD52, and RAB1A) were found to be up-regulated in cancer group on both microarray and RT-PCR platforms; three genes (DCLRE1C, BACH2, and DUOX1) were found to be down-regulated in cancer group on both platforms.
  • FIG. 14 shows the class prediction methodology used. 129 samples (69 from patients without cancer; 60 from patients with lung cancer) were separated into a training (n=77) and a test set (n=52). The most frequently chosen 40 up- and 40 down-regulated genes from internal cross validation on the training set were selected for the final gene committee. The weighted voted algorithm using this committee of 80 genes was then used to predict the class of the test set samples.
  • FIG. 15 shows hierarchical clustering of class-predictor genes. Z-score-normalized gene-expression measurements of the eighty class-predictor genes in the 52 test-set samples are shown in a false-color scale and organized from top to bottom by hierarchical clustering. The Affymetrix U133A probeset ID and HUGO symbol are given to the right of each gene. The test-set samples are organized from left to right first by whether the patient had a clinical diagnosis of cancer. Within these two groups, the samples are organized by the accuracy of the class-predictor diagnosis (samples classified incorrectly are on the right shown in dark green). 43/52 (83%) test samples are classified correctly. The sample ID is given at the top of each column The prediction strength of each of the diagnoses made by the class-prediction algorithm is indicated in a false-color scale immediately below the prediction accuracy.
  • Prediction strength is a measure of the level of diagnostic confidence and varies on a continuous scale from 0 to 1 where 1 indicates a high degree of confidence.
  • FIG. 16 shows a Comparison of Receiver Operating Characteristic (ROC) curves. Sensitivity (y-axis) and 1-Specificity (x-axis) were calculated at various prediction strength thresholds where a prediction of no cancer was assigned a negative prediction strength value and a prediction of cancer was assigned a positive prediction strength value. The solid black line represents the ROC curve for the airway gene expression classifier. The dotted black line represents the average ROC curve for 1000 classifiers derived by randomizing the training set class labels (“class randomized”). The upper and lower lines of the gray shaded region represent the average ROC curves for the top and bottom half of random biomarkers (based on area under the curve). There is a significant difference between the area under the curve of the actual classifier and the random classifiers (p=0.004; empiric p-value based on permutation)
  • FIG. 17 shows the Principal Component Analysis (PCA) of biomarker gene expression in lung tissue samples. The 80 biomarker probesets were mapped to 64 probesets in the Bhattacharjee et al. HGU95Av2 microarray dataset of lung cancer and normal lung tissue. The PCA is a representation of the overall variation in expression of the 64 biomarker probesets. The normal lung samples (NL) are represented in green, the adenocarcinomas (AD) in red, the small cells (SC) in blue, and the squamous (SQ) lung cancer samples in yellow. The normal lung samples separate from the lung cancer samples along the first principal component (empirically derived p-value=0.023, see supplemental methods).
  • FIGS. 18A-18C show data obtained in this study. FIG. 18A shows bronchoscopy results for the 129 patients in the study. Only 32 of the 60 patients that had a final diagnosis of cancer had bronchoscopies that were diagnostic of lung cancer. The remaining 97 samples had bronchoscopies that were negative for lung cancer including 5 that had a definitive alternate benign diagnosis. This resulted in 92 patients with non-diagnostic bronchoscopy that required further tests and/or clinical follow-up. FIG. 18B shows biomarker prediction results. 36 of the 92 patients with non-diagnostic bronchoscopies exhibited a gene expression profile that was positive for lung cancer. This resulted in 25 of 28 cancer patients with non-diagnostic bronchoscopies being predicted to have cancer. FIG. 18C shows combined test results. In a combined test where a positive test result from either bronchoscopy or gene expression is considered indicative of lung cancer a sensitivity of 95% (57 of 60 cancer patients) with only a 16% false positive rate (11 of 69 non-cancer patients) is achieved. The shading of each contingency table is reflective of the overall fraction of each sample type in each quadrant.
  • FIGS. 19A-19B show a comparison of bronchoscopy and biomarker prediction by A) cancer stage or B) cancer subtype. Each square symbolizes one patient sample. The upper half represents the biomarker prediction accuracy and the lower half represents the bronchoscopy accuracy. Not all cancer samples are represented in this figure. FIG. 19A includes only Non Small Cell cancer samples that could be staged using the TMN system (48 of the 60 total cancer samples). FIG. 19B includes samples that could be histologically classified as Adenocarcinoma, Squamous Cell Carcinoma and Small Cell Carcinoma (45 of the 60 total cancer samples).
  • FIGS. 20A-20F show a list of genes which are differentially expressed in smokers and non-smokers. T-test statistical results are shown.
  • FIGS. 21A-21G show a list of genes which are differentially expressed insmokers and smokers with lung cancer. T-test statistical results are shown.
  • FIG. 22 is a schematic diagram showing an example of loss of heterozygosity analysis.
  • FIG. 23 is a graph showing fractional allelic loss in smokers and non-smokers.
  • FIG. 24A and FIG. 24B shows clustering of current and never smoker samples. Hierarchical clustering of current (n-34) and never (n=23) smokers according to the expression of the 97 probesets representing the 85 genes differentially expressed between current and never smokers. While current and never smokers separate into 2 groups, three current smokers appear to cluster with never smokers (rectangle). Expression of a number of redox-related and xenobiotic genes in these subjects was not increased (brackets) and therefore resembled that of never smokers despite substantial smoke exposure. There was also a subset of current smokers (circled individuals on x-axis) who did not upregulate expression of a number of predominantly redox/xenobiotic genes (circled expression analysis in the middle of the graph) to the same degree as other smokers. In addition, there is a never smoker, 167N (box), who is an outlier among never smokers and expresses a subset of genes at the level of current smokers. HUGO gene ID listed for all 85 genes. Functional classification of select genes is shown. Darker gray=high level of expression, lighter grey=low level of expression, black=mean level of expression.
  • FIGS. 25A-25B show a multidimensional scaling plot of current, never, and former smoker samples. Multidimensional scaling plot of current (lighter grey boxes), never (medium grey boxes, mainly clustered on the left hand side of the graph) and former smokers (darkest grey boxes) in 97 dimensional space according to the expression of the 97 probesets reflecting the 85 differentially expressed genes between current and never smokers. FIG. 25A illustrates that current and never smokers separate into their 2 classes according to the expression of these genes. FIG. 25B shows that when former smokers are plotted according to the expression of these genes, a majority of former smokers appear to group more closely to never smokers. There are, however, a number of former smokers who group more closely to current smokers (black circle). The only clinical variable that differed between the 2 groups of former smokers was length of smoking cessation (p<0.05), with formers smokers who quit within 2 years clustering with current smokers. The MDS plots are reduced dimension representations of the data and the axes on the figure have no units.
  • FIG. 26 shows genes expression of which is irreversibly altered by cigarette smoke. Hierarchical clustering plot of 15 of the 97 probesets representing the 85 genes from FIG. 24 that remain differentially expressed between former vs. never smokers (p<0.0001) as long as 30 years after cessation of smoking. Samples are grouped according to smoking status and length of smoking cessation (samples are not being clustered and thus there is no dendogram on the sample axis). Patient ID, status (C, For N) and length of time since smoking cessation are shown for each sample. Current=current smokers, former=former smokers and never=never smokers. HUGO gene ID listed for all 15 genes. Two genes (HLF and MT1X) appear twice in the analysis (i.e. two different probe sets corresponding to the same gene). Darker grey shades indicate higher level of expression, lighter colors indicate low level of expression, black=mean level of expression.
  • FIGS. 27A-27C show Scatterplots of spatial (FIG. 27A) and temporal (FIG. 27B) replicate samples (2 fold, 10 fold and 30 fold lines of change shown; axes are log scaled). Histogram of fold changes computed between all replicates and between unrelated samples (FIG. 27C).
  • FIG. 28 shows a dendogram of samples obtained from hierarchal clustering of the top 1000 most variable genes across all samples. Hierarchical clustering of all samples (n=75 subjects) across the 1000 most variable genes. Current (C), former (F) and never (N) smokers do not cluster into their 3 classes.
  • FIG. 29 shows variability in gene expression in the normal airway transcriptome. This histogram shows the number of genes in the normal airway transcriptome (˜7100 genes whose median detection p value<0.05) according to their coefficient of variation (standard deviation/mean*100) across the 23 healthy never smokers. Approximately 90% of the genes have a coefficient of variation below 50%
  • FIG. 30 shows hierarchical clustering of all 18 former smokers according to the expression of the top 97 probesets that were differentially expressed between current and never smokers. The only clinical variable that statistically differed (p<0.05) between the 2 molecular subclasses of former smokers was length of smoking cessation. Patient ID (denoted with “F”) and time since patient quit smoking (in years) are shown.
  • FIGS. 31A-31E show real time QRT-PCR and microarray data for select genes that were found to be differentially expressed between current and never smokers on microarray analysis. Fold change is relative to one of the never smokers. For NQO1 (NAD(P)H dehydrogenase, quinone 1, FIG. 31A), ALDH3A1 (aldehyde dehydrogenase 3 family, member A1, FIG. 31B), CYPIB1 (cytochrome P450, subfamily I (dioxin-inducible), polypeptide 1, FIG. 31C) and CEACAMS (carcinoembryonic antigen-related cell adhesion molecule 5, FIG. 3 ID), gene expression was measured on 3 never smokers (N) and 3 current smokers(S). For SLIT1 (slit homolog 1, FIG. 31E), a gene reversibly downregulated by cigarette smoke, gene expression was measured on a never smoker, 2 former smokers who quit smoking more than two years ago, 1 former smoker who quit smoking within the last two years and a current smoker. Pearson correlations between real-time PCR and microarray data for each gene are shown.
  • FIG. 32 shows a table of genes present in bronchial epithelial cells that should be expressed in bronchial epithelial cells.
  • FIG. 33 shows genes absent in bronchial epithelial cells that should not be expressed in bronchial airway epithelial cells.
  • FIG. 34 shows demographic features of all 75 patients whose microarrays were included in our study. Three clinical groups were evaluated: never smokers, former smokers and current smokers. For continuous variables, the mean (and the standard deviation) is shown. For gender, M=number of males, F=number of females. For race, W=Caucasian, B=African American, 0=other. Pack years of smoking calculated as number of packs of cigarettes per day multiplied by number of years of smoking. ANOVA, t-tests, and Chi-squared tests were used to evaluate differences between groups for continuous variables; chi-square tests were used to evaluate categorical variables. *=one value missing, ** indicates that the data was not normally distributed and therefore, the t-test p-value was computed using logged values.
  • FIG. 35 shows analysis of replicates. Pearson correlation coefficients were computed between replicate samples, between samples from the same group (never or current smoker), and between samples from two different groups (never versus current smoker). The mean R squared values from the analyses are reported.
  • FIG. 36A-36C show multiple linear regression results performed on the top 10 percent most variable genes (calculated using the coefficient of variation) in the normal airway transcriptome. A general linear model was used to explore the relationship between gene expression and age, race, gender, and the three possible two-way interaction terms. Seventy models having a p value of 0.01 are shown along with the p values for the significant regressors (p<=0.01).
  • FIGS. 37A-37B show genes correlated with pack-years among current smokers (p<0.0001). Pearson correlation for gene expression and pack-years smoking. R-values and p-values for 51 genes that were tightly correlated with pack-years among current smokers are reported. The 5 genes shown in bold are the genes whose expression is most significantly correlated to pack-years as assessed by a permutation analysis.
  • FIG. 38A-38B show summary of analysis of genes irreversibly altered by cigarette smoke. At-test was performed between former and never smoker across all 9968 genes, and 44 genes were found to have a p value threshold below 0.00098. These 44 genes are listed in the table according to their p value on t-test between current and never smokers, as the intersection of these 2 t-tests (former vs. never and current vs. never) correspond to irreversibly altered genes. Fifteen genes (shown in bold) were found to be irreversible altered by cigarette smoking given that they are in common with the list of 97 probesets significantly differentially expressed between current and never smokers. In addition to the 15 genes, 12 more genes had at-test p value between current and never smokers of less than 0.001, and only 7 of the 44 genes had p values between current and never smokers of greater than 0.05.
  • FIGS. 39A-39B show ANCOVA and 2 way ANOVA. An ANCOVA was performed to test the effect of smoking status (never or current) on gene expression while controlling for the effect of age (the covariate). A two-way ANOVA was performed to test the effect of smoking status (never or current) on gene expression while controlling for the fixed effects of race (encoded as three racial groups: Caucasian, African American, and other) or gender and the interaction terms of status:race or status:gender. The never versus current smoker t-test p value threshold (p value=1.06*10−5) was used to determine significant genes in the above analyses performed on the filtered set of 9968 genes. The table lists the genes found to be significantly different between never and current smokers controlling for the effects of age, race, and gender. Many of the genes listed are labeled “common” because they are also found in the set of 97 probesets found to be significantly different between never and current smokers based on at-test analysis.
  • FIG. 40 shows a multidimensional scaling plot of all smokers with and without cancer plotted in 208 dimensional space according to the expression of the 208 genes that distinguish the 2 classes on t-test.
  • FIG. 41 shows a hierarchical clustering plot of all current smokers according to the expression of 9 genes considered to be statistical outliers among at least 3 patients by Grubb's test. These 9 genes were selected from the 361 genes found to be differentially expressed between current and never smokers at p<0.001. Darker gray=high level of expression, lighter grey=low level of expression, black=mean level of expression.
  • DETAILED DESCRIPTION OF THE INVENTION
  • The present invention is directed to gene/transcript groups and methods of using the expression profile of these gene/transcript groups in diagnosis and prognosis of lung diseases.
  • We provide a method that significantly increases the diagnostic accuracy of lung diseases, such as lung cancer. When one combines the gene expression analysis of the present invention with bronchoscopy, the diagnosis of lung cancer is dramatically better by detecting the cancer in an earlier stage than any other available method to date, and by providing far fewer false negatives and/or false positives than any other available method.
  • We have found a group of gene transcripts that we can use individually and in groups or subsets for enhanced diagnosis for lung diseases, such as lung cancer, using gene expression analysis. We provide detailed guidance on the increase and/or decrease of expression of these genes for diagnosis and prognosis of lung diseases, such as lung cancer.
  • One example of the gene transcript groups useful in the diagnostic/prognostic tests of the invention is set forth in Table 6. We have found that taking any group that has at least 20 of the Table 6 genes provides a much greater diagnostic capability than chance alone.
  • Preferably one would use more than 20 of these gene transcript, for example about 20-100 and any combination between, for example, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, and so on. Our preferred groups are the groups of 96 (Table 1), 84 (Table 2), 50 (Table 3), 36 (Table 4), 80 (Table 5), 535 (Table 6) and 20 (Table 7). In some instances, we have found that one can enhance the accuracy of the diagnosis by adding additional genes to any of these specific groups.
  • Naturally, following the teachings of the present invention, one may also include one or more of the genes and/or transcripts presented in Tables 1-7 into a kit or a system for a multicancer screening kit. For example, any one or more genes and or transcripts from Table 7 may be added as a lung cancer marker for a gene expression analysis.
  • When one uses these groups, the genes in the group are compared to a control or a control group. The control groups can be non-smokers, smokers, or former smokers. Preferably, one compares the gene transcripts or their expression product in the biological sample of an individual against a similar group, except that the members of the control groups do not have the lung disorder, such as emphysema or lung cancer. For example, comparing can be performed in the biological sample from a smoker against a control group of smokers who do not have lung cancer. When one compares the transcripts or expression products against the control for increased expression or decreased expression, which depends upon the particular gene and is set forth in the tables—not all the genes surveyed will show an increase or decrease. However, at least 50% of the genes surveyed must provide the described pattern. Greater reliability if obtained as the percent approaches 100%. Thus, in one embodiment, one wants at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% of the genes surveyed to show the altered pattern indicative of lung disease, such as lung cancer, as set forth in the tables as shown below.
  • The presently described gene expression profile can also be used to screen for individuals who are susceptible for lung cancer. For example, a smoker, who is over a certain age, for example over 40 years old, or a smoker who has smoked, for example, a certain number of years, may wish to be screened for lung cancer. The gene expression analysis as described herein can provide an accurate very early diagnosis for lung cancer. This is particularly useful in diagnosis of lung cancer, because the earlier the cancer is detected, the better the survival rate is.
  • For example, when we analyzed the gene expression results, we found, that if one applies a less stringent threshold, the group of 80 genes as presented in Table 5 are part of the most frequently chosen genes across 1000 statistical test runs (see Examples below for more details regarding the statistical testing). Using random data, we have shown that no random gene shows up more than 67 times out of 1000. Using such a cutoff, the 535 genes of Table 6 in our data show up more than 67 times out of 1000. All the 80 genes in Table 5 form a subset of the 535 genes. Table 7 shows the top 20 genes which are subset of the 535 list. The direction of change in expression is shown using signal to noise ratio. A negative number in Tables 5, 6, and 7 means that expression of this gene or transcript is up in lung cancer samples. Positive number in Table 5, 6, and 7, indicates that the expression of this gene or transcript is down in lung cancer.
  • Accordingly, any combination of the genes and/or transcripts of Table 6 can be used. In one embodiment, any combination of at least 5-10, 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80, 80-90, 90-100, 100-120, 120-140, 140-150, 150-160, 160-170, 170-180, 180-190, 190-200, 200-210, 210-220, 220-230, 230-240, 240-250, 250-260, 260-270, 270-280, 280-290, 290-300, 300-310, 310-320, 320-330, 330-340, 340-350, 350-360, 360-370, 370-380, 380-390, 390-400, 400-410, 410-420, 420-430, 430-440, 440-450, 450-460, 460-470, 470-480, 480-490, 490-500, 500-510, 510-520, 520-530, and up to about 535 genes selected from the group consisting of genes or transcripts as shown in the Table 6.
  • Table 7 provides 20 of the most frequently variably expressed genes in lung cancer when compared to samples without cancer. Accordingly, in one embodiment, any combination of about 3-5, 5-10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or all 20 genes and/or transcripts of Table 7, or any sub-combination thereof are used.
  • In one embodiment, the invention provides a gene group the expression profile of which is useful in diagnosing lung diseases and which comprises probes that hybridize ranging from 1 to 96 and all combinations in between for example 5, 10, 15, 20, 25, 30, 35, at least about 36, at least to 40, at lest to 50, at least to 60, to at least 70, to at least 80, to at least 90, or all of the following 96 gene sequences: NM_003335; NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_001319; NM_006545.1; NM_021145.1; NM_002437.1; NM_006286; NM_001003698 /// NM_001003699 /// NM_002955; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_002853.1; NM_019067.1; NM_024917.1; NM_020979.1; NM_005597.1; NM_007031.1; NM_009590.1; NM_020217.1; NM_025026.1; NM_014709.1; NM_014896.1; AF010144; NM_005374.1; NM_001696; NM_005494 /// NM_058246; NM_006534 /// NM_181659; NM_006368; NM_002268 /// NM_032771; NM_014033; NM_016138; NM_007048 /// NM_194441; NM_006694; NM_000051 /// NM_138292 /// NM_138293; NM_000410 /// NM_139002 /// NM_139003 /// NM_139004 /// NM_139005 /// NM_139006 /// NM_139007 /// NM_139008 /// NM_139009 /// NM_139010 /// NM_139011; NM_004691; NM_012070 /// NM_139321 /// NM_139322; NM_006095; AI632181; AW024467; NM_021814; NM_005547.1; NM_203458; NM_015547 /// NM_147161; AB007958.1; NM_207488; NM_005809 /// NM_181737 /// NM_181738; NM_016248 /// NM_144490; AK022213.1; NM_005708; NM_207102; AK023895; NM_144606 /// NM_144997; NM_018530; AK021474; U43604.1; AU147017; AF222691.1; NM_015116; NM_001005375 /// NM_001005785 /// NM_001005786 /// NM_004081 /// NM_020363 /// NM_020364 /// NM_020420; AC004692; NM_001014; NM_000585 /// NM_172174 /// NM_172175; NM_054020 /// NM_172095 /// NM_172096 /// NM_172097; BE466926; NM_018011; NM_024077; NM_012394; NM_019011 /// NM_207111 /// NM_207116; NM_017646; NM_021800; NM_016049; NM_014395; NM_014336; NM_018097; NM_019014; NM_024804; NM_018260; NM_018118; NM_014128; NM_024084; NM_005294; AF077053; NM_138387; NM_024531; NM_000693; NM_018509; NM_033128; NM_020706; AI523613; and NM_014884
  • In one embodiment, the invention provides a gene group the expression profile of which is useful in diagnosing lung diseases and comprises probes that hybridize to at least, for example, 5, 10, 15, 20, 25, 30, 35, at least about 36, at least to 40, at least to 50, at least to 60, to at least 70, to at least 80, to all of the following 84 gene sequences: NM_030757.1; R83000; AK021571.1; NM_014182.1; NM_17932.1; U85430.1; AI683552; BC002642.1; AW024467; NM_030972.1; BC021135.1; AL161952.1; AK026565.1; AK023783.1; BF218804; NM_001281.1; NM_024006.1; AK023843.1; BC001602.1; BC034707.1; BC064619.1; AY280502.1; BC059387.1; AF135421.1; BC061522.1; L76200.1; U50532.1; BC006547.2; BC008797.2; BC000807.1; AL080112.1; BC033718.1 /// BC046176.1 /// BC038443.1; NM_000346.1; BC008710.1; Hs.288575 (UNIGENE ID); AF020591.1; BC000423.2; BC002503.2; BC008710.1; BC009185.2; Hs.528304 (UNIGENE ID); U50532.1; BC013923.2; BC031091; NM_007062; Hs.249591 (Unigene ID); BC075839.1 /// BC073760.1; BC072436.1 /// BC004560.2; BC001016.2; Hs.286261 (Unigene ID); AF348514.1; BC005023.1; BC066337.1 /// BC058736.1 /// BC050555.1; Hs.216623 (Unigene ID); BC072400.1; BC041073.1; U43965.1; BC021258.2; BC016057.1; BC016713.1 /// BC014535.1 /// AF237771.1; BC000360.2; BC007455.2; BC000701.2; BC010067.2; BC023528.2 /// BC047680.1; BC064957.1; Hs.156701 (Unigene ID); BC030619.2; BC008710.1; U43965.1; BC066329.1; Hs.438867 (Unigene ID); BC035025.2 /// BC050330.1; BC023976.2; BC074852.2 /// BC074851.2; Hs.445885 (Unigene ID); BC008591.2 /// BC050440.1 ///; BC048096.1; AF365931.1; AF257099.1; and BC028912.1.
  • In one embodiment, the invention provides a gene group the expression profile of which is useful in diagnosing lung diseases and comprises probes that hybridize to at least, for example 5, 10, 15, 20, 25, 30, preferably at least about 36, still more preferably at least to 40, still more preferably at lest to 45, still more preferably all of the following 50 gene sequences, although it can include any and all members, for example, 20, 21, 22, up to and including 36: NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; AB014576.1; BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1. In one preferred embodiment, one can use at least 20-30, 30-40, of the 50 genes that overlap with the individual predictor genes identified in the analysis using the t-test, and, for example, 5-9 of the non-overlapping genes, identified using the t-test analysis as individual predictor genes, and combinations thereof.
  • In one embodiment, the invention provides a gene group the expression profile of which is useful in diagnosing lung diseases and comprises probes that hybridize to at least for example 5, 10, 15, 20, preferably at least about 25, still more preferably at least to 30, still more preferably all of the following 36 gene sequences: NM_007062.1; NM_001281.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; NM_002268 /// NM_032771; NM_007048 /// NM_194441; NM_006694; U85430.1; NM_004691; AB014576.1; BF218804; BE467941; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_021971.1; NM_014128.1; AA133341; and AF198444.1. In one preferred embodiment, one can use at least 20 of the 36 genes that overlap with the individual predictors and, for example, 5-9 of the non-overlapping genes, and combinations thereof.
  • The expression of the gene groups in an individual sample can be analyzed using any probe specific to the nucleic acid sequences or protein product sequences encoded by the gene group members. For example, in one embodiment, a probe set useful in the methods of the present invention is selected from the nucleic acid probes of between 10-15, 15-20, 20-180, preferably between 30-180, still more preferably between 36-96, still more preferably between 36-84, still more preferably between 36-50 probes, included in the Affymetrix Inc. gene chip of the Human Genome U133 Set and identified as probe ID Nos: 208082_x_at, 214800_x_at, 215208_x_at, 218556_at, 207730_x_at, 210556_at, 217679_x_at, 202901_x_at, 213939_s_at, 208137_x_at, 214705_at, 215001_s_at, 218155_x_at, 215604_x_at, 212297_at, 201804_x_at, 217949_s_at, 215179_x_at, 211316_x_at, 217653_x_at, 266_s_at, 204718_at, 211916_s_at, 215032_at, 219920_s_at, 211996_s_at, 200075_s_at, 214753_at, 204102_s_at, 202419_at, 214715_x_at, 216859_x_at, 215529_x_at, 202936_s_at, 212130_x_at, 215204_at, 218735_s_at, 200078_s_at, 203455_s_at, 212227_x_at, 222282_at, 219678_x_at, 208268_at, 221899_at, 213721_at, 214718_at, 201608_s_at, 205684_s_at, 209008_x_at, 200825_s_at, 218160_at, 57739_at, 211921_x_at, 218074_at, 200914_x_at, 216384_x_at, 214594_x_at, 222122_s_at, 204060_s_at, 215314_at, 208238_x_at, 210705_s_at, 211184_s_at, 215418_at, 209393_s_at, 210101_x_at, 212052_s_at, 215011_at, 221932_s_at, 201239_s_at, 215553_x_at, 213351_s_at, 202021_x_at, 209442_x_at, 210131_x_at, 217713_x_at, 214707_x_at, 203272_s_at, 206279_at, 214912_at, 201729_s_at, 205917_at, 200772_x_at, 202842_s_at, 203588_s_at, 209703_x_at, 217313_at, 217588_at, 214153_at, 222155_s_at, 203704_s_at, 220934_s_at, 206929_s_at, 220459_at, 215645_at, 217336_at, 203301_s_at, 207283_at, 222168_at, 222272_x_at, 219290_x_at, 204119_s_at, 215387_x_at, 222358_x_at, 205010_at, 1316_at, 216187_x_at, 208678_at, 222310_at, 210434_x_at, 220242_x_at, 207287_at, 207953_at, 209015_s_at, 221759_at, 220856_x_at, 200654_at, 220071_x_at, 216745_x_at, 218976_at, 214833_at, 202004_x_at, 209653_at, 210858_x_at, 212041_at, 221294_at, 207020_at, 204461_x_at, 205367_at, 219203_at, 215067_x_at, 212517_at, 220215_at, 201923_at, 215609_at, 207984_s_at, 215373_x_at, 216110_x_at, 215600_x_at, 216922_x_at, 215892_at, 201530_x_at, 217371_s_at, 222231_s_at, 218265_at, 201537_s_at, 221616_s_at, 213106_at, 215336_at, 209770_at, 209061_at, 202573_at, 207064_s_at, 64371_at, 219977_at, 218617_at, 214902_x_at, 207436_x_at, 215659_at, 204216_s_at, 214763_at, 200877_at, 218425_at, 203246_s_at, 203466_at, 204247_s_at, 216012_at, 211328_x_at, 218336_at, 209746_s_at, 214722_at, 214599_at, 220113_x_at, 213212_x_at, 217671_at, 207365_x_at, 218067_s_at, 205238_at, 209432_s_at, and 213919_at. In one preferred embodiment, one can use at least, for example, 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-90, 90-100, 110, 120, 130, 140, 150, 160, or 170 of the 180 genes that overlap with the individual predictors genes and, for example, 5-9 of the non-overlapping genes and combinations thereof
  • Sequences for the Affymetrix probes are provided in the Appendix to the specification, all the pages of which are herein incorporated by reference in their entirety.
  • One can analyze the expression data to identify expression patters associated with any lung disease that is caused by exposure to air pollutants, such as cigarette smoke, asbestos or any other lung disease. For example, the analysis can be performed as follows. One first scans a gene chip or mixture of beads comprising probes that are hybridized with a study group samples. For example, one can use samples of non-smokers and smokers, non-asbestos exposed individuals and asbestos-exposed individuals, non-smog exposed individuals and smog-exposed individuals, smokers without a lung disease and smokers with lung disease, to obtain the differentially expressed gene groups between individuals with no lung disease and individuals with lung disease. One must, of course select appropriate groups, wherein only one air pollutant can be selected as a variable. So, for example, one can compare non-smokers exposed to asbestos but not smog and non-smokers not exposed to asbestos or smog.
  • The obtained expression analysis, such as microarray or microbead raw data consists of signal strength and detection p-value. One normalizes or scales the data, and filters the poor quality chips/bead sets based on images of the expression data, control probes, and histograms. One also filters contaminated specimens which contain non-epithelial cells. Lastly, one filters the genes of importance using detection p-value. This results in identification of transcripts present in normal airways (normal airway transcriptome). Variability and multiple regression analysis can be used. This also results in identification of effects of smoking on airway epithelial cell transcription. For this analysis, one can use T-test and Pearson correlation analysis. One can also identify a group or a set of transcripts that are differentially expressed in samples with lung disease, such as lung cancer and samples without cancer. This analysis was performed using class prediction models.
  • For analysis of the data, one can use, for example, a weighted voting method. The weighted voting method ranks, and gives a weight “p” to all genes by the signal to noise ration of gene expression between two classes: P=mean(class 1)−mean(class 2)/sd(class 1)=sd(class 2). Committees of variable sizes of the top ranked genes are used to evaluate test samples, but genes with more significant p-values can be more heavily weighed. Each committee genes in test sample votes for one class or the other, based on how close that gene expression level is to the class 1 mean or the class 2 mean. V(gene A)=P(gene A), i.e. level of expression in test sample less the average of the mean expression values in the two classes. Votes for each class are tallied and the winning class is determined along with prediction strength as PS=Vwin−Vlose/Vwin+Vlose. Finally, the accuracy can be validated using cross-validation +/−independent samples.
  • Table 1 shows 96 genes that were identified as a group distinguishing smokers with cancer from smokers without cancer. The difference in expression is indicated at the column on the right as either “down”, which indicates that the expression of that particular transcript was lower in smokers with cancer than in smokers without cancer, and “up”, which indicates that the expression of that particular transcript was higher in smokers with cancer than smokers without cancer. In one embodiment, the exemplary probes shown in the column “Affymetrix Id in the Human Genome U133 chip” can be used. Sequences for the Affymetrix probes are provided in the Appendix.
  • TABLE 1
    96 Gene Group
    Direction in
    Affymetrix Id GenBank ID Gene Description Gene Name Cancer
    1316_at NM_003335 ubiquitin-activated UBE1L down
    enzyme E1-like
    200654_at NM_000918 procollagen-proline, 2- P4HB up
    oxoglutarate 4-
    dioxygenase (proline 4-
    hydroxylase), beta
    polypeptide (protein
    disulfide isomerase;
    thyroid hormone binding
    protein p55)
    200877_at NM_006430.1 chaperonin containing CCT4 up
    TCP1, subunit 4 (delta)
    201530_x_at NM_001416.1 eukaryotic translation EIF4A1 up
    factor 4A, isoform 1
    201537_s_at NM_004090 dual specificity DUSP3 up
    phosphatase 3 (vaccinia
    virus phosphatase VH1-
    related)
    201923_at NM_006406.1 peroxiredoxin 4 PRDX4 up
    202004_x_at NM_003001.2 succinate dehydrogenase SDHC up
    complex, subunit C,
    integral membrane
    protein, 15 kDa
    202573_at NM_001319 casein kinase 1, gamma 2 CSNK1G2 down
    203246_s_at NM_006545.1 tumor suppressor TUSC4 up
    candidate 4
    203301_s_at NM_021145.1 cyclin D binding myb- DMTF1 down
    like transcription factor 1
    203466_at NM_002437.1 MpV17 transgene, MPV17 up
    murine homolog,
    glomerusclerosis
    203588_s_at NM_006286 transcription factor Dp-2 TFDP2 up
    (E2F dimerization
    partner 2)
    203704_s_at NM_001003698 /// ras responsive element RREB1 down
    NM_001003699 /// binding protein 1
    NM_002955
    204119_s_at NM_001123 /// adenosine kinase ADK up
    NM_006721
    204216_s_at NM_024824 nuclear protein UKp68 FLJ11806 up
    204247_s_at NM_004935.1 cyclin-dependent kinase 5 CDK5 up
    204461_x_at NM_002853.1 RAD1 homolog RAD1 down
    205010_at NM_019067.1 hypothetical protein FLJ10613 down
    FLJ10613
    205238_at NM_024917.1 chromosome X open CXorf34 down
    reading frame 34
    205367_at NM_020979.1 adaptor protein with APS down
    pleckstrin homology and
    src homology 2 domains
    206929_s_at NM_005597.1 nuclear factor I/c NFIC down
    (CCAAT-binding
    transcription factor)
    207020_at NM_007031.1 heat shock transcription HSF2BP down
    factor 2 binding protein
    207064_s_at NM_009590.1 amine oxidase, copper AOC2 down
    containing 2 (retina-
    specific)
    207283_at NM_020217.1 hypothetical protein DKFZp547I014 down
    DKFZp547I014
    207287_at NM_025026.1 hypothetical protein FLJ14107 down
    FLJ14107
    207365_x_at NM_014709.1 ubiquitin specific USP34 down
    protease 34
    207436_x_at NM_014896.1 KIAA0894 protein KIAA0894 down
    207953_at AF010144 down
    207984_s_at NM_005374.1 membrane protein, MPP2 down
    palmitoylated 2
    (MAGUK p55
    subfamily member2
    208678_at NM_001696 ATPase, H+ ATP6V1E1 up
    transporting, lysosomal
    31 kDa, V1 subunit E,
    isoform 1
    209015_s_at NM_005494 /// DnaJ (Hsp40) homolog, DNAJB6 up
    NM_058246 subfamily B, member 6
    209061_at NM_006534 /// nuclear receptor NCOA3 down
    NM_181659 coactivator 3
    209432_s_at NM_006368 cAMP responsive CREB3 up
    element binding protein 3
    209653_at NM_002268 /// karyopherin alpha 4 KPNA4 up
    NM_032771 (importin alpha 3)
    209703_x_at NM_014033 DKFZP586A0522 DKFZP586A0522 down
    protein
    209746_s_at NM_016138 coenzyme Q7 homolog, COQ7 down
    ubiquinone
    209770_at NM_007048 /// butyrophilin, subfamily BTN3A1 down
    NM_194441 3, member A1
    210434_x_at NM_006694 jumping translocation JTB up
    breakpoint
    210858_x_at NM_000051 /// ataxia telangiectasia ATM down
    NM_138292 /// mutated (includes
    NM_138293 complementation groups
    A, C, and D
    211328_x_at NM_000410 /// hemochromatosis HFE down
    NM_139002 ///
    NM_139003 ///
    NM_139004 ///
    NM_139005 ///
    NM_139006 ///
    NM_139007 ///
    NM_139008 ///
    NM_139009 ///
    NM_139010 ///
    NM_139011
    212041_at NM_004691 ATPase, H+ ATP6V0D1 up
    transporting, lysosomal
    38 kDa, V0 subunit d
    isoform 1
    212517_at NM_012070 /// attractin ATRN down
    NM_139321 ///
    NM_139322
    213106_at NM_006095 ATPase, ATP8A1 down
    aminophospholipid
    transporter (APLT),
    Class I, type 8A,
    member 1
    213212_x_at AI632181 Similar to FLJ40113 down
    protein
    213919_at AW024467 down
    214153_at NM_021814 ELOVL family member ELOVL5 down
    5, elongation of long
    chain fatty acids
    (FEN1/Elo2,
    SUR4/Elo3-like, yeast)
    214599_at NM_005547.1 involucrin IVL down
    214722_at NM_203458 similar to NOTCH2 N2N down
    protein
    214763_at NM_015547 /// thiosterase, adipose THEA down
    NM_147161 associated
    214833_at AB007958.1 KIAA0792 gene product KIAA0792 down
    214902_x_at NM_207488 FLJ42393 protein FLJ42393 down
    215067_x_at NM_005809 /// peroxiredoxin 2 PRDX2 down
    NM_181737 ///
    NM_181738
    215336_at NM_016248 /// A kinase (PRKA) AKAP11 down
    NM_144490 anchor protein
    215373_x_at AK022213.1 hypothetical protein FLJ12151 down
    FLJ12151
    215387_x_at NM_005708 Glypican 6 GPC6 down
    215600_x_at NM_207102 F-box and WD-40 FBXW12 down
    domain protein 12
    215609_at AK023895 down
    215645_at NM_144606 /// Hypothetical protein FLCN down
    NM_144997 MGC13008
    215659_at NM_018530 Gasdermin-like GSDML down
    215892_at AK021474 down
    216012_at U43604.1 human unidentified down
    mRNA, partial sequence
    216110_x_at AU147017 down
    216187_x_at AF222691.1 Homo sapiens Alu LNX1 down
    repeat
    216745_x_at NM_015116 Leucine-rich repeats and LRCH1 down
    calponin homology (CH)
    domain containing 1
    216922_x_at NM_001005375 /// deleted in azoospermia DAZ2 down
    NM_001005785 ///
    NM_001005786 ///
    NM_004081 ///
    NM_020363 ///
    NM_020364 ///
    NM_020420
    217313_at AC004692 down
    217336_at NM_001014 ribosomal protein S10 RPS10 down
    217371_s_at NM_000585 /// interleukin 15 IL15 down
    NM_172174 ///
    NM_172175
    217588_at NM_054020 /// cation channel, sperm CATSPER2 down
    NM_172095 /// associated 2
    NM_172096 ///
    NM_172097
    217671_at BE466926 down
    218067_s_at NM_018011 hypothetical protein FLJ10154 down
    FLJ10154
    218265_at NM_024077 SECIS binding protein 2 SECISBP2 down
    218336_at NM_012394 prefoldin 2 PFDN2 up
    218425_at NM_019011 /// TRIAD3 protein TRIAD3 down
    NM_207111 ///
    NM_207116
    218617_at NM_017646 tRNA TRIT1 down
    isopentenyltransferase 1
    218976_at NM_021800 DnaJ (Hsp40) homolog, DNAJC12 up
    subfamily C, member 12
    219203_at NM_016049 chromosome 14 open C14orf122 up
    reading frame 122
    219290_x_at NM_014395 dual adaptor of DAPP1 down
    phosphotyrosine and 3-
    phosphoinositides
    219977_at NM_014336 aryl hydrocarbon AIPL1 down
    receptor interacting
    protein-like 1
    220071_x_at NM_018097 chromosome 15 open C15orf25 down
    reading frame 25
    220113_x_at NM_019014 polymerase (RNA) I POLR1B down
    polypeptide B, 128 kDa
    220215_at NM_024804 hypothetical protein FLJ12606 down
    FLJ12606
    220242_x_at NM_018260 hypothetical protein FLJ10891 down
    FLJ10891
    220459_at NM_018118 MCM3 MCM3APAS down
    minichromosome
    maintenace deficient 3
    (s. cerevisiae) associated
    protein, antisense
    220856_x_at NM_014128 down
    220934_s_at NM_024084 hypothetical protein MGC3196 down
    MGC3196
    221294_at NM_005294 G protein-coupled GPR21 down
    receptor 21
    221616_s_at AF077053 Phosphoglycerate kinase 1 PGK1 down
    221759_at NM_138387 glucose-6-phosphatase G6PC3 up
    catalytic subunit-related
    222155_s_at NM_024531 G protein-coupled GPR172A up
    receptor 172A
    222168_at NM_000693 Aldehyde ALDH1A3 down
    dehydrogenase 1 family,
    member A3
    222231_s_at NM_018509 hypothetical protein PRO1855 up
    PRO1855
    222272_x_at NM_033128 scinderin SCIN down
    222310_at NM_020706 splicing factor, SFRS15 down
    arginine/serine-rich 15
    222358_x_at AI523613 down
    64371_at NM_014884 splicing factor, SFRS14 down
    arginine/serine-rich 14
  • Table 2 shows one preferred 84 gene group that was identified as a group distinguishing smokers with cancer from smokers without cancer. The difference in expression is indicated at the column on the right as either “down”, which indicates that the expression of that particular transcript was lower in smokers with cancer than in smokers without cancer, and “up”, which indicates that the expression of that particular transcript was higher in smokers with cancer than smokers without cancer. These genes were identified using traditional Student's t-test analysis.
  • In one embodiment, the exemplary probes shown in the column “Affymetrix Id in the Human Genome U133 chip” can be used in the expression analysis.
  • TABLE 2
    84 Gene Group
    GenBank ID
    (unless otherwise Direction in Affymetrix
    mentioned) Gene Name Description Cancer ID
    NM_030757.1 MKRN4 makorin, ring finger down 208082_x_at
    protein, 4 /// makorin,
    ring finger protein, 4
    R83000 BTF3 basic transcription down 214800_x_at
    factor
    3
    AK021571.1 MUC20 mucin 20 down 215208_x_at
    NM_014182.1 ORMDL2 ORM1-like 2 (S. cerevisiae) up 218556_at
    NM_17932.1 FLJ20700 hypothetical protein down 207730_x_at
    FLJ20700
    U85430.1 NFATC3 nuclear factor of down 210556_at
    activated T-cells,
    cytoplasmic,
    calcineurin-dependent 3
    AI683552 down 217679_x_at
    BC002642.1 CTSS cathepsin S down 202901_x_at
    AW024467 RIPX rap2 interacting protein x down 213939_s_at
    NM_030972.1 MGC5384 hypothetical protein down 208137_x_at
    MGC5384 ///
    hypothetical protein
    MGC5384
    BC021135.1 INADL InaD-like protein down 214705_at
    AL161952.1 GLUL glutamate-ammonia down 215001_s_at
    ligase (glutamine
    synthase)
    AK026565.1 FLJ10534 hypothetical protein down 218155_x_at
    FLJ10534
    AK023783.1 Homo sapiens cDNA down 215604_x_at
    FLJ13721 fis, clone
    PLACE2000450.
    BF218804 AFURS1 ATPase family homolog down 212297_at
    up-regulated in
    senescence cells
    NM_001281.1 CKAP1 cytoskeleton associated up 201804_x_at
    protein 1
    NM_024006.1 IMAGE3455200 hypothetical protein up 217949_s_at
    IMAGE3455200
    AK023843.1 PGF placental growth factor, down 215179_x_at
    vascular endothelial
    growth factor-related
    protein
    BC001602.1 CFLAR CASP8 and FADD-like down 211316_x_at
    apoptosis regulator
    BC034707.1 Homo sapiens down 217653_x_at
    transcribed sequence
    with weak similarity to
    protein
    ref: NP_060312.1
    (H. sapiens)
    hypothetical protein
    FLJ20489 [Homo
    sapiens]
    BC064619.1 CD24 CD24 antigen (small down 266_s_at
    cell lung carcinoma
    cluster 4 antigen)
    AY280502.1 EPHB6 EphB6 down 204718_at
    BC059387.1 MYO1A myosin IA down 211916_s_at
    Homo sapiens down 215032_at
    transcribed sequences
    AF135421.1 GMPPB GDP-mannose up 219920_s_at
    pyrophosphorylase B
    BC061522.1 MGC70907 similar to MGC9515 down 211996_s_at
    protein
    L76200.1 GUK1 guanylate kinase 1 up 200075_s_at
    U50532.1 CG005 hypothetical protein down 214753_at
    from BCRA2 region
    BC006547.2 EEF2 eukaryotic translation down 204102_s_at
    elongation factor 2
    BC008797.2 FVT1 follicular lymphoma down 202419_at
    variant translocation 1
    BC000807.1 ZNF160 zinc finger protein 160 down 214715_x_at
    AL080112.1 down 216859_x_at
    BC033718.1 /// C21orf106 chromosome 21 open down 215529_x_at
    BC046176.1 /// reading frame 106
    BC038443.1
    NM_000346.1 SOX9 SRY (sex determining up 202936_s_at
    region Y)-box 9
    (campomelic dysplasia,
    autosomal sex-reversal)
    BC008710.1 SUI1 putative translation up 212130_x_at
    initiation factor
    Hs.288575 Homo sapiens cDNA down 215204_at
    (UNIGENE ID) FLJ14090 fis, clone
    MAMMA1000264.
    AF020591.1 AF020591 zinc finger protein down 218735_s_at
    BC000423.2 ATP6V0B ATPase, H+ up 200078_s_at
    transporting, lysosomal
    21 kDa, V0 subunit c″ ///
    ATPase, H+
    transporting, lysosomal
    21 kDa, V0 subunit c″
    BC002503.2 SAT spermidine/spermine down 203455_s_at
    N1-acetyltransferase
    BC008710.1 SUI1 putative translation up 212227_x_at
    initiation factor
    Homo sapiens down 222282_at
    transcribed sequences
    BC009185.2 DCLRE1C DNA cross-link repair down 219678_x_at
    1C (PSO2 homolog, S. cerevisiae)
    Hs.528304 ADAM28 a disintegrin and down 208268_at
    (UNIGENE ID) metalloproteinase
    domain 28
    U50532.1 CG005 hypothetical protein down 221899_at
    from BCRA2 region
    BC013923.2 SOX2 SRY (sex determining down 213721_at
    region Y)-box 2
    BC031091 ODAG ocular development- down 214718_at
    associated gene
    NM_007062 PWP1 nuclear phosphoprotein up 201608_s_at
    similar to S. cerevisiae
    PWP1
    Hs.249591 FLJ20686 hypothetical protein down 205684_s_at
    (Unigene ID) FLJ20686
    BC075839.1 /// KRT8 keratin 8 up 209008_x_at
    BC073760.1
    BC072436.1 /// HYOU1 hypoxia up-regulated 1 up 200825_s_at
    BC004560.2
    BC001016.2 NDUFA8 NADH dehydrogenase up 218160_at
    (ubiquinone) 1 alpha
    subcomplex, 8, 19 kDa
    Hs.286261 FLJ20195 hypothetical protein down 57739_at
    (Unigene ID) FLJ20195
    AF348514.1 Homo sapiens fetal down 211921_x_at
    thymus prothymosin
    alpha mRNA, complete
    cds
    BC005023.1 CGI-128 CGI-128 protein up 218074_at
    BC066337.1 /// KTN1 kinectin 1 (kinesin down 200914_x_at
    BC058736.1 /// receptor)
    BC050555.1
    down 216384_x_at
    Hs.216623 ATP8B1 ATPase, Class I, type down 214594_x_at
    (Unigene ID) 8B, member 1
    BC072400.1 THOC2 THO complex 2 down 222122_s_at
    BC041073.1 PRKX protein kinase, X-linked down 204060_s_at
    U43965.1 ANK3 ankyrin 3, node of down 215314_at
    Ranvier (ankyrin G)
    down 208238_x_at
    BC021258.2 TRIM5 tripartite motif- down 210705_s_at
    containing 5
    BC016057.1 USH1C Usher syndrome 1C down 211184_s_at
    (autosomal recessive,
    severe)
    BC016713.1 /// PARVA parvin, alpha down 215418_at
    BC014535.1 ///
    AF237771.1
    BC000360.2 EIF4EL3 eukaryotic translation up 209393_s_at
    initiation factor 4E-like 3
    BC007455.2 SH3GLB1 SH3-domain GRB2-like up 210101_x_at
    endophilin B1
    BC000701.2 KIAA0676 KIAA0676 protein down 212052_s_at
    BC010067.2 CHC1 chromosome down 215011_at
    condensation 1
    BC023528.2 /// C14orf87 chromosome 14 open up 221932_s_at
    BC047680.1 reading frame 87
    BC064957.1 KIAA0102 KIAA0102 gene up 201239_s_at
    product
    Hs.156701 Homo sapiens cDNA down 215553_x_at
    (Unigene ID) FLJ14253 fis, clone
    OVARC1001376.
    BC030619.2 KIAA0779 KIAA0779 protein down 213351_s_at
    BC008710.1 SUI1 putative translation up 202021_x_at
    initiation factor
    U43965.1 ANK3 ankyrin 3, node of down 209442_x_at
    Ranvier (ankyrin G)
    BC066329.1 SDHC succinate up 210131_x_at
    dehydrogenase
    complex, subunit C,
    integral membrane
    protein, 15 kDa
    Hs.438867 Homo sapiens down 217713_x_at
    (Unigene ID) transcribed sequence
    with weak similarity to
    protein
    ref: NP_060312.1
    (H. sapiens)
    hypothetical protein
    FLJ20489 [Homo
    sapiens]
    BC035025.2 /// ALMS1 Alstrom syndrome 1 down 214707_x_at
    BC050330.1
    BC023976.2 PDAP2 PDGFA associated up 203272_s_at
    protein
    2
    BC074852.2 /// PRKY protein kinase, Y-linked down 206279_at
    BC074851.2
    Hs.445885 KIAA1217 Homo sapiens cDNA down 214912_at
    (Unigene ID) FLJ12005 fis, clone
    HEMBB1001565.
    BC008591.2 /// KIAA0100 KIAA0100 gene up 201729_s_at
    BC050440.1 /// product
    BC048096.1
    AF365931.1 ZNF264 zinc finger protein 264 down 205917_at
    AF257099.1 PTMA prothymosin, alpha down 200772_x_at
    (gene sequence 28)
    BC028912.1 DNAJB9 DnaJ (Hsp40) homolog, up 202842_s_at
    subfamily B, member 9
  • Table 3 shows one preferred 50 gene group that was identified as a group distinguishing smokers with cancer from smokers without cancer. The difference in expression is indicated at the column on the right as either “down”, which indicates that the expression of that particular transcript was lower in smokers with cancer than in smokers without cancer, and “up”, which indicates that the expression of that particular transcript was higher in smokers with cancer than smokers without cancer.
  • This gene group was identified using the GenePattern server from the Broad Institute, which includes the Weighted Voting algorithm. The default settings, i.e., the signal to noise ratio and no gene filtering, were used.
  • In one embodiment, the exemplary probes shown in the column “ Affymetrix Id in the Human Genome U133 chip” can be used in the expression analysis.
  • TABLE 3
    50 Gene Group
    Affymetrix Id in the
    Direction Human Genome
    GenBank ID Gene Name in Cancer U133 chip
    NM_007062.1 PWP1 up in cancer 201608_s_at
    NM_001281.1 CKAP1 up in cancer 201804_x_at
    BC000120.1 up in cancer 202355_s_at
    NM_014255.1 TMEM4 up in cancer 202857_at
    BC002642.1 CTSS up in cancer 202901_x_at
    NM_000346.1 SOX9 up in cancer 202936_s_at
    NM_006545.1 NPR2L up in cancer 203246_s_at
    BG034328 up in cancer 203588_s_at
    NM_021822.1 APOBEC3G up in cancer 204205_at
    NM_021069.1 ARGBP2 up in cancer 204288_s_at
    NM_019067.1 FLJ10613 up in cancer 205010_at
    NM_017925.1 FLJ20686 up in cancer 205684_s_at
    NM_017932.1 FLJ20700 up in cancer 207730_x_at
    NM_030757.1 MKRN4 up in cancer 208082_x_at
    NM_030972.1 MGC5384 up in cancer 208137_x_at
    AF126181.1 BCG1 up in cancer 208682_s_at
    U93240.1 up in cancer 209653_at
    U90552.1 up in cancer 209770_at
    AF151056.1 up in cancer 210434_x_at
    U85430.1 NFATC3 up in cancer 210556_at
    U51007.1 up in cancer 211609_x_at
    BC005969.1 up in cancer 211759_x_at
    NM_002271.1 up in cancer 211954_s_at
    AL566172 up in cancer 212041_at
    AB014576.1 KIAA0676 up in cancer 212052_s_at
    BF218804 AFURS1 down in cancer 212297_at
    AK022494.1 down in cancer 212932_at
    AA114843 down in cancer 213884_s_at
    BE467941 down in cancer 214153_at
    NM_003541.1 HIST1H4K down in cancer 214463_x_at
    R83000 BTF3 down in cancer 214800_x_at
    AL161952.1 GLUL down in cancer 215001_s_at
    AK023843.1 PGF down in cancer 215179_x_at
    AK021571.1 MUC20 down in cancer 215208_x_at
    AK023783.1 down in cancer 215604_x_at
    AU147182 down in cancer 215620_at
    AL080112.1 down in cancer 216859_x_at
    AW971983 down in cancer 217588_at
    AI683552 down in cancer 217679_x_at
    NM_024006.1 IMAGE3455200 down in cancer 217949_s_at
    AK026565.1 FLJ10534 down in cancer 218155_x_at
    NM_014182.1 ORMDL2 down in cancer 218556_at
    NM_021800.1 DNAJC12 down in cancer 218976_at
    NM_016049.1 CGI-112 down in cancer 219203_at
    NM_019023.1 PRMT7 down in cancer 219408_at
    NM_021971.1 GMPPB down in cancer 219920_s_at
    NM_014128.1 down in cancer 220856_x_at
    AK025651.1 down in cancer 221648_s_at
    AA133341 C14orf87 down in cancer 221932_s_at
    AF198444.1 down in cancer 222168_at
  • Table 4 shows one preferred 36 gene group that was identified as a group distinguishing smokers with cancer from smokers without cancer. The difference in expression is indicated at the column on the right as either “down”, which indicates that the expression of that particular transcript was lower in smokers with cancer than in smokers without cancer, and “up”, which indicates that the expression of that particular transcript was higher in smokers with cancer than smokers without cancer.
  • In one embodiment, the exemplary probes shown in the column “Affymetrix Id in the Human Genome U133 chip” can be used in the expression analysis.
  • TABLE 4
    36 Gene Group
    GenBank ID Gene Name Gene Description Affy ID
    NM_007062.1 PWP1 nuclear phosphoprotein 201608_s_at
    similar to S. cerevisiae
    PWP1
    NM_001281.1 CKAP1 cytoskeleton associated 201804_x_at
    protein 1
    BC002642.1 CTSS cathepsin S 202901_x_at
    NM_000346.1 SOX9 SRY (sex determining 202936_s_at
    region Y)-box 9
    (campomelic dysplasia,
    autosomal sex-reversal)
    NM_006545.1 NPR2L homologous to yeast 203246_s_at
    nitrogen permease
    (candidate tumor
    suppressor)
    BG034328 transcription factor Dp-2 203588_s_at
    (E2F dimerization partner
    2)
    NM_019067.1 FLJ10613 hypothetical protein 205010_at
    FLJ10613
    NM_017925.1 FLJ20686 hypothetical protein 205684_s_at
    FLJ20686
    NM_017932.1 FLJ20700 hypothetical protein 207730_x_at
    FLJ20700
    NM_030757.1 MKRN4 makorin, ring finger 208082_x_at
    protein, 4 /// makorin, ring
    finger protein, 4
    NM_030972.1 MGC5384 hypothetical protein 208137_x_at
    MGC5384
    NM_002268 /// KPNA4 karyopherin alpha 4 209653_at
    NM_032771 (importin alpha 3)
    NM_007048 /// BTN3A1 butyrophilin, subfamily 3, 209770_at
    NM_194441 member A1
    NM_006694 JBT jumping translocation 210434_x_at
    breakpoint
    U85430.1 NFATC3 nuclear factor of activated 210556_at
    T-cells, cytoplasmic,
    calcineurin-dependent 3
    NM_004691 ATP6V0D1 ATPase, H+ transporting, 212041_at
    lysosomal 38 kDa, V0
    subunit d isoform 1
    AB014576.1 KIAA0676 KIAA0676 protein 212052_s_at
    BF218804 AFURS1 ATPase family homolog 212297_at
    up-regulated in senescence
    cells
    BE467941 EVOVL family member 5, 214153_at
    elongation of long chain
    fatty acids (FEN1/Elo2,
    SUR4/Elo3-like, yeast)
    R83000 BTF3 basic transcription factor 3 214800_x_at
    AL161952.1 GLUL glutamate-ammonia ligase 215001_s_at
    (glutamine synthase)
    AK023843.1 PGF placental growth factor, 215179_x_at
    vascular endothelial
    growth factor-related
    protein
    AK021571.1 MUC20 mucin 20 215208_x_at
    AK023783.1 Homo sapiens cDNA 215604_x_at
    FLJ13721 fis, clone
    PLACE2000450.
    AL080112.1 216859_x_at
    AW971983 cation, sperm associated 2 217588_at
    AI683552 217679_x_at
    NM_024006.1 IMAGE3455200 hypothetical protein 217949_s_at
    IMAGE3455200
    AK026565.1 FLJ10534 hypothetical protein 218155_x_at
    FLJ10534
    NM_014182.1 ORMDL2 ORM1-like 2 (S. cerevisiae) 218556_at
    NM_021800.1 DNAJC12 J Domain containing 218976_at
    protein
    1
    NM_016049.1 CGI-112 comparative gene 219203_at
    identification transcript
    112
    NM_021971.1 GMPPB GDP-mannose 219920_s_at
    pyrophosphorylase B
    NM_014128.1 220856_x_at
    AA133341 C14orf87 chromosome 14 open 221932_s_at
    reading frame
    87
    AF198444.1 Homo sapiens 10q21 222168_at
    mRNA sequence
  • In one embodiment, the gene group of the present invention comprises at least, for example, 5, 10, 15, 20, 25, 30, more preferably at least 36, still more preferably at least about 40, still more preferably at least about 50, still more preferably at least about 60, still more preferably at least about 70, still more preferably at least about 80, still more preferably at least about 86, still more preferably at least about 90, still more preferably at least about 96 of the genes as shown in Tables 1-4.
  • In one preferred embodiment, the gene group comprises 36-180 genes selected from the group consisting of the genes listed in Tables 1-4.
  • In one embodiment, the invention provides group of genes the expression of which is lower in individuals with cancer.
  • Accordingly, in one embodiment, the invention provides of a group of genes useful in diagnosing lung diseases, wherein the expression of the group of genes is lower in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-30, still more preferably at least about 30-40, still more preferably at least about 40-50, still more preferably at least about 50-60, still more preferably at least about 60-70, still more preferably about 72 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 1): NM_003335; NM_001319; NM_021145.1; NM_001003698 /// NM_001003699 ///; NM_002955; NM_002853.1; NM_019067.1; NM_024917.1; NM_020979.1; NM_005597.1; NM_007031.1; NM_009590.1; NM_020217.1; NM_025026.1; NM_014709.1; NM_014896.1; AF010144; NM_005374.1; NM_006534 /// NM_181659; NM_014033; NM_016138; NM_007048 /// NM_194441; NM_000051 /// NM_138292 /// NM_138293; NM_000410 /// NM_139002 /// NM_139003 /// NM_139004 /// NM_139005 /// NM_139006 /// NM_139007 /// NM_139008 /// NM_139009 /// NM_139010 /// NM_139011; NM_012070 /// NM_139321 /// NM_139322; NM_006095; AI632181; AW024467; NM_021814; NM_005547.1; NM_203458; NM_015547 /// NM_147161; AB007958.1; NM_207488; NM_005809 /// NM_181737 /// NM_181738; NM_016248 /// NM_144490; AK022213.1; NM_005708; NM_207102; AK023895; NM_144606 /// NM_144997; NM_018530; AK021474; U43604.1; AU147017; AF222691.1; NM_015116; NM_001005375 /// NM_001005785 /// NM_001005786 /// NM_004081 /// NM_020363 /// NM_020364 /// NM_020420; AC004692; NM_001014; NM_000585 /// NM_172174 /// NM_172175; NM_054020 /// NM_172095 /// NM_172096 /// NM_172097; BE466926; NM_018011; NM_024077; NM_019011 /// NM_207111 /// NM_207116; NM_017646; NM_014395; NM_014336; NM_018097; NM_019014; NM_024804; NM_018260; NM_018118; NM_014128; NM_024084; NM_005294; AF077053; NM_000693; NM_033128; NM_020706; AI523613; and NM_014884.
  • In another embodiment, the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is lower in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-30, still more preferably at least about 30-40, still more preferably at least about 40-50, still more preferably at least about 50-60, still more preferably about 63 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 2): NM_030757.1; R83000; AK021571.1; NM_17932.1; U85430.1; AI683552; BC002642.1; AW024467; NM_030972.1; BC021135.1; AL161952.1; AK026565.1; AK023783.1; BF218804; AK023843.1; BC001602.1; BC034707.1; BC064619.1; AY280502.1; BC059387.1; BC061522.1; U50532.1; BC006547.2; BC008797.2; BC000807.1; AL080112.1; BC033718.1 /// BC046176.1 ///; BC038443.1; Hs.288575 (UNIGENE ID); AF020591.1; BC002503.2; BC009185.2; Hs.528304 (UNIGENE ID); U50532.1; BC013923.2; BC031091; Hs.249591 (Unigene ID); Hs.286261 (Unigene ID); AF348514.1; BC066337.1 /// BC058736.1 /// BC050555.1; Hs.216623 (Unigene ID); BC072400.1; BC041073.1; U43965.1; BC021258.2; BC016057.1; BC016713.1 /// BC014535.1 /// AF237771.1; BC000701.2; BC010067.2; Hs.156701 (Unigene ID); BC030619.2; U43965.1; Hs.438867 (Unigene ID); BC035025.2 /// BC050330.1; BC074852.2 /// BC074851.2; Hs.445885 (Unigene ID); AF365931.1; and AF257099.1
  • In another embodiment, the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is lower in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-25, still more preferably about 25 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 3): BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1.
  • In another embodiment, the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is higher in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least to 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-25, still more preferably about 25 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 1): NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_006545.1; NM_002437.1; NM_006286; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_001696; NM_005494 /// NM_058246; NM_006368; NM_002268 /// NM_032771; NM_006694; NM_004691; NM_012394; NM_021800; NM_016049; NM_138387; NM_024531; and NM_018509.
  • In another embodiment, the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is higher in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least to 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-23, still more preferably about 23 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 2): NM_014182.1; NM_001281.1; NM_024006.1; AF135421.1; L76200.1; NM_000346.1; BC008710.1; BC000423.2; BC008710.1; NM_007062; BC075839.1 /// BC073760.1; BC072436.1 /// BC004560.2; BC001016.2; BC005023.1; BC000360.2; BC007455.2; BC023528.2 /// BC047680.1; BC064957.1; BC008710.1; BC066329.1; BC023976.2; BC008591.2 /// BC050440.1 /// BC048096.1; and BC028912.1.
  • In another embodiment, the invention provides of a group of genes useful in diagnosing lung diseases wherein the expression of the group of genes is higher in individuals exposed to air pollutants with cancer as compared to individuals exposed to the same air pollutant who do not have cancer, the group comprising probes that hybridize at least to 5, preferably at least about 5-10, still more preferably at least about 10-20, still more preferably at least about 20-25, still more preferably about 25 genes consisting of transcripts (transcripts are identified using their GenBank ID or Unigene ID numbers and the corresponding gene names appear in Table 3): NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; and AB014576.1.
  • In one embodiment, the invention provides a method of diagnosing lung disease comprising the steps of measuring the expression profile of a gene group in an individual suspected of being affected or being at high risk of a lung disease (i.e. test individual), and comparing the expression profile (i.e. control profile) to an expression profile of an individual without the lung disease who has also been exposed to similar air pollutant than the test individual (i.e. control individual), wherein differences in the expression of genes when compared between the afore mentioned test individual and control individual of at least 10, more preferably at least 20, still more preferably at least 30, still more preferably at least 36, still more preferably between 36-180, still more preferably between 36-96, still more preferably between 36-84, still more preferably between 36-50, is indicative of the test individual being affected with a lung disease. Groups of about 36 genes as shown in table 4, about 50 genes as shown in table 3, about 84 genes as shown in table 2 and about 96 genes as shown in table 1 are preferred. The different gene groups can also be combined, so that the test individual can be screened for all, three, two, or just one group as shown in tables 1-4.
  • For example, if the expression profile of a test individual exposed to cigarette smoke is compared to the expression profile of the 50 genes shown in table 3, using the Affymetrix inc probe set on a gene chip as shown in table 3, the expression profile that is similar to the one shown in FIG. 10 for the individuals with cancer, is indicative that the test individual has cancer. Alternatively, if the expression profile is more like the expression profile of the individuals who do not have cancer in FIG. 10, the test individual likely is not affected with lung cancer.
  • The group of 50 genes was identified using the GenePattern server from the Broad Institute, which includes the Weighted Voting algorithm. The default settings, i.e., the signal to noise ratio and no gene filtering, were used. GenePattern is available through the World Wide Wed at location broad.mit.edu/cancer/software/genepattern. This program allows analysis of data in groups rather than as individual genes. Thus, in one preferred embodiment, the expression of substantially all 50 genes of Table 3, are analyzed together. The expression profile of lower that normal expression of genes selected from the group consisting of BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1, and the gene expression profile of higher than normal expression of genes selected from the group consisting of NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; and AB014576.1, is indicative of the individual having or being at high risk of developing lung disease, such as lung cancer. In one preferred embodiment, the expression pattern of all the genes in the Table 3 is analyzed. In one embodiment, in addition to analyzing the group of predictor genes of Table 3, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10-15, 15-20, 20-30, or more of the individual predictor genes identified using the t-test analysis are analyzed. Any combination of, for example, 5-10 or more of the group predictor genes and 5-10, or more of the individual genes can also be used.
  • The term “expression profile” as used herein, refers to the amount of the gene product of each of the analyzed individual genes in the sample. The “expression profile” is like a signature expression map, like the one shown for each individual in FIG. 10, on the Y-axis.
  • The term “lung disease”, as used herein, refers to disorders including, but not limited to, asthma, chronic bronchitis, emphysema, bronchietasis, primary pulmonary hypertension and acute respiratory distress syndrome. The methods described herein may also be used to diagnose or treat lung disorders that involve the immune system including, hypersensitivity pneumonitis, eosinophilic pneumonias, and persistent fungal infections, pulmonary fibrosis, systemic sclerosis, idiopathic pulmonary hemosiderosis, pulmonary alveolar proteinosis, cancers of the lung such as adenocarcinoma, squamous cell carcinoma, small cell and large cell carcinomas, and benign neoplasm of the lung including bronchial adenomas and hamartomas. In one preferred embodiment, the lung disease is lung cancer.
  • The biological samples useful according to the present invention include, but are not limited to tissue samples, cell samples, and excretion samples, such as sputum or saliva, of the airways. The samples useful for the analysis methods according to the present invention can be taken from the mouth, the bronchial airways, and the lungs.
  • The term “air pollutants”, as used herein, refers to any air impurities or environmental airway stress inducing agents, such as cigarette smoke, cigar smoke, smog, asbestos, and other air pollutants that have suspected or proven association to lung diseases.
  • The term “individual”, as used herein, preferably refers to human However, the methods are not limited to humans, and a skilled artisan can use the diagnostic/prognostic gene groupings of the present invention in, for example, laboratory test animals, preferably animals that have lungs, such as non-human primates, murine species, including, but not limited to rats and mice, dogs, sheep, pig, guinea pigs, and other model animals
  • The term “field defect” as used throughout the specification means that the transcription pattern of epithelial cells lining the entire airway including the mouth buccal mucosa, airways, and lung tissue changes in response to airway pollutants. Therefore, the present invention provides methods to identify epithelial cell gene expression patterns that are associated with diseases and disorders of the lung.
  • The phrase “altered expression” as used herein, refers to either increased or decreased expression in an individual exposed to air pollutant, such as a smoker, with cancer when compared to an expression pattern of the lung cells from an individual exposed to similar air pollutant, such as smoker, who does not have cancer. Tables 1 and 2 show the preferred expression pattern changes of the invention. The terms “up” and “down” in the tables refer to the amount of expression in a smoker with cancer to the amount of expression in a smoker without cancer Similar expression pattern changes are likely associated with development of cancer in individuals who have been exposed to other airway pollutants.
  • In one embodiment, the group of genes the expression of which is analyzed in diagnosis and/or prognosis of lung cancer are selected from the group of 80 genes as shown in Table 5. Any combination of genes can be selected from the 80 genes. In one embodiment, the combination of 20 genes shown in Table 7 is selected. In one embodiment, a combination of genes from Table 6 is selected.
  • TABLE 5
    Group of 80 genes for prognostic and diagnostic testing of lung cancer.
    Number of
    runs the gene is
    Affymetrix probe indicated in
    ID No. that can be cancer samples Signal to noise in a cancer sample.
    used to identify as differentially Negative values indicate increase
    the gene/nucleic expressed out of expression in lung cancer,
    acid sequence in of 1000 test positive values indicate decrease of
    the next column Gene symbol runs expression in lung cancer.
    200729_s_at ACTR2 736 −0.22284
    200760_s_at ARL6IP5 483 −0.21221
    201399_s_at TRAM1 611 −0.21328
    201444_s_at ATP6AP2 527 −0.21487
    201635_s_at FXR1 458 −0.2162
    201689_s_at TPD52 565 −0.22292
    201925_s_at DAF 717 −0.25875
    201926_s_at DAF 591 −0.23228
    201946_s_at CCT2 954 −0.24592
    202118_s_at CPNE3 334 −0.21273
    202704_at TOB1 943 −0.25724
    202833_s_at SERPINA1 576 −0.20583
    202935_s_at SOX9 750 −0.25574
    203413_at NELL2 629 −0.23576
    203881_s_at DMD 850 −0.24341
    203908_at SLC4A4 887 −0.23167
    204006_s_at FCGR3A /// 207 −0.20071
    FCGR3B
    204403_x_at KIAA0738 923 0.167772
    204427_s_at RNP24 725 −0.2366
    206056_x_at SPN 976 0.196398
    206169_x_at RoXaN 984 0.259637
    207730_x_at HDGF2 969 0.169108
    207756_at 855 0.161708
    207791_s_at RAB1A 823 −0.21704
    207953_at AD7C-NTP 1000 0.218433
    208137_x_at 996 0.191938
    208246_x_at TK2 982 0.179058
    208654_s_at CD164 388 −0.21228
    208892_s_at DUSP6 878 −0.25023
    209189_at FOS 935 −0.27446
    209204_at LMO4 78 0.158674
    209267_s_at SLC39A8 228 −0.24231
    209369_at ANXA3 384 −0.19972
    209656_s_at TMEM47 456 −0.23033
    209774_x_at CXCL2 404 −0.2117
    210145_at PLA2G4A 475 −0.26146
    210168_at C6 458 −0.24157
    210317_s_at YWHAE 803 −0.29542
    210397_at DEFB1 176 −0.22512
    210679_x_at 970 0.181718
    211506_s_at IL8 270 −0.3105
    212006_at UBXD2 802 −0.22094
    213089_at LOC153561 649 0.164097
    213736_at COX5B 505 0.155243
    213813_x_at 789 0.178643
    214007_s_at PTK9 480 −0.21285
    214146_s_at PPBP 593 −0.24265
    214594_x_at ATP8B1 962 0.284039
    214707_x_at ALMS1 750 0.164047
    214715_x_at ZNF160 996 0.198532
    215204_at SENP6 211 0.169986
    215208_x_at RPL35A 999 0.228485
    215385_at FTO 164 0.187634
    215600_x_at FBXW12 960 0.17329
    215604_x_at UBE2D2 998 0.224878
    215609_at STARD7 940 0.191953
    215628_x_at PPP2CA 829 0.16391
    215800_at DUOX1 412 0.160036
    215907_at BACH2 987 0.178338
    215978_x_at LOC152719 645 0.163399
    216834_at 633 −0.25508
    216858_x_at 997 0.232969
    217446_x_at 942 0.182612
    217653_x_at 976 0.270552
    217679_x_at 987 0.265918
    217715_x_at ZNF354A 995 0.223881
    217826_s_at UBE2J1 812 −0.23003
    218155_x_at FLJ10534 998 0.186425
    218976_at DNAJC12 486 −0.22866
    219392_x_at FLJ11029 867 0.169113
    219678_x_at DCLRE1C 877 0.169975
    220199_s_at FLJ12806 378 −0.20713
    220389_at FLJ23514 102 0.239341
    220720_x_at FLJ14346 989 0.17976
    221191_at DKFZP434A0131 616 0.185412
    221310_at FGF14 511 −0.19965
    221765_at 319 −0.25025
    222027_at NUCKS 547 0.171954
    222104_x_at GTF2H3 981 0.186025
    222358_x_at 564 0.194048
  • TABLE 6
    Group of 535 genes useful in prognosis or diagnosis of lung cancer.
    Number of
    runs the gene is
    Affymetrix probe indicated in
    ID No. that can be cancer samples Signal to noise in a cancer sample.
    used to identify as differentially Negative values indicate increase
    the gene/nucleic expressed out of expression in lung cancer,
    acid sequence in of 1000 test positive values indicate decrease of
    the next column Gene symbol runs expression in lung cancer.
    200729_s_at ACTR2 736 −0.22284
    200760_s_at ARL6IP5 483 −0.21221
    201399_s_at TRAM1 611 −0.21328
    201444_s_at ATP6AP2 527 −0.21487
    201635_s_at FXR1 458 −0.2162
    201689_s_at TPD52 565 −0.22292
    201925_s_at DAF 717 −0.25875
    201926_s_at DAF 591 −0.23228
    201946_s_at CCT2 954 −0.24592
    202118_s_at CPNE3 334 −0.21273
    202704_at TOB1 943 −0.25724
    202833_s_at SERPINA1 576 −0.20583
    202935_s_at SOX9 750 −0.25574
    203413_at NELL2 629 −0.23576
    203881_s_at DMD 850 −0.24341
    203908_at SLC4A4 887 −0.23167
    204006_s_at FCGR3A /// 207 −0.20071
    FCGR3B
    204403_x_at KIAA0738 923 0.167772
    204427_s_at RNP24 725 −0.2366
    206056_x_at SPN 976 0.196398
    206169_x_at RoXaN 984 0.259637
    207730_x_at HDGF2 969 0.169108
    207756_at 855 0.161708
    207791_s_at RAB1A 823 −0.21704
    207953_at AD7C-NTP 1000 0.218433
    208137_x_at 996 0.191938
    208246_x_at TK2 982 0.179058
    208654_s_at CD164 388 −0.21228
    208892_s_at DUSP6 878 −0.25023
    209189_at FOS 935 −0.27446
    209204_at LMO4 78 0.158674
    209267_s_at SLC39A8 228 −0.24231
    209369_at ANXA3 384 −0.19972
    209656_s_at TMEM47 456 −0.23033
    209774_x_at CXCL2 404 −0.2117
    210145_at PLA2G4A 475 −0.26146
    210168_at C6 458 −0.24157
    210317_s_at YWHAE 803 −0.29542
    210397_at DEFB1 176 −0.22512
    210679_x_at 970 0.181718
    211506_s_at IL8 270 −0.3105
    212006_at UBXD2 802 −0.22094
    213089_at LOC153561 649 0.164097
    213736_at COX5B 505 0.155243
    213813_x_at 789 0.178643
    214007_s_at PTK9 480 −0.21285
    214146_s_at PPBP 593 −0.24265
    214594_x_at ATP8B1 962 0.284039
    214707_x_at ALMS1 750 0.164047
    214715_x_at ZNF160 996 0.198532
    215204_at SENP6 211 0.169986
    215208_x_at RPL35A 999 0.228485
    215385_at FTO 164 0.187634
    215600_x_at FBXW12 960 0.17329
    215604_x_at UBE2D2 998 0.224878
    215609_at STARD7 940 0.191953
    215628_x_at PPP2CA 829 0.16391
    215800_at DUOX1 412 0.160036
    215907_at BACH2 987 0.178338
    215978_x_at LOC152719 645 0.163399
    216834_at 633 −0.25508
    216858_x_at 997 0.232969
    217446_x_at 942 0.182612
    217653_x_at 976 0.270552
    217679_x_at 987 0.265918
    217715_x_at ZNF354A 995 0.223881
    217826_s_at UBE2J1 812 −0.23003
    218155_x_at FLJ10534 998 0.186425
    218976_at DNAJC12 486 −0.22866
    219392_x_at FLJ11029 867 0.169113
    219678_x_at DCLRE1C 877 0.169975
    220199_s_at FLJ12806 378 −0.20713
    220389_at FLJ23514 102 0.239341
    220720_x_at FLJ14346 989 0.17976
    221191_at DKFZP434A0131 616 0.185412
    221310_at FGF14 511 −0.19965
    221765_at 319 −0.25025
    222027_at NUCKS 547 0.171954
    222104_x_at GTF2H3 981 0.186025
    222358_x_at 564 0.194048
    202113_s_at SNX2 841 −0.20503
    207133_x_at ALPK1 781 0.155812
    218989_x_at SLC30A5 765 −0.198
    200751_s_at HNRPC 759 −0.19243
    220796_x_at SLC35E1 691 0.158199
    209362_at SURB7 690 −0.18777
    216248_s_at NR4A2 678 −0.19796
    203138_at HAT1 669 −0.18115
    221428_s_at TBL1XR1 665 −0.19331
    218172_s_at DERL1 665 −0.16341
    215861_at FLJ14031 651 0.156927
    209288_s_at CDC42EP3 638 −0.20146
    214001_x_at RPS10 634 0.151006
    209116_x_at HBB 626 −0.12237
    215595_x_at GCNT2 625 0.136319
    208891_at DUSP6 617 −0.17282
    215067_x_at PRDX2 616 0.160582
    202918_s_at PREI3 614 −0.17003
    211985_s_at CALM1 614 −0.20103
    212019_at RSL1D1 601 0.152717
    216187_x_at KNS2 591 0.14297
    215066_at PTPRF 587 0.143323
    212192_at KCTD12 581 −0.17535
    217586_x_at 577 0.147487
    203582_s_at RAB4A 567 −0.18289
    220113_x_at POLR1B 563 0.15764
    217232_x_at HBB 561 −0.11398
    201041_s_at DUSP1 560 −0.18661
    211450_s_at MSH6 544 −0.15597
    202648_at RPS19 533 0.150087
    202936_s_at SOX9 533 −0.17714
    204426_at RNP24 526 −0.18959
    206392_s_at RARRES1 517 −0.18328
    208750_s_at ARF1 515 −0.19797
    202089_s_at SLC39A6 512 −0.19904
    211297_s_at CDK7 510 −0.15992
    215373_x_at FLJ12151 509 0.146742
    213679_at FLJ13946 492 −0.10963
    201694_s_at EGR1 490 −0.19478
    209142_s_at UBE2G1 487 −0.18055
    217706_at LOC220074 483 0.11787
    212991_at FBXO9 476 0.148288
    201289_at CYR61 465 −0.19925
    206548_at FLJ23556 465 0.141583
    202593_s_at MIR16 462 −0.17042
    202932_at YES1 461 −0.17637
    220575_at FLJ11800 461 0.116435
    217713_x_at DKFZP566N034 452 0.145994
    211953_s_at RANBP5 447 −0.17838
    203827_at WIPI49 447 −0.17767
    221997_s_at MRPL52 444 0.132649
    217662_x_at BCAP29 434 0.116886
    218519_at SLC35A5 428 −0.15495
    214833_at KIAA0792 428 0.132943
    201339_s_at SCP2 426 −0.18605
    203799_at CD302 422 −0.16798
    211090_s_at PRPF4B 421 −0.1838
    220071_x_at C15orf25 420 0.138308
    203946_s_at ARG2 415 −0.14964
    213544_at ING1L 415 0.137052
    209908_s_at 414 0.131346
    201688_s_at TPD52 410 −0.18965
    215587_x_at BTBD14B 410 0.139952
    201699_at PSMC6 409 −0.13784
    214902_x_at FLJ42393 409 0.140198
    214041_x_at RPL37A 402 0.106746
    203987_at FZD6 392 −0.19252
    211696_x_at HBB 392 −0.09508
    218025_s_at PECI 389 −0.18002
    215852_x_at KIAA0889 382 0.12243
    209458_x_at HBA1 /// 380 −0.09796
    HBA2
    219410_at TMEM45A 379 −0.22387
    215375_x_at 379 0.148377
    206302_s_at NUDT4 376 −0.18873
    208783_s_at MCP 372 −0.15076
    211374_x_at 364 0.131101
    220352_x_at MGC4278 364 0.152722
    216609_at TXN 363 0.15162
    201942_s_at CPD 363 −0.1889
    202672_s_at ATF3 361 −0.12935
    204959_at MNDA 359 −0.21676
    211996_s_at KIAA0220 358 0.144358
    222035_s_at PAPOLA 353 −0.14487
    208808_s_at HMGB2 349 −0.15222
    203711_s_at HIBCH 347 −0.13214
    215179_x_at PGF 347 0.146279
    213562_s_at SQLE 345 −0.14669
    203765_at GCA 340 −0.1798
    214414_x_at HBA2 336 −0.08492
    217497_at ECGF1 336 0.123255
    220924_s_at SLC38A2 333 −0.17315
    218139_s_at C14orf108 332 −0.15021
    201096_s_at ARF4 330 −0.18887
    220361_at FLJ12476 325 −0.15452
    202169_s_at AASDHPPT 323 −0.15787
    202527_s_at SMAD4 322 −0.18399
    202166_s_at PPP1R2 320 −0.16402
    204634_at NEK4 319 −0.15511
    215504_x_at 319 0.145981
    202388_at RGS2 315 −0.14894
    215553_x_at WDR45 315 0.137586
    200598_s_at TRA1 314 −0.19349
    202435_s_at CYP1B1 313 0.056937
    216206_x_at MAP2K7 313 0.10383
    212582_at OSBPL8 313 −0.17843
    216509_x_at MLLT10 312 0.123961
    200908_s_at RPLP2 308 0.136645
    215108_x_at TNRC9 306 −0.1439
    213872_at C6orf62 302 −0.19548
    214395_x_at EEF1D 302 0.128234
    222156_x_at CCPG1 301 −0.14725
    201426_s_at VIM 301 −0.17461
    221972_s_at Cab45 299 −0.1511
    219957_at 298 0.130796
    215123_at 295 0.125434
    212515_s_at DDX3X 295 −0.14634
    203357_s_at CAPN7 295 −0.17109
    211711_s_at PTEN 295 −0.12636
    206165_s_at CLCA2 293 −0.17699
    213959_s_at KIAA1005 289 −0.16592
    215083_at PSPC1 289 0.147348
    219630_at PDZK1IP1 287 −0.15086
    204018_x_at HBA1 /// 286 −0.08689
    HBA2
    208671_at TDE2 286 −0.17839
    203427_at ASF1A 286 −0.14737
    215281_x_at POGZ 286 0.142825
    205749_at CYP1A1 285 0.107118
    212585_at OSBPL8 282 −0.13924
    211745_x_at HBA1 /// 281 −0.08437
    HBA2
    208078_s_at SNF1LK 278 −0.14395
    218041_x_at SLC38A2 276 −0.17003
    212588_at PTPRC 270 −0.1725
    212397_at RDX 270 −0.15613
    208268_at ADAM28 269 0.114996
    207194_s_at ICAM4 269 0.127304
    222252_x_at 269 0.132241
    217414_x_at HBA2 266 −0.08974
    207078_at MED6 261 0.1232
    215268_at KIAA0754 261 0.13669
    221387_at GPR147 261 0.128737
    201337_s_at VAMP3 259 −0.17284
    220218_at C9orf68 259 0.125851
    222356_at TBL1Y 259 0.126765
    208579_x_at H2BFS 258 −0.16608
    219161_s_at CKLF 257 −0.12288
    202917_s_at S100A8 256 −0.19869
    204455_at DST 255 −0.13072
    211672_s_at ARPC4 254 −0.17791
    201132_at HNRPH2 254 −0.12817
    218313_s_at GALNT7 253 −0.179
    218930_s_at FLJ11273 251 −0.15878
    219166_at C14orf104 250 −0.14237
    212805_at KIAA0367 248 −0.16649
    201551_s_at LAMP1 247 −0.18035
    202599_s_at NRIP1 247 −0.16226
    203403_s_at RNF6 247 −0.14976
    214261_s_at ADH6 242 −0.1414
    202033_s_at RB1CC1 240 −0.18105
    203896_s_at PLCB4 237 −0.20318
    209703_x_at DKFZP586A0522 234 0.140153
    211699_x_at HBA1 /// 232 −0.08369
    HBA2
    210764_s_at CYR61 231 −0.13139
    206391_at RARRES1 230 −0.16931
    201312_s_at SH3BGRL 225 −0.12265
    200798_x_at MCL1 221 −0.13113
    214912_at 221 0.116262
    204621_s_at NR4A2 217 −0.10896
    217761_at MTCBP-1 217 −0.17558
    205830_at CLGN 216 −0.14737
    218438_s_at MED28 214 −0.14649
    207475_at FABP2 214 0.097003
    208621_s_at VIL2 213 −0.19678
    202436_s_at CYP1B1 212 0.042216
    202539_s_at HMGCR 210 −0.15429
    210830_s_at PON2 209 −0.17184
    211906_s_at SERPINB4 207 −0.14728
    202241_at TRIB1 207 −0.10706
    203594_at RTCD1 207 −0.13823
    215863_at TFR2 207 0.095157
    221992_at LOC283970 206 0.126744
    221872_at RARRES1 205 −0.11496
    219564_at KCNJ16 205 −0.13908
    201329_s_at ETS2 205 −0.14994
    214188_at HIS1 203 0.1257
    201667_at GJA1 199 −0.13848
    201464_x_at JUN 199 −0.09858
    215409_at LOC254531 197 0.094182
    202583_s_at RANBP9 197 −0.13902
    215594_at 197 0.101007
    214326_x_at JUND 196 −0.1702
    217140_s_at VDAC1 196 −0.14682
    215599_at SMA4 195 0.133438
    209896_s_at PTPN11 195 −0.16258
    204846_at CP 195 −0.14378
    222303_at 193 −0.10841
    218218_at DIP13B 193 −0.12136
    211015_s_at HSPA4 192 −0.13489
    208666_s_at ST13 191 −0.13361
    203191_at ABCB6 190 0.096808
    202731_at PDCD4 190 −0.1545
    209027_s_at ABI1 190 −0.15472
    205979_at SCGB2A1 189 −0.15091
    216351_x_at DAZ1 /// 189 0.106368
    DAZ3 ///
    DAZ2 ///
    DAZ4
    220240_s_at C13orf11 188 −0.16959
    204482_at CLDN5 187 0.094134
    217234_s_at VIL2 186 −0.16035
    214350_at SNTB2 186 0.095723
    201693_s_at EGR1 184 −0.10732
    212328_at KIAA1102 182 −0.12113
    220168_at CASC1 181 −0.1105
    203628_at IGF1R 180 0.067575
    204622_x_at NR4A2 180 −0.11482
    213246_at C14orf109 180 −0.16143
    218728_s_at HSPC163 180 −0.13248
    214753_at PFAAP5 179 0.130184
    206336_at CXCL6 178 −0.05634
    201445_at CNN3 178 −0.12375
    209886_s_at SMAD6 176 0.079296
    213376_at ZBTB1 176 −0.17777
    213887_s_at POLR2E 175 −0.16392
    204783_at MLF1 174 −0.13409
    218824_at FLJ10781 173 0.1394
    212417_at SCAMP1 173 −0.17052
    202437_s_at CYP1B1 171 0.033438
    217528_at CLCA2 169 −0.14179
    218170_at ISOC1 169 −0.14064
    206278_at PTAFR 167 0.087096
    201939_at PLK2 167 −0.11049
    200907_s_at KIAA0992 166 −0.18323
    207480_s_at MEIS2 166 −0.15232
    201417_at SOX4 162 −0.09617
    213826_s_at 160 0.097313
    214953_s_at APP 159 −0.1645
    204897_at PTGER4 159 −0.08152
    201711_x_at RANBP2 158 −0.17192
    202457_s_at PPP3CA 158 −0.18821
    206683_at ZNF165 158 −0.08848
    214581_x_at TNFRSF21 156 −0.14624
    203392_s_at CTBP1 155 −0.16161
    212720_at PAPOLA 155 −0.14809
    207758_at PPM1F 155 0.090007
    220995_at STXBP6 155 0.106749
    213831_at HLA-DQA1 154 0.193368
    212044_s_at 153 0.098889
    202434_s_at CYP1B1 153 0.049744
    206166_s_at CLCA2 153 −0.1343
    218343_s_at GTF3C3 153 −0.13066
    202557_at STCH 152 −0.14894
    201133_s_at PJA2 152 −0.18481
    213605_s_at MGC22265 151 0.130895
    210947_s_at MSH3 151 −0.12595
    208310_s_at C7orf28A /// 151 −0.15523
    C7orf28B
    209307_at 150 −0.1667
    215387_x_at GPC6 148 0.114691
    213705_at MAT2A 147 0.104855
    213979_s_at 146 0.121562
    212731_at LOC157567 146 −0.1214
    210117_at SPAG1 146 −0.11236
    200641_s_at YWHAZ 145 −0.14071
    210701_at CFDP1 145 0.151664
    217152_at NCOR1 145 0.130891
    204224_s_at GCH1 144 −0.14574
    202028_s_at 144 0.094276
    201735_s_at CLCN3 144 −0.1434
    208447_s_at PRPS1 143 −0.14933
    220926_s_at C1orf22 142 −0.17477
    211505_s_at STAU 142 −0.11618
    221684_s_at NYX 142 0.102298
    206906_at ICAM5 141 0.076813
    213228_at PDE8B 140 −0.13728
    217202_s_at GLUL 139 −0.15489
    211713_x_at KIAA0101 138 0.108672
    215012_at ZNF451 138 0.13269
    200806_s_at HSPD1 137 −0.14811
    201466_s_at JUN 135 −0.0667
    211564_s_at PDLIM4 134 −0.12756
    207850_at CXCL3 133 −0.17973
    221841_s_at KLF4 133 −0.1415
    200605_s_at PRKAR1A 132 −0.15642
    221198_at SCT 132 0.08221
    201772_at AZIN1 131 −0.16639
    205009_at TFF1 130 −0.17578
    205542_at STEAP1 129 −0.08498
    218195_at C6orf211 129 −0.14497
    213642_at 128 0.079657
    212891_s_at GADD45GIP1 128 −0.09272
    202798_at SEC24B 127 −0.12621
    222207_x_at 127 0.10783
    202638_s_at ICAM1 126 0.070364
    200730_s_at PTP4A1 126 −0.15289
    219355_at FLJ10178 126 −0.13407
    220266_s_at KLF4 126 −0.15324
    201259_s_at SYPL 124 −0.16643
    209649_at STAM2 124 −0.1696
    220094_s_at C6orf79 123 −0.12214
    221751_at PANK3 123 −0.1723
    200008_s_at GDI2 123 −0.15852
    205078_at PIGF 121 −0.13747
    218842_at FLJ21908 121 −0.08903
    202536_at CHMP2B 121 −0.14745
    220184_at NANOG 119 0.098142
    201117_s_at CPE 118 −0.20025
    219787_s_at ECT2 117 −0.14278
    206628_at SLC5A1 117 −0.12838
    204007_at FCGR3B 116 −0.15337
    209446_s_at 116 0.100508
    211612_s_at IL13RA1 115 −0.17266
    220992_s_at C1orf25 115 −0.11026
    221899_at PFAAP5 115 0.11698
    221719_s_at LZTS1 115 0.093494
    201473_at JUNB 114 −0.10249
    221193_s_at ZCCHC10 112 −0.08003
    215659_at GSDML 112 0.118288
    205157_s_at KRT17 111 −0.14232
    201001_s_at UBE2V1 /// 111 −0.16786
    Kua-UEV
    216789_at 111 0.105386
    205506_at VIL1 111 0.097452
    204875_s_at GMDS 110 −0.12995
    207191_s_at ISLR 110 0.100627
    202779_s_at UBE2S 109 −0.11364
    210370_s_at LY9 109 0.096323
    202842_s_at DNAJB9 108 −0.15326
    201082_s_at DCTN1 107 −0.10104
    215588_x_at RIOK3 107 0.135837
    211076_x_at DRPLA 107 0.102743
    210230_at 106 0.115001
    206544_x_at SMARCA2 106 −0.12099
    208852_s_at CANX 105 −0.14776
    215405_at MYO1E 105 0.086393
    208653_s_at CD164 104 −0.09185
    206355_at GNAL 103 0.1027
    210793_s_at NUP98 103 −0.13244
    215070_x_at RABGAP1 103 0.125029
    203007_x_at LYPLA1 102 −0.17961
    203841_x_at MAPRE3 102 −0.13389
    206759_at FCER2 102 0.081733
    202232_s_at GA17 102 −0.11373
    215892_at 102 0.13866
    214359_s_at HSPCB 101 −0.12276
    215810_x_at DST 101 0.098963
    208937_s_at ID1 100 −0.06552
    213664_at SLC1A1 100 −0.12654
    219338_s_at FLJ20156 100 −0.10332
    206595_at CST6 99 −0.10059
    207300_s_at F7 99 0.082445
    213792_s_at INSR 98 0.137962
    209674_at CRY1 98 −0.13818
    40665_at FMO3 97 −0.05976
    217975_at WBP5 97 −0.12698
    210296_s_at PXMP3 97 −0.13537
    215483_at AKAP9 95 0.125966
    212633_at KIAA0776 95 −0.16778
    206164_at CLCA2 94 −0.13117
    216813_at 94 0.089023
    208925_at C3orf4 94 −0.1721
    219469_at DNCH2 94 −0.12003
    206016_at CXorf37 93 −0.11569
    216745_x_at LRCH1 93 0.117149
    212999_x_at HLA-DQB1 92 0.110258
    216859_x_at 92 0.116351
    201636_at 92 −0.13501
    204272_at LGALS4 92 0.110391
    215454_x_at SFTPC 91 0.064918
    215972_at 91 0.097654
    220593_s_at FLJ20753 91 0.095702
    222009_at CGI-14 91 0.070949
    207115_x_at MBTD1 91 0.107883
    216922_x_at DAZ1 /// 91 0.086888
    DAZ3 ///
    DAZ2 ///
    DAZ4
    217626_at AKR1C1 /// 90 0.036545
    AKR1C2
    211429_s_at SERPINA1 90 −0.11406
    209662_at CETN3 90 −0.10879
    201629_s_at ACP1 90 −0.14441
    201236_s_at BTG2 89 −0.09435
    217137_x_at 89 0.070954
    212476_at CENTB2 89 −0.1077
    218545_at FLJ11088 89 −0.12452
    208857_s_at PCMT1 89 −0.14704
    221931_s_at SEH1L 88 −0.11491
    215046_at FLJ23861 88 −0.14667
    220222_at PRO1905 88 0.081524
    209737_at AIP1 87 −0.07696
    203949_at MPO 87 0.113273
    219290_x_at DAPP1 87 0.111366
    205116_at LAMA2 86 0.05845
    222316_at VDP 86 0.091505
    203574_at NFIL3 86 −0.14335
    207820_at ADH1A 86 0.104444
    203751_x_at JUND 85 −0.14118
    202930_s_at SUCLA2 85 −0.14884
    215404_x_at FGFR1 85 0.119684
    216266_s_at ARFGEF1 85 −0.12432
    212806_at KIAA0367 85 −0.13259
    219253_at 83 −0.14094
    214605_x_at GPR1 83 0.114443
    205403_at IL1R2 82 −0.19721
    222282_at PAPD4 82 0.128004
    214129_at PDE4DIP 82 −0.13913
    209259_s_at CSPG6 82 −0.12618
    216900_s_at CHRNA4 82 0.105518
    221943_x_at RPL38 80 0.086719
    215386_at AUTS2 80 0.129921
    201990_s_at CREBL2 80 −0.13645
    220145_at FLJ21159 79 −0.16097
    221173_at USH1C 79 0.109348
    214900_at ZKSCAN1 79 0.075517
    203290_at HLA-DQA1 78 −0.20756
    215382_x_at TPSAB1 78 −0.09041
    201631_s_at IER3 78 −0.12038
    212188_at KCTD12 77 −0.14672
    220428_at CD207 77 0.101238
    215349_at 77 0.10172
    213928_s_at HRB 77 0.092136
    221228_s_at 77 0.0859
    202069_s_at IDH3A 76 −0.14747
    208554_at POU4F3 76 0.107529
    209504_s_at PLEKHB1 76 −0.13125
    212989_at TMEM23 75 −0.11012
    216197_at ATF7IP 75 0.115016
    204748_at PTGS2 74 −0.15194
    205221_at HGD 74 0.096171
    214705_at INADL 74 0.102919
    213939_s_at RIPX 74 0.091175
    203691_at PI3 73 −0.14375
    220532_s_at LR8 73 −0.11682
    209829_at C6orf32 73 −0.08982
    206515_at CYP4F3 72 0.104171
    218541_s_at C8orf4 72 −0.09551
    210732_s_at LGALS8 72 −0.13683
    202643_s_at TNFAIP3 72 −0.16699
    218963_s_at KRT23 72 −0.10915
    213304_at KIAA0423 72 −0.12256
    202768_at FOSB 71 −0.06289
    205623_at ALDH3A1 71 0.045457
    206488_s_at CD36 71 −0.15899
    204319_s_at RGS10 71 −0.10107
    217811_at SELT 71 −0.16162
    202746_at ITM2A 70 −0.06424
    221127_s_at RIG 70 0.110593
    209821_at C9orf26 70 −0.07383
    220957_at CTAGE1 70 0.092986
    215577_at UBE2E1 70 0.10305
    214731_at DKFZp547A023 70 0.102821
    210512_s_at VEGF 69 −0.11804
    205267_at POU2AF1 69 0.101353
    216202_s_at SPTLC2 69 −0.11908
    220477_s_at C20orf30 69 −0.16221
    205863_at S100A12 68 −0.10353
    215780_s_at SET /// 68 −0.10381
    LOC389168
    218197_s_at OXR1 68 −0.14424
    203077_s_at SMAD2 68 −0.11242
    222339_x_at 68 0.121585
    200698_at KDELR2 68 −0.15907
    210540_s_at B4GALT4 67 −0.13556
    217725_x_at PAI-RBP1 67 −0.14956
    217082_at 67 0.086098
  • TABLE 7
    Group of 20 genes useful in prognosis and/or diagnosis of lung cancer.
    Number of runs
    Affymetrix probe the gene is
    ID No. that can be indicated in Signal to noise in a cancer sample.
    used to identify cancer samples Negative values indicate increase of
    the gene/nucleic as differentially expression in lung cancer, positive values
    acid sequence in expressed out of indicate decrease of expression in lung
    the next column Gene symbol 1000 test runs cancer.
    207953_at AD7C-NTP 1000 0.218433
    215208_x_at RPL35A 999 0.228485
    215604_x_at UBE2D2 998 0.224878
    218155_x_at FLJ10534 998 0.186425
    216858_x_at 997 0.232969
    208137_x_at 996 0.191938
    214715_x_at ZNF160 996 0.198532
    217715_x_at ZNF354A 995 0.223881
    220720_x_at FLJ14346 989 0.17976
    215907_at BACH2 987 0.178338
    217679_x_at 987 0.265918
    206169_x_at RoXaN 984 0.259637
    208246_x_at TK2 982 0.179058
    222104_x_at GTF2H3 981 0.186025
    206056_x_at SPN 976 0.196398
    217653_x_at 976 0.270552
    210679_x_at 970 0.181718
    207730_x_at HDGF2 969 0.169108
    214594_x_at ATP8B1 962 0.284039
  • One can use the above tables to correlate or compare the expression of the transcript to the expression of the gene product. Increased expression of the transcript as shown in the table corresponds to increased expression of the gene product. Similarly, decreased expression of the transcript as shown in the table corresponds to decreased expression of the gene product
  • The analysis of the gene expression of one or more genes and/or transcripts of the groups or their subgroups of the present invention can be performed using any gene expression method known to one skilled in the art. Such methods include, but are not limited to expression analysis using nucleic acid chips (e.g. Affymetrix chips) and quantitative RT-PCR based methods using, for example real-time detection of the transcripts. Analysis of transcript levels according to the present invention can be made using total or messenger RNA or proteins encoded by the genes identified in the diagnostic gene groups of the present invention as a starting material. In the preferred embodiment the analysis is an immunohistochemical analysis with an antibody directed against proteins comprising at least about 10-20, 20-30, preferably at least 36,_at least 36-50, 50, about 50-60, 60-70, 70-80, 80-90, 96, 100-180, 180-200, 200-250, 250-300, 300-350, 350-400, 400-450, 450-500, 500-535 proteins encoded by the genes and/or transcripts as shown in Tables 1-7.
  • The methods of analyzing transcript levels of the gene groups in an individual include Northern-blot hybridization, ribonuclease protection assay, and reverse transcriptase polymerase chain reaction (RT-PCR) based methods. The different RT-PCR based techniques are the most suitable quantification method for diagnostic purposes of the present invention, because they are very sensitive and thus require only a small sample size which is desirable for a diagnostic test. A number of quantitative RT-PCR based methods have been described and are useful in measuring the amount of transcripts according to the present invention. These methods include RNA quantification using PCR and complementary DNA (cDNA) arrays (Shalon et al., Genome Research 6(7):639-45, 1996; Bernard et al., Nucleic Acids Research 24(8):1435-42, 1996), real competitive PCR using a MALDI-TOF Mass spectrometry based approach (Ding et al, PNAS, 100: 3059-64, 2003), solid-phase mini-sequencing technique, which is based upon a primer extension reaction (U.S. Pat. No. 6,013,431, Suomalainen et al. Mol. Biotechnol. June; 15(2):123-31, 2000), ion-pair high-performance liquid chromatography (Doris et al. J. Chromatogr. A May 8;806(1):47-60, 1998), and 5′ nuclease assay or real-time RT-PCR (Holland et al. Proc Natl Acad Sci USA 88: 7276-7280, 1991).
  • Methods using RT-PCR and internal standards differing by length or restriction endonuclease site from the desired target sequence allowing comparison of the standard with the target using gel electrophoretic separation methods followed by densitometric quantification of the target have also been developed and can be used to detect the amount of the transcripts according to the present invention (see, e.g., U.S. Pat. Nos. 5,876,978; 5,643,765; and 5,639,606.
  • The samples are preferably obtained from bronchial airways using, for example, endoscopic cytobrush in connection with a fiber optic bronchoscopy. In one embodiment, the cells are obtained from the individual's mouth buccal cells, using, for example, a scraping of the buccal mucosa.
  • In one preferred embodiment, the invention provides a prognostic and/or diagnostic immunohistochemical approach, such as a dip-stick analysis, to determine risk of developing lung disease. Antibodies against proteins, or antigenic epitopes thereof, that are encoded by the group of genes of the present invention, are either commercially available or can be produced using methods well know to one skilled in the art.
  • Some aspects of the present invention also provide prognostic, diagnostic, and therapeutic tools for the disorders of lung, particularly, lung cancer. The invention is based on the identification of a “field defect” phenomenon and specific expression patterns related to airway epithelial cell exposure to pollutants, such as cigarette smoke. The airway expression patterns of the present invention can be analyzed using nucleic acids and/or proteins from a biological sample of the airways.
  • For example, lung cancer involves histopathological and molecular progression from normal to premalignant to cancer. Gene expression arrays of lung tumors have been used to characterize expression profiles of lung cancers, and to show the progression of molecular changes from non-malignant lung tissue to lung cancer. However, for the screening and early diagnostic purpose, it is not practicable to obtain samples from the lungs. Therefore, the present invention provides for the first time, a method of obtaining cells from other parts of the airways to identify the epithelial gene expression pattern in an individual.
  • The ability to determine which individuals have molecular changes in their airway epithelial cells and how these changes relate to a lung disorder, such as premalignant and malignant changes is a significant improvement for determining risk and for diagnosing a lung disorder such as cancer at a stage when treatment can be more effective, thus reducing the mortality and morbidity rates of lung cancer. The ease with which airway epithelial cells can be obtained, such as bronchoscopy and buccal mucosal scrapings, shows that this approach has wide clinical applicability and is a useful tool in a standard clinical screening for the large number of subjects at risk for developing disorders of the lung.
  • Lung disorders which may be diagnosed or treated by methods described herein include, but are not limited to, asthma, chronic bronchitis, emphysema, bronchietasis, primary pulmonary hypertension and acute respiratory distress syndrome. The methods described herein may also be used to diagnose or treat lung disorders that involve the immune system including, hypersensitivity pneumonitis, eosinophilic pneumonias, and persistent fungal infections, pulmonary fibrosis, systemic sclerosis, ideopathic pulmonary hemosiderosis, pulmonary alveolar proteinosis, cancers of the lung such as adenocarcinoma, squamous cell carcinoma, small cell and large cell carcinomas, and benign neoplasms of the lung including bronchial adenomas and hamartomas.
  • The biological samples useful according to the present invention include, but are not limited to tissue samples, cell samples, and excretion samples, such as sputum or saliva, of the airways. The samples useful for the analysis methods according to the present invention can be taken from the mouth, the bronchial airways, and the lungs.
  • In one embodiment, the invention provides an “airway transcriptome” the expression pattern of which is useful in prognostic, diagnostic and therapeutic applications as described herein. The airway transcriptome of the present invention comprises 85 genes the expression of which differs significantly between healthy smokers and healthy non-smokers. The airway transcriptome according to the present invention comprises 85 genes, corresponding to 97 probesets, as a number of genes are represented by more than one probeset on the affymetrix array, identified from the about 7100 probesets the expression of which was statistically analyzed using epithelial cell RNA samples from smokers and non-smokers. Therefore, the invention also provides proteins that are encoded by the 85 genes. The 85 identified airway transcriptome genes are listed on the following
  • TABLE 8
    1. HLF hepatic leukemia factor (OMIM#142385)
    2. CYFIP2 CYTOPLASMIC FMRP-INTERACTING PROTEIN 2 (OMIM#606323)
    3. MGLL monoglyceride lipase (GenBank gi: 47117287)
    4. HSPA2 HEAT-SHOCK 70-KD PROTEIN 2 (OMIM#140560)
    5. DKFZP586B2420 GeneCards ™ database (Weitzman Institute of Science, Rehovot,
    Israel) at http://www6.unito.it/cgi-bin/cards/carddisp?DKFZPS86B2420
    6. SLIT1 SLIT, DROSOPHILA, HOMOLOG OF, 1 (OMIM#603742)
    7. SLIT2 SLIT, DROSOPHILA, HOMOLOG OF, 2 (OMIM#603746)
    8. C14orf132 hypothetical protein (GeneCards ™ database Id No. GC14P094495 at
    http://bioinfo.enic.es/cgi-bin/db/genecards/carddisp?C14orf132
    9. TU3A DOWNREGULATED IN RENAL CELL CARCINOMA 1 (OMIM#608295)
    10. MMP10 MATRIX METTALLOPROTEIN 10 (OMIM#185260)
    11. CCND2 CYCLIN D2: CCND2 (OMIM#123833)
    12. CX3CL1 CHEMOKINE, CX3C MOTIF, LIGAND 1 (OMIM#601880)
    13. MGC5560 MuDB database at http://mutdb.org/AnnoSNP/data/48/S1/DE/AC.nt.html
    14. MT1F METALLOTHIONEN 1F (OMIM#156352)
    15. RNAHP Homo sapiens RNA helicase-related protein (Unigene/Hs. 8765)
    16. MT1X METALLOTHIONEIN 1X (OMIM#156359)
    17. MT1L METALLOTHIONEIN 1L (OMIM#156358)
    18. MT1G METALLOTHIONEIN 1G (OMIM#156353)
    19. PEC1 GenBank ID No. AI541256
    20. TNFSF13 TUMOR NECROSIS FACTOR LIGAND SUPERFAMILY, MEMBER
    13 (OMIM#604472)
    21. GMDS GDP-MANNOSE 4,6-DEHYDRATASE (OMIM#602884)
    22. ZNF232 ZINC FINGER PROTEIN 2 (OMIM#194500)
    23. GALNT12 UDP-N-ACETYL-ALPHA-D-GALACTOSAMINE:POLYPEPTIDE N-
    ACETYLGALACTOSAMINYLTRANSFERASE 13 (OMIM#608369)
    24. AP2B1 ADAPTOR-RELATED PROTEIN COMPLEX 2, BETA-1 SUBUNIT
    (OMIM#601925)
    25. HN1 HUMANIN (OMIM#606120)
    26. ABCC1 ATP-BINDING CASSETTE, SUBFAMILY C, MEMBER 1
    (OMIM#158343)
    27. RAB11A RAS FAMILY, MEMBER RAB11A (OMIM#605570)
    28. MSMB MICROSEMINOPROTEIN, BETA (OMIM#157145)
    29. MAFG V-MAF AVIAN MUSCULOAPONEUROTIC FIBROSARCOMA ONCOGENE
    FAMILY, PROTEIN G (OMIM#602020)
    30. ABHD2 GeneCards ™ ID No. GC15P087361
    31. ANXA3 ANNEXIN A3 (OMIM#106490)
    32. VMD2 VITELLIFORM MACULAR DYSTROPHY GENE 2 (OMIM#607854)
    33. FTH1 FERRITIN HEAVY CHAIN 1 (OMIM#134770)
    34. UGT1A3 UDP-GLYCOSYLTRANSFERASE1 FAMILY, POLPEPTIDE A3
    (OMIM#606428)
    35. TSPAN-1 tetraspan 1 (GeneID: 10103 at Entrez Gene, NCBI Database)
    36. CTGF CONNECTIVE TISSUE GROWTH FACTOR (OMIM#121009)
    37. PDG phosphoglycerate dehydrogenase (GeneID: 26227 at Entrez Gene, NCBI
    Database)
    38. HTATIP2 HIV-1 TAT-INTERACTING PROTEIN 2, 30-KD (OMIM#605628)
    39. CYP4F11 CYTOCHROME P450, SUBFAMILY IVF, POLYPEPTIDE 11
    40. GCLM GLUTAMATE-CYSTEIN LIGASE, MODIFIER SUBUNIT (OMIM#601176)
    41. ADH7 ALCOHOL DEHYDROGENASE 7 (OMIM#600086)
    42. GCLC GLUTAMATE-CYSTEINE LIGASE, CATALYTIC SUBUNIT
    (OMIM#606857)
    43. UPK1B UROPLAKIN 1B (OMIM#602380)
    44. PLEKHB2 pleckstrin homology domain containing, family B (evectins) member 2,
    GENEATLAS GENE DATABASE AT http://www.dsi.univ-paris5.fr/genatlas/fiche1.php?symbol=PLEKHB2
    45. TCN1 TRANSCOBALAMIN 1 (OMIM#189905)
    46. TRIM16 TRIPARTITE MOTIF-CONTAINING PROTEIN 16
    47. UGT1A9 UDP-GLYCOSYLTRANSFERASE 1 FAMILY, POLYPEPTIDE A9
    (OMIM#606434)
    48. UGT1A1 UDP-GLYCOSYLTRANSFERASE 1 FAMILY, POLYPEPTIDE A1
    (OMIM#191740)
    49. UGT1A6 UDP-GLYCOSYLTRANSFERASE 1 FAMILY, POLYPEPTIDE A6
    (OMIM#606431)
    50. NQ01 NAD(P)H dehydrogenase, quinone 1 (OMIM#125860)
    51. TXNRD1 THIOREDOXIN REDUCTASE 1 (OMIM#601112)
    52. PRDX1 PEROXIREDOXIN 1 (OMIM#176763)
    53. ME1 MALIC ENZYME 1 (OMIM#154250)
    54. PIR PIRIN (OMIM# 603329)
    55. TALDO1 TRANSALDOLASE 1 (OMIM#602063)
    56. GPX2 GLUTATHIONE PEROXIDASE 2 (OMIM#138319)
    57. AKR1C3 ALDO-KETO REDUCTASE FAMILY 1, MEMBER C3
    (OMIM#603966)
    58. AKR1C1 ALDO-KETO REDUCTASE FAMILY 1, MEMBER 1 (OMIM#600449)
    59. AKR1C1-pseudo ALDO-KETO REDUCTASE FAMILY 1, pseudo gene,
    GeneCards ™ No. GC10U990141
    60. AKR1C2 ALDO-KETO REDUCTASE FAMILY 1, MEMBER C2
    (OMIM#600450)
    61. ALDG3A1 ALDEHYDE DEHYDROGENASE, FAMILY 3, SUBFAMILY A,
    MEMBER 1 (OMIM#100660)
    62. CLDN10 CLAUDIN 10 (GeneCards ™ ID: GC13P093783)
    63. TXN thioredoxin (OMIM#187700)
    64. TKT TRANSKETOLASE (OMIM#606781)
    65. CYP1B1 CYTOCHROME P450, SUBFAMILY 1, POLYPEPTIDE 1
    (OMIM#601771)
    66. CBR1 CARBONYL REDUCTASE 1 (OMIM#114830)
    67. AKR1B1 ALDO-KETO REDUCTASE FAMILY 1, MEMBER B1
    (OMIM#103880)
    68. NET6 Transmembrane 4 superfamily member 13 (GenBank ID gi: 11135162)
    69. NUDT4 nudix (nucleoside diphosphate linked moiety X)-type motif 4 (Entrez
    GeneID: 378990)
    70. GALNT3 UDP-N-ACETYL-APLHA-D-GALACTOSAMINE:POLYPEPTIDE N-
    ACETYLGALACTOSAMINYLTRANSFERASE 3 (OMIM#601756)
    71. GALNT7 UDP-N-ACETYL-APLHA-D-GALACTOSAMINE:POLYPEPTIDE N-
    ACETYLGALACTOSAMINYLTRANSFERASE 7 (OMIM#605005)
    72. CEACAM6 CARCINOEMBRYONIC ANTIGEN-RELATED CELL ADHESION
    MOLECULE 6 (OMIM#163980)
    73. AP1G1 ADAPTOR-RELATED PROTEIN COMPLEX 1, GAMMA-1 SUBUNIT
    (OMIM#603533)
    74. CA12 CARBONIC ANHYDRASE XII (OMIM#603263)
    75. FLJ20151 hypothetical protein (GeneCards ™ ID: GC15MO61330)
    76. BCL2L13 apoptosis facilitator (GeneID: 23786, Entrez)
    77. SRPUL Homo sapenes subhi-repeat protein (MutDB at
    http://mutdb.org/AnnoSNP/data/DD/S0/9U/AC.nt.html)
    78. FLJ13052 Homo sapiens NAD kinase (GenBank ID gi: 20070325)
    79. GALNT6 UDP-N-ACETYL-ALPHA-D-GALACTOSAMINE:POLYPETIDE N-
    ACETYLGALACTOSAMINYLTRANSFERASE 6 (OMIM#605148)
    80. OASIS cAMP responsive element binding protein 3-like 1 (GeneBank ID gi: 21668501)
    81. MUC5B MUCIN 5, SUBTYPE B, TRACHEOBRONCHIAL (OMIM#600770)
    82. S100P S100 CALCIUM-BINDING PROTEIN P (OMIM#600614)
    83. SDR1 dehydrogenease/reductase (SDR family) member 3 (GeneID: 9249, Entrez)
    84. PLA2G10 PHOSPHOLIPASE A2, GROUP X (OMIM#603603)
    85. DPYSL3 DIHYDROPYRIMIDINASE-LIKE 3 (OMIM#601168)
  • The invention further provides a lung cancer diagnostic airway transcriptome comprising at least 208 genes that are differentially expressed between smokers with lung cancer and smokers witout lung cancer. The genes identified as being part of the diagnostic airway transcriptome are 208238_x_at-probeset; 216384x_at-probeset; 217679_x_at-probeset; 216859_x_at-probeset; 211200_s_at-probeset; PDPKI; ADAM28; ACACB; ASMTL; ACVR2B; ADAT1; ALMS1; ANK3; ANK3; OARS; AFURS1; ATP8B1; ABCC1; BTF3; BRD4; CELSR2; CALM31 CAPZB; CAPZB1 CFLAR; CTSS; CD24; CBX3; C21orf106; C6orf111; C6orf62; CHC1; DCLRE1C; EML2; EMS 1 EPHB6; EEF2; FGFR3; FLJ20288; FVT1; GGTLA4; GRP; GLUL; HDGF; Homo sapiens cDNA FLJ1 1452 fis, clone HEMBA1001435; Homo sapiens cDNA FLJ12005 fis, clone HEMBB1001565; Homo sapiens cDNA FLJ13721 fis, clone PLACE2000450; Homo sapiens cDNA FLJ14090 fis, clone MAMMA1000264; Homo sapiens cDNA FLJ14253 fis, clone OVARC1001376; Homo sapiens fetal thymus prothymosin alpha mRNA, complete eds Homo sapiens fetal thymus prothymosin alpha mRNA; Homo sapiens transcribed sequence with strong similarity to protein ref:NP_004726.1 (H. sapiens) leucine rich repeat (in FL1I) interacting protein 1; Homo sapiens transcribed sequence with weak similarity to protein ref:NP_060312.1 (H. sapiens) hypothetical protein FLJ20489; Homo sapiens transcribed sequence with weak similarity to protein ref:NP_060312.1 (H. sapiens) hypothetical protein FLJ20489; 222282_at-probeset corresponding to Homo sapiens transcribed sequences; 215032_at-probeset corresponding to Homo sapiens transcribed sequences; 81811_at-probeset corresponding to Homo sapiens transcribed sequences; OKFp4547K1 113; ET; FLJ10534; FLJ10743; FLJ13171; FLJ14639; FLJ14675; FLJ20195; FLJ20686; FLJ20700; CG005; CG005; MGC5384; IMP-2; INAOL; INHBC; KIAA0379; KIA A0676; KIAA0779; KIAA1193; KTNI; KLF5; LRRFIP1; MKRN4; MAN1C1; MVK; MUC20; MPZL1; MYO1A; MRLC2; NFATC3; OOAG; PARVA; PASK; PIK3C2B; PGF; PKP4; PRKX; PRKY; PTPRF; PTMA; PTMA; PHTF2; RAB14; ARHGEF6; RIPX; REC8L1; RIOK3; SEMA3F; SRRM21 MGC709071 SMT3H2; SLC28A3; SAT; SFRS111 SOX2; THOC2; TRIM51 USP7; USP9X; USHIC; AF020591; ZNF131; ZNF160; ZNF264; 217414_x_at-probeset;; 217232_x_at-probeset;; ATF3; ASXL2; ARF4L; APG5L; ATP6V0B; BAG1; BTG2; COMT; CTSZ; CGI-128; C14orf87; CLDN3; CYR61; CKAP1; OAF; OAF; OSIP1; OKFZP564G2022; ONAJB9; OOOST; OUSP1; DUSP6; DKC1; EGR1; EIF4EL3; EXT2; GMPPB; GSN; GUK1; HSPA8; Homo sapiens PRO2275 mRNA, complete eds; Homo sapiens transcribed sequence with strong similarity to protein ref:NP_006442.2, polyadenylate binding protein-interacting protein 1; HAX1; DKFZP434KO46; IMAGE3455200, HYOUI; IDN3; JUNB; KRT8; KIAA01O0; KIAA0102; APH-1A; LSM4; MAGED2; MRPS7; MOCS2; MNDA; NOUFA8; NNT; NFIL3; PWP1; NR4A2; NUDT4; ORMOL2; POAP2; PPIH; PBX3; P4HA2; PPP1R15A; PRGII P2RX4; SUi1; SUi1; SUi1; RABSC; ARHB; RNASE4; RNH; RNPC4; SEC23B; SERPINA1; SH3GLB1; SLC35B1; SOX9; SOX9; STCH; SDHC; TINF2; TCF8; E2-EPF; FOS; JUN; ZFP36; ZNF500; and ZDHHC4.
  • Deviation in the expression compared to control group can be increased expression or decreased expression of one or more of the 208 genes. Preferably, downregulation of expression of at least one, preferably at least 10, 15, 25, 30, 50, 60, 75, 80, 90, 100, 110, or all of the 121 genes consisting of 208238_x_at-probeset; 216384_x_at-probeset; 217679_x_at-probeset; 216859_x_at-probeset; 211200_s_at-probeset; PDPK1 ADAM28; ACACB; ASMTL; ACVR2B; ADAT1; ALMS1; ANK3; ANK3; OARS; AFURSI; ATP8B1; ABCC1; BTF3; BRD4; CELSR2; CALM31 CAPZB; CAPZB1 CFLAR; CTSS; CD24; CBX3; C21orf106; C6orf111; C6orf62; CHC1; DCLREIC; EMIL2; EMS1, EPHB6; EEF2; FGFR3; FLJ20288; FVT1; GGTLA4; GRP; GLUL HDGF; Homo sapiens cDNA FLJI 1452 fis, clone HEMBA1001435; Homo sapiens cDNA FLJ12005 fis, cloneHEMBB1001565; Homo sapiens cDNA FLJ13721 fis, clone PLACE2000450; Homo sapiens cDNA FLJ14090 fis, clone MAMMA1000264; Homo sapiens cDNA FLJ14253 fis, clone OVARC1001376; Homo sapiens fetal thymus prothymosin alpha mRNA, complete eds; Homo sapiens transcribed sequence with strong similarity to protein ref:NP_004726.1 (H. sapiens) leucine rich repeat (in FLU) interacting protein 1; Homo sapiens transcribed sequence with weak similarity to protein ref:NP_060312.1 (H. sapiens) hypothetical protein FLJ20489; Homo sapiens transcribed sequence with weak similarity to protein ref:NP_060312.1 (H. sapiens) hypothetical protein FLJ20489; 222282_at-probeset corresponding to Homo sapiens transcribed sequences; 215032_at-probeset corresponding to Homo sapiens transcribed sequences; 81811_at-probeset corresponding to Homo sapiens transcribed sequences; DKFZp547K1 113; ET; FLJ10534;FLJ10743;FLJ13171;FLJ14639;FLJ14675;FLJ20195;FLJ20686;FLJ20700; CG005; CG005; MGC5384; IMP-2; INADL; INHBC; KIAA0379; KIAA0676; K1AA0779; KIAA1193; KTN1; KLF5; LRRFIP1; MKRN4; MAN1C1; MVK; MUC20; MPZL1; MYO1A; MRLC2; NFATC3; ODAG; PARVA; PASK; PIK3C2B; PGF; PKP4; PRKX; PRKY; PTPRF; PTMA; PTMA; PHTF2; RAB14; ARHGEF6; RIPX; REC8L1; RIOK3; SEMA3F; SRRM21 MGC709071 SMT3H2; SLC28A3; SAT; SFRS111 SOX2; THOC2; TRIM51USP7; USP9X; USH1C; AF020591; ZNF131; ZNF160; and ZNF264, when compared to a control group is indicative of lung cancer.
  • Preferably increase, or up-regulation of expression of at least one, preferably at least 10, 15, 25, 30, 50, 60, 75, 80, or all of the 87 genes consisting of of217414_x_at-probeset;; 217232_x_at-probeset;;ATF3; ASXL2; ARF4L; APG5L; ATP6VOB; BAG1; BTG2; COMT; CTSZ; CGI-128; C14orf87; CLDN3; CYR61; CKAP1; DAF; DAF; DSIPI; DKFZP564G2022; DNAJB9; DDOST; DUSP1; DUSP6; DKC1; EGR1; EIF4EL3; EXT2; GMPPB; GSN; GUK1; HSPA8; Homo sapiens PRO2275 mRNA, complete eds; Homo sapiens transcribed sequence with strong similarity to protein ref:NP_006442.2, polyadenylate binding protein-interacting protein 1; HAX1; DKFZP434K046; IMAGE3455200; HYOU1; IDN3; JUNB; KRT8; KIAA0100; KIAA0102; APH-1A; LSM4; MAGED2; MRPS7; MOCS2; MNDA; NDUFA8; NNT; NFIL3; PWP1; NR4A2; NUDT4; ORMDL2; PDAP2; PPIH; PBX3; P4HA2; PPP1R15A; PRGII P2RX4; SUi1; SUi1; SUi1; RAB5C; ARHB; RNASE4; RNH; RNPC4; SEC23B; SERPINA1; SH3GLB1; SLC35B1; SOX9; SOX9; STCH; SDHC; TINF2; TCF8; E2-EPF; FOS; JUN; ZFP36; ZNF500; and ZDHHC4 as compared to a control group indicated that the individual is affected with lung cancer.
  • The probeset numbers as referred to herein and throughout the specification, refer to the Affymetrix probesets.
  • The methods to identify the airway transcriptomes can be used to identify airway transcriptomes in other animals than humans by performing the statistical comparisons as provided in the Examples below in any two animal groups, wherein one group is exposed to an airway pollutant and the other group is not exposed to such pollutant and performing the gene expression analysis of any large probeset, such as the probeset of 7119 genes used in some of the Examples. Therefore, the subject or individual as described herein and throughout the specification is not limited to human, but encompasses other mammals and animals, such as murine, bovine, swine, and other primates. This methodology can also be carried out with lung disorders to create new clusters of genes wherein change in their expression is related to specific disorders.
  • We identified a subset of three current smokers who did not upregulate expression of a number of predominantly redox/xenobiotic genes to the same degree as other smokers. One of these smokers developed lung cancer within 6 months of the analysis. In addition, there is a never smoker, who is an outlier among never smokers and expresses a subset of genes at the level of current smokers (see FIG. 24 and associated Figure legend). These outlier genes are as shown on Table 9 below.
  • TABLE 9
    GENBANK ID HUGO ID GENBANK DESCRIPTION
    NM 001353.2 AKR1C1 aldo-keto reductase family 1,
    member C1 (dihydrodiol dehydrogenase 1;
    20-alpha (3-alpha)-hydroxysteroid
    dehydrogenase)
    NM 002443.1 MSMB microseminoprotein, beta-
    AI346835 TM4SF1 transmembrane 4 superfamily member 1
    NM 006952.1 UPK1B uroplakin 1B
    AI740515 FLJ20152 hypothetical protein FLJ20152
    AC004832 SEC14L3 SEC14-like 3 (S. cerevisiae)
    NM 020685.1 HT021 HT021
    NM 007210.2 GALNT6 UDP-N-acetyl-alpha-D-
    galactosamine:polypeptide
    N-acetylgalactosaminyltransferase 6
    (GalNAc-T6)
    NM 001354 AKR1C2 aldo-keto reductase family 1, member C2
  • These divergent patterns of gene expression in a small subset of smokers represent a failure of these smokers to mount an appropriate response to cigarette exposure and indicate a linkage to increased risk for developing lung cancer. As a result, these “outlier” genes can thus serve as biomarkers for susceptibility to the carcinogenic effects of cigarette smoke and other air pollutants.
  • Therefore, in one embodiment, the invention provides a method of determining an increased risk of lung disease, such as lung cancer, in a smoker comprising taking an airway sample from the individual, analyzing the expression of at least one, preferably at least two, still more preferably at least 4, still more preferably at least 5, still more preferably at least 6, still more preferably at least 7, still more preferably at least 8, still more preferably at least 8, and still more preferably at least all 9 of the outlier genes including AKRICI; MSNIB; TM4SF1; UPKIB; FLJ20152; SEC14L3; HT021; GALNT6; and AKRIC2, wherein deviation of the expression of at least one, preferably at least two, still more preferably at least 4, still more preferably at least 5, still more preferably at least 6, still more preferably at least 7, still more preferably at least 8, still more preferably at least 8, and still more preferably at least all 9 as compared to a control group is indicative of the smoker being at increased risk of developing a lung disease, for example, lung cancer.
  • FIG. 41 shows a hierarchical clustering plot of all current smokers according to the expression of 9 genes considered to be statistical outliers among at least 3 patients by Grubb's test. These 9 genes were selected from the 361 genes found to be differentially expressed between current and never smokers at p<0.001. Darker gray=high level of expression, lighter grey=low level of expression, black=mean level of expression. It can be clearly seen that the “outlier” individuals have significantly different expression pattern of these 9 nine genes.
  • We have shown that if the cells in the airways of an individual exposed to pollutant, such as cigarette smoke, do not turn on, or increase the expression of one or more of the certain genes encoding proteins associated with detoxification, and genes encoding mucins and cell adhesion molecules, this individual is at increased risk of developing lung diseases.
  • We have also shown that if the cells in the airways of an individual exposed to pollutant, such as cigarette smoke, do not turn off, or decrease the transcription of genes encoding one or more of certain proteins associated with immune regulation and metallothioneins, the individual has an increased risk of developing lung disease.
  • We have also shown that if the cells in the airways of an individual exposed to pollutant, such as cigarette smoke, do not turn off one or more tumor suppressor genes or turn on one or more protooncogenes, the individual is at increased risk of developing lung disease.
  • The methods disclosed herein can also be used to show exposure of a non-smoker to environmental pollutants by showing increased expression in a biological sample taken from the airways of the non-smoker of genes encoding proteins associated with detoxification, and genes encoding mucins and cell adhesion molecules or decreased expression of genes encoding certain proteins associated with immune regulation and metallothioneins. If such changes are observed, an entire group of individuals at work or home environment of the exposed individual may be analyzed and if any of them does not show the indicative increases and decreases in the expression of the airway transcriptome, they may be at greater risk of developing a lung disease and susceptible for intervention. These methods can be used, for example, in a work place screening analyses, wherein the results are useful in assessing working environments, wherein the individuals may be exposed to cigarette smoke, mining fumes, drilling fumes, asbestos and/or other chemical and/or physical airway pollutants. Screening can be used to single out high risk workers from the risky environment to transfer to a less risky environment.
  • Accordingly, in one embodiment, the invention provides prognostic and diagnostic methods to screen for individuals at risk of developing diseases of the lung, such as lung cancer, comprising screening for changes in the gene expression pattern of the airway transcriptome. The method comprises obtaining a cell sample from the airways of an individual and measuring the level of expression of 1-85 gene transcripts of the airway transcriptome as provided herein. Preferably, the level of at least two, still more preferably at least 3, 4, 5, 6, 7, 8, 9, 10 transcripts, and still more preferably, the level of at least 10-15, 15-20, 20-50, or more transcripts, and still more preferably all of the 97 trasncripts in the airway transcriptome are measured, wherein difference in the expression of at least one, preferably at least two, still more preferably at least three, and still more preferably at least 4, 5, 6, 7, 8, 9, 10, 10-15, 15-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-85 genes present in the airway transcriptome compared to a normal airway transcriptome is indicative of increased risk of a lung disease. The control being at least one, preferably a group of more than one individual exposed to the same pollutant and having a normal or healthy response to the exposure.
  • In one embodiment, difference in at least one of the detoxification related genes, mucin genes, and/or cell adhesion related genes compared to the level of these genes expressed in a control, is indicative of the individual being at an increased risk of developing diseases of the lung. The differences in expression of at least one immune system regulation and/or metallothionein regulation related genes compared to the level of these genes expressed in a control group indicates that the individual is at risk of developing diseases of the lung.
  • In one embodiment, the invention provides a prognostic method for lung diseases comprising detecting gene expression changes in at least on of the mucin genes of the airway transcriptome, wherein increase in the expression compared with control group is indicative of an increased risk of developing a lung disease. Examples of mucin genes include muc 5 subtypes A, B, and. C.
  • In one preferred embodiment, the invention provides a tool for screening for changes in the airway transcriptome during long time intervals, such as weeks, months, or even years. The airway transcriptome expression analysis is therefore performed at time intervals, preferably two or more time intervals, such as in connection with an annual physical examination, so that the changes in the airway transcriptome expression pattern can be tracked in individual basis. The screening methods of the invention are useful in following up the response of the airways to a variety of pollutants that the subject is exposed to during extended periods. Such pollutants include direct or indirect exposure to cigarette smoke or other air pollutants. The control as used herein is a healthy individual, whose responses to airway pollutants are in the normal range of a smoker as provided by, for example, the transcription patterns shown in FIG. 24.
  • Analysis of transcript levels according to the present invention can be made using total or messenger RNA or proteins encoded by the genes identified in the airway trascriptome of the present invention as a starting material. In the preferred embodiment the analysis is an immunohistochemical analysis with an antibody directed against at least one, preferably at least two, still more preferably at least 4-10 proteins encoded by the genes of the airway transcriptome
  • The methods of analyzing transcript levels of one or more of the 85 transcripts in an individual include Northern-blot hybridization, ribonuclease protection assay, and reverse transcriptase polymerase chain reaction (RT-PCR) based methods. The different RT-PCR based techniques are the most suitable quantification method for diagnostic purposes of the present invention, because they are very sensitive and thus require only a small sample size which is desirable for a diagnostic test. A number of quantitative RT-PCR based methods have been described and are useful in measuring the amount of transcripts according to the present invention. These methods include RNA quantification using PCR and complementary DNA (cDNA) arrays (Shalon et al., Genome Research 6(7):639-45, 1996; Bernard et al., Nucleic Acids Research 24(8):1435-42, 1996), solid-phase mini-sequencing technique, which is based upon a primer extension reaction (U.S. Pat. No. 6,013,431, Suomalainen et al. Mol. Biotechnol. June; 15(2):123-31, 2000), ion-pair high-performance liquid chromatography (Doris et al. J. Chromatogr. A May 8; 806(1):47-60, 1998), and 5′ nuclease assay or real-time RT-PCR (Holland et al. Proc Natl Acad Sci USA 88: 7276 7280, 1991).
  • Methods using RT-PCR and internal standards differing by length or restriction endonuclease site from the desired target sequence allowing comparison of the standard with the target using gel electrophoretic separation methods followed by densitometric quantification of the target have also been developed and can be used to detect the amount of the transcripts according to the present invention (see, e.g., U.S. Pat. Nos. 5,876,978; 5,643,765; and 5,639,606).
  • For example, the present invention provides a method for detecting risk of developing lung cancer in a subject exposed to cigarette smoke comprising measuring the level of 1-97 proteins encoded by the airway transcriptome in a biological sample of the subject. Preferably at least one, still more preferably at least two, still more preferably at least three, and still more preferably at least 4-10, or more of the proteins encoded by the airway transcriptome in a biological sample of the subject are analyzed. The method comprises binding an antibody against one or more of the proteins encoded by the airway transcriptome (the “protein”) to a solid support chosen from the group consisting of dip-stick and membrane; incubating the solid support in the presence of the sample to be analyzed under conditions where antibody-antigen complexes form; incubating the support with an anti-protein antibody conjugated to a detectable moeity which produces a signal; visually detecting said signal, wherein said signal is proportional to the amount of protein in said sample; and comparing the signal in said sample to a standard, wherein a difference in the amount of the protein in the sample compared to said standard of at feast one, preferably at least two, still more preferably at least 3-5, still more preferably at least 5-10, proteins is indicative of an increased risk of developing lung cancer. The standard levels are measured to indicate expression levels in a normal airway exposed to cigarette smoke, as exemplified in the smoker transcript pattern shown, for example on FIG. 24.
  • The assay reagents, pipettes/dropper, and test tubes may be provided in the form of a kit. Accordingly, the invention further provides a test kit for visual detection of one or more proteins encoded by the airway transcriptome, wherein detection of a level that differs from a pattern in a control individual is considered indicative of an increased risk of developing lung disease in the subject. The test kit comprises one or more solutions containing a known concentration of one or more proteins encoded by the airway transcriptome (the “protein”) to serve as a standard; a solution of a anti-protein antibody bound to an enzyme; a chrotnogen which changes color or shade by the action of the enzyme; a solid support chosen from the group consisting of dip-stick and membrane carrying on the surface thereof an antibody to the protein.
  • The inventions disclosed herein contemplate either one dipstick capable of detecting all the diagnostically important gene products or alternatively, a series of dipsticks capable of detecting the amount proteins of a smaller sub-group of diagnostic proteins of the present invention.
  • Antibodies can be prepared by means well known in the art. The term “antibodies” is meant to include monoclonal antibodies, polyclonal antibodies and antibodies prepared by recombinant nucleic acid techniques that are selectively reactive with a desired antigen. Antibodies against the proteins encoded by any of the genes in the diagnostic gene groups of the present invention are either known or can be easily produced using the methods well known in the art. Internet sites such as Biocompare through the World Wide Web at “biocompare.com/abmatrix.asp?antibody=y” provide a useful tool to anyone skilled in the art to locate existing antibodies against any of the proteins provided according to the present invention.
  • Antibodies against the diagnostic proteins according to the present invention can be used in standard techniques such as Western blotting or immunohistochemistry to quantify the level of expression of the proteins of the diagnostic airway proteome. This is quantified according to the expression of the gene transcript, i.e. the increased expression of transcript corresponds to increased expression of the gene product, i.e. protein. Similarly decreased expression of the transcript corresponds to decreased expression of the gene product or protein. Detailed guidance of the increase or decrease of expression of preferred transcripts in lung disease, particularly lung cancer, is set forth in the tables. For example, Tables 5 and 6 describe a group of genes the expression of which is altered in lung cancer.
  • Immunohistochemical applications include assays, wherein increased presence of the protein can be assessed, for example, from a saliva or sputum sample.
  • The immunohistochemical assays according to the present invention can be performed using methods utilizing solid supports. The solid support can be a any phase used in performing immunoassays, including dipsticks, membranes, absorptive pads, beads, microtiter wells, test tubes, and the like. Preferred are test devices which may be conveniently used by the testing personnel or the patient for self-testing, having minimal or no previous training Such preferred test devices include dipsticks, membrane assay systems as described in U.S. Pat. No. 4,632,901. The preparation and use of such conventional test systems is well described in the patent, medical, and scientific literature. If a stick is used, the anti-protein antibody is bound to one end of the stick such that the end with the antibody can be dipped into the solutions as described below for the detection of the protein. Alternatively, the samples can be applied onto the antibody-coated dipstick or membrane by pipette or dropper or the like.
  • The antibody against proteins encoded by the diagnostic airway transcriptome (the “protein”) can be of any isotype, such as IgA, IgG or IgM, Fab fragments, or the like. The antibody may be a monoclonal or polyclonal and produced by methods as generally described, for example, in Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, 1988, incorporated herein by reference. The antibody can be applied to the solid support by direct or indirect means. Indirect bonding allows maximum exposure of the protein binding sites to the assay solutions since the sites are not themselves used for binding to the support. Preferably, polyclonal antibodies are used since polyclonal antibodies can recognize different epitopes of the protein thereby enhancing the sensitivity of the assay.
  • The solid support is preferably non-specifically blocked after binding the protein antibodies to the solid support. Non-specific blocking of surrounding areas can be with whole or derivatized bovine serum albumin, or albumin from other animals, whole animal serum, casein, non-fat milk, and the like.
  • The sample is applied onto the solid support with bound protein-specific antibody such that the protein will be bound to the solid support through said antibodies. Excess and unbound components of the sample are removed and the solid support is preferably washed so the antibody-antigen complexes are retained on the solid support. The solid support may be washed with a washing solution which may contain a detergent such as Tween-20, Tween-80 or sodium dodecyl sulfate.
  • After the protein has been allowed to bind to the solid support, a second antibody which reacts with protein is applied. The second antibody may be labeled, preferably with a visible label. The labels may be soluble or particulate and may include dyed immunoglobulin binding substances, simple dyes or dye polymers, dyed latex beads, dye-containing liposomes, dyed cells or organisms, or metallic, organic, inorganic, or dye solids. The labels may be bound to the protein antibodies by a variety of means that are well known in the art. In some embodiments of the present invention, the labels may be enzymes that can be coupled to a signal producing system. Examples of visible labels include alkaline phosphatase, beta-galactosidase, horseradish peroxidase, and biotin. Many enzyme-chromogen or enzyme-substrate-chromogen combinations are known and used for enzyme-linked assays. Dye labels also encompass radioactive labels and fluorescent dyes.
  • Simultaneously with the sample, corresponding steps may be carried out with a known amount or amounts of the protein and such a step can be the standard for the assay. A sample from a healthy individual exposed to a similar air pollutant such as cigarette smoke, can be used to create a standard for any and all of the diagnostic gene group encoded proteins.
  • The solid support is washed again to remove unbound labeled antibody and the labeled antibody is visualized and quantified. The accumulation of label will generally be assessed visually. This visual detection may allow for detection of different colors, for example, red color, yellow color, brown color, or green color, depending on label used. Accumulated label may also be detected by optical detection devices such as reflectance analyzers, video image analyzers and the like. The visible intensity of accumulated label could correlate with the concentration of protein in the sample. The correlation between the visible intensity of accumulated label and the amount of the protein may be made by comparison of the visible intensity to a set of reference standards. Preferably, the standards have been assayed in the same way as the unknown sample, and more preferably alongside the sample, either on the same or on a different solid support.
  • The concentration of standards to be used can range from about 1 mg of protein per liter of solution, up to about 50 mg of protein per liter of solution. Preferably, two or more different concentrations of an airway gene group encoded proteins are used so that quantification of the unknown by comparison of intensity of color is more accurate.
  • For example, the present invention provides a method for detecting risk of developing lung cancer in a subject exposed to cigarette smoke comprising measuring the transcription profile of the proteins encoded by one or more groups of genes of the invention in a biological sample of the subject. Preferably at least about 30, still more preferably at least about 36, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, or about 180 of the proteins encoded by the airway transcriptome in a biological sample of the subject are analyzed. The method comprises binding an antibody against each protein encoded by the gene in the gene group (the “protein”) to a solid support chosen from the group consisting of dip-stick and membrane; incubating the solid support in the presence of the sample to be analyzed under conditions where antibody-antigen complexes form; incubating the support with an anti-protein antibody conjugated to a detectable moiety which produces a signal; visually detecting said signal, wherein said signal is proportional to the amount of protein in said sample; and comparing the signal in said sample to a standard, wherein a difference in the amount of the protein in the sample compared to said standard of the same group of proteins, is indicative of diagnosis of or an increased risk of developing lung cancer. The standard levels are measured to indicate expression levels in an airway exposed to cigarette smoke where no cancer has been detected.
  • The assay reagents, pipettes/dropper, and test tubes may be provided in the form of a kit. Accordingly, the invention further provides a test kit for visual detection of the proteins encoded by the airway gene groups, wherein detection of a level that differs from a pattern in a control individual is considered indicative of an increased risk of developing lung disease in the subject. The test kit comprises one or more solutions containing a known concentration of one or more proteins encoded by the airway transcriptome (the “protein”) to serve as a standard; a solution of a anti-protein antibody bound to an enzyme; a chromogen which changes color or shade by the action of the enzyme; a solid support chosen from the group consisting of dip-stick and membrane carrying on the surface thereof an antibody to the protein. Instructions including the up or down regulation of the each of the genes in the groups as provided by the Tables 1 and 2 are included with the kit.
  • The practice of the present invention may employ, unless otherwise indicated, conventional techniques and descriptions of organic chemistry, polymer technology, molecular biology (including recombinant techniques), cell biology, biochemistry, and immunology, which are within the skill of the art. Such conventional techniques include polymer array synthesis, hybridization, ligation, and detection of hybridization using a label. Specific illustrations of suitable techniques can be had by reference to the example herein below. However, other equivalent conventional procedures can, of course, also be used. Such conventional techniques and descriptions can be found in standard laboratory manuals such as Genome Analysis: A Laboratory Manual Series (Vols. I-IV), Using Antibodies: A Laboratory Manual, Cells: A Laboratory Manual, PCR Primer: A Laboratory Manual, and Molecular Cloning: A Laboratory Manual (all from Cold Spring Harbor Laboratory Press), Stryer, L. (1995) Biochemistry (4th Ed.) Freeman, N.Y., Gait, “Oligonucleotide Synthesis: A Practical Approach” 1984, IRL Press, London, Nelson and Cox (2000), Lehninger, Principles of Biochemistry 3rd Ed., W.H. Freeman Pub., New York, N.Y. and Berg et al. (2002) Biochemistry, 5th Ed., W.H. Freeman Pub., New York, N.Y., all of which are herein incorporated in their entirety by reference for all purposes.
  • The methods of the present invention can employ solid substrates, including arrays in some preferred embodiments. Methods and techniques applicable to polymer (including protein) array synthesis have been described in U.S. Ser. No. 09/536,841, WO 00/58516, U.S. Pat. Nos. 5,143,854, 5,242,974, 5,252,743, 5,324,633, 5,384,261, 5,405,783, 5,424,186, 5,451,683, 5,482,867, 5,491,074, 5,527,681, 5,550,215, 5,571,639, 5,578,832, 5,593,839, 5,599,695, 5,624,711, 5,631,734, 5,795,716, 5,831,070, 5,837,832, 5,856,101, 5,858,659, 5,936,324, 5,968,740, 5,974,164, 5,981,185, 5,981,956, 6,025,601, 6,033,860, 6,040,193, 6,090,555, 6,136,269, 6,269,846 and 6,428,752, in PCT Applications Nos. PCT/US99/00730 (International Publication Number WO 99/36760) and PCT/US01/04285, which are all incorporated herein by reference in their entirety for all purposes.
  • Patents that describe synthesis techniques in specific embodiments include U.S. Pat. Nos. 5,412,087, 6,147,205, 6,262,216, 6,310,189, 5,889,165, and 5,959,098. Nucleic acid arrays are described in many of the above patents, but the same techniques are applied to polypeptide and protein arrays.
  • Nucleic acid arrays that are useful in the present invention include, but are not limited to those that are commercially available from Affymetrix (Santa Clara, Calif.) under the brand name GeneChip7. Example arrays are shown on the website at affymetrix.com.
  • Examples of gene expression monitoring, and profiling methods that are useful in the methods of the present invention are shown in U.S. Pat. Nos. 5,800,992, 6,013,449, 6,020,135, 6,033,860, 6,040,138, 6,177,248 and 6,309,822. Other examples of uses are embodied in U.S. Pat. Nos. 5,871,928, 5,902,723, 6,045,996, 5,541,061, and 6,197,506.
  • The present invention also contemplates sample preparation methods in certain preferred embodiments. Prior to or concurrent with expression analysis, the nucleic acid sample may be amplified by a variety of mechanisms, some of which may employ PCR. See, e.g., PCR Technology: Principles and Applications for DNA Amplification (Ed. H. A. Erlich, Freeman Press, NY, N.Y., 1992); PCR Protocols: A Guide to Methods and Applications (Eds. Innis, et al., Academic Press, San Diego, Calif., 1990); Mattila et al., Nucleic Acids Res. 19, 4967 (1991); Eckert et al., PCR Methods and Applications 1, 17 (1991); PCR (Eds. McPherson et al., IRL Press, Oxford); and U.S. Pat. Nos. 4,683,202, 4,683,195, 4,800,159 4,965,188, and 5,333,675, and each of which is incorporated herein by reference in their entireties for all purposes. The sample may be amplified on the array. See, for example, U.S. Pat. No. 6,300,070 and U.S. patent application Ser. No. 09/513,300, which are incorporated herein by reference.
  • Other suitable amplification methods include the ligase chain reaction (LCR) (e.g., Wu and Wallace, Genomics 4, 560 (1989), Landegren et al., Science 241, 1077 (1988) and Barringer et al. Gene 89:117 (1990)), transcription amplification (Kwoh et al., Proc. Natl. Acad. Sci. USA 86, 1173 (1989) and WO88/10315), self-sustained sequence replication (Guatelli et al., Proc. Nat. Acad. Sci. USA, 87, 1874 (1990) and WO90/06995), selective amplification of target polynucleotide sequences (U.S. Pat. No. 6,410,276), consensus sequence primed polymerase chain reaction (CP-PCR) (U.S. Pat. No. 4,437,975), arbitrarily primed polymerase chain reaction (AP-PCR) (U.S. Pat. No. 5,413,909, 5,861,245) and nucleic acid based sequence amplification (NABSA). (U.S. Pat. Nos. 5,409,818, 5,554,517, and 6,063,603). Other amplification methods that may be used are described in, U.S. Pat. Nos. 5,242,794, 5,494,810, 4,988,617 and in U.S. Ser. No. 09/854,317, each of which is incorporated herein by reference.
  • Additional methods of sample preparation and techniques for reducing the complexity of a nucleic sample are described, for example, in Dong et al., Genome Research 11, 1418 (2001), in U.S. Pat. No. 6,361,947, 6,391,592 and U.S. Patent application Ser. Nos. 09/916,135, 09/920,491, 09/910,292, and 10/013,598.
  • Methods for conducting polynucleotide hybridization assays have been well developed in the art. Hybridization assay procedures and conditions will vary depending on the application and are selected in accordance with the general binding methods known including those referred to in: Maniatis et al. Molecular Cloning: A Laboratory Manual (2nd Ed. Cold Spring Harbor, N.Y. 1989); Berger and Kimmel Methods in Enzymology, Vol. 152, Guide to Molecular Cloning Techniques (Academic Press, Inc., San Diego, Calif., 1987); Young and Davism, P.N.A.S, 80: 1194 (1983). Methods and apparatus for carrying out repeated and controlled hybridization reactions have been described, for example, in U.S. Pat. Nos. 5,871,928, 5,874,219, 6,045,996 and 6,386,749, 6,391,623 each of which are incorporated herein by reference
  • The present invention also contemplates signal detection of hybridization between the sample and the probe in certain embodiments. See, for example, U.S. Pat. Nos. 5,143,854, 5,578,832; 5,631,734; 5,834,758; 5,936,324; 5,981,956; 6,025,601; 6,141,096; 6,185,030; 6,201,639; 6,218,803; and 6,225,625, in provisional U.S. Patent application 60/364,731 and in PCT Application PCT/US99/06097 (published as WO99/47964).
  • Examples of methods and apparatus for signal detection and processing of intensity data are disclosed in, for example, U.S. Pat. Nos. 5,143,854, 5,547,839, 5,578,832, 5,631,734, 5,800,992, 5,834,758; 5,856,092, 5,902,723, 5,936,324, 5,981,956, 6,025,601, 6,090,555, 6,141,096, 6,185,030, 6,201,639; 6,218,803; and 6,225,625, in U.S. Patent application 60/364,731 and in PCT Application PCT/US99/06097 (published as WO99/47964).
  • The practice of the present invention may also employ conventional biology methods, software and systems. Computer software products of the invention typically include computer readable medium having computer-executable instructions for performing the logic steps of the method of the invention. Suitable computer readable medium include floppy disk, CD-ROM/DVD/DVD-ROM, hard-disk drive, flash memory, ROM/RAM, magnetic tapes and etc. The computer executable instructions may be written in a suitable computer language or combination of several languages. Basic computational biology methods are described in, e.g. Setubal and Meidanis et al., Introduction to Computational Biology Methods (PWS Publishing Company, Boston, 1997); Salzberg, Searles, Kasif, (Ed.), Computational Methods in Molecular Biology, (Elsevier, Amsterdam, 1998); Rashidi and Buehler, Bioinformatics Basics: Application in Biological Science and Medicine (CRC Press, London, 2000) and Ouelette and Bzevanis Bioinformatics: A Practical Guide for Analysis of Gene and Proteins (Wiley & Sons, Inc., 2nd ed., 2001).
  • The present invention also makes use of various computer program products and software for a variety of purposes, such as probe design, management of data, analysis, and instrument operation. See, for example, U.S. Pat. Nos. 5,593,839, 5,795,716, 5,733,729, 5,974,164, 6,066,454, 6,090,555, 6,185,561, 6,188,783, 6,223,127, 6,229,911 and 6,308,170.
  • Additionally, the present invention may have embodiments that include methods for providing gene expression profile information over networks such as the Internet as shown in, for example, U.S. patent applications Ser. Nos. 10/063,559, 60/349,546, 60/376,003, 60/394,574, 60/403,381.
  • Throughout this specification, various aspects of this invention are presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the invention. Accordingly, the description of a range should be considered to have specifically disclosed all the possible sub-ranges as well as individual numerical values within that range. For example, description of a range such as from 10-20 should be considered to have specifically disclosed sub-ranges such as from 10-13, from 10-14, from 10-15, from 11-14, from 11-16, etc., as well as individual numbers within that range, for example, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, and 20. This applies regardless of the breadth of the range. In addition, the fractional ranges are also included in the exemplified amounts that are described. Therefore, for example, a range of 1-3 includes fractions such as 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, etc. This applies particularly to the amount of increase or decrease of expression of any particular gene or transcript.
  • Methods and systems of the present disclosure may be combined with or modified by other methods or systems, such as, for example, those described in WO/2005/000098, which is entirely incorporated herein by reference.
  • The present invention has many preferred embodiments and relies on many patents, applications and other references for details known to those of the art. Therefore, when a patent, application, or other reference is cited or repeated throughout the specification, it should be understood that it is incorporated by reference in its entirety for all purposes as well as for the proposition that is recited.
  • EXAMPLES Example 1
  • In this study, we used three study groups: 1) normal non-smokers (n=23); 2) smokers without cancer (active v. former smokers) (n=52); 3) smokers with suspect cancer (n=98: 45 cancer, 53 no cancer).
  • We obtained epithelial nucleic acids (RNA/DNA) from epithelial cells in mouth and airway (bronchoscopy). We also obtained nucleic acids from blood to provide one control.
  • We analyzed gene expression using RNA and U133A Affymetrix array that represents transcripts from about 22,500 genes.
  • The microarray data analysis was performed as follows. We first scanned the Affymetrix chips that had been hybridized with the study group samples. The obtained microarray raw data consisted of signal strength and detection p-value. We normalized or scaled the data, and filtered the poor quality chips based on images, control probes, and histograms according to standard Affymetrix instructions. We also filtered contaminated specimens which contained non-epithelial cells. Lastly, the genes of importance were filtered using detection p-value. This resulted in identification of transcripts present in normal airways (normal airway transcriptome), with variability and multiple regression analysis. This also resulted in identification of effects of smoking on airway epithelial cell transcription. For this, we used T-test and Pearson correlation analysis. We also identified a group or a set of transcripts that were differentially expressed in samples with lung cancer and samples without cancer. This analysis was performed using class prediction models.
  • We used weighted voting method. The weighted voting method ranks, and gives a weight “p” to all genes by the signal to noise ration of gene expression between two classes: P=mean(class 1)−mean(class 2)/sd(class 1)=sd(class 2). Committees of variable sizes of the top ranked genes were used to evaluate test samples, but genes with more significant p-values were more heavily weighed. Each committee genes in test sample votes for one class or the other, based on how close that gene expression level is to the class 1 mean or the class 2 mean. V(gene A)=P(gene A), i.e. level of expression in test sample less the average of the mean expression values in the two classes. Votes for each class were tallied and the winning class was determined along with prediction strength as PS=Vwin−Vlose/Vwin+Vlose. Finally, the accuracy was validated using cross-validation +/− independent samples.
  • FIG. 8 shows diagrams of the class prediction model analysis used in the Example 1.
  • The results of the weighted voting method for a 50 gene group analysis (50 gene committee) were as follows. Cross-validation (n=74) resulted in accuracy of 81%, with sensitivity of 76% and specificity of 85%. In an independent dataset (n=24) the accuracy was 88%, with sensitivity of 75% and specificity of 100%.
  • We note that with sensitivity to bronchoscopy alone only 18/45 (40%) of cancers were diagnosed at the time of bronchoscopy using brushings, washings, biopsy or Wang.
  • We performed a gene expression analysis of the human genome using isolated nucleic acid samples comprising lung cell transcripts from individuals. The chip used was the Human Genome U133 Set. We used Microarray Suite 5.0 software to analyze raw data from the chip (i.e. to convert the image file into numerical data). Both the chip and the software are proprietary materials from Affymetrix. Bronchoscopy was performed to obtain nucleic acid samples from 98 smoker individuals.
  • We performed a Student's t-test using gene expression analysis of 45 smokers with lung cancer and 53 smokers without lung cancer. We identified several groups of genes that showed significant variation in their expression between smokers with cancer and smokers without cancer. We further identified at least three groups of genes that, when their expression was analyzed in combination, the results allowed us to significantly increase diagnostic power in identifying cancer carrying smokers from smokers without cancer.
  • The predictor groups of genes were identified using the GenePattern server from the Broad Institute, which includes the Weighted Voting algorithm. The default settings, i.e., the signal to noise ratio and no gene filtering, were used. GenePattern is available at World Wide Web from broad.mitedu/cancer/software/genepattern. This program allows analysis of data in groups rather than as individual genes.
  • Table 1 shows the top 96 genes from our analysis with different expression patterns in smokers with cancer and smokers without cancer.
  • Table 2 shows the 84 genes that were also identified in our previous screens as individual predictors of lung cancer.
  • Table 4 shows a novel group of 36 genes the expression of which was different between the smokers with cancer and smokers without cancer.
  • Table 3 shows a group of 50 genes that we identified as most predictive of development of cancer in smokers. That is, that when the expression of these genes was analyzed and reflected the pattern (expression down or up) as shown in Table 3, we could identify the individuals who will develop cancer based on this combined expression profile of these genes. When used in combination, the expression analysis of these 50 genes was predictive of a smoker developing lung cancer in over 70% of the samples. Accuracy of diagnosis of lung cancer in our sample was 80-85% on cross-validation and independent dataset (accuracy includes both the sensitivity and specificity). The sensitivity (percent of cancer cases correctly diagnosed) was approximately 75% as compared to sensitivity of 40% using standard bronchoscopy technique. (Specificity is percent of non-cancer cases correctly diagnosed).
  • These data show the dramatic increase of diagnostic power that can be reached using the expression profiling of the gene groups as identified in the present study.
  • Example 2
  • We report here a gene expression profile, derived from histologically normal large airway epithelial cells of current and former smokers with clinical suspicion of lung cancer that is highly sensitive and specific for the diagnosis of lung cancer. This airway signature is effective in diagnosing lung cancer at an early and potentially resectable stage. When combined with results from bronchoscopy (i.e. washings, brushings, and biopsies of the affected area), the expression profile is diagnostic of lung cancer in 95% of cases. We further show that the airway epithelial field of injury involves a number of genes that are differentially expressed in lung cancer tissue, providing potential information about pathways that may be involved in the genesis of lung cancer.
  • Patient Population: We obtained airway brushings from current and former smokers (n=208) undergoing fiber optic bronchoscopy as a diagnostic study for clinical suspicion of lung cancer between January 2003 and May 2005. Patients were recruited from 4 medical centers: Boston University Medical Center, Boston, Mass.; Boston Veterans Administration, West Roxbury, Mass.; Lahey Clinic, Burlington, Mass.; and Trinity College, Dublin, Ireland. Exclusion criteria included never smokers, cigar smokers and patients on a mechanical ventilator at the time of their bronchoscopy. Each subject was followed clinically, post-bronchoscopy, until a final diagnosis of lung cancer or an alternate benign diagnosis was made. Subjects were classified as having lung cancer if their bronchoscopy studies (brushing, bronchoalveolar lavage or endobronchial biopsy) or a subsequent lung biopsy (transthoracic biopsy or surgical lung biopsy) yielded tumor cells on pathology/cytology. Subjects were classified with an alternative benign diagnosis if the bronchoscopy or subsequent lung biopsy yielded a non-lung cancer diagnosis or if their radiographic abnormality resolved on follow up chest imaging. The study was approved by the Institutional Review Boards of all 4 medical centers and all participants provided written informed consent.
  • Airway epithelial cell collection: Following completion of the standard diagnostic bronchoscopy studies, bronchial airway epithelial cells were obtained from the “uninvolved” right mainstem bronchus with an endoscopic cytobrush (Cellebrity Endoscopic Cytobrush, Boston Scientific, Boston, Mass.). If a suspicious lesion (endobronchial or submucosal) was seen in the right mainstem bronchus, cells were then obtained from the uninvolved left mainstem bronchus. The brushes were immediately placed in TRIzol reagent (Invitrogen, Carlsbad, Calif.) after removal from the bronchoscope and kept at −80° C. until RNA isolation was performed. RNA was extracted from the brushes using TRIzol Reagent (Invitrogen) as per the manufacturer protocol, with a yield of 8-15 μg of RNA per patient. Integrity of the RNA was confirmed by denaturing gel electrophoresis. Epithelial cell content and morphology of representative bronchial brushing samples was quantified by cytocentrifugation (ThermoShandon Cytospin, Pittsburgh, Pa.) of the cell pellet and staining with a cytokeratin antibody (Signet, Dedham Mass.). These samples were reviewed by a pathologist who was blinded to the diagnosis of the patient.
  • Microarray data acquisition and preprocessing: 6-8 μg of total RNA was processed, labeled, and hybridized to Affymetrix HG-U133A GeneChips containing approximately 22,215 human transcripts as described previously(17). We obtained sufficient quantity of high quality RNA for microarray studies from 152 of the 208 samples. The quantity of RNA obtained improved during the course of the study so that 90% of brushings yielded sufficient high quality RNA during the latter half of the study. Log-normalized probe-level data was obtained from CEL files using the Robust Multichip Average (RMA) algorithm(18). A z-score filter was employed to filter out arrays of poor quality (see supplement for details), leaving 129 samples with a final diagnosis available for analysis.
  • Microarray Data Analysis: Class Prediction
  • To develop and test a gene expression predictor capable of distinguishing smokers with and without lung cancer, 60% of samples (n=77) representing a spectrum of clinical risk for lung cancer and approximately equal numbers of cancer and no cancer subjects were randomly assigned to a training set (see Supplement). Using the training set samples, the 22,215 probesets were filtered via ANCOVA using pack-years as the covariate; probesets with a p-value greater than 0.05 for the difference between the two groups were excluded. This training-set gene filter was employed to control for the potential confounding effect of cumulative tobacco exposure, which differed between subjects with and without cancer (see Table 1a).
  • Cancer NonCancer
    Samples
    60 69
    Age** 64.1 +/− 9.0 49.8 +/− 15.2
    Smoking Status 51.7% F, 48..3% C 37.7% F, 62..3% C
    Gender
    80% M, 20% F 73.9% M, 26.1% F
    PackYears**   57.4 +/− 25..6  29.4 +/− 27..3
    Age Started 15.2 +/− 4.2 16.7 +/− 6.8 
    Smoking intensity  1.3 +/− 0.45 0.9 +/− 0.5
    (PPD): Currents*
    Months Quit:  113 +/− 118 158 +/− 159
    Formers
    *Two classes statistically different: p < 0.05
    **Two classes statistically different: p < 0.001
  • Table 1a shows demographic features and characteristics of the two patient classes being studied. Statistical differences between the two patient classes and associated p values were calculated using T-tests, Chi-square tests and Fisher's exact tests where appropriate.
  • Gene selection was conducted through internal cross-validation within the training set using the weighted voting algorithm(19). The internal cross-validation was repeated 50 times, and the top 40 up-and top 40 down-regulated probesets in cancer most frequently chosen during internal cross-validation runs were selected as the final gene committee of 80 features (see sections, infra, for details regarding the algorithm and the number of genes selected for the committee).
  • The accuracy, sensitivity, and specificity of the biomarker were assessed on the independent test set of 52 samples. This was accomplished by using the weighted vote algorithm to predict the class of each test set sample based on the gene expression of the 80 probesets and the probe set weights derived from the 77 samples in the training set. To assess the performance of our classifier, we first created 1000 predictors from the training set where we randomized the training set class labels. We evaluated the performance of these “class-randomized” classifiers for predicting the sample class of the test set samples and compared these to our classifier using ROC analysis. To assess whether the performance of our gene expression profile depends on the specific training and test sets from which it was derived and tested, we next created 500 new training and test sets with our 129 samples and derived new “sample-randomized” classifiers from each of these training sets which were then tested on the corresponding test set. To assess the specificity of our classifier genes, we next created 500 classifiers each composed of 80 randomly selected genes. We then tested the ability of these “gene-randomized” classifiers to predict the class of samples in the test set. To evaluate the robustness of our class prediction algorithm and data preprocessing, we also used these specific 80 genes to generate predictive models with an alternate class prediction algorithm (Prediction Analysis of Microarrays (PAM)(20)) and with MAS 5.0 generated expression data instead of RMA. Finally, the performance of our predictor was compared to the diagnostic yield of bronchoscopy.
  • Quantitative PCR Validation: Real time PCR (QRT-PCR) was used to confirm the differential expression of a select number of genes in our predictor. Primer sequences were designed with Primer Express software (Applied Biosystems, Foster City, Calif.). Forty cycles of amplification, data acquisition, and data analysis were carried out in an ABI Prism 7700 Sequence Detector (Applied Biosystems, Foster City, Calif.). All real time PCR experiments were carried out in triplicate on each sample (see sections infra).
  • Linking to lung cancer tissue microarray data: The 80-gene lung cancer biomarker derived from airway epithelium gene expression was evaluated for its ability to distinguish between normal and cancerous lung tissue using an Affymetrix HGU95Av2 dataset published by Bhattacharjee et al(21) that we processed using RMA. By mapping Unigene identifiers, 64 HGU95Av2 probesets were identified that measure the expression of genes that corresponded to the 80 probesets in our airway classifier. This resulted in a partial airway epithelium signature that was then used to classify tumor and normal samples from the dataset. In addition, PCA analysis of the lung tissue samples was performed using the expression of these 64 probesets.
  • To further assess the statistical significance of the relationship between datasets, Gene Set Enrichment Analysis(22) was performed to determine if the 64 biomarker genes are non-randomly distributed within the HGU95Av2 probesets ordered by differential expression between normal and tumor tissue. Finally, a two-tailed Fisher Exact Test was used to test if the proportion of biomarker genes among the genes differentially expressed between normal and tumor lung tissue is different from the overall proportion of differentially expressed genes (see sections, infra).
  • Statistical Analysis: RMA was performed in BioConductor. The upstream gene filtering by ANCOVA, and the implementation of the weighted voted algorithm and internal cross validation used to generate the data were executed through an R script we wrote for this purpose. The PAM algorithm was carried out using the ‘pamr’ library in R. All other statistical analyses including Student's T-Tests, Fisher's exact tests, ROC curves and PCA were performed using the R statistical package.
  • Study Population and Epithelial samples: 129 subjects that had microarrays passing the quality control filter described above were included in the class prediction analysis (see Supplemental FIG. 1). Demographic data on these subjects, including 60 smokers with primary lung cancer and 69 smokers without lung cancer is presented in Table 1. Cell type and stage information for all cancer patients is shown in Supplemental Table 1. Bronchial brushings yielded 90% epithelial cells, as determined by cytokeratin staining, with the majority being ciliated cells with normal bronchial airway morphology. No dysplastic or cancer cells were seen on any representative brushings obtained from smokers with or without cancer.
  • Class Prediction analysis: Comparison of demographic features for 77 subjects in the training set vs. the 52 samples in the test set is shown in Supplemental Table 2. An 80 gene class prediction committee capable of distinguishing smokers with and without cancer was built on the training set of 77 samples and tested on the independent sample set (FIG. 14). The accuracy, sensitivity and specificity of this model was 83%(43/52), 80% (16/20) and 84% (27/32) respectively. When samples predicted with a low degree of confidence (as defined by a Prediction Strength metric <0.3; see Supplement for details) were considered non-diagnostic, the overall accuracy of the model on the remaining 43 samples in the test set increased to 88% (93% sensitivity, 86% specificity). Hierarchical clustering of the 80 genes selected for the diagnostic biomarker in the test set samples is shown in FIG. 15. Principal Component Analysis of all cancer samples according to the expression of these 80 genes did not reveal grouping by cell type (FIG. 10). The accuracy of this 80-gene classifier was similar when microarray data was preprocessed in MAS 5.0 and when the PAM class prediction algorithm was used (see Supplemental Table 3).
  • The 80-gene predictor's accuracy, sensitivity and specificity on the 52 sample test set was significantly better than the performance of classifiers derived from randomizing the class labels of the training set (p=0.004; empiric p-value for random classifier AUC>true classifier AUC; FIG. 16). The performance of the classifier was not dependent on the particular composition of the training and test set on which it was derived and tested: 500 training and test sets (derived from the 129 samples) resulted in classifiers with similar accuracy as the classifier derived from our training set (FIG. 11). Finally, we demonstrated that the classifier is better able to distinguish the two sample classes than 500 classifiers derived by randomly selecting genes (see FIG. 12).
  • Real time PCR: Differential expression of select genes in our diagnostic airway profile was confirmed by real time PCR (see FIG. 13).
  • Linking to lung cancer tissue: Our airway biomarker was also able to correctly classify lung cancer tissue from normal lung tissue with 98% accuracy. Principal Component Analysis demonstrated separation of non-cancerous samples from cancerous samples in the Bhattacharjee dataset according to the expression of our airway signature (see FIG. 17). Furthermore, our class prediction genes were statistically overrepresented among genes differentially expressed between cancer vs. no cancer in the Bhattacharjee dataset by Fisher exact test (p<0.05) and Gene Enrichment Analysis (FDR<0.25, see Supplement for details).
  • Synergy with Bronchoscopy: Bronchoscopy was diagnostic (via endoscopic brushing, washings or biopsy of the affected region) in 32/60 (53%) of lung cancer patients and 5/69 non-cancer patients. Among non-diagnostic bronchoscopies (n=92), our class prediction model had an accuracy of 85% with 89% sensitivity and 83% specificity. Combining bronchoscopy with our gene expression signature resulted in a 95% diagnostic sensitivity (57/60) across all cancer subjects. Given the approximate 50% disease prevalence in our cohort, a negative bronchoscopy and negative gene expression signature for lung cancer resulted in a 95% negative predictive value (NPV) for disease (FIG. 18). In patients with a negative bronchoscopy, the positive predictive value of our gene expression profile for lung cancer was approximately 70% (FIG. 18).
  • Stage and cell type subgroup analysis: The diagnostic yield of our airway gene expression signature vs. bronchoscopy according to stage and cell type of the lung cancer samples is shown in FIG. 19.
  • Lung cancer is the leading cause of death from cancer in the United States, in part because of the lack of sensitive and specific diagnostic tools that are useful in early-stage disease. With approximately 90 million former and current smokers in the U.S., physicians increasingly encounter smokers with clinical suspicion for lung cancer on the basis of an abnormal radiographic imaging study and/or respiratory symptoms. Flexible bronchoscopy represents a relatively noninvasive initial diagnostic test to employ in this setting. This study was undertaken in order to develop a gene expression-based diagnostic, that when combined with flexible bronchoscopy, would provide a sensitive and specific one-step procedure for the diagnosis of lung cancer. Based on the concept that cigarette smoking creates a respiratory tract “field defect”, we examined the possibility that profiles of gene expression in relatively easily accessible large airway epithelial cells would serve as an indicator of the amount and type of cellular injury induced by smoking and might provide a diagnostic tool in smokers who were being evaluated for the possibility of lung cancer.
  • We have previously shown that smoking induces a number of metabolizing and anti-oxidant genes, induces expression of several putative oncogenes and suppresses expression of several potential tumor suppressor genes in large airway epithelial cells(17). We show here that the pattern of airway gene expression in smokers with lung cancer differs from smokers without lung cancer, and the expression profile of these genes in histologically normal bronchial epithelial cells can be used as a sensitive and specific predictor of the presence of lung cancer. We found that the expression signature was particularly useful in early stage disease where bronchoscopy was most often negative and where most problems with diagnosis occur. Furthermore, combining the airway gene expression signature with bronchoscopy results in a highly sensitive diagnostic approach capable of identifying 95% of lung cancer cases.
  • Given the unique challenges to developing biomarkers for disease using DNA microarrays(23), we employed a rigorous computational approach in the evaluation of our dataset. The gene expression biomarker reported in this paper was derived from a training set of samples obtained from smokers with suspicion of lung cancer and was tested on an independent set of samples obtained from four tertiary medical centers in the US and Ireland. The robust nature of this approach was confirmed by randomly assigning samples into separate training and test sets and demonstrating a similar overall accuracy (FIG. 11). In addition, the performance of our biomarker was significantly better than biomarkers obtained via randomization of class labels in the training set (FIG. 16) or via random 80 gene committees (FIG. 8). Finally, the performance of our 80-gene profile remained unchanged when microarray data was preprocessed via a different algorithm or when a second class prediction algorithm was employed.
  • In terms of limitations, our study was not designed to assess performance as a function of disease stage or subtype. Our gene expression predictor, however, does appear robust in early stage disease compared with bronchoscopy (see FIG. 19). Our profile was able to discriminate between cancer and no cancer across all subtypes of lung cancer (see FIG. 10). 80% of the cancers in our dataset were NSCLC and our biomarker was thus trained primarily on events associated with that cell type. However, given the high yield for bronchoscopy alone in the diagnosis of small cell lung cancer, this does not limit the sensitivity and negative predictive value of the combined bronchoscopy and gene expression signature approach. A large-scale clinical trial is needed to validate our signature across larger numbers of patients and establish its efficacy in early stage disease as well as its ability to discriminate between subtypes of lung cancer.
  • In addition to serving as a diagnostic biomarker, profiling airway gene expression across smokers with and without lung cancer can also provide insight into the nature of the “field of injury” reported in smokers and potential pathways implicated in lung carcinogenesis. Previous studies have demonstrated allelic loss and methylation of tumor suppressor genes in histologically normal bronchial epithelial cells from smokers with and without lung cancer(12; 13; 15). Whether these changes are random mutational effects or are directly related to lung cancer has been unclear. The finding that our airway gene signature was capable of distinguishing lung cancer tissue from normal lung (FIG. 4) suggests that the airway biomarker is, at least in part, reflective of changes occurring in the cancerous tissue and may provide insights into lung cancer biology.
  • Among the 80 genes in our diagnostic signature, a number of genes associated with the RAS oncogene pathway, including Rab 1a and FOS, are up regulated in the airway of smokers with lung cancer. Rab proteins represent a family of at least 60 different Ras-like GTPases that have crucial roles in vesicle trafficking, signal transduction, and receptor recycling, and dysregulation of RAB gene expression has been implicated in tumorigenesis(24). A recent study by Shimada et al.(25) found a high prevalence of Rab1A-overexpression in head and neck squamous cell carcinomas and also in premalignant tongue lesions, suggesting that it may be an early marker of smoking-related respiratory tract carcinogenesis.
  • In addition to these RAS pathway genes, the classifier contained several pro-inflammatory genes, including Interleukin-8 (IL-8) and beta-defensin 1 that were up regulated in smokers with lung cancer. IL-8, originally discovered as a chemotactic factor for leukocytes, has been shown to contribute to human cancer progression through its mitogenic and angiogenic properties(26; 27). Beta defensins, antimicrobial agents expressed in lung epithelial cells, have recently found to be elevated in the serum of patients with lung cancer as compared to healthy smokers or patients with pneumonia(28). Higher levels of these mediators of chronic inflammation in response to tobacco exposure may result in increased oxidative stress and contribute to tumor promotion and progression in the lung(29; 30)
  • A number of key antioxidant defense genes were found to be decreased in airway epithelial cells of subjects with lung cancer, including BACH2 and dual oxidase 1, along with a DNA repair enzyme, DNA repair protein 1C. BACH-2, a transcription factor, promotes cell apoptosis in response to high levels of oxidative-stress(31). We have previously found that a subset of healthy smokers respond differently to tobacco smoke, failing to induce a set of detoxification enzymes in their normal airway epithelium, and that these individuals may be predisposed to its carcinogenic effects(17). Taken together, these data suggest that a component of the airway “field defect” may reflect whether a given smoker is appropriately increasing expression of protective genes in response to the toxin. This inappropriate response may reflect a genetic susceptibility to lung cancer or alternatively, epigenetic silencing or deletion of that gene by the carcinogen.
  • In summary, our study has identified an airway gene expression biomarker that has the potential to directly impact the diagnostic evaluation of smokers with suspect lung cancer. These patients usually undergo fiberoptic bronchoscopy as their initial diagnostic test. Gene expression profiling can be performed on normal-appearing airway epithelial cells obtained in a simple, non-invasive fashion at the time of the bronchoscopy, prolonging the procedure by only 3-5 minutes, without adding significant risks. Our data strongly suggests that combining results from bronchoscopy with the gene expression biomarker substantially improves the diagnostic sensitivity for lung cancer (from 53% to 95%). In a setting of 50% disease prevalence, a negative bronchoscopy and negative gene expression signature for lung cancer results in a 95% negative predictive value (NPV), allowing these patients to be followed non-aggressively with repeat imaging studies. For patients with a negative bronchoscopy and positive gene expression signature, the positive predictive value is ˜70%, and these patients would likely require further invasive testing (i.e. transthoracic needle biopsy or open lung biopsy) to confirm the presumptive lung cancer diagnosis. However, this represents a substantial reduction in the numbers of patients requiring further invasive diagnostic testing compared to using bronchoscopy alone. In our study, 92/129 patients were bronchoscopy negative and would have required further diagnostic work up. However, the negative predictive gene expression profile in 56 of these 92 negative bronchoscopy subjects would leave only 36 subjects who would require further evaluation (see FIG. 18).
  • The cross-sectional design of our study limits interpretation of the false positive rate for our signature. Given that the field of injury may represent whether a smoker is appropriately responding to the toxin, derangements in gene expression could precede the development of lung cancer or indicate a predisposition to the disease. Long-term follow-up of the false positive cases is needed (via longitudinal study) to assess whether they represent smokers who are at higher risk for developing lung cancer in the future. If this proves to be true, our signature could serve as a screening tool for lung cancer among healthy smokers and have the potential to identify candidates for chemoprophylaxis trials.
  • Study Patients and Sample Collection
  • A. Primary sample set: We recruited current and former smokers undergoing flexible bronchoscopy for clinical suspicion of lung cancer at four tertiary medical centers. All subjects were older than 21 years of age and had no contraindications to flexible bronchoscopy including hemodynamic instability, severe obstructive airway disease, unstable cardiac or pulmonary disease (i.e. unstable angina, congestive heart failure, respiratory failure) inability to protect airway or altered level of consciousness and inability to provide informed consent. Never smokers and subjects who only smoked cigars were excluded from the study. For each consented subject, we collected data regarding their age, gender, race, and a detailed smoking history including age started, age quit, and cumulative tobacco exposure. Former smokers were defined as patients who had not smoked a cigarette for at least one month prior to entering our study. All subjects were followed, post-bronchoscopy, until a final diagnosis of lung cancer or an alternative diagnosis was made (mean follow-up time=52 days). For those patients diagnosed with lung cancer, the stage and cell type of their tumor was recorded. The clinical data collected from each subject in this study can be accessed in a relational database at http://pulm.bumc.bu.edu/CancerDx/. The stage and cell type of the 60 cancer samples used to train and test the class prediction model is shown in Supplemental Table 1 below.
  • Cell Type Stage
    NSCLC 48 NSCLC staging
    Squamous Cell
    23 IA 2
    Adenocarcinoma 11 IB 9
    Large Cell 4 IIA 2
    Not classified 10 IIB 0
    Small Cell 11 IIIA 9
    Unknown 1 IIIB 9
    IV 17
  • Supplemental Table 1 above shows cell type and staging information for 60 lung cancer patients in the 129 primary sample set used to build and test the class prediction model. Staging information limited to the 48 non-small cell samples.
  • The demographic features of the samples in training and test shown are shown in Supplemental Table 2 below. The Table shows patient demographics for the primary dataset (n=129) according to training and test set status. Statistical differences between the two patient classes and associated p values were calculated using T-tests, Chi-square tests and Fisher's exact tests where appropriate. PPD=packs per day, F=former smokers, C=current smokers, M=male, F=female.
  • Training set Test set
    Samples 77 52
    Age 59.3 +/− 13.1 52.1 +/− 15.6
    Smoking Status 41.6% F, 58.4% C 48.1% F, 51.9% C
    Gender* 83.1% M, 16.9% F 67.3% M, 32.7% F
    PackYears 45.6 +/− 31   37.7 +/− 27.8
    Age Started 16.2 +/− 6.3  15.8 +/− 5.3 
    Smoking intensity  1.1 +/− 0.53   1 +/− 0.5
    (PPD): Currents
    Months Quit: 128 +/− 139 139 +/ 141
    Formers
    *Two classes statistically different: p < 0.05
  • While our study recruited patients whose indication for bronchoscopy included a suspicion for lung cancer, each patient's clinical pre-test probability for disease varied. In order to ensure that our class prediction model was trained on samples representing a spectrum of lung cancer risk, three independent pulmonary clinicians, blinded to the final diagnoses, evaluated each patient's clinical history (including age, smoking status, cumulative tobacco exposure, co-morbidities, symptoms/signs and radiographic findings) and assigned a pre-bronchoscopy probability for lung cancer. Each patient was classified into one of three risk groups: low (<10% probability of lung cancer), medium (10-50% probability of lung cancer) and high (>50% probability of lung cancer). The final risk assignment for each patient was decided by the majority opinion.
  • Prospective Sample Set:
  • After completion of the primary study, a second set of samples was collected from smokers undergoing flexible bronchoscopy for clinical suspicion of lung cancer at 5 medical centers (St. Elizabeth's Hospital in Boston, Mass. was added to the 4 institutions used for the primary dataset). Inclusion and exclusion criteria were identical to the primary sample set. Forty additional subjects were included in this second validation set. Thirty-five subjects had microarrays that passed our quality-control filter. Demographic data on these subjects, including 18 smokers with primary lung cancer and 17 smokers without lung cancer, is presented in Supplemental Table 3. There was no statistical difference in age or cumulative tobacco exposure between case and controls in this prospective cohort (as opposed to the primary dataset; see Table 1a).
  • Supplemental Table 3 below shows patient demographics for the prospective validation set (n=35) by cancer status. Statistical differences between the two patient classes and associated p values were calculated using T-tests, Chi-square tests and Fisher's exact tests where appropriate. PPD=packs per day, F=former smokers, C=current smokers, M=male, F=female.
  • Cancer No Cancer
    Samples
    18 17
    Age 66.1 +/− 11.4 62.2 +/− 11.1
    Smoking Status 66.7% F, 33.3% C 52.9% F, 47.1% C
    Gender* 66.6% M, 33.3% F 70.6% M, 29.4% F
    PackYears 46.7 +/− 28.8   60 +/− 44.3
    Age Started 16.4 +/− 7.3  14.2 +/− 3.8 
    Smoking intensity  1.1 +/− 0.44 1.2 +/− 0.9
    (PPD): Currents
    Months Quit: 153 +/− 135  93 +/− 147
    Formers
    *Two classes statistically different: p < 0.05
  • Airway Epithelial Cell Collection:
  • Bronchial airway epithelial cells were obtained from the subjects described above via flexible bronchoscopy. Following local anesthesia with 2% topical lidocaine to the oropharynx, flexible bronchoscopy was performed via the mouth or nose. Following completion of the standard diagnostic bronchoscopy studies (i.e. bronchoalveolar lavage, brushing and endo/transbronchial biopsy of the affected region), brushings were obtained via three endoscopic cytobrushes from the right mainstem bronchus. The cytobrush was rubbed over the surface of the airway several times and then retracted from the bronchoscope so that epithelial cells could be placed immediately in TRIzol solution and kept at −80° C. until RNA isolation was performed.
  • Given that these patients were undergoing bronchoscopy for clinical indications, the risks from our study were minimal, with less than a 5% risk of a small amount of bleeding from these additional brushings. The clinical bronchoscopy was prolonged by approximately 3-4 minutes in order to obtain the research samples. All participating subjects were recruited by IRB-approved protocols for informed consent, and participation in the study did not affect subsequent treatment. Patient samples were given identification numbers in order to protect patient privacy.
  • Microarray Data Acquisition and Preprocessing
  • Microarray data acquisition: 6-8 μg of total RNA from bronchial epithelial cells were converted into double-stranded cDNA with SuperScript II reverse transcriptase (Invitrogen) using an oligo-dT primer containing a T7 RNA polymerase promoter (Genset, Boulder, Colo.). The ENZO Bioarray RNA transcript labeling kit (Enzo Life Sciences, Inc, Farmingdale, N.Y.) was used for in vitro transcription of the purified double stranded cDNA. The biotin-labeled cRNA was then purified using the RNeasy kit (Qiagen) and fragmented into fragments of approximately 200 base pairs by alkaline treatment. Each cRNA sample was then hybridized overnight onto the Affymetrix HG-U133A array followed by a washing and staining protocol. Confocal laser scanning (Agilent) was then performed to detect the streptavidin-labeled fluor.
  • Preprocessing of array data via RMA: The Robust Multichip Average (RMA) algorithm was used for background adjustment, normalization, and probe-level summarization of the microarray samples in this study (Irizarry R A, et al., Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res 2003; 31(4):e15.). RMA expression measures were computed using the R statistical package and the justRMA function in the Affymetrix Bioconductor package. A total of 296 CEL files from airway epithelial samples included in this study as well as those previously processed in our lab were analyzed using RMA. RMA was chosen for probe-level analysis instead of Microarray Suite 5.0 because it maximized the correlation coefficients observed between 7 pairs of technical replicates (Supplemental Table 4).
  • TABLE 4
    Supplemental
    Pearson Correlation Coefficients (22,215 probe-sets)
    Affy log2Affy RMA
    Average 0.972 0.903 0.985
    SD 0.017 0.029 0.009
    Median 0.978 0.912 0.987
  • Supplemental Table 4 shows the Average Pearson Correlations between 7 pairs of replicate samples where probe-set gene expression values were determined using Microarray Suite 5.0 (Affy), logged data from Microarray Suite 5.0 (log 2 Affy), and RMA. RMA maximizes the correlation between replicate samples.
  • Sample filter: To filter out arrays of poor quality, each probeset on the array was z-score normalized to have a mean of zero and a standard deviation of 1 across all 152 samples. These normalized gene-expression values were averaged across all probe-sets for each sample. The assumption explicit in this analysis is that poor-quality samples will have probeset intensities that consistently trend higher or lower across all samples and thus have an average z-score that differs from zero. This average z-score metric correlates with Affymetrix MAS 5.0 quality metrics such as percent present (FIG. 7) and GAPDH 3′/5′ ratio. Microarrays that had an average z-score with a value greater than 0.129 (˜15% of the 152 samples) were filtered out. The resulting sample set consisted of 60 smokers with cancer and 69 smokers without cancer.
  • Prospective validation test set: CEL files for the additional 40 samples were added to the collection of airway epithelial CEL files described above, and the entire set was analyzed using RMA to derive expression values for the new samples. Microarrays that had an average z-score with a value greater than 0.129 (5 of the 40 samples) were filtered out. Class prediction of the 35 remaining prospective samples was conducted using the vote weights for the 80-predictive probesets derived from the training set of 77 samples using expression values computed in the section above.
  • Microarray Data Analysis
  • Class Prediction Algorithm: The 129-sample set (60 cancer samples, 69 no cancer samples) was used to develop a class-prediction algorithm capable of distinguishing between the two classes. One potentially confounding difference between the two groups is a difference in cumulative tobacco-smoke exposure as measured by pack-years. To insure that the genes chosen for their ability to distinguish patients with and without cancer in the training set were not simply distinguishing this difference in tobacco smoke exposure, the pack-years each patient smoked was included as a covariate in the training set ANCOVA gene filter.
  • In addition, there are differences in the pre-bronchoscopy clinical risk for lung cancer among the 129 patients. Three physicians reviewed each patient's clinical data (including demographics, smoking histories, and radiographic findings) and divided the patients into three groups: high, medium, and low pre-bronchoscopy risk for lung cancer (as described above). In order to control for differences in pre-bronchoscopy risk for lung cancer between the patients with and without a final diagnosis of lung cancer, the training set was constructed with roughly equal numbers of cancer and no cancer samples from a spectrum of lung cancer risk.
  • The weighted voting algorithm (Golub T R, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 1999; 286(5439):531-537) was implemented as the class prediction method, with several modifications to the gene-selection methodology. Genes that varied between smokers with and without cancer in the training set samples after adjusting for tobacco-smoke exposure (p<0.05) were identified using an ANCOVA with pack-years as the covariate. Further gene selection was performed using the signal to noise metric and internal cross-validation where the 40 most consistently up- and the 40 most consistently down-regulated probesets were identified. The internal cross validation involved leaving 30% of the training samples out of each round of cross-validation, and selecting genes based on the remaining 70% of the samples. The final gene committee consisted of eighty probesets that were identified as being most frequently up-regulated or down-regulated across 50 rounds of internal cross-validation. The parameters of this gene-selection algorithm were chosen to maximize the average accuracy, sensitivity and specificity obtained from fifty runs. This algorithm was implemented in R and yields results that are comparable to the original implementation of the weighted-voted algorithm in GenePattern when a specific training, test, and gene set are given as input.
  • After determination of the optimal gene-selection parameters, the algorithm was run using a training set of 77 samples to arrive at a final set of genes capable of distinguishing between smokers with and without lung cancer. The accuracy, sensitivity and specificity of this classifier were tested against 52 samples that were not included in the training set. The performance of this classifier in predicting the class of each test-set sample was assessed by comparing it to runs of the algorithm where either: 1) different training/tests sets were used; 2) the cancer status of the training set of 77 samples were randomized; or 3) the genes in the classifier were randomly chosen (see randomization section below for details).
  • Randomization: The accuracy, sensitivity, specificity, and area under the ROC curve (using the signed prediction strength as a continuous cancer predictor) for the 80-probeset predictor (above) were compared to 1000 runs of the algorithm using three different types of randomization. First, the class labels of the training set of 77 samples were permuted and the algorithm, including gene selection, was re-run 1000 times (referred to in Supplemental Table 5 as Random 1).
  • Supplemental Table 5 below shows results of a comparison between the actual classifier and random runs (explained above). Accur=Accuracy, Sens=Sensitivity, Spec=Specificity, AUC=area under the curve, and sd=standard deviation. All p-value are empirically derived.
  • SUPPLEMENTAL TABLE 5
    p- p- p- p-
    Accur sd(Accur) value Sens sd(Sens) value Spec sd(Spec) value AUC sd(AUC) value
    Actual 0.827 0.8  0.844 0.897
    Classifier
    Random 1 0.491 0.171 0.018 0.487 0.219 0.114 0.493 0.185 0.015 0.487 0.223 0.004
    Random 2 0.495 0.252 0.078 0.496 0.249 0.173 0.495 0.263 0.073 0.495 0.309 0.008
    Random 3 0.495 0.193 0.021 0.491 0.268 0.217 0.498 0.17  0.006 0.492 0.264 0.007
  • The second randomization used the 80 genes in the original predictor but permuted the class labels of the training set samples over 1000 runs to randomize the gene weights used in the classification step of the algorithm (referred to in Supplemental Table 5 as Random 2).
  • In both of these randomization methods, the class labels were permuted such that half of the training set samples was labeled correctly. The third randomization method involved randomly selecting 80 probesets for each of 1000 random classifiers (referred to in Supplemental Table 5 as Random 3).
  • The p-value for each metric and randomization method shown indicate the percentage of 1000 runs using that randomization method that exceeded or was equal to the performance of the actual classifier.
  • In addition to the above analyses, the actual classifier was compared to 1000 runs of the algorithm where different training/test sets were chosen but the correct sample labels were retained. Empirically derived p-values were also computed to compare the actual classifier to the 1000 runs of the algorithm (see Supplemental Table 6). These data indicate that the actual classifier was derived using a representative training and test set.
  • SUPPLEMENTAL TABLE 6
    p- p- p- p-
    Accur sd(Accur) value Sens sd(Sens) value Spec sd(Spec) value AUC sd(AUC) value
    Actual 0.827 0.8  0.844 0.897
    Classifier
    1000 Runs 0.784 0.054 0.283 0.719 0.104 0.245 0.83  0.06 0.407 0.836 0.053 0.108
  • Supplemental Table 6 above shows a comparison of actual classifier to 1000 runs of the algorithm with different training/test sets.
  • Finally, these 1000 runs of the algorithm were also compared to 1000 runs were the class labels of different training sets were randomized in the same way as described above. Empirically derived p-values were computed to compare 1000 runs to 1000 random runs (Supplemental Table 7).
  • SUPPLEMENTAL TABLE 7
    p- p- p- p-
    Accur sd(Accur) value Sens sd(Sens) value Spec sd(Spec) value AUC sd(AUC) value
    1000 Runs 0.784 0.054 0.719 0.104 0.83  0.06  0.836 0.053
    1000 Random 0.504 0.126 0.002 0.501 0.154 0.025 0.506 0.154 0.003 0.507 0.157 0.001
    Runs
  • Supplemental Table 7 above shows comparison of runs of the algorithm using different training/test sets to runs where the class labels of the training sets were randomized (1000 runs were conducted).
  • The distribution of the prediction accuracies summarized in Supplemental Tables 6 and 7 is shown in FIG. 8.
  • Characteristics of the 1000 additional runs of the algorithm: The number of times a sample in the test set was classified correctly and its average prediction strength was computed across the 1000 runs of the algorithm. The average prediction strength when a sample was classified correctly was 0.54 for cancers and 0.61 for no cancers, and the average prediction strength when a sample was misclassified was 0.31 for cancer and 0.37 for no cancers. The slightly higher prediction strength for smokers without cancer is reflective of the fact that predictors have a slightly higher specificity on average. Supplemental FIG. 3 shows that samples that are consistently classified correctly or classified incorrectly are classified with higher confidence higher average prediction strength). Interestingly, 64% of the samples that are consistently classified incorrectly (incorrect greater than 95% of the time, n=22 samples) are samples from smokers that do not currently have a final diagnosis of cancer. This significantly higher false-positive rate might potentially reflect the ability of the biomarker to predict future cancer occurrence or might indicate that a subset of smokers with a cancer-predisposing gene-expression phenotype are protected from developing cancer through some unknown mechanism.
  • In order to further assess the stability of the biomarker gene committee, the number of times the 80-predictive probesets used in the biomarker were selected in each of the 1000 runs (Supplemental Table 6) was examined. (See FIG. 10A) The majority of the 80-biomarker probesets were chosen frequently over the 1000 runs (37 probesets were present in over 800 runs, and 58 of the probesets were present in over half of the runs). For purposes of comparison, when the cancer status of the training set samples are randomized over 1000 runs (Supplemental Table 7), the most frequently selected probeset is chosen 66 times, and the average is 7.3 times. (See FIG. 10B).
  • Comparison of RMA vs. MAS 5.0 and weighted voting vs. PAM: To evaluate the robustness of our ability to use airway gene expression to classify smokers with and without lung cancer, we examined the effect of different class-prediction and data preprocessing algorithms. We tested the 80-probesets in our classifier to generate predictive models using the Prediction Analysis of Microarrays (PAM) algorithm (Tibshirani R, et al., Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci USA 2002; 99(10):6567-6572), and we also tested the ability of the WV algorithm to use probeset level data that had been derived using the MAS 5.0 algorithm instead of RMA. The accuracy of the classifier was similar when microarray data was preprocessed in MAS 5.0 and when the PAM class prediction algorithm was used (see Supplemental Table 8).
  • TABLE 8
    Supplemental
    Accuracy Sensitivity Specificity
    WV—RMA data 82.69% 80% 84.38%
    PAM—RMA data 86.54% 90% 84.38%
    WV—MASS data 82.69% 80% 84.38%
    PAM—MASS data 86.54% 95% 81.25%
  • Supplemental Table 8 shows a comparison of accuracy, sensitivity and specificity for our 80 probeset classifier on the 52 sample test set using alternative microarray data preprocessing algorithms and class prediction algorithms.
  • Prediction strength: The Weighted voting algorithm predicts a sample's class by summing the votes each gene on the class prediction committee gives to one class versus the other. The level of confidence with which a prediction is made is captured by the Prediction Strength (PS) and is calculated as follows:
  • P S = V winning - V losing V winning + V losing
  • Vwinning refers to the total gene committee votes for the winning class and Vlosing refers to the total gene committee votes for the losing class. Since Vwinning is always greater than Vlosing, PS confidence varies from 0 (arbitrary) to 1 (complete confidence) for any given sample.
  • In our test set, the average PS for our gene profile's correct predictions (43/52 test samples) is 0.73(+/−0.27), while the average PS for the incorrect predictions (9/52 test samples) is much lower: 0.49(+/−0.33; p<z; Student T-Test). This result shows that, on average, the Weighted Voting algorithm is more confident when it is making a correct prediction than when it is making an incorrect prediction. This result holds across 1000 different training/test set pairs (FIG. 11):
  • Cancer cell type: To determine if the tumor cell subtype affects the expression of genes that distinguish airway epithelium from smokers with and without lung cancer, Principal Component Analysis (PCA) was performed on the gene-expression measurements for the 80 probesets in our predictor and all of the airway epithelium samples from patients with lung cancer (FIG. 12). Gene expression measurements were Z(0,1) normalized prior to PCA. There is no apparent separation of the samples with regard to cancer subtype.
  • Link to Lung Cancer Tissue Microarray Dataset
  • Preprocessing of Bhattacharjee data: The 254 CEL files from HgU95Av2 arrays used by Bhattacharjee et al. (Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses. Proc Natl Acad Sci USA 2001; 98(24):13790-13795) were downloaded from the MIT Broad Institute's database available through internet (broad.mit.edu/mpr/lung). RMA-derived expression measurements were computed using these CEL files as described above. Technical replicates were filtered by choosing one at random to represent each patient. In addition, arrays from carcinoid samples and patients who were indicated to have never smoked were excluded, leaving 151 samples. The z-score quality filter described above was applied to this data set resulting in 128 samples for further analysis (88 adenocarcinomas, 3 small cell, 20 squamous, and 17 normal lung samples).
  • Probesets were mapped between the HGU133A array and HGU95Av2 array using Chip Comparer at the Duke University's database available through the world wide web at tenero.duhs.duke.edu/genearray/perl/chip/chipcomparer.pl. 64 probesets on the HGU95Av2 array mapped to the 80-predictive probesets. The 64 probesets on the HGU95Av2 correspond to 48 out of the 80 predictive probesets (32/80 predictive probesets have no clear corresponding probe on the HGU95Av2 array).
  • Analyses of Bhattacharjee dataset: In order to explore the expression of genes that we identified as distinguishing large airway epithelial cells from smokers with and without lung cancer in lung tumors profiled by Bhattacharjee, two different analyses were performed. Principal component analysis was used to organize the 128 Bhattacharjee samples according to the expression of the 64 mapped probesets. Principal component analysis was conducted in R using the package prcomp on the z-score normalized 128 samples by 64 probeset matrix. The normal and malignant samples in the Bhattacharjee dataset appear to separate along principal component 1 (see FIG. 17). To assess the significance of this result, the principal component analysis was repeated using the 128 samples and 1000 randomly chosen sets of 64 probesets. The mean difference between normal and malignant samples was calculated based on the projected values for principal component 1 for the actual 64 probesets and for each of the 1000 random sets of 64 probesets. The mean difference between normal and malignant from the 1000 random gene sets was used to generate a null distribution. The observed difference between the normal and malignant samples using the biomarker probesets was greater than the difference observed using randomly selected genes (p=0.026 for mean difference and p=0.034 for median difference).
  • The second analysis involved using the weighted voted algorithm to predict the class of 108 samples in the Bhattacharjee dataset using the 64 probe sets and a training set of 10 randomly chosen normal tissues and 10 randomly chosen tumor tissues. The samples were classified with 89.8% accuracy, 89.1% sensitivity, and 100% specificity (see Supplemental Table 9 below, Single Run). To examine the significance of these results, the weighted voted algorithm was re-run using two types of data randomization. First, the class labels of the training set of 20 samples were permuted and the algorithm, including gene selection, was re-run 1000 times (referred to in Supplemental Table 9 as Random 1). The second randomization involved permuting the class labels of the training set of 20 samples and re-running the algorithm 1000 times keeping the list of 64-probsets constant (referred to in Supplemental Table 9 as Random 2). In the above two types of randomization, the class labels were permuted such that half the samples were correctly labeled. The p-value for each metric and randomization method shown indicate the percentage of 1000 runs using that randomization method that exceeded or were equal to the performance of the actual classifier. Genes that distinguish between large airway epithelial cells from smokers with and without cancer are significantly better able to distinguish lung cancer tissue from normal lung tissue than any random run where the class labels of the training set are randomized.
  • SUPPLEMENTAL TABLE 9
    p- p- p- p-
    Accur sd(Accur) value Sens sd(Sens) value Spec sd(Spec) value AUC sd(AUC) value
    Single 0.898 0.891 1 0.984
    Run
    Random
    1 0.486 0.218 0.007 0.486 0.217 0.008 0.484 0.352 0.131 0.481 0.324 0.005
    Random 2 0.498 0.206 0.009 0.499 0.201 0.011 0.494 0.344 0.114 0.494 0.324 0.014
  • Supplemental Table 9 above shows results of a comparison between the predictions of the Bhattacharjee samples using the 64 probesets that map to a subset of the 80-predictive probesets and random runs (explained above). Accur=Accuracy, Sens=Sensitivity, Spec=Specificity, AUC=area under the curve, and sd=standard deviation.
  • Real Time PCR: Quantitative RT-PCR analysis was used to confirm the differential expression of a seven genes from our classifier. Primer sequences for the candidate genes and a housekeeping gene, the 18S ribosomal subunit, were designed with PRIMER EXPRESS® software (Applied Biosystems) (see Supplemental Table 10).
  • SUPPLEMENTAL TABLE 10
    Candidate and housekeeping
    gene primers for real time PCR assay
    Gene
    Symbol Affy ID Ensembl ID Name Forward Primer Reverse Primer
    BACH2 215907_at ENSG00000112182 BTB and CNC TGGCAAAACCGCATC ACCACCATGCCCAGC
    homology 1, basic TCTAC TAA
    leucine zipper (SEQ ID No. 1) (SEQ ID No. 2)
    transcription
    factor 2
    DCLRE1C 219678_x_at ENSG00000152457 DNA cross-link GCACTTTGAGGTGGG CCAGGCTGGTGTCGA
    repair 1C CAAT ACTC
    (SEQ ID No. 3) (SEQ ID No. 4)
    DUOX1 215800_at ENSG00000137857 dual oxidase 1 GAGAGAAAGCAAAGG CATGTGAGTCTGAAA
    AGTGAACTT TTACAGCATT
    (SEQ ID No. 5) (SEQ ID No. 6)
    FOS 209189_at ENSG00000170345 v-fos FBJ murine AGATGTAGCAAAACG CTCTGAAGTGTCACT
    osteosarcoma CATGGA GGGAACA
    viral oncogene (SEQ ID No. 7) (SEQ ID No. 8)
    homolog
    IL8 211506_s_at ENSG00000169429 interleukin 8 GCTAAAGAACTTAGA GGTGGAAAGGTTTGG
    TGTCAGTGCAT AGTATGTC
    (SEQ ID No. 9) (SEQ ID No. 10)
    RAB1A 207791_s_at ENSG00000138069 RAB1A, member GGAGCCCATGGCATC TTGAAGGACTCCTGA
    RAS oncogene ATA TCTGTCA
    family (SEQ ID No. 11) (SEQ ID No. 12)
    TPD52 201689_s_at ENSG00000076554 tumor protein D52 TGACTTGAGAGTGGA TTACTGTCACAAACG
    ACCTCCTA GTGCTAAA
    (SEQ ID No. 13) (SEQ ID No. 14)
    18S TTTCGGAACTGAGGC TTTCGCTCTGGTCCG
    CATG TCTT
    (SEQ ID No. 15) (SEQ ID No. 16)
    GAPDH TGCACCACCAACTGC GGCATGGACTGTGGT
    TTAGC CATGAG
    (SEQ ID No. 17) (SEQ ID No. 18)
    HPRT1 TGACACTGGCAAAAC GGTCCTTTTCACCAG
    AATGCA CAAGCT
    (SEQ ID No. 19) (SEQ ID No. 20)
    SDHA TGGGAACAAGAGGGC CCACCACTGCATCAA
    ATCTG ATTCATG
    (SEQ ID No. 21) (SEQ ID No. 22)
    TBP TGCACAGGAGCCAAG CACATCACAGCTCCC
    AGTGAA CACCA
    (SEQ ID No. 23) (SEQ ID No. 24)
    YWHAZ ACTTTTGGTACATTG CCGCCAGGACAAACC
    TGGCTTCAA AGTAT
    (SEQ ID No. 25) (SEQ ID No. 26)
  • Primer sequences for five other housekeeping genes (HPRT1, SDHA, YWHAZ, GAPDH, and TBP) were adopted from Vandesompele et al. (Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol 2002; 3(7)). RNA samples (1 μg of the RNA used in the microarray experiment) were treated with DNAfree (Ambion, Austin, Tex.), according to the manufacturer's protocol, to remove contaminating genomic DNA. Total RNA was reverse-transcribed using random hexamers (Applied Biosystems) and SuperScript II reverse transcriptase (Invitrogen). The resulting first-strand cDNA was diluted with nuclease-free water (Ambion) to 5 ng/μl. PCR amplification mixtures (25 μl) contained 10 ng template cDNA, 12.5 μl of 2× SYBR Green PCR master mix (Applied Biosystems) and 300 nM forward and reverse primers. Forty cycles of amplification and data acquisition were carried out in an Applied Biosystems 7500 Real Time PCR System. Threshold determinations were automatically performed by Sequence Detection Software (version 1.2.3) (Applied Biosystems) for each reaction. All real-time PCR experiments were carried out in triplicate on each sample (6 samples total; 3 smokers with lung cancer and 3 smokers without lung cancer).
  • Data analysis was performed using the geNorm tool (Id.). Three genes (YWHAZ, GAPDH, and TBP) were determined to be the most stable housekeeping genes and were used to normalize all samples. Data from the QRT-PCR for 7 genes along with the microarray results for these genes is shown in FIG. 13.
  • REFERENCES
  • (1) Parkin D M, et al., C A Cancer J Clin 2005; 55(2):74-108.
  • (2) Shields P G. Ann Oncol 1999; 10 Suppl 5:S7-11.
  • (3) Hirsch F R, et al., Clin Cancer Res 2001; 7(1):5-22.
  • (4) Jett J R. Clin Cancer Res 2005; 11(13 Pt 2):49885-4992s.
  • (5) Macredmond R, et al., Thorax 2006; 61(1):54-56.
  • (6) Postmus P E. Chest 2005; 128(1):16-18.
  • (7) Mazzone P, et al., Clin Chest Med 2002; 23(1):137-58, ix.
  • (8) Schreiber G, and McCrory DC. Chest 2003; 123(1 Suppl):115S-128S.
  • (9) Janssen-Heijnen M L, et al., Epidemiology 2001; 12(2):256-258.
  • (10) Salomaa E R, et al., Chest 2005; 128(4):2282-2288.
  • (11) Auerbach O, et al., Arch Environ Health 1970; 21(6):754-768.
  • (12) Powell C A, et al., Clin Cancer Res 1999; 5(8):2025-2034.
  • (13) Wistuba I I, et al., J Natl Cancer Inst 1997; 89(18):1366-1373.
  • (14) Franklin W A, et al., J Clin Invest 1997; 100(8):2133-2137.
  • (15) Guo M, et al., Clin Cancer Res 2004; 10(15):5131-5136.
  • (16) Miyazu Y M, et al., Cancer Res 2005; 65(21):9623-9627.
  • (17) Spira A, et al., Proc Natl Acad Sci USA 2004; 101(27):10143-10148.
  • (18) Bolstad B M, et al., Bioinformatics 2003; 19(2):185-193.
  • (19) Golub T R, et al, Science 1999; 286(5439):531-537.
  • (20) Tibshirani R, et al., Proc Natl Acad Sci USA 2002; 99(10):6567-6572.
  • (21) Bhattacharjee A, et al., Proc Natl Acad Sci USA 2001; 98(24):13790-13795.
  • (22) Subramanian A, et al., Proc Natl Acad Sci USA 2005; 102(43):15545-15550.
  • (23) Simon R, et al., J Natl Cancer Inst 2003; 95(1):14-18.
  • (24) Cheng K W, et al., Cancer Res 2005; 65(7):2516-2519.
  • (25) Shimada K, et al., Br J Cancer 2005; 92(10):1915-1921.
  • (26) Xie K. Cytokine Growth Factor Rev 2001; 12(4):375-391.
  • (27) Campa D, et al., Cancer Epidemiol Biomarkers Prey 2005; 14(10):2457-2458.
  • (28) Arimura Y, et al., Anticancer Res. 24, 4051-4057. 2004.
  • (29) Coussens L M, and Werb Z. Nature 2002; 420(6917):860-867.
  • (30) Godschalk R, et al., Carcinogenesis 2002; 23(12):2081-2086.
  • (31) Kamio T, et al., Blood 2003; 102(9):3317-3322
  • Example 4
  • Primary lung tumors and histologically normal lung tissue were collected from the tumor bank of Brigham and Women's Hospital. Research specimens were snap frozen on dry ice and stored at −140° C. Each sample was accompanied by an adjacent section embedded in Optimum Cutting Temperature Compound for histological confirmation The thoracic surgery clinical data-base was abstracted for details of smoking history, clinical staging and other demographic details. From the tumor bank, six cases of adenocarcinoma in life-time never smokers were selected and six cases of adenocarcinoma from cigarette smokers were then chosen for comparison by matching for the following criteria in a descending hierarchy of priority: (1) cell type; (2) histological stage of differentiation; (3) pathologic TNM stage; and (4) patient age (Table 6). All of the subjects except for one smoker were female. The collection of anonymous discarded tumor specimens was approved by the Brigham and Womens Institutional Review Board Hospital and the study was approved by the Human Studies Committee of Boston University Medical Center. Once the cases were selected, specimens and clinical data were de-identified in accordance with the discarded tissue protocol governing the study; thus, linkage of each paired tumor and normal tissue sample with specific additional clinical characteristics other than smoking status, cell type, differentiation and gender was not possible.
  • Histological sections were reviewed by a pathologist, blinded to original pathological diagnosis. Tumor histology agreed in all cases and the mean percentage of tumor in each sample was 60%, DNA was extracted from tumor and non-involved samples using QIAamp Tissue Kit (Qiagen, Valencia, Calif.). LOH studies were performed using fluorescent microsatellite LOH analysis as described previously (Powell Calif., et al., Clin. Cancer Res., 5:2025-34 (1999)).
  • Tumor and normal lung DNA templates from samples were amplified with a panel of 52 fluorescent PCR primers from ten chromosomal regions that have been reported to harbor lung cancer tumor suppressor genes or have demonstrated LOH in lung tumors or bronchial epithelium of cigarette smokers. Based on our prior studies and results of other investigators using fluorescent methods to detect LOH, we defined LOH as a >20% change in normalized allele height ratio (FIG. 22) (Liloglou T, et al., Cancer Res., 61:1624-1628 (2001); Liloglou T, et al., Int. J. Oneal., 16:5-14 (2000)). All instances of LOH were verified by repetition and the mean allele height ratio was used for data analysis. LOH was measured by comparing tumor DNA to nonmalignant lung DNA rather than to lymphocyte DNA, which was unavailable for this study. Thus, LOH represented allelic loss between two somatic sites in the same lung, rather than between tumor tissue and constitutional genomic DNA.
  • The extent of LOH was expressed as fractional allelic loss (FAL) which equals the number of primers with LOH per template/number of informative primers. Fisher exact test and ×2 were used to determine the difference in FAL in smokers compared with nonsmokers.
  • Results. All tumors demonstrated LOH in at least one microsatellite on each of the ten chromosomal arms evaluated in this study (Table 7). With respect to nonmalignant lung epithelium, LOH was more frequent in the tumors of nonsmokers than in those of smokers (FIG. 23). FAL ranged from 6 to 93% with a mean of 46%, in nonsmokers, and from 2 to 60% with a mean of 28%, in smokers (P<0.05). In the pairwise comparison of nonsmokers and clinically matched smokers, LOH was more frequent in five of six nonsmokers.
  • Chromosomes 1.0p, 9p, and 5q were the most frequent sites of LOH in nonsmokers' tumors while 9p and 5q were the most frequent sites in smokers. Increased FAL in nonsmokers was most pronounced at five chromosomal arms: 3p, 8p, 9p, 10p, and 18q with FAL ranging from 55 to 87%. These microsatellites harbor several known or candidate tumor suppressor genes such as FHIT, DLCL (Daigo Y, et al., Cancer Res., 59:1966-1972 (1999)), RASSFI (Dammam R, el al., Nat. Genet., 25:315-319 (2000)) (chromosome 3p), PRK (Li B, et al., J Biol. Chem., 271:19402-19408 (1996) (chromosome 8p), p16 (chromosome 9p), SMAD2 and SMAD4 (Takei K, etal., Cancer Res., 58:3700-3705 (1998)) (chromosome 18q).
  • In most tumors, there were instances of microsatellites demonstrating LOH interspersed with microsatellites that retained heterozygosity (see chromosome 1p in subject S3, Table 7). This pattern of discontinuous allelic loss was evident on all chromosomes that were evaluated, and is considered a potential mutational signature of lung carcinogenesis attributable to mitotic recombination (Wistuba, II, Behrens C, et al., Cancer Res., 60:19491960 (2000)).
  • However, in other instances there was LOH at a number of contiguous loci suggesting larger chromosomal deletions (see chromosome 3p in subject NS3, Table 7). This was particularly true on 3p, a fragile site previously found to be involved in smokers with and without tumors.
  • Example 5
  • Methods. Samples of epithelial cells, obtained by brushing airway surfaces, were obtained from intra- and extra-pulmonary airways in 11 normal non-smokers (NS), 15 smokers without lung cancer (S), and 9 smokers with lung cancer (SC). 5-10 ug of RNA was extracted using standard trizol-based methods, quality of RNA was assayed in gels, and the RNA was processed using standard protocols developed by Affymetrix for the U133 human array. Expression profiles, predictive algorithms, and identification of critical genes are made using bioinformatic methods.
  • Results. There are 5169 genes in the NS Transcriptome, 4960 genes in the S Transcriptome, and 5518 genes in the SC Transcriptome. There are 4344 genes in common between the 3 Transcriptomes. There are 327 unique genes in the NS Transcriptome, 149 unique genes in the S Transcriptome, and 551 unique genes in the SC Transcriptome. FIGS. 20A-20F show a list of genes which are differentially expressed in smokers and non-smokers. FIGS. 21A-21B show a list of genes which are differentially expressed in smokers and smokers with lung cancer. T-test statistical results are shown.
  • Example 6
  • There are approximately 1.25 billion daily cigarette smokers in the world (1). Cigarette smoking is responsible for 90% of all lung cancers, the leading cause of cancer deaths in the US and the world (2, 3). Smoking is also the major cause of chronic obstructive pulmonary disease (COPD), the fourth leading cause of death in the US (4). Despite the well-established causal role of cigarette smoking in lung cancer and COPD, only 10-20% of smokers actually develop these diseases (5). There are few indicators of which smokers are at highest risk for developing either lung cancer or COPD, and it is unclear why individuals remain at high risk decades after they have stopped smoking (6).
  • Given the burden of lung disease created by cigarette smoking, surprisingly few studies(7, 8) have been done in humans to determine how smoking affects the epithelial cells of the pulmonary airways that are exposed to the highest concentrations of cigarette smoke or what smoking-induced changes in these cells are reversible when subjects stop smoking. With the two exceptions noted above, which examine a specific subset of genes in humans, studies investigating the effects of tobacco on airway epithelial cells have been in cultured cells, in human alveolar lavage samples in which alveolar macrophages predominate, or in rodent smoking models (summarized in Gebel et al(9)).
  • A number of recent studies have used DNA microarray technology to study normal and cancerous whole lung tissue and have identified molecular profiles that distinguish the various subtypes of lung cancer as well as predict clinical outcome in a subset of these patients(10-13).
  • Based on the concept that genetic alterations in airway epithelial cells of smokers represent a “field defect”(14, 15), we obtained human epithelial cells at bronchoscopy from brushings of the right main bronchus proximal to the right upper lobe of the lung, and defined profiles of gene expression in these cells using the U133A GeneChip® array (Affymetrix Inc., Santa Clara, Calif.). We here describe the subset of genes expressed in large airway epithelial cells (the airway transcriptome) of healthy never smokers, thereby gaining insights into the biological functions of these cells.
  • Surprisingly, we identified a large number of genes whose expression is altered by cigarette smoking, defined genes whose expression correlates with cumulative pack years of smoking, and identified genes whose expression does and does not return to normal when subjects discontinue smoking.
  • In addition, we identified a subset of smokers who were “outliers” expressing some genes in a fashion that significantly differed from most smokers. One of these “outliers” developed lung cancer within 6 months of expression profiling, suggesting that gene expression profiles of smokers with cancer differ from that of smokers without lung cancer.
  • Materials and Methods:
  • Study Population and Sample Collection: We recruited non-smoking and smoking subjects (n=93) to undergo fiberoptic bronchoscopy at Boston Medical Center between November 2001 and June 2003. Non-smoking volunteers with significant environmental cigarette exposure and subjects with respiratory symptoms or regular use of inhaled medications were excluded. For each subject, a detailed smoking history was obtained including number of pack-years, number of packs per day, age started, age quit, and environmental tobacco exposure.
  • All subjects in our study underwent fiberoptic bronchoscopy between November 2001 and June 2003. Risks from the procedure were minimized by carefully screening volunteers (medical history, physical exam, chest X-ray, spirometry and EKG), by minimizing topical lidocaine anesthesia, and by monitoring the EKG and SaO2 throughout the procedure. After passage of the bronchoscope through the vocal cords, brushings were obtained via 3 cytobrushes (CELEBRITY Endoscopy Cytology Brush, Boston Scientific, Boston, Mass.) from the right upper lobe bronchus.
  • Bronchial airway epithelial cells were obtained from brushings of the right mainstem bronchus taken during fiberoptic bronchoscopy using an endoscopic cytobrush (CELEBRITY Endoscopy Cytology Brush, Boston Scientific, Boston, Mass.). The brushes were immediately placed in TRizol reagent (Invitrogen, Carlsbad, Calif.) after removal from the bronchoscope and kept at −80° C. until RNA isolation was performed. Any other RNA protection protocol known to one skilled in the art can also be used. RNA was extracted from the brushes using TRizol Reagent (Invitrogen) as per the manufacturer protocol, with a yield of 8-15 μg of RNA per patient. Other methods of RNA isolation or purification can be used to isolate RNA from the samples. Integrity of the RNA was confirmed by running it on a RNA denaturing gel. Epithelial cell content of representative bronchial brushing samples was quantified by cytocentrifugation (ThermoShandon Cytospin, Pittsburgh, Pa.) of the cell pellet and staining with a cytokeratin antibody (Signet, Dedham Mass.). The study was approved by the Institutional Review Board of Boston University Medical Center and all participants provided written informed consent.
  • Microarray Data Acquisition and Preprocessing: We obtained sufficient quantity of good quality RNA for microarray studies from 85 of the 93 subjects recruited into our study. Total RNA was processed, labeled, and hybridized to Affymetrix HG-U133A GeneChips containing approximately 22,500 human genes, any other type of nucleic acid or protein array may also be used. Six to eight μg of total RNA from bronchial epithelial cells was converted into double-stranded cDNA with the SuperScript II reverse transcriptase (Invitrogen) using an oligo-dT primer containing a T7 RNA polymerase promoter (Genset, Boulder, Colo.). The ENZO Bioarray RNA transcript labeling kit (Affymetrix) was used for in vitro transcription of the purified double stranded cDNA. The biotin-labeled cRNA was purified using the RNeasy kit (Qiagen) and fragmented into approximately 200 base pairs by alkaline treatment (200mM Tris-acetate, pH 8.2, 500 mM potassium acetate, 150 mM magnesium acetate). Each verified cRNA sample was then hybridized overnight onto the Affymetrix HG-U133A array and confocal laser scanning (Agilent) was then performed to detect the streptavidin-labeled fluor. A single weighted mean expression level for each gene along with a p(detection)-value (which indicates whether the transcript was reliably detected) was derived using Microarray Suite 5.0 software (Affymetrix, SantaClara, Calif.).
  • Using a one-sided Wilcoxon Signed rank test, the MAS 5.0 software also generated a detection p-value (p(detection)-value) for each gene which indicates whether the transcript was reliably detected. We scaled the data from each array in order to normalize the results for inter-array comparisons. Microarray data normalization was accomplished in MAS 5.0, where the mean intensity for each array (top and bottom 2% of genes excluded) was corrected (by a scaling factor) to a set target intensity of 100. The list of genes on this array is available at hap://www.affymetrix.com/analysis/download center.affx.
  • Arrays of poor quality were excluded based on several quality control measures. Each array's scanned image was required to be free of any significant artifacts and the bacterial genes spiked into the hybridization mix had to have a P(detection)-value below 0.05 (called present). If an array passed this criteria, it was evaluated based on three other quality measures: the 3′ to 5′ ratio of the intensity for Glyceraldehyde-3-phosphate dehydrogenase (GAPDH), the percent of genes detected as present, and the percent of “outlier” genes as determined by a computational algorithm we developed (see httn://pulm.bumc.bu.edu/aged/supplemental.html for further details, which are herein incorporated by reference).
  • In addition to the above set of rules, one further quality control measure was applied to each array. While cytokeratin stains of selected specimens reveal that approximately 90% of nucleated cells are epithelial, we developed a gene filter to exclude specimens potentially contaminated with inflammatory cells. A group of genes on the U133A array was identified that should be expressed in bronchial epithelial cells as well as a list of genes that are specific for various lineages of white blood cells and distal alveolar epithelial cells (see FIGS. 32 and 33). Arrays whose 90th percentile for the p(detection)-value was more than 0.05 for genes that should be detected in epithelial cells or whose 80th percentile p(detection)-value was less than 0.05 for genes that should not be expressed in bronchial epithelial cells were excluded from the study 10 of the 85 samples were excluded based on the quality control filter and the epithelial content filter described above (see http://pulm.bumc.bu.edu/aged/supplemental.html for details regarding excluded samples).
  • In addition to filtering out poor quality arrays, a gene filter was applied to remove genes that were not reliably detected. From the complete set of ˜22500 probesets on the U133 array, we filtered out probesets whose p(detection)-value was not less than 0.05 in at least 20% of all samples. 9968 probesets passed our filter and were used in all further statistical analyses for the dataset.
  • Microarray Data Analysis: Clinical information and array data as well as gene annotations are stored in an interactive MYSQL database coded in Pert available at http://pulm.bumc.bu.edu/aged/index.html. All statistical analyses below and within the database were performed using R software version 1.6.2 (available at http://r-project.org). The gene annotations used for each probe set were from the October 2003 NetAffx HG-U13 3A Annotation Files.
  • Technical, spatial (right and left bronchus from same subject) and temporal (baseline and at 3 months from same subject) replicates were obtained from selected subjects for quality control. Pearson correlations were calculated for technical, spatial and temporal replicate samples from the same individual. RNA isolated from the epithelial cells of one patient was divided in half and processed separately as detailed in the methods for the technical replicates (data not shown). Different brushings were obtained from the right and left airways of the same patient and processed separately for the spatial replicates (FIG. 27A). Brushings of the right airway were obtained approximately 3 months apart and processed separately for the temporal replicates (FIG. 27B).
  • In addition to the correlation graphs in FIGS. 27A and 27B, two systematic approaches were implemented to assess the variability between replicates versus the variability between unrelated samples. Pearson correlation coefficients were computed between replicates as well as between unrelated samples within a group (never or current smoker) and between groups (never versus current smoker) using the filtered gene list (9968 genes). FIG. 35 reports the mean R squared values for each of the four comparisons. The results demonstrate that the mean correlation among replicates is higher than between two unrelated samples, and that the within group correlations between unrelated samples are higher than the between group correlations between unrelated samples.
  • The second approach uses a different methodology, but yields similar results to those described in FIG. 35. For each of the 9968 genes, a differential gene expression ratio was computed between replicate samples and between all possible combinations of two unrelated samples (Lenburg M, Liou L, Gerry N, Frampton G, Cohen H & Christman M. (2003) BMC Cancer 3 , 31). A histogram of the log base 2 ratio values or fold changes is displayed in FIG. 27C. The number of fold changes computed for the replicate samples is less than the number of fold changes computed for unrelated samples, therefore, the frequencies in the histogram are calculated as a percent of the total fold changes calculated. As expected, the histogram clearly shows that there is less variability among the replicate samples. In the replicate samples there is a higher frequency of genes having a fold change close to or equal to one compared to unrelated samples.
  • An unsupervised analysis of the microarray data was performed by hierarchal clustering the top 1000 most variable probe sets (determined by coefficient of variation) across all samples using log transformed z-score normalized data. The analysis was performed using a Pearson correlation (uncentered) similarity metric and average linkage clustering with CLUSTER and TREEVIEW software programs obtained at http://rana.1bI.gov/EisenSoftware.htm (see FIG. 28).
  • The normal large airway transcriptome was defined by the genes whose median p(detection)-value was less than 0.05 across all 23 healthy never smokers (7119 genes expressed across majority of subjects), as well as a subset of these 7119 genes whose p(detection)-value was less than 0.05 in all 23 subjects (2382 genes expressed across all subjects). The coefficient of variation for each gene in the transcriptome was calculated. as the standard deviation divided by the mean expression level multiplied by 100 for that gene across all nonsmoking individuals. In order to identify functional categories that were over- or underrepresented within the airway transcriptome, the GOMINER software (16) was used to functionally classify the genes expressed across all nonsmokers (2382 probesets) by the molecular function categories within Gene Ontology (GO). Multiple linear regressions were performed on the top ten percent most variable probesets (712 probesets, as measured by the coefficient of variation) in the normal airway transcriptome (7119 probesets) in order to study the effects of age, gender, and race on gene expression.
  • It should be noted, that genes expressed at low levels are not necessarily accurately detected by microarray technology. The probe sets which define the normal airway transcriptome, therefore, will represent genes which are expressed at a measurable level in either the majority or all of the nonsmoking healthy subjects. One of the limitations to this approach, however, is that we will be excluding genes expressed at low levels in the normal airway transcriptome.
  • Multiple linear regressions were performed on the top ten percent most variable genes (712 genes, as measured by the coefficient of variation, defined here as sd/mean*100) in the normal airway transcriptome (7119 genes) in order to study the effects of age, gender, and race on gene expression (see FIGS. 36A-36C) using R statistical software version 1.6.2. FIG. 29 shows that the majority of genes in the normal airway transcriptome have coefficients of variation below 50. As a result, we choose to focus on a smaller subset of the 7119 genes, specifically the top ten percent most variable genes, in order to explore whether or not various demographic variables could explain the patterns of gene expression. The coefficients of variation for the top ten percent most variable genes ranged from 50.78 to 273.04. A general linear model was used to explore the relationship between gene expression and age (numerical variable), race (categorical variable with two groups Caucasian or Other), and gender (categorical variable). The model included the three main effects plus the three possible two-way interactions. Models having a p-value less than 0.01 (83 genes) were chosen for further analysis. For each of these models, the following diagnostic plots were assessed: residuals versus the fitted values plot, normal Q-Q plot, and Cook's distance plot. Based on the graphs, 13 models were removed because the residuals were not normally distributed or had unequal variance. The regression results for the remaining 70 genes are included in FIGS. 36A-36C as well as the p-values for the significant regressors (p<=0.01). The age:race interaction term is absent from the table because none of the models had p-values less than 0.01 for this term.
  • To examine the effect of smoking on the airway, a two-sample t-test was used to test for genes differentially expressed between current smokers (n=34) and never smokers (n=23). In order to quantify how well a given gene's expression level correlates with number of pack-years of smoking among current smokers, Pearson correlation coefficients were calculated (see supplementary information). For multiple comparison correction, a permutation test was used to assess the significance of our p-value threshold for any given gene's comparison between two groups (p(t-test)-value) or between a clinical variable (p(t-test)-value) (see supporting information for details). In order to further characterize the behavior of current smokers, two-dimensional hierarchical clustering of all never smokers and current smokers using the genes that were differentially expressed between current vs. never smokers was performed. Hierarchical clustering of the genes and samples was performed using log transformed z-score normalized data using a Pearson correlation (uncentered) similarity metric and average linkage clustering using CLUSTER and TREEVIEW software programs.
  • Multidimensional scaling and principal component analysis were used to characterize the behavior of former smokers (n=18) based on the set genes differentially expressed between current and never smokers using Partek 5.0 software (http://www.partek.com). In addition, we executed an unsupervised hierarchical clustering analysis of all 18 former smokers according to the expression of the genes differentially expressed between current and never smoker. In order to identify genes irreversibly altered by cigarette smoking, we performed at-test between former smokers (n=18) and never smokers (n=23) across the genes that were considered differentially expressed between current and never smokers. Coefficients of variation (sd/mean*100) were computed across never, former, and current smoker subjects for each of the 9968 probesets. The top 1000 most variable probesets (%CV>56.52) were selected and hierarchical clustering of these probesets and samples was performed using log transformed z-score normalized data using a Pearson correlation (uncentered) similarity metric and average linkage clustering using CLUSTER and TREEVIEW software programs obtained at http://rana.1bI.gov/EisenSoftware.htm. The clustering dendogram of the samples is displayed in FIG. 28. The samples do not cluster according to their classification of never, former, or current smokers, and therefore, a supervised approach was needed (see below). In addition, the dendogram does not reveal a clustering pattern that is related to technical variation in the processing of the samples. Table 7 below List of genes whose expression did not return to normal even after about 20 years of smoking:
  • TABLE 7
    Affymetrix ID Gene Symbol
    213455 at LOC92689
    823_at CX3CL1
    204755_x_at HLF
    204058_at MEl
    217755_at HNl
    207547_s_at TU3A
    211657_at CEACAM6
    213629_x_at MTlF
    214106_s_at GMDS
    207222_at PLA2G10
    204326_x_at MT1X
    201431_s_at DPYSL3
    204754_at HLF
    208581_x_at MT1X
    215785_s_at CYFIP2
  • Given the invasive nature of the bronchoscopy procedure, we were unable to recruit age-, race- and gender-matched patients for the smoker vs. nonsmoker comparison. Due to baseline differences in age, gender, and race between never and current smoker groups (see FIG. 34), we performed an ANCOVA to test the effect of smoking status (never or current) on gene expression while controlling for the effects of age (the covariate). In addition, a two way ANOVA was performed to test the effect of smoking status (never or current) on gene expression while controlling for the fixed effects of race (encoded as three racial groups: Caucasian, African American, and other) or gender and the interaction terms of status:race or status:gender. Both the ANCOVA and two-way ANOVA were performed with Partek 5.0 software.
  • Genes that distinguish smokers with and without cancer. In order to identify airway gene expression profiles diagnostic of lung cancer, a two-sample t-test was performed to test for genes differentially expressed between smokers with lung cancer (n=23) and smokers without lung cancer (n=45). 202 genes were differentially expressed. between the groups at p<0.001 (see table 6). In order to correct for multiple comparisons, we calculated a q-value (Storey J D & Tibshirani R (2003). Proc. Natl. Acad. Sci. U.S.A 100, 9449-9445) for each gene, which represents the proportion of false positives present in the group of genes with smaller p-values than the gene.
  • Outlier genes among current smokers: Among airway epithelial genes altered by cigarette smoke, there are a number of genes expressed at extremely high or low levels among a subset of current smokers. In order to identify these “outlier genes,” we performed a Grubbs test on the 320 genes differentially expressed between current (n=34) and never (n=23) smokers at p<0.001. Nine genes were found to be outliers in 3 or more of the current smokers (see table 7). These divergent patterns of gene expression in a small subset of smokers represent a failure to mount an appropriate response to cigarette exposure and may be linked to increased risk for developing lung cancer. As a result, these “outlier” genes can thus serve as biomarkers for susceptibility to the carcinogenic effects of cigarette smoke.
  • Quantitative PCR Validation: Real time PCR(QRT-PCR) was used to confirm the differential expression of a select number of genes. Primer sequences were designed with Primer Express software (Applied Biosystems, Foster City, Calif.). Forty cycles of amplification, data acquisition, and data analysis were carried out in an ABI Prism 7700 Sequence Detector (Applied Biosystems, Foster City, Calif.). All real time PCR experiments were carried out in triplicate on each sample.
  • In further detail, real time PCR (QRT-PCR) primer sequences were designed with Primer Express software (Applied Biosystems, Foster City, Calif.) based on alignments of candidate gene sequences. RNA samples (500 ng of residual sample from array experiment) were treated with DNAfree (Ambion), as per the manufacturer protocol, to remove contaminating genomic DNA. Total RNA was reverse transcribed using Superscript II (Gibco). Five microliters of the reverse transcription reaction was added to 45 μl of SYBR Green PCR master mix (Applied Biosystems). Forty cycles of amplification, data acquisition, and data analysis were carried out in an ABI Prism 7700 Sequence Detector (PE Applied Biosystems). Threshold determinations were automatically performed by the instrument for each reaction. The cycle at which a sample crosses the threshold (a PCR cycle where the fluorescence emission exceeds that of nontemplate controls) is called the threshold cycle, or CT. A high CT value corresponds to a small amount of template DNA, and a low CT corresponds to a large amount of template present initially. All real time PCR experiments were carried out in triplicate on each sample (mean of the triplicate shown). Data from the QRT-PCR for 5 genes that changed in response to cigarette exposure along with the tnicroarray results for these genes is shown in FIGS. 31A-31E.
  • Additional Information: Additional information from this study including the raw image data from all microarray samples (.DAT files), expression levels for all genes in all samples (stored in a relational database), user-defined statistical and graphical analysis of data and clinical data on all subjects is available at http://pulm.bumc.bu.edu/aged/. Data from our microarray experiments has also been deposited in NCBI's Gene Expression Omnibus under accession GSE994.
  • Results and Discussion: Study Population and replicate samples. Microarrays from 75 subjects passed the quality control filters described above and are included in this study. Demographic data on these subjects, including 23 never smokers, 34 current smokers, and 18 former smokers, is presented in FIG. 34. Bronchial brushings yielded 90% epithelial cells, as determined by cytokeratin staining, with the majority being ciliated cells. Samples taken from the right and left main bronchi in the same individual were highly reproducible with an R2 value of 0.92, as were samples from the same individual taken 3 months apart with an R2 value of 0.85 (see FIGS. 27A-27C).
  • The Normal Airway Transcriptome: 7119 genes were expressed at measurable levels in the majority of never smokers and 2382 genes were expressed in all of the 23 healthy never smokers. There was relatively little variation in expression levels of the 7119 genes; 90% had a coefficient of variation (SD/mean) of <50% (see FIG. 29). Only a small part of the variation between subjects could be explained by age, gender or race on multiple linear regression analysis (see FIGS. 36A-36C).
  • Table 6 depicts the GOMINER molecular functions(16) of the 2382 genes expressed in large airway epithelial cells of all healthy never smokers. Genes associated with oxidant stress, ion and electron transport, chaperone activity, vesicular transport, ribosomal structure and binding functions are over-represented. Genes associated with transcriptional regulation, signal transduction, pores and channels are under-represented as well as immune, cytokine and chemokine genes. Upper airway epithelial cells, at least in normal subjects, appear to serve as an oxidant and detoxifying defense system for the lung, but serve few other complex functions in the basal state.
  • Table 6: GOMINER molecular functions of genes in airway epithelial cells. Major molecular functional categories and subcategories of 2382 genes expressed in all never smoker subjects. Over- or under-representation of categories is deteiniined using Fisher's Exact Test. The null hypothesis is that the number of genes in our flagged set belonging to a category divided by the total number of genes in the category is equal to the number of flagged genes NOT in the category divided by the total number of genes NOT in the category. Equivalency in these two proportions is consistent with a random distribution of genes into functional categories and indicates no enrichment or depletion of genes in the category being tested. Categories considered to be statistically (p (GO)<0.05) over- or under-represented by GOMINER are shown.
  • Cells/arrays refers to the ratio of the number of genes expressed in epithelial cells divided by the number of genes on U133A array in each functional category. Actual numbers are in parentheses.
  • TABLE 6
    Over represented Under represented
    Molecular Functions (cells/array) (cells/array)
    Binding Activity
    RNA binding 0.76 (273/366)
    Translation 0.72 (72/101)
    Transcription 0.30 (214/704)
    GTP binding 0.55 (106/194)
    GTPase 0.55 (83/152)
    G nucleotide 0.52 (128/246)
    Receptor 0.20 (79/396)
    Chaperone 0.62 (80/119)
    Chemokine 0.24 (10/42)
    Cytokine 0.20 (39/194)
    Enzyme activity 0.46 (1346/2925)
    Oxidoreductase 0.54 (225/417)
    Isomerase 0.56 (48/82)
    Signal transduction 0.29 (490/1716)
    Structural 0.46 (253/548)
    Transcription 0.35 (321/917)
    regulator
    Transporter
    Carrier 0.48 (175/363)
    Ion 0.56 (130/231)
    Anion 0.26 (15/61)
    Cation 0.64 (116/180
    Metal 0.68 (42/62)
    Electron 0.58 (131/226)
    Channel/pore 0.16 (43/269)
  • Effects of Cigarette Smoking on the Airway Transcriptome: Smoking altered the airway epithelial cell expression of a large number of genes. Ninety-seven genes were found to he differentially expressed by t-test between current and never smokers at p<1.06*10−5. This (p(t-test)-value) threshold was selected based on a permutation analysis performed to address the multiple comparison problem inherent in ally microarray analysis (see supporting information for further details). We chose a very stringent multiple comparison correction and (p(t-test)-value) threshold in order to identify a subset of genes altered by cigarette smoking with only a small probability of having a false positive. Of the 97 genes that passed the permutation analysis, 68 (73%) represented increased gene expression among current smokers. The greatest increases were in genes that coded for xenobiotic functions such as CYP1B1 (30 fold) and DBDD (5 fold), antioxidants such as GPX2 (3 fold), and. ALDH3A1 (6 fold) and genes involved in electron transport such as NADPH (4 fold). In addition, several cell adhesion molecules, CEACAM6 (2 fold) and claudin 10 (3 fold), were increased in smokers, perhaps in response to the increased permeability that has been found on exposure to cigarette smoke(17). Genes that decreased included TU3A (−4 fold), MMP10 (−2 fold), HLF (−2 fold), and CX3CL1 (−2 fold). In general, genes that were increased in smokers tended to be involved in regulation of oxidant stress and glutathione metabolism, xenobiotic metabolism, and secretion. Expression of several putative oncogenes (pirin, CA12, and CEACAM6) were also increased. Genes that decreased in smokers tended to be involved in regulation of inflammation, although expression of several putative tumor suppressor genes (TU3A, SLIT1 and 2, GAS6) were decreased. Changes in the expression of select genes were confirmed by real time RT-PCR (see FIGS. 31A-31E).
  • FIG. 24 shows two-dimensional hierarchical clustering of all the current and never smokers based on the 97 genes that are differentially expressed between the two groups (tree for genes not shown). There were three current smokers (patients 456, 4147 and 4164) whose expression of a subset of genes was similar to that of never smokers. These three smokers, who were similar clinically to other smokers, also segregated in the same fashion when clusters were based on the top 361 genes differentially expressed. between never and current smokers (p<0.001), Expression of a number of redox-related and xenobiotic genes was not increased in these 3 smokers (147C, 164C, 56C), and therefore, their profile resembled that of never smokers despite their substantial and continuing exposure to cigarette smoke. Thus, these individuals failed to increase expression of a number of genes that serve as protective detoxification and anti-oxidant genes, potentially putting them a risk of more severe smoking-related damage. Whether or not these differences represent genetic polymorphisms, and whether these individuals represent the 10-15% of smokers who ultimately develop lung cancer is uncertain. However, one of these subjects (147C) subsequently developed lung cancer during one year follow up, suggesting some link between the divergent patterns of gene expression and presence of or risk for developing lung cancer. There was also a subset of four additional current smokers who clustered with current smokers, but did not up-regulate expression of a cluster of predominantly redox/xenobiotic genes to the same degree as other smokers, although none of these smokers had developed lung cancer in six months of follow up. In addition, there is a never smoker (167N) who is an outlier among never smokers and expresses a subset of genes at the level of current smokers. We reviewed this subject's clinical history and were unable to identify any obvious environmental exposures (i.e. second hand smoke exposure) that might explain the divergent pattern of gene expression.
  • As might be expected, changes in gene expression were also correlated with cumulative cigarette exposure (pack-years). While 159 and 661 genes correlated with cumulative smoking history at p<0.001 and p<0.01 levels respectively (see FIGS. 37A-37B), only 5 genes correlated with pack-years at the p<3.1×106threshold (based on permutation analysis; see supporting information for details). They include cystatin, which has been shown to correlate with tumor growth and inflammation(18), HBP17 has been shown to enhance FGF growth factor activity(19), and BRD2, which is a transcription factor that acts with E2F proteins to induce a number of cell cycle-related genes(20). Among the genes that were correlated at the p <0.0001 level, there were a number of genes that decreased with increasing cumulative smoking history including genes that are involved in DNA repair (RPA1).
  • Due to baseline differences in age, sex, and race between never and current smoker groups, ANCOVA and 2-way ANOVA were performed to test the effect of smoking status on gene expression while controlling for the effects of age, gender, race and two-way interactions. Many of the genes found to be modulated by smoking in this analysis were also found using the simpler t-test. Age and gender had little effect on gene expression changes induced by smoking, while race appeared to influence the effect of smoking on the expression of a number of genes. The ANOVA analysis controlling for race yielded 16 genes, not included in the set of 97 genes differentially expressed between current and never smokers (see FIGS. 39A-39B). Given the relatively small sample size for this subgroup analysis, these observations must be confirmed in a larger study but may account in part for the reported increased incidence of lung cancer in African American cigarette smokers(21).
  • Thus, the general effect of smoking on large airway epithelial cells was to induce expression of xenobiotic metabolism and redox stress-related genes and to decrease expression of some genes associated with regulation of inflammation. Several putative oncogenes were upregulated and tumor suppressor genes were downregulated although their roles, in smoking-induced lung cancer remain to be determined. Risk for developing lung cancer in smokers has been shown to increase with cumulative pack-years of exposure(22), and a number of putative oncogenes correlate positively with pack-years, while putative tumor suppressor genes correlate negatively.
  • It is unlikely that the alterations we observed in smokers were due to a change in cell types obtained at bronchoscopy. Several dynein genes were expressed at high levels in never smokers in our study, consistent with the predominance of ciliated cells in our samples. The level of expression of various dynein genes, and therefore the balance of cell types being sampled, did not change in smokers. This is consistent with a previous study of antioxidant gene expression in airway epithelial cells from never and current smokers that showed no change in histologic types of cells obtained from smokers(8). Our findings that drug metabolism and antioxidant genes are induced by smoking in airway epithelial cells is consistent with in vitro and in vivo animal studies (summarized in (9)). The high density arrays used in our studies allowed us to define the effect of cigarette smoking on a large number of genes not previously described as being affected by smoking.
  • Two sample unequal variance t-tests were performed to find differentially expressed genes between never and current smokers. Due to the presence of multiple comparisons in array data, there is the potential problem of finding genes differentially expressed between the 2 groups when no difference actually exists(Benjamini, Y. & Hochberg, Y. (1995) Journal of the Royal Statistical Society Series B 57, 289-300). Current methods available to adjust for multiple comparisons, such as the Bonferroni correction (where the (p(t-test)-value) threshold is divided by the number of hypotheses tested), are often too conservative when applied to microarray data (MacDonald, T. J., Brown, K. M., LaFleur, B., Peterson, K., Lawlor, C., Chen, Y., Packer, R. J., Cogen, P. & Stephan, D. A. (2001) Nat. Genet. 29, 143-152). However, we chose to employ a very stringent multiple comparison correction and (p(t-test)-value) threshold in order to identify a subset of genes altered by cigarette smoking with only a small probability of having a false positive. The Bonferroni correction controls the probability of committing even one error in all the hypotheses tested; however, the correction assumes independence of the different tests which is unlikely to hold true in the microarray setting where multiple genes are co-regulated (Tusher, V. G., Tibshirani, R. & Chu, G. (2001) Proc. Natl. Acad Sci. U.S.A 98, 5116-5121). Therefore, we have elected to employ a permutation-based correction (coded in PERL in our database) to assess the significance of the (p(t-test)-value) for any given gene. The permutation test is similar to the Bonferroni correction in that it controls the probability of finding even one gene by chance in the hypotheses tested, however, a permutation-based correction is data dependent. After calculating at-test statistic and (p(t-test)-value) for each gene, we permute the group assignments of all samples 1000 times and calculate for each permutation the t-statistic and corresponding (p(t-test)-value) for each gene. After all permutations are completed, the result is a 9968 (# of genes) by 1000 (# of permutations) matrix of (p(t-test)-values). For each permutation, a gene's actual (p(t-test)-value) is compared to all other permuted (p(t-test)-values) to determine if the any of the permuted (p(t-test)-values) is equal to or lower than the actual gene's (p(t-test)-value). An adjusted (p(t-test)-value) is computed for each gene based on the permutation test. The adjusted (p(t-test)-value) is the probability of observing at least as small a (p(t-test)-value) (in any gene) as the gene's actual (p(t-test)-value) in any random permutation. A gene is considered significant if less than 50 out of 1000 permutations (0.05) yield a gene with a permuted (p(t-test)-value) equal to or lower than the actual gene's (p(t-test)-value).
  • For our t-test comparing current vs. never smokers, the permuted (p(t-test)-value) threshold was found to be 1.06*10−5. Ninety-seven genes were considered differentially expressed between current and never smokers at this threshold. One shortcoming of this methodology is that is impossible to compute all possible permutations of the group assignments for large sample sizes. As a result, we repeated the permutation analysis 15 times yielding an average (p(t-test)-value) of 1.062*10−5 (sd=1.52*10−6). The mean (p(t-test)-value) was used as a cutoff and yielded a gene list of ninety-seven genes. In this case, the distribution of the data is such that the permuted P<t-test)-value threshold is slightly less strict than the equivalent Bonferroni cutoff.
  • By only focusing on the list of 97 genes that pass the (p(t-test)-value) threshold of 1.06*10−5, we recognize that we are ignoring a number of genes differentially expressed between never and current smokers (false negatives), but we wanted to be very confident regarding biological conclusions derived from genes that were considered differentially expressed. A broader list of genes was defined by calculating the q-value for each gene in the analysis as proposed by Storey J D & Tibshirani R (2003). Proc. Nail. Acrid Sci. US.A 100, 9449-9445. A given gene's q-value is the proportion of false positives present in the group of genes with smaller p-values than the gene. The q-value of the 97th gene was 0.005, which means that among all 97 t-tests that we designate as significant only 0.5% of them will be false positives. A less strict (p(t-test)-value) cutoff of 4.06*10−4 q-value=0.01) yields 261 genes with approximately 3 false positive genes. The q-values were calculated using the program Q-Value which can be downloaded from http://faculty.washington.edu/˜istorey/gvalue/. Larger lists of genes can be accessed through our database by selecting a less restrictive (p(t-test)-value) threshold (http://pulm.bumc.bu.edu/aged).
  • In order to further characterize the effect of tobacco smoke on bronchial epithelial cells, we wanted to explore how genes' expression changes with amount of smoking. Pearson correlation calculations exploring the relationship between gene expression among current smokers and pack-years of smoking were computed. A less strict permutation analysis was performed to correct for multiple Pearson correlation calculations. The analysis is analogous to the procedure described above, except only the genes having a correlation with a (p(correlation)-value) of less than 0.05 are permuted (2099 probesets instead of 9968 probesets). In addition, instead of permuting the class labels as described above, the pack-years were permuted (in a given permutation, gene expression values for a gene are assigned randomly to pack-year values). Using the less strict permutation analysis, the threshold was found to be 3.19*106 genes falling below this threshold. Supplementary Table 6 displays the top 51 genes with unadjusted (p(correlation)-value) below 0.0001. The (p(correlation)-value) threshold found using the permutation based multiple comparison correction is more strict than the Bonferroni threshold of 2.4*10−5 because the correction is data dependent and pack-year values in our study are quite variable. The current smokers in our study have an average number of pack-years of 22, but there are 3 “outlier” current smokers with extremely high pack-year histories (>70 pack-years). These smokers with extremely high pack years underpin the linear fit and result in better correlations even for random permutations, and thus lead to a stricter multiple comparison correction threshold.
  • Effects of Smoking Cessation: There is relatively little information about how smoking cessation alters the effects of smoking on airways. Cough and sputum production decreases rapidly in smokers with bronchitis who cease to smoke(23). The accelerated decline in forced expiratory volume (FEVI), that characterizes smokers with COPD, reverts to an ace appropriate decline of FEVI when smoking is discontinued(24). However, the allelic loss in airway epithelial cells obtained at biopsy, changes relatively little in former smokers and the risk for developing lung cancer remains high for at least 20 years after smoking cessation(6).
  • FIG. 25A shows a multidimensional scaling plot of never and current smokers according to the expression of the 97 genes that distinguish current smokers from never smokers. FIG. 25B shows that former smokers who discontinued smoking less than 2 years prior to this study tend to cluster with current smokers, whereas former smokers who discontinued smoking for more than 2 years group more closely with never smokers. Hierarchical clustering of all 18 former smokers according to the expression of these same 97 genes also reveals 2 subgroups of former smokers, with the length of smoking cessation being the only clinical variable that was statistically different between the 2 subgroups (see FIG. 30). Reversible genes were predominantly drug metabolizing and antioxidant genes.
  • There were 13 genes that did not return to normal levels in former smokers, even those who had discontinued smoking 20-30 year prior to testing (p<9*10−4; threshold determined by permutation analysis). These genes include a number of potential tumor suppressor genes, e.g. TU3A and CX3CL1, that are permanently decreased, and several putative oncogenes, e.g. CEACAM6 and HN1, which are permanently increased (see FIG. 26). Three metallothionein genes remain decreased in former smokers. Metallothioneins have metal binding, detoxification and antioxidant properties and have been reported to affect cell proliferation and apoptosis(25). The metallothionein genes that remained abnormal in former smokers are located at 16q13, suggesting that this may represent a fragile site for DNA injury in smokers. The persistence of abnormal expression of select genes after smoking cessation may provide growth advantages to a subset of epithelial cells allowing for clonal expansion and perpetuation of these cells years after smoking had been discontinued. These permanent changes might explain the persistent risk of lung cancer in former smokers.
  • We performed an unsupervised hierarchical clustering analysis of all 18 former smokers according to the expression of the 97 genes differentially expressed between current and never smoker (FIG. 30). In addition, a multidimensional scaling (MDS) plot was constructed of all samples according to the expression of these 97 genes (FIGS. 25A-25B). The MDS plot in FIG. 25 was constructed from the raw expression data for the 97 genes across all the samples using orthogonal initialization and euclidean distance as the similarity metric. Principal component analysis using the same data yielded similar results. Hierarchical clustering of the genes and samples was performed using log transformed z-score normalized data using a Pearson correlation (uncentered) similarity metric and average linkage clustering using CLUSTER and TREEVIEW software programs obtained at http://rana.1bI.gov/EisenSoftware.htmMDS and PCA were performed using Partek 5.0 software obtained at www.partek.com.
  • In order to identify genes irreversibly altered by cigarette smoking, we performed a t-test between former smokers (n=18) and never smokers (n=23) across the 97 genes that were considered differentially expressed between current and never smokers. A permutation analysis (as described above) was used to determine the (p(t-test)-value) threshold of 9.8*10−4 . Using this threshold, 15 of the 97 probesets were found to be significantly irreversible altered by cigarette smoking. In order to strengthen the argument that the 15 irreversibly altered probesets are related to smoking, the analysis was expanded to all 9968 genes. At-test was performed between former and never smoker across all 9968 genes, and 44 genes were found to have a (p(t-test)-value) threshold below 0.00098. While the permuted (p(t-test)-value) threshold for this extension of our t-test should have been computed across all 9968 genes, the former smokers are the smallest group in our study and thus we chose a less restrictive (p(t-test)-value) threshold. Although a there was about a 100-fold increase in the amount of genes analyzed there was only about a 3-fold increase in the number of genes found to be significantly different between never and former smokers. Therefore, most genes that are significantly different between never and former smokers are also significantly different between current and never smokers. Also, in addition to the 15 genes, 12 more genes had a (p(t-test)-value) between current and never smokers of less than 0.001, and only 7 of the 44 genes had (p(t-test)-values) between current and never smokers of greater than 0.05 (FIGS. 38A-38B).
  • We have, for the first time, characterized the genes expressed, and by extrapolation, defined the functions of a specific set of epithelial cells from a complex organ across a broad cross section of normal individuals. Large airway epithelial cells appear to serve antioxidant, metabolizing, and host defense functions.
  • Cigarette smoking, a major cause of lung disease, induces xenobiotic and redox regulating genes as well as several oncogenes, and decreases expression of several tumor suppressor genes and genes that regulate airway inflammation. We also identified a subset of three smokers who respond differently to cigarette smoke, i.e. individuals who do not turn on the genes needed to deal with getting rid of the pollutants, i.e., their airway transcriptome expression pattern resembles that of a non-smoker, and these smokers are thus predisposed to the carcinogenic effects.
  • Finally, we have explored the reversibility of altered gene expression when smoking was discontinued. The expression level of smoking induced genes among former smokers began to resemble that of never smokers after two years of smoking cessation. Genes that reverted to normal within two years of cessation tended to serve metabolizing and antioxidant functions.
  • Several genes, including potential oncogenes and tumor suppressor genes, failed to revert to never smoker levels years after cessation of smoking. Without wishing to be bound by a theory, these later findings explain the continued risk for developing lung cancer many years after individuals have ceased to smoke. In addition, results from this study show that the airway gene expression profile in smokers serves as a biomarker for lung cancer.
  • 50WVprobesetseq.txt
    Probes sequences for 201804_x_at:
    >probe:HG-U133A:201804_x_at:238:233; Interrogation_Position = 450; Antisense;
    CTGAAGCGCAGAAAGCTCGGCCGGT
    >probe:HG-U133A:201804_x_at:675:489; Interrogation_Position = 605; Antisense;
    GGGCACCGTCATGTATGTAGGTCTC
    >probe:HG-U133A:201804_x_at:533:507; Interrogation_Position = 624; Antisense;
    GGTCTCACAGATTTCAAGCCTGGCT
    >probe:HG-U133A:201804_x_at:276:295; Interrogation_Position = 641; Antisense;
    GCCTGGCTACTGGATTGGTGTCCGC
    >probe:HG-U133A:201804_x_at:302:481; Interrogation_Position = 658; Antisense;
    GTGTCCGCTATGATGAGCCACTGGG
    >probe:HG-U133A:201804_x_at:363:157; Interrogation_Position = 721; Antisense;
    AATGCCAGGCCAAGTATGGCGCCTT
    >probe:HG-U133A:201804_x_at:671:39; Interrogation_Position = 736; Antisense;
    ATGGCGCCTTTGTCAAGCCAGCAGT
    >probe:HG-U133A:201804_x_at:4:289; Interrogation_Position = 752; Antisense;
    GCCAGCAGTCGTGACGGTGGGGGAC
    >probe:HG-U133A:201804_x_at:375:585; Interrogation_Position = 810; Antisense;
    TGACACCTAAGGAATTCCCCTGCTT
    >probe:HG-U133A:201804_x_at:22:621; Interrogation_Position = 888; Antisense;
    TCTCCTGACCCCATTTTAATTTTAT
    >probe:HG-U133A:201804_x_at:679:679; Interrogation_Position = 908; Antisense;
    TTTATTCATTTTTTCCTTTGCCATT
    Probes sequences for 203588_s_at:
    >probe:HG-U133A:203588_s_at:530:623; Interrogation_Position = 787; Antisense;
    TCTACCATTCAGCTGCCATTCATAA
    >probe:HG-U133A:203588_s_at:152:331; Interrogation_Position = 845; Antisense;
    GCAGCATCTCCAGTGACAAGTTTGA
    >probe:HG-U133A:203588_s_at:382:39; Interrogation_Position = 931; Antisense;
    ATGGGAATGTCGTTTGGCCTGGAGT
    >probe:HG-U133A:203588_s_at:223:521; Interrogation_Position = 951; Antisense;
    GGAGTCAGGCAAATGCTCTCTGGAG
    >probe:HG-U133A:203588_s_at:268:585; Interrogation_Position = 980; Antisense;
    TGAAACTTGCGAAATCCCTGGTGCC
    >probe:HG-U133A:203588_s_at:662:13; Interrogation_Position = 1031; Antisense;
    ATATCTCCACAGGACCTTCTTGGTT
    >probe:HG-U133A:203588_s_at:123:633; Interrogation_Position = 1059; Antisense;
    TCAGGGACTACTTCTGAACTCTACC
    >probe:HG-U133A:203588_s_at:251:163; Interrogation_Position = 1096; Antisense;
    AATTTAGACCTGACCACTGGTGCCA
    >probe:HG-U133A:203588_s_at:341:51; Interrogation_Position = 1167; Antisense;
    AGTGGCCTTAGCAACTGGGCAGTTC
    >probe:HG-U133A:203588_s_at:238:173; Interrogation_Position = 1199; Antisense;
    CAAACAGTCACCAGTCCAGCAGTGC
    >probe:HG-U133A:203588_s_at:667:1; Interrogation_Position = 1301; Antisense;
    ATTCCTCCTCCCCAGAATAAAGACA
    Probes sequences for 219920_s_at:
    >probe:HG-U133A:219920_s_at:202:549; Interrogation_Position = 1024; Antisense;
    GGCCAGAACTGCAGCATTGGCCCCA
    >probe:HG-U133A:219920_s_at:305:703; Interrogation_Position = 1040; Antisense;
    TTGGCCCCAATGTGAGCCTGGGACC
    >probe:HG-U133A:219920_s_at:646:265; Interrogation_Position = 1063; Antisense;
    CCTGGCGTGGTGGTCGAAGATGGTG
    >probe:HG-U133A:219920_s_at:425:415; Interrogation_Position = 1237; Antisense;
    GATGAGCTCTACCTCAACGGAGCCA
    >probe:HG-U133A:219920_s_at:283:595; Interrogation_Position = 1265; Antisense;
    TGCTGCCCCACAAGTCTATTGGCGA
    >probe:HG-U133A:219920_s_at:216:281; Interrogation_Position = 1297; Antisense;
    CCAGAGCCTCGTATCATCATGTGAG
    >probe:HG-U133A:219920_s_at:246:55; Interrogation_Position = 1370; Antisense;
    AGTGCTGGCCTGACACATCAGAAGA
    >probe:HG-U133A:219920_s_at:455:201; Interrogation_Position = 1385; Antisense;
    CATCAGAAGACCCTGGACTTGTCAT
    >probe:HG-U133A:219920_s_at:612:459; Interrogation_Position = 1405; Antisense;
    GTCATTATTTGTCTGGGGGGCACTG
    >probe:HG-U133A:219920_s_at:15:485; Interrogation_Position = 1466; Antisense;
    GTGGACATCATCTGGCAGGATCCCT
    >probe:HG-U133A:219920_s_at:618:249; Interrogation_Position = 1511; Antisense;
    CCCACTCCCTCAAGAAGGGCCAGGG
    Probes sequences for 201608_s_at:
    >probe:HG-U133A:201608_s_at:302:159; Interrogation_Position = 1287; Antisense;
    AATCAAGGGCTGTCTCGTGACTGCT
    >probe:HG-U133A:201608_s_at:706:619; Interrogation_Position = 1299; Antisense;
    TCTCGTGACTGCTTCAGCTGACAAA
    >probe:HG-U133A:201608_s_at:170:631; Interrogation_Position = 1408; Antisense;
    TCATGTTGCCCTGATTTGCCATTTA
    >probe:HG-U133A:201608_s_at:555:61; Interrogation_Position = 1455; Antisense;
    AGAAGGGCTTCGGGTCTGGGATATA
    >probe:HG-U133A:201608_s_at:685:397; Interrogation_Position = 1517; Antisense;
    GAGAGAGGCTTGTTCTTGGGAGTGC
    >probe:HG-U133A:201608_s_at:676:627; Interrogation_Position = 1549; Antisense;
    TCATCTATTAGTGGCCCTTTTGGCA
    >probe:HG-U133A:201608_s_at:10:189; Interrogation_Position = 1572; Antisense;
    CAGCAGGAGCTCAGATACACCCATG
    >probe:HG-U133A:201608_s_at:231:23; Interrogation_Position = 1614; Antisense;
    ATCTAATTTCCTGCTTACCTTAACT
    >probe:HG-U133A:201608_s_at:278:35; Interrogation_Position = 1665; Antisense;
    ATGTTCCATGCGTGGCAGCAACCAT
    >probe:HG-U133A:201608_s_at:697:479; Interrogation_Position = 1745; Antisense;
    GTGGCACCACAAATATCCGGTCTTT
    >probe:HG-U133A:201608_s_at:235:705; Interrogation_Position = 1768; Antisense;
    TTGTGCTTGCTCTTCAGATGGATGG
    Probes sequences for 202857_at:
    >probe:HG-U133A:202857_at:316:499; Interrogation_Position = 433; Antisense;
    GGGGAACAGATTGATCCTTCCACCC
    >probe:HG-U133A:202857_at:308:601; Interrogation_Position = 451; Antisense;
    TCCACCCATCGCAAGAACTACGTAC
    >probe:HG-U133A:202857_at:646:647; Interrogation_Position = 469; Antisense;
    TACGTACGTGTAGTGGGCCGGAATG
    >probe:HG-U133A:202857_at:607:383; Interrogation_Position = 511; Antisense;
    GACCTACAAGGCATCCGAATCGACT
    >probe:HG-U133A:202857_at:432:615; Interrogation_Position = 530; Antisense;
    TCGACTCAGATATTAGCGGCACCCT
    >probe:HG-U133A:202857_at:375:243; Interrogation_Position = 546; Antisense;
    CGGCACCCTCAAGTTTGCGTGTGAG
    >probe:HG-U133A:202857_at:75:587; Interrogation_Position = 594; Antisense;
    TGAACTCATTGAATTCTTTTCCCGA
    >probe:HG-U133A:202857_at:321:247; Interrogation_Position = 655; Antisense;
    CGAACAGATCTTTGTGACCATGCCC
    >probe:HG-U133A:202857_at:689:87; Interrogation_Position = 671; Antisense;
    ACCATGCCCTGCACATATCGCATGA
    >probe:HG-U133A:202857_at:103:391; Interrogation_Position = 697; Antisense;
    GAGCTATGAACCACTGGAGCAGCCC
    >probe:HG-U133A:202857_at:57:303; Interrogation_Position = 718; Antisense;
    GCCCACACTGGCTTGATGGATCACC
    Probes sequences for 219203_at:
    >probe:HG-U133A:219203_at:405:265; Interrogation_Position = 335; Antisense;
    CCTGTCCGTCATGTTGGAGGTCGCC
    >probe:HG-U133A:219203_at:608:613; Interrogation_Position = 355; Antisense;
    TCGCCCTCAACCAGGTGGATGTGTG
    >probe:HG-U133A:219203_at:32:463; Interrogation_Position = 394; Antisense;
    GTCTGGTGGTGGCTGGTTACTACCA
    >probe:HG-U133A:219203_at:536:437; Interrogation_Position = 409; Antisense;
    GTTACTACCATGCCAATGCAGCTGT
    >probe:HG-U133A:219203_at:118:333; Interrogation_Position = 426; Antisense;
    GCAGCTGTGAACGATCAGAGCCCTG
    >probe:HG-U133A:219203_at:240:341; Interrogation_Position = 486; Antisense;
    GAATTCTTCCCTGATGCAGTACTTA
    >probe:HG-U133A:219203_at:649:619; Interrogation_Position = 581; Antisense;
    TCTCCGCTGGGTCCCTAAGGATAAG
    >probe:HG-U133A:219203_at:411:539; Interrogation_Position = 640; Antisense;
    GGCAGATGGTGGGAGCTCTACTGGA
    >probe:HG-U133A:219203_at:9:653; Interrogation_Position = 658; Antisense;
    TACTGGAAGATCGGGCCCACCAGCA
    >probe:HG-U133A:219203_at:15:189; Interrogation_Position = 678; Antisense;
    CAGCACCTTGTGGACTTTGACTGCC
    >probe:HG-U133A:219203_at:539:599; Interrogation_Position = 699; Antisense;
    TGCCACCTTGATGACATCCGGCAGG
    Probes sequences for 211759_x_at:
    >probe:HG-U133A:211759_x_at:485:313; Interrogation_Position = 429; Antisense;
    GCTCTTTCCTGAAGCGCAGCAAGCT
    >probe:HG-U133A:211759_x_at:674:489; Interrogation_Position = 592; Antisense;
    GGGCACCGTCATGTATGTAGGTCTC
    >probe:HG-U133A:211759_x_at:532:507; Interrogation_Position = 611; Antisense;
    GGTCTCACAGATTTCAAGCCTGGCT
    >probe:HG-U133A:211759_x_at:275:295; Interrogation_Position = 628; Antisense;
    GCCTGGCTACTGGATTGGTGTCCGC
    >probe:HG-U133A:211759_x_at:301:481; Interrogation_Position = 645; Antisense;
    GTGTCCGCTATGATGAGCCACTGGG
    >probe:HG-U133A:211759_x_at:362:157; Interrogation_Position = 708; Antisense;
    AATGCCAGGCCAAGTATGGCGCCTT
    >probe:HG-U133A:211759_x_at:670:39; Interrogation_Position = 723; Antisense;
    ATGGCGCCTTTGTCAAGCCAGCAGT
    >probe:HG-U133A:211759_x_at:3:289; Interrogation_Position = 739; Antisense;
    GCCAGCAGTCGTGACGGTGGGGGAC
    >probe:HG-U133A:211759_x_at:374:585; Interrogation_Position = 797; Antisense;
    TGACACCTAAGGAATTCCCCTGCTT
    >probe:HG-U133A:211759_x_at:21:621; Interrogation_Position = 875; Antisense;
    TCTCCTGACCCCATTTTAATTTTAT
    >probe:HG-U133A:211759_x_at:678:679; Interrogation_Position = 895; Antisense;
    TTTATTCATTTTTTCCTTTGCCATT
    Probes sequences for 218556_at:
    >probe:HG-U133A:218556_at:45:657; Interrogation_Position = 38; Antisense;
    TAGCCGGACGGGGATCTGAGCTGGC
    >probe:HG-U133A:218556_at:593:645; Interrogation_Position = 94; Antisense;
    TAAACCCCAACACCCGAGTGATGAA
    >probe:HG-U133A:218556_at:682:591; Interrogation_Position = 163; Antisense;
    TGCATATGGTTCTACTCAGCATCCC
    >probe:HG-U133A:218556_at:356:211; Interrogation_Position = 191; Antisense;
    CTTCAGCATTCCTGTTGTCTGGACC
    >probe:HG-U133A:218556_at:483:233; Interrogation_Position = 216; Antisense;
    CTGACCAACGTCATCCATAACCTGG
    >probe:HG-U133A:218556_at:147:645; Interrogation_Position = 233; Antisense;
    TAACCTGGCTACGTATGTCTTCCTT
    >probe:HG-U133A:218556_at:689:151; Interrogation_Position = 295; Antisense;
    AAGGAAAGGCTCGGCTACTGACACA
    >probe:HG-U133A:218556_at:478:553; Interrogation_Position = 331; Antisense;
    TGGACTATGGGCTCCAGTTTACCTC
    >probe:HG-U133A:218556_at:265:609; Interrogation_Position = 397; Antisense;
    TCCTGGCCAGCTTCTATACCAAGTA
    >probe:HG-U133A:218556_at:407:295; Interrogation_Position = 450; Antisense;
    GCCTCATTGCTAAGTGTACTGCTGC
    >probe:HG-U133A:218556_at:117:435; Interrogation_Position = 488; Antisense;
    GTTCCATGGGGTTCGTGTCTTTGGC
    Probes sequences for 214463_x_at:
    >probe:HG-U133A:214463_x_at:264:35; Interrogation_Position = 46; Antisense;
    ATGTCTGGCCGCGGCAAAGGCGGGA
    >probe:HG-U133A:214463_x_at:659:315; Interrogation_Position = 91; Antisense;
    GCTAAGCGCCACCGTAAAGTACTGC
    >probe:HG-U133A:214463_x_at:115:303; Interrogation_Position = 116; Antisense;
    GCGACAATATCCAGGGCATCACCAA
    >probe:HG-U133A:214463_x_at:692:207; Interrogation_Position = 157; Antisense;
    CTTGCTCGCCGCGGCGGCGTGAAGC
    >probe:HG-U133A:214463_x_at:706:25; Interrogation_Position = 184; Antisense;
    ATCTCCGGCCTCATCTACGAGGAGA
    >probe:HG-U133A:214463_x_at:629:379; Interrogation_Position = 207; Antisense;
    GACTCGCGGGGTGCTGAAGGTGTTC
    >probe:HG-U133A:214463_x_at:20:435; Interrogation_Position = 228; Antisense;
    GTTCCTGGAGAACGTGATCCGGGAC
    >probe:HG-U133A:214463_x_at:588:385; Interrogation_Position = 258; Antisense;
    GACCTATACAGAGCACGCCAAGCGC
    >probe:HG-U133A:214463_x_at:215:459; Interrogation_Position = 289; Antisense;
    GTCACCGCCATGGATGTGGTCTACG
    >probe:HG-U133A:214463_x_at:269:631; Interrogation_Position = 340; Antisense;
    TACGGTTTCGGTGGTTGAGCGTCCT
    >probe:HG-U133A:214463_x_at:55:581; Interrogation_Position = 355; Antisense;
    TGAGCGTCCTTTTCTACCAATAAAA
    Probes sequences for 210434_x_at:
    >probe:HG-U133A:210434_x_at:506:41; Interrogation_Position = 579; Antisense;
    ATGGCCTGGGGCTGGCATTTATCTT
    >probe:HG-U133A:210434_x_at:281:685; Interrogation_Position = 597; Antisense;
    TTATCTTTCCTTTCAGCAAGCACCT
    >probe:HG-U133A:210434_x_at:552:333; Interrogation_Position = 616; Antisense;
    GCACCTCAAATTTGCCATGCTGGCT
    >probe:HG-U133A:210434_x_at:392:145; Interrogation_Position = 664; Antisense;
    AAGAGTGCTCTCCATGCTCTAATTT
    >probe:HG-U133A:210434_x_at:656:653; Interrogation_Position = 701; Antisense;
    TACCCCTGAGTGTGGTCCCACAGGA
    >probe:HG-U133A:210434_x_at:142:231; Interrogation_Position = 776; Antisense;
    CTGCCGCTCAGCTTTGATGGAACAA
    >probe:HG-U133A:210434_x_at:169:349; Interrogation_Position = 819; Antisense;
    GAAGGGGCTGTCGTGTGTGTGGCCC
    >probe:HG-U133A:210434_x_at:73:707; Interrogation_Position = 854; Antisense;
    TTGTCTTGTCATCATTCGTCAGCGA
    >probe:HG-U133A:210434_x_at:501:615; Interrogation_Position = 919; Antisense;
    TCGAGTCCATATAGCTACATTCCAC
    >probe:HG-U133A:210434_x_at:269:283; Interrogation_Position = 940; Antisense;
    CCACCCTTGTATCCTGGGTCTTAGA
    >probe:HG-U133A:210434_x_at:68:577; Interrogation_Position = 998; Antisense;
    TGATTTGCACTCTTGGTTCTTTGGA
    Probes sequences for 209653_at:
    >probe:HG-U133A:209653_at:158:457; Interrogation_Position = 1252; Antisense;
    GTAATTGATGCCAATCTTGTACCAA
    >probe:HG-U133A:209653_at:298:515; Interrogation_Position = 1293; Antisense;
    GGATAAGGGGGATTTTGGCACTCAA
    >probe:HG-U133A:209653_at:297:355; Interrogation_Position = 1321; Antisense;
    GAAGCTGCTTGGGCCATAAGTAACT
    >probe:HG-U133A:209653_at:274:423; Interrogation_Position = 1366; Antisense;
    GATCAAGTGGCTTACCTTATCCAAC
    >probe:HG-U133A:209653_at:627:613; Interrogation_Position = 1400; Antisense;
    TCCCACCTTTTTGCAACTTGCTGAC
    >probe:HG-U133A:209653_at:594:375; Interrogation_Position = 1576; Antisense;
    GACATCTACAAATTGGCCTATGAGA
    >probe:HG-U133A:209653_at:3:423; Interrogation_Position = 1599; Antisense;
    GATCATTGATCAGTTCTTCTCTTCA
    >probe:HG-U133A:209653_at:651:425; Interrogation_Position = 1627; Antisense;
    GATATTGATGAAGACCCTAGCCTTG
    >probe:HG-U133A:209653_at:520:255; Interrogation_Position = 1641; Antisense;
    CCCTAGCCTTGTTCCAGAGGCAATT
    >probe:HG-U133A:209653_at:29:541; Interrogation_Position = 1669; Antisense;
    GGCGGAACATTTGGTTTCAATTCAT
    >probe:HG-U133A:209653_at:445:639; Interrogation_Position = 1685; Antisense;
    TCAATTCATCTGCCAATGTACCAAC
    Probes sequences for 217949_s_at:
    >probe:HG-U133A:217949_s_at:236:317; Interrogation_Position = 332; Antisense;
    GCTATTGTTAGGTTGCCTGCGGACA
    >probe:HG-U133A:217949_s_at:226:627; Interrogation_Position = 433; Antisense;
    TCTTCGTGCTCTATGATTTCTGCAT
    >probe:HG-U133A:217949_s_at:659:617; Interrogation_Position = 451; Antisense;
    TCTGCATTGTTTGTATCACCACCTA
    >probe:HG-U133A:217949_s_at:35:87; Interrogation_Position = 468; Antisense;
    ACCACCTATGCTATCAACGTGAGCC
    >probe:HG-U133A:217949_s_at:556:485; Interrogation_Position = 497; Antisense;
    GTGGCTCAGTTTCCGGAAGGTCCAA
    >probe:HG-U133A:217949_s_at:241:543; Interrogation_Position = 536; Antisense;
    GGCTAAGAGGCACTGAGCCCTCAAC
    >probe:HG-U133A:217949_s_at:309:325; Interrogation_Position = 594; Antisense;
    GCATGTGAGCCTTGCCTAAGGGGGC
    >probe:HG-U133A:217949_s_at:133:247; Interrogation_Position = 607; Antisense;
    GCCTAAGGGGGCATATCTGGGTCCC
    >probe:HG-U133A:217949_s_at:507:615; Interrogation_Position = 622; Antisense;
    TCTGGGTCCCTAGAAGGCCCTAGAT
    >probe:HG-U133A:217949_s_at:255:411; Interrogation_Position = 644; Antisense;
    GATGTGGGGCTTCTAGATTACCCCC
    >probe:HG-U133A:217949_s_at:651:229; Interrogation_Position = 674; Antisense;
    CTGCCATACCCGCACATGACAATGG
    Probes sequences for 203246_s_at:
    >probe:HG-U133A:203246_s_at:500:709; Interrogation_Position = 770; Antisense;
    TTGTGACACTGGTGTCCATCCTCCA
    >probe:HG-U133A:203246_s_at:95:453; Interrogation_Position = 795; Antisense;
    GTACTCCAATGTATACTGCCCAACG
    >probe:HG-U133A:203246_s_at:199:63; Interrogation_Position = 858; Antisense;
    AGAGGCATGTCTATCCTACGTGACC
    >probe:HG-U133A:203246_s_at:291:619; Interrogation_Position = 906; Antisense;
    TCTCCGGGATGTGTTCCAGCTATAC
    >probe:HG-U133A:203246_s_at:435:309; Interrogation_Position = 1011; Antisense;
    GCTGATCCAGTTCGGGCTTATGAAG
    >probe:HG-U133A:203246_s_at:519:353; Interrogation_Position = 1032; Antisense;
    GAAGAACCTCATCAGGCGACTACAG
    >probe:HG-U133A:203246_s_at:587:377; Interrogation_Position = 1074; Antisense;
    GACTCGGGAAGAGCAGAGCCACCCT
    >probe:HG-U133A:203246_s_at:675:541; Interrogation_Position = 1103; Antisense;
    GGCTTTATACAGGCTGCCACAGCTA
    >probe:HG-U133A:203246_s_at:248:363; Interrogation_Position = 1183; Antisense;
    GAAAATGACCCCAACATCATCATCT
    >probe:HG-U133A:203246_s_at:540:111; Interrogation_Position = 1239; Antisense;
    ACACATTGCTGTGGGTAGTCCCTCC
    >probe:HG-U133A:203246_s_at:388:47; Interrogation_Position = 1268; Antisense;
    AGGAGGCTTGTCATACTGTCTAGAG
    Probes sequences for 219408_at:
    >probe:HG-U133A:219408_at:67:307; Interrogation_Position = 1843; Antisense;
    GCTGTGGTTGTGGAGTTCAGGGACC
    >probe:HG-U133A:219408_at:298:239; Interrogation_Position = 1887; Antisense;
    CTGTGGTGACTGCGAAGGCTTCGAC
    >probe:HG-U133A:219408_at:450:319; Interrogation_Position = 1904; Antisense;
    GCTTCGACGTGCACATCATGGACGA
    >probe:HG-U133A:219408_at:257:375; Interrogation_Position = 1927; Antisense;
    GACATGATTAAGCGTGCCCTGGACT
    >probe:HG-U133A:219408_at:375:257; Interrogation_Position = 2066; Antisense;
    CCCTGTGTGCCGAGGGCACCGTGGA
    >probe:HG-U133A:219408_at:320:183; Interrogation_Position = 2082; Antisense;
    CACCGTGGAGCTCAGAAGGCCCGGG
    >probe:HG-U133A:219408_at:123:191; Interrogation_Position = 2107; Antisense;
    CAGAGCCACGCAGCGGTGCTATGGA
    >probe:HG-U133A:219408_at:78:289; Interrogation_Position = 2111; Antisense;
    GCCACGCAGCGGTGCTATGGATGGA
    >probe:HG-U133A:219408_at:445:663; Interrogation_Position = 2126; Antisense;
    TATGGATGGAGTACCACCTGACCCC
    >probe:HG-U133A:219408_at:75:281; Interrogation_Position = 2254; Antisense;
    CCAGATCCCAGAGCACTGCTGGGTG
    >probe:HG-U133A:219408_at:356:521; Interrogation_Position = 2337; Antisense;
    GGAGTTCAGGCATGCAGATACCCCA
    Probes sequences for 208682_s_at:
    >probe:HG-U133A:208682_s_at:558:451; Interrogation_Position = 1353; Antisense;
    GTACCTGGACTATGCCAGAGTCCCC
    >probe:HG-U133A:208682_s_at:639:29; Interrogation_Position = 1364; Antisense;
    ATGCCAGAGTCCCCAATAGCAATCC
    >probe:HG-U133A:208682_s_at:445:295; Interrogation_Position = 1412; Antisense;
    GCCTGCGCTCTTACTATGAGACCAG
    >probe:HG-U133A:208682_s_at:283:539; Interrogation_Position = 1494; Antisense;
    GGCAGCTCAGTACCGAGAGGCGATG
    >probe:HG-U133A:208682_s_at:379:545; Interrogation_Position = 1545; Antisense;
    GGCTGCAGCTGAAGCCAAGGCTAGG
    >probe:HG-U133A:208682_s_at:608:79; Interrogation_Position = 1581; Antisense;
    AGCTCGAATGGGCATTGGGCTCGGC
    >probe:HG-U133A:208682_s_at:402:235; Interrogation_Position = 1643; Antisense;
    CTGATATCGGACCCTGGGCCAAAGC
    >probe:HG-U133A:208682_s_at:661:249; Interrogation_Position = 1661; Antisense;
    CCAAAGCCCGGATCCAGGCGGGAGC
    >probe:HG-U133A:208682_s_at:677:449; Interrogation_Position = 1742; Antisense;
    GTACCAATAACAGTGCCAGTGCCAG
    >probe:HG-U133A:208682_s_at:372:53; Interrogation_Position = 1765; Antisense;
    AGTGCCAGCACCAGTGGTGGCTTCA
    >probe:HG-U133A:208682_s_at:196:185; Interrogation_Position = 1860; Antisense;
    CACCAGTGGCAGCTCTGGTGCCTGT
    Probes sequences for 211609_x_at:
    >probe:HG-U133A:211609_x_at:96:563; Interrogation_Position = 746; Antisense;
    TGGGTCTTGGTGCCAGTGACTTTGA
    >probe:HG-U133A:211609_x_at:435:447; Interrogation_Position = 778; Antisense;
    GTAGATCCCAGTGCTGATCCTGAGC
    >probe:HG-U133A:211609_x_at:115:187; Interrogation_Position = 869; Antisense;
    CAGCTGCAGCTTCTGCTGCTGAGGC
    >probe:HG-U133A:211609_x_at:387:383; Interrogation_Position = 925; Antisense;
    GACGATGCCCTGCTGAAGATGACCA
    >probe:HG-U133A:211609_x_at:27:75; Interrogation_Position = 956; Antisense;
    AGCAAGAGTTTGGCCGCACTGGGCT
    >probe:HG-U133A:211609_x_at:418:593; Interrogation_Position = 1017; Antisense;
    TGCTTATGCCATGCAGATGTCCCTG
    >probe:HG-U133A:211609_x_at:161:445; Interrogation_Position = 1053; Antisense;
    GTTTGGCCAGGCGGAATCAGCAGAC
    >probe:HG-U133A:211609_x_at:458:601; Interrogation_Position = 1083; Antisense;
    TGCCAGCTCAGCTATGGACACATCC
    >probe:HG-U133A:211609_x_at:345:109; Interrogation_Position = 1100; Antisense;
    ACACATCCGAGCCAGCCAAGGAGGA
    >probe:HG-U133A:211609_x_at:374:575; Interrogation_Position = 1128; Antisense;
    TGATTACGACGTGATGCAGGACCCC
    >probe:HG-U133A:211609_x_at:670:399; Interrogation_Position = 1153; Antisense;
    GAGTTCCTTCAGAGTGTCCTAGAGA
    Probes sequences for 202936_s_at:
    >probe:HG-U133A:202936_s_at:240:449; Interrogation_Position = 3363; Antisense;
    GTAGTGTATCACTGAGTCATTTGCA
    >probe:HG-U133A:202936_s_at:32:573; Interrogation_Position = 3389; Antisense;
    TGTTTTCTGCCACAGACCTTTGGGC
    >probe:HG-U133A:202936_s_at:240:563; Interrogation_Position = 3409; Antisense;
    TGGGCTGCCTTATATTGTGTGTGTG
    >probe:HG-U133A:202936_s_at:140:143; Interrogation_Position = 3470; Antisense;
    AAGCATGTGTCATCCATATTTCTCT
    >probe:HG-U133A:202936_s_at:490:701; Interrogation_Position = 3505; Antisense;
    TTGGAGTGAGGGAGGCTACCTGGAG
    >probe:HG-U133A:202936_s_at:157:217; Interrogation_Position = 3520; Antisense;
    CTACCTGGAGGGGATCAGCCCACTG
    >probe:HG-U133A:202936_s_at:593:181; Interrogation_Position = 3535; Antisense;
    CAGCCCACTGACAGACCTTAATCTT
    >probe:HG-U133A:202936_s_at:456:9; Interrogation_Position = 3561; Antisense;
    ATTACTGCTGTGGCTAGAGAGTTTG
    >probe:HG-U133A:202936_s_at:528:15; Interrogation_Position = 3688; Antisense;
    ATATGGCATCCTTCAATTTCTGTAT
    >probe:HG-U133A:202936_s_at:77:455; Interrogation_Position = 3787; Antisense;
    GTAAAAGCTTTGGTTTGTGTTCGTG
    >probe:HG-U133A:202936_s_at:348:573; Interrogation_Position = 3854; Antisense;
    TGTTCTCTCCGTGAAACTTACCTTT
    Probes sequences for 211954_s_at:
    >probe:HG-U133A:211954_s_at:630:387; Interrogation_Position = 3671; Antisense;
    GAGCCACAAGCAGGAAGAGCAGCGC
    >probe:HG-U133A:211954_s_at:291:675; Interrogation_Position = 3713; Antisense;
    TTTCCATGGATTTCTACCAGACCAC
    >probe:HG-U133A:211954_s_at:500:653; Interrogation_Position = 3727; Antisense;
    TACCAGACCACTGAAGGAGTTCCTG
    >probe:HG-U133A:211954_s_at:403:301; Interrogation_Position = 3755; Antisense;
    GCCCTGCGGTAGCTAGCACTGAAGA
    >probe:HG-U133A:211954_s_at:448:653; Interrogation_Position = 3888; Antisense;
    TACTCAGCGTAGATGTGTGTTCACA
    >probe:HG-U133A:211954_s_at:148:479; Interrogation_Position = 3904; Antisense;
    GTGTTCACACAAATTGCTCTGCATT
    >probe:HG-U133A:211954_s_at:653:267; Interrogation_Position = 3981; Antisense;
    CCTCCTACCCAGCAAACCAGTAGAC
    >probe:HG-U133A:211954_s_at:688:673; Interrogation_Position = 4056; Antisense;
    TTTGATGTCACTTTGGTTCTTTTTC
    >probe:HG-U133A:211954_s_at:570:53; Interrogation_Position = 4096; Antisense;
    AGTGACTCAGTCGGGAAGCTTTCCA
    >probe:HG-U133A:211954_s_at:472:473; Interrogation_Position = 4138; Antisense;
    GTGAAGTGTCATTGGCATGTCTGGC
    >probe:HG-U133A:211954_s_at:525:37; Interrogation_Position = 4154; Antisense;
    ATGTCTGGCAGTAGTCTCTCATTCA
    Probes sequences for 213884_s_at:
    >probe:HG-U133A:213884_s_at:400:545; Interrogation_Position = 1073; Antisense;
    GGCTGACTGGGGCAACAGCCGCATC
    >probe:HG-U133A:213884_s_at:256:81; Interrogation_Position = 1089; Antisense;
    AGCCGCATCCAGGTATTCGACAGCT
    >probe:HG-U133A:213884_s_at:488:279; Interrogation_Position = 1097; Antisense;
    CCAGGTATTCGACAGCTCTGGCTCC
    >probe:HG-U133A:213884_s_at:539:275; Interrogation_Position = 1120; Antisense;
    CCTTCCTGTCCTATATCAACACATC
    >probe:HG-U133A:213884_s_at:661:21; Interrogation_Position = 1134; Antisense;
    ATCAACACATCTGCAGAACCACTGT
    >probe:HG-U133A:213884_s_at:362:25; Interrogation_Position = 1142; Antisense;
    ATCTGCAGAACCACTGTATGGTCCA
    >probe:HG-U133A:213884_s_at:86:181; Interrogation_Position = 1177; Antisense;
    CACTGACCTCGGATGGCCATGTGGT
    >probe:HG-U133A:213884_s_at:155:561; Interrogation_Position = 1201; Antisense;
    TGGTGGCTGATGCTGGCAACCACTG
    >probe:HG-U133A:213884_s_at:96:87; Interrogation_Position = 1219; Antisense;
    ACCACTGCTTTAAAGCCTATCGCTA
    >probe:HG-U133A:213884_s_at:650:271; Interrogation_Position = 1234; Antisense;
    CCTATCGCTACCTCCAGTAGCTGTA
    >probe:HG-U133A:213884_s_at:536:603; Interrogation_Position = 1246; Antisense;
    TCCAGTAGCTGTACAGAGGCCCTGC
    Probes sequences for 218976_at:
    >probe:HG-U133A:218976_at:118:251; Interrogation_Position = 619; Antisense;
    CCCAAGCCCCTAGAGAAGTCAGTCT
    >probe:HG-U133A:218976_at:587:55; Interrogation_Position = 639; Antisense;
    AGTCTCCCCGCAAAATTCAGATTCT
    >probe:HG-U133A:218976_at:195:567; Interrogation_Position = 678; Antisense;
    TGTGAATGGTTGGCACCTTCGTTTC
    >probe:HG-U133A:218976_at:389:215; Interrogation_Position = 805; Antisense;
    CTATGCTGCCAACATGCAGTCTTTG
    >probe:HG-U133A:218976_at:473:199; Interrogation_Position = 859; Antisense;
    CATGTCTGTGAATTGCTGAGTACTA
    >probe:HG-U133A:218976_at:461:651; Interrogation_Position = 879; Antisense;
    TACTAATTGATTCCTCCATCCTTGA
    >probe:HG-U133A:218976_at:551:343; Interrogation_Position = 902; Antisense;
    GAATCAGTTCTCATAATGCTTTTTA
    >probe:HG-U133A:218976_at:14:31; Interrogation_Position = 1007; Antisense;
    ATGCTTTTTCCTATTAATACTACTT
    >probe:HG-U133A:218976_at:395:5; Interrogation_Position = 1081; Antisense;
    ATTGACATACTGTGATCTCTATTAG
    >probe:HG-U133A:218976_at:29:575; Interrogation_Position = 1124; Antisense;
    TGTTTTCTTACCCTTGACTTACAAT
    >probe:HG-U133A:218976_at:496:473; Interrogation_Position = 1154; Antisense;
    GTGAAATTACTTGTCTGAACCCCGT
    Probes sequences for 204288_s_at:
    >probe:HG-U133A:204288_s_at:430:117; Interrogation_Position = 3526; Antisense;
    AAAGGTGCTGAGGACTACCCTGACC
    >probe:HG-U133A:204288_s_at:434:425; Interrogation_Position = 3577; Antisense;
    GATAGGATTCACAGCTTGAGCTCAA
    >probe:HG-U133A:204288_s_at:28:389; Interrogation_Position = 3594; Antisense;
    GAGCTCAAATAAGCCACAGCGTCCT
    >probe:HG-U133A:204288_s_at:705:81; Interrogation_Position = 3611; Antisense;
    AGCGTCCTGTGTTTACTCATGAAAA
    >probe:HG-U133A:204288_s_at:110:543; Interrogation_Position = 3663; Antisense;
    GGCTCTGTATAACTATACTCCCAGG
    >probe:HG-U133A:204288_s_at:203:475; Interrogation_Position = 3743; Antisense;
    GTGATGACGGCTGGTTTGTGGGGAC
    >probe:HG-U133A:204288_s_at:500:609; Interrogation_Position = 3797; Antisense;
    TCCCCGGAAACTACGTCAAGAGGCT
    >probe:HG-U133A:204288_s_at:279:145; Interrogation_Position = 3814; Antisense;
    AAGAGGCTGTGAATTGCGCTCCCTC
    >probe:HG-U133A:204288_s_at:570:607; Interrogation_Position = 3903; Antisense;
    TCCCGTTGTCATGCCTTACGGTTTC
    >probe:HG-U133A:204288_s_at:96:493; Interrogation_Position = 3997; Antisense;
    GGGACGACATGGCAGGCTGGTCCCC
    >probe:HG-U133A:204288_s_at:277:367; Interrogation_Position = 4028; Antisense;
    GAAAGTGTGGATTCCTACTTCCTGC
    Probes sequences for 202355_s_at:
    >probe:HG-U133A:202355_s_at:395:397; Interrogation_Position = 1207; Antisense;
    GAGAGCGACATTGACAGCGAGGCCT
    >probe:HG-U133A:202355_s_at:454:387; Interrogation_Position = 1276; Antisense;
    GAGCGGAAGCCGTCGGGAGGGAGCT
    >probe:HG-U133A:202355_s_at:701:331; Interrogation_Position = 1414; Antisense;
    GCAGCCAAGCGGTTGCGGCTGGACA
    >probe:HG-U133A:202355_s_at:94:67; Interrogation_Position = 1448; Antisense;
    AGAGCCTGTCTGGGAAGTCGACACC
    >probe:HG-U133A:202355_s_at:692:281; Interrogation_Position = 1477; Antisense;
    CCACCATCAGGCAAGACAACACCCA
    >probe:HG-U133A:202355_s_at:547:327; Interrogation_Position = 1515; Antisense;
    GCAGGTGACTGAGGATGCCGTGCGC
    >probe:HG-U133A:202355_s_at:602:81; Interrogation_Position = 1556; Antisense;
    AGCCCATGACCACTAAGGACCTGCT
    >probe:HG-U133A:202355_s_at:122:373; Interrogation_Position = 1623; Antisense;
    GACAGTGAACGTGTTGGCCCAGATC
    >probe:HG-U133A:202355_s_at:427:171; Interrogation_Position = 1659; Antisense;
    CAACCCCGAGCGCAAGATGATCAAC
    >probe:HG-U133A:202355_s_at:689:635; Interrogation_Position = 1679; Antisense;
    TCAACGACAAAATGCACTTCTCCCT
    >probe:HG-U133A:202355_s_at:630:321; Interrogation_Position = 1715; Antisense;
    GCTTGGTCCAATACATGGCTCTGCC
    Probes sequences for 212932_at:
    >probe:HG-U133A:212932_at:544:679; Interrogation_Position = 2939; Antisense;
    TTTAGACTTGCAGGTGCCTTTTCAT
    >probe:HG-U133A:212932_at:136:425; Interrogation_Position = 2982; Antisense;
    GATTCTTCTAGCATTACTCGTTGGT
    >probe:HG-U133A:212932_at:186:547; Interrogation_Position = 3079; Antisense;
    GGCCCTGTCCAGTTGGGTGATCAGG
    >probe:HG-U133A:212932_at:262:87; Interrogation_Position = 3110; Antisense;
    ACCAGCATCGGAAAGACTTCCCAGC
    >probe:HG-U133A:212932_at:1:77; Interrogation_Position = 3132; Antisense;
    AGCACCAAGCTTGAGCTGTGTCGTT
    >probe:HG-U133A:212932_at:367:559; Interrogation_Position = 3148; Antisense;
    TGTGTCGTTTCGTGGAGGGGGCAGC
    >probe:HG-U133A:212932_at:246:501; Interrogation_Position = 3165; Antisense;
    GGGGCAGCGAGGATGGGCTTGAGCT
    >probe:HG-U133A:212932_at:230:389; Interrogation_Position = 3185; Antisense;
    GAGCTGTTGAGAGATTTCTGCCCTA
    >probe:HG-U133A:212932_at:483:697; Interrogation_Position = 3200; Antisense;
    TTCTGCCCTAGAGATGGCCTTTGTA
    >probe:HG-U133A:212932_at:73:97; Interrogation_Position = 3275; Antisense;
    ACTGGCAGGACGGTGTTCATCGCAT
    >probe:HG-U133A:212932_at:352:387; Interrogation_Position = 3310; Antisense;
    GACCAGCCTCTAGGCTAGCGGCTGC
    Probes sequences for 212041_at:
    >probe:HG-U133A:212041_at:213:481; Interrogation_Position = 1104; Antisense;
    GTGTCGCAACATCGTGTGGATCGCT
    >probe:HG-U133A:212041_at:431:485; Interrogation_Position = 1119; Antisense;
    GTGGATCGCTGAATGTATCGCCCAG
    >probe:HG-U133A:212041_at:540:181; Interrogation_Position = 1147; Antisense;
    CACCGCGCCAAAATCGACAACTACA
    >probe:HG-U133A:212041_at:166:117; Interrogation_Position = 1157; Antisense;
    AAATCGACAACTACATCCCTATCTT
    >probe:HG-U133A:212041_at:424:543; Interrogation_Position = 1199; Antisense;
    GGCTCTCAATTGCACTCTTTGTGTG
    >probe:HG-U133A:212041_at:525:479; Interrogation_Position = 1259; Antisense;
    GTGTATGTGGTCTGTGACAAGCCTG
    >probe:HG-U133A:212041_at:164:239; Interrogation_Position = 1270; Antisense;
    CTGTGACAAGCCTGTGGCTCACCTG
    >probe:HG-U133A:212041_at:537:181; Interrogation_Position = 1397; Antisense;
    CACCAAGGATGGACGAAGACCCCCT
    >probe:HG-U133A:212041_at:143:635; Interrogation_Position = 1441; Antisense;
    TCAGCCCTGTGGTTACAGCCGCTGA
    >probe:HG-U133A:212041_at:515:109; Interrogation_Position = 1455; Antisense;
    ACAGCCGCTGATGTATCTAAGAAGC
    >probe:HG-U133A:212041_at:303:257; Interrogation_Position = 1655; Antisense;
    CCCTGCCCCTCTGAGACAATAAAAC
    Probes sequences for 221932_s_at:
    >probe:HG-U133A:221932_s_at:429:75; Interrogation_Position = 89; Antisense;
    AGCAGAGAGGGAGCCGTTCATGTCA
    >probe:HG-U133A:221932_s_at:382:437; Interrogation_Position = 104; Antisense;
    GTTCATGTCAGAGACTCACTGCCAG
    >probe:HG-U133A:221932_s_at:476:179; Interrogation_Position = 120; Antisense;
    CACTGCCAGAAAAGCCTTACCCATT
    >probe:HG-U133A:221932_s_at:375:707; Interrogation_Position = 157; Antisense;
    TTGAGACCGCAACTGCTTGCACTGA
    >probe:HG-U133A:221932_s_at:337:1; Interrogation_Position = 208; Antisense;
    TTTTAGTTGGTCTGGTGTTCGGGCT
    >probe:HG-U133A:221932_s_at:322:455; Interrogation_Position = 329; Antisense;
    GTCACTTATTCTTTGCCTGATTCAG
    >probe:HG-U133A:221932_s_at:519:343; Interrogation_Position = 373; Antisense;
    GAATCATTATTCATGACCCCTCTGC
    >probe:HG-U133A:221932_s_at:678:383; Interrogation_Position = 387; Antisense;
    GACCCCTCTGCAAATGTGTCAGTCT
    >probe:HG-U133A:221932_s_at:141:465; Interrogation_Position = 408; Antisense;
    GTCTCCAAAGAGAGTATCTCCCCCC
    >probe:HG-U133A:221932_s_at:623:25; Interrogation_Position = 423; Antisense;
    ATCTCCCCCCAAATTTTGTGTAGCT
    >probe:HG-U133A:221932_s_at:75:569; Interrogation_Position = 535; Antisense;
    TGTGCATGCCTGAGTTGATTCCGAA
    Probes sequences for 215208_x_at:
    >probe:HG-U133A:215208_x_at:526:465; Interrogation_Position = 1958; Antisense;
    GTCCAGGCTAAGGTGCAATGGCACG
    >probe:HG-U133A:215208_x_at:583:471; Interrogation_Position = 1970; Antisense;
    GTGCAATGGCACGATGGCTCACGCC
    >probe:HG-U133A:215208_x_at:339:315; Interrogation_Position = 2138; Antisense;
    GCTACTGGGGAAGCTGAGGTTGCAG
    >probe:HG-U133A:215208_x_at:352:491; Interrogation_Position = 2195; Antisense;
    GGGCAACAGAGCAACACTTTGTCAA
    >probe:HG-U133A:215208_x_at:710:461; Interrogation_Position = 2234; Antisense;
    GTCTACAGCATAAACATCTTTAGGC
    >probe:HG-U133A:215208_x_at:635:125; Interrogation_Position = 2265; Antisense;
    AAAAACTAATTTTGGCCTGGTGCAG
    >probe:HG-U133A:215208_x_at:516:265; Interrogation_Position = 2280; Antisense;
    CCTGGTGCAGTGACTTGCGCTATAA
    >probe:HG-U133A:215208_x_at:258:471; Interrogation_Position = 2289; Antisense;
    GTGACTTGCGCTATAATCCCAGCAC
    >probe:HG-U133A:215208_x_at:81:69; Interrogation_Position = 2358; Antisense;
    AGACCAGAGTGACCAACACTGTGAA
    >probe:HG-U133A:215208_x_at:619:251; Interrogation_Position = 2440; Antisense;
    CCCACCTCACTCGGAGGCTGAGGCA
    >probe:HG-U133A:215208_x_at:66:585; Interrogation_Position = 2476; Antisense;
    TGAACCCGGGAAGCAGAGATTACAG
    Probes sequences for 205684_s_at:
    >probe:HG-U133A:205684_s_at:583:349; Interrogation_Position = 2138: Antisense;
    GAAGTAACTCTTGGGGACAATATAT
    >probe:HG-U133A:205684_s_at:77:321; Interrogation_Position = 2183; Antisense;
    GCATTACCTTGAAATATGAAGTGCC
    >probe:HG-U133A:205684_s_at:394:55; Interrogation_Position = 2202; Antisense;
    AGTGCCATTTGAATGTCCCAGGGCT
    >probe:HG-U133A:205684_s_at:611:291; Interrogation_Position = 2205; Antisense;
    GCCATTTGAATGTCCCAGGGCTTAT
    >probe:HG-U133A:205684_s_at:312:703; Interrogation_Position = 2210; Antisense;
    TTGAATGTCCCAGGGCTTATTAATA
    >probe:HG-U133A:205684_s_at:259:147; Interrogation_Position = 2238; Antisense;
    AAGATTTTCAACCCCTGAACTGCTT
    >probe:HG-U133A:205684_s_at:144:621; Interrogation_Position = 2264; Antisense;
    TCTGCCTCTGTGGAAAACTACTTTG
    >probe:HG-U133A:205684_s_at:350:363; Interrogation_Position = 2276; Antisense;
    GAAAACTACTTTGGGATTCTTCAGT
    >probe:HG-U133A:205684_s_at:350:653; Interrogation_Position = 2282; Antisense;
    TACTTTGGGATTCTTCAGTATTTGT
    >probe:HG-U133A:205684_s_at:671:7; Interrogation_Position = 2337; Antisense;
    ATTCATTCTAGGCATTGTTTATATT
    >probe:HG-U133A:205684_s_at:242:217; Interrogation_Position = 2344; Antisense;
    CTAGGCATTGTTTATATTTGAAGTT
    Probes sequences for 208082_x_at:
    >probe:HG-U133A:208082_x_at:578:269; Interrogation_Position = 3103; Antisense;
    CCTTGGTTTAAGGCTGGGCGCGGTG
    >probe:HG-U133A:208082_x_at:391:305; Interrogation_Position = 3165; Antisense;
    GCGGCCGGATCACAAGGCCAGGAGA
    >probe:HG-U133A:208082_x_at:112:29; Interrogation_Position = 3189; Antisense;
    ATCGAGACCATCCTGTGAATGGCGA
    >probe:HG-U133A:208082_x_at:533:567; Interrogation_Position = 3202; Antisense;
    TGTGAATGGCGAAACCCTGTCTCTA
    >probe:HG-U133A:208082_x_at:378:411; Interrogation_Position = 3319; Antisense;
    GAGGCGGAGCTTGCAATGAACTGAG
    >probe:HG-U133A:208082_x_at:430:231; Interrogation_Position = 3339; Antisense;
    CTGAGAATGTGCCACTGGACTCCCA
    >probe:HG-U133A:208082_x_at:334:669; Interrogation_Position = 3494; Antisense;
    TATAGCAGTACTAGCTCTGTTTATA
    >probe:HG-U133A:208082_x_at:424:123; Interrogation_Position = 3552; Antisense;
    AAACTACCTGGTTATCCCATATATT
    >probe:HG-U133A:208082_x_at:379:153; Interrogation_Position = 3612; Antisense;
    AATGTTTTACGATCTAGCCTTTCCA
    >probe:HG-U133A:208082_x_at:592:291; Interrogation_Position = 3628; Antisense;
    GCCTTTCCAGTATAGGCACTTCCTG
    >probe:HG-U133A:208082_x_at:517:333; Interrogation_Position = 3643; Antisense;
    GCACTTCCTGAAAAACCTTTGTCCT
    Probes sequences for 214153_at:
    >probe:HG-U133A:214153_at:128:707; Interrogation_Position = 851; Antisense;
    TTGTTGCCAGTGCCTGGTAGCCATA
    >probe:HG-U133A:214153_at:434:451; Interrogation_Position = 867; Antisense;
    GTAGCCATAAGGCCTGCTGATCTGG
    >probe:HG-U133A:214153_at:135:147; Interrogation_Position = 981; Antisense;
    AAGTCATATACTTCTGGGCTTCATA
    >probe:HG-U133A:214153_at:646:487; Interrogation_Position = 996; Antisense;
    GGGCTTCATAGTTTATGGCTGCACT
    >probe:HG-U133A:214153_at:526:41; Interrogation_Position = 1010; Antisense;
    ATGGCTGCACTGTGGGACATACACC
    >probe:HG-U133A:214153_at:296:491; Interrogation_Position = 1023; Antisense;
    GGGACATACACCAGCTACAAGGAAG
    >probe:HG-U133A:214153_at:671:507; Interrogation_Position = 1054; Antisense;
    GGATAGGACCCGATGAAGCTGAAAT
    >probe:HG-U133A:214153_at:204:211; Interrogation_Position = 1096; Antisense;
    CTTATAGCAGCATAGCACTTTGTTC
    >probe:HG-U133A:214153_at:384:367; Interrogation_Position = 1208; Antisense;
    GAAAGACCAATTTGGCCACATAACC
    >probe:HG-U133A:214153_at:469:267; Interrogation_Position = 1233; Antisense;
    CCTGTCATCTGCCATCCAGATACTA
    >probe:HG-U133A:214153_at:350:297; Interrogation_Position = 1321; Antisense;
    GCCTGTGATTCAGTTAACTTACCTC
    Probes sequences for 210556_at:
    >probe:HG-U133A:210556_at:58:509; Interrogation_Position = 3142; Antisense;
    GGTAAGTTCATCTCTGATATGTTCT
    >probe:HG-U133A:210556_at:462:437; Interrogation_Position = 3147; Antisense;
    GTTCATCTCTGATATGTTCTTGAAG
    >probe:HG-U133A:210556_at:51:27; Interrogation_Position = 3151; Antisense;
    ATCTCTGATATGTTCTTGAAGTAGT
    >probe:HG-U133A:210556_at:385:427; Interrogation_Position = 3179; Antisense;
    GATTCAGGGACTTTATTCTCCCAAG
    >probe:HG-U133A:210556_at:462:627; Interrogation_Position = 3182; Antisense;
    TCAGGGACTTTATTCTCCCAAGTGT
    >probe:HG-U133A:210556_at:634:377; Interrogation_Position = 3187; Antisense;
    GACTTTATTCTCCCAAGTGTCATGA
    >probe:HG-U133A:210556_at:352:59; Interrogation_Position = 3215; Antisense;
    AGTTTCTATGGATTGCTTATTGGCA
    >probe:HG-U133A:210556_at:59:671; Interrogation_Position = 3232; Antisense;
    TATTGGCATATGGTTGGGCTTTTAA
    >probe:HG-U133A:210556_at:467:153; Interrogation_Position = 3283; Antisense;
    AATATATAACTTTGCCAGGTACCAC
    >probe:HG-U133A:210556_at:534:645; Interrogation_Position = 3289; Antisense;
    TAACTTTGCCAGGTACCACGGCTCA
    >probe:HG-U133A:210556_at:260:49; Interrogation_Position = 3503; Antisense;
    AGGTGGCAGTTGCAGGGAGCTAAGA
    Probes sequences for 217679_x_at:
    >probe:HG-U133A:217679_x_at:425:333; Interrogation_Position = 36; Antisense;
    GCACTTTGGGAAGCTGAGGCGGTGG
    >probe:HG-U133A:217679_x_at:665:95; Interrogation_Position = 38; Antisense;
    ACTTTGGGAAGCTGAGGCGGTGGAT
    >probe:HG-U133A:217679_x_at:207:213; Interrogation_Position = 39; Antisense;
    CTTTGGGAAGCTGAGGCGGTGGATC
    >probe:HG-U133A:217679_x_at:378:707; Interrogation_Position = 41; Antisense;
    TTGGGAAGCTGAGGCGGTGGATCAC
    >probe:HG-U133A:217679_x_at:49:565; Interrogation_Position = 42; Antisense;
    TGGGAAGCTGAGGCGGTGGATCACC
    >probe:HG-U133A:217679_x_at:55:237; Interrogation_Position = 49; Antisense;
    CTGAGGCGGTGGATCACCTGAGCTC
    >probe:HG-U133A:217679_x_at:41:579; Interrogation_Position = 50; Antisense;
    TGAGGCGGTGGATCACCTGAGCTCA
    >probe:HG-U133A:217679_x_at:174:267; Interrogation_Position = 258; Antisense;
    CCTGGGCGACAAAGCAAGACTCTTT
    >probe:HG-U133A:217679_x_at:238:237; Interrogation_Position = 259; Antisense;
    CTGGGCGACAAAGCAAGACTCTTTC
    >probe:HG-U133A:217679_x_at:689:563; Interrogation_Position = 260; Antisense;
    TGGGCGACAAAGCAAGACTCTTTCT
    >probe:HG-U133A:217679_x_at:260:375; Interrogation_Position = 319; Antisense;
    GAGACAGAAAAGATGGGTTTTTGTT
    Probes sequences for 207730_x_at:
    >probe:HG-U133A:207730_x_at:465:639; Interrogation_Position = 1307; Antisense;
    TCAAGGGGTTGCTCAGATGGGCCGG
    >probe:HG-U133A:207730_x_at:267:595; Interrogation_Position = 1316; Antisense;
    TGCTCAGATGGGCCGGGCATGGTGG
    >probe:HG-U133A:207730_x_at:701:455; Interrogation_Position = 1350; Antisense;
    GTAACCTCAGCACTGTGGGAGGCCA
    >probe:HG-U133A:207730_x_at:135:519; Interrogation_Position = 1367; Antisense;
    GGAGGCCAAGGGGGCAGATCACTTC
    >probe:HG-U133A:207730_x_at:115:61; Interrogation_Position = 1382; Antisense;
    AGATCACTTCAGGTCGGGAGTTCCA
    >probe:HG-U133A:207730_x_at:4:191; Interrogation_Position = 1405; Antisense;
    CAGACCAGCCTGTTCAACATGGCGA
    >probe:HG-U133A:207730_x_at:125:173; Interrogation_Position = 1419; Antisense;
    CAACATGGCGAAACCCCATTCTACC
    >probe:HG-U133A:207730_x_at:228:381; Interrogation_Position = 1495; Antisense;
    GACTGAGGAGGGGTCACCTGAGGTC
    >probe:HG-U133A:207730_x_at:78:517; Interrogation_Position = 1521; Antisense;
    GGATGTCAAGATCAGACTGGCCAAC
    >probe:HG-U133A:207730_x_at:585:287; Interrogation_Position = 1540; Antisense;
    GCCAACAGAATGAAACCCTGTCTCT
    >probe:HG-U133A:207730_x_at:227:69; Interrogation_Position = 1714; Antisense;
    AGAAAGTTAGCCGGGCGTGGCACCT
    Probes sequences for 214800_x_at:
    >probe:HG-U133A:214800_x_at:593:335; Interrogation_Position = 844; Antisense;
    GCAAACTGAATTGAGTCAACTTCTG
    >probe:HG-U133A:214800_x_at:342:73; Interrogation_Position = 883; Antisense;
    AGAAGTTACTGGGAGCTGCTATTTT
    >probe:HG-U133A:214800_x_at:221:57; Interrogation_Position = 959; Antisense;
    AGATCTCTAATATTTTTAAGCCCAA
    >probe:HG-U133A:214800_x_at:26:207; Interrogation_Position = 988; Antisense;
    CTTGGACACTGCAGCTCTTTTCAGT
    >probe:HG-U133A:214800_x_at:483:221; Interrogation_Position = 1002; Antisense;
    CTCTTTTCAGTTTTTGCTTATACAC
    >probe:HG-U133A:214800_x_at:614:111; Interrogation_Position = 1023; Antisense;
    ACACAATTCATTCTTTGCAGCTAAT
    >probe:HG-U133A:214800_x_at:70:689; Interrogation_Position = 1047; Antisense;
    TTAAGCCGAAGAAGCCTGGGAATCA
    >probe:HG-U133A:214800_x_at:390:435; Interrogation_Position = 1095; Antisense;
    GTTCTTTGCCTAGTATACAGTTTTA
    >probe:HG-U133A:214800_x_at:39:681; Interrogation_Position = 1124; Antisense;
    TTTATTTCATTGACACCGATCTGTA
    >probe:HG-U133A:214800_x_at:71:207; Interrogation_Position = 1131; Antisense;
    CATTGACACCGATCTGTACACAGTA
    >probe:HG-U133A:214800_x_at:545:161; Interrogation_Position = 1192; Antisense;
    AATATGGCTGATAACCTTTGGAATT
    Probes sequences for 202901_x_at:
    >probe:HG-U133A:202901_x_at:2:429; Interrogation_Position = 1174; Antisense;
    GATTAATGTGTATTTACTGTACTAA
    >probe:HG-U133A:202901_x_at:226:159; Interrogation_Position = 1205; Antisense;
    AATATAGTTTGAGGCCGGGCACGGT
    >probe:HG-U133A:202901_x_at:674:667; Interrogation_Position = 1519; Antisense;
    TATAGTTTGATTCTTCATTTTTTTA
    >probe:HG-U133A:202901_x_at:686:155; Interrogation_Position = 1552; Antisense;
    AATCTCAGGATAAAGTTTGCTAAGT
    >probe:HG-U133A:202901_x_at:55:437; Interrogation_Position = 1616; Antisense;
    GTTCAACCTAAAACAATCTGTAATT
    >probe:HG-U133A:202901_x_at:395:27; Interrogation_Position = 1631; Antisense;
    ATCTGTAATTGCTTATTGTTTTATT
    >probe:HG-U133A:202901_x_at:315:681; Interrogation_Position = 1650; Antisense;
    TTTATTGTATACTCTTTGTCTTTTA
    >probe:HG-U133A:202901_x_at:31:569; Interrogation_Position = 1666; Antisense;
    TGTCTTTTAAGACCCCTAATAGCCT
    >probe:HG-U133A:202901_x_at:677:253; Interrogation_Position = 1679; Antisense;
    CCCTAATAGCCTTTTGTAACTTGAT
    >probe:HG-U133A:202901_x_at:661:293; Interrogation_Position = 1687; Antisense;
    GCCTTTTGTAACTTGATGGCTTAAA
    >probe:HG-U133A:202901_x_at:252:83; Interrogation_Position = 1709; Antisense;
    AAAAATACTTAATAAATCTGCCATT
    Probes sequences for 220856_x_at:
    >probe:HG-U133A:220856_x_at:281:647; Interrogation_Position = 861; Antisense;
    TAAAAGCACAGACTTCAGGCCAGGC
    >probe:HG-U133A:220856_x_at:264:693; Interrogation_Position = 874; Antisense;
    TTCAGGCCAGGCACAATGACTCACA
    >probe:HG-U133A:220856_x_at:625:485; Interrogation_Position = 929; Antisense;
    GGGCTGATTCAGCCCCAGGAGTTCA
    >probe:HG-U133A:220856_x_at:309:9; Interrogation_Position = 1006; Antisense;
    ATTAGCCGGCAGTGGTGGCACATGC
    >probe:HG-U133A:220856_x_at:60:195; Interrogation_Position = 1034; Antisense;
    CAGTCCCAGCTACTCGAGAGGATGA
    >probe:HG-U133A:220856_x_at:575:279; Interrogation_Position = 1177; Antisense;
    CCAGCTGGGCATGGTGACTCACACT
    >probe:HG-U133A:220856_x_at:462:287; Interrogation_Position = 1224; Antisense;
    GCCAAGGAGGGCAGGTCATGAGGTC
    >probe:HG-U133A:220856_x_at:109:551; Interrogation_Position = 1268; Antisense;
    GGCCAATTTGTACCTGCAGGCCTCC
    >probe:HG-U133A:220856_x_at:114:615; Interrogation_Position = 1321; Antisense;
    TCGACCGCGATGATGTGGCTCTGGA
    >probe:HG-U133A:220856_x_at:272:617; Interrogation_Position = 1340; Antisense;
    TCTGGAAGGCGTGAGCCACTTCTTC
    >probe:HG-U133A:220856_x_at:457:211; Interrogation_Position = 1358; Antisense;
    CTTCTTCCGCGAATTGGCCGAGGAG
    Probes sequences for 208137_x_at:
    >probe:HG-U133A:208137_x_at:470:399; Interrogation_Position = 597; Antisense;
    GAGTTCAAGCCTTTATTGACATTCA
    >probe:HG-U133A:208137_x_at:273:145; Interrogation_Position = 634; Antisense;
    AAGAGGATGGGGCCAGGTGTGGTGG
    >probe:HG-U133A:208137_x_at:323:181; Interrogation_Position = 678; Antisense;
    CACGTCGGGAGGCCAAGGCACATAG
    >probe:HG-U133A:208137_x_at:628:535; Interrogation_Position = 694; Antisense;
    GGCACATAGGTCACTTGAGGTCAGG
    >probe:HG-U133A:208137_x_at:461:87; Interrogation_Position = 727; Antisense;
    ACCAGCATGGCCAACAGATGTGAGC
    >probe:HG-U133A:208137_x_at:429:63; Interrogation_Position = 742; Antisense;
    AGATGTGAGCCACTTTTCTCAGCCT
    >probe:HG-U133A:208137_x_at:653:695; Interrogation_Position = 757; Antisense;
    TTCTCAGCCTGTTTTTTGTTTCTTA
    >probe:HG-U133A:208137_x_at:76:423; Interrogation_Position = 880; Antisense;
    GATATGAGGCCAGGCACGGAGGCTC
    >probe:HG-U133A:208137_x_at:667:395; Interrogation_Position = 965; Antisense;
    GAGACTTTCCTGGCTAACAGGGTGA
    >probe:HG-U133A:208137_x_at:613:465; Interrogation_Position = 1048; Antisense;
    GTCCCAGCTACTCCAGATGCTGAGG
    >probe:HG-U133A:208137_x_at:75:695; Interrogation_Position = 1105; Antisense;
    TTGCAGGGAGTCGAGATCGCGCCAC
    Probes sequences for 215620_at:
    >probe:HG-U133A:215620_at:430:69; Interrogation_Position = 1369; Antisense;
    AGACATCCCTCCTTGTTATGTCTTA
    >probe:HG-U133A:215620_at:649:253; Interrogation_Position = 1375; Antisense;
    CCCTCCTTGTTATGTCTTACTTTGT
    >probe:HG-U133A:215620_at:51:687; Interrogation_Position = 1384; Antisense;
    TTATGTCTTACTTTGTTGCTGTTGC
    >probe:HG-U133A:215620_at:313:223; Interrogation_Position = 1455; Antisense;
    CTCTACTCCGACTGCAGTTGTAGAA
    >probe:HG-U133A:215620_at:100:539; Interrogation_Position = 1483; Antisense;
    GGCAGAGAAGGTATGACACCCCCTC
    >probe:HG-U133A:215620_at:501:611; Interrogation_Position = 1506; Antisense;
    TCCCCCATTATACCTCAGAAAGAAC
    >probe:HG-U133A:215620_at:181:559; Interrogation_Position = 1546; Antisense;
    TGAGTACGACTGGGAATTTGTTTTT
    >probe:HG-U133A:215620_at:450:671; Interrogation_Position = 1562; Antisense;
    TTTGTTTTTTCCACTATCTCTAGGA
    >probe:HG-U133A:215620_at:59:441; Interrogation_Position = 1594; Antisense;
    GTTTTGTAGAATCCATATCCTTAAT
    >probe:HG-U133A:215620_at:282:67; Interrogation_Position = 1638; Antisense;
    AGAGCATACCTAGTTCTCAGACCCA
    >probe:HG-U133A:215620_at:64:437; Interrogation_Position = 1650; Antisense;
    GTTCTCAGACCCAATTCCATAGATT
    Probes sequences for 221648_s_at:
    >probe:HG-U133A:221648_s_at:632:331; Interrogation_Position = 2363; Antisense;
    GCAGCTGGGCTCTTAATGTTACTTA
    >probe:HG-U133A:221648_s_at:554:99; Interrogation_Position = 2383; Antisense;
    ACTTATTGTAACTCACTTCCTGTAA
    >probe:HG-U133A:221648_s_at:592:455; Interrogation_Position = 2404; Antisense;
    GTAAGACTCAATCTACTCTTCAAGT
    >probe:HG-U133A:221648_s_at:25:627; Interrogation_Position = 2420; Antisense;
    TCTTCAAGTTGATGTACCCTCTATG
    >probe:HG-U133A:221648_s_at:278:453; Interrogation_Position = 2433; Antisense;
    GTACCCTCTATGACTGTAATTCCTG
    >probe:HG-U133A:221648_s_at:300:239; Interrogation_Position = 2473; Antisense;
    CTGTGGTGATCTTAAGCGTGTCTTG
    >probe:HG-U133A:221648_s_at:283:305; Interrogation_Position = 2488; Antisense;
    GCGTGTCTTGAGTTCCATGCAAATT
    >probe:HG-U133A:221648_s_at:206:481; Interrogation_Position = 2529; Antisense;
    GTGTCCATAATCAAATCATCATCTT
    >probe:HG-U133A:221648_s_at:300:147; Interrogation_Position = 2561; Antisense;
    AAGGGCTACCCCATAATTATCAGAC
    >probe:HG-U133A:221648_s_at:86:627; Interrogation_Position = 2602; Antisense;
    TCTTTTTCCATTCTGTAATCTTAAA
    >probe:HG-U133A:221648_s_at:72:635; Interrogation_Position = 2633; Antisense;
    TCAGCTTCAGTGCCTGTTCACAATA
    Probes sequences for 215001_s_at:
    >probe:HG-U133A:215001_s_at:698:545; Interrogation_Position = 4497; Antisense;
    GGCTGGTGAACTTGAGCATGTTACT
    >probe:HG-U133A:215001_s_at:518:17; Interrogation_Position = 4557; Antisense;
    ATAGCATGTCACTAAAGCAGGCCTT
    >probe:HG-U133A:215001_s_at:208:107; Interrogation_Position = 4661; Antisense;
    ACAGAATTGCTTGTTTGCTTCAACT
    >probe:HG-U133A:215001_s_at:125:107; Interrogation_Position = 4718; Antisense;
    ACAGGGCTGGAGTCAAAACACTTGT
    >probe:HG-U133A:215001_s_at:481:11; Interrogation_Position = 4844; Antisense;
    ATTTTCTGGTTCTTGTGTTGGCTGT
    >probe:HG-U133A:215001_s_at:19:437; Interrogation_Position = 4852; Antisense;
    GTTCTTGTGTTGGCTGTGGCAGGCC
    >probe:HG-U133A:215001_s_at:580:677; Interrogation_Position = 4886; Antisense;
    TTTCTTTTGCCATGACAACTTCTAA
    >probe:HG-U133A:215001_s_at:522:371; Interrogation_Position = 4899; Antisense;
    GACAACTTCTAATTGCCATGTACAG
    >probe:HG-U133A:215001_s_at:409:457; Interrogation_Position = 4934; Antisense;
    GTCAAATAACTCCTCATTGTAAACA
    >probe:HG-U133A:215001_s_at:111:113; Interrogation_Position = 4956; Antisense;
    ACAAACTGTGTAACTGCCCAAAGCA
    >probe:HG-U133A:215001_s_at:373:331; Interrogation_Position = 4978; Antisense;
    GCAGCACTTATAAATCAGCCTAACA
    Probes sequences for 222168_at:
    >probe:HG-U133A:222168_at:319:163: Interrogation_Position = 5797; Antisense;
    AATTCACTTATTTTCATGGGTTCTG
    >probe:HG-U133A:222168_at:278:385; Interrogation_Position = 5843; Antisense;
    GACCTAAAAATCTCAATGGCACCTA
    >probe:HG-U133A:222168_at:498:217; Interrogation_Position = 5846; Antisense;
    CTAAAAATCTCAATGGCACCTACCA
    >probe:HG-U133A:222168_at:142:219; Interrogation_Position = 5883; Antisense;
    CTAACACATAGTAGGGGCTCTACAT
    >probe:HG-U133A:222168_at:418:449; Interrogation_Position = 5893; Antisense;
    GTAGGGGCTCTACATGATTATCTGT
    >probe:HG-U133A:222168_at:548:543; Interrogation_Position = 5898; Antisense;
    GGCTCTACATGATTATCTGTAATTG
    >probe:HG-U133A:222168_at:8:519; Interrogation_Position = 5935; Antisense;
    GGAGGTGATGGTTTTCCTATTTGAT
    >probe:HG-U133A:222168_at:12:339; Interrogation_Position = 6004; Antisense;
    GCAACATAGTTCTAAGCCAGTGGTT
    >probe:HG-U133A:222168_at:82:559; Interrogation_Position = 6024; Antisense;
    TGGTTCTCCAACTTGAGTATGCATC
    >probe:HG-U133A:222168_at:219:621; Interrogation_Position = 6028; Antisense:
    TCTCCAACTTGAGTATGCATCAGAA
    >probe:HG-U133A:222168_at:183:129; Interrogation_Position = 6071; Antisense;
    AAAATACAGATGTCTGCCGGGCGCG
    Probes sequences for 212297_at:
    >probe:HG-U133A:212297_at:420:435; Interrogation_Position = 1883; Antisense;
    GTTCCCCATGTTTATGAAAGTCCTG
    >probe:HG-U133A:212297_at:329:113; Interrogation_Position = 2014; Antisense;
    AAATATTCATGCATGCAATTTTGAC
    >probe:HG-U133A:212297_at:23:565; Interrogation_Position = 2058; Antisense;
    TGTATATTTATGGTGGGAGGTGGTT
    >probe:HG-U133A:212297_at:25:163; Interrogation_Position = 2109; Antisense;
    AATTTTTGTACAGTCTGTGGGCATT
    >probe:HG-U133A:212297_at:127:465; Interrogation_Position = 2121; Antisense;
    GTCTGTGGGCATTTACACATTTTTA
    >probe:HG-U133A:212297_at:391:147; Interrogation_Position = 2188; Antisense;
    AAGTTACTTCTAGTTATGATTTGTG
    >probe:HG-U133A:212297_at:517:423; Interrogation_Position = 2205; Antisense;
    GATTTGTGAATTCCCTAAGACCTTG
    >probe:HG-U133A:212297_at:619:155; Interrogation_Position = 2257; Antisense;
    AATGATACTGCATCTTTATATTTTT
    >probe:HG-U133A:212297_at:516:113; Interrogation_Position = 2283; Antisense;
    AAATTGTATTGCTGCTCAAGAATGG
    >probe:HG-U133A:212297_at:38:19; Interrogation_Position = 2301; Antisense;
    AGAATGGTACCCTCTTGTCAAAAAG
    >probe:HG-U133A:212297_at:199:205; Interrogation_Position = 2331; Antisense;
    CATTCATAATTGTACATTCAGCATT
    Probes sequences for 215604_x_at:
    >probe:HG-U133A:215604_x_at:453:357; Interrogation_Position = 2315; Antisense;
    GAACCCAGGAAGCGGAAGTTGCAGT
    >probe:HG-U133A:215604_x_at:518:373; Interrogation_Position = 2375; Antisense;
    GACAGAGGGAGACTCCGTCTCCAAA
    >probe:HG-U133A:215604_x_at:484:89; Interrogation_Position = 2406; Antisense;
    AAAGTTCAGGCCGGGCACGGTAGCT
    >probe:HG-U133A:215604_x_at:393:545; Interrogation_Position = 2458; Antisense;
    GGCTGATGCAGGTGGATCACCGGAG
    >probe:HG-U133A:215604_x_at:171:181; Interrogation_Position = 2475; Antisense;
    CACCGGAGGTCAGGAACTAGAAATC
    >probe:HG-U133A:215604_x_at:242:531; Interrogation_Position = 2487; Antisense;
    GGAACTAGAAATCAGCCTGGCCAAC
    >probe:HG-U133A:215604_x_at:330:687; Interrogation_Position = 2547; Antisense;
    TTAGACAGGTGTGGTGGTGCGTGCC
    >probe:HG-U133A:215604_x_at:130:351; Interrogation_Position = 2597; Antisense;
    GAAGCAGAAAAATCACTTGAACCCA
    >probe:HG-U133A:215604_x_at:329:581; Interrogation_Position = 2644; Antisense;
    TGAGATCGTGCCATTGCATTCCAGC
    >probe:HG-U133A:215604_x_at:391:245; Interrogation_Position = 2674; Antisense;
    CGACAAGACTGAAACTCTATCCCCC
    >probe:HG-U133A:215604_x_at:17:645; Interrogation_Position = 2797; Antisense;
    TAAGAAACTATATTACCAGAGCAAG
    Probes sequences for 209770_at:
    >probe:HG-U133A:209770_at:45:515; Interrogation_Position = 2874; Antisense;
    GGAAATTTGGATGAAGGGAGCTAGA
    >probe:HG-U133A:209770_at:463:57; Interrogation_Position = 3087; Antisense;
    AGTACAGATGAGGGTTCACTATGTT
    >probe:HG-U133A:209770_at:565:181; Interrogation_Position = 3191; Antisense;
    CACCGGGTGACCGGCTTACAGGGAT
    >probe:HG-U133A:209770_at:17:447; Interrogation_Position = 3198; Antisense;
    TGACCGGCTTACAGGGATATTTTTA
    >probe:HG-U133A:209770_at:3:691; Interrogation_Position = 3220; Antisense;
    TTAATCCCGTTATGGACTCTGTCTC
    >probe:HG-U133A:209770_at:152:465; Interrogation_Position = 3240; Antisense;
    GTCTCCAGGAGAGGGGTCTATCCAC
    >probe:HG-U133A:209770_at:493:215; Interrogation_Position = 3257; Antisense;
    CTATCCACCCCTGCTCATTGGTGGA
    >probe:HG-U133A:209770_at:329:283; Interrogation_Position = 3261; Antisense;
    CCACCCCTGCTCATTGGTGGATGTT
    >probe:HG-U133A:209770_at:69:531; Interrogation_Position = 3319; Antisense;
    GGAAAAACTACTCCTCATTATCATC
    >probe:HG-U133A:209770_at:34:25; Interrogation_Position = 3341; Antisense;
    ATCATTATTATTGCTCTCCACTGTA
    >probe:HG-U133A:209770_at:3:223; Interrogation_Position = 3370; Antisense;
    CTCTACCTGGCATGTGCTTGTCAAG
    Probes sequences for 205010_at:
    >probe:HG-U133A:205010_at:243:311; Interrogation_Position = 1769; Antisense;
    GCTCTGGCATCTGCCCTGAAAAATA
    >probe:HG-U133A:205010_at:201:365; Interrogation_Position = 1807; Antisense;
    GAAACGTGCAGATAAAATCGCCAGC
    >probe:HG-U133A:205010_at:200:61; Interrogation_Position = 1816; Antisense;
    AGATAAAATCGCCAGCAAGCTGTCT
    >probe:HG-U133A:205010_at:96:29; Interrogation_Position = 1823; Antisense;
    ATCGCCAGCAAGCTGTCTGATTCCA
    >probe:HG-U133A:205010_at:90:413; Interrogation_Position = 1883; Antisense;
    GATGGTGTTGGTGACTAATCGACTG
    >probe:HG-U133A:205010_at:680:503; Interrogation_Position = 1886; Antisense;
    GGTGTTGGTGACTAATCGACTGATC
    >probe:HG-U133A:205010_at:127:475; Interrogation_Position = 1893; Antisense;
    GTGACTAATCGACTGATCTCACTTC
    >probe:HG-U133A:205010_at:709:283; Interrogation_Position = 1927; Antisense;
    CCAAGCACCAGTTCCGGTGGTACGG
    >probe:HG-U133A:205010_at:523:451; Interrogation_Position = 1946; Antisense;
    GTACGGGGGAATACCAGTGAAATAG
    >probe:HG-U133A:205010_at:678:141; Interrogation_Position = 1985; Antisense;
    AAGCATCTGCATATTGAAAGAACGC
    >probe:HG-U133A:205010_at:455:3; Interrogation_Position = 1997; Antisense;
    ATTGAAAGAACGCTTTCCCCACTGT
    Probes sequences for 215179_x_at:
    >probe:HG-U133A:215179_x_at:330:387; Interrogation_Position = 2996; Antisense;
    GACCAACACGGTGAAACCACATCTC
    >probe:HG-U133A:215179_x_at:310:9; Interrogation_Position = 3036; Antisense;
    ATTAGCTGGGCGTGTTGGTGCATGC
    >probe:HG-U133A:215179_x_at:147:137; Interrogation_Position = 3196; Antisense;
    AAGAATTCCAAAAGCCCATGTGCTC
    >probe:HG-U133A:215179_x_at:267:255; Interrogation_Position = 3220; Antisense;
    CCCTTGCCCATTCTTAAGTTGACAT
    >probe:HG-U133A:215179_x_at:312:57; Interrogation_Position = 3268; Antisense;
    AGTCAAGCCAGGCTCAGTACCTCAT
    >probe:HG-U133A:215179_x_at:13:59; Interrogation_Position = 3283; Antisense;
    AGTACCTCATGCCTGTAATTCTAGC
    >probe:HG-U133A:215179_x_at:383:247; Interrogation_Position = 3325; Antisense;
    CGAGTGGATCGCTTGAGGCCAGGAG
    >probe:HG-U133A:215179_x_at:545:263; Interrogation_Position = 3343; Antisense;
    CCAGGAGTTCAAGCCTAGCCGGGGC
    >probe:HG-U133A:215179_x_at:218:297; Interrogation_Position = 3360; Antisense;
    GCCGGGGCAACATGGCGAAACCTCG
    >probe:HG-U133A:215179_x_at:309:99; Interrogation_Position = 3407; Antisense;
    ACTAGCTGGACGTGGTGCCATGTGC
    >probe:HG-U133A:215179_x_at:69:483; Interrogation_Position = 3461; Antisense;
    GTGGAATGATCACCTGAGCCTGGGA
    Probes sequences for 212052_s_at:
    >probe:HG-U133A:212052_s_at:45:313; Interrogation_Position = 3981; Antisense;
    GCTGCCTTAGACAGATTCCCTGGGC
    >probe:HG-U133A:212052_s_at:337:181; Interrogation_Position = 4058; Antisense;
    CACCTTCCTTACACCTGGTGGGAGC
    >probe:HG-U133A:212052_s_at:179:663; Interrogation_Position = 4214; Antisense;
    TATGTGGTATGGGGGTCATTCCTGC
    >probe:HG-U133A:212052_s_at:599:147; Interrogation_Position = 4281; Antisense;
    AAGTGATGGAACCCTCAGGTGCTCT
    >probe:HG-U133A:212052_s_at:235:81; Interrogation_Position = 4311; Antisense;
    AGCCTGAACCTCCTGACTGAGGAAC
    >probe:HG-U133A:212052_s_at:554:177; Interrogation_Position = 4375; Antisense;
    CACAGGCGTGGCTGTACACGTGCTC
    >probe:HG-U133A:212052_s_at:680:221; Interrogation_Position = 4404; Antisense;
    CTCATCATCCTTTCCAGTAACTTTA
    >probe:HG-U133A:212052_s_at:212:123; Interrogation_Position = 4429; Antisense;
    AAAAAACATCCCTCAGGTCCTGATA
    >probe:HG-U133A:212052_s_at:183:507; Interrogation_Position = 4444; Antisense;
    GGTCCTGATATATTTCCTTGGATTC
    >probe:HG-U133A:212052_s_at:505:709; Interrogation_Position = 4476; Antisense;
    TTGGCTAGAAATTACACTGTGCTCA
    >probe:HG-U133A:212052_s_at:38:569; Interrogation_Position = 4493; Antisense;
    TGTGCTCAATGCCTTAATAAATCCC
    Probes sequences for 216859_x_at:
    >probe:HG-U133A:216859_x_at:585:283; Interrogation_Position = 973; Antisense;
    CCAACAGCTCACTGAGAACGGGTCA
    >probe:HG-U133A:216859_x_at:411:109; Interrogation_Position = 976; Antisense;
    ACAGCTCACTGAGAACGGGTCATGA
    >probe:HG-U133A:216859_x_at:560:77; Interrogation_Position = 978; Antisense;
    AGCTCACTGAGAACGGGTCATGATG
    >probe:HG-U133A:216859_x_at:390:179; Interrogation_Position = 982; Antisense;
    CACTGAGAACGGGTCATGATGACGA
    >probe:HG-U133A:216859_x_at:390:395; Interrogation_Position = 1053; Antisense;
    GAGAAATCAGATTGTTGCTGTGTCT
    >probe:HG-U133A:216859_x_at:177:431; Interrogation_Position = 1062; Antisense;
    GATTGTTGCTGTGTCTGTGTAGAAA
    >probe:HG-U133A:216859_x_at:371:349; Interrogation_Position = 1087; Antisense;
    GAAGTAGACACAGGAGACTCCACTT
    >probe:HG-U133A:216859_x_at:136:373; Interrogation_Position = 1093; Antisense;
    GACACAGGAGACTCCACTTTGTTCT
    >probe:HG-U133A:216859_x_at:305:105; Interrogation_Position = 1096; Antisense;
    ACAGGAGACTCCACTTTGTTCTGTA
    >probe:HG-U133A:216859_x_at:610:67; Interrogation_Position = 1367; Antisense;
    AGACCTCTGTTCACTTGTTTATTTG
    >probe:HG-U133A:216859_x_at:218:619; Interrogation_Position = 1372; Antisense;
    TCTGTTCACTTGTTTATTTGCTGAC
    Probes sequences for 204205_at:
    >probe:HG-U133A:204205_at:187:301; Interrogation_Position = 1177; Antisense;
    GCCCGCATCTATGATGATCAAGGAA
    >probe:HG-U133A:204205_at:312:145; Interrogation_Position = 1200; AntisenSe;
    AAGATGTCAGGAGGGGCTGCGCACC
    >probe:HG-U133A:204205_at:82:87; Interrogation_Position = 1515; Antisense;
    ACCAGCAAAGCAATGCACTCCTGAC
    >probe:HG-U133A:204205_at:197:341; Interrogation_Position = 1524; Antisense;
    GCAATGCACTCCTGACCAAGTAGAT
    >probe:HG-U133A:204205_at:600:333; Interrogation_Position = 1529; Antisense;
    GCACTCCTGACCAAGTAGATTCTTT
    >probe:HG-U133A:204205_at:480:7; Interrogation_Position = 1559; Antisense;
    ATTAGAGTGCATTACTTTGAATCAA
    >probe:HG-U133A:204205_at:234:649; Interrogation_Position = 1606; Antisense;
    TAAAGTACTAAGATTGTGCTCAATA
    >probe:HG-U133A:204205_at:402:441; Interrogation_Position = 1640; Antisense;
    GTTTCAAACCTACTAATCCAGCGAC
    >probe:HG-U133A:204205_at:113:123; Interrogation_Position = 1645; Antisense;
    AAACCTACTAATCCAGCGACAATTT
    >probe:HG-U133A:204205_at:186:29; Interrogation_Position = 1655; Antisense;
    ATCCAGCGACAATTTGAATCGGTTT
    >probe:HG-U133A:204205_at:33:343; Interrogation_Position = 1670; Antisense;
    GAATCGGTTTTGTAGGTAGAGGAAT
    Probes sequences for 217588_at:
    >probe:HG-U133A:217588_at:114:245: Interrogation_Position = 116; Antisense;
    CGTCGACGTAAGCACTTCTCAGAGA
    >probe:HG-U133A:217588_at:171:561; Interrogation_Position = 208; Antisense;
    TGGGTGCCTTGGGGTGTATTTTACT
    >probe:HG-U133A:217588_at:36:139; Interrogation_Position = 236; Antisense;
    AACCTGTGTGTTTTCCACAGTCCCT
    >probe:HG-U133A:217588_at:445:231; Interrogation_Position = 259; Antisense;
    CTGCTTCCCGAGGTTGTGGTATTGG
    >probe:HG-U133A:217588_at:133:479; Interrogation_Position = 282; Antisense;
    GGTAGGGGTAACAGGCCAATCGGTG
    >probe:HG-U133A:217588_at:144:199; Interrogation_Position = 293; Antisense;
    CAGGCCAATCGGTGTGGCTTCAGCT
    >probe:HG-U133A:217588_at:458:621; Interrogation_Position = 343; Antisense;
    TCTCTCAAACTCCTTGCACAATTCC
    >probe:HG-U133A:217588_at:137:99; Interrogation_Position = 351; Antisense;
    ACTCCTTGCACAATTCCGTCAAATT
    >probe:HG-U133A:217588_at:188:591; Interrogation_Position = 383; Antisense;
    TTATTTTGGTCCTGGTCAGGGCCCT
    >probe:HG-U133A:217588_at:492:195; Interrogation_Position = 399; Antisense;
    CAGGGCCCTCAAGGTGATTTGACTG
    >probe:HG-U133A:217588_at:39:299; Interrogation_Position = 535; Antisense;
    GCCGAGGGGGATGGGTCATGAGGTC
    Probes sequences for 218155_x_at:
    >probe:HG-U133A:218155_x_at:480:137; Interrogation_Position = 4603; Antisense;
    AACCACAACATTGCCTGCATGTCTA
    >probe:HG-U133A:218155_x_at:56:173; Interrogation_Position = 4608; Antisense;
    CAACATTGCCTGCATGTCTAAAAGA
    >probe:HG-U133A:218155_x_at:452:297; Interrogation_Position = 4727; Antisense;
    GCCTGGCCAACCTGGTGAAATGCAG
    >probe:HG-U133A:218155_x_at:379:91; Interrogation_Position = 4736; Antisense;
    ACCTGGTGAAATGCAGTGTCAACTA
    >probe:HG-U133A:218155_x_at:392:157; Interrogation_Position = 4763; Antisense;
    AATAGGAAAATCAGCCAGGCGTGGT
    >probe:HG-U133A:218155_x_at:126:343; Interrogation_Position = 4832; Antisense;
    GAATCACTTGAACTTGGGAGGGGTA
    >probe:HG-U133A:218155_x_at:488:91; Interrogation_Position = 4842; Antisense;
    AACTTGGGAGGGGTAGGTTGCAGTG
    >probe:HG-U133A:218155_x_at:517:563; Interrogation_Position = 4846; Antisense;
    TGGGAGGGGTAGGTTGCAGTGAGCT
    >probe:HG-U133A:218155_x_at:482:309; Interrogation_Position = 4868; Antisense;
    GCTGAGATCACACCAAAATGGGGTG
    >probe:HG-U133A:218155_x_at:373:169; Interrogation_Position = 4881; Antisense;
    CAAAATGGGGTGGGGCGCAGTGGCT
    >probe:HG-U133A:218155_x_at:329:545; Interrogation_Position = 4943; Antisense;
    GGCGGATCACGAGGTAGGGAGATCA
    180ttestprobesetseq.txt
    Probes sequences for 208082_x_at:
    >probe:HG-U133A:208082_x_at:578:269; Interrogation_Position = 3103; Antisense;
    CCTTGGTTTAAGGCTGGGCGCGGTG
    >probe:HG-U133A:208082_x_at:391:305; Interrogation_Position = 3165; Antisense;
    GCGGCCGGATCACAAGGCCAGGAGA
    >probe:HG-U133A:208082_x_at:112:29; Interrogation_Position = 3189; Antisense;
    ATCGAGACCATCCTGTGAATGGCGA
    >probe:HG-U133A:208082_x_at:533:567; Interrogation_Position = 3202; Antisense;
    TGTGAATGGCGAAACCCTGTCTCTA
    >probe:HG-U133A:208082_x_at:378:411; Interrogation_Position = 3319; Antisense;
    GAGGCGGAGCTTGCAATGAACTGAG
    >probe:HG-U133A:208082_x_at:430:231; Interrogation_Position = 3339; Antisense;
    CTGAGAATGTGCCACTGGACTCCCA
    >probe:HG-U133A:208082_x_at:334:669; Interrogation_Position = 3494; Antisense;
    TATAGCAGTACTAGCTCTGTTTATA
    >probe:HG-U133A:208082_x_at:424:123; Interrogation_Position = 3552; Antisense;
    AAACTACCTGGTTATCCCATATATT
    >probe:HG-U133A:208082_x_at:379:153; Interrogation_Position = 3612; Antisense;
    AATGTTTTACGATCTAGCCTTTCCA
    >probe:HG-U133A:208082_x_at:592:291; Interrogation_Position = 3628; Antisense;
    GCCTTTCCAGTATAGGCACTTCCTG
    >probe:HG-U133A:208082_x_at:517:333; Interrogation_Position = 3643; Antisense;
    GCACTTCCTGAAAAACCTTTGTCCT
    Probes sequences for 214800_x_at:
    >probe:HG-U133A:214800_x_at:593:335; Interrogation_Position = 844; Antisense;
    GCAAACTGAATTGAGTCAACTTCTG
    >probe:HG-U133A:214800_x_at:342:73; Interrogation_Position = 883; Antisense;
    AGAAGTTACTGGGAGCTGCTATTTT
    >probe:HG-U133A:214800_x_at:221:57; Interrogation_Position = 959; Antisense;
    AGATCTCTAATATTTTTAAGCCCAA
    >probe:HG-U133A:214800_x_at:26:207; Interrogation_Position = 988; Antisense;
    CTTGGACACTGCAGCTCTTTTCAGT
    >probe:HG-U133A:214800_x_at:483:221; Interrogation_Position = 1002; Antisense;
    CTCTTTTCAGTTTTTGCTTATACAC
    >probe:HG-U133A:214800_x_at:614:111; Interrogation_Position = 1023; Antisense;
    ACACAATTCATTCTTTGCAGCTAAT
    >probe:HG-U133A:214800_x_at:70:689; Interrogation_Position = 1047; Antisense;
    TTAAGCCGAAGAAGCCTGGGAATCA
    >probe:HG-U133A:214800_x_at:390:435; Interrogation_Position = 1095; Antisense;
    GTTCTTTGCCTAGTATACAGTTTTA
    >probe:HG-U133A:214800_x_at:39:681; Interrogation_Position = 1124; Antisense;
    TTTATTTCATTGACACCGATCTGTA
    >probe:HG-U133A:214800_x_at:71:207; Interrogation_Position = 1131; Antisense;
    CATTGACACCGATCTGTACACAGTA
    >probe:HG-U133A:214800_x_at:545:161; Interrogation_Position = 1192; Antisense;
    AATATGGCTGATAACCTTTGGAATT
    Probes sequences for 215208_x_at:
    >probe:HG-U133A:215208_x_at:526:465; Interrogation_Position = 1958; Antisense;
    GTCCAGGCTAAGGTGCAATGGCACG
    >probe:HG-U133A:215208_x_at:583:471; Interrogation_Position = 1970; Antisense;
    GTGCAATGGCACGATGGCTCACGCC
    >probe:HG-U133A:215208_x_at:339:315; Interrogation_Position = 2138; Antisense;
    GCTACTGGGGAAGCTGAGGTTGCAG
    >probe:HG-U133A:215208_x_at:352:491; Interrogation_Position = 2195; Antisense;
    GGGCAACAGAGCAACACTTTGTCAA
    >probe:HG-U133A:215208_x_at:710:461; Interrogation_Position = 2234; Antisense;
    GTCTACAGCATAAACATCTTTAGGC
    >probe:HG-U133A:215208_x_at:635:125; Interrogation_Position = 2265; Antisense;
    AAAAACTAATTTTGGCCTGGTGCAG
    >probe:HG-U133A:215208_x_at:516:265; Interrogation_Position = 2280; Antisense;
    CCTGGTGCAGTGACTTGCGCTATAA
    >probe:HG-U133A:215208_x_at:258:471; Interrogation_Position = 2289; Antisense;
    GTGACTTGCGCTATAATCCCAGCAC
    >probe:HG-U133A:215208_x_at:81:69; Interrogation_Position = 2359; Antisense;
    AGACCAGAGTGACCAACACTGTGAA
    >probe:HG-U133A:215208_x_at:619:251; Interrogation_Position = 2440; Antisense;
    CCCACCTCACTCGGAGGCTGAGGCA
    >probe:HG-U133A:215208_x_at:66:585; Interrogation_Position = 2476; Antisense;
    TGAACCCGGGAAGCAGAGATTACAG
    Probes sequences for 218556_at:
    >probe:HG-U133A:218556_at:45:657; Interrogation_Position = 38; Antisense;
    TAGCCGGACGGGGATCTGAGCTGGC
    >probe:HG-U133A:218556_at:593:645; Interrogation_Position = 94; Antisense;
    TAAACCCCAACACCCGAGTGATGAA
    >probe:HG-U133A:218556_at:682:591; Interrogation_Position = 163; Antisense;
    TGCATATGGTTCTACTCAGCATCCC
    >probe:HG-U133A:218556_at:356:211; Interrogation_Position = 191; Antisense;
    CTTCAGCATTCCTGTTGTCTGGACC
    >probe:HG-U133A:218556_at:483:233; Interrogation_Position = 216; Antisense;
    CTGACCAACGTCATCCATAACCTGG
    >probe:HG-U133A:218556_at:147:645; Interrogation_Position = 233; Antisense;
    TAACCTGGCTACGTATGTCTTCCTT
    >probe:HG-U133A:218556_at:689:151; Interrogation_Position = 295; Antisense;
    AAGGAAAGGCTCGGCTACTGACACA
    >probe:HG-U133A:218556_at:478:553; Interrogation_Position = 331; Antisense;
    TGGACTATGGGCTCCAGTTTACCTC
    >probe:HG-U133A:218556_at:265:609; Interrogation_Position = 397; Antisense;
    TCCTGGCCAGCTTCTATACCAAGTA
    >probe:HG-U133A:218556_at:407:295; Interrogation_Position = 450; Antisense;
    GCCTCATTGCTAAGTGTACTGCTGC
    >probe:HG-U133A:218556_at:117:435; Interrogation_Position = 488; Antisense;
    GTTCCATGGGGTTCGTGTCTTTGGC
    Probes sequences for 207730_x_at:
    >probe:HG-U133A:207730_x_at:465:639; Interrogation_Position = 1307; Antisense;
    TCAAGGGGTTGCTCAGATGGGCCGG
    >probe:HG-U133A:207730_x_at:267:595; Interrogation_Position = 1316; Antisense;
    TGCTCAGATGGGCCGGGCATGGTGG
    >probe:HG-U133A:207730_x_at:701:455; Interrogation_Position = 1350; Antisense;
    GTAACCTCAGCACTGTGGGAGGCCA
    >probe:HG-U133A:207730_x_at:135:519; Interrogation_Position = 1367; Antisense;
    GGAGGCCAAGGGGGCAGATCACTTC
    >probe:HG-U133A:207730_x_at:115:61; Interrogation_Position = 1382; Antisense;
    AGATCACTTCAGGTCGGGAGTTCCA
    >probe:HG-U133A:207730_x_at:4:191; Interrogation_Position = 1405; Antisense;
    CAGACCAGCCTGTTCAACATGGCGA
    >probe:HG-U133A:207730_x_at:125:173; Interrogation_Position = 1419; Antisense;
    CAACATGGCGAAACCCCATTCTACC
    >probe:HG-U133A:207730_x_at:228:381; Interrogation_Position = 1495; Antisense;
    GACTGAGGAGGGGTCACCTGAGGTC
    >probe:HG-U133A:207730_x_at:78:517; Interrogation_Position = 1521; Antisense;
    GGATGTCAAGATCAGACTGGCCAAC
    >probe:HG-U133A:207730_x_at:585:287; Interrogation_Position = 1540; Antisense;
    GCCAACAGAATGAAACCCTGTCTCT
    >probe:HG-U133A:207730_x_at:227:69; Interrogation_Position = 1714; Antisense;
    AGAAAGTTAGCCGGGCGTGGCACCT
    Probes sequences for 210556_at:
    >probe:HG-U133A:210556_at:58:509; Interrogation_Position = 3142; Antisense;
    GGTAAGTTCATCTCTGATATGTTCT
    >probe:HG-U133A:210556_at:462:437; Interrogation_Position = 3147; Antisense;
    GTTCATCTCTGATATGTTCTTGAAG
    >probe:HG-U133A:210556_at:51:27: Interrogation_Position = 3151; Antisense;
    ATCTCTGATATGTTCTTGAAGTAGT
    >probe:HG-U133A:210556_at:385:427; Interrogation_Position = 3179; Antisense;
    GATTCAGGGACTTTATTCTCCCAAG
    >probe:HG-U133A:210556_at:462:627; Interrogation_Position = 3182; Antisense;
    TCAGGGACTTTATTCTCCCAAGTGT
    >probe:HG-U133A:210556_at:634:377; Interrogation_Position = 3187; Antisense;
    GACTTTATTCTCCCAAGTGTCATGA
    >probe:HG-U133A:210556_at:352:59; Interrogation_Position = 3215; Antisense;
    AGTTTCTATGGATTGCTTATTGGCA
    >probe:HG-U133A:210556_at:59:671; Interrogation_Position = 3232; Antisense;
    TATTGGCATATGGTTGGGCTTTTAA
    >probe:HG-U133A:210556_at:467:153; Interrogation_Position = 3283; Antisense;
    AATATATAACTTTGCCAGGTACCAC
    >probe:HG-U133A:210556_at:534:645; Interrogation_Position = 3289; Antisense;
    TAACTTTGCCAGGTACCACGGCTCA
    >probe:HG-U133A:210556_at:260:49; Interrogation_Position = 3503; Antisense;
    AGGTGGCAGTTGCAGGGAGCTAAGA
    Probes sequences for 203588_s_at:
    >probe:HG-U133A:203588_s_at:530:623; Interrogation_Position = 787; Antisense;
    TCTACCATTCAGCTGCCATTCATAA
    >probe:HG-U133A:203588_s_at:152:331; Interrogation_Position = 845; Antisense;
    GCAGCATCTCCAGTGACAAGTTTGA
    >probe:HG-U133A:203588_s_at:382:39; Interrogation_Position = 931; Antisense;
    ATGGGAATGTCGTTTGGCCTGGAGT
    >probe:HG-U133A:203588_s_at:223:521; Interrogation_Position = 951; Antisense;
    GGAGTCAGGCAAATGCTCTCTGGAG
    >probe:HG-U133A:203588_s_at:268:585; Interrogation_Position = 980; Antisense;
    TGAAACTTGCGAAATCCCTGGTGCC
    >probe:HG-U133A:203588_s_at:662:13; Interrogation_Position = 1031; Antisense;
    ATATCTCCACAGGACCTTCTTGGTT
    >probe:HG-U133A:203588_s_at:123:633; Interrogation_Position = 1059; Antisense;
    TCAGGGACTACTTCTGAACTCTACC
    >probe:HG-U133A:203588_s_at:251:163; Interrogation_Position = 1096; Antisense;
    AATTTAGACCTGACCACTGGTGCCA
    >probe:HG-U133A:203588_s_at:341:51; Interrogation_Position = 1167; Antisense;
    AGTGGCCTTAGCAACTGGGCAGTTC
    >probe:HG-U133A:203588_s_at:238:173; Interrogation_Position = 1199; Antisense;
    CAAACAGTCACCAGTCCAGCAGTGC
    >probe:HG-U133A:203588_s_at:667:1; Interrogation_Position = 1301; Antisense;
    ATTCCTCCTCCCCAGAATAAAGACA
    Probes sequences for 217679_x_at:
    >probe:HG-U133A:217679_x_at:425:333; Interrogation_Position = 36; Antisense;
    GCACTTTGGGAAGCTGAGGCGGTGG
    >probe:HG-U133A:217679_x_at:665:95; Interrogation_Position = 38; Antisense;
    ACTTTGGGAAGCTGAGGCGGTGGAT
    >probe:HG-U133A:217679_x_at:207:213; Interrogation_Position = 39; Antisense;
    CTTTGGGAAGCTGAGGCGGTGGATC
    >probe:HG-U133A:217679_x_at:378:707; Interrogation_Position = 41; Antisense;
    TTGGGAAGCTGAGGCGGTGGATCAC
    >probe:HG-U133A:217679_x_at:49:565; Interrogation_Position = 42; Antisense;
    TGGGAAGCTGAGGCGGTGGATCACC
    >probe:HG-U133A:217679_x_at:55:237; Interrogation_Position = 49; Antisense;
    CTGAGGCGGTGGATCACCTGAGCTC
    >probe:HG-U133A:217679_x_at:41:579; Interrogation_Position = 50; Antisense;
    TGAGGCGGTGGATCACCTGAGCTCA
    >probe:HG-U133A:217679_x_at:174:267; Interrogation_Position = 258; Antisense;
    CCTGGGCGACAAAGCAAGACTCTTT
    >probe:HG-U133A:217679_x_at:238:237; Interrogation_Position = 259; Antisense;
    CTGGGCGACAAAGCAAGACTCTTTC
    >probe:HG-U133A:217679_x_at:689:563; Interrogation_Position = 260; Antisense;
    TGGGCGACAAAGCAAGACTCTTTCT
    >probe:HG-U133A:217679_x_at:260:375; Interrogation_Position = 319; Antisense;
    GAGACAGAAAAGATGGGTTTTTGTT
    Probes sequences for 202901_x_at:
    >probe:HG-U133A:202901_x_at:2:429; Interrogation_Position = 1174; Antisense;
    GATTAATGTGTATTTACTGTACTAA
    >probe:HG-U133A:202901_x_at:226:159; Interrogation_Position = 1205; Antisense;
    AATATAGTTTGAGGCCGGGCACGGT
    >probe:HG-U133A:202901_x_at:674:667; Interrogation_Position = 1519; Antisense;
    TATAGTTTGATTCTTCATTTTTTTA
    >probe:HG-U133A:202901_x_at:686:155; Interrogation_Position = 1552; Antisense;
    AATCTCAGGATAAAGTTTGCTAAGT
    >probe:HG-U133A:202901_x_at:55:437; Interrogation_Position = 1616; Antisense;
    GTTCAACCTAAAACAATCTGTAATT
    >probe:HG-U133A:202901_x_at:395:27; Interrogation_Position = 1631; Antisense;
    ATCTGTAATTGCTTATTGTTTTATT
    >probe:HG-U133A:202901_x_at:315:681; Interrogation_Position = 1650; Antisense;
    TTTATTGTATACTCTTTGTCTTTTA
    >probe:HG-U133A:202901_x_at:31:569; Interrogation_Position = 1666; Antisense;
    TGTCTTTTAAGACCCCTAATAGCCT
    >probe:HG-U133A:202901_x_at:677:253; Interrogation_Position = 1679; Antisense;
    CCCTAATAGCCTTTTGTAACTTGAT
    >probe:HG-U133A:202901_x_at:661:293; Interrogation_Position = 1687; Antisense;
    GCCTTTTGTAACTTGATGGCTTAAA
    >probe:HG-U133A:202901_x_at:252:83; Interrogation_Position = 1709; Antisense;
    AAAAATACTTAATAAATCTGCCATT
    Probes sequences for 213939_s_at:
    >probe:HG-U133A:213939_s_at:367:457; Interrogation_Position = 1364; Antisense;
    GTCACGTCGCAAGTCCGATATCACA
    >probe:HG-U133A:213939_s_at:696:437; Interrogation_Position = 1418; Antisense;
    GTTACTACATTTAAGCTCTGATTCT
    >probe:HG-U133A:213939_s_at:129:137; Interrogation_Position = 1690; Antisense;
    AACCCAGTCCTTGATAATACCAGAT
    >probe:HG-U133A:213939_s_at:319:429; Interrogation_Position = 1712; Antisense;
    GATTCTCGGTGCATTGTTCATGTCT
    >probe:HG-U133A:213939_s_at:454:437; Interrogation_Position = 1727; Antisense;
    GTTCATGTCTGTATCTGCAAGCCTG
    >probe:HG-U133A:213939_s_at:46:621; Interrogation_Position = 1753; Antisense;
    TCTGCCATTGCAGTGTGATGAACCT
    >probe:HG-U133A:213939_s_at:554:133; Interrogation_Position = 1784; Antisense;
    AACTACCATGTGAACTTGAACTCTT
    >probe:HG-U133A:213939_s_at:488:617; Interrogation_Position = 1819; Antisense;
    TCTGTGTCTCTTAGACCATGACCCC
    >probe:HG-U133A:213939_s_at:308:1; Interrogation_Position = 1858; Antisense;
    ATATAGGTTTAATCCCCAAGCCAAG
    >probe:HG-U133A:213939_s_at:163:59; Interrogation_Position = 1893; Antisense;
    AGTATTTCACTACACTTCTATTCTC
    >probe:HG-U133A:213939_s_at:625:429; Interrogation_Position = 1927; Antisense;
    GATTGCTTGCAACTTTTTTTGCCTT
    Probes sequences for 208137_x_at:
    >probe:HG-U133A:208137_x_at:470:399; Interrogation_Position = 597; Antisense;
    GAGTTCAAGCCTTTATTGACATTCA
    >probe:HG-U133A:208137_x_at:273:145; Interrogation_Position = 634; Antisense;
    AAGAGGATGGGGCCAGGTGTGGTGG
    >probe:HG-U133A:208137_x_at:323:181; Interrogation_Position = 678; Antisense;
    CACGTCGGGAGGCCAAGGCACATAG
    >probe:HG-U133A:208137_x_at:628:535; Interrogation_Position = 694; Antisense;
    GGCACATAGGTCACTTGAGGTCAGG
    >probe:HG-U133A:208137_x_at:461:87; Interrogation_Position = 727; Antisense;
    ACCAGCATGGCCAACAGATGTGAGC
    >probe:HG-U133A:208137_x_at:429:63; Interrogation_Position = 742; Antisense;
    AGATGTGAGCCACTTTTCTCAGCCT
    >probe:HG-U133A:208137_x_at:653:695; Interrogation_Position = 757; Antisense;
    TTCTCAGCCTGTTTTTTGTTTCTTA
    >probe:HG-U133A:208137_x_at:76:423; Interrogation_Position = 880; Antisense;
    GATATGAGGCCAGGCACGGAGGCTC
    >probe:HG-U133A:208137_x_at:667:395; Interrogation_Position = 965; Antisense;
    GAGACTTTCCTGGCTAACAGGGTGA
    >probe:HG-U133A:208137_x_at:613:465; Interrogation_Position = 1048; Antisense;
    GTCCCAGCTACTCCAGATGCTGAGG
    >probe:HG-U133A:208137_x_at:75:695; Interrogation_Position = 1105; Antisense;
    TTGCAGGGAGTCGAGATCGCGCCAC
    Probes sequences for 214705_at:
    >probe:HG-U133A:214705_at:346:129; Interrogation_Position = 4945; Antisense;
    AACAACTAGCAAGGTTCTTTCACTT
    >probe:HG-U133A:214705_at:281:513; Interrogation_Position = 4957; Antisense;
    GGTTCTTTCACTTGGAATTACTAAG
    >probe:HG-U133A:214705_at:31:1; Interrogation_Position = 4970; Antisense;
    GGAATTACTAAGATCGGAGTTTTGC
    >probe:HG-U133A:214705_at:362:573; Interrogation_Position = 5001; Antisense;
    TGATTAAAGCAACCATACCCAAGAA
    >probe:HG-U133A:214705_at:342:19; Interrogation_Position = 5015; Antisense;
    ATACCCAAGAAATAGCTAGCATCAA
    >probe:HG-U133A:214705_at:389:345; Interrogation_Position = 5040; Antisense;
    GAATGAGATTTATCCAATGTTGGGT
    >probe:HG-U133A:214705_at:132:497; Interrogation_Position = 5061; Antisense;
    GGGTCAAGAACATTGCTTCGACATG
    >probe:HG-U133A:214705_at:520:359; Interrogation_Position = 5068; Antisense;
    GAACATTGCTTCGACATGGAAATTA
    >probe:HG-U133A:214705_at:400:531; Interrogation_Position = 5097; Antisense;
    GGAACATTGCTTTTCGTGATACTGT
    >probe:HG-U133A:214705_at:118:425; Interrogation_Position = 5114; Antisense;
    GATACTGTTAATTTCATACTATGTT
    >probe:HG-U133A:214705_at:547:637; Interrogation_Position = 5175; Antisense;
    TAATTCTTCACGATAGTAGTTTTCT
    Probes sequences for 209703_x_at:
    >probe:HG-U133A:209703_x_at:112:237; Interrogation_Position = 1768; Antisense;
    CTGTTGCAGAGAGGGGTCCTGGAGA
    >probe:HG-U133A:209703_x_at:113:395; Interrogation_Position = 1789; Antisense;
    GAGAAATGGGTTACCCCAGTTGTCT
    >probe:HG-U133A:209703_x_at:309:687; Interrogation_Position = 1799; Antisense;
    TTACCCCAGTTGTCTTATTTAAATG
    >probe:HG-U133A:209703_x_at:485:153; Interrogation_Position = 1820; Antisense;
    AATGGTTACCCATCAGATTTTAATT
    >probe:HG-U133A:209703_x_at:507:11; Interrogation_Position = 1836; Antisense;
    ATTTTAATTTTATCTTCTCTTTGAG
    >probe:HG-U133A:209703_x_at:132:351; Interrogation_Position = 1874; Antisense;
    GAAGCACTTAAATCACTCCAAAGAA
    >probe:HG-U133A:209703_x_at:347:75; Interrogation_Position = 1986; Antisense;
    AGCACAGATTTTTTCTTTCTGCTTT
    >probe:HG-U133A:209703_x_at:242:143; Interrogation_Position = 2029; Antisense;
    AAGAGACAGGGGGCTGGGCGTGGTG
    >probe:HG-U133A:209703_x_at:152:499; Interrogation_Position = 2094; Antisense;
    GGTGGATCACGAGGTAGGAGTTAAA
    >probe:HG-U133A:209703_x_at:476:51; Interrogation_Position = 2264; Antisense;
    AGTGAACCGAGATCATGCCATTGTA
    >probe:HG-U133A:209703_x_at:325:275; Interrogation_Position = 2281; Antisense;
    CCATTGTACTCTAGCCTGGGTGACA
    Probes sequences for 215001_s_at:
    >probe:HG-U133A:215001_s_at:698:545; Interrogation_Position = 4497; Antisense;
    GGCTGGTCAACTTGAGCATGTTACT
    >probe:HG-U133A:215001_s_at:518:17; Interrogation_Position = 4557; Antisense;
    ATAGCATGTCACTAAAGCAGGCCTT
    >probe:HG-U133A:215001_s_at:208:107; Interrogation_Position = 4661; Antisense;
    ACAGAATTGCTTGTTTGCTTCAACT
    >probe:HG-U133A:215001_s_at:125:107; Interrogation_Position = 4718; Antisense;
    ACAGGGCTGGAGTCAAAACACTTGT
    >probe:HG-U133A:215001_s_at:481:11; Interrogation_Position = 4844; Antisense;
    ATTTTCTGGTTCTTGTGTTGGCTGT
    >probe:HG-U133A:215001_s_at:19:437; Interrogation_Position = 4852; Antisense;
    GTTCTTGTGTTGGCTGTGGCAGGCC
    >probe:HG-U133A:215001_s_at:580:677; Interrogation_Position = 4886; Antisense;
    TTTCTTTTGCCATGACAACTTCTAA
    >probe:HG-U133A:215001_s_at:522:371; Interrogation_Position = 4899; Antisense;
    GACAACTTCTAATTGCCATGTACAG
    >probe:HG-U133A:215001_s_at:409:457; Interrogation_Position = 4934; Antisense;
    GTCAAATAACTCCTCATTGTAAACA
    >probe:HG-U133A:215001_s_at:111:113; Interrogation_Position = 4956; Antisense;
    ACAAACTGTGTAACTGCCCAAAGCA
    >probe:HG-U133A:215001_s_at:373:331; Interrogation_Position = 4978; Antisense;
    GCAGCACTTATAAATCAGCCTAACA
    Probes sequences for 218155_x_at:
    >probe:HG-U133A:218155_x_at:480:137; Interrogation_Position = 4603; Antisense;
    AACCACAACATTGCCTGCATGTCTA
    >probe:HG-U133A:218155_x_at:56:173; Interrogation_Position = 4608; Antisense;
    CAACATTGCCTGCATGTCTAAAAGA
    >probe:HG-U133A:218155_x_at:452:297; Interrogation_Position = 4727; Antisense;
    GCCTGGCCAACCTGGTGAAATGCAG
    >probe:HG-U133A:218155_x_at:379:91; Interrogation_Position = 4736; Antisense;
    ACCTGGTGAAATGCAGTGTCAACTA
    >probe:HG-U133A:218155_x_at:392:157; Interrogation_Position = 4763; Antisense;
    AATAGGAAAATCAGCCAGGCGTGGT
    >probe:HG-U133A:218155_x_at:126:343; Interrogation_Position = 4832; Antisense;
    GAATCACTTGAACTTGGGAGGGGTA
    >probe:HG-U133A:218155_x_at:488:91; Interrogation_Position = 4842; Antisense;
    AACTTGGGAGGGGTAGGTTGCAGTG
    >probe:HG-U133A:218155_x_at:517:563; Interrogation_Position = 4846; Antisense;
    TGGGAGGGGTAGGTTGCAGTGAGCT
    >probe:HG-U133A:218155_x_at:482:309; Interrogation_Position = 4868; Antisense;
    GCTGAGATCACACCAAAATGGGGTG
    >probe:HG-U133A:218155_x_at:373:169; Interrogation_Position = 4881; Antisense;
    CAAAATGGGGTGGGGCGCAGTGGCT
    >probe:HG-U133A:218155_x_at:329:545; Interrogation_Position = 4943; Antisense;
    GGCGGATCACGAGGTAGGGAGATCA
    Probes sequences for 215604_x_at:
    >probe:HG-U133A:215604_x_at:453:357; Interrogation_Position = 2315; Antisense;
    GAACCCAGGAAGCGGAAGTTGCAGT
    >probe:HG-U133A:215604_x_at:518:373; Interrogation_Position = 2375; Antisense;
    GACAGAGGGAGACTCCGTCTCCAAA
    >probe:HG-U133A:215604_x_at:484:89; Interrogation_Position = 2406; Antisense;
    AAAGTTCAGGCCGGGCACGGTAGCT
    >probe:HG-U133A:215604_x_at:393:545; Interrogation_Position = 2458; Antisense;
    GGCTGATGCAGGTGGATCACCGGAG
    >probe:HG-U133A:215604_x_at:171:181; Interrogation_Position = 2475; Antisense;
    CACCGGAGGTCAGGAACTAGAAATC
    >probe:HG-U133A:215604_x_at:242:531; Interrogation_Position = 2487; Antisense;
    GGAACTAGAAATCAGCCTGGCCAAC
    >probe:HG-U133A:215604_x_at:330:687; Interrogation_Position = 2547; Antisense;
    TTAGACAGGTGTGGTGGTGCGTGCC
    >probe:HG-U133A:215604_x_at:130:351; Interrogation_Position = 2597; Antisense;
    GAAGCAGAAAAATCACTTGAACCCA
    >probe:HG-U133A:215604_x_at:329:581; Interrogation_Position = 2644; Antisense;
    TGAGATCGTGCCATTGCATTCCAGC
    >probe:HG-U133A:215604_x_at:391:245; Interrogation_Position = 2674; Antisense;
    CGACAAGACTGAAACTCTATCCCCC
    >probe:HG-U133A:215604_x_at:17:645; Interrogation_Position = 2797; Antisense;
    TAAGAAACTATATTACCAGAGCAAG
    Probes sequences for 212297_at:
    >probe:HG-U133A:212297_at:420:435; Interrogation_Position = 1883; Antisense;
    GTTCCCCATGTTTATGAAAGTCCTG
    >probe:HG-U133A:212297_at:329:113; Interrogation_Position = 2014; Antisense;
    AAATATTCATGCATGCAATTTTGAC
    >probe:HG-U133A:212297_at:23:565; Interrogation_Position = 2058; Antisense;
    TGTATATTTATGGTGGGAGGTGGTT
    >probe:HG-U133A:212297_at:25:163; Interrogation_Position = 2109; Antisense;
    AATTTTTGTACAGTCTGTGGGCATT
    >probe:HG-U133A:212297_at:127:465; Interrogation_Position = 2121; Antisense;
    GTCTGTGGGCATTTACACATTTTTA
    >probe:HG-U133A:212297_at:391:147; Interrogation_Position = 2188; Antisense;
    AAGTTACTTCTAGTTATGATTTGTG
    >probe:HG-U133A:212297_at:517:423; Interrogation_Position = 2205; Antisense;
    GATTTGTGAATTCCCTAAGACCTTG
    >probe:HG-U133A:212297_at:619:155; Interrogation_Position = 2257; Antisense;
    AATGATACTGCATCTTTATATTTTT
    >probe:HG-U133A:212297_at:516:113; Interrogation_Position = 2283; Antisense;
    AAATTGTATTGCTGCTCAAGAATGG
    >probe:HG-U133A:212297_at:38:19; Interrogation_Position = 2301; Antisense;
    AGAATGGTACCCTCTTGTCAAAAAG
    >probe:HG-U133A:212297_at:199:205; Interrogation_Position = 2331; Antisense;
    CATTCATAATTGTACATTCAGCATT
    Probes sequences for 201804_x_at:
    >probe:HG-U133A:201804_x_at:238:233; Interrogation_Position = 450; Antisense;
    CTGAAGCGCAGAAAGCTCGGCCGGT
    >probe:HG-U133A:201804_x_at:675:489; Interrogation_Position = 605; Antisense;
    GGGCACCGTCATGTATGTAGGTCTC
    >probe:HG-U133A:201804_x_at:533:507; Interrogation_Position = 624; Antisense;
    GGTCTCACAGATTTCAAGCCTGGCT
    >probe:HG-U133A:201804_x_at:276:295; Interrogation_Position = 641; Antisense;
    GCCTGGCTACTGGATTGGTGTCCGC
    >probe:HG-U133A:201804_x_at:302:481; Interrogation_Position = 658; Antisense;
    GTGTCCGCTATGATGAGCCACTGGG
    >probe:HG-U133A:201804_x_at:363:157; Interrogation_Position = 721; Antisense;
    AATGCCAGGCCAAGTATGGCGCCTT
    >probe:HG-U133A:201804_x_at:671:39; Interrogation_Position = 736; Antisense;
    ATGGCGCCTTTGTCAAGCCAGCAGT
    >probe:HG-U133A:201804_x_at:4:289; Interrogation_Position = 752; Antisense;
    GCCAGCAGTCGTGACGGTGGGGGAC
    >probe:HG-U133A:201804_x_at:375:585; Interrogation_Position = 810; Antisense;
    TGACACCTAAGGAATTCCCCTGCTT
    >probe:HG-U133A:201804_x_at:22:621; Interrogation_Position = 888; Antisense;
    TCTCCTGACCGCATTTTAATTTTAT
    >probe:HG-U133A:201804_x_at:679:679; Interrogation_Position = 908; Antisense;
    TTTATTCATTTTTTCCTTTGCCATT
    Probes sequences for 217313_at:
    >probe:HG-U133A:217313_at:31:587; Interrogation_Position = 14; Antisense;
    TGAAGGCCTCGGGTGCACTAAAAGA
    >probe:HG-U133A:217313_at:207:547; Interrogation_Position = 18; Antisense;
    GGCCTCGGGTGCACTAAAAGAGTAC
    >probe:HG-U133A:217313_at:388:417; Interrogation_Position = 177; Antisense;
    GATGACTTCAGGCTGGATTGTCTAC
    >probe:HG-U133A:217313_at:249:377; Interrogation_Position = 180; Antisense;
    GACTTCAGGCTGGATTGTCTACTGT
    >probe:HG-U133A:217313_at:455:545; Interrogation_Position = 187; Antisense;
    GGCTGGATTGTCTACTGTGGACAGG
    >probe:HG-U133A:217313_at:360:559; Interrogation_Position = 190; Antisense;
    TGGATTGTCTACTGTGGACAGGTGT
    >probe:HG-U133A:217313_at:493:347; Interrogation_Position = 206; Antisense;
    GACAGGTGTTTGAGAAGGTCCCCCT
    >probe:HG-U133A:217313_at:425:19; Interrogation_Position = 291; Antisense;
    ATACAGGGAATACTGGGACCTGACC
    >probe:HG-U133A:217313_at:557:301; Interrogation_Position = 340; Antisense;
    GCCCACTGCATCCAGATCATGAAGG
    >probe:HG-U133A:217313_at:236:485; Interrogation_Position = 364; Antisense;
    GTGGAGGAGATCGCAGCCAACAAGT
    >probe:HG-U133A:217313_at:343:47; Interrogation_Position = 368; Antisense;
    AGGAGATCGCAGCCAACAAGTGCCG
    Probes sequences for 217949_s_at:
    >probe:HG-U133A:217949_s_at:236:317; Interrogation_Position = 332; Antisense;
    GCTATTGTTAGGTTGCCTGCGGACA
    >probe:HG-U133A:217949_s_at:226:627; Interrogation_Position = 433; Antisense;
    TCTTCGTGCTCTATGATTTCTGCAT
    >probe:HG-U133A:217949_s_at:659:617; Interrogation_Position = 451; Antisense;
    TCTGCATTGTTTGTATCACCACCTA
    >probe:HG-U133A:217949_s_at:35:87; Interrogation_Position = 468; Antisense;
    ACCACCTATGCTATCAACGTGAGCC
    >probe:HG-U133A:217949_s_at:556:485; Interrogation_Position = 497; Antisense;
    GTGGCTCAGTTTCCGGAAGGTCCAA
    >probe:HG-U133A:217949_s_at:241:543; Interrogation_Position = 536; Antisense;
    GGCTAAGAGGCACTGAGCCCTCAAC
    >probe:HG-U133A:217949_s_at:309:325; Interrogation_Position = 594; Antisense;
    GCATGTGAGCCTTGCCTAAGGGGGC
    >probe:HG-U133A:217949_s_at:133:247; Interrogation_Position = 607; Antisense;
    GCCTAAGGGGGCATATCTGGGTCCC
    >probe:HG-U133A:217949_s_at:507:615; Interrogation_Position = 622; Antisense;
    TCTGGGTCCCTAGAAGGCCCTAGAT
    >probe:HG-U133A:217949_s_at:255:411; Interrogation_Position = 644; Antisense;
    GATGTGGGGCTTCTAGATTACCCCC
    >probe:HG-U133A:217949_s_at:651:229; Interrogation_Position = 674; Antisense;
    CTGCCATACCCGCACATGACAATGG
    Probes sequences for 215179_x_at:
    >probe:HG-U133A:215179_x_at:330:387; Interrogation_Position = 2996; Antisense;
    GACCAACACGGTGAAACCACATCTC
    >probe:HG-U133A:215179_x_at:310:9; Interrogation_Position = 3036; Antisense;
    ATTAGCTGGGCGTGTTGGTGCATGC
    >probe:HG-U133A:215179_x_at:147:137; Interrogation_Position = 3196; Antisense;
    AAGAATTCCAAAAGCCCATGTGCTC
    >probe:HG-U133A:215179_x_at:267:255; Interrogation_Position = 3220; Antisense;
    CCCTTGCCCATTCTTAAGTTGACAT
    >probe:HG-U133A:215179_x_at:312:57; Interrogation_Position = 3268; Antisense;
    AGTCAAGCCAGGCTCAGTACCTCAT
    >probe:HG-U133A:215179_x_at:13:59; Interrogation_Position = 3283; Antisense;
    AGTACCTCATGCCTGTAATTCTAGC
    >probe:HG-U133A:215179_x_at:383:247; Interrogation_Position = 3325; Antisense;
    CGAGTGGATCGCTTGAGGCCAGGAG
    >probe:HG-U133A:215179_x_at:545:263; Interrogation_Position = 3343; Antisense;
    CCAGGAGTTCAAGCCTAGCCGGGGC
    >probe:HG-U133A:215179_x_at:218:297; Interrogation_Position = 3360; Antisense;
    GCCGGGGCAACATGGCGAAACCTCG
    >probe:HG-U133A:215179_x_at:309:99; Interrogation_Position = 3407; Antisense;
    ACTAGCTGGACGTGGTGCCATGTGC
    >probe:HG-U133A:215179_x_at:69:483; Interrogation_Position = 3461; Antisense;
    GTGGAATGATCACCTGAGCCTGGGA
    Probes sequences for 217588_at:
    >probe:HG-U133A:217588_at:114:245; Interrogation_Position = 116; Antisense;
    CGTCGACGTAAGCACTTCTCAGAGA
    >probe:HG-U133A:217588_at:171:561; Interrogation_Position = 208; Antisense;
    TGGGTGCCTTGGGGTGTATTTTACT
    >probe:HG-U133A:217588_at:36:139; Interrogation_Position = 236; Antisense;
    AACCTGTGTGTTTTCCACAGTCCCT
    >probe:HG-U133A:217588_at:445:231; Interrogation_Position = 259; Antisense;
    CTGCTTCCCGAGGTTGTGGTATTGG
    >probe:HG-U133A:217588_at:133:479; Interrogation_Position = 282; Antisense;
    GGTAGGGGTAACAGGCCAATCGGTG
    >probe:HG-U133A:217588_at:144:199; Interrogation_Position = 293; Antisense;
    CAGGCCAATCGGTGTGGCTTCAGCT
    >probe:HG-U133A:217588_at:458:621; Interrogation_Position = 343; Antisense;
    TCTCTCAAACTCCTTGCACAATTCC
    >probe:HG-U133A:217588_at:137:99; Interrogation_Position = 351; Antisense;
    ACTCCTTGCACAATTCCGTCAAATT
    >probe:HG-U133A:217588_at:188:591; Interrogation_Position = 383; Antisense;
    TTATTTTGGTCCTGGTCAGGGCCCT
    >probe:HG-U133A:217588_at:492:195; Interrogation_Position = 399; Antisense;
    CAGGGCCCTCAAGGTGATTTGACTG
    >probe:HG-U133A:217588_at:39:299; Interrogation_Position = 535; Antisense;
    GCCGAGGGGGATGGGTCATGAGGTC
    Probes sequences for 214153_at:
    >probe:HG-U133A:214153_at:128:707; Interrogation_Position = 851; Antisense;
    TTGTTGCCAGTGCCTGGTAGCCATA
    >probe:HG-U133A:214153_at:434:451; Interrogation_Position = 867; Antisense;
    GTAGCCATAAGGCCTGCTGATCTGG
    >probe:HG-U133A:214153_at:135:147; Interrogation_Position = 981; Antisense;
    AAGTCATATACTTCTGGGCTTCATA
    >probe:HG-U133A:214153_at:646:487; Interrogation_Position = 996; Antisense;
    GGGCTTCATAGTTTATGGCTGCACT
    >probe:HG-U133A:214153_at:526:41; Interrogation_Position = 1010; Antisense;
    ATGGCTGCACTGTGGGACATACACC
    >probe:HG-U133A:214153_at:296:491; Interrogation_Position = 1023; Antisense;
    GGGACATACACCAGCTACAAGGAAG
    >probe:HG-U133A:214153_at:671:507; Interrogation_Position = 1054; Antisense;
    GGATAGGACCCGATGAAGCTGAAAT
    >probe:HG-U133A:214153_at:204:211; Interrogation_Position = 1096; Antisense;
    CTTATAGCAGCATAGCACTTTGTTC
    >probe:HG-U133A:214153_at:384:367; Interrogation_Position = 1208; Antisense;
    GAAAGACCAATTTGGCCACATAACC
    >probe:HG-U133A:214153_at:489:267; Interrogation_Position = 1233; Antisense;
    CCTGTCATCTGCCATCCAGATACTA
    >probe:HG-U133A:214153_at:350:297; Interrogation_Position = 1321; Antisense;
    GCCTGTGATTCAGTTAACTTACCTC
    Probes sequences for 222155_s_at:
    >probe:HG-U133A:222155_s_at:133:273; Interrogation_Position = 1260; Antisense;
    CCTAAGGCCTATCAGCTTCTATCAG
    >probe:HG-U133A:222155_s_at:347:153; Interrogation_Position = 1263; Antisense;
    AAGGCCTATCAGCTTCTATCAGCCC
    >probe:HG-U133A:222155_s_at:153:185; Interrogation_Position = 1319; Antisense;
    CACCAACGCGCTGACCAATGGCGTG
    >probe:HG-U133A:222155_s_at:666:85; Interrogation_Position = 1320; Antisense;
    ACCAACGCGCTGACCAATGGCGTGC
    >probe:HG-U133A:222155_s_at:318:303; Interrogation_Position = 1615; Antisense;
    GCGTGTTCTCCTACGTGAAGGTGGC
    >probe:HG-U133A:222155_s_at:158:479; Interrogation_Position = 1617; Antisense;
    GTGTTCTCCTACGTGAAGGTGGCAG
    >probe:HG-U133A:222155_s_at:404:433; Interrogation_Position = 1619; Antisense;
    GTTCTCCTACGTGAAGGTGGCAGCC
    >probe:HG-U133A:222155_s_at:104:227; Interrogation_Position = 1622; Antisense;
    CTCCTACGTGAAGGTGGCAGCCAGC
    >probe:HG-U133A:222155_s_at:158:217; Interrogation_Position = 1625; Antisense;
    CTACGTGAAGGTGGCAGCCAGCTCC
    >probe:HG-U133A:222155_s_at:494:203; Interrogation_Position = 1748; Antisense;
    CATCTATCACGTGTTCCACAGCAGA
    >probe:HG-U133A:222155_s_at:190:625; Interrogation_Position = 1750; Antisense;
    TCTATCACGTGTTCCACAGCAGAAA
    Probes sequences for 211316_x_at:
    >probe:HG-U133A:211316_x_at:71:545; Interrogation_Position = 1491; Antisense;
    GGCTGTGCACAGTTCACCGAGAAGC
    >probe:HG-U133A:211316_x_at:562:625; Interrogation_Position = 1521; Antisense;
    TCTTCTGGAGCCTGTGTACTGCGGA
    >probe:HG-U133A:211316_x_at:702:595; Interrogation_Position = 1554; Antisense;
    TGCTGGAGCAGTCTCACAGCTCACC
    >probe:HG-U133A:211316_x_at:593:179; Interrogation_Position = 1632; Antisense;
    CACTCCTGGATCTTCACATTGAACT
    >probe:HG-U133A:211316_x_at:233:665; Interrogation_Position = 1709; Antisense;
    TATGTTTGGCTGCAGCACACTCTGA
    >probe:HG-U133A:211316_x_at:664:361; Interrogation_Position = 1734; Antisense;
    GAAAGAAACTTATCCTCTCCTACAC
    >probe:HG-U133A:211316_x_at:159:551; Interrogation_Position = 1864; Antisense;
    GGCCAACATGGTAAACGCTGTCCCT
    >probe:HG-U133A:211316_x_at:518:483; Interrogation_Position = 1915; Antisense;
    GTGGGTGTGGGTACCTGTATTCCCA
    >probe:HG-U133A:211316_x_at:24:519; Interrogation_Position = 1961; Antisense;
    GGAGGATCTTTTGAACCCAGGAGTT
    >probe:HG-U133A:211316_x_at:221:325; Interrogation_Position = 1996; Antisense;
    GCATGCTGTGATTGTGCCTACGAAT
    >probe:HG-U133A:211316_x_at:415:293; Interrogation_Position = 2011; Antisense;
    GCCTACGAATAGCCACTGCATACCA
    Probes sequences for 217653_x_at:
    >probe:HG-U133A:217653_x_at:5:101; Interrogation_Position = 35; Antisense;
    ACTCATTGAGAGTTGGGACTATTTC
    >probe:HG-U133A:217653_x_at:147:401; Interrogation_Position = 44; Antisense;
    GAGTTGGGACTATTTCACACATACA
    >probe:HG-U133A:217653_x_at:264:265; Interrogation_Position = 73; Antisense;
    CCTGGCATGTAGAAGGGACTTAATG
    >probe:HG-U133A:217653_x_at:37:501; Interrogation_Position = 107; Antisense;
    GGGGAGGCATTTTAAAATCCACATC
    >probe:HG-U133A:217653_x_at:39:127; Interrogation_Position = 135; Antisense;
    AAAATGTTGTTCTGTTTGGGAGTGG
    >probe:HG-U133A:217653_x_at:431:521; Interrogation_Position = 221; Antisense;
    GGAGTTCGAGATCAACCTGAGCAAC
    >probe:HG-U133A:217653_x_at:197:61; Interrogation_Position = 229; Antisense;
    AGATCAACCTGAGCAACATGGTGAA
    >probe:HG-U133A:217653_x_at:206:649; Interrogation_Position = 273; Antisense;
    TACAAACATTAGCTGAGCATGGGGG
    >probe:HG-U133A:217653_x_at:520:315; Interrogation_Position = 316; Antisense;
    GCTACTTGGGAGGCTTAGGCACTTG
    >probe:HG-U133A:217653_x_at:578:41; Interrogation_Position = 326; Antisense;
    AGGCTTAGGCACTTGAATGAGAATC
    >probe:HG-U133A:217653_x_at:46:587; Interrogation_Position = 339; Antisense;
    TGAATGAGAATCACTTGGACCCAGG
    Probes sequences for 266_s_at:
    >probe:HG-U133A:266_s_at:638:691; Interrogation_Position = 1612; Antisense;
    TTCACAAACTTTTATACTCTTTCTG
    >probe:HG-U133A:266_s_at:595:465; Interrogation_Position = 1737; Antisense;
    GTCCTAAGCCTAAAAGTGGGCTTGA
    >probe:HG-U133A:266_s_at:346:53; Interrogation_Position = 1751; Antisense;
    AGTGGGCTTGATTCTGCAGTAAATC
    >probe:HG-U133A:266_s_at:376:191; Interrogation_Position = 1767; Antisense;
    CAGTAAATCTTTTACAACTGCCTCG
    >probe:HG-U133A:266_s_at:547:291; Interrogation_Position = 1786; Antisense;
    GCCTCGACACACATAAACCTTTTTA
    >probe:HG-U133A:266_s_at:617:109; Interrogation_Position = 1851; Antisense;
    ACACTGATGCTTAGATGTTCCAGTA
    >probe:HG-U133A:266_s_at:315:457; Interrogation_Position = 1873; Antisense;
    GTAATCTAATATGGCCACAGTAGTC
    >probe:HG-U133A:266_s_at:164:175; Interrogation_Position = 1888; Antisense;
    CACAGTAGTCTTGATGACCAAAGTC
    >probe:HG-U133A:266_s_at:236:675; Interrogation_Position = 1918; Antisense;
    TTTCCATCTTTAGAAAACTACATGG
    >probe:HG-U133A:266_s_at:413:529; Interrogation_Position = 1942; Antisense;
    GGAACAAACAGATCGAACAGTTTTG
    >probe:HG-U133A:266_s_at:220:217; Interrogation_Position = 1970; Antisense;
    CTACTGTGTGTGTGAATGAACACTC
    >probe:HG-U133A:266_s_at:32:131; Interrogation_Position = 1988; Antisense;
    AACACTCTTGCTTTATTCCAGAATG
    >probe:HG-U133A:266_s_at:329:671; Interrogation_Position = 2001; Antisense;
    TATTCCAGAATGCTGTACATCTATT
    >probe:HG-U133A:266_s_at:382:595; Interrogation_Position = 2011; Antisense;
    TGCTGTACATCTATTTTGGATTGTA
    >probe:HG-U133A:266_s_at:495:679; Interrogation_Position = 2051; Antisense;
    TTTACGCTTTGATTCATAGTAACTT
    >probe:HG-U133A:266_s_at:35:203; Interrogation_Position = 2065; Antisense;
    CATAGTAACTTCTTATGGAATTGAT
    Probes sequences for 204718_at:
    >probe:HG-U133A:204718_at:181:177; Interrogation_Position = 3453; Antisense;
    CACTTGGCAGAAGGACCGTGCccGG
    >probe:HG-U133A:204718_at:68:147; Interrogation_Position = 3514; Antisense;
    AAGATGATCCGCAAGCCAGATACCC
    >probe:HG-U133A:204718_at:468:139; Interrogation_Position = 3526; Antisense;
    AAGCCAGATACCCTGCAGGCTGGCG
    >probe:HG-U133A:204718_at:73:543; Interrogation_Position = 3632; Antisense;
    GGCTTTCAGCCATTGGACTGGAGTG
    >probe:HG-U133A:204718_at:210:529; Interrogation_Position = 3663; Antisense;
    GGACAACTTCTCCAAGTTTGGCCTC
    >probe:HG-U133A:204718_at:450:209; Interrogation_Position = 3693; Antisense;
    CTTCAGTGATGTGGCTCAGCTCAGC
    >probe:HG-U133A:204718_at:223:221; Interrogation_Position = 3707; Antisense;
    CTCAGCTCAGCCTAGAAGACCTGCC
    >probe:HG-U133A:204718_at:370:605; Interrogation_Position = 3788; Antisense;
    TCCTTCAGCAACACCTGAGGCAGCA
    >probe:HG-U133A:204718_at:600:343; Interrogation_Position = 3832; Antisense;
    GAATGACGATACCCGTGACTCAGCC
    >probe:HG-U133A:204718_at:329:149; Interrogation_Position = 3874; Antisense;
    AAGGGACATGTGGGACGTGAGCCGG
    >probe:HG-U133A:204718_at:361:131; Interrogation_Position = 3904; Antisense;
    AACAGCCTCTGTGAGAGATGCCCCA
    Probes sequences for 203704_s_at:
    >probe:HG-U133A:203704_s_at:621:417; Interrogation_Position = 8112; Antisense;
    GATGCTGAGATTCAATCACTACATG
    >probe:HG-U133A:203704_s_at:678:635; Interrogation_Position = 8127; Antisense;
    TCACTACATGAAACACCTGGCTGTG
    >probe:HG-U133A:203704_s_at:628:111; Interrogation_Position = 8160; Antisense;
    ACAACCCAGAGGGCTGTGTTCCAAG
    >probe:HG-U133A:203704_s_at:621:169; Interrogation_Position = 8181; Antisense;
    CAAGCAGCGCTGGGGAAGCTACGTA
    >probe:HG-U133A:203704_s_at:428:353; Interrogation_Position = 8195; Antisense;
    GAAGCTACGTAACAGTCGGATGCCA
    >probe:HG-U133A:203704_s_at:3:455; Interrogation_Position = 8203; Antisense;
    GTAACAGTCGGATGCCAGTTTTGGA
    >probe:HG-U133A:203704_s_at:690:553; Interrogation_Position = 8224; Antisense;
    TGGAAGATTCACCATGCGTTCTGAC
    >probe:HG-U133A:203704_s_at:340:577; Interrogation_Position = 8293; Antisense;
    TGATCCTAGTGATTTCAGCCCATGC
    >probe:HG-U133A:203704_s_at:19:127; Interrogation_Position = 8374; Antisense;
    AAAAACTGCTGCTACATGTTATGTA
    >probe:HG-U133A:203704_s_at:297:121; Interrogation_Position = 8401; Antisense;
    AAACTGGTTTATGCCACATGAACAG
    >probe:HG-U133A:203704_s_at:317:435; Interrogation_Position = 8456; Antisense;
    GTTCCTCTTTGTATTCAGTTGTATA
    Probes sequences for 220934_s_at:
    >probe:HG-U133A:220934_s_at:253:587; Interrogation_Position = 146; Antisense;
    TGAAGCAGGTATCTTGCATGGCCCA
    >probe:HG-U133A:220934_s_at:251:601; Interrogation_Position = 186; Antisense;
    TGCCATGCTACCTCTGAAAGTCAAA
    >probe:HG-U133A:220934_s_at:48:149; Interrogation_Position = 203; Antisense;
    AAGTCAAAGGCCGACGCTTCTATTT
    >probe:HG-U133A:220934_s_at:202:91; Interrogation_Position = 216; Antisense;
    ACGCTTCTATTTCCTCTTGGACAAA
    >probe:HG-U133A:220934_s_at:328:171; Interrogation_Position = 237; Antisense;
    CAAAACTGGACACTTCCCTAACACA
    >probe:HG-U133A:220934_s_at:203:583; Interrogation_Position = 270; Antisense;
    TGACAATACTGTGGGTGCCTACCGG
    >probe:HG-U133A:220934_s_at:213:469; Interrogation_Position = 284; Antisense;
    GTGCCTACCGGAGCTTGTGAAGAAA
    >probe:HG-U133A:220934_s_at:344:117; Interrogation_Position = 306; Antisense;
    AAATGACCTCAAGTCACTCACCTCT
    >probe:HG-U133A:220934_s_at:93:459; Interrogation_Position = 318; Antisense;
    GTCACTCACCTCTCCAAGAGGAGGA
    >probe:HG-U133A:220934_s_at:391:477; Interrogation_Position = 370; Antisense;
    GTGTTGGTTCACACCTGTTGTAATC
    >probe:HG-U133A:220934_s_at:284:647; Interrogation_Position = 560; Antisense;
    TAAAGGAGCAGTCATCCCTAGGGCA
    Probes sequences_for 211916_s_at:
    >probe:HG-U133A:211916_s_at:92:117; Interrogation_Position = 2550; Antisense;
    AAAGCTCTGTGCCAGTGAACTGTTC
    >probe:HG-U133A:211916_s_at:644:49; Interrogation_Position = 2576; Antisense;
    AGGGCAAGAAGGCTTCATATCCCCA
    >probe:HG-U133A:211916_s_at:400:611; Interrogation_Position = 2606; Antisense;
    TCCCCATTCCATTCTGTGGTGACTA
    >probe:HG-U133A:211916_s_at:544:483; Interrogation_Position = 2621; Antisense;
    GTGGTGACTACATTGGGCTGCAAGG
    >probe:HG-U133A:211916_s_at:107:339: Interrogation_Position = 2640; Antisense;
    GCAAGGGAACCCCAAGCTGCAGAAG
    >probe:HG-U133A:211916_s_at:303:41; Interrogation_Position = 2729; Antisense;
    ATGGCAAGACTTCTTCTCGGATTCT
    >probe:HG-U133A:211916_s_at:668:607; Interrogation_Position = 2753; Antisense;
    TCCTCCTGACCAAGGGCCATGTGAT
    >probe:HG-U133A:211916_s_at:369:349; Interrogation_Position = 2793; Antisense;
    GAAGTCCCAGGCCAAAATTGTCATT
    >probe:HG-U133A:211916_s_at:242:415; Interrogation_Position = 2860; Antisense;
    GATGGGCTCTTTAGCTTGCATCTGA
    >probe:HG-U133A:211916_s_at:122:383; Interrogation_Position = 2952; Antisense;
    GACCAAAATGTACCGGGCTGTGCTG
    >probe:HG-U133A:211916_s_at:153:181; Interrogation_Position = 2982; Antisense;
    CACGCAGAGGCAGCTTACAGTCACC
    Probes sequences for 206929_s_at:
    >probe:HG-U133A:206929_s_at:38:285; Interrogation_Position = 953; Antisense;
    CCACGAGTAGCAGCCGCAACTGGAC
    >probe:HG-U133A:206929_s_at:269:451; Interrogation_Position = 959; Antisense;
    GTAGCAGCCGCAACTGGACGGAGGA
    >probe:HG-U133A:206929_s_at:666:331; Interrogation_Position = 968; Antisense;
    GCAACTGGACGGAGGACATGGAAGG
    >probe:HG-U133A:206929_s_at:465:105; Interrogation_Position = 983; Antisense;
    ACATGGAAGGAGGCATCTCGTCCCC
    >probe:HG-U133A:206929_s_at:88:325; Interrogation_Position = 995; Antisense;
    GCATCTCGTCCCCGGTGAAGAAGAC
    >probe:HG-U133A:206929_s_at:62:623; Interrogation_Position = 998; Antisense;
    TCTCGTCCCCGGTGAAGAAGACAGA
    >probe:HG-U133A:206929_s_at:256:555; Interrogation_Position = 1025; Antisense;
    TGGACAAGTCACCATTCAACAGCCC
    >probe:HG-U133A:206929_s_at:646:465; Interrogation_Position = 1305; Antisense;
    GTCCTGGTATCTGGGATAGCAAAGG
    >probe:HG-U133A:206929_s_at:323:239; Interrogation_Position = 1315; Antisense;
    CTGGGATAGCAAAGGTCTTCTTCCC
    >probe:HG-U133A:206929_s_at:166:661; Interrogation_Position = 1416; Antisense;
    TAGAGTGAACAAGAACACCCCTGCC
    >probe:HG-U133A:206929_s_at:317:125; Interrogation_Position = 1466; Antisense;
    AAACACATAGACGCACACACTCAGG
    Probes sequences for 215032_at:
    >probe:HG-U133A:215032_at:282:695; Interrogation_Position = 1833; Antisense;
    TTCTTATCTCTTGGCATGTTTGGGA
    >probe:HG-U133A:215032_at:262:569; Interrogation_Position = 1903; Antisense;
    TGTCCAAGGTCACGCAATCTGAACT
    >probe:HG-U133A:215032_at:222:253; Interrogation_Position = 2004; Antisense;
    CCCAGGGACACAGAGTTTGTGATCA
    >probe:HG-U133A:215032_at:324:97; Interrogation_Position = 2031; Antisense;
    ACTGAATAATTCTGCCACGTCTTTA
    >probe:HG-U133A:215032_at:312:697; Interrogation_Position = 2040; Antisense;
    TTCTGCCACGTCTTTATTGTCATGC
    >probe:HG-U133A:215032_at:417:679; Interrogation_Position = 2052; Antisense;
    TTTATTGTCATGCCTTCATCACCTA
    >probe:HG-U133A:215032_at:457:599; Interrogation_Position = 2062; Antisense;
    TGCCTTCATCACCTACTACGTATGA
    >probe:HG-U133A:215032_at:135:407; Interrogation_Position = 2103; Antisense;
    GAGGGATTTAAGGGTGAGGGAGAGA
    >probe:HG-U133A:215032_at:633:373; Interrogation_Position = 2136; Antisense;
    GACAGACACAGGCATAAGAGACAAA
    >probe:HG-U133A:215032_at:445:16l; Interrogation_Position = 2183; Antisense;
    AATTCAAGCCAGGTGTAGGGGTGCA
    >probe:HG-U133A:215032_at:648:175; Interrogation_Position = 2206; Antisense;
    CACACCTGTAGCCCTAGTTACTAGG
    Probes sequences for 219920_s_at:
    >probe:HG-U133A:219920_s_at:202:549; Interrogation_Position = 1024; Antisense;
    GGCCAGAACTGCAGCATTGGCCCCA
    >probe:HG-U133A:219920_s_at:305:703; Interrogation_Position = 1040; Antisense;
    TTGGCCCCAATGTGAGCCTGGGACC
    >probe:HG-U133A:219920_s_at:646:265; Interrogation_Position = 1063; Antisense;
    CCTGGCGTGGTGGTCGAAGATGGTG
    >probe:HG-U133A:219920_s_at:425:415; Interrogation_Position = 1237; Antisense;
    GATGAGCTCTACCTCAACGGAGCCA
    >probe:HG-U133A:219920_s_at:283:595; Interrogation_Position = 1265; Antisense;
    TGCTGCCCCACAAGTCTATTGGCGA
    >probe:HG-U133A:219920_s_at:216:281; Interrogation_Position = 1297; Antisense;
    CCAGAGCCTCGTATCATCATGTGAG
    >probe:HG-U133A:219920_s_at:246:55; Interrogation_Position = 1370; Antisense;
    AGTGCTGGCCTGACACATCAGAAGA
    >probe:HG-U133A:219920_s_at:455:201; Interrogation_Position = 1385; Antisense;
    CATCAGAAGACCCTGGACTTGTCAT
    >probe:HG-U133A:219920_s_at:612:459; Interrogation_Position = 1405; Antisense;
    GTCATTATTTGTCTGGGGGGCACTG
    >probe:HG-U133A:219920_s_at:15:485; Interrogation_Position = 1466; Antisense;
    GTGGACATCATCTGGCAGGATCCCT
    >probe:HG-U133A:219920_s_at:618:249; Interrogation_Position = 1511; Antisense;
    CCCACTCCCTCAAGAAGGGCCAGGG
    Probes sequences for 211996_s_at:
    >probe:HG-U133A:211996_s_at:589:121; Interrogation_Position = 2359; Antisense;
    AAACCACGCAAACCCAAGAGGCAGA
    >probe:HG-U133A:211996_s_at:73:65; Interrogation_Position = 2381; Antisense;
    AGAGGGCGGCTGAGATGGAACCACC
    >probe:HG-U133A:211996_s_at:110:167; Interrogation_Position = 2415; Antisense;
    CAAGAGGCGGAGGGTCGGTGACGTG
    >probe:HG-U133A:211996_s_at:452:137; Interrogation_Position = 2441; Antisense;
    AACCGTCACGCAAACCCAAGAGGCG
    >probe:HG-U133A:211996_s_at:634:167; Interrogation_Position = 2457; Antisense;
    CAAGAGGCGGAGGGCCGCTGACGTG
    >probe:HG-U133A:211996_s_at:401:63; Interrogation_Position = 2504; Antisense;
    AGAGGCGGAGGGTCGGTGATGTGGA
    >probe:HG-U133A:211996_s_at:218:577; Interrogation_Position = 2520; Antisense;
    TGATGTGGAACCGTCACGCAAACCC
    >probe:HG-U133A:211996_s_at:704:285; Interrogation_Position = 2588; Antisense;
    CCAAGAGGCGGAGGGTCGGTGACGT
    >probe:HG-U133A:211996_s_at:549:487; Interrogation_Position = 2601; Antisense;
    GGTCGGTGACGTGGAACCGTCACGC
    >probe:HG-U133A:211996_s_at:363:299; Interrogation_Position = 2644; Antisense;
    GCCGCTGACGTGGAACCATCATTAC
    >probe:HG-U133A:211996_s_at:248:577; Interrogation_Position = 2695; Antisense;
    TGAGAAGAGGCCAGTGCACTCAAGC
    Probes sequences for 200075_s_at:
    >probe:HG-U133A:200075_s_at:466:263; Interrogation_Position = 547; Antisense;
    CCGCATCTGTGTGCTGGACGTGGAC
    >probe:HG-U133A:200075_s_at:550:525; Interrogation_Position = 568; Antisense;
    GGACCTGCAGGGTGTGCGGAACATC
    >probe:HG-U133A:200075_s_at:339:469; Interrogation_Position = 581; Antisense;
    GTGCGGAACATCAAGGCCACCGATC
    >probe:HG-U133A:200075_s_at:470:305; Interrogation_Position = 664; Antisense;
    GCGGCAGCGCAACACTGAAACCGAG
    >probe:HG-U133A:200075_s_at:491:597; Interrogation_Position = 715; Antisense;
    TGCCCAGGCCGACATGGAGAGCAGC
    >probe:HG-U133A:200075_s_at:596:413; Interrogation_Position = 758; Antisense;
    GATGTGGTCATCATTAACGACAGCC
    >probe:HG-U133A:200075_s_at:710:627; Interrogation_Position = 765; Antisense;
    TCATCATTAACGACAGCCTGGACCA
    >probe:HG-U133A:200075_s_at:587:347; Interrogation_Position = 805; Antisense;
    GAAGGAGGCGCTCTCTGAGGAAATC
    >probe:HG-U133A:200075_s_at:515:153; Interrogation_Position = 826; Antisense;
    AATCAAGAAAGCTCAAAGGACCGGC
    >probe:HG-U133A:200075_s_at:220:255; Interrogation_Position = 950; Antisense;
    CCCTTGGCCAGCATGTGGAGTGGAG
    >probe:HG-U133A:200075_s_at:219:637; Interrogation_Position = 1054; Antisense;
    TCACTCTGGACCCAGGGCTGACATC
    Probes sequences for 214753_at:
    >probe:HG-U133A:214753_at:269:613; Interrogation_Position = 1869; Antisense;
    TCCCTTGCCTTCACTGTAATGCTTA
    >probe:HG-U133A:214753_at:696:641; Interrogation_Position = 1885; Antisense;
    TAATGCTTAATGGTTGTGTAGTCTT
    >probe:HG-U133A:214753_at:163:463; Interrogation_Position = 1905; Antisense;
    GTCTTATACGTGACTCCTGACTTCA
    >probe:HG-U133A:214753_at:303:267; Interrogation_Position = 1920; Antisense;
    CCTGACTTCAAGGATCCTGGTCTGT
    >probe:HG-U133A:214753_at:255:29; Interrogation_Position = 1933; Antisense;
    ATCCTGGTCTGTACCTCTTTAGGTC
    >probe:HG-U133A:214753_at:700:647; Interrogation_Position = 1944; Antisense;
    TACCTCTTTAGGTCAACACGTTTTG
    >probe:HG-U133A:214753_at:176:95; Interrogation_Position = 1961; Antisense;
    ACGTTTTGAGTGAACTGGTGTTGGT
    >probe:HG-U133A:214753_at:216:535; Interrogation_Position = 2025; Antisense;
    GGAATGGCTTCATATAGGAGTTCAC
    >probe:HG-U133A:214753_at:289:401; Interrogation_Position = 2098; Antisense;
    GAGTTGAGCTGATTGGAGGACCAAA
    >probe:HG-U133A:214753_at:189:385; Interrogation_Position = 2116; Antisense;
    GACCAAATTAAAAGACTGGCTGGGC
    >probe:HG-U133A:214753_at:477:83; Interrogation_Position = 2219; Antisense;
    AGCCTAGATAACCTGGGTATCCCAG
    Probes sequences for 220459_at:
    >probe:HG-U133A:220459_at:20:361; Interrogation_Position = 1969; Antisense;
    GAAATGTGACTTTGTAATCCCAGGA
    >probe:HG-U133A:220459_at:353:333; Interrogation_Position = 2008; Antisense;
    GCACCAAGAAGTTACTGTTACCTCA
    >probe:HG-U133A:220459_at:106:559; Interrogation_Position = 2047; Antisense;
    TGGATGGCTCTTATCATCATCTTTA
    >probe:HG-U133A:220459_at:196:471; Interrogation_Position = 2181; Antisense;
    GTGCCCACTACTCAATTTGACTGGC
    >probe:HG-U133A:220459_at:546:69; Interrogation_Position = 2241; Antisense;
    AGACATAATACTTGCTTCTGAGCCT
    >probe:HG-U133A:220459_at:126:77; Interrogation_Position = 2296; Antisense;
    AGCAGTATTTGCATGGTTCCAGTGA
    >probe:HG-U133A:220459_at:471:613; Interrogation_Position = 2379; Antisense;
    TGACCTTTATACACACGTTAACAGA
    >probe:HG-U133A:220459_at:628:107; Interrogation_Position = 2399; Antisense;
    ACAGAAATCATCTGATCCCCTTTGC
    >probe:HG-U133A:220459_at:386:419; Interrogation_Position = 2412; Antisense;
    GATCCCCTTTGCTGCTAAGTGAGTT
    >probe:HG-U133A:220459_at:335:363; Interrogation_Position = 2437; Antisense;
    GAAAAGCGAAAGCGTCTGAATCCCA
    >probe:HG-U133A:220459_at:74:345; Interrogation_Position = 2454; Antisense;
    GAATCCCACCAGCATCGCTGGATGT
    Probes sequences for 215645_at:
    >probe:HG-U133A:215645_at:494:517; Interrogation_Position = 1525; Antisense;
    GGAGGAAGGGCTGAAATGCTTTCTA
    >probe:HG-U133A:215645_at:72:425; Interrogation_Position = 1553; Antisense;
    GATACTATCTGGGCATATTACTTCC
    >probe:HG-U133A:215645_at:515:27; Interrogation_Position = 1559; Antisense;
    ATCTGGGCATATTACTTCCTGTGGT
    >probe:HG-U133A:215645_at:436:489; Interrogation_Position = 1563; Antisense;
    GGGCATATTACTTCCTGTGGTTCAC
    >probe:HG-U133A:215645_at:109:669; Interrogation_Position = 1568; Antisense;
    TATTACTTCCTGTGGTTCACTGTCT
    >probe:HG-U133A:215645_at:179:569; Interrogation_Position = 1588; Antisense;
    TGTCTGGGTGACAGGATTCATAGAA
    >probe:HG-U133A:215645_at:406:195; Interrogation_Position = 1599; Antisense;
    CAGGATTCATAGAAGCCCAAACTTT
    >probe:HG-U133A:215645_at:668:91; Interrogation_Position = 1639; Antisense;
    ACCCTTGTAACAAAGCCGCACACGT
    >probe:HG-U133A:215645_at:554:699; Interrogation_Position = 1643; Antisense;
    TTGTAACAAAGCCGCACACGTACGC
    >probe:HG-U133A:215645_at:641:107; Interrogation_Position = 1659; Antisense;
    CACGTACGCCCTCAAGCTAAAACAA
    >probe:HG-U133A:215645_at:227:297; Interrogation_Position = 1739; Antisense;
    GCCTGGCAGATAACGGTGAAACCCC
    Probes sequences for 204102_s_at:
    >probe:HG-U133A:204102_s_at:165:293; Interrogation_Position = 2727; Antisense;
    GCCATCACTCAACCATAACACTTGA
    >probe:HG-U133A:204102_s_at:229:171; Interrogation_Position = 2736; Antisense;
    CAACCATAACACTTGATGCCGTTTC
    >probe:HG-U133A:204102_s_at:587:19; Interrogation_Position = 2741; Antisense;
    ATAACACTTGATGCCGTTTCTTTCA
    >probe:HG-U133A:204102_s_at:309:111; Interrogation_Position = 2744; Antisense;
    ACACTTGATGCCGTTTCTTTCAATA
    >probe:HG-U133A:204102_s_at:311:419; Interrogation_Position = 2750; Antisense;
    GATGCCGTTTCTTTCAATATTTATT
    >probe:HG-U133A:204102_s_at:333:639; Interrogation_Position = 2763; Antisense;
    TCAATATTTATTTCCAGAGTCCGGA
    >probe:HG-U133A:204102_s_at:118:681; Interrogation_Position = 2769; Antisense;
    TTTATTTCCAGAGTCCGGAGGCAGC
    >probe:HG-U133A:204102_s_at:379:281; Interrogation_Position = 2792; Antisense;
    GCAGACACGCCCTCTTAGTAGGGAC
    >probe:HG-U133A:204102_s_at:80:373; Interrogation_Position = 2795; Antisense;
    GACACGCCCTCTTAGTAGGGACTTA
    >probe:HG-U133A:204102_s_at:385:181; Interrogation_Position = 2797; Antisense;
    CACGCCCTCTTAGTAGGGACTTAAT
    >probe:HG-U133A:204102_s_at:353:655; Interrogation_Position = 2807; Antisense;
    TAGTAGGGACTTAATGGGCCGGTCG
    Probes sequences for 202419_at:
    >probe:HG-U133A:202419_at:508:83; Interrogation_Position = 1685; Antisense;
    AGCCCTGTGTTGTGCTCAGGACTCA
    >probe:HG-U133A:202419_at:608:595; Interrogation_Position = 1713; Antisense;
    TGCTGCTGGTGGAAACTCATGGCTT
    >probe:HG-U133A:202419_at:161:623; Interrogation_Position = 1741; Antisense;
    TCTCTCTTTGATCCCATAAAGCTAC
    >probe:HG-U133A:202419_at:71:157; Interrogation_Position = 1827; Antisense;
    AATGCTTTCTTGTCTTTAGACTCAA
    >probe:HG-U133A:202419_at:198:319; Interrogation_Position = 1854; Antisense;
    GCTTAGGGAACGTTTCATTTCTCAT
    >probe:HG-U133A:202419_at:44:153; Interrogation_Position = 1888; Antisense;
    AAGGCAGCCTCCTTAAATGTTTTCT
    >probe:HG-U133A:202419_at:430:679; Interrogation_Position = 1943; Antisense;
    TTTACAGTTCCTTCAATAACCATGA
    >probe:HG-U133A:202419_at:606:583; Interrogation_Position = 1973; Antisense;
    TGAAGTTCACCTATCCCATTTTAGC
    >probe:HG-U133A:202419_at:36:223; Interrogation_Position = 2017; Antisense;
    CTCTTCCTTTCCAATTTTGCTTATA
    >probe:HG-U133A:202419_at:430:385; Interrogation_Position = 2085; Antisense;
    GACCAGCTAAAATTTTCGACTTGAC
    >probe:HG-U133A:202419_at:520:149; Interrogation_Position = 2157; Antisense;
    AAGTGTGACTTTTTCTCGGAGCATA
    Probes sequences for 214715_x_at:
    >probe:HG-U133A:214715_x_at:545:287; Interrogation_Position = 5145; Antisense;
    GCCAAGGCAAGCAGATCACCTGAAG
    >probe:HG-U133A:214715_x_at:96:91; Interrogation_Position = 5162; Antisense;
    ACCTGAAGTCAGGAGAGACCAGCCT
    >probe:HG-U133A:214715_x_at:646:551; Interrogation_Position = 5186; Antisense;
    TGGCCAGCATAGTGAAACCCCATCT
    >probe:HG-U133A:214715_x_at:239:127; Interrogation_Position = 5217; Antisense;
    AAAATACAAACTTAGCTGGGCTTAG
    >probe:HG-U133A:214715_x_at:682:307; Interrogation_Position = 5231; Antisense;
    GCTGGGCTTAGTGGTGCATGCCTGT
    >probe:HG-U133A:214715_x_at:267:321; Interrogation_Position = 5292; Antisense;
    GCTTGAATCCAAGAGGCAGAGGTTC
    >probe:HG-U133A:214715_x_at:382:521; Interrogation_Position = 5324; Antisense;
    GGAGATCATTCCATTACACTCCAGC
    >probe:HG-U133A:214715_x_at:183:239; Interrogation_Position = 5349; Antisense;
    CTGGGCACCAGGAACGAAACTCGTT
    >probe:HG-U133A:214715_x_at:102:127; Interrogation_Position = 5398; Antisense;
    AAAACGTTGGCTGGGCATGGTGGCT
    >probe:HG-U133A:214715_x_at:11:411; Interrogation_Position = 5454; Antisense;
    GAGGCAGGTGGATTACTTAAGGTCA
    >probe:HG-U133A:214715_x_at:609:357; Interrogation_Position = 5610; Antisense;
    GAACCCAGGTGGTGGAGGCTGCAGT
    Probes sequences for 216859_x_at:
    >probe:HG-U133A:216859_x_at:585:283; Interrogation_Position = 973; Antisense;
    CCAACAGCTCACTGAGAACGGGTCA
    >probe:HG-U133A:216859_x_at:411:109; Interrogation_Position = 976; Antisense;
    ACAGCTCACTGAGAACGGGTCATGA
    >probe:HG-U133A:216859_x_at:560:77; Interrogation_Position = 978; Antisense;
    AGCTCACTGAGAACGGGTCATGATG
    >probe:HG-U133A:216859_x_at:390:179; Interrogation_Position = 982; Antisense;
    CACTGAGAACGGGTCATGATGACGA
    >probe:HG-U133A:216859_x_at:390:395; Interrogation_Position = 1053; Antisense;
    GAGAAATCAGATTGTTGCTGTGTCT
    >probe:HG-U133A:216859_x_at:177:431; Interrogation_Position = 1062; Antisense;
    GATTGTTGCTGTGTCTGTGTAGAAA
    >probe:HG-U133A:216859_x_at:371:349; Interrogation_Position = 1087; Antisense;
    GAAGTAGACACAGGAGACTCCACTT
    >probe:HG-U133A:216859_x_at:136:373; Interrogation_Position = 1093; Antisense;
    GACACAGGAGACTCCACTTTGTTCT
    >probe:HG-U133A:216859_x_at:305:105; Interrogation_Position = 1096; Antisense;
    ACAGGAGACTCCACTTTGTTCTGTA
    >probe:HG-U133A:216859_x_at:610:67; Interrogation_Position = 1367; Antisense;
    AGACCTCTGTTCACTTGTTTATTTG
    >probe:HG-U133A:216859_x_at:218:619; Interrogation_Position = 1372; Antisense;
    TCTGTTCACTTGTTTATTTGCTGAC
    Probes sequences for 215529_x_at:
    >probe:HG-U133A:215529_x_at:613:673; Interrogation_Position = 2539; Antisense;
    TTTGATATCTCATTTCCATGTAACA
    >probe:HG-U133A:215529_x_at:466:119; Interrogation_Position = 2589; Antisense;
    AAAGATGTTCTATTCTAGTTGGAGA
    >probe:HG-U133A:215529_x_at:616:473; Interrogation_Position = 2652; Antisense;
    GTGACTAATAACTGTAAGACTCTCT
    >probe:HG-U133A:215529_x_at:143:567; Interrogation_Position = 2779; Antisense;
    TGTGTTGACTCAAATTCATTTCTGA
    >probe:HG-U133A:215529_x_at:575:141; Interrogation_Position = 2829; Antisense;
    AAGCACCAAGTTTCAGGCCAGGAGT
    >probe:HG-U133A:215529_x_at:678:577; Interrogation_Position = 2891; Antisense;
    TGAGGCAGGCGGATTACCTGAGGTC
    >probe:HG-U133A:215529_x_at:382:529; Interrogation_Position = 2935; Antisense;
    GGACAATATGATGAGACCCCGTCTC
    >probe:HG-U133A:215529_x_at:300:115; Interrogation_Position = 2974; Antisense;
    AAATTAGCTGGCTGGTGGTGCGCGC
    >probe:HG-U133A:215529_x_at:221:709; Interrogation_Position = 3002; Antisense;
    TTGTCCCGGCTACTTGGGAGGTTGA
    >probe:HG-U133A:215529_x_at:152:407; Interrogation_Position = 3019; Antisense;
    GAGGTTGAGGCACAAGAATCGCTTG
    >probe:HG-U133A:215529_x_at:285:519; Interrogation_Position = 3056; Antisense;
    GGAGGTTGCAGCGAGCCAAGATCTC
    Probes sequences for 202936_s_at:
    >probe:HG-U133A:202936_s_at:240:449; Interrogation_Position = 3363; Antisense;
    GTAGTGTATCACTGAGTCATTTGCA
    >probe:HG-U133A:202936_s_at:32:573; Interrogation_Position = 3389; Antisense;
    TGTTTTCTGCCACAGACCTTTGGGC
    >probe:HG-U133A:202936_s_at:240:563; Interrogation_Position = 3409; Antisense;
    TGGGCTGCCTTATATTGTGTGTGTG
    >probe:HG-U133A:202936_s_at:140:143; Interrogation_Position = 3470; Antisense;
    AAGCATGTGTCATCCATATTTCTCT
    >probe:HG-U133A:202936_s_at:490:701; Interrogation_Position = 3505; Antisense;
    TTGGAGTGAGGGAGGCTACCTGGAG
    >probe:HG-U133A:202936_s_at:157:217; Interrogation_Position = 3520; Antisense;
    CTACCTGGAGGGGATCAGCCCACTG
    >probe:HG-U133A:202936_s_at:593:181; Interrogation_Position = 3535; Antisense;
    CAGCCCACTGACAGACCTTAATCTT
    >probe:HG-U133A:202936_s_at:456:9; Interrogation_Position = 3561; Antisense;
    ATTACTGCTGTGGCTAGAGAGTTTG
    >probe:HG-U133A:202936_s_at:528:15; Interrogation_Position = 3688; Antisense;
    ATATGGCATCCTTCAATTTCTGTAT
    >probe:HG-U133A:202936_s_at:77:455; Interrogation_Position = 3787; Antisense;
    GTAAAAGCTTTGGTTTGTGTTCGTG
    >probe:HG-U133A:202936_s_at:348:573; Interrogation_Position = 3854; Antisense;
    TGTTCTCTCCGTGAAACTTACCTTT
    Probes sequences for 212130_x_at:
    >probe:HG-U133A:212130_x_at:326:691; Interrogation_Position = 13; Antisense;
    TTCACTTTCTAGACGATTACAAACA
    >probe:HG-U133A:212130_x_at:552:189; Interrogation_Position = 71; Antisense;
    CAGAGTTCCCTACCCTAAGAGAATG
    >probe:HG-U133A:212130_x_at:677:427; Interrogation_Position = 95; Antisense;
    GTTACCACCTGAACAGTCCTCGGTG
    >probe:HG-U133A:212130_x_at:406:535; Interrogation_Position = 145; Antisense;
    GGCAGAAGCACCAGCTGTACTACTA
    >probe:HG-U133A:212130_x_at:649:503; Interrogation_Position = 198; Antisense;
    GGTGTCTCCAACCTGACTAGGTGGA
    >probe:HG-U133A:212130_x_at:356:301; Interrogation_Position = 237; Antisense;
    GCCCTCTTACCGCTAGCGAGGTGAT
    >probe:HG-U133A:212130_x_at:609:477; Interrogation_Position = 257; Antisense;
    GTGATAGGACATCTGGCTTGCCACA
    >probe:HG-U133A:212130_x_at:294:321; Interrogation_Position = 272; Antisense;
    GCTTGCCACAAAGGTCTGTTCGACC
    >probe:HG-U133A:212130_x_at:543:435; Interrogation_Position = 289; Antisense;
    GTTCGACCAGACATATCCTAGCTAA
    >probe:HG-U133A:212130_x_at:106:347; Interrogation_Position = 331; Antisense;
    GAATGTGAGGCCAACCTTCTATCAG
    >probe:HG-U133A:212130_x_at:315:187; Interrogation_Position = 439; Antisense;
    CAGCTGCCCAATGCCATGTGAAGTA
    Probes sequences for 217336_at:
    >probe:HG-U133A:217336_at:432:71; Interrogation_Position = 29; Antisense;
    AGAACCAGATTGCCATTTATGACTC
    >probe:HG-U133A:217336_at:423:87; Interrogation_Position = 32; Antisense;
    ACCAGATTGCCATTTATGACTCCTT
    >probe:HG-U133A:217336_at:336:631; Interrogation_Position = 70; Antisense;
    TCATGGTGGCCAAGGACATCCACAT
    >probe:HG-U133A:217336_at:336:287; Interrogation_Position = 78; Antisense;
    GCCAAGGACATCCACATGCCTAAGC
    >probe:HG-U133A:217336_at:30:267; Interrogation_Position = 104; Antisense;
    CCTGGAGCTGTCAGACAAGAATGTG
    >probe:HG-U133A:217336_at:582:467; Interrogation_Position = 128; Antisense;
    GTCCAACCTTCCTGTCATAAAGGCC
    >probe:HG-U133A:217336_at:558:137; Interrogation_Position = 132; Antisense;
    AACCTTCCTGTCATAAAGGCCATGC
    >probe:HG-U133A:217336_at:307:609; Interrogation_Position = 137; Antisense;
    TCCTGTCATAAAGGCCATGCAGTCT
    >probe:HG-U133A:217336_at:419:349; Interrogation_Position = 335; Antisense;
    GAAGGGTGAGTGACCTGCAAGACTC
    >probe:HG-U133A:217336_at:454:49; Interrogation_Position = 165; Antisense;
    AGGGAAAGCCGACAGAGATACCTAC
    >probe:HG-U133A:217336_at:179:493; Interrogation_Position = 435; Antisense;
    GGGACTGGGTCAGCAACTGAATTCC
    Probes sequences for 203301_s_at:
    >probe:HG-U133A:203301_s_at:155:453; Interrogation_Position = 3214; Antisense;
    GTAACAAAACCCACCAGAATCTAAG
    >probe:HG-U133A:203301_s_at:653:187; Interrogation_Position = 3228; Antisense;
    CAGAATCTAAGCAGTTTTCACCCCC
    >probe:HG-U133A:203301_s_at:361:259; Interrogation_Position = 3248; Antisense;
    CCCCCTCAGAAACCACTGTCATTAG
    >probe:HG-U133A:203301_s_at:591:55; Interrogation_Position = 3316; Antisense;
    AGTAATGTTACCTATCCTTGATACC
    >probe:HG-U133A:203301_s_at:662:215; Interrogation_Position = 3327; Antisense;
    CTATCCTTGATACCATGACCATTTA
    >probe:HG-U133A:203301_s_at:705:341; Interrogation_Position = 3382; Antisense;
    GAATAGTTTGTCATCCACTTAGTGT
    >probe:HG-U133A:203301_s_at:590:197; Interrogation_Position = 3399; Antisense;
    CTTAGTGTGTTAGCTGGTGGGGTAC
    >probe:HG-U133A:203301_s_at:317:499; Interrogation_Position = 3417; Antisense;
    GGGGTACAATATAACCTCTCATCTC
    >probe:HG-U133A:203301_s_at:451:217; Interrogation_Position = 3492; Antisense;
    CTAAGAATCATTCCTGTACTACAGA
    >probe:HG-U133A:203301_s_at:649:419; Interrogation_Position = 3626; Antisense;
    GATCTTCACAAAGTTGTCTTTTCAC
    >probe:HG-U133A:203301_s_at:51:179; Interrogation_Position = 3648; Antisense;
    CACTGTGTTTTGTCAACGTGAAATT
    Probes sequences for 207283_at:
    >probe:HG-U133A:207283_at:610:673; Interrogation_Position = 1734; Antisense;
    TTTCAAACCCAGTAGAGTTACCTCA
    >probe:HG-U133A:207283_at:166:129; Interrogation_Position = 1739; Antisense;
    AACCCAGTAGAGTTACCTCAAAGAT
    >probe:HG-U133A:207283_at:410:673; Interrogation_Position = 1778; Antisense;
    TTTCAGAAGATGGCACCGAAAGTGA
    >probe:HG-U133A:207283_at:260:331; Interrogation_Position = 1790; Antisense;
    GCACCGAAAGTGAAGAAGGAAGCTC
    >probe:HG-U133A:207283_at:593:281; Interrogation_Position = 2031; Antisense;
    CCACTGAGTTTGCTATGAAGAAGAT
    >probe:HG-U133A:207283_at:709:311; Interrogation_Position = 2131; Antisense;
    GCTCTGTGACATTGATGGGGCCAAG
    >probe:HG-U133A:207283_at:702:235; Interrogation_Position = 2134; Antisense;
    CTGTGACATTGATGGGGCCAAGGTC
    >probe:HG-U133A:207283_at:305:417; Interrogation_Position = 2175; Antisense;
    GATGAAGGCATATGTTCCACTGGCT
    >probe:HG-U133A:207283_at:560:421; Interrogation_Position = 2238; Antisense;
    GATCATCCAAACTGAGTCCATCTGG
    >probe:HG-U133A:207283_at:221:135; Interrogation_Position = 2247; Antisense;
    AACTGAGTCCATCTGGCTAATTCTA
    >probe:HG-U133A:207283_at:163:469; Interrogation_Position = 2253; Antisense;
    GTCCATCTGGCTAATTCTAAATATA
    Probes sequences for 222168_at:
    >probe:HG-U133A:222168_at:319:163; Interrogation_Position = 5797; Antisense;
    AATTCACTTATTTTCATGGGTTCTG
    >probe:HG-U133A:222168_at:278:385; Interrogation_Position = 5843; Antisense;
    GACCTAAAAATCTCAATGGCACCTA
    >probe:HG-U133A:222168_at:498:217; Interrogation_Position = 5846; Antisense;
    CTAAAAATCTCAATGGCACCTACCA
    >probe:HG-U133A:222168_at:142:219; Interrogation_Position = 5883; Antisense;
    CTAACACATAGTAGGGGCTCTACAT
    >probe:HG-U133A:222168_at:418:449; Interrogation_Position = 5893; Antisense;
    GTAGGGGCTCTACATGATTATCTGT
    >probe:HG-U133A:222168_at:548:543; Interrogation_Position = 5898; Antisense;
    GGCTCTACATGATTATCTGTAATTG
    >probe:HG-U133A:222168_at:8:519; Interrogation_Position = 5935; Antisense;
    GGAGGTGATGGTTTTCCTATTTGAT
    >probe:HG-U133A:222168_at:12:339; Interrogation_Position = 6004; Antisense;
    GCAACATAGTTCTAAGCCAGTGGTT
    >probe:HG-U133A:222168_at:82:559; Interrogation_Position = 6024; Antisense;
    TGGTTCTCCAACTTGAGTATGCATC
    >probe:HG-U133A:222168_at:219:621; Interrogation_Position = 6028; Antisense;
    TCTCCAACTTGAGTATGCATCAGAA
    >probe:HG-U133A:222168_at:183:129; Interrogation_Position = 6071; Antisense;
    AAAATACAGATGTCTGCCGGGCGCG
    Probes sequences for 222272_x_at:
    >probe:HG-U133A:222272_x_at:88:647; Interrogation_Position = 650; Antisense;
    TAAAACAGGGCCATGAGCCACCCAC
    >probe:HG-U133A:222272_x_at:507:205 Interrogation_Position = 674; Antisense;
    CATTCACAGGCTGGTTCCTGGGCTG
    >probe:HG-U133A:222272_x_at:437:539; Interrogation_Position = 753; Antisense;
    GGCAGTTATCTCATTGCTGTTTTGG
    >probe:HG-U133A:222272_x_at:324:319; Interrogation_Position = 794; Antisense;
    GCTTTTTGCTTATTTGTCTTTTGAA
    >probe:HG-U133A:222272_x_at:208:577; Interrogation_Position = 872; Antisense;
    TGAGGTAGGCGGATCACTGGGGTCA
    >probe:HG-U133A:222272_x_at:421:163; Interrogation_Position = 957; Antisense;
    AATTAGCTGCGCGTGGTGGTGCACG
    >probe:HG-U133A:222272_x_at:360:267; Interrogation_Position = 982; Antisense;
    CCTGTAGTCCCTGCTACTTGGAAGG
    >probe:HG-U133A:222272_x_at:90:383; Interrogation_Position = 1128; Antisense;
    GACGCGGCCGCGAATTTAGTAGTAG
    >probe:HG-U133A:222272_x_at:494:447; Interrogation_Position = 1146; Antisense;
    GTAGTAGTAGGCGGCCGCTCTAGAG
    >probe:HG-U133A:222272_x_at:633:61; Interrogation_Position = 1167; Antisense;
    AGAGGATCCAAGCTTACGTACGCGT
    >probe:HG-U133A:222272_x_at:398:655; Interrogation_Position = 1185; Antisense;
    TACGCGTGCATGCGACGTCATAGCT
    Probes sequences for 219290_x_at:
    >probe:HG-U133A:219290_x_at:108:73; Interrogation_Position = 2447; Antisense;
    AGAAGTTACTGGCCAGGAGCGGCGG
    >probe:HG-U133A:219290_x_at:178:455; Interrogation_Position = 2481; Antisense;
    GTAATCCCAGGACTTTGGTAGGCCA
    >probe:HG-U133A:219290_x_at:496:507; Interrogation_Position = 2497; Antisense;
    GGTAGGCCAAGACAAGCAGATCACT
    >probe:HG-U133A:219290_x_at:528:195; Interrogation_Position = 2528; Antisense;
    CAGGAGTTCAACATCAGCCTGGCCA
    >probe:HG-U133A:219290_x_at:161:551; Interrogation_Position = 2548; Antisense;
    GGCCAACATGATGAAACCTTGTCTT
    >probe:HG-U133A:219290_x_at:668:591; Interrogation_Position = 2677; Antisense;
    TGCAGTGAACCAAGATCGCGGCGCT
    >probe:HG-U133A:219290_x_at:460:505; Interrogation_Position = 2715; Antisense;
    GGTGACAGAGTCAGACTCCGTCCCA
    >probe:HG-U133A:219290_x_at:105:669; Interrogation_Position = 2804; Antisense;
    TATAGTTTTATGTTTGTTTTCTTAG
    >probe:HG-U133A:219290_x_at:400:37; Interrogation_Position = 2835; Antisense;
    ATGTGTTTCTTTGGGTGGGTAATAT
    >probe:HG-U133A:219290_x_at:681:671; Interrogation_Position = 2857; Antisense;
    TATTGTGTTTTACTATGTTTACCTT
    >probe:HG-U133A:219290_x_at:606:441; Interrogation_Position = 2898; Antisense;
    GTTTATTTATATTCTTTGGCTTTGT
    Probes sequences for 204119_s_at:
    >probe:HG-U133A:204119_s_at:473:119; Interrogation_Position = 645; Antisense;
    AAAGGTGGCTCACCATGCTTCTGAA
    >probe:HG-U133A:204119_s_at:488:101; Interrogation_Position = 685; Antisense;
    ACTTTGAATCTATCTGCACCGTTTA
    >probe:HG-U133A:204119_s_at:526:333; Interrogation_Position = 700; Antisense;
    GCACCGTTTATTAGCCAGTTCTACA
    >probe:HG-U133A:204119_s_at:564:107; Interrogation_Position = 778; Antisense;
    ACAGAAGCTGCCACTTTTGCTAGAG
    >probe:HG-U133A:204119_s_at:310:371; Interrogation_Position = 846; Antisense;
    GACACAAGCCCTGCCAAAGATGAAC
    >probe:HG-U133A:204119_s_at:62:45; Interrogation_Position = 877; Antisense;
    AGGCAGCGAATCGTGATCTTCACCC
    >probe:HG-U133A:204119_s_at:516:669; Interrogation_Position = 1013; Antisense;
    TTGGAGGTTTTCTGTCTCAACTGGT
    >probe:HG-U133A:204119_s_at:558:169; Interrogation_Position = 1030; Antisense;
    CAACTGGTCTCTGACAAGCCTCTGA
    >probe:HG-U133A:204119_s_at:540:295; Interrogation_Position = 1047; Antisense;
    GCCTCTGACTGAATGTATCCGTGCT
    >probe:HG-U133A:204119_s_at:291:667; Interrogation_Position = 1062; Antisense;
    TATCCGTGCTGGCCACTATGCAGCA
    >probe:HG-U133A:204119_s_at:120:613; Interrogation_Position = 1094; Antisense;
    TAATTAGACGGACTGGCTGCACCTT
    Probes sequences for 215204_at:
    >probe:HG-U133A:215204_at:49:199; Interrogation_Position = 1792; Antisense;
    CATGGTGAAACCCTAACTCTCCAAA
    >probe:HG-U133A:215204_at:105:659; Interrogation_Position = 2013; Antisense;
    TATGGAGGACTTCCTTACAACTTTT
    >probe:HG-U133A:215204_at:176:9; Interrogation_Position = 2124; Antisense;
    ATTACTGTACTGGTACGTGCTTTTT
    >probe:HG-U133A:215204_at:113:453; Interrogation_Position = 2130; Antisense;
    GTACTGGTACGTGCTTTTTATATAT
    >probe:HG-U133A:215204_at:544:683; Interrogation_Position = 2147; Antisense;
    TTATATATTTGATCTATTTCTTTGC
    >probe:HG-U133A:215204_at:492:625; Interrogation_Position = 2159; Antisense;
    TCTATTTCTTTGCATCAGATCCCTA
    >probe:HG-U133A:215204_at:38:13; Interrogation_Position = 2162; Antisense;
    ATTTCTTTGCATCAGATCCCTAGAA
    >probe:HG-U133A:215204_at:290:575; Interrogation_Position = 2206; Antisense;
    TGATATACAGATTTTTAAGACTTGA
    >probe:HG-U133A:215204_at:462:641; Interrogation_Position = 2221; Antisense;
    TAAGACTTGATACGTTAATGCCAAA
    >probe:HG-U133A:215204_at:686:423; Interrogation_Position = 2229; Antisense;
    GATACGTTAATGCCAAAATGCCTTG
    >probe:HG-U133A:215204_at:68:129; Interrogation_Position = 2243; Antisense;
    AAAATGCCTTGTGTGGCCAGGCGCG
    Probes sequences for 215387_x_at:
    >probe:HG-U133A:215387_x_at:562:303; Interrogation_Position = 1850; Antisense;
    GCGATGGGCAGGTAGAAGGGAATGA
    >probe:HG-U133A:215387_x_at:212:491; Interrogation_Position = 1867; Antisense;
    GGGAATGAGATGGGCCAGGCGCAGT
    >probe:HG-U133A:215387_x_at:424:333; Interrogation_Position = 1912; Antisense;
    GCACTTTAGGAGGCCGAGGCTGGCA
    >probe:HG-U133A:215387_x_at:173:299; Interrogation_Position = 1924; Antisense;
    GCCGAGGCTGGCAGATCAGAAGGCC
    >probe:HG-U133A:215387_x_at:33:115; Interrogation_Position = 2011; Antisense;
    AAATTAGCTGGGCACGGGGGCATGC
    >probe:HG-U133A:215387_x_at:23:551; Interrogation_Position = 2226; Antisense;
    TGGTTGATGAGTCTTTAGGGAGTTC
    >probe:HG-U133A:215387_x_at:410:437; Interrogation_Position = 2247; Antisense;
    GTTCATAGACTATTCTGGGTTGTGT
    >probe:HG-U133A:215387_x_at:188:513; Interrogation_Position = 2264; Antisense;
    GGTTGTGTATAGTTTCTGAATTTCC
    >probe:HG-U133A:215387_x_at:339:397; Interrogation_Position = 2311; Antisense;
    GAGAGAAAGAGAGTTGGCCAGCACA
    >probe:HG-U133A:215387_x_at:269:65; Interrogation_Position = 2318; Antisense;
    AGAGAGTTGGCCAGCACAGTGCCTC
    >probe:HG-U133A:215387_x_at:130:701; Interrogation_Position = 2362; Antisense;
    TTGGGATGCTGAGGTGGGCGGATCA
    Probes sequences for 222358_x_at:
    >probe:HG-U133A:222358_x_at:87:687; Interrogation_Position = 15; Antisense;
    TTAGTGGCTGGACATGGTGGCTCAT
    >probe:HG-U133A:222358_x_at:596:201; Interrogation_Position = 37; Antisense;
    CATGCCTCTAATTCCAGCACATTGT
    >probe:HG-U133A:222358_x_at:55:407; Interrogation_Position = 75; Antisense;
    GAGGATAGCTTTAACCCAGGAGTTC
    >probe:HG-U133A:222358_x_at:430:411; Interrogation_Position = 237; Antisense;
    GAGGCTGCAGTCAGCTGTGTTCACA
    >probe:HG-U133A:222358_x_at:209:425; Interrogation_Position = 361; Antisense;
    GATTTATTCCTTTTTTACCATGTCA
    >probe:HG-U133A:222358_x_at:163:15; Interrogation_Position = 413; Antisense;
    ATATTGCTGCTTTTCCTAGAGGGAG
    >probe:HG-U133A:222358_x_at:111:461; Interrogation_Position = 440; Antisense;
    GTCATGATTACTACTGGATCCACAG
    >probe:HG-U133A:222358_x_at:88:383; Interrogation_Position = 482; Antisense;
    GACGCGGCCGCGAATTTAGTAGTAG
    >probe:HG-U133A:222358_x_at:491:447; Interrogation_Position = 500; Antisense;
    GTAGTAGTAGGCGGCCGCTCTAGAG
    >probe:HG-U133A:222358_x_at:632:61; Interrogation_Position = 521; Antisense;
    AGAGGATCCAAGCTTACGTACGCGT
    >probe:HG-U133A:222358_x_at:65:649; Interrogation_Position = 535; Antisense;
    TACGTACGCGTGCATGCGACGTCAT
    Probes sequences for 218735_s_at:
    >probe:HG-U133A:218735_s_at:165:141; Interrogation_Position = 3062; Antisense;
    AAGCTTCCAGTTATGATACTTTCCT
    >probe:HG-U133A:218735_s_at:19:35; Interrogation_Position = 3074; Antisense;
    ATGATACTTTCCTTATTCAACATGA
    >probe:HG-U133A:218735_s_at:59:145; Interrogation_Position = 3133; Antisense;
    AAGATGGAGCCGGGTGCAGTTGCCA
    >probe:HG-U133A:218735_s_at:619:21; Interrogation_Position = 3382; Antisense;
    ATCACACCGCTGCCTTGTGAGAGAT
    >probe:HG-U133A:218735_s_at:155:413; Interrogation_Position = 3435; Antisense;
    GATGGGAAATTTCCCTGCCAGACCT
    >probe:HG-U133A:218735_s_at:527:195; Interrogation_Position = 3474; Antisense;
    CAGGTGGCCAAAACATGACTCTCAG
    >probe:HG-U133A:218735_s_at:593:223; Interrogation_Position = 3492; Antisense;
    CTCTCAGAGTGGGGCTTCATGACCA
    >probe:HG-U133A:218735_s_at:225:487; Interrogation_Position = 3503; Antisense;
    GGGCTTCATGACCATGTGCATCAGA
    >probe:HG-U133A:218735_s_at:160:183; Interrogation_Position = 3524; Antisense;
    CAGAATTGCCTGGAGTGTGCACTGA
    >probe:HG-U133A:218735_s_at:364:135; Interrogation_Position = 3549; Antisense;
    AACTGTGTATTACCAAGCTCACTCT
    >probe:HG-U133A:218735_s_at:583:655; Interrogation_Position = 3559; Antisense;
    TACCAAGCTCACTCTAGCCAACTAA
    Probes sequences for 200078_s_at:
    >probe:HG-U133A:200078_s_at:545:285; Interrogation_Position = 453; Antisense;
    CCAAGGCCATCGGCCATCGGAACTA
    >probe:HG-U133A:200078_s_at:202:25; Interrogation_Position = 468; Antisense;
    ATCGGAACTACCATGCAGGCTACTC
    >probe:HG-U133A:200078_s_at:360:591; Interrogation_Position = 481; Antisense;
    TGCAGGCTACTCCATGTTTGGGGCT
    >probe:HG-U133A:200078_s_at:442:463; Interrogation_Position = 523; Antisense;
    GTCTAACCTCTTCTGTGGAGTCTGC
    >probe:HG-U133A:200078_s_at:21:55; Interrogation_Position = 541; Antisense;
    AGTCTGCGTGGGCATCGTGGGCAGT
    >probe:HG-U133A:200078_s_at:205:203; Interrogation_Position = 616; Antisense;
    CATCGTGGAGATCTTTGGCAGCGCC
    >probe:HG-U133A:200078_s_at:350:461; Interrogation_Position = 656; Antisense;
    GTCATCGTCGCAATTCTTCAGACCT
    >probe:HG-U133A:200078_s_at:429:255; Interrogation_Position = 822; Antisense;
    CCCTCTTGCTGCGTGTTGATTTGGA
    >probe:HG-U133A:200078_s_at:160:427; Interrogation_Position = 839; Antisense;
    GATTTGGAGGCACTGCAGTCCAGGC
    >probe:HG-U133A:200078_s_at:701:593; Interrogation_Position = 892; Antisense;
    TGCTGCTGACTCTGTGCAGCTGCGC
    >probe:HG-U133A:200078_s_at:376:639; Interrogation_Position = 940; Antisense;
    TCAACCCATCTTCCTAGTGTTTGTG
    Probes sequences for 205010_at:
    >probe:HG-U133A:205010_at:243:311; Interrogation_Position = 1769; Antisense;
    GCTCTGGCATCTGCCCTGAAAAATA
    >probe:HG-U133A:205010_at:201:365; Interrogation_Position = 1867; Antisense;
    GAAACGTGCAGATAAAATCGCCAGC
    >probe:HG-U133A:205010_at:200:61; Interrogation_Position = 1816; Antisense;
    AGATAAAATCGCCAGCAAGCTGTCT
    >probe:HG-U133A:205010_at:96:29; Interrogation_Position = 1823; Antisense;
    ATCGCCAGCAAGCTGTCTGATTCCA
    >probe:HG-U133A:205010_at:90:413; Interrogation_Position = 1883; Antisense;
    GATGGTGTTGGTGACTAATCGACTG
    >probe:HG-U133A:205010_at:680:503; Interrogation_Position = 1886; Antisense;
    GGTGTTGGTGACTAATCGACTGATC
    >probe:HG-U133A:205010_at:127:475; Interrogation_Position = 1893; Antisense;
    GTGACTAATCGACTGATCTCACTTC
    >probe:HG-U133A:205010_at:709:283; Interrogation_Position = 1927; Antisense;
    CCAAGCACCAGTTCCGGTGGTACGG
    >probe:HG-U133A:205010_at:523:451; Interrogation_Position = 1946; Antisense;
    GTACGGGGGAATACCAGTGAAATAG
    >probe:HG-U133A:205010_at:678:141; Interrogation_Position = 1985; Antisense;
    AAGCATCTGCATATTGAAAGAACGC
    >probe:HG-U133A:205010_at:455:3; Interrogation_Position = 1997; Antisense;
    ATTGAAAGAACGCTTTCCCCACTGT
    Probes sequences for 203455_s_at:
    >probe:HG-U133A:203455_s_at:456:579; Interrogation_Position = 839; Antisense;
    TGAGGTCTGTTTAAAGTGGCAATCT
    >probe:HG-U133A:203455_s_at:276:485; Interrogation_Position = 854; Antisense;
    GTGGCAATCTCAGATGCAGTTTGGA
    >probe:HG-U133A:203455_s_at:350:63; Interrogation_Position = 878; Antisense;
    AGAGTCAGATCTTTCTCCTTGAATA
    >probe:HG-U133A:203455_s_at:530:421; Interrogation_Position = 885; Antisense;
    GATCTTTCTCCTTGAATATCTTTCG
    >probe:HG-U133A:203455_s_at:482:677; Interrogation_Position = 889; Antisense;
    TTTCTCCTTGAATATCTTTCGATAA
    >probe:HG-U133A:203455_s_at:426:503; Interrogation_Position = 921; Antisense;
    GGTGGTGTGATCTTAATATATTTGA
    >probe:HG-U133A:203455_s_at:230:125; Interrogation_Position = 946; Antisense;
    AAAAAACTTCATTCTCGTGAGTCAT
    >probe:HG-U133A:203455_s_at:391:7; Interrogation_Position = 956; Antisense;
    ATTCTCGTGAGTCATTTAAATGTGT
    >probe:HG-U133A:203455_s_at:550:571; Interrogation_Position = 978; Antisense;
    TGTACAATGTACACACTGGTACTTA
    >probe:HG-U133A:203455_s_at:417:111; Interrogation_Position = 988; Antisense;
    ACACACTGGTACTTAGAGTTTCTGT
    >probe:HG-U133A:203455_s_at:287:453; Interrogation_Position = 996; Antisense;
    GTACTTAGAGTTTCTGTTTGATTCT
    Probes sequences for 1316_at:
    >probe:HG-U133A:1316_at:17:461; Interrogation_Position = 1951; Antisense;
    GTGTTCTGCAGTTCCCAGGACCCCA
    >probe:HG-U133A:1316_at:133:345; Interrogation_Position = 2017; Antisense;
    GACAAGCCACCTTGACCGTAGGGAA
    >probe:HG-U133A:1316_at:273:261; Interrogation_Position = 2080; Antisense;
    CCCCTACACACACATGAGAGAGAGC
    >probe:HG-U133A:1316_at:302:219; Interrogation_Position = 2083: Antisense;
    CTACACACACATGAGAGAGAGCCCC
    >probe:HG-U133A:1316_at:537:279; Interrogation_Position = 2107; Antisense;
    CCACCCAGTTCCTTGGCCTAGGTCT
    >probe:HG-U133A:1316_at:443:61; Interrogation_Position = 2164; Antisense;
    AGATGCCTGGGTGCAAAGAACGGCT
    >probe:HG-U133A:1316_at:320:595; Interrogation_Position = 2167; Antisense;
    TGCCTGGGTGCAAAGAACGGCTTGG
    >probe:HG-U133A:1316_at:41:337; Interrogation_Position = 2176; Antisense;
    GCAAAGAACGGCTTGGCTTGGCTCC
    >probe:HG-U133A:1316_at:125:709; Interrogation_Position = 2188; Antisense;
    TTGGCTTGGCTCCTCCTCTGGAGTT
    >probe:HG-U133A:1316_at:363:321; Interrogation_Position = 2191; Antisense;
    GCTTGGCTCCTCCTCTGGAGTTAAA
    >probe:HG-U133A:1316_at:167:163; Interrogation_Position = 2215; Antisense;
    AATTTATAGTCATTCTAACTGCACT
    >probe:HG-U133A:1316_at:484:661; Interrogation_Position = 2221; Antisense;
    TAGTCATTCTAACTGCACTTTGGAA
    >probe:HG-U133A:1316_at:349:627; Interrogation_Position = 2224; Antisense;
    TCATTCTAACTGCACTTTGGAAACC
    >probe:HG-U133A:1316_at:395:691; Interrogation_Position = 2227; Antisense;
    TTCTAACTGCACTTTGGAAACCAAG
    >probe:HG-U133A:1316_at:68:231; Interrogation_Position = 2233; Antisense;
    CTGCACTTTGGAAACCAAGCAAGGG
    >probe:HG-U133A:1316_at:596:101; Interrogation_Position = 2236; Antisense;
    CACTTTGGAAACCAAGCAAGGGGAG
    Probes sequences for 212227_x_at;
    >probe:HG-U133A:212227_x_at:689:73; Interrogation_Position = 1621; Antisense;
    AGCAATACCGTCATGTTTCAGCCAA
    >probe:HG-U133A:212227_x_at:222:17l; Interrogation_Position = 1643; Antisense;
    CAAGCCCAGAGCCCTAAGATTACAA
    >probe:HG-U133A:212227_x_at:553:189; Interrogation_Position = 1704; Antisense;
    CAGAGTTCCCTACCCTAAGAGAATG
    >probe:HG-U133A:212227_x_at:676:427; Interrogation_Position = 1728; Antisense;
    GTTACCACCTGAACAGTCCTCGGTG
    >probe:HG-U133A:212227_x_at:405:535; Interrogation_Position = 1778; Antisense;
    GGCAGAAGCACCAGCTGTACTACTA
    >probe:HG-U133A:212227_x_at:650:503; Interrogation_Position = 1831; Antisense;
    GGTGTCTCCAACCTGACTAGGTGGA
    >probe:HG-U133A:212227_x_at:610:477; Interrogation_Position = 1891; Antisense;
    GTGATAGGACATCTGGCTTGCCACA
    >probe:HG-U133A:212227_x_at:295:321; Interrogation_Position = 1906; Antisense;
    GCTTGCCACAAAGGTCTGTTCGACC
    >probe:HG-U133A:212227_x_at:544:435; Interrogation_Position = 1923; Antisense;
    GTTCGACCAGACATATCCTAGCTAA
    >probe:HG-U133A:212227_x_at:107:347; Interrogation_Position = 1965; Antisense;
    GAATGTGAGGCCAACCTTCTATCAG
    >probe:HG-U133A:212227_x_at:316:187; Interrogation_Position = 2073; Antisense;
    CAGCTGCCCAATGCCATGTGAAGTA
    Probes sequences for 216187_x_at:
    >probe:HG-U133A:216187_x_at:432:521; Interrogation_Position = 1606; Antisense;
    GGAGTTCAAGATCAACCAACATAGC
    >probe:HG-U133A:216187_x_at:174:169; Interrogation_Position = 1618; Antisense;
    CAACCAACATAGCAAGACCCTGTCT
    >probe:HG-U133A:216187_x_at:139:643; Interrogation_Position = 1828; Antisense;
    TAAGCTACCTAAAGTGTTATGAGAA
    >probe:HG-U133A:216187_x_at:12:149; Interrogation_Position = 1856; Antisense;
    AATGAGGCCGGACATGGTGGCTCAC
    >probe:HG-U133A:216187_x_at:544:157; Interrogation_Position = 1887; Antisense;
    AATCCCAACACTTTGCGAGGCTAAG
    >probe:HG-U133A:216187_x_at:326:595; Interrogation_Position = 1900; Antisense;
    TGCGAGGCTAAGGCGGGTGGGTCAC
    >probe:HG-U133A:216187_x_at:191:387; Interrogation_Position = 1943; Antisense;
    GACCAGCCTGACCAAGAATGATAAG
    >probe:HG-U133A:216187_x_at:113:345; Interrogation_Position = 1958; Antisense;
    GAATGATAAGACCCTGTCTCTACTA
    >probe:HG-U133A:216187_x_at:474:283; Interrogation_Position = 2028; Antisense;
    CCAACTACTAGGGAGGCTGAGGCAG
    >probe:HG-U133A:216187_x_at:704:191; Interrogation_Position = 2076; Antisense;
    CAGAGGTTGCAATGAGCCAAGATGA
    >probe:HG-U133A:216187_x_at:370:59; Interrogation_Position = 2095; Antisense;
    AGATGATGCCATTGCATGCCAGCCC
    Probes sequences for 208678_at:
    >probe:HG-U133A:208678_at:434:415; Interrogation_Position = 719; Antisense;
    GATGATGCCAGAAGTCCGGGGAGCC
    >probe:HG-U133A:208678_at:614:691; Interrogation_Position = 786; Antisense;
    TTCAGGAGGTGGAGCTCGTCGTCAG
    >probe:HG-U133A:208678_at:682:129; Interrogation_Position = 850; Antisense;
    AACACGAATGTCTCTGTAGCTTCCT
    >probe:HG-U133A:208678_at:320:231; Interrogation_Position = 880; Antisense;
    CTGCCCCAGTATTGCTCTGTATTTA
    >probe:HG-U133A:208678_at:144:619; Interrogation_Position = 895; Antisense;
    TCTGTATTTATCAGCGATGCCCCTC
    >probe:HG-U133A:208678_at:288:147; Interrogation_Position = 1002; Antisense;
    AAGTGTGACAGTGTCCAGCCGGTTC
    >probe:HG-U133A:208678_at:168:99; Interrogation_Position = 1080; Antisense;
    ACTCTCCGTCTGGCCTAAAGGTGAT
    >probe:HG-U133A:208678_at:419:573; Interrogation_Position = 1104; Antisense;
    TGTATTTGGTGTTTGGCCCTGCAGT
    >probe:HG-U133A:208678_at:603:609; Interrogation_Position = 1128; Antisense;
    TCCCCACTCTTGAGGCTTAAGGCGC
    >probe:HG-U133A:208678_at:150:689; Interrogation_Position = 1144; Antisense;
    TTAAGGCGCATGTGGCACACCACTC
    >probe:HG-U133A:208678_at:131:465; Interrogation_Position = 1181; Antisense;
    GTCGCTTTACTGTTACCTGTTTAGG
    Probes sequences for 222310_at:
    >probe:HG-U133A:222310_at:220:113; Interrogation_Position = 248; Antisense;
    ACAAGGTATATTTCTCTCACATAAC
    >probe:HG-U133A:222310_at:290:183; Interrogation_Position = 279; Antisense;
    CACCATTTATATCACTCAACTTGAG
    >probe:HG-U133A:222310_at:638:683; Interrogation_Position = 304; Antisense;
    TTATGGAAATTATCACTTCTGTATC
    >probe:HG-U133A:222310_at:149:699; Interrogation_Position = 331; Antisense;
    TTCTGTGGATTGTTCATTATGTCGT
    >probe:HG-U133A:222310_at:228:431; Interrogation_Position = 338; Antisense;
    GATTGTTCATTATGTCGTTTTGTAA
    >probe:HG-U133A:222310_at:22:453; Interrogation_Position = 359; Antisense;
    GTAATTTGAGAGATTTTCCCCCTCA
    >probe:HG-U133A:222310_at:624:361; Interrogation_Position = 388; Antisense;
    GAAAACATCGATTATTTTCCCTGGT
    >probe:HG-U133A:222310_at:617:565; Interrogation_Position = 419; Antisense;
    TGTGATATGTGCTCAGTGCAAAAAT
    >probe:HG-U133A:222310_at:625:37; Interrogation_Position = 425; Antisense;
    ATGTGCTCAGTGCAAAAATTTCCAG
    >probe:HG-U133A:222310_at:266:169; Interrogation_Position = 436; Antisense;
    GCAAAAATTTCCAGGGTTTGAAGCT
    >probe:HG-U133A:222310_at:252:1; Interrogation_Position = 448; Antisense;
    AGGGTTTGAAGCTGAATTTACTAGT
    Probes sequences for 210434_x_at:
    >probe:HG-U133A:210434_x_at:506:41; Interrogation_Position = 579; Antisense;
    ATGGCCTGGGGCTGGCATTTATCTT
    >probe:HG-U133A:210434_x_at:281:685; Interrogation_Position = 597; Antisense;
    TTATCTTTCCTTTCAGCAAGCACCT
    >probe:HG-U133A:210434_x_at:552:333; Interrogation_Position = 616; Antisense;
    GCACCTCAAATTTGCCATGCTGGCT
    >probe:HG-U133A:210434_x_at:392:145; Interrogation_Position = 664; Antisense;
    AAGAGTGCTCTCCATGCTCTAATTT
    >probe:HG-U133A:210434_x_at:656:653; Interrogation_Position = 701; Antisense;
    TACCCCTGAGTGTGGTCCCACAGGA
    >probe:HG-U133A:210434_x_at:142:231; Interrogation_Position = 776; Antisense;
    CTGCCGCTCAGCTTTGATGGAACAA
    >probe:HG-U133A:210434_x_at:169:349; Interrogation_Position = 819; Antisense;
    GAAGGGGCTGTCGTGTGTGTGGCCC
    >probe:HG-U133A:210434_x_at:73:707; Interrogation_Position = 854; Antisense;
    TTGTCTTGTCATCATTCGTCAGCGA
    >probe:HG-U133A:210434_x_at:501:615; Interrogation_Position = 919; Antisense;
    TCGAGTCCATATAGCTACATTCCAC
    >probe:HG-U133A:210434_x_at:269:283; Interrogation_Position = 940; Antisense;
    CCACCCTTGTATCCTGGGTCTTAGA
    >probe:HG-U133A:210434_x_at:68:577; Interrogation_Position = 998; Antisense;
    TGATTTGCACTCTTGGTTCTTTGGA
    Probes sequences for 222282_at:
    >probe:HG-U133A:222282_at:328:433; Interrogation_Position = 324; Antisense;
    GTTGACAGACTCTTGTTTTCTGGAG
    >probe:HG-U133A:222282_at:81:557; Interrogation_Position = 344; Antisense;
    TGGAGGTTAACCTTGTCTCTGGTCC
    >probe:HG-U133A:222282_at:561:707; Interrogation_Position = 421; Antisense;
    TTGTCCACAATCTGTTATGAACCCT
    >probe:HG-U133A:222282_at:438:661; Interrogation_Position = 436; Antisense;
    TATGAACCCTGGACTGGTGAGCCTT
    >probe:HG-U133A:222282_at:449:475; Interrogation_Position = 452; Antisense;
    GTGAGCCTTGTTATCTGTGGCCAAC
    >probe:HG-U133A:222282_at:167:27; Interrogation_Position = 464; Antisense;
    ATCTGTGGCCAACCTAGGGAATTTT
    >probe:HG-U133A:222282_at:106:575; Interrogation_Position = 570; Antisense;
    TGTTGTCTCTCATAGTTTTTCCAGC
    >probe:HG-U133A:222282_at:139:269; Interrogation_Position = 595; Antisense;
    CCTCCTAGTCACAATCCCCAGAGGC
    >probe:HG-U133A:222282_at:302:29; Interrogation_Position = 608; Antisense;
    ATCCCCAGAGGCAGCTACTTCAATT
    >probe:HG-U133A:222282_at:123:211; Interrogation_Position = 625; Antisense;
    CTTCAATTCTTCTGGCTACCTAGTT
    >probe:HG-U133A:222282_at:704:539; Interrogation_Position = 638; Antisense;
    GGCTACCTAGTTTTACTTTACTCTG
    Probes sequences for 219678_x_at:
    >probe:HG-U133A:219678_x_at:293:579; Interrogation_Position = 2054; Antisense;
    TGAGAAGCTGGCAACTGGTGAGAGT
    >probe:HG-U133A:219678_x_at:30:363; Interrogation_Position = 2095; Antisense;
    GAAAATGCTCACTCTTAGATACCTA
    >probe:HG-U133A:219678_x_at:113:141; Interrogation_Position = 2128; Antisense;
    AAGCGTTTCAACCTAGAGCAACCAC
    >probe:HG-U133A:219678_x_at:385:169; Interrogation_Position = 2146; Antisense;
    CAACCACTAAAAAACCTGCACAGAG
    >probe:HG-U133A:219678_x_at:381:645; Interrogation_Position = 2222; Antisense;
    TAACCTGGTTGGGTGTGGTGGCTCA
    >probe:HG-U133A:219678_x_at:473:281; Interrogation_Position = 2258; Antisense;
    CCAGCACTTTGAGGTGGGCAATGGC
    >probe:HG-U133A:219678_x_at:399:501; Interrogation_Position = 2270; Antisense;
    GGTGGGCAATGGCTTGAGCCCAGGA
    >probe:HG-U133A:219678_x_at:200:107; Interrogation_Position = 2317; Antisense;
    ACAGTGAAATGTGTCTCTACTTACA
    >probe:HG-U133A:219678_x_at:598:39; Interrogation_Position = 2369; Antisense;
    ATGGTGGTGGGCTACTCTGGAGGCC
    >probe:HG-U133A:219678_x_at:614:283; Interrogation_Position = 2447; Antisense;
    CCACTGCACCTCTAGTCTGGGTGAC
    >probe:HG-U133A:219678_x_at:455:389; Interrogation_Position = 2592; Antisense;
    GAGCTGAGATTGTGTCATTGCACTC
    Probes sequences for 208268_at:
    >probe:HG-U133A:208268_at:192:677; Interrogation_Position = 1635; Antisense;
    TTTCCCCTGTGCATGTGCGAAGGAA
    >probe:HG-U133A:208268_at:469:415; Interrogation_Position = 1670; Antisense;
    GATGACAGTGTTTAACCATGGTCAA
    >probe:HG-U133A:208268_at:676:605; Interrogation_Position = 1707; Antisense;
    TCCTATCCTTCTTAGAAGCTTCGAA
    >probe:HG-U133A:208268_at:571:353; Interrogation_Position = 1721; Antisense;
    GAAGCTTCGAACTCAAAATCATGGA
    >probe:HG-U133A:208268_at:207:707; Interrogation_Position = 1855; Antisense;
    TTGTTTTTTGTCTCAGCATCAGTAT
    >probe:HG-U133A:208268_at:401:703; Interrogation_Position = 1862; Antisense;
    TTGTCTCAGCATCAGTATATCCCAT
    >probe:HG-U133A:208268_at:225:203; Interrogation_Position = 1871; Antisense;
    CATCAGTATATCCCATGCAATATTT
    >probe:HG-U133A:208268_at:506:407; Interrogation_Position = 1896; Antisense;
    GAGGTGTGCTCATACTAAAATTATT
    >probe:HG-U133A:208268_at:545:337; Interrogation_Position = 1970; Antisense;
    GCAACCCTACTAAGATCATAAACCC
    >probe:HG-U133A:208268_at:285:259; Interrogation_Position = 1975; Antisense;
    CCTACTAAGATCATAAACCCTTGGA
    >probe:HG-U133A:208268_at:289:137; Interrogation_Position = 1990; Antisense;
    AACCCTTGGAAATCTGTGTGTGTGC
    Probes sequences for 220242_x_at:
    >probe:HG-U133A:220242_x_at:231:639; Interrogation_Position = 2279; Antisense;
    TAATAGTTACAAATGCGGTGAGCAC
    >probe:HG-U133A:220242_x_at:160:243; Interrogation_Position = 2294; Antisense;
    CGGTGAGCACAGCAAACCATCAAGG
    >probe:HG-U133A:220242_x_at:83:65; Interrogation_Position = 2412; Antisense;
    AGAGGATTGGGCCAGGCGTGTGGCT
    >probe:HG-U133A:220242_x_at:629:279; Interrogation_Position = 2450; Antisense;
    CCAGCACTTTAGGAGCCCAAGGTGG
    >probe:HG-U133A:220242_x_at:113:631; Interrogation_Position = 2490; Antisense;
    TCATGAGTTTGAGATCAGCCTGGCC
    >probe:HG-U133A:220242_x_at:482:83; Interrogation_Position = 2506; Antisense;
    AGCCTGGCCAACAGACGTGAGTCAC
    >probe:HG-U133A:220242_x_at:685:105; Interrogation_Position = 2516; Antisense;
    ACAGACGTGAGTCACTTTTCCCAGC
    >probe:HG-U133A:220242_x_at:143:185; Interrogation_Position = 2537; Antisense;
    CAGCCTGCTTTTTGTTTCTTTAACA
    >probe:HG-U133A:220242_x_at:161:345; Interrogation_Position = 2595; Antisense;
    GAATCTGAATCACATTGGCTTATAT
    >probe:HG-U133A:220242_x_at:589:671; Interrogation_Position = 2636; Antisense;
    TTTCCAAACCATCAATGTGGGTTGT
    >probe:HG-U133A:220242_x_at:139:475; Interrogation_Position = 2709; Antisense;
    GTGAAATTCTCAGTTTTTTTATGTT
    Probes sequences for 207287_at:
    >probe:HG-U133A:207287_at:188:545; Interrogation_Position = 1250; Antisense;
    GGCTGTCAGATGGCCTTGAGCGGCA
    >probe:HG-U133A:207287_at:217:525; Interrogation_Position = 1271; Antisense;
    GGCACCAAGTAGAAAACGCGCTCCC
    >probe:HG-U133A:207287_at:302:211; Interrogation_Position = 1305; Antisense;
    CTTCTCCTCAGCTTCATTGTGAGAC
    >probe:HG-U133A:207287_at:380:567; Interrogation_Position = 1322; Antisense;
    TGTGAGACCTCAAGTTCCTCAGCTT
    >probe:HG-U133A:207287_at:280:321; Interrogation_Position = 1343; Antisense;
    GCTTCCAGGATGATCAACCTAGCTG
    >probe:HG-U133A:207287_at:587:251; Interrogation_Position = 1406; Antisense;
    CCCAGCCAGGGAGACCAGGTGTTGT
    >probe:HG-U133A:207287_at:649:481; Interrogation_Position = 1547; Antisense;
    GTGTGTTCTGGATCTCTAGAGGGGT
    >probe:HG-U133A:207287_at:641:415; Interrogation_Position = 1557; Antisense;
    GATCTCTAGAGGGGTTTGGTTTGGG
    >probe:HG-U133A:207287_at:708:509; Interrogation_Position = 1574; Antisense;
    GGTTTGGGCCAAGTAGTGCTTAGTT
    >probe:HG-U133A:207287_at:75:595; Interrogation_Position = 1646; Antisense;
    TGCTCAGCTGTCATATCCTGCAAGG
    >probe:HG-U133A:207287_at:57:63; Interrogation_Position = 1673; Antisense;
    AGAGGAAAGATGTGGGCCGTGCGCG
    Probes sequences for 221899_at:
    >probe:HG-U133A:221899_at:618:667; Interrogation_Position = 158; Antisense;
    TATTTTCAGCTTAGGTTTTTACCAG
    >probe:HG-U133A:221899_at:701:233; Interrogation_Position = 183; Antisense;
    CTGAATGACTGTAGGCAAGCCGTTT
    >probe:HG-U133A:221899_at:88:319; Interrogation_Position = 213; Antisense;
    GCTTTGAGACCTAATTTCTCCATCT
    >probe:HG-U133A:221899_at:439:697; Interrogation_Position = 228; Antisense;
    TTCTCCATCTGTTAGAGGTGGTGAT
    >probe:HG-U133A:221899_at:659:639; Interrogation_Position = 252; Antisense;
    TAATTATACTTGCCCTACCTGATCA
    >probe:HG-U133A:221899_at:517:617; Interrogation_Position = 424; Antisense;
    TCTGTACCATTGTTATGTAGTCCCA
    >probe:HG-U133A:221899_at:418:157; Interrogation_Position = 468; Antisense;
    AATGCTAATGTTTTCCAAGGCTTAA
    >probe:HG-U133A:221899_at:98:683; Interrogation_Position = 518; Antisense;
    TTTTCTCATTGATATTGCTCTACAA
    >probe:HG-U133A:221899_at:242:79; Interrogation_Position = 566; Antisense;
    AGCGGACATGGGTCAATTTCTGTTT
    >probe:HG-U133A:221899_at:443:319; Interrogation_Position = 604; Antisense;
    GTTCTTAACCTGGGCTACACTTTTG
    >probe:HG-U133A:221899_at:481:485; Interrogation_Position = 615; Antisense;
    GGGCTACACTTTTGAATTTCTTGGG
    Probes sequences for 213721_at:
    >probe:HG-U133A:213721_at:578:333; Interrogation_Position = 942; Antisense;
    GCACATGAACGGCTGGAGCAACGGC
    >probe:HG-U133A:213721_at:267:241; Interrogation_Position = 954; Antisense;
    CTGGAGCAACGGCAGCTACAGCATG
    >probe:HG-U133A:213721_at:410:315; Interrogation_Position = 968; Antisense;
    GCTACAGCATGATGCAGGACCAGCT
    >probe:HG-U133A:213721_at:678:265; Interrogation_Position = 1071; Antisense;
    CCTGCAGTACAACTCCATGACCAGC
    >probe:HG-U133A:213721_at:303:281; Interrogation_Position = 1091; Antisense;
    CCAGCTCGCAGACCTACATGAACGG
    >probe:HG-U133A:213721_at:408:313; Interrogation_Position = 1172; Antisense;
    GCTCCATGGGTTCGGTGGTCAAGTC
    >probe:HG-U133A:213721_at:293:695; Interrogation_Position = 1182; Antisense;
    TTCGGTGGTCAAGTCCGAGGCCAGC
    >probe:HG-U133A:213721_at:535:231; Interrogation_Position = 1395; Antisense;
    CTGCCCCTCTCACACATGTGAGGGC
    >probe:HG-U133A:213721_at:51:177; Interrogation_Position = 1407; Antisense;
    CACATGTGAGGGCCGGACAGCGAAC
    >probe:HG-U133A:213721_at:272:297; Interrogation_Position = 1418; Antisense;
    GCCGGACAGCGAACTGGAGGGGGGA
    >probe:HG-U133A:213721_at:525:321; Interrogation_Position = 1503; Antisense;
    GCATGGAGAAAACCCGGTACGCTCA
    Probes sequences for 214718_at:
    >probe:HG-U133A:214718_at:239:307; Interrogation_Position = 1170; Antisense;
    GCTGGGCAATGGAGTCAGATTCTCT
    >probe:HG-U133A:214718_at:245:515; Interrogation_Position = 1180; Antisense;
    GGAGTCAGATTCTCTTTCTTAAAAA
    >probe:HG-U133A:214718_at:54:97; Interrogation_Position = 1216; Antisense;
    ACTGGATTTCCAGTTCTCTAATATT
    >probe:HG-U133A:214718_at:24:437; Interrogation_Position = 1228; Antisense;
    GTTCTCTAATATTCTTAGTACCACA
    >probe:HG-U133A:214718_at:175:683; Interrogation_Position = 1242; Antisense;
    TTAGTACCACAAGATATGTCATAGG
    >probe:HG-U133A:214718_at:229:369; Interrogation_Position = 1278; Antisense;
    GAAATTCTTAGCTGGAAAAGTGACT
    >probe:HG-U133A:214718_at:257:681; Interrogation_Position = 1309; Antisense;
    TTTTTCTCCTGCTACCTAGTAATAA
    >probe:HG-U133A:214718_at:237:621; Interrogation_Position = 1313; Antisense;
    TCTCCTGCTACCTAGTAATAAACAA
    >probe:HG-U133A:214718_at:630:441; Interrogation_Position = 1344; Antisense;
    GTTTATTACTGGTCACTTAGAAAAT
    >probe:HG-U133A:214718_at:682:89; Interrogation_Position = 1443; Antisense;
    ACCTGAGGTCGGGAAGTGGATCGCC
    >probe:HG-U133A:214718_at:694:47; Interrogation_Position = 1448; Antisense;
    AGGTCGGGAAGTGGATCGCCTGAGG
    Probes sequences for 207953_at:
    >probe:HG-U133A:207953_at:353:479; Interrogation_Position = 869; Antisense;
    GTGTAGTGGTGCATGTCTTTGGTCC
    >probe:HG-U133A:207953_at:65:571; Interrogation_Position = 870; Antisense;
    TGTAGTGGTGCATGTCTTTGGTCCC
    >probe:HG-U133A:207953_at:613:99; Interrogation_Position = 974; Antisense;
    ACTTCATCCTGGGTAAGAGTGGGAC
    >probe:HG-U133A:207953_at:275:181; Interrogation_Position = 978; Antisense;
    CATCCTGGGTAAGAGTGGGACACCT
    >probe:HG-U133A:207953_at:398:265; Interrogation_Position = 981; Antisense;
    CCTGGGTAAGAGTGGGACACCTGTG
    >probe:HG-U133A:207953_at:173:403; Interrogation_Position = 990; Antisense;
    GAGTGGGACACCTGTGTCTTAAAAA
    >probe:HG-U133A:207953_at:370:387; Interrogation_Position = 1103; Antisense;
    GACCAGCTTGGGCACATCAGTGAAA
    >probe:HG-U133A:207953_at:243:187; Interrogation_Position = 1106; Antisense;
    CAGCTTGGGCACATCAGTGAAACCC
    >probe:HG-U133A:207953_at:521:77; Interrogation_Position = 1107; Antisense;
    AGCTTGGGCACATCAGTGAAACCCT
    >probe:HG-U133A:207953_at:112:687; Interrogation_Position = 1148; Antisense;
    TTAGCCAGGCAGAGCCGGGCATGGT
    >probe:HG-U133A:207953_at:622:279; Interrogation_Position = 1175; Antisense;
    CCAGTCCTGTATCTAGCACTTTGGG
    Probes sequences for 201608_s_at:
    >probe:HG-U133A:201608_s_at:302:159; Interrogation_Position = 1287; Antisense;
    AATCAAGGGCTGTCTCGTGACTGCT
    >probe:HG-U133A:201608_s_at:706:619; Interrogation_Position = 1299; Antisense;
    TCTCGTGACTGCTTCAGCTGACAAA
    >probe:HG-U133A:201608_s_at:170:631; Interrogation_Position = 1408; Antisense;
    TCATGTTGCCCTGATTTGCCATTTA
    >probe:HG-U133A:201608_s_at:555:61; Interrogation_Position = 1455; Antisense;
    AGAAGGGCTTCGGGTCTGGGATATA
    >probe:HG-U133A:201608_s_at:685:397; Interrogation_Position = 1517; Antisense;
    GAGAGAGGCTTGTTCTTGGGAGTGC
    >probe:HG-U133A:201608_s_at:676:627; Interrogation_Position = 1549; Antisense;
    TCATCTATTAGTGGCCCTTTTGGCA
    >probe:HG-U133A:201608_s_at:10:189; Interrogation_Position = 1572; Antisense;
    CAGCAGGAGCTCAGATACACCCATG
    >probe:HG-U133A:201608_s_at:231:23; Interrogation_Position = 1614; Antisense;
    ATCTAATTTCCTGCTTACCTTAACT
    >probe:HG-U133A:201608_s_at:278:35; Interrogation_Position = 1665; Antisense;
    ATGTTCCATGCGTGGCAGCAACCAT
    >probe:HG-U133A:201608_s_at:697:479; Interrogation_Position = 1745; Antisense;
    GTGGCACCACAAATATCCGGTCTTT
    >probe:HG-U133A:201608_s_at:235:705; Interrogation_Position = 1768; Antisense;
    TTGTGCTTGCTCTTCAGATGGATGG
    Probes sequences for 209015_s_at:
    >probe:HG-U133A:209015_s_at:294:433; Interrogation_Position = 1944; Antisense;
    GTTGAACTCATGTTTCAGTTCGCGA
    >probe:HG-U133A:209015_s_at:653:189; Interrogation_Position = 1959; Antisense;
    CAGTTCGCGAACATTGACTCCTTAC
    >probe:HG-U133A:209015_s_at:704:377; Interrogation_Position = 1974; Antisense;
    GACTCCTTACGAAAGTCACTTCATT
    >probe:HG-U133A:209015_s_at:25:629; Interrogation_Position = 1994; Antisense;
    TCATTCTAACTAGATGCGCCCACTT
    >probe:HG-U133A:209015_s_at:658:675; Interrogation_Position = 2135; Antisense;
    TTTCCTTCCCTTTCACATGAGGATC
    >probe:HG-U133A:209015_s_at:218:35; Interrogation_Position = 2151; Antisense;
    ATGAGGATCTGCCGTTCATGTTGCT
    >probe:HG-U133A:209015_s_at:70:289; Interrogation_Position = 2236; Antisense;
    GCCAGCAATTTTAATCTAGCAGTGT
    >probe:HG-U133A:209015_s_at:142:533; Interrogation_Position = 2269; Antisense;
    GGAATTTTTTGGCGCAATCCATGTA
    >probe:HG-U133A:209015_s_at:121:337; Interrogation_Position = 2282; Antisense;
    GCAATCCATGTAGCAGTGACCCAGG
    >probe:HG-U133A:209015_s_at:66:583; Interrogation_Position = 2298; Antisense;
    TGACCCAGGCTTGGGAGCCAGAAAC
    >probe:HG-U133A:209015_s_at:309:695; Interrogation_Position = 2417; Antisense;
    TTCTGTGGACAATGTAACCCTAAAC
    Probes sequences for 221759_at:
    >probe:HG-U133A:221759_at:227:185; Interrogation_Position = 917; Antisense;
    CAGCCTAGCCTTCAAGTGGTGTGAG
    >probe:HG-U133A:221759_at:472:503; Interrogation_Position = 934; Antisense;
    GGTGTGAGCGGCCTGAGTGGATACA
    >probe:HG-U133A:221759_at:399:53; Interrogation_Position = 949; Antisense;
    AGTGGATACACGTGGATAGCCGGCC
    >probe:HG-U133A:221759_at:238:599; Interrogation_Position = 977; Antisense;
    TGCCTCCCTGAGCCGTGACTCAGGG
    >probe:HG-U133A:221759_at:497:215; Interrogation_Position = 1040; Antisense;
    CTATGCCCAGGTGCGTCGGGCACAG
    >probe:HG-U133A:221759_at:258:73; Interrogation_Position = 1078; Antisense;
    AGAAGATAGCCTGCCTTGTGCTGGC
    >probe:HG-U133A:221759_at:160;635; Interrogation_Position = 1150; Antisense;
    TCAGCCTCTTCTACATTTTCAATTT
    >probe:HG-U133A:221759_at:694:225; Interrogation_Position = 1212; Antisense;
    CTCGTGCCCTGGGCAGTGCACATGT
    >probe:HG-U133A:221759_at:315:537; Interrogation_Position = 1223; Antisense;
    GGCAGTGCACATGTTCAGTGCCCAG
    >probe:HG-U133A:221759_at:491:175; Interrogation_Position = 1306; Antisense;
    CACAAAGCCAACACTCTGTGACCAC
    >probe:HG-U133A:221759_at:8:299; Interrogation_Position = 1417; Antisense;
    GCCCCAAAGATGGGCCTTCTCTCTC
    Probes sequences for 205684_s_at:
    >probe:HG-U133A:205684_s_at:583:349; Interrogation_Position = 2138; Antisense;
    GAAGTAACTCTTGGGGACAATATAT
    >probe:HG-U133A:205684_s_at:77:321; Interrogation_Position = 2183; Antisense;
    GCATTACCTTGAAATATGAAGTGCC
    >probe:HG-U133A:205684_s_at:394:55; Interrogation_Position = 2202; Antisense;
    AGTGCCATTTGAATGTCCCAGGGCT
    >probe:HG-U133A:205684_s_at:611:291; Interrogation_Position = 2205; Antisense;
    GCCATTTGAATGTCCCAGGGCTTAT
    >probe:HG-U133A:205684_s_at:312:703; Interrogation_Position = 2210; Antisense;
    TTGAATGTCCCAGGGCTTATTAATA
    >probe:HG-U133A:205684_s_at:259:147; Interrogation_Position = 2238; Antisense;
    AAGATTTTCAACCCCTGAACTGCTT
    >probe:HG-U133A:205684_s_at:144:621; Interrogation_Position = 2264; Antisense;
    TCTGCCTCTGTGGAAAACTACTTTG
    >probe:HG-U133A:205684_s_at:350:363; Interrogation_Position = 2276; Antisense;
    GAAAACTACTTTGGGATTCTTCAGT
    >probe:HG-U133A:205684_s_at:350:653; Interrogation_Position = 2282; Antisense;
    TACTTTGGGATTCTTCAGTATTTGT
    >probe:HG-U133A:205684_s_at:671:7; Interrogation_Position = 2337; Antisense;
    ATTCATTCTAGGCATTGTTTATATT
    >probe:HG-U133A:205684_s_at:242:217; Interrogation_Position = 2344; Antisense;
    CTAGGCATTGTTTATATTTGAAGTT
    Probes sequences for 209008_x_at:
    >probe:HG-U133A:209008_x_at:268:583; Interrogation_Position = 957; Antisense;
    TGAACCGGAACATCAGCCGGCTCCA
    >probe:HG-U133A:209008_x_at:504:297; Interrogation_Position = 972; Antisense;
    GCCGGCTCCAGGCTGAGATTGAGGG
    >probe:HG-U133A:209008_x_at:11:393; Interrogation_Position = 1043; Antisense;
    GAGCAGCGTGGAGAGCTGGCCATTA
    >probe:HG-U133A:209008_x_at:165:41; Interrogation_Position = 1068; Antisense;
    AGGATGCCAACGCCAAGTTGTCCGA
    >probe:HG-U133A:209008_x_at:626:487; Interrogation_Position = 1113; Antisense;
    GGGCCAAGCAGGACATGGCGCGGCA
    >probe:HG-U133A:209008_x_at:114:243; Interrogation_Position = 1133; Antisense;
    CGGCAGCTGCGTGAGTACCAGGAGC
    >probe:HG-U133A:209008_x_at:707:613; Interrogation_Position = 1191; Antisense;
    TCGCCACCTACAGGAAGCTGCTGGA
    >probe:HG-U133A:209008_x_at:448:305; Interrogation_Position = 1278; Antisense;
    GCGGCTATGCAGGTGGTCTGAGCTC
    >probe:HG-U133A:209008_x_at:562:605; Interrogation_Position = 1397; Antisense;
    TCCTCCAGGGCCGTGGTTGTGAAGA
    >probe:HG-U133A:209008_x_at:576:563; Interrogation_Position = 1438; Antisense;
    TGGGAAGCTGGTGTCTGAGTCCTCT
    >probe:HG-U133A:209008_x_at:40:303; Interrogation_Position = 1471; Antisense;
    GCCCAAGTGAACAGCTGCGGCAGCC
    Probes sequences for 200825_s_at:
    >probe:HG-U133A:200825_s_at:301:239; Interrogation_Position = 4074; Antisense;
    CTGGGCTGACCAAAATGTGCTTTCT
    >probe:HG-U133A:200825_s_at:334:155; Interrogation_Position = 4087; Antisense;
    AATGTGCTTTCTACTGTGAGTCCCT
    >probe:HG-U133A:200825_s_at:157:467; Interrogation_Position = 4106; Antisense;
    GTCCCTATCCCAAGATCCTGGGGAA
    >probe:HG-U133A:200825_s_at:160:589; Interrogation_Position = 4147; Antisense;
    TGAATGTAGAGATGCCACCTCCCTC
    >probe:HG-U133A:200825_s_at:215:223; Interrogation_Position = 4169; Antisense;
    CTCTCTCTGAGGCAGGCCTGTGGAT
    >probe:HG-U133A:200825_s_at:282:617; Interrogation_Position = 4222; Antisense;
    TCTGTGCATCACTCTGCTAGGTTGG
    >probe:HG-U133A:200825_s_at:459:607; Interrogation_Position = 4285; Antisense;
    TCCTCCAGTATTCCGTCTGTAGCAG
    >probe:HG-U133A:200825_s_at:607:343; Interrogation_Position = 4342; Antisense;
    GAATGAACCTGGCTGTGTCAGTCAT
    >probe:HG-U133A:200825_s_at:566:565; Interrogation_Position = 4355; Antisense;
    TGTGTCAGTCATTTTGTCTTTTCCT
    >probe:HG-U133A:200825_s_at:633:45; Interrogation_Position = 4470; Antisense;
    AGTAGCTGTTGATGCTGGTTGGACA
    >probe:HG-U133A:200825_s_at:253:453; Interrogation_Position = 4510; Antisense;
    GTACTTTGCTCCATTGTTAATTGAG
    Probes sequences for 218160_at:
    >probe:HG-U133A:218160_at:163:597; Interrogation_Position = 160; Antisense;
    TGCGGCCCATCACTATGGAGCTCAA
    >probe:HG-U133A:218160_at:588:165; Interrogation_Position = 199; Antisense;
    CAAGGAATTTATGCTCTGCCGCTGG
    >probe:HG-U133A:218160_at:686:95; Interrogation_Position = 353; Antisense;
    ACTGGCCAGCAGTTATTTCGTCACT
    >probe:HG-U133A:218160_at:527:191; Interrogation_Position = 477; Antisense;
    CAGATCGACCTTTACCGGAGAATCC
    >probe:HG-U133A:218160_at:99:523; Interrogation_Position = 493; Antisense;
    GGAGAATCCCTATCACTCAAGACCA
    >probe:HG-U133A:218160_at:360:77; Interrogation_Position = 507; Antisense;
    ACTCAAGACCAAGACCGGATCCCAG
    >probe:HG-U133A:218160_at:690:419; Interrogation_Position = 538; Antisense;
    GATCGAGGGAGATCTGCAGCCTGCC
    >probe:HG-U133A:218160_at:282:199; Interrogation_Position = 566; Antisense;
    CATGGCAGCCGCTTTTATTTCTGGA
    >probe:HG-U133A:218160_at:384:647; Interrogation_Position = 596; Antisense;
    TAAAGATGGGTCCGTGGCCCACACT
    >probe:HG-U133A:218160_at:317:109; Interrogation_Position = 616; Antisense;
    ACACTCGGTCATGTGCTCAGACAAC
    >probe:HG-U133A:218160_at:62:363; Interrogation_Position = 648; Antisense;
    GAAAACGCCCATGCGGTTTGCATCG
    Probes sequences for 220856_x_at:
    >probe:HG-U133A:220856_x_at:281:647; Interrogation_Position = 861; Antisense;
    TAAAAGCACAGACTTCAGGCCAGGC
    >probe:HG-U133A:220856_x_at:264:693; Interrogation_Position = 874; Antisense;
    TTCAGGCCAGGCACAATGACTCACA
    >probe:HG-U133A:220856_x_at:625:485; Interrogation_Position = 929; Antisense;
    GGGCTGATTCAGCCCCAGGAGTTCA
    >probe:HG-U133A:220856_x_at:309:9; Interrogation_Position = 1006; Antisense;
    ATTAGCCGGCAGTGGTGGCACATGC
    >probe:HG-U133A:220856_x_at:60:195; Interrogation_Position = 1034; Antisense;
    CAGTCCCAGCTACTCGAGAGGATGA
    >probe:HG-U133A:220856_x_at:575:279; Interrogation_Position = 1177; Antisense;
    CCAGCTGGGCATGGTGACTCACACT
    >probe:HG-U133A:220856_x_at:462:287; Interrogation_Position = 1224; Antisense;
    GCCAAGGAGGGCAGGTCATGAGGTC
    >probe:HG-U133A:220856_x_at:109:551; Interrogation_Position = 1268; Antisense;
    GGCCAATTTGTACCTGCAGGCCTCC
    >probe:HG-U133A:220856_x_at:114:615; Interrogation_Position = 1321; Antisense;
    TCGACCGCGATGATGTGGCTCTGGA
    >probe:HG-U133A:220856_x_at:272:617; Interrogation_Position = 1340; Antisense;
    TCTGGAAGGCGTGAGCCACTTCTTC
    >probe:HG-U133A:220856_x_at:457:211; Interrogation_Position = 1358; Antisense;
    CTTCTTCCGCGAATTGGCCGAGGAG
    Probes sequences for 57739_at:
    >probe:HG-U133A:57739_at:87:635; Interrogation_Position = 100; Antisense;
    TCAGCCTCCCTGCTGGGACAGGGAC
    >probe:HG-U133A:57739_at:16:257; Interrogation_Position = 107; Antisense;
    CCCTGCTGGGACAGGGACCTATGGC
    >probe:HG-U133A:57739_at:76:79; Interrogation_Position = 278; Antisense;
    AGCTTAGCATGAATCTTCTTTATTG
    >probe:HG-U133A:57739_at:546:319; Interrogation_Position = 279; Antisense;
    GCTTAGCATGAATCTTCTTTATTGT
    >probe:HG-U133A:57739_at:378:77; Interrogation_Position = 283; Antisense;
    AGCATGAATCTTCTTTATTGTCCTG
    >probe:HG-U133A:57739_at:419:325; Interrogation_Position = 284; Antisense;
    GCATGAATCTTCTTTATTGTCCTGA
    >probe:HG-U133A:57739_at:602:331; Interrogation_Position = 345; Antisense;
    GCAGCTGAGCCAAAGGGGTCAGAAA
    >probe:HG-U133A:57739_at:314:495; Interrogation_Position = 360; Antisense;
    GGGTCAGAAATTCTGCCCTTTGCCT
    >probe:HG-U133A:57739_at:227:509; Interrogation_Position = 361; Antisense;
    GGTCAGAAATTCTGCCCTTTGCCTC
    >probe:HG-U133A:57739_at:672:281; Interrogation_Position = 385; Antisense;
    CCACCACATGGCATTCTGGTTTTGG
    >probe:HG-U133A:57739_at:171:185; Interrogation_Position = 386; Antisense;
    CACCACATGGCATTCTGGTTTTGGT
    >probe:HG-U133A:57739_at:210:85; Interrogation_Position = 387; Antisense;
    ACCACATGGCATTCTGGTTTTGGTT
    >probe:HG-U133A:57739_at:87:285; Interrogation_Position = 388; Antisense;
    CCACATGGCATTCTGGTTTTGGTTT
    >probe:HG-U133A:57739_at:24:177; Interrogation_Position = 389; Antisense;
    CACATGGCATTCTGGTTTTGGTTTC
    >probe:HG-U133A:57739_at:199:105; Interrogation_Position = 390; Antisense;
    ACATGGCATTCTGGTTTTGGTTTCT
    >probe:HG-U133A:57739_at:348:199; Interrogation_Position = 391; Antisense;
    CATGGCATTCTGGTTTTGGTTTCTG
    Probes sequences for 200654_at:
    >probe:HG-U133A:200654_at:583:21; Interrogation_Position = 1982; Antisense;
    ATCAGGGTATTTGTTCCACCTTGGC
    >probe:HG-U133A:200654_at:315:265; Interrogation_Position = 2052; Antisense;
    CCGGACTGGACATGGTCACTCAGTA
    >probe:HG-U133A:200654_at:89:97; Interrogation_Position = 2056; Antisense;
    ACTGGACATGGTCACTCAGTACCGC
    >probe:HG-U133A:200654_at:535:267; Interrogation_Position = 2080; Antisense;
    CCTGCAGTGTCGCCATGACTGATCA
    >probe:HG-U133A:200654_at:675:33; Interrogation_Position = 2094; Antisense;
    ATGACTGATCATGGCTCTTGCATTT
    >probe:HG-U133A:200654_at:93:199; Interrogation_Position = 2103; Antisense;
    CATGGCTCTTGCATTTTTGGGTAAA
    >probe:HG-U133A:200654_at:33:375; Interrogation_Position = 2132; Antisense;
    GACTTCCGGATCCTGTCAGGGTGTC
    >probe:HG-U133A:200654_at:569:43; Interrogation_Position = 2266; Antisense;
    AGGCTCAGGCCAGGTCTGGACAGCT
    >probe:HG-U133A:200654_at:100:43; Interrogation_Position = 2272; Antisense;
    AGGCCAGGTCTGGACAGCTGTGACT
    >probe:HG-U133A:200654_at:170:87; Interrogation_Position = 2318; Antisense;
    ACCAGCCGGCTATGGGCACATTACG
    >probe:HG-U133A:200654_at:32:565; Interrogation_Position = 2330; Antisense;
    TGGGCACATTACGTGACCACTGGCC
    Probes sequences for 220071_x_at:
    >probe:HG-U133A:220071_x_at:588:657; Interrogation_Position = 1175; Antisense;
    TAGCCGACCATGGTGGTGCATGCCT
    >probe:HG-U133A:220071_x_at:25:509; Interrogation_Position = 1226; Antisense;
    GGTAAGAGGATCACCTAAGCCTGTG
    >probe:HG-U133A:220071_x_at:596:139; Interrogation_Position = 1242; Antisense;
    AAGCCTGTGAGGTCATGGTTGCAAT
    >probe:HG-U133A:220071_x_at:382:165; Interrogation_Position = 1263; Antisense;
    CAATGAGTCATGATCACGCCACTGC
    >probe:HG-U133A:220071_x_at:286:301; Interrogation_Position = 1300; Antisense;
    GCGACACAGTAAGACCCTGTCTCAA
    >probe:HG-U133A:220071_x_at:508:143; Interrogation_Position = 1329; Antisense;
    AAGAAGTGTGTTTCTGGCCAGGCAC
    >probe:HG-U133A:220071_x_at:145:519; Interrogation_Position = 1385; Antisense;
    GGAGGCCTAGGTGGGCAGATCATGA
    >probe:HG-U133A:220071_x_at:584:287; Interrogation_Position = 1434; Antisense;
    GCCAACATGGCGAAACACCTGTCTC
    >probe:HG-U133A:220071_x_at:271:101; Interrogation_Position = 1514; Antisense;
    ACTTTGGGAGGCCGACCCAGATGGG
    >probe:HG-U133A:220071_x_at:624:25; Interrogation_Position = 1522; Antisense;
    AGGCCGACCCAGATGGGTGGATCAC
    >probe:HG-U133A:220071_x_at:545:193; Interrogation_Position = 1706; Antisense;
    CAGTAAGCCAAGATCGTGCCTCTGC
    Probes sequences for 216745_x_at:
    >probe:HG-U133A:216745_x_at:222:565; Interrogation_Position = 1415; Antisense;
    TGTGGTGGCATGTGCCTATAGTTCC
    >probe:HG-U133A:216745_x_at:701:513; Interrogation_Position = 1465; Antisense;
    GGATTAGTTGAGCCCAGAAGTTCGA
    >probe:HG-U133A:216745_x_at:664:583; Interrogation_Position = 1533; Antisense;
    TGACATAGTGAGACCCTGCCTTAAA
    >probe:HG-U133A:216745_x_at:236:65; Interrogation_Position = 1572; Antisense;
    AGAGGGTAAGCATCATGACATCTCC
    >probe:HG-U133A:216745_x_at:34:27; Interrogation_Position = 1591; Antisense;
    ATCTCCTTACCCTGATTAATCTCAA
    >probe:HG-U133A:216745_x_at:217:91; Interrogation_Position = 1599; Antisense;
    ACCCTGATTAATCTCAAACTGTTCA
    >probe:HG-U133A:216745_x_at:706:13; Interrogation_Position = 1641; Antisense;
    ATATATACACACATCCGCATACCTT
    >probe:HG-U133A:216745_x_at:378:249; Interrogation_Position = 1656; Antisense;
    CGCATACCTTCGCATACACATATAC
    >probe:HG-U133A:216745_x_at:82:75; Interrogation_Position = 1697; Antisense;
    AGCAAATGCAGGCTGGGCACGGTTG
    >probe:HG-U133A:216745_x_at:534:687; Interrogation_Position = 1842; Antisense;
    TTAGCTGGATGTAGTGGCGGGCGCC
    >probe:HG-U133A:216745_x_at:261:433; Interrogation_Position = 1927; Antisense;
    GTTGCAGTGAACGGAGATGGCGCCA
    Probes sequences for 218976_at:
    >probe:HG-U133A:218976_at:118:251; Interrogation_Position = 619; Antisense;
    CCCAAGCCCCTAGAGAAGTCAGTCT
    >probe:HG-U133A:218976_at:587:55; Interrogation_Position = 639; Antisense;
    AGTCTCCCCGCAAAATTCAGATTGT
    >probe:HG-U133A:218976_at:195:567; Interrogation_Position = 678; Antisense;
    TGTGAATGGTTGGCACCTTCGTTTC
    >probe:HG-U133A:218976_at:389:215; Interrogation_Position = 805; Antisense;
    CTATGCTGCCAACATGCAGTCTTTG
    >probe:HG-U133A:218976_at:473:199; Interrogation_Position = 859; Antisense;
    CATGTCTGTGAATTGCTGAGTACTA
    >probe:HG-U133A:218976_at:461:651; Interrogation_Position = 879; Antisense;
    TACTAATTGATTCCTCCATCCTTGA
    >probe:HG-U133A:218976_at:551:343; Interrogation_Position = 902; Antisense;
    GAATCAGTTCTCATAATGCTTTTTA
    >probe:HG-U133A:218976_at:14:31; Interrogation_Position = 1007; Antisense;
    ATGCTTTTTCCTATTAATACTACTT
    >probe:HG-U133A:218976_at:395:5; Interrogation_Position = 1081; Antisense;
    ATTGACATACTGTGATCTCTATTAG
    >probe:HG-U133A:218976_at:29:575; Interrogation_Position = 1124; Antisense;
    TGTTTTCTTACCCTTGACTTACAAT
    >probe:HG-U133A:218976_at:496:473; Interrogation_Position = 1154; Antisense;
    GTGAAATTACTTGTCTGAACCCCGT
    Probes sequences for 211921_x_at:
    >probe:HG-U133A:211921_x_at:215:37; Interrogation_Position = 13; Antisense;
    ATGTCAGACGCAGCCGTAGACACCA
    >probe:HG-U133A:211921_x_at:563:77; Interrogation_Position = 37; Antisense;
    AGCTCCGAAATCACCACCAAGGACT
    >probe:HG-U133A:211921_x_at:447:361; Interrogation_Position = 94; Antisense;
    GAAAATGGAAGAGACGCCCCTGCTA
    >probe:HG-U133A:211921_x_at:314:93; Interrogation_Position = 107; Antisense;
    ACGCCCCTGCTAACGGGAATGCTAA
    >probe:HG-U133A:211921_x_at:43:327; Interrogation_Position = 147; Antisense;
    GCAGGAGGCTGACAATGAGGTAGAC
    >probe:HG-U133A:211921_x_at:528:141; Interrogation_Position = 254; Antisense;
    AAGCTGAGTCAGCTACGGGCAAGCG
    >probe:HG-U133A:211921_x_at:237:217; Interrogation_Position = 266; Antisense;
    CTACGGGCAAGCGGGCAGCTGAAGA
    >probe:HG-U133A:211921_x_at:532:79; Interrogation_Position = 275; Antisense;
    AGCGGGCAGCTGAAGATGATGAGGA
    >probe:HG-U133A:211921_x_at:204:409; Interrogation_Position = 295; Antisense;
    GAGGATGACGATGTCGATACCAAGA
    >probe:HG-U133A:211921_x_at:680:421; Interrogation_Position = 310; Antisense;
    GATACCAAGAAGCAGAAGACCGACG
    >probe:HG-U133A:211921_x_at:149:331; Interrogation_Position = 321; Antisense;
    GCAGAAGACCGACGAGGATGACTAG
    Probes sequences for 218074_at:
    >probe:HG-U133A:218074_at:78:189; Interrogation_Position = 164; Antisense;
    CAGCATCGACGCACGCGAGATCTTC
    >probe:HG-U133A:218074_at:486:245; Interrogation_Position = 188; Antisense;
    CGATCTGATTCGCTCCATCAATGAC
    >probe:HG-U133A:218074_at:384:391; Interrogation_Position = 216; Antisense;
    GAGCATCCACTGACGCTAGAGGAGT
    >probe:HG-U133A:218074_at:15:327; Interrogation_Position = 254; Antisense;
    GCAGGTGCGGGTTCAGGTTAGCGAC
    >probe:HG-U133A:218074_at:554:57; Interrogation_Position = 285; Antisense;
    AGTACAGTGGCTGTGGCTTTCACAC
    >probe:HG-U133A:218074_at:625:467; Interrogation_Position = 350; Antisense;
    GTCCATCAAGGTCAAGCTTCTGCGC
    >probe:HG-U133A:218074_at:471:145; Interrogation_Position = 393; Antisense;
    AAGATGGACGTGCACATTACTCCGG
    >probe:HG-U133A:218074_at:22:21; Interrogation_Position = 460; Antisense;
    ATAAGGAGCGGGTGGCAGCTGCCCT
    >probe:HG-U133A:218074_at:26:361; Interrogation_Position = 488; Antisense;
    GAACACCCACCTCTTGGAGGTTGTG
    >probe:HG-U133A:218074_at:52:221; Interrogation_Position = 556; Antisense;
    CTCAGCCTGCATACTGGTATCCTGG
    >probe:HG-U133A:218074_at:323:309; Interrogation_Position = 598; Antisense;
    GCTGTTACCGTTGTTTTCTTGAATC
    Probes sequences for 200914_x_at:
    >probe:HG-U133A:200914_x_at:224:577; Interrogation_Position = 106; Antisense;
    TTCCCGGCCGCACAGGGTTTTATAG
    >probe:HG-U133A:200914_x_at:191:333; Interrogation_Position = 115; Antisense;
    GCACAGGGTTTTATAGGATCACATT
    >probe:HG-U133A:200914_x_at:552:515; Interrogation_Position = 130; Antisense;
    GGATCACATTGACAAAAGTACCATG
    >probe:HG-U133A:200914_x_at:559:451; Interrogation_Position = 147; Antisense;
    GTACCATGGAGTTTTATGAGTCAGC
    >probe:HG-U133A:200914_x_at:16:403; Interrogation_Position = 164; Antisense;
    GAGTCAGCATATTTTATTGTTCTTA
    >probe:HG-U133A:200914_x_at:247:13; Interrogation_Position = 172; Antisense;
    ATATTTTATTGTTCTTATTCCTTCA
    >probe:HG-U133A:200914_x_at:318:435; Interrogation_Position = 182; Antisense;
    GTTCTTATTCCTTCAATAGTTATTA
    >probe:HG-U133A:200914_x_at:424:9; Interrogation_Position = 203; Antisense;
    ATTACAGTAATTTTCCTCTTCTTCT
    >probe:HG-U133A:200914_x_at:642:271; Interrogation_Position = 217; Antisense;
    CCTCTTCTTCTGGCTTTTCATGAAA
    >probe:HG-U133A:200914_x_at:589:393; Interrogation_Position = 279; Antisense;
    GAGAACAAAAGCTTATTCCTACCAA
    >probe:HG-U133A:200914_x_at:149:121; Interrogation_Position = 286; Antisense;
    AAAGCTTATTCCTACCAAAACAGAT
    Probes sequences for 214833_at:
    >probe:HG-U133A:214833_at:457:345; Interrogation_Position = 4954; Antisense;
    GAATGAGCTTGAAGTGCTCCCCTCT
    >probe:HG-U133A:214833_at:441:271; Interrogation_Position = 4996; Antisense;
    CCTCTGACAGTTTTGCTCTTTGGTT
    >probe:HG-U133A:214833_at:463:583; Interrogation_Position = 5000; Antisense;
    TGACAGTTTTGCTCTTTGGTTCACA
    >probe:HG-U133A:214833_at:421:625; Interrogation_Position = 5012; Antisense;
    TCTTTGGTTCACAAGGCTTCCTAAA
    >probe:HG-U133A:214833_at:407:559; Interrogation_Position = 5016; Antisense;
    TGGTTCACAAGGCTTCCTAAACTCA
    >probe:HG-U133A:214833_at:60:437; Interrogation_Position = 5018; Antisense;
    GTTCACAAGGCTTCCTAAACTCAGG
    >probe:HG-U133A:214833_at:320:153; Interrogation_Position = 5024; Antisense;
    AAGGCTTCCTAAACTCAGGATTGCT
    >probe:HG-U133A:214833_at:555:123; Interrogation_Position = 5034; Antisense;
    AAACTCAGGATTGCTTTGTGGGTCT
    >probe:HG-U133A:214833_at:165:197; Interrogation_Position = 5039; Antisense;
    CAGGATTGCTTTGTGGGTCTGTTCT
    >probe:HG-U133A:214833_at:205:43; Interrogation_Position = 5074; Antisense;
    AGGCCAGGGCAGGTGCAAGCTATGC
    >probe:HG-U133A:214833_at:236:551; Interrogation_Position = 5075; Antisense;
    GGCCAGGGCAGGTGCAAGCTATGCC
    Probes sequences for 216384_x_at:
    >probe:HG-U133A:216384_x_at:213:37; Interrogation_Position = 13; Antisense;
    ATGTCAGACGCAGCCGTAGACACCA
    >probe:HG-U133A:216384_x_at:141:227; Interrogation_Position = 39; Antisense;
    CTCCGAAATCACCACCGAGGACTTA
    >probe:HG-U133A:216384_x_at:671:347; Interrogation_Position = 72; Antisense;
    GAAGGAAGTTGTGGAAGAGGCGGAA
    >probe:HG-U133A:216384_x_at:448:361; Interrogation_Position = 94; Antisense;
    GAAAATGGAAGAGACGCCCCTGCTC
    >probe:HG-U133A:216384_x_at:650:299; Interrogation_Position = 109; Antisense;
    GCCCCTGCTCACGGGAATGCTAATG
    >probe:HG-U133A:216384_x_at:583:405; Interrogation_Position = 133; Antisense;
    GAGGAAAATGGGGAGCCGGAGGCTG
    >probe:HG-U133A:216384_x_at:47:387; Interrogation_Position = 145; Antisense;
    GAGCCGGAGGCTGACAACGAGGTAG
    >probe:HG-U133A:216384_x_at:669:497; Interrogation_Position = 187; Antisense;
    GGTGGGGAGGAAGAAGGTGATGGTG
    >probe:HG-U133A:216384_x_at:343:389; Interrogation_Position = 239; Antisense;
    GAGCTGAGTCAGCTACGGGCAAGCG
    >probe:HG-U133A:216384_x_at:455:95; Interrogation_Position = 253; Antisense;
    ACGGGCAAGCGGGCAGCTGAAGATG
    >probe:HG-U133A:216384_x_at:195:63; Interrogation_Position = 273; Antisense;
    AGATGATGAGGATGACGATGTCGAT
    Probes sequences for 214594_x_at:
    >probe:HG-U133A:214594_x_at:465:603; Interrogation_Position = 3960; Antisense;
    TCCTATTATTCGGCCGGGCGTGGTG
    >probe:HG-U133A:214594_x_at:122:273; Interrogation_Position = 3961; Antisense;
    CCTATTATTCGGCCGGGCGTGGTGG
    >probe:HG-U133A:214594_x_at:526:9; Interrogation_Position = 4261: Antisense;
    ATTTAAAAATAGGCCGGGCATGGTG
    >probe:HG-U133A:214594_x_at:141:677; Interrogation_Position = 4262; Antisense;
    TTTAAAAATAGGCCGGGCATGGTGG
    >probe:HG-U133A:214594_x_at:250:83; Interrogation_Position = 4390; Antisense;
    ACAAAAATTAGCTGGAAATCGCTTG
    >probe:HG-U133A:214594_x_at:118:125; Interrogation_Position = 4392; Antisense;
    AAAAATTAGCTGGAAATCGCTTGAA
    >probe:HG-U133A:214594_x_at:177:129; Interrogation_Position = 4393; Antisense;
    AAAATTAGCTGGAAATCGCTTGAAC
    >probe:HG-U133A:214594_x_at:182:115; Interrogation_Position = 4394; Antisense;
    AAATTAGCTGGAAATCGCTTGAACC
    >probe:HG-U133A:214594_x_at:421:161; Interrogation_Position = 4395; Antisense;
    AATTAGCTGGAAATCGCTTGAACCT
    >probe:HG-U133A:214594_x_at:421:5; Interrogation_Position = 4396; Antisense;
    ATTAGCTGGAAATCGCTTGAACCTG
    >probe:HG-U133A:214594_x_at:454:581; Interrogation_Position = 4438; Antisense;
    TGAGCCGAGATCGTGGCACTGCAGC
    Probes sequences for 222122_s_at:
    >probe:HG-U133A:222122_s_at:274:649; Interrogation_Position = 3047; Antisense;
    TACAAACTAACCAAGGCATCGGTAC
    >probe:HG-U133A:222122_s_at:478:323; Interrogation_Position = 3062; Antisense;
    GCATCGGTACATTGCCTTGAAACAG
    >probe:HG-U133A:222122_s_at:65:131; Interrogation_Position = 3082; Antisense;
    AACAGGCGAATATACTCACATCAGG
    >probe:HG-U133A:222122_s_at:73:395; Interrogation_Position = 3212; Antisense;
    GAGAAGAGGCCAGATCTATATGCAT
    >probe:HG-U133A:222122_s_at:598:487; Interrogation_Position = 3244; Antisense;
    GGGCTACTCTGGGCAGTTGAAAAGT
    >probe:HG-U133A:222122_s_at:23:155; Interrogation_Position = 3293; Antisense;
    AATGAGTTTCATCACAAAGACCCCC
    >probe:HG-U133A:222122_s_at:379:259; Interrogation_Position = 3315; Antisense;
    CCCCTCCGAGGAATGCAGTTGCCAG
    >probe:HG-U133A:222122_s_at:68:481; Interrogation_Position = 3339; Antisense;
    GTGTGCAAAATGGGCCTGGTGGTGG
    >probe:HG-U133A:222122_s_at:13:495; Interrogation_Position = 3435; Antisense;
    GGGAGAGATCTCAGTGTGGTGTGAA
    >probe:HG-U133A:222122_s_at:316:121; Interrogation_Position = 3470; Antisense;
    AAAGCTTCTAGTACCACACCTAAAG
    >probe:HG-U133A:222122_s_at:518:529; Interrogation_Position = 3509; Antisense;
    GGAAATAGTGGCTCTAACAGCAACA
    Probes sequences for 202004_x_at:
    >probe:HG-U133A:202004_x_at:666:33; Interrogation_Position = 749; Antisense;
    ATGAGGTGGCTGCAAAAACTCCCCT
    >probe:HG-U133A:202004_x_at:363:271; Interrogation_Position = 791; Antisense;
    CCTACTCTCGGCCTAGAAGCAGTTA
    >probe:HG-U133A:202004_x_at:334:439; Interrogation_Position = 812; Antisense;
    GTTATTCTCTCTCCATATTGGGCTT
    >probe:HG-U133A:202004_x_at:64:595; Interrogation_Position = 844; Antisense;
    TGCTGAGGGTCAGCTTTTGGCTCCT
    >probe:HG-U133A:202004_x_at:503:311; Interrogation_Position = 863; Antisense;
    GCTCCTTCTTCCTGAGACAGTGGAA
    >probe:HG-U133A:202004_x_at:503:129; Interrogation_Position = 887; Antisense;
    AACAATGCCAGCTCTGTGGCTTCTG
    >probe:HG-U133A:202004_x_at:577:431; Interrogation_Position = 941; Antisense;
    GTTGGGTGAAGCTTTGGGTTGCCAC
    >probe:HG-U133A:202004_x_at:375:231; Interrogation_Position = 965; Antisense;
    CTGCCTGTGGGTTTGCTGGCTTAAA
    >probe:HG-U133A:202004_x_at:381:251; Interrogation_Position = 1016; Antisense;
    CCCAGGCCATTAACAACTAACACAG
    >probe:HG-U133A:202004_x_at:426:533; Interrogation_Position = 1073; Antisense;
    GGAATTAGTCTGTCCCAGCTAGAGG
    >probe:HG-U133A:202004_x_at:250:59; Interrogation_Position = 1113; Antisense;
    AGTTAGTTCTTGGAGCAGCTGCTTT
    Probes sequences for 204060_s_at:
    >probe:HG-U133A:204060_s_at:669:479; Interrogation_Position = 5533; Antisense;
    GTGTGTTGGGAAGTTCTGTTGAGGC
    >probe:HG-U133A:204060_s_at:549:489; Interrogation_Position = 5540; Antisense;
    GGGAAGTTCTGTTGAGGCTTAGTTT
    >probe:HG-U133A:204060_s_at:48:433; Interrogation_Position = 5550; Antisense;
    GTTGAGGCTTAGTTTGATCTTCCAT
    >probe:HG-U133A:204060_s_at:306:213; Interrogation_Position = 5557; Antisense;
    CTTAGTTTGATCTTCCATGGTGGAC
    >probe:HG-U133A:204060_s_at:230:675; Interrogation_Position = 5642; Antisense;
    TTTCCACAGTCCTTGTGCTCTGAGA
    >probe:HG-U133A:204060_s_at:245:177; Interrogation_Position = 5646; Antisense;
    CACAGTCCTTGTGCTCTGAGATGGC
    >probe:HG-U133A:204060_s_at:105:469; Interrogation_Position = 5650; Antisense;
    GTCCTTGTGCTCTGAGATGGCTGTT
    >probe:HG-U133A:204060_s_at:332:581; Interrogation_Position = 5662; Antisense;
    TGAGATGGCTGTTGAGCTCTGCTCA
    >probe:HG-U133A:204060_s_at:570:175; Interrogation_Position = 5787; Antisense;
    CACACAGGCAGAGTGCCTTGGCAGG
    >probe:HG-U133A:204060_s_at:318:87; Interrogation_Position = 5819; Antisense;
    ACCCCTTGGCGAGCCAGAACTGGGA
    >probe:HG-U133A:204060_s_at:247:673; Interrogation_Position = 5937; Antisense;
    TTTGTACATATGCCCTAGGAAAATT
    Probes sequences for 209653_at:
    >probe:HG-U133A:209653_at:158:457; Interrogation_Position = 1252; Antisense;
    GTAATTGATGCCAATCTTGTACCAA
    >probe:HG-U133A:209653_at:298:515; Interrogation_Position = 1293; Antisense;
    GGATAAGGGGGATTTTGGCACTCAA
    >probe:HG-U133A:209653_at:297:355; Interrogation_Position = 1321; Antisense;
    GAAGCTGCTTGGGCCATAAGTAACT
    >probe:HG-U133A:209653_at:274:423; Interrogation_Position = 1366; Antisense;
    GATCAAGTGGCTTACCTTATCCAAC
    >probe:HG-U133A:209653_at:627:613; Interrogation_Position = 1400; Antisense;
    TCCCACCTTTTTGCAACTTGCTGAC
    >probe:HG-U133A:209653_at:594:375; Interrogation_Position = 1576; Antisense;
    GACATCTACAAATTGGCCTATGAGA
    >probe:HG-U133A:209653_at:3:423; Interrogation_Position = 1599; Antisense;
    GATCATTGATCAGTTCTTCTCTTCA
    >probe:HG-U133A:209653_at:651:425; Interrogation_Position = 1627; Antisense;
    GATATTGATGAAGACCCTAGCCTTG
    >probe:HG-U133A:209653_at:520:255; Interrogation_Position = 1641; Antisense;
    CCCTAGCCTTGTTCCAGAGGCAATT
    >probe:HG-U133A:209653_at:29:541; Interrogation_Position = 1669; Antisense;
    GGCGGAACATTTGGTTTCAATTCAT
    >probe:HG-U133A:209653_at:445:639; Interrogation_Position = 1685; Antisense;
    TCAATTCATCTGCCAATGTACCAAC
    Probes sequences for 210858_x_at:
    >probe:HG-U133A:210858_x_at:33:593; Interrogation_Position = 5328; Antisense;
    TGCTCATACAGCAGGCCATAGACCC
    >probe:HG-U133A:210858_x_at:263:199; Interrogation_Position = 5344; Antisense;
    CATAGACCCCAAAAATCTCAGCCGA
    >probe:HG-U133A:210858_x_at:302:633; Interrogation_Position = 5361; Antisense;
    TCAGCCGACTTTTCCCAGGATGGAA
    >probe:HG-U133A:210858_x_at:91:531; Interrogation_Position = 5382; Antisense;
    GGAAAGCTTGGGTGTGATCTTCAGT
    >probe:HG-U133A:210858_x_at:451:687; Interrogation_Position = 5414; Antisense;
    TTACCCTTTCATTCAGCCTTTAGAA
    >probe:HG-U133A:210858_x_at:680:291; Interrogation_Position = 5450; Antisense;
    GCCTTTATTTTTAACCTGCCAACAT
    >probe:HG-U133A:210858_x_at:309:643; Interrogation_Position = 5577; Antisense;
    TAAGGAACATCTCTGCTTTCACTCT
    >probe:HG-U133A:210858_x_at:645:115; Interrogation_Position = 5606; Antisense;
    AAATAATGGTCATTCGGGCTGGGCG
    >probe:HG-U133A:210858_x_at:535:687; Interrogation_Position = 5770; Antisense;
    TTAGCCGAGCATGGTGGCGGGCACC
    >probe:HG-U133A:210858_x_at:21:495; Interrogation_Position = 5844; Antisense;
    GGGAGGTGAAGGTTGCTGTGGGCCA
    >probe:HG-U133A:210858_x_at:58:483; Interrogation_Position = 5861; Antisense;
    GTGGGCCAAAATCATGCCATTGCAC
    Probes sequences for 215314_at:
    >probe:HG-U133A:215314_at:471:221; Interrogation_Position = 872; Antisense;
    CTCAGCCTCCTAAGGAGCTATTTTT
    >probe:HG-U133A:215314_at:195:471; Interrogation_Position = 1015; Antisense;
    GTGCCCAGCTAGATTCTTTACTTTT
    >probe:HG-U133A:215314_at:37:665; Interrogation_Position = 1110; Antisense;
    TATGCCTCTTTCTGTAGATCAAATG
    >probe:HG-U133A:215314_at:399:369; Interrogation_Position = 1134; Antisense;
    GAAATATGGCCCAATTTACAAGGGT
    >probe:HG-U133A:215314_at:266:497; Interrogation_Position = 1155; Antisense;
    GGGTATACACCTTCATGGTAGATGA
    >probe:HG-U133A:215314_at:15:427; Interrogation_Position = 1217; Antisense;
    GATTTAATTGTAGTTTCTGCTGCCA
    >probe:HG-U133A:215314_at:354:285; Interrogation_Position = 1238; Antisense;
    GCCAAAAAGTCTCCATAATTATTTT
    >probe:HG-U133A:215314_at:679:159; Interrogation_Position = 1265; Antisense;
    AATAACTTATAACTACCTTGATCTT
    >probe:HG-U133A:215314_at:319:643; Interrogation_Position = 1274; Antisense;
    TAACTACCTTGATCTTTACCTGTAC
    >probe:HG-U133A:215314_at:648:275; Interrogation_Position = 1280; Antisense;
    CCTTGATCTTTACCTGTACACACAC
    >probe:HG-U133A:215314_at:373:457; Interrogation_Position = 1330; Antisense;
    GTAATTTCACTTAGAATCCTGGTCA
    Probes sequences for 208238_x_at:
    >probe:HG-U133A:208238_x_at:281:401; Interrogation_Position = 75; Antisense;
    GAGTTGTCAGAAAGGTTTCCCTGCA
    >probe:HG-U133A:208238_x_at:158:357; Interrogation_Position = 84; Antisense;
    GAAAGGTTTCCCTGCAGATAAACTT
    >probe:HG-U133A:208238_x_at:235:119; Interrogation_Position = 111; Antisense;
    AAAGATCTCAACCTGTGGACGGGAA
    >probe:HG-U133A:208238_x_at:598:63; Interrogation_Position = 159; Antisense;
    AGAGTCCCCACTGGCACAGTGGCTC
    >probe:HG-U133A:208238_x_at:695:67; Interrogation_Position = 198; Antisense;
    AGCACTTCTAGAGGCTGAGGCGGGT
    >probe:HG-U133A:208238_x_at:288:43; Interrogation_Position = 215; Antisense;
    AGGCGGGTGTATTGCTTGAGCCCAG
    >probe:HG-U133A:208238_x_at:447:389; Interrogation_Position = 232; Antisense;
    GAGCCCAGAAGTTTGAGAGCAGCCT
    >probe:HG-U133A:208238_x_at:459:195; Interrogation_Position = 309; Antisense;
    CATGAGGGTACACACCTGTAGTCCC
    >probe:HG-U133A:208238_x_at:306:311; Interrogation_Position = 385; Antisense;
    GCTGCAGTGAGCCTAGGTGGTGCCA
    >probe:HG-U133A:208238_x_at:25:335; Interrogation_Position = 412; Antisense;
    GCACTCCAACCTGGGGGACAGAGGA
    >probe:HG-U133A:208238_x_at:699:491; Interrogation_Position = 426; Antisense;
    GGGACAGAGGAGGCCCTGTCCCGAG
    Probes sequences for 212041_at:
    >probe:HG-U133A:212041_at:213:481; Interrogation_Position = 1104; Antisense;
    GTGTCGCAACATCGTGTGGATCGCT
    >probe:HG-U133A:212041_at:431:485: Interrogation_Position = 1119; Antisense;
    GTGGATCGCTGAATGTATCGCCCAG
    >probe:HG-U133A:212041_at:540:181; Interrogation_Position = 1147; Antisense;
    CACCGCGCCAAAATCGACAACTACA
    >probe:HG-U133A:212041_at:166:117; Interrogation_Position = 1157; Antisense;
    AAATCGACAACTACATCCCTATCTT
    >probe:HG-U133A:212041_at:424:543; Interrogation_Position = 1199; Antisense;
    GGCTCTCAATTGCACTCTTTGTGTG
    >probe:HG-U133A:212041_at:525:479; Interrogation_Position = 1259; Antisense;
    GTGTATGTGGTCTGTGACAAGCCTG
    >probe:HG-U133A:212041_at:164:239; Interrogation_Position = 1270; Antisense;
    CTGTGACAAGCCTGTGGCTCACCTG
    >probe:HG-U133A:212041_at:537:181; Interrogation_Position = 1397; Antisense;
    CACCAAGGATGGACGAAGACCCCCT
    >probe:HG-U133A:212041_at:143:635; Interrogation_Position = 1441; Antisense;
    TCAGCCCTGTGGTTACAGCCGCTGA
    >probe:HG-U133A:212041_at:515:109; Interrogation_Position = 1455; Antisense;
    ACAGCCGCTGATGTATCTAAGAAGC
    >probe:HG-U133A:212041_at:303:257; Interrogation_Position = 1655; Antisense;
    CCCTGCCCCTCTGAGACAATAAAAC
    Probes sequences for 221294_at:
    >probe:HG-U133A:221294_at:480:607; Interrogation_Position = 464; Antisense;
    TCCTGATTTGGCTATACTCGACCCT
    >probe:HG-U133A:221294_at:484:691; Interrogation_Position = 545; Antisense;
    TTCAGTGGTGTGCGGAGTCCTGGCA
    >probe:HG-U133A:221294_at:413:217; Interrogation_Position = 579; Antisense;
    CTACTTCACCCTGTTCATCGTGATG
    >probe:HG-U133A:221294_at:694:577; Interrogation_Position = 599; Antisense;
    TGATGATGTTATATGCCCCAGCAGC
    >probe:HG-U133A:221294_at:289:549; Interrogation_Position = 741; Antisense;
    GGCCTGTCCTGATAAGCGCTATGCC
    >probe:HG-U133A:221294_at:99:209; Interrogation_Position = 759; Antisense;
    CTATGCCATGGTCCTGTTTCGAATC
    >probe:HG-U133A:221294_at:7:447; Interrogation_Position = 790; Antisense;
    GTATTTTACATCCTCTGGTTGCCAT
    >probe:HG-U133A:221294_at:568:513; Interrogation_Position = 806; Antisense;
    GGTTGCCATATATCATCTACTTCTT
    >probe:HG-U133A:221294_at:264:375; Interrogation_Position = 953; Antisense;
    GACTAAAGCGCCTCTCAGGGGCTAT
    >probe:HG-U133A:221294_at:514:195; Interrogation_Position = 968; Antisense;
    CAGGGGCTATGTGTACTTCTTGTGC
    >probe:HG-U133A:221294_at:658:285; Interrogation_Position = 1006; Antisense;
    GCCAACGACCCTTACACAGTTAGAA
    Probes sequences for 210705_s_at:
    >probe:HG-U133A:210705_s_at:460:197; Interrogation_Position = 1123; Antisense;
    CATGTTGGTTAGGCTCGTCTAGAAC
    >probe:HG-U133A:210705_s_at:501:437; Interrogation_Position = 1130; Antisense;
    GTTAGGCTCGTCTAGAACCTCTGAC
    >probe:HG-U133A:210705_s_at:394:21; Interrogation_Position = 1272; Antisense;
    ATAAGTGCTAATGTTTTATTGCATA
    >probe:HG-U133A:210705_s_at:476:5; Interrogation_Position = 1289; Antisense;
    ATTGCATAGTAGGGTGACTAGAGTT
    >probe:HG-U133A:210705_s_at:29:401; Interrogation_Position = 1309; Antisense;
    GAGTTAACAATAACCTATTGCATAT
    >probe:HG-U133A:210705_s_at:140:369; Interrogation_Position = 1381; Antisense;
    GAAATGACACATATTTGAGGTGATG
    >probe:HG-U133A:210705_s_at:592:1; Interrogation_Position = 1409; Antisense;
    ATGCTAATTACCCTGGTTCGGTTAT
    >probe:HG-U133A:210705_s_at:433:653; Interrogation_Position = 1417; Antisense;
    TACCCTGGTTCGGTTATTACGCAAT
    >probe:HG-U133A:210705_s_at:504:511; Interrogation_Position = 1423; Antisense;
    GGTTCGGTTATTACGCAATGTATAC
    >probe:HG-U133A:210705_s_at:179:127; Interrogation_Position = 1523; Antisense;
    AAAACCTTCATCTAATACTTTGGAT
    >probe:HG-U133A:210705_s_at:349:203; Interrogation_Position = 1531; Antisense;
    CATCTAATACTTTGGATCATTGTGA
    Probes sequences for 211184_s_at:
    >probe:HG-U133A:211184_s_at:685:463; Interrogation_Position = 1779; Antisense;
    GTCTGCCCCCCAAAGGAGTATGACG
    >probe:HG-U133A:211184_s_at:502:401; Interrogation_Position = 1794; Antisense;
    GAGTATGACGATGAGCTGACCTTCT
    >probe:HG-U133A:211184_s_at:182:111; Interrogation_Position = 1924; Antisense;
    ACACACACCAGATGGCATCCTTGGG
    >probe:HG-U133A:211184_s_at:554:25; Interrogation_Position = 1940; Antisense;
    ATCCTTGGGACCTGAATCTATCACC
    >probe:HG-U133A:211184_s_at:485:179; Interrogation_Position = 1961; Antisense;
    CACCCAGGAATCTCAAACTCCCTTT
    >probe:HG-U133A:211184_s_at:144:53; Interrogation_Position = 1997; Antisense;
    AGGGCCAGATAAGGAACAGCTCGGG
    >probe:HG-U133A:211184_s_at:37:421; Interrogation_Position = 2077; Antisense;
    GATCTCAAGGATCCAGACTCTCATT
    >probe:HG-U133A:211184_s_at:515:241; Interrogation_Position = 2110; Antisense;
    CTGGCCCAGTGAATTTGGTCTCTCC
    >probe:HG-U133A:211184_s_at:516:157; Interrogation_Position = 2164; Antisense;
    AATAAGACCCCACTGGAGTCTCTCT
    >probe:HG-U133A:211184_s_at:228:225; Interrogation_Position = 2211; Antisense;
    CTCTGCTCTAATTGCTGCCAGGATT
    >probe:HG-U133A:211184_s_at:490:599; Interrogation_Position = 2226; Antisense;
    TGCCAGGATTGTCACTCCAAACCTT
    Probes sequences for 215418_at:
    >probe:HG-U133A:215418_at:293:203; Interrogation_Position = 2973; Antisense;
    CATCTGTCCCAGTGACTACAGCTTT
    >probe:HG-U133A:215418_at:232:55; Interrogation_Position = 2983; Antisense;
    AGTGACTACAGCTTTTAACTCCTCC
    >probe:HG-U133A:215418_at:247:229; Interrogation_Position = 3001; Antisense;
    CTCCTCCAGCAGACATCTAGTCAGG
    >probe:HG-U133A:215418_at:573:339; Interrogation_Position = 3065; Antisense;
    GCAAGGTAGAACATGGCCACAGAGT
    >probe:HG-U133A:215418_at:572:343; Interrogation_Position = 3110; Antisense;
    GAATCTTGCCTCCAGGAATGGTTAC
    >probe:HG-U133A:215418_at:152:139; Interrogation_Position = 3137; Antisense;
    AACCATCCAGCATCACATCTATGCA
    >probe:HG-U133A:215418_at:602:25; Interrogation_Position = 3153; Antisense;
    ATCTATGCAACCCCACGTTAGGCAT
    >probe:HG-U133A:215418_at:53:91; Interrogation_Position = 3162; Antisense;
    ACCCCACGTTAGGCATTGCGTACAA
    >probe:HG-U133A:215418_at:504:329; Interrogation_Position = 3187; Antisense;
    GCAGATTAGATCATTTCACATTTCA
    >probe:HG-U133A:215418_at:546:519; Interrogation_Position = 3241; Antisense;
    GGAGGTGATGCTTCTTTTGTGAACT
    >probe:HG-U133A:215418_at:401:47; Interrogation_Position = 3288; Antisense;
    AGGTCTGAGGCAGGGCATACACATT
    Probes sequences for 207020_at:
    >probe:HG-U133A:207020_at:334:593; Interrogation_Position = 1353; Antisense;
    TGCTCGGCCACCAGGGTTTTGGTGA
    >probe:HG-U133A:207020_at:227:1; Interrogation_Position = 1369; Antisense;
    TTTTGGTGAAAATGCCGGTGTCCCT
    >probe:HG-U133A:207020_at:106:193; Interrogation_Position = 1400; Antisense;
    CAGATCCCTCATTTGATACTCCAAA
    >probe:HG-U133A:207020_at:634:285; Interrogation_Position = 1420; Antisense;
    CCAAAACCATCACCATGTACCATGT
    >probe:HG-U133A:207020_at:165:399; Interrogation_Position = 1522; Antisense;
    GAGAGTGCTTGTAGGTGGTTGGTTA
    >probe:HG-U133A:207020_at:333:409; Interrogation_Position = 1561; Antisense;
    GAGGCCATGAAAGAACCTGTCCTTC
    >probe:HG-U133A:207020_at:446:531; Interrogation_Position = 1587; Antisense;
    GGAAAAGTGGTCCATGTCTGTGCTG
    >probe:HG-U133A:207020_at:63:65; Interrogation_Position = 1618; Antisense;
    AGAGGGCTTGCTTAGGGCAGCTTCT
    >probe:HG-U133A:207020_at:637:385; Interrogation_Position = 1655; Antisense;
    GACCATGACCTGTAGGTTCACCTAT
    >probe:HG-U133A:207020_at:668:585; Interrogation_Position = 1693; Antisense;
    TGAATAACCATCTACCCCATTAGTC
    >probe:HG-U133A:207020_at:243:695; Interrogation_Position = 1753; Antisense;
    TTCATTCTCTCTCTAGTAATTGTAA
    Probes sequences for 209393_s_at:
    >probe:HG-U133A:209393_s_at:15:561; Interrogation_Position = 434; Antisense;
    TGGTGGCAAGTGGATTATTCGGCTG
    >probe:HG-U133A:209393_s_at:119:5; Interrogation_Position = 450; Antisense;
    ATTCGGCTGCGGAAGGGCTTGGCCT
    >probe:HG-U133A:209393_s_at:67:549; Interrogation_Position = 470; Antisense;
    GGCCTCCCGTTGCTGGGAGAATCTC
    >probe:HG-U133A:209393_s_at:533:393; Interrogation_Position = 486; Antisense;
    GAGAATCTCATTTTGGCCATGCTGG
    >probe:HG-U133A:209393_s_at:651:409; Interrogation_Position = 509; Antisense;
    GGGGGAACAGTTCATGGTTGGGGAG
    >probe:HG-U133A:209393_s_at:162:431; Interrogation_Position = 525; Antisense;
    GTTGGGGAGGAGATCTGTGGGGCTG
    >probe:HG-U133A:209393_s_at:650:479; Interrogation_Position = 552; Antisense;
    GTGTCTGTCCGCTTTCAGGAAGACA
    >probe:HG-U133A:209393_s_at:356:69; Interrogation_Position = 595; Antisense;
    AGACTGCCAGTGACCAAGCAACCAC
    >probe:HG-U133A:209393_s_at:455:107; Interrogation_Position = 618; Antisense;
    ACAGCCCGAATCCGGGACACACTTC
    >probe:HG-U133A:209393_s_at:36:371; Interrogation_Position = 633; Antisense;
    GACACACTTCGGCGAGTGCTTAACC
    >probe:HG-U133A:209393_s_at:361:19; Interrogation_Position = 680; Antisense;
    ATACAAAACTCACACCGACAGCATC
    Probes sequences for 210101_x_at:
    >probe:HG-U133A:210101_x_at:317:339; Interrogation_Position = 681; Antisense;
    GAATCAGCAGTACACATGCCCATCA
    >probe:HG-U133A:210101_x_at:397:201; Interrogation_Position = 701; Antisense;
    CATCACCTTCGCTGTCTGAATGACT
    >probe:HG-U133A:210101_x_at:701:81; Interrogation_Position = 733; Antisense;
    AGCCCAGATGACTTACTATGCACAG
    >probe:HG-U133A:210101_x_at:407:59; Interrogation_Position = 797; Antisense;
    AGTTTTCCATCCAATTATCTTAGTA
    >probe:HG-U133A:210101_x_at:689:21; Interrogation_Position = 828; Antisense;
    ATCAGACTTCTGTGACACCTGTACC
    >probe:HG-U133A:210101_x_at:415:371; Interrogation_Position = 841; Antisense;
    GACACCTGTACCATCAGTTTTACCA
    >probe:HG-U133A:210101_x_at:590:709; Interrogation_Position = 873; Antisense;
    TTGGTTCTTCTGCCATGGCTTCAAC
    >probe:HG-U133A:210101_x_at:616:167; Interrogation_Position = 897; Antisense;
    CAAGTGGCCTAGTAATCACCTCTCC
    >probe:HG-U133A:210101_x_at:202:273; Interrogation_Position = 920; Antisense;
    CCTTCCAACCTCAGTGACCTTAAGG
    >probe:HG-U133A:210101_x_at:148:119; Interrogation_Position = 961; Antisense;
    AAAGGCCAGGGTTCTCTATGATTAT
    >probe:HG-U133A:210101_x_at:633:149; Interrogation_Position = 1152; Antisense;
    AAGGTTGCCCATCATGACTTTGTAT
    Probes sequences for 212052_s_at:
    >probe:HG-U133A:212052_s_at:45:313; Interrogation_Position = 3981; Antisense;
    GCTGCCTTAGACAGATTCCCTGGGC
    >probe:HG-U133A:212052_s_at:337:181; Interrogation_Position = 4058; Antisense;
    CACCTTCCTTACACCTGGTGGGAGC
    >probe:HG-U133A:212052_s_at:179:663; Interrogation_Position = 4214; Antisense;
    TATGTGGTATGGGGGTCATTCCTGC
    >probe:HG-U133A:212052_s_at:599:147; Interrogation_Position = 4281; Antisense;
    AAGTGATGGAACCCTCAGGTGCTCT
    >probe:HG-U133A:212052_s_at:235:81; Interrogation_Position = 4311; Antisense;
    AGCCTGAACCTCCTGACTGAGGAAC
    >probe:HG-U133A:212052_s_at:554:177; Interrogation_Position = 4375; Antisense;
    CACAGGCGTGGCTGTACACGTGCTC
    >probe:HG-U133A:212052_s_at:680:221; Interrogation_Position = 4404; Antisense;
    CTCATCATCCTTTCCAGTAACTTTA
    >probe:HG-U133A:212052_s_at:212:123; Interrogation_Position = 4429; Antisense;
    AAAAAACATCCCTCAGGTCCTGATA
    >probe:HG-U133A:212052_s_at:183:507; Interrogation_Position = 4444; Antisense;
    GGTCCTGATATATTTCCTTGGATTC
    >probe:HG-U133A:212052_s_at:505:709; Interrogation_Position = 4476; Antisense;
    TTGGCTAGAAATTACACTGTGCTCA
    >probe:HG-U133A:212052_s_at:38:569; Interrogation_Position = 4493; Antisense;
    TGTGCTCAATGCCTTAATAAATCCC
    Probes sequences for 204461_x_at:
    >probe:HG-U133A:204461_x_at:265:417; Interrogation_Position = 1339; Antisense;
    GATGAAGTTCTTATTCTGAGTACAG
    >probe:HG-U133A:204461_x_at:617:401; Interrogation_Position = 1356; Antisense;
    GAGTACAGTACTCTTTGTCATTTCA
    >probe:HG-U133A:204461_x_at:227:523; Interrogation_Position = 1422; Antisense;
    GGAGCAAGGTCATGTACCCTAATAG
    >probe:HG-U133A:204461_x_at:361:461; Interrogation_Position = 1504; Antisense;
    GTCATAGATTATTCAGGACTGTCCT
    >probe:HG-U133A:204461_x_at:569:527; Interrogation_Position = 1519; Antisense;
    GGACTGTCCTTTAGTTCTGTCTTTT
    >probe:HG-U133A:204461_x_at:306:619; Interrogation_Position = 1534; Antisense;
    TCTGTCTTTTGAACTCATGGGAATA
    >probe:HG-U133A:204461_x_at:620:401; Interrogation_Position = 1564; Antisense;
    GAGTCAGCGTAACATTTCAAGAGTC
    >probe:HG-U133A:204461_x_at:540:1; Interrogation_Position = 1583; Antisense;
    AGAGTCTAAAGGTGGCCGGGTGTGG
    >probe:HG-U133A:204461_x_at:256:121; Interrogation_Position = 1670; Antisense;
    AAACCAGCCTGACCAACGTGGAGAA
    >probe:HG-U133A:204461_x_at:668:7; Interrogation_Position = 1722; Antisense;
    ATTAGCCGGACGTGGTGGCACATGC
    >probe:HG-U133A:204461_x_at:514:79; Interrogation_Position = 1817; Antisense;
    AGCTGAAATGACGCCATTGCACTCC
    Probes sequences for 215011_at:
    >probe:HG-U133A:215011_at:319:709; Interrogation_Position = 1806; Antisense;
    TTGTACTGAGCTGAGACCTTGCCAC
    >probe:HG-U133A:215011_at:250:709; Interrogation_Position = 1890; Antisense;
    TTGTATTTGTGCTTACCTGTGCCAG
    >probe:HG-U133A:215011_at:115:97; Interrogation_Position = 1917; Antisense;
    ACTGTTCTAGGTAAGCACTAAGTGG
    >probe:HG-U133A:215011_at:107:483; Interrogation_Position = 1938; Antisense;
    GTGGGCTTTAATACAGCATATTCCA
    >probe:HG-U133A:215011_at:355:323; Interrogation_Position = 1953; Antisense;
    GCATATTCCAATGGGGAATCCCAGG
    >probe:HG-U133A:215011_at:484:339; Interrogation_Position = 1968; Antisense;
    GAATCCCAGGAACCAAAAGACTAAT
    >probe:HG-U133A:215011_at:36:143; Interrogation_Position = 1984; Antisense;
    AAGACTAATTGTCCAAGTCCACAAC
    >probe:HG-U133A:215011_at:276:165; Interrogation_Position = 1990; Antisense;
    AATTGTCCAAGTCCACAACTAGAAG
    >probe:HG-U133A:215011_at:431:329; Interrogation_Position = 2025; Antisense;
    GCAGAAACAAGCATCAAATTCCCTG
    >probe:HG-U133A:215011_at:661:169; Interrogation_Position = 2032; Antisense;
    CAAGCATCAAATTCCCTGCTCAGGA
    >probe:HG-U133A:215011_at:531:113; Interrogation_Position = 2040; Antisense;
    AAATTCCCTGCTCAGGAAGAAGCCA
    Probes sequences for 205367_at:
    >probe:HG-U133A:205367_at:302:291; Interrogation_Position = 1563; Antisense;
    GCCAGTGTCACGTACAGCATCTGTG
    >probe:HG-U133A:205367_at:656:477; Interrogation_Position = 1567; Antisense;
    GTGTCACGTACAGCATCTGTGGTTC
    >probe:HG-U133A:205367_at:659:455; Interrogation_Position = 1569; Antisense;
    GTCACGTACAGCATCTGTGGTTCCA
    >probe:HG-U133A:205367_at:449:181; Interrogation_Position = 1571; Antisense;
    CACGTACAGCATCTGTGGTTCCAGT
    >probe:HG-U133A:205367_at:325:453; Interrogation_Position = 1574; Antisense;
    GTACAGCATCTGTGGTTCCAGTCTG
    >probe:HG-U133A:205367_at:686:187; Interrogation_Position = 1577; Antisense;
    CAGCATCTGTGGTTCCAGTCTGTGC
    >probe:HG-U133A:205367_at:80:77; Interrogation_Position = 1578; Antisense;
    AGCATCTGTGGTTCCAGTCTGTGCT
    >probe:HG-U133A:205367_at:395:183; Interrogation_Position = 1628; Antisense;
    CACCCCATCCCACTGGAGTCAGGGG
    >probe:HG-U133A:205367_at:404:261; Interrogation_Position = 1630; Antisense;
    CCCCATCCCACTGGAGTCAGGGGGC
    >probe:HG-U133A:205367_at:67:249; Interrogation_Position = 2008; Antisense;
    CGCCGTGGAGAACCAGTACTCCTTC
    >probe:HG-U133A:205367_at:2:249; Interrogation_Position = 2010; Antisense;
    CCGTGGAGAACCAGTACTCCTTCTA
    Probes sequences for 219203_at:
    >probe:HG-U133A:219203_at:405:265; Interrogation_Position = 335; Antisense;
    CCTGTCCGTCATGTTGGAGGTCGCC
    >probe:HG-U133A:219203_at:608:613; Interrogation_Position = 355; Antisense;
    TCGCCCTCAACCAGGTGGATGTGTG
    >probe:HG-U133A:219203_at:32:463; Interrogation_Position = 394; Antisense;
    GTCTGGTGGTGGCTGGTTACTACCA
    >probe:HG-U133A:219203_at:536:437; Interrogation_Position = 409; Antisense;
    GTTACTACCATGCCAATGCAGCTGT
    >probe:HG-U133A:219203_at:118:333; Interrogation_Position = 426; Antisense;
    GCAGCTGTGAACGATCAGAGCCCTG
    >probe:HG-U133A:219203_at:240:341; Interrogation_Position = 486; Antisense;
    GAATTCTTCCCTGATGCAGTACTTA
    >probe:HG-U133A:219203_at:649:619; Interrogation_Position = 581; Antisense;
    TCTCCGCTGGGTCCCTAAGGATAAG
    >probe:HG-U133A:219203_at:411:539; Interrogation_Position = 640; Antisense;
    GGCAGATGGTGGGAGCTCTACTGGA
    >probe:HG-U133A:219203_at:9:653; Interrogation_Position = 658; Antisense;
    TACTGGAAGATCGGGCCCACCAGCA
    >probe:HG-U133A:219203_at:15:189; Interrogation_Position = 678; Antisense;
    CAGCACCTTGTGGACTTTGACTGCC
    >probe:HG-U133A:219203_at:539:599; Interrogation_Position = 699; Antisense;
    TGCCACCTTGATGACATCCGGCAGG
    Probes sequences for 215067_x_at:
    >probe:HG-U133A:215067_x_at:438:129; Interrogation_Position = 2661; Antisense;
    AACAAACAGGCCGAGCGCGGTGGCT
    >probe:HG-U133A:215067_x_at:42:393; Interrogation_Position = 2859; Antisense;
    GAGACTCCCTTGAACCCGGGAGGAG
    >probe:HG-U133A:215067_x_at:543:193; Interrogation_Position = 2888; Antisense;
    CAGTAAGCCGAGATCACACCACTGC
    >probe:HG-U133A:215067_x_at:507:603; Interrogation_Position = 2915; Antisense;
    TCCAGCCTGGGCGCCAGTGTGAGAC
    >probe:HG-U133A:215067_x_at:154:449; Interrogation_Position = 2973; Antisense;
    GTAGGGGTGGGGCAGAGGGACAAAG
    >probe:HG-U133A:215067_x_at:615:197; Interrogation_Position = 3022; Antisense;
    CAGGAAGATGCTGTGTCTCATGCCT
    >probe:HG-U133A:215067_x_at:538:463; Interrogation_Position = 3036; Antisense;
    GTCTCATGCCTGTAGATGCTGTGTC
    >probe:HG-U133A:215067_x_at:504:599; Interrogation_Position = 3042; Antisense;
    TGCCTGTAGATGCTGTGTCTCATGC
    >probe:HG-U133A:215067_x_at:327:659; Interrogation_Position = 3070; Antisense;
    TAGTTCCACCTACTCTGGAGAGTGA
    >probe:HG-U133A:215067_x_at:237:45; Interrogation_Position = 3116; Antisense;
    AGGAATTTGAGTCGAGCCTAGGCAA
    >probe:HG-U133A:215067_x_at:583:657; Interrogation_Position = 3134; Antisense;
    TAGGCAACATAGCAATACCCCATCT
    Probes sequences for 221932_s_at:
    >probe:HG-U133A:221932_s_at:429:75; Interrogation_Position = 89; Antisense;
    AGCAGAGAGGGAGCCGTTCATGTCA
    >probe:HG-U133A:221932_s_at:382:437; Interrogation_Position = 104; Antisense;
    GTTCATGTCAGAGACTCACTGCCAG
    >probe:HG-U133A:221932_s_at:476:179; Interrogation_Position = 120; Antisense;
    CACTGCCAGAAAAGCCTTACCTTTT
    >probe:HG-U133A:221932_s_at:375:707; Interrogation_Position = 157; Antisense;
    TTGAGACCGCAACTGCTTGCACTGA
    >probe:HG-U133A:221932_s_at:337:1; Interrogation_Position = 208; Antisense;
    TTTTAGTTGGTCTGGTGTTCGGGCT
    >probe:HG-U133A:221932_s_at:322:455; Interrogation_Position = 329; Antisense;
    GTCACTTATTCTTTGCCTGATTCAG
    >probe:HG-U133A:221932_s_at:519:343; Interrogation_Position = 373; Antisense;
    GAATCATTATTCATGACCCCTCTGC
    >probe:HG-U133A:221932_s_at:678:383; Interrogation_Position = 387; Antisense;
    GACCCCTCTGCAAATGTGTCAGTCT
    >probe:HG-U133A:221932_s_at:141:465; Interrogation_Position = 408; Antisense;
    GTCTCCAAAGAGAGTATCTCCCCCC
    >probe:HG-U133A:221932_s_at:623:25; Interrogation_Position = 423; Antisense;
    ATCTCCCCCCAAATTTTGTGTAGCT
    >probe:HG-U133A:221932_s_at:75:569; Interrogation_Position = 535; Antisense;
    TGTGCATGCCTGAGTTGATTCCGAA
    Probes sequences for 201239_s_at:
    >probe:HG-U133A:201239_s_at:581:557; Interrogation_Position = 275; Antisense;
    TGGTCTAATTGATGGTCGCCTCACC
    >probe:HG-U133A:201239_s_at:305:201; Interrogation_Position = 299; Antisense;
    CATCTGTACAATCTCCTGTTTCTTT
    >probe:HG-U133A:201239_s_at:688:227; Interrogation_Position = 311; Antisense;
    CTCCTGTTTCTTTGCCATAGTGGCT
    >probe:HG-U133A:201239_s_at:9:671; Interrogation_Position = 340; Antisense;
    TTTGGGATTATATGCACCCCTTTCC
    >probe:HG-U133A:201239_s_at:547:601; Interrogation_Position = 362; Antisense;
    TCCAGAGTCCAAACCCGTTTTGGCT
    >probe:HG-U133A:201239_s_at:642:709; Interrogation_Position = 381; Antisense;
    TTGGCTTTGTGTGTCATATCCTATT
    >probe:HG-U133A:201239_s_at:498:393; Interrogation_Position = 447; Antisense;
    GAGAAGAGCATCTTTCTCGTGGCCC
    >probe:HG-U133A:201239_s_at:7:617; Interrogation_Position = 461; Antisense;
    TCTCGTGGCCCACAGGAAAGATCCT
    >probe:HG-U133A:201239_s_at:690:415; Interrogation_Position = 501; Antisense;
    GATGATATTTGGCAGCTGTCCTCCA
    >probe:HG-U133A:201239_s_at:18:357; Interrogation_Position = 597; Antisense;
    GAAGCCGAGTTCACAAAGTCCATTG
    >probe:HG-U133A:201239_s_at:328:603; Interrogation_Position = 678; Antisense;
    TCCAGGCTCCATGACAGTCTTGCCA
    Probes sequences for 212517_at:
    >probe:HG-U133A:212517_at:689:501; Interrogation_Position = 7478; Antisense;
    GGTGGTTTCTAGATAGCCAGGCCCA
    >probe:HG-U133A:212517_at:650:41; Interrogation_Position = 7496; Antisense;
    AGGCCCACCAAGAGATATTGCCCCT
    >probe:HG-U133A:212517_at:416:399; Interrogation_Position = 7507; Antisense;
    GAGATATTGCCCCTTGATGAGAGTC
    >probe:HG-U133A:212517_at:373:207; Interrogation_Position = 7519; Antisense;
    CTTGATGAGAGTCAAACACCCTGCC
    >probe:HG-U133A:212517_at:371:409; Interrogation_Position = 7569; Antisense;
    GAGGAAAATTGGCACCTCATCTTTT
    >probe:HG-U133A:212517_at:274:537; Interrogation_Position = 7579; Antisense;
    GGCACCTCATCTTTTAAAGGCAGTA
    >probe:HG-U133A:212517_at:210:109; Interrogation_Position = 7650; Antisense;
    ACACTAGTGAAGCCTGTTTCGTTGA
    >probe:HG-U133A:212517_at:96:687; Interrogation_Position = 7719; Antisense;
    TTACGATTTCGTTTGTTTGGATTCA
    >probe:HG-U133A:212517_at:698:11; Interrogation_Position = 7767; Antisense;
    ATTTAGCATCTATTACACTCATGTA
    >probe:HG-U133A:212517_at:295:625; Interrogation_Position = 7819; Antisense;
    TCATTGCGGGGATTGTGGGTGTTAT
    >probe:HG-U133A:212517_at:447:357; Interrogation_Position = 7918; Antisense;
    GAACTAAGGGAGGTCTGTGCATTTT
    Probes sequences for 220215_at:
    >probe:HG-U133A:220215_at:51:167; Interrogation_Position = 1319; Antisense;
    CAATGTGATCAAGCCTTCAGTCGCC
    >probe:HG-U133A:220215_at:404:611; Interrogation_Position = 1349; Antisense;
    TCCCTTCACCTCCACGAAAGAATTC
    >probe:HG-U133A:220215_at:295:635; Interrogation_Position = 1354; Antisense;
    TCACCTCCACGAAAGAATTCATACT
    >probe:HG-U133A:220215_at:63:155; Interrogation_Position = 1404; Antisense;
    AATGCGGTAAAGCCTACACTCGTTC
    >probe:HG-U133A:220215_at:211:57; Interrogation_Position = 1430; Antisense;
    AGTCACCTTACTCGCCATGAAAGAA
    >probe:HG-U133A:220215_at:69:89; Interrogation_Position = 1434; Antisense;
    ACCTTACTCGCCATGAAAGAAGTCA
    >probe:HG-U133A:220215_at:103:351; Interrogation_Position = 1452; Antisense;
    GAAGTCATGATATAGAGGCTGGGTG
    >probe:HG-U133A:220215_at:681:307; Interrogation_Position = 1469; Antisense;
    GCTGGGTGTAGTGACTCAGCCTATA
    >probe:HG-U133A:220215_at:2:503; Interrogation_Position = 1473; Antisense;
    GGTGTAGTGACTCAGCCTATAATCC
    >probe:HG-U133A:220215_at:203:85; Interrogation_Position = 1565; Antisense;
    AACAATGTGAAACAACCTGGGCAAC
    >probe:HG-U133A:220215_at:343:157; Interrogation_Position = 1615; Antisense;
    AATAAAATAATGCCGGGCACGGTGG
    Probes sequences for 201923_at:
    >probe:HG-U133A:201923_at:411:475; Interrogation_Position = 424; Antisense;
    GTGTCCAACTGAAATTATCGCTTTT
    >probe:HG-U133A:201923_at:470:451; Interrogation_Position = 491; Antisense;
    GTAGCATGCTCTGTTGATTCACAGT
    >probe:HG-U133A:201923_at:513:423; Interrogation_Position = 506; Antisense;
    GATTCACAGTTTACCCATTTGGCCT
    >probe:HG-U133A:201923_at:581:555; Interrogation_Position = 530; Antisense;
    TGGATTAATACCCCTCGAAGACAAG
    >probe:HG-U133A:201923_at:214:643; Interrogation_Position = 570; Antisense;
    TAAGGATTCCACTTCTTTCAGATTT
    >probe:HG-U133A:201923_at:40:385; Interrogation_Position = 595; Antisense;
    GACCCATCAGATCTCAAAGGACTAT
    >probe:HG-U133A:201923_at:612:41; Interrogation_Position = 640; Antisense;
    AGGCCACACTCTTAGAGGTCTCTTC
    >probe:HG-U133A:201923_at:264:653; Interrogation_Position = 697; Antisense;
    TACTCTGAATGATCTTCCTGTGGGT
    >probe:HG-U133A:201923_at:661:515; Interrogation_Position = 730; Antisense;
    GGATGAGACACTACGTTTGGTTCAA
    >probe:HG-U133A:201923_at:363:443; Interrogation_Position = 744; Antisense;
    GTTTGGTTCAAGCATTCCAGTACAC
    >probe:HG-U133A:201923_at:593:363; Interrogation_Position = 815; Antisense;
    GAAACAATAATCCCAGATCCAGCTG
    Probes sequences for 215553_x_at:
    >probe:HG-U133A:215553_x_at:478:333; Interrogation_Position = 1517; Antisense;
    GCACCATGCTGAGCCTGGGAGGTAA
    >probe:HG-U133A:215553_x_at:207:587; Interrogation_Position = 1550; Antisense;
    TGAATGAGTGGACCTTGTGGGTTCC
    >probe:HG-U133A:215553_x_at:436:75; Interrogation_Position = 1561; Antisense;
    ACCTTGTGGGTTCCATGCTAGTTAA
    >probe:HG-U133A:215553_x_at:692:645; Interrogation_Position = 1583; Antisense;
    TAAAGACACCCTATGAGACCAGCCT
    >probe:HG-U133A:215553_x_at:519:487; Interrogation_Position = 1817; Antisense;
    GGGCCAGGCAAGGTGGGTCACACCT
    >probe:HG-U133A:215553_x_at:130:343; Interrogation_Position = 1874; Antisense;
    GAATCACTTGAGTCCGGCAGTTTGA
    >probe:HG-U133A:215553_x_at:668:131; Interrogation_Position = 1912; Antisense;
    AACATAATGAGACCGCATCTTCACA
    >probe:HG-U133A:215553_x_at:325:395; Interrogation_Position = 1920; Antisense;
    GAGACCGCATCTTCACAAAAAATAT
    >probe:HG-U133A:215553_x_at:571:39; Interrogation_Position = 1962; Antisense;
    ATGGTGAAGTGTGCCTGTGGTCCCA
    >probe:HG-U133A:215553_x_at:80:401; Interrogation_Position = 2009; Antisense;
    GAGTATTGCCTGAGCCCAGGAGTTC
    >probe:HG-U133A:215553_x_at:307:545; Interrogation_Position = 2043; Antisense;
    GGCTGCAGTGAACTGTGATCACACC
    Probes sequences for 215609_at:
    >probe:HG-U133A:215609_at:252:417; Interrogation_Position = 1247; Antisense;
    GATGCTTCAGTAAGCCCAGATCACA
    >probe:HG-U133A:215609_at:191:33; Interrogation_Position = 1248; Antisense;
    ATGCTTCAGTAAGCCCAGATCACAC
    >probe:HG-U133A:215609_at:574:125; Interrogation_Position = 1320; Antisense;
    AAAAAGACATAGAGCCAGGCGCCTG
    >probe:HG-U133A:215609_at:224:287; Interrogation_Position = 1364; Antisense;
    GCCAAGGCGTGAGGACTGTTTGAGG
    >probe:HG-U133A:215609_at:216:29; Interrogation_Position = 1368; Antisense;
    AGGCGTGAGGACTGTTTGAGGTCAG
    >probe:HG-U133A:215609_at:353:305; Interrogation_Position = 1370; Antisense;
    GCGTGAGGACTGTTTGAGGTCAGGA
    >probe:HG-U133A:215609_at:386:85; Interrogation_Position = 1493; Antisense;
    AGCCACTGGCAAGGCTGAGGCACGA
    >probe:HG-U133A:215609_at:305:519; Interrogation_Position = 1540; Antisense;
    GGAGGCTGCGGTGAGGTGAGATCAC
    >probe:HG-U133A:215609_at:636:63; Interrogation_Position = 1590; Antisense;
    AGAGTGAGATTATGTCTTAAGAAAA
    >probe:HG-U133A:215609_at:78:121; Interrogation_Position = 1617; Antisense;
    AAAGAGTCTGGGCGTGATGTCTCAT
    >probe:HG-U133A:215609_at:267:403; Interrogation_Position = 1620; Antisense;
    GAGTCTGGGCGTGATGTCTCATGCC
    Probes sequences for 207984_s_at:
    >probe:HG-U133A:207984_s_at:679:289; Interrogation_Position = 2873; Antisense;
    GCGCCATTTACACTGTAGTCTGTAC
    >probe:HG-U133A:207984_s_at:345:113; Interrogation_Position = 2896; Antisense;
    ACAACCTGTGGTTCCACGTGCATGT
    >probe:HG-U133A:207984_s_at:232:237; Interrogation_Position = 2932; Antisense;
    CTGTGCCTCTGGCACCAGGTTGTGT
    >probe:HG-U133A:207984_s_at:574:19; Interrogation_Position = 3071; Antisense;
    ATCAGCCCTCATTTCTTATAATACC
    >probe:HG-U133A:207984_s_at:603:677; Interrogation_Position = 3086; Antisense;
    TTATAATACCCTGATCCCAGACTCC
    >probe:HG-U133A:207984_s_at:609:299; Interrogation_Position = 3170; Antisense;
    GCCCCCCTGCCATAACTTGGGGAGG
    >probe:HG-U133A:207984_s_at:550:155; Interrogation_Position = 3231; Antisense;
    AATGCTGTCCTAGATGTACTTGGGC
    >probe:HG-U133A:207984_s_at:259:613; Interrogation_Position = 3349; Antisense;
    TCCCAGTTCTTCCAAGTGTCTAACT
    >probe:HG-U133A:207984_s_at:72:219; Interrogation_Position = 3368; Antisense;
    CTAACTAGTCTTCGCTGCAGCGTCA
    >probe:HG-U133A:207984_s_at:391:249; Interrogation_Position = 3380; Antisense;
    CGCTGCAGCGTCAGCCAAAGCTGGC
    >probe:HG-U133A:207984_s_at:398:459; Interrogation_Position = 3417; Antisense;
    GTGTGCCCATTTCCTAGGGAAGGGG
    Probes sequences for 215373_x_at:
    >probe:HG-U133A:215373_x_at:419:35; Interrogation_Position = 1503; Antisense;
    ATGAGGCCAGGGGCAGTGGCTCTCT
    >probe:HG-U133A:215373_x_at:20:287; Interrogation_Position = 1537; Antisense;
    CCAACACTTCAGGAAAGCCGGGAGG
    >probe:HG-U133A:215373_x_at:423:365; Interrogation_Position = 1549; Antisense;
    GAAAGCCGGGAGGAAGGATCGCTTA
    >probe:HG-U133A:215373_x_at:639:347; Interrogation_Position = 1561; Antisense;
    GAAGGATCGCTTAAGGACAGGAGTT
    >probe:HG-U133A:215373_x_at:479:83; Interrogation_Position = 1593; Antisense;
    AGCCTAAGCAACAGATCCAGACCCT
    >probe:HG-U133A:215373_x_at:696:501; Interrogation_Position = 1643; Antisense;
    GGTGTGGTGGCGTACACCTTTGGTC
    >probe:HG-U133A:215373_x_at:631:347; Interrogation_Position = 1689; Antisense;
    GAAGGAGGAGGATTGCTGGAGCCCA
    >probe:HG-U133A:215373_x_at:77:389; Interrogation_Position = 1732; Antisense;
    GAGCCATGACTGTGACACTGCACTC
    >probe:HG-U133A:215373_x_at:194:363; Interrogation_Position = 1799; Antisense;
    GAAAACATGACCAGGCATGGTGGCT
    >probe:HG-U133A:215373_x_at:7:255; Interrogation_Position = 2011; Antisense;
    CCCAGGAGGTGGAGTTTGCAGTAGG
    >probe:HG-U133A:215373_x_at:166:675; Interrogation_Position = 2025; Antisense;
    TTTGCAGTAGGCCAAGATCGCACCA
    Probes sequences for 216110_x_at:
    >probe:HG-U133A:216110_x_at:476:7; Interrogation_Position = 1856; Antisense;
    ATTCTTCTAAATAACTCATGAGTCC
    >probe:HG-U133A:216110_x_at:499:351; Interrogation_Position = 1901; Antisense;
    GAAGAGTTCCTTCTCAGTTTCTAGC
    >probe:HG-U133A:216110_x_at:230:435; Interrogation_Position = 1906; Antisense;
    GTTCCTTCTCAGTTTCTAGCATATG
    >probe:HG-U133A:216110_x_at:260:689; Interrogation_Position = 1911; Antisense;
    TTCTCAGTTTCTAGCATATGATGAT
    >probe:HG-U133A:216110_x_at:271:161; Interrogation_Position = 1956; Antisense;
    AATAGTGAAAATGTCCTAGCTTTCT
    >probe:HG-U133A:216110_x_at:317:155; Interrogation_Position = 1965; Antisense;
    AATGTCCTAGCTTTCTTTAAATAAT
    >probe:HG-U133A:216110_x_at:470:407; Interrogation_Position = 1996; Antisense;
    GAGGGGCTGGGCATAATGGCTCACA
    >probe:HG-U133A:216110_x_at:616:249; Interrogation_Position = 2029; Antisense;
    CCCAACACTTTAAGAGGCTGAGATG
    >probe:HG-U133A:216110_x_at:264:399; Interrogation_Position = 2048; Antisense;
    GAGATGGAAGGATTCCTTCAGCCCA
    >probe:HG-U133A:216110_x_at:296:429; Interrogation_Position = 2058; Antisense;
    GATTCCTTCAGCCCAGGAGTTTGAA
    >probe:HG-U133A:216110_x_at:42:635; Interrogation_Position = 2065; Antisense;
    TCAGCCCAGGAGTTTGAAACCAGCC
    Probes sequences for 213351_s_at:
    >probe:HG-U133A:213351_s_at:344:555; Interrogation_Position = 3329; Antisense;
    TGGAAGTCACTTTGATAGATGGATT
    >probe:HG-U133A:213351_s_at:676:267; Interrogation_Position = 3428; Antisense;
    CCTCCAGCCAAATAGTCCCATGCTG
    >probe:HG-U133A:213351_s_at:339:115; Interrogation_Position = 3437; Antisense;
    AAATAGTCCCATGCTGACTTTCTAT
    >probe:HG-U133A:213351_s_at:546:119; Interrogation_Position = 3473; Antisense;
    AAACTGTCTTAGGAAGGACCTTCAG
    >probe:HG-U133A:213351_s_at:51:185; Interrogation_Position = 3541; Antisense;
    CACCAGACCCAGAAGTTGTACGCAT
    >probe:HG-U133A:213351_s_at:490:381; Interrogation_Position = 3546; Antisense;
    GACCCAGAAGTTGTACGCATTTAAT
    >probe:HG-U133A:213351_s_at:522:709; Interrogation_Position = 3674; Antisense;
    TTGTCTTGTCAGAGCTCTTCTATAA
    >probe:HG-U133A:213351_s_at:592:459; Interrogation_Position = 3681; Antisense;
    GTCAGAGCTCTTCTATAACCTCGCT
    >probe:HG-U133A:213351_s_at:266:627; Interrogation_Position = 3689; Antisense;
    TCTTCTATAACCTCGCTCTAATGGC
    >probe:HG-U133A:213351_s_at:549:137; Interrogation_Position = 3697; Antisense;
    AACCTCGCTCTAATGGCTTAACAGT
    >probe:HG-U133A:213351_s_at:652:639; Interrogation_Position = 3707; Antisense;
    TAATGGCTTAACAGTTGTTCTGGGT
    Probes sequences for 202021_x_at:
    >probe:HG-U133A:202021_x_at:688:73; Interrogation_Position = 751; Antisense;
    AGCAATACCGTCATGTTTCAGCCAA
    >probe:HG-U133A:202021_x_at:221:171; Interrogation_Position = 773; Antisense;
    CAAGCCCAGAGCCCTAAGATTACAA
    >probe:HG-U133A:202021_x_at:675:427; Interrogation_Position = 858; Antisense;
    GTTACCACCTGAACAGTCCTCGGTG
    >probe:HG-U133A:202021_x_at:407:535; Interrogation_Position = 908; Antisense;
    GGCAGAAGCACCAGCTGTACTACTA
    >probe:HG-U133A:202021_x_at:648:503; Interrogation_Position = 961; Antisense;
    GGTGTCTCCAACCTGACTAGGTGGA
    >probe:HG-U133A:202021_x_at:355:301; Interrogation_Position = 1000; Antisense;
    GCCCTCTTACCGCTAGCGAGGTGAT
    >probe:HG-U133A:202021_x_at:608:477; Interrogation_Position = 1020; Antisense;
    GTGATAGGACATCTGGCTTGCCACA
    >probe:HG-U133A:202021_x_at:293:321; Interrogation_Position = 1035; Antisense;
    GCTTGCCACAAAGGTCTGTTCGACC
    >probe:HG-U133A:202021_x_at:542:435; Interrogation_Position = 1052; Antisense;
    GTTCGACCAGACATATCCTAGCTAA
    >probe:HG-U133A:202021_x_at:105:347; Interrogation_Position = 1094; Antisense;
    GAATGTGAGGCCAACCTTCTATCAG
    >probe:HG-U133A:202021_x_at:314:187; Interrogation_Position = 1202; Antisense;
    CAGCTGCCCAATGCCATGTGAAGTA
    Probes sequences for 215600_x_at:
    >probe:HG-U133A:215600_x_at:146:389; Interrogation_Position = 1457; Antisense;
    GAGCCTGGGAGGTTGAGGCTGCAGT
    >probe:HG-U133A:215600_x_at:305:311; Interrogation_Position = 1474; Antisense;
    GCTGCAGTGAGTCAAGATTGCACCA
    >probe:HG-U133A:215600_x_at:447:707; Interrogation_Position = 1554; Antisense;
    TTGAGGGCAGGGTGGTGGCACACGC
    >probe:HG-U133A:215600_x_at:263:391; Interrogation_Position = 1623; Antisense;
    GAGCTCAGGAGGTTGAAGCTGCAGT
    >probe:HG-U133A:215600_x_at:361:141; Interrogation_Position = 1638; Antisense;
    AAGCTGCAGTGAGATGTGTTTGCGC
    >probe:HG-U133A:215600_x_at:341:53; Interrogation_Position = 1645; Antisense;
    AGTGAGATGTGTTTGCGCCACTGCA
    >probe:HG-U133A:215600_x_at:279:591; Interrogation_Position = 1666; Antisense;
    TGCACTCCAGCTTGGGTGACGGAGC
    >probe:HG-U133A:215600_x_at:187:685; Interrogation_Position = 1677; Antisense;
    TTGGGTGACGGAGCGAGACCCTGTC
    >probe:HG-U133A:215600_x_at:382:341; Interrogation_Position = 1726; Antisense;
    GAATTGCAGTAAAACCGTGTGCTTC
    >probe:HG-U133A:215600_x_at:532:123; Interrogation_Position = 1737; Antisense;
    AAACCGTGTGCTTCTATGAGGGGAT
    >probe:HG-U133A:215600_x_at:458:11; Interrogation_Position = 1767; Antisense;
    ATTTACTTGTGGCTGGGCACGGTGG
    Probes sequences for 209442_x_at:
    >probe:HG-U133A:209442_x_at:664:429; Interrogation_Position = 2894; Antisense;
    GATTCCATTACCTCATATCTCAAAG
    >probe:HG-U133A:209442_x_at:174:357; Interrogation_Position = 2946; Antisense;
    GAAGCCATACAGAAATCACTCCAGA
    >probe:HG-U133A:209442_x_at:146:171; Interrogation_Position = 2979; Antisense;
    CAAAATCTTACTTTCCAGAATCCCA
    >probe:HG-U133A:209442_x_at:283:75; Interrogation_Position = 3079; Antisense;
    AGCATCACCACTAGCAGCATATCAG
    >probe:HG-U133A:209442_x_at:185:73; Interrogation_Position = 3136; Antisense;
    AGAAGAGACTAAACCCTGTGTGCCT
    >probe:HG-U133A:209442_x_at:703:137; Interrogation_Position = 3147; Antisense;
    AACCCTGTGTGCCTGTCAGTATGAA
    >probe:HG-U133A:209442_x_at:93:527; Interrogation_Position = 3183; Antisense;
    GGACTTCTCCAGCAGATGGCAAGCC
    >probe:HG-U133A:209442_x_at:202:339; Interrogation_Position = 3201; Antisense;
    GCAAGCCAAGGCTTAGCCTCCATGA
    >probe:HG-U133A:209442_x_at:12:351; Interrogation_Position = 3227; Antisense;
    GAAGAGGGGTCCAGTGGGTCTGAGC
    >probe:HG-U133A:209442_x_at:510:387; Interrogation_Position = 3313; Antisense;
    GAGCCACTCGTAACAGCGAACGGTC
    >probe:HG-U133A:209442_x_at:632:349; Interrogation_Position = 3417; Antisense;
    GAAGTGTGTGTGTGTTCGCTGCTTC
    Probes sequences for 210131_x_at:
    >probe:HG-U133A:210131_x_at:665:33; Interrogation_Position = 736; Antisense;
    ATGAGGTGGCTGCAAAAACTCCCCT
    >probe:HG-U133A:210131_x_at:362:271; Interrogation_Position = 778; Antisense;
    CCTACTCTCGGCCTAGAAGCAGTTA
    >probe:HG-U133A:210131_x_at:333:439; Interrogation_Position = 799; Antisense;
    GTTATTCTCTCTCCATATTGGGCTT
    >probe:HG-U133A:210131_x_at:63:595; Interrogation_Position = 831; Antisense;
    TGCTGAGGGTCAGCTTTTGGCTCCT
    >probe:HG-U133A:210131_x_at:502:311; Interrogation_Position = 850; Antisense;
    GCTCCTTCTTCCTGAGACAGTGGAA
    >probe:HG-U133A:210131_x_at:502:129; Interrogation_Position = 874; Antisense;
    AACAATGCCAGCTCTGTGGCTTCTG
    >probe:HG-U133A:210131_x_at:412:709; Interrogation_Position = 929; Antisense;
    TTGGTGAGGCTTTGGGTGCCACTGC
    >probe:HG-U133A:210131_x_at:76:99; Interrogation_Position = 949; Antisense;
    ACTGCCTGTGGGTTGCTGGCTTAAA
    >probe:HG-U133A:210131_x_at:558:475; Interrogation_Position = 992; Antisense;
    GTGAGAGCCCAGGCCATTAACACCT
    >probe:HG-U133A:210131_x_at:425:533; Interrogation_Position = 1055; Antisense;
    GGAATTAGTCTGTCCCAGCTAGAGG
    >probe:HG-U133A:210131_x_at:249:59; Interrogation_Position = 1095; Antisense;
    AGTTAGTTCTTGGAGCAGCTGCTTT
    Probes sequences for 217713_x_at:
    >probe:HG-U133A:217713_x_at:601:225; Interrogation_Position = 26; Antisense;
    CTCCGTGGCTCGTGCTTTTAGCCAC
    >probe:HG-U133A:217713_x_at:46:487; Interrogation_Position = 30; Antisense;
    GTGGCTCGTGCTTTTAGCCACTAGG
    >probe:HG-U133A:217713_x_at:196:657; Interrogation_Position = 44; Antisense;
    TAGCCACTAGGCTTTGCTGTGTAAA
    >probe:HG-U133A:217713_x_at:579:141; Interrogation_Position = 100; Antisense;
    AAGAAAGCGTTGCAGGCTGGGCGTG
    >probe:HG-U133A:217713_x_at:297:483; Interrogation_Position = 165; Antisense;
    GTGGGCGGAGAATCACTTGAGCTCA
    >probe:HG-U133A:217713_x_at:597:599; Interrogation_Position = 274; Antisense;
    TGCCTGAAGTCCCAGCTTCTTGGGA
    >probe:HG-U133A:217713_x_at:11:585; Interrogation_Position = 278; Antisense;
    TGAAGTCCCAGCTTCTTGGGAGGCT
    >probe:HG-U133A:217713_x_at:218:411; Interrogation_Position = 303; Antisense;
    GAGGCAGCATAATTGCTTGAACCTG
    >probe:HG-U133A:217713_x_at:47:495; Interrogation_Position = 337; Antisense;
    GGGTTGCAGTGAGCCTAGATAGTGC
    >probe:HG-U133A:217713_x_at:92:389; Interrogation_Position = 347; Antisense;
    GAGCCTAGATAGTGCCACTACACTC
    >probe:HG-U133A:217713_x_at:555:365; Interrogation_Position = 431; Antisense;
    GAAAGCATTGCCATTTCTATGGATT
    Probes sequences for 214707_x_at:
    >probe:HG-U133A:214707_x_at:93:185; Interrogation_Position = 5780; Antisense;
    CACCAAGTGGTCGGGAGCTCTGGCT
    >probe:HG-U133A:214707_x_at:473:77; Interrogation_Position = 5795; Antisense;
    AGCTCTGGCTTGCACCCAGAATAAA
    >probe:HG-U133A:214707_x_at:141:403; Interrogation_Position = 5860; Antisense;
    GAGTCATTTCTCACTTATGGCACTG
    >probe:HG-U133A:214707_x_at:557:133; Interrogation_Position = 5937; Antisense;
    AACTTCTCACTAATTTGCATGTCTT
    >probe:HG-U133A:214707_x_at:684:197; Interrogation_Position = 5954; Antisense;
    CATGTCTTAGGCTCCTCTTTATTAG
    >probe:HG-U133A:214707_x_at:264:679; Interrogation_Position = 6004; Antisense;
    TTTAGGCCAGGTACGGTGGCTCATG
    >probe:HG-U133A:214707_x_at:182:685; Interrogation_Position = 6045; Antisense;
    TTAGGGAGGCCGAGGTGTGCAGATC
    >probe:HG-U133A:214707_x_at:577:363; Interrogation_Position = 6111; Antisense;
    GAAACTCCATCTCTACTCAAAATAC
    >probe:HG-U133A:214707_x_at:221:565; Interrogation_Position = 6150; Antisense;
    TGTGGAGGCACACGCCTGTAATCCC
    >probe:HG-U133A:214707_x_at:462:81; Interrogation_Position = 6192; Antisense;
    AGCCGGGAGGATCTCTTGAACCTGG
    >probe:HG-U133A:214707_x_at:292:391; Interrogation_Position = 6234; Antisense;
    GAGCTTAGATGGTGCCACTGTACTC
    Probes sequences for 216922_x_at:
    >probe:HG-U133A:216922_x_at:248:55; Interrogation_Position = 1021; Antisense;
    AGTCGCCACTGGATATCAGTTCCCT
    >probe:HG-U133A:216922_x_at:555:685; Interrogation_Position = 1054; Antisense;
    TTATCAGCCATTTCCTGCTTATCCA
    >probe:HG-U133A:216922_x_at:562:59; Interrogation_Position = 1079; Antisense;
    AGTTCACCATTTCAGGTCACTGCTG
    >probe:HG-U133A:216922_x_at:124:677; Interrogation_Position = 1136; Antisense;
    TTTCCTGCTTATCCAAATTCACCAT
    >probe:HG-U133A:216922_x_at:176:693; Interrogation_Position = 1153; Antisense;
    TTCACCATTTCAAGTCGCCACTGGA
    >probe:HG-U133A:216922_x_at:363:323; Interrogation_Position = 1205; Antisense;
    GCATTTCCTGCTTATCCAAATTCAC
    >probe:HG-U133A:216922_x_at:274:115; Interrogation_Position = 1222; Antisense;
    AAATTCACCAGTTCAGGTCACCACT
    >probe:HG-U133A:216922_x_at:212:147; Interrogation_Position = 1294; Antisense;
    AAGTTCACCATTTCAGGTCACCACT
    >probe:HG-U133A:216922_x_at:408:175; Interrogation_Position = 1365; Antisense;
    CAAATTCAGCAGTTCAGGTCACCAC
    >probe:HG-U133A:216922_x_at:7:667; Interrogation_Position = 1407; Antisense;
    TATACAATTACCAGATGCCACCGCA
    >probe:HG-U133A:216922_x_at:543:557; Interrogation_Position = 1481; Antisense;
    TGGTGGTATCTTGTCTGTTTAATCC
    Probes sequences for 215892_at:
    >probe:HG-U133A:215892_at:420:337; Interrogation_Position = 1102; Antisense;
    GCAAAAAGATAGCAATGGCCCATTT
    >probe:HG-U133A:215892_at:445:339; Interrogation_Position = 1113; Antisense;
    GCAATGGCCCATTTGGGAAAAACAC
    >probe:HG-U133A:215892_at:460:145; Interrogation_Position = 1180; Antisense;
    AAGTTCTTGCTTATATTACTGGGAA
    >probe:HG-U133A:215892_at:348:123; Interrogation_Position = 1203; Antisense;
    AAAAGAGGTCCTGATCAGATCCCAA
    >probe:HG-U133A:215892_at:699:63; Interrogation_Position = 1229; Antisense;
    AGAGTTCTTGGATCTCATACAGGAA
    >probe:HG-U133A:215892_at:408:43; Interrogation_Position = 1249; Antisense;
    AGGAAGGAATTCAAGACGAGTCGGA
    >probe:HG-U133A:215892_at:14:381; Interrogation_Position = 1263; Antisense;
    GACGAGTCGGAAAGTACAGTGCAAA
    >probe:HG-U133A:215892_at:191:549; Interrogation_Position = 1312; Antisense;
    GGCCAGCATGATGGCTTGGGCCTGT
    >probe:HG-U133A:215892_at:469:325; Interrogation_Position = 1317; Antisense;
    GCATGATGGCTTGGGCCTGTGGTCT
    >probe:HG-U133A:215892_at:174:563; Interrogation_Position = 1328; Antisense;
    TGGGCCTGTGGTCTAAGCACTTTGG
    >probe:HG-U133A:215892_at:602:409; Interrogation_Position = 1489; Antisense;
    GAGGCTGGATGGGACAGTCGCTTGA
    Probes sequences for 203272_s_at:
    >probe:HG-U133A:203272_s_at:594:303; Interrogation_Position = 78; Antisense;
    GCGGCTGGGGCAGGTTATGGTAGTG
    >probe:HG-U133A:203272_s_at:42:663; Interrogation_Position = 93; Antisense;
    TATGGTAGTGCGGACTGCGGTGTGA
    >probe:HG-U133A:203272_s_at:517:79; Interrogation_Position = 240; Antisense;
    AGCTGAGCAAGCTTTGGTGCGGCCT
    >probe:HG-U133A:203272_s_at:632:261; Interrogation_Position = 300; Antisense;
    CCGCGGCTCTATGTTCTATGATGAG
    >probe:HG-U133A:203272_s_at:106:495; Interrogation_Position = 329; Antisense;
    GGGATCTGGCTCACGAGTTCTATGA
    >probe:HG-U133A:203272_s_at:651:393; Interrogation_Position = 355; Antisense;
    GAGACAATCGTCACCAAGAACGGGC
    >probe:HG-U133A:203272_s_at:199:81; Interrogation_Position = 383; Antisense;
    AGCGGGCCAAGCTGAGGCGAGTGCA
    >probe:HG-U133A:203272_s_at:505:617; Interrogation_Position = 414; Antisense;
    TCTGATTCCTCAGGGCATCGTGAAG
    >probe:HG-U133A:203272_s_at:649:489; Interrogation_Position = 426; Antisense;
    GGGCATCGTGAAGCTGGATCACCCC
    >probe:HG-U133A:203272_s_at:384:13; Interrogation_Position = 464; Antisense;
    ATTTCCCTGTGATCCTCTATGAGGT
    >probe:HG-U133A:203272_s_at:654:305; Interrogation_Position = 581; Antisense;
    GCTGGGGCCACAACCCTGAATAAAC
    Probes sequences for 201530_x_at:
    >probe:HG-U133A:201530_x_at:61:419; Interrogation_Position = 901; Antisense;
    GATGCATGCTCGAGATTTCACTGTA
    >probe:HG-U133A:201530_x_at:14:679; Interrogation_Position = 916; Antisense;
    TTTCACTGTATCCGCCATGCATGGA
    >probe:HG-U133A:201530_x_at:688:33; Interrogation_Position = 971; Antisense;
    ATGAGGGAGTTTCGTTCTGGCTCTA
    >probe:HG-U133A:201530_x_at:262:401; Interrogation_Position = 999; Antisense;
    GAGTTTTGATTACCACTGACCTGCT
    >probe:HG-U133A:201530_x_at:560:97; Interrogation_Position = 1013; Antisense;
    ACTGACCTGCTGGCCAGAGGCATTG
    >probe:HG-U133A:201530_x_at:353:473; Interrogation_Position = 1040; Antisense;
    GTGCAGCAGGTTTCTTTAGTCATCA
    >probe:HG-U133A:201530_x_at:411:55; Interrogation_Position = 1057; Antisense;
    AGTCATCAACTATGACCTTCCCACC
    >probe:HG-U133A:201530_x_at:347:207; Interrogation_Position = 1186; Antisense;
    CATTGAGACCTTCTACAACACCTCC
    >probe:HG-U133A:201530_x_at:55:45; Interrogation_Position = 1215; Antisense;
    AGGAAATGCCCCTCAATGTTGCTGA
    >probe:HG-U133A:201530_x_at:620:309; Interrogation_Position = 1235; Antisense;
    GCTGACCTCATCTGAGGGGCTGTCC
    >probe:HG-U133A:201530_x_at:596:527; Interrogation_Position = 1339; Antisense;
    GGACATCTTGTCATTTTTTTTCTTT
    Probes sequences for 217371_s_at:
    >probe:HG-U133A:217371_s_at:633:351; Interrogation_Position = 171; Antisense;
    GAAGATCTTATTCAATCTATGCATA
    >probe:HG-U133A:217371_s_at:700:1; Interrogation_Position = 195; Antisense;
    ATTGATGCTACTTTATATACGGAAA
    >probe:HG-U133A:217371_s_at:635:529; Interrogation_Position = 215; Antisense;
    GGAAAGTGATGTTCACCCCAGTTGC
    >probe:HG-U133A:217371_s_at:426:395; Interrogation_Position = 222; Antisense;
    GATGTTCACCCCAGTTGCAAAGTAA
    >probe:HG-U133A:217371_s_at:209:109; Interrogation_Position = 246; Antisense;
    ACAGCAATGAAGTGCTTTCTCTTGG
    >probe:HG-U133A:217371_s_at:110:441; Interrogation_Position = 279; Antisense;
    GTTATTTCACTTGAGTCCGGAGATG
    >probe:HG-U133A:217371_s_at:543:403; Interrogation_Position = 291; Antisense
    GAGTCCGGAGATGCAAGTATTCATG
    >probe:HG-U133A:217371_s_at:681:447; Interrogation_Position = 321; Antisense;
    GTAGAAAATCTGATCATCCTAGCAA
    >probe:HG-U133A:217371_s_at:422:575; Interrogation_Position = 331; Antisense;
    TGATCATCCTAGCAAACAACAGTTT
    >probe:HG-U133A:217371_s_at:567:1; Interrogation_Position = 346; Antisense;
    ACAACAGTTTGTCTTCTAATGGGAA
    >probe:HG-U133A:217371_s_at:661:341; Interrogation_Position = 426; Antisense;
    GAATTTTTGCAGAGTTTTGTACATA
    Probes sequences for 206279_at:
    >probe:HG-U133A:206279_at:461:325; Interrogation_Position = 6682; Antisense;
    GCAGTACAGTCCTTCCTAATAACGG
    >probe:HG-U133A:206279_at:36:191; Interrogation_Position = 6688; Antisense;
    CAGTCCTTCCTAATAACGGGGATGC
    >probe:HG-U133A:206279_at:509:271; Interrogation_Position = 6696; Antisense;
    CCTAATAACGGGGATGCTCTGTCAT
    >probe:HG-U133A:206279_at:505:485; Interrogation_Position = 6782; Antisense;
    GTGGACATGTCTGTTCTGTATATAA
    >probe:HG-U133A:206279_at:74:327; Interrogation_Position = 6829; Antisense;
    GCAGGTGGGATTTGAAGCTTTTCCA
    >probe:HG-U133A:206279_at:582:425; Interrogation_Position = 6837; Antisense;
    GATTTGAAGCTTTTCCACAGTCCTT
    >probe:HG-U133A:206279_at:554:79; Interrogation_Position = 6882; Antisense;
    AGCTCTGCTCAGCTGTCACAGCTGA
    >probe:HG-U133A:206279_at:266:571; Interrogation_Position = 6895; Antisense;
    TGTCACAGCTGAATTCTTTTTTTGT
    >probe:HG-U133A:206279_at:388:341; Interrogation_Position = 6905; Antisense;
    GAATTTTTTTTTTGTGCTGAACACT
    >probe:HG-U133A:206279_at:645:203; Interrogation_Position = 6910; Antisense;
    CTTTTTTTGTGCTGAACACTGGGCA
    >probe:HG-U133A:206279_at:424:687; Interrogation_Position = 7106; Antisense;
    TTAATCTTGTTTGTTGCGCAATGGA
    Probes sequences for 222231_s_at:
    >probe:HG-U133A:222231_s_at:699:161; Interrogation_Position = 2375; Antisense;
    AATTAAGCATTTTCTTGCCTCCTTT
    >probe:HG-U133A:222231_s_at:373:215; Interrogation_Position = 2396; Antisense;
    CTTTGCTTCATCTTTTCACAACAGC
    >probe:HG-U133A:222231_s_at:625:39; Interrogation_Position = 2450; Antisense;
    ATGGTGCTCATTCACTGCAAACTCC
    >probe:HG-U133A:222231_s_at:32:591; Interrogation_Position = 2465; Antisense;
    TGCAAACTCCCAGTTGCAAGCTCCT
    >probe:HG-U133A:222231_s_at:403:293; Interrogation_Position = 2538; Antisense;
    GCCTTTTAGCCCTAATGACCTTTTG
    >probe:HG-U133A:222231_s_at:682:383; Interrogation_Position = 2554; Antisense;
    GACCTTTTGGATGGGACTGCAACTC
    >probe:HG-U133A:222231_s_at:180:381; Interrogation_Position = 2568; Antisense;
    GACTGCAACTCATGACTATCCTGAT
    >probe:HG-U133A:222231_s_at:97:277; Interrogation_Position = 2625; Antisense;
    CCATAGCTCTGCTGCGTAGGTCTAC
    >probe:HG-U133A:222231_s_at:553:447; Interrogation_Position = 2640; Antisense;
    GTAGGTCTACATCTTACTCAGAATC
    >probe:HG-U133A:222231_s_at:569:177; Interrogation_Position = 2664; Antisense;
    CACTACACATTCCTTTAGTCTTCCT
    >probe:HG-U133A:222231_s_at:29:703; Interrogation_Position = 2684; Antisense;
    TTCCTCCAAGCTCCAGAGCCATTGG
    Probes sequences for 214912_at:
    >probe:HG-U133A:214912_at:355:145; Interrogation_Position = 2236; Antisense;
    AAGAGACCCAGTTACAGAGCTATGT
    >probe:HG-U133A:214912_at:675:339; Interrogation_Position = 2273; Antisense;
    GAATGTCAATTCTAGGTAACGATCC
    >probe:HG-U133A:214912_at:136:49; Interrogation_Position = 2286; Antisense;
    AGGTAACGATCCTCAATGTCACAGG
    >probe:HG-U133A:214912_at:78:639; Interrogation_Position = 2298; Antisense;
    TCAATGTCACAGGTACTTCACGTAT
    >probe:HG-U133A:214912_at:346:181; Interrogation_Position = 2316; Antisense;
    CACGTATTAATACCTCTTTGCCATT
    >probe:HG-U133A:214912_at:252:293; Interrogation_Position = 2335; Antisense;
    GCCATTATACATATGTCCTTCTCCA
    >probe:HG-U133A:214912_at:106:469; Interrogation_Position = 2349; Antisense;
    GTCCTTCTCCAAATGATGGCTCTTT
    >probe:HG-U133A:214912_at:651:703; Interrogation_Position = 2385; Antisense;
    TTGTCTTCTAAGCTGGTTGTGTGGA
    >probe:HG-U133A:214912_at:605:217; Interrogation_Position = 2419; Antisense;
    CTACCTTCCACATTGTCTTATAGGA
    >probe:HG-U133A:214912_at:671:33; Interrogation_Position = 2471; Antisense;
    ATGACTCTAATTACAGCACTCTGGG
    >probe:HG-U133A:214912_at:255:657; Interrogation_Position = 2586; Antisense;
    TAGCTGGACCTAGTGGTGGGCGTCT
    Probes sequences for 201729_s_at:
    >probe:HG-U133A:201729_s_at:455:541; Interrogation_Position = 6382; Antisense;
    GGCTAGACTTTGCCATGGCTGTCAA
    >probe:HG-U133A:201729_s_at:303:117; Interrogation_Position = 6406; Antisense;
    AAAGGGACAGCCGCAAAGCCCTGGT
    >probe:HG-U133A:201729_s_at:430:147; Interrogation_Position = 6462; Antisense;
    AAGTCTGCAACAGGCTCTGAGGTCC
    >probe:HG-U133A:201729_s_at:687:255; Interrogation_Position = 6547; Antisense;
    CCCGGCTCCTCATTGGTTTAAGTGT
    >probe:HG-U133A:201729_s_at:186:53; Interrogation_Position = 6595; Antisense;
    AGTCCATCTTTGGCAGGCGCAAATG
    >probe:HG-U133A:201729_s_at:126:547; Interrogation_Position = 6624; Antisense;
    GGCGATTCGAGTGGCTGCAGTACAG
    >probe:HG-U133A:201729_s_at:170:329; Interrogation_Position = 6640; Antisense;
    GCAGTACAGGATCTGACTCTGGCTC
    >probe:HG-U133A:201729_s_at:567:197: Interrogation_Position = 6664; Antisense;
    CAGGCTCCAGGGACTTGTGGGGTGG
    >probe:HG-U133A:201729_s_at:331:321; Interrogation_Position = 6694; Antisense;
    GCTTCCCGTTATCCACGAGGATTTG
    >probe:HG-U133A:201729_s_at:345:479; Interrogation_Position = 6722; Antisense;
    GTGTCAGAGCCCATAGGCATCACTC
    >probe:HG-U133A:201729_s_at:32:61; Interrogation_Position = 6795; Antisense;
    AGTTCTGTCTCACACTGATCAGGCT
    Probes sequences for 218265_at:
    >probe:HG-U133A:218265_at:169:641; Interrogation_Position = 2820; Antisense;
    TAAGGAAACTTCCTCTCCATTGCAG
    >probe:HG-U133A:218265_at:321:621; Interrogation_Position = 2833; Antisense;
    TCTCCATTGCAGAATAGCTGAGCCA
    >probe:HG-U133A:218265_at:626:555; Interrogation_Position = 2901; Antisense;
    TGGAGGTTGCCATGGAGGGCCATTC
    >probe:HG-U133A:218265_at:169:289; Interrogation_Position = 2956; Antisense;
    GCCACTTGGCAGAAGGGTGCAGGGC
    >probe:HG-U133A:218265_at:199:317; Interrogation_Position = 3002; Antisense;
    GCTACAGGGAAAGGGCCCTTTCTCA
    >probe:HG-U133A:218265_at:521:273; Interrogation_Position = 3018; Antisense;
    CCTTTCTCAGGGGATGTAGCTTTTT
    >probe:HG-U133A:218265_at:624:489; Interrogation_Position = 3053; Antisense;
    GGGAACACTTGGAGGATTTGCTAAA
    >probe:HG-U133A:218265_at:353:557; Interrogation_Position = 3098; Antisense;
    TGGTTTTCTAACCTGTGACTTTTTG
    >probe:HG-U133A:218265_at:373:29; Interrogation_Position = 3125; Antisense;
    ATGAATTATTCCTTTCAGTCTTTAT
    >probe:HG-U133A:218265_at:659:35; Interrogation_Position = 3234; Antisense;
    ATGTGGTTTCTGTCTTAGACCAGGA
    >probe:HG-U133A:218265_at:95:519; Interrogation_Position = 3256; Antisense;
    GGAGGACAGAGTTTGCTTTCATATT
    Probes sequences for 201537_s_at:
    >probe:HG-U133A:201537_s_at:295:311; Interrogation_Position = 1344; Antisense;
    GCTGCTCTGGAATATTTCCCTTCGA
    >probe:HG-U133A:201537_s_at:483:25; Interrogation_Position = 1368; Antisense;
    ATCTTATCTCAGTCACTTCGTTTTT
    >probe:HG-U133A:201537_s_at:517:661; Interrogation_Position = 1513; Antisense;
    TAGTGTTGGCAAGCCCAAGCCACTC
    >probe:HG-U133A:201537_s_at:692:177; Interrogation_Position = 1533; Antisense;
    CACTCGTGCTAACTGCTTTTTGTCT
    >probe:HG-U133A:201537_5_at:646:211; Interrogation_Position = 1548; Antisense;
    CTTTTTGTCTCGGTTGCTATTCCAA
    >probe:HG-U133A:201537_s_at:441:431; Interrogation_Position = 1588; Antisense;
    GTTGGCCAATTACAGCGTGTGTGCA
    >probe:HG-U133A:201537_s_at:640:263; Interrogation_Position = 1634; Antisense;
    CCTCTCAGAAACGCGGCCAGAAGAC
    >probe:HG-U133A:201537_s_at:123:535; Interrogation_Position = 1665; Antisense;
    GGAAGTGAAAGGTCCCAGGCACACA
    >probe:HG-U133A:201537_s_at:698:301; Interrogation_Position = 1694; Antisense;
    GCCCATTGCAGGTGGCTCTTACAGC
    >probe:HG-U133A:201537_s_at:150:287; Interrogation_Position = 1727; Antisense;
    GCCAGCACGGGATCCCTGAAGTGAC
    >probe:HG-U133A:201537_s_at:78:187; Interrogation_Position = 1875; Antisense;
    CAGCCCTCACATGTGTATGCACATT
    Probes sequences for 221616_s_at:
    >probe:HG-U133A:221616_s_at:614:157; Interrogation_Position = 796; Antisense;
    AATCGTGGAACTAGTCTCCCTTGGT
    >probe:HG-U133A:221616_s_at:451:145; Interrogation_Position = 986; Antisense;
    AAGATTGGTCAGCACGACACTGGAT
    >probe:HG-U133A:221616_s_at:298:373; Interrogation_Position = 1001; Antisense;
    GACACTGGATTGCCAGTCTCACATC
    >probe:HG-U133A:221616_s_at:148:463; Interrogation_Position = 1016; Antisense;
    GTCTCACATCATCTGCATCAACATT
    >probe:HG-U133A:221616_s_at:452:661; Interrogation_Position = 1040; Antisense;
    TAGGTTTCTTAGCATGGCTCGAATA
    >probe:HG-U133A:221616_s_at:460:323; Interrogation_Position = 1072; Antisense;
    GCATCATCCAGAATTGTAGTCACAT
    >probe:HG-U133A:221616_s_at:68:19; Interrogation_Position = 1126; Antisense;
    ATAACCCTTGGTTCATACTCTGTGA
    >probe:HG-U133A:221616_s_at:200:437; Interrogation_Position = 1136; Antisense;
    GTTCATACTCTGTGATTCCCATATC
    >probe:HG-U133A:221616_s_at:444:565; Interrogation_Position = 1146; Antisense;
    TGTGATTCCCATATCCTTCAGGATC
    >probe:HG-U133A:221616_s_at:207:517; Interrogation_Position = 1166; Antisense;
    GGATCTGTGCCATCACCAAGGCATC
    >probe:HG-U133A:221616_s_at:195:203; Interrogation_Position = 1187; Antisense;
    CATCTCTCGGAGCGTTCTTGGGAGG
    Probes sequences for 213106_at:
    >probe:HG-U133A:213106_at:220:103; Interrogation_Position = 2437; Antisense;
    ACTTCTCAGCAAATAAATCTCCCTT
    >probe:HG-U133A:213106_at:486:701; Interrogation_Position = 2495; Antisense;
    TTGAATTAACAGCAACTTTCCACAG
    >probe:HG-U133A:213106_at:603:651; Interrogation_Position = 2612; Antisense;
    TACTATCTGTTGTTTAACTACTTTA
    >probe:HG-U133A:213106_at:514:291; Interrogation_Position = 2687; Antisense;
    GCCTTGGAAGGCAGCTACTACAGAA
    >probe:HG-U133A:213106_at:471:647; Interrogation_Position = 2734; Antisense;
    TAAAGTAGTATTTTCAGCTCCCTGA
    >probe:HG-U133A:213106_at:566:79; Interrogation_Position = 2749; Antisense;
    AGCTCCCTGAAAAACCATTCCTGCT
    >probe:HG-U133A:213106_at:438:133; Interrogation_Position = 2819; Antisense;
    AATCTGTGGTTATAGTAGTTTTCTC
    >probe:HG-U133A:213106_at:323:697; Interrogation_Position = 2839; Antisense;
    TTCTCAGGTTTGTTTATCTTGATGT
    >probe:HG-U133A:213106_at:106:629; Interrogation_Position = 2855; Antisense;
    TCTTGATGTTTGATGCACTGTGTTT
    >probe:HG-U133A:213106_at:559:439; Interrogation_Position = 2919; Antisense;
    GTTATGTGTCATTGGCCTTTTGTGA
    >probe:HG-U133A:213106_at:99:125; Interrogation_Position = 2971; Antisense;
    AAACATTTTGTCCTCTAATTGTTAT
    Probes sequences for 215336_at:
    >probe:HG-U133A:215336_at:602:595; Interrogation_Position = 1831; Antisense;
    TGCTGCGTGGCATTAAGGTTCTGGC
    >probe:HG-U133A:215336_at:515:551; Interrogation_Position = 1838; Antisense;
    TGGCATTAAGGTTCTGGCACAGAGA
    >probe:HG-U133A:215336_at:636:121; Interrogation_Position = 1862; Antisense;
    AAAAAGCGGGGCTCATCCAGGATTG
    >probe:HG-U133A:215336_at:227:279; Interrogation_Position = 1878; Antisense;
    CCAGGATTGGTGGTGTGCTCGTTGC
    >probe:HG-U133A:215336_at:388:471; Interrogation_Position = 1892; Antisense;
    GTGCTCGTTGCATATACAGTGTCCA
    >probe:HG-U133A:215336_at:91:15; Interrogation_Position = 1903; Antisense;
    ATATACAGTGTCCAGGGTCTCACCT
    >probe:HG-U133A:215336_at:47:197; Interrogation_Position = 1915; Antisense;
    CAGGGTCTCACCTCCAGAGATGAAG
    >probe:HG-U133A:215336_at:373:33; Interrogation_Position = 1944; Antisense;
    ATGAAGTCAGTGAAGGGTCCTGAGA
    >probe:HG-U133A:215336_at:30:417; Interrogation_Position = 1994; Antisense;
    GATGATTTCAGTGCAGGTGATTCAG
    >probe:HG-U133A:215336_at:422:191; Interrogation_Position = 2016; Antisense;
    CAGAGACCTCACTTGAAAATATATT
    >probe:HG-U133A:215336_at:571:95; Interrogation_Position = 2086; Antisense;
    ACTGTTCTTTTCTGCTACAGAGATC
    Probes sequences for 205917_at:
    >probe:HG-U133A:205917_at:639:485; Interrogation_Position = 5990; Antisense;
    GTGGCAGATCTTGTTTTACACTCCC
    >probe:HG-U133A:205917_at:513:675; Interrogation_Position = 6046; Antisense;
    TTTCCAGTCTATTTTCTGTCTTGTT
    >probe:HG-U133A:205917_at:436:683; Interrogation_Position = 6086; Antisense;
    TTATTGACCTTCCATGGTCCTCACT
    >probe:HG-U133A:205917_at:106:41; Interrogation_Position = 6099; Antisense;
    ATGGTCCTCACTGATTGTTTTCTTT
    >probe:HG-U133A:205917_at:298:441; Interrogation_Position = 6115; Antisense;
    GTTTTCTTTGTCATACCTAATCTGT
    >probe:HG-U133A:205917_at:301:499; Interrogation_Position = 6201; Antisense;
    GGTGGGTTCTTTTGTACACCTTCTG
    >probe:HG-U133A:205917_at:344:147; Interrogation_Position = 6287; Antisense;
    AAGTCATGGTTCTCTGGTGGTTTGT
    >probe:HG-U133A:205917_at:214:367; Interrogation_Position = 6353; Antisense;
    GAAAGCCCACAGTCTTCTGAGTTGT
    >probe:HG-U133A:205917_at:273:471; Interrogation_Position = 6376; Antisense;
    GTGCTACACCAATATTTCTATGAAC
    >probe:HG-U133A:205917_at:176:37; Interrogation_Position = 6456; Antisense;
    ATGTTTGTAAAATTCCCTAGCTCTC
    >probe:HG-U133A:205917_at:11:613; Interrogation_Position = 6469; Antisense;
    TCCCTAGCTCTCTGCACTGAGCTGA
    Probes sequences for 209770_at:
    >probe:HG-U133A:209770_at:45:515; Interrogation_Position = 2874; Antisense;
    GGAAATTTGGATGAAGGGAGCTAGA
    >probe:HG-U133A:209770_at:463:57; Interrogation_Position = 3087; Antisense;
    AGTACAGATGAGGGTTCACTATGTT
    >probe:HG-U133A:209770_at:565:181; Interrogation_Position = 3191; Antisense;
    CACCGGGTGACCGGCTTACAGGGAT
    >probe:HG-U133A:209770_at:17:447; Interrogation_Position = 3198; Antisense;
    TGACCGGCTTACAGGGATATTTTTA
    >probe:HG-U133A:209770_at:3:691; Interrogation_Position = 3220; Antisense;
    TTAATCCCGTTATGGACTCTGTCTC
    >probe:HG-U133A:209770_at:152:465; Interrogation_Position = 3240; Antisense;
    GTCTCCAGGAGAGGGGTCTATCCAC
    >probe:HG-U133A:209770_at:493:215; Interrogation_Position = 3257; Antisense;
    CTATCCACCCCTGCTCATTGGTGGA
    >probe:HG-U133A:209770_at:329:283; Interrogation_Position = 3261; Antisense;
    CCACCCCTGCTCATTGGTGGATGTT
    >probe:HG-U133A:209770_at:69:531; Interrogation_Position = 3319; Antisense;
    GGAAAAACTACTCCTCATTATCATC
    >probe:HG-U133A:209770_at:34:25; Interrogation_Position = 3341; Antisense;
    ATCATTATTATTGCTCTCCACTGTA
    >probe:HG-U133A:209770_at:3:223; Interrogation_Position = 3370; Antisense;
    CTCTACCTGGCATGTGCTTGTCAAG
    Probes sequences for 209061_at:
    >probe:HG-U133A:209061_at:429:25; Interrogation_Position = 7435; Antisense;
    ATCTGAGGTGAGTTGGGGGTATCTA
    >probe:HG-U133A:209061_at:666:541; Interrogation_Position = 7491; Antisense;
    GGCTTGATGTCCTAGAAGTTCTTTG
    >probe:HG-U133A:209061_at:576:57; Interrogation_Position = 7507; Antisense;
    AGTTCTTTGATCCAGAGGTGGGTGC
    >probe:HG-U133A:209061_at:47:407; Interrogation_Position = 7521; Antisense;
    GAGGTGGGTGCAGCTGAAAGTAAAC
    >probe:HG-U133A:209061_at:313:601; Interrogation_Position = 7555; Antisense;
    TGCCAGTTACATGTATGCCTGCCCA
    >probe:HG-U133A:209061_at:418:279; Interrogation_Position = 7577; Antisense;
    CCAGTTCCCTTTTTATTTGCAGAAG
    >probe:HG-U133A:209061_at:104:681; Interrogation_Position = 7609; Antisense;
    TTTTGTTCACAATTAGGTTCCTAGG
    >probe:HG-U133A:209061_at:269:137; Interrogation_Position = 7689; Antisense;
    AACGAGCAGGGTGTTTTCTCTCTTC
    >probe:HG-U133A:209061_at:509:197; Interrogation_Position = 7769; Antisense;
    CAGGAATTGACTTATACTCTTGAGA
    >probe:HG-U133A:209061_at:518:39; Interrogation_Position = 7889; Antisense;
    ATGGGGGATGGTGAGCTGTGACTGC
    >probe:HG-U133A:209061_at:701:97; Interrogation_Position = 7909; Antisense;
    ACTGCTTTGCTGACCATTTTGGATG
    Probes sequences for 202573_at:
    >probe:HG-U133A:202573_at:491:321; Interrogation_Position = 2264; Antisense;
    GCATTGTTGCCAGGGGTGAGGCCGT
    >probe:HG-U133A:202573_at:369:247; Interrogation_Position = 2596; Antisense;
    CGACACCCGGCAAGCAGCCGGAGAC
    >probe:HG-U133A:202573_at:76:329; Interrogation_Position = 2609; Antisense;
    GCAGCCGGAGACAAAACGCCTTAAA
    >probe:HG-U133A:202573_at:689:231; Interrogation_Position = 2649; Antisense;
    CTGCAGGTATATTGCAGGGGCCTGG
    >probe:HG-U133A:202573_at:221:545; Interrogation_Position = 2720; Antisense;
    GGCTGCCGGGCAGAGTGGAGCAGCT
    >probe:HG-U133A:202573_at:663:193; Interrogation_Position = 2757; Antisense;
    CAGGGCGGTGGCTGTGAGTCTAGTT
    >probe:HG-U133A:202573_at:257:567; Interrogation_Position = 2769; Antisense;
    TGTGAGTCTAGTTTTTGCTTTACCA
    >probe:HG-U133A:202573_at:345:679; Interrogation_Position = 2780; Antisense;
    TTTTTGCTTTACCAAGTGTACAGAA
    >probe:HG-U133A:202573_at:185:119; Interrogation_Position = 2803; Antisense;
    AAATGGCATTTACGTTTCTCTGATG
    >probe:HG-U133A:202573_at:369:323; Interrogation_Position = 2808; Antisense;
    GCATTTACGTTTCTCTGATGCTCCC
    >probe:HG-U133A:202573_at:260:619; Interrogation_Position = 2821; Antisense;
    TCTGATGCTCCCTTGAAGCCATAGA
    Probes sequences for 207064_s_at:
    >probe:HG-U133A:207064_s_at:344:45; Interrogation_Position = 2100; Antisense;
    AGGACATCCCAAACACAGTGACTCT
    >probe:HG-U133A:207064_s_at:648:531; Interrogation_Position = 2127; Antisense;
    GGAACAGAGTTGGCTTCTTGCTCCG
    >probe:HG-U133A:207064_s_at:550:233; Interrogation_Position = 2233; Antisense;
    CTGCAGCATCAATCCTGTGGCCTGC
    >probe:HG-U133A:207064_s_at:187:181; Interrogation_Position = 2306; Antisense;
    CACGGCTTCTAGTCCTGAGGGTGTG
    >probe:HG-U133A:207064_s_at:568:439; Interrogation_Position = 2343; Antisense;
    GTTAGGCACATGTACTTTTCCCTGT
    >probe:HG-U133A:207064_s_at:253:621; Interrogation_Position = 2381; Antisense;
    TCTCCGTGTTTTTATCACACCTGCT
    >probe:HG-U133A:207064_s_at:456:199; Interrogation_Position = 2450; Antisense;
    CATCAGTCCCTTTGGTTAATTCTTA
    >probe:HG-U133A:207064_s_at:171:213; Interrogation_Position = 2471; Antisense;
    CTTACTTCCTGTTCATCTCTAAAGT
    >probe:HG-U133A:207064_s_at:579:161; Interrogation_Position = 2534; Antisense;
    AATATCACAAATCCTACTACTCAGA
    >probe:HG-U133A:207064_s_at:434:637; Interrogation_Position = 2568; Antisense;
    TCACATTACATCAGACATCTCTTTA
    >probe:HG-U133A:207064_s_at:88:377; Interrogation_Position = 2581; Antisense;
    GACATCTCTTTATGCATGTGCATTC
    Probes sequences for 64371_at:
    >probe:HG-U133A:64371_at:374:429; Interrogation_Position = 22; Antisense;
    GATTCCCCAGCACTCTGCTGCGTGT
    >probe:HG-U133A:64371_at:449:99; Interrogation_Position = 33; Antisense;
    ACTCTGCTGCGTGTAACTCCACTCA
    >probe:HG-U133A:64371_at:79:311; Interrogation_Position = 38; Antisense;
    GCTGCGTGTAACTCCACTCAATTCT
    >probe:HG-U133A:64371_at:170:285; Interrogation_Position = 63; Antisense;
    CCACTCATCCTTCCTTGTGAAGCAG
    >probe:HG-U133A:64371_at:328:273; Interrogation_Position = 75; Antisense;
    CCTTGTGAAGCAGGATCGTTGAAGT
    >probe:HG-U133A:64371_at:300:529; Interrogation_Position = 121; Antisense;
    GGAAAACTTAGGATCCCTCTGACAC
    >probe:HG-U133A:64371_at:417:225; Interrogation_Position = 137; Antisense;
    CTCTGACACCCCAGGATTAGGGGAC
    >probe:HG-U133A:64371_at:419:183; Interrogation_Position = 143; Antisense;
    CACCCCAGGATTAGGGGACACAGCA
    >probe:HG-U133A:64371_at:354:175; Interrogation_Position = 161; Antisense;
    CACAGCAGTGGCTAGGGCATCAGCC
    >probe:HG-U133A:64371_at:53:109; Interrogation_Position = 162; Antisense;
    ACAGCAGTGGCTAGGGCATCAGCCA
    >probe:HG-U133A:64371_at:66:185; Interrogation_Position = 181; Antisense;
    CAGCCACAGAACTGAGCGGGAAATG
    >probe:HG-U133A:64371_at:35:243; Interrogation_Position = 197; Antisense;
    CGGGAAATGCCACTTGTATTGGCTG
    >probe:HG-U133A:64371_at:645:207; Interrogation_Position = 209; Antisense;
    CTTGTATTGGCTGTAAAGAAATCCT
    >probe:HG-U133A:64371_at:257:565; Interrogation_Position = 220; Antisense;
    TGTAAAGAAATCCTGGCTTTGGGCC
    >probe:HG-U133A:64371_at:495:453; Interrogation_Position = 221; Antisense;
    GTAAAGAAATCCTGGCTTTGGGCCA
    >probe:HG-U133A:64371_at:271:367; Interrogation_Position = 226; Antisense;
    GAAATCCTGGCTTTGGGCCAGGCAC
    Probes sequences for 219977_at:
    >probe:HG-U133A:219977_at:428:539; Interrogation_Position = 1193; Antisense;
    GGCAGGGAGCAAGTGGCCTGGTCAC
    >probe:HG-U133A:219977_at:311:265; Interrogation_Position = 1209; Antisense;
    CCTGGTCACTTCTGGTTCGATTGAC
    >probe:HG-U133A:219977_at:655:385; Interrogation_Position = 1231; Antisense;
    GACCAGGATCGTGGTGTCACTTTTT
    >probe:HG-U133A:219977_at:118:9; Interrogation_Position = 1418; Antisense;
    ATTCTTCTGGTAGTTTTCTCTATTT
    >probe:HG-U133A:219977_at:559:677; Interrogation_Position = 1465; Antisense;
    TTTCTAGATTTTACCCCATGTCAAT
    >probe:HG-U133A:219977_at:135:515; Interrogation_Position = 1499; Antisense;
    GGATTTGATGCTCTGATCCTTTCTC
    >probe:HG-U133A:219977_at:266:57; Interrogation_Position = 1560; Antisense;
    AGTTACATTTGGGGGTCATCTCGGT
    >probe:HG-U133A:219977_at:167:497; Interrogation_Position = 1572; Antisense;
    GGGTCATCTCGGTGATTTTTGTAAC
    >probe:HG-U133A:219977_at:641:653; Interrogation_Position = 1599; Antisense;
    TACGCAGGACACTTAGAGCTCTCTA
    >probe:HG-U133A:219977_at:21:225; Interrogation_Position = 1617; Antisense;
    CTCTCTAGAATCCCACTGACTTTAG
    >probe:HG-U133A:219977_at:248:97; Interrogation_Position = 1631; Antisense;
    ACTGACTTTAGTGGGGTCTTGATGT
    Probes sequences for 200772_x_at:
    >probe:HG-U133A:200772_x_at:480:363; Interrogation_Position = 177; Antisense;
    GAAAAGCCATCTTTGCATTGTTCCT
    >probe:HG-U133A:200772_x_at:173:585; Interrogation_Position = 291; Antisense;
    TGAACTCTCGCTTTCTTTTTAATCC
    >probe:HG-U133A:200772_x_at:544:183; Interrogation_Position = 342; Antisense;
    CACCATGTCAGACGCAGCCGTAGAC
    >probe:HG-U133A:200772_x_at:216:37; Interrogation_Position = 346; Antisense;
    ATGTCAGACGCAGCCGTAGACACCA
    >probe:HG-U133A:200772_x_at:564:77; Interrogation_Position = 370; Antisense;
    AGCTCCGAAATCACCACCAAGGACT
    >probe:HG-U133A:200772_x_at:460:21; Interrogation_Position = 379; Antisense;
    ATCACCACCAAGGACTTAAAGGAGA
    >probe:HG-U133A:200772_x_at:529:141; Interrogation_Position = 618; Antisense;
    AAGCTGAGTCAGCTACGGGCAAGCG
    >probe:HG-U133A:200772_x_at:422:187; Interrogation_Position = 627; Antisense;
    CAGCTACGGGCAAGCGGGCAGCTGA
    >probe:HG-U133A:200772_x_at:182:417; Interrogation_Position = 657; Antisense;
    GATGAGGATGACGATGTCGATACCA
    >probe:HG-U133A:200772_x_at:369:353; Interrogation_Position = 684; Antisense;
    GAAGCAGAAGACCGACGAGGATGAC
    >probe:HG-U133A:200772_x_at:292:67; Interrogation_Position = 692; Antisense;
    AGACCGACGAGGATGACTAGACAGC
    Probes sequences for 218617_at:
    >probe:HG-U133A:218617_at:455:329; Interrogation_Position = 1164; Antisense;
    GCAGAAAAGCTCCCACCATTTTCTT
    >probe:HG-U133A:218617_at:280:557; Interrogation_Position = 1195; Antisense;
    TGGTTTTAAAGTCTCACGTTCTCTA
    >probe:HG-U133A:218617_at:58:81; Interrogation_Position = 1243; Antisense;
    AGCTCCTTGTGTGGCTGATGTGTCT
    >probe:HG-U133A:218617_at:96:691; Interrogation_Position = 1327; Antisense;
    TTAAAAGCAGCACAGATTCCACATT
    >probe:HG-U133A:218617_at:147:409; Interrogation_Position = 1361; Antisense;
    GAGGATCTTCTTTGTGGTGAATACC
    >probe:HG-U133A:218617_at:423:17; Interrogation_Position = 1381; Antisense;
    ATACCAGGATTGACTGCATCCTTTT
    >probe:HG-U133A:218617_at:399:661; Interrogation_Position = 1416; Antisense;
    TATGTCCCTGACTCTGGCTAAAATT
    >probe:HG-U133A:218617_at:276:475; Interrogation_Position = 1481; Antisense;
    GTGAGCCACATATTGGGAGTTCTAG
    >probe:HG-U133A:218617_at:430:539; Interrogation_Position = 1518; Antisense;
    GGCAGGAAAGGGCCATCTCCATTGA
    >probe:HG-U133A:218617_at:280:69; Interrogation_Position = 1634; Antisense;
    AGACTTTGAAATTTGCGAGCTGCTC
    >probe:HG-U133A:218617_at:569:179; Interrogation_Position = 1675; Antisense;
    CACTGCTGTCTTTCTATTGAGTTAC
    Probes sequences for 214902_x_at:
    >probe:HG-U133A:214902_x_at:281:251; Interrogation_Position = 2475; Antisense;
    CCCAAAGTCTTAGTTGCAGGGCACT
    >probe:HG-U133A:214902_x_at:16:449; Interrogation_Position = 2569; Antisense;
    GTAGGAGAGCAATACCTTTGAAAAG
    >probe:HG-U133A:214902_x_at:533:431; Interrogation_Position = 2627; Antisense;
    GTTGTCAAAACTAATACTCGTGTAT
    >probe:HG-U133A:214902_x_at:676:11; Interrogation_Position = 2712; Antisense;
    ATTTTTGTCATTTCACTTTTGTGGA
    >probe:HG-U133A:214902_x_at:354:577; Interrogation_Position = 2762; Antisense;
    TGATGTGATCAGGCTGGGTGCGTTG
    >probe:HG-U133A:214902_x_at:594:37; Interrogation_Position = 2866; Antisense;
    ATGGTGAAGCCCTGTTTCTACTAGG
    >probe:HG-U133A:214902_x_at:642:137; Interrogation_Position = 2872; Antisense;
    AAGCCCTGTTTCTACTAGGAATACA
    >probe:HG-U133A:214902_x_at:248:545; Interrogation_Position = 2947; Antisense;
    GGCTGAAGCAGGAGAGTCGCTTGAA
    >probe:HG-U133A:214902_x_at:432:403; Interrogation_Position = 2960; Antisense;
    GAGTCGCTTGAACTTGGGAGGTGGA
    >probe:HG-U133A:214902_x_at:172:195; Interrogation_Position = 2990; Antisense;
    CAGTGAGACGAGATTGCGCCATTGC
    >probe:HG-U133A:214902_x_at:647:229; Interrogation_Position = 3016; Antisense;
    CTCCAGGCTGGGCGACAGAGTGAGA
    Probes sequences for 207436_x_at:
    >probe:HG-U133A:207436_x_at:347:471; Interrogation_Position = 3530; Antisense;
    GTGCTCTCAGAGTAGTAGTGTTTGT
    >probe:HG-U133A:207436_x_at:702:407; Interrogation_Position = 3596; Antisense;
    GAGGTCAGATGGCCTGGACACACAA
    >probe:HG-U133A:207436_x_at:360:365; Interrogation_Position = 3632; Antisense;
    GAAACTGAGTCTGCCATGATATTTT
    >probe:HG-U133A:207436_x_at:60:151; Interrogation_Position = 3658; Antisense;
    AAGGGAGCTATGTCAAAATGTGGTT
    >probe:HG-U133A:207436_x_at:68:205; Interrogation_Position = 3769; Antisense;
    CATAGAACTGCCTGGAGGGCTCCCT
    >probe:HG-U133A:207436_x_at:111:405; Interrogation_Position = 3783; Antisense;
    GAGGGCTCCCTGACAAAACAAAAGA
    >probe:HG-U133A:207436_x_at:653:67; Interrogation_Position = 3933; Antisense;
    AGACCAGCGTGGCCAACATGATGAA
    >probe:HG-U133A:207436_x_at:421:105; Interrogation_Position = 3948; Antisense;
    ACATGATGAAACCCAGTCTCTACTA
    >probe:HG-U133A:207436_x_at:581:197; Interrogation_Position = 3990; Antisense;
    CAGGCATGGTGGCAGGCCTCTGTAT
    >probe:HG-U133A:207436_x_at:509:295; Interrogation_Position = 4005; Antisense;
    GCCTCTGTATTCCCAGCTATTTGGG
    >probe:HG-U133A:207436_x_at:7:159; Interrogation_Position = 4045; Antisense;
    AATCGCTTGAATTCGGGAGGCAGAG
    Probes sequences for 215659_at:
    >probe:HG-U133A:215659_at:380:699; Interrogation_Position = 1554; Antisense;
    TTCCTAAATGGTCTTCCTTTTCCAT
    >probe:HG-U133A:215659_at:240:619; Interrogation_Position = 1598; Antisense;
    TCTGCTTTTAATTTAGCGAGCTCTT
    >probe:HG-U133A:215659_at:69:305; Interrogation_Position = 1613; Antisense;
    GCGAGCTCTTCTCATGTGTTTATCA
    >probe:HG-U133A:215659_at:181:405; Interrogation_Position = 1657; Antisense;
    GAGGGCAGTTTGCTTACTGGTTAAG
    >probe:HG-U133A:215659_at:24:151; Interrogation_Position = 1683; Antisense;
    AAGGATGCAGGCTTTAGGGCTGGAA
    >probe:HG-U133A:215659_at:379:657; Interrogation_Position = 1697; Antisense;
    TAGGGCTGGAAGCACCTGGTTTCAA
    >probe:HG-U133A:215659_at:534:73; Interrogation_Position = 1707; Antisense;
    AGCACCTGGTTTCAAAGCCTGGCTC
    >probe:HG-U133A:215659_at:678:221; Interrogation_Position = 1735; Antisense;
    CTCTTATCAGCTGCGTAACCTTTGG
    >probe:HG-U133A:215659_at:380:309; Interrogation_Position = 1744; Antisense;
    GCTGCGTAACCTTTGGACAAGTTGC
    >probe:HG-U133A:215659_at:563:433; Interrogation_Position = 1764; Antisense;
    GTTGCTTTATTGCTCTAAGTTTCAG
    >probe:HG-U133A:215659_at:612:219; Interrogation_Position = 1793; Antisense;
    CTCCTGTGTCAACTCTAGAGGACTG
    Probes sequences for 204216_s_at:
    >probe:HG-U133A:204216_s_at:391:63; Interrogation_Position = 1367; Antisense;
    AGATGAGTGAACTGAGTGTGGCACA
    >probe:HG-U133A:204216_s_at:327:683; Interrogation_Position = 1406; Antisense;
    TTTTGGAGCGCTGCAAGTACTGGCC
    >probe:HG-U133A:204216_s_at:313:309; Interrogation_Position = 1415; Antisense;
    GCTGCAAGTACTGGCCTGCTTGTAA
    >probe:HG-U133A:204216_s_at:604:39; Interrogation_Position = 1442; Antisense;
    ATGGGGATGAGTGTGCCTACCATCA
    >probe:HG-U133A:204216_s_at:29:589; Interrogation_Position = 1480; Antisense;
    TGCAAAGCCTTCCCCAATTGTAAAT
    >probe:HG-U133A:204216_s_at:367:477; Interrogation_Position = 1554; Antisense;
    GTGTACTAAACCAGATTGTCCCTTC
    >probe:HG-U133A:204216_s_at:221:707; Interrogation_Position = 1569; Antisense;
    TTGTCCCTTCACTCATGTGAGTAGA
    >probe:HG-U133A:204216_s_at:525:341; Interrogation_Position = 1595; Antisense;
    GAATTCCAGTACTGTCTCCAAAACC
    >probe:HG-U133A:204216_s_at:289:467; Interrogation_Position = 1691; Antisense;
    GTCCCTTCTATCATCCAAAACATTG
    >probe:HG-U133A:204216_s_at:96:219; Interrogation_Position = 1727; Antisense;
    CTCAATGTACAAGACCGGACTGCAC
    >probe:HG-U133A:204216_s_at:376:467; Interrogation_Position = 1831; Antisense;
    GTCCTGCCTGGCAGAAGATCATGCA
    Probes sequences for 214763_at:
    >probe:HG-U133A:214763_at:478:45; Interrogation_Position = 2603; Antisense;
    AGGACTCCAGCGTCAGGATGTTGCC
    >probe:HG-U133A:214763_at:565:303; Interrogation_Position = 2612; Antisense;
    GCGTCAGGATGTTGCCACTTGGAGT
    >probe:HG-U133A:214763_at:386:153; Interrogation_Position = 2643; Antisense;
    AATGGGCACGGGGTTTTCAGCCACA
    >probe:HG-U133A:214763_at:459:291; Interrogation_Position = 2679; Antisense;
    GCCAGTGTTCAGCAGGATCATGCCT
    >probe:HG-U133A:214763_at:127:189; Interrogation_Position = 2688; Antisense;
    CAGCAGGATCATGCCTTCTGTGTCT
    >probe:HG-U133A:214763_at:479:489; Interrogation_Position = 2784; Antisense;
    GGGCAGTGGGAGTTATGGGGTCATC
    >probe:HG-U133A:214763_at:456:561; Interrogation_Position = 2799; Antisense;
    TGGGGTCATCAAGGACCTTGCCTCT
    >probe:HG-U133A:214763_at:439:293; Interrogation_Position = 2818; Antisense;
    GCCTCTCTGGAATCTGTCAACCCAG
    >probe:HG-U133A:214763_at:183:545; Interrogation_Position = 2848; Antisense;
    GGCTCCAGGTGGATGGGTTGCTTTA
    >probe:HG-U133A:214763_at:624:469; Interrogation_Position = 2878; Antisense;
    GTGCCTGGTGCATGAGAAACTCTTG
    >probe:HG-U133A:214763_at:395:339; Interrogation_Position = 2963; Antisense;
    GCAATGTAGCTGAGGACTGGTCTGA
    Probes sequences for 200877_at:
    >probe:HG-U133A:200877_at:63:485; Interrogation_Position = 1343; Antisense;
    GTGGTATGGAATCCTACTGCGTTCG
    >probe:HG-U133A:200877_at:76:305; Interrogation_Position = 1361; Antisense;
    GCGTTCGTGCTTTTGCAGATGCTAT
    >probe:HG-U133A:200877_at:276:461; Interrogation_Position = 1390; Antisense;
    GTCATTCCATCTACACTAGCTGAAA
    >probe:HG-U133A:200877_at:541:217; Interrogation_Position = 1405; Antisense;
    CTAGCTGAAAATGCCGGCCTGAATC
    >probe:HG-U133A:200877_at:349:549; Interrogation_Position = 1420; Antisense;
    GGCCTGAATCCCATTTCTACAGTAA
    >probe:HG-U133A:200877_at:209:353; Interrogation_Position = 1447; Antisense;
    GAACTAAGAAACCGGCATGCCCAGG
    >probe:HG-U133A:200877_at:668:557; Interrogation_Position = 1523; Antisense;
    TGGAGGAACTGGTTGTCCAGCCTCT
    >probe:HG-U133A:200877_at:444:281; Interrogation_Position = 1539; Antisense;
    CCAGCCTCTGTTGGTATCAGTCAGT
    >probe:HG-U133A:200877_at:480:193; Interrogation_Position = 1560; Antisense;
    CAGTGCTCTGACTCTTGCAACTGAA
    >probe:HG-U133A:200877_at:138:379; Interrogation_Position = 1644; Antisense;
    GACTAGCACCATTATGATCACCAGT
    >probe:HG-U133A:200877_at:293:617; Interrogation_Position = 1735; Antisense;
    TCTGGGCTTGGTCTTCCAGTTGGCA
    Probes sequences for 218425_at:
    >probe:HG-U133A:218425_at:198:203; Interrogation_Position = 2557; Antisense;
    CATCACTCTGGCCCATTGACAGATG
    >probe:HG-U133A:218425_at:42:243; Interrogation_Position = 2564; Antisense;
    CTGGCCCATTGACAGATGAGAGGTC
    >probe:HG-U133A:218425_at:518:415; Interrogation_Position = 2578; Antisense;
    GATGAGAGGTCTGAAGCCTTCCTGG
    >probe:HG-U133A:218425_at:139:581; Interrogation_Position = 2669; Antisense;
    TGAGAGGCTGGGAGTTCACAGACCT
    >probe:HG-U133A:218425_at:494:59; Interrogation_Position = 2681; Antisense;
    AGTTCACAGACCTCAGACACAGCTG
    >probe:HG-U133A:218425_at:286:111; Interrogation_Position = 2697; Antisense;
    ACACAGCTGAGTCCGACAACCATTG
    >probe:HG-U133A:218425_at:315:369; Interrogation_Position = 2711; Antisense;
    GACAACCATTGGGGTGGGGCTGCAT
    >probe:HG-U133A:218425_at:316:231; Interrogation_Position = 2818; Antisense;
    CTGCCATTGGCTTCTCAGATGTATT
    >probe:HG-U133A:218425_at:55:543; Interrogation_Position = 2826; Antisense;
    GGCTTCTCAGATGTATTTCAAGGAC
    >probe:HG-U133A:218425_at:120:527; Interrogation_Position = 2847; Antisense;
    GGACTAAAGTGGGCTCTAAGATCTA
    >probe:HG-U133A:218425_at:65:219; Interrogation_Position = 2862; Antisense;
    CTAAGATCTAAGATGGCCCGGCGCG
    Probes sequences for 203246_s_at:
    >probe:HG-U133A:203246_s_at:500:709; Interrogation_Position = 770; Antisense;
    TTGTGACACTGGTGTCCATCCTCCA
    >probe:HG-U133A:203246_s_at:95:453; Interrogation_Position = 795; Antisense;
    GTACTCCAATGTATACTGCCCAACG
    >probe:HG-U133A:203246_s_at:199:63; Interrogation_Position = 858; Antisense;
    AGAGGCATGTCTATCCTACGTGACC
    >probe:HG-U133A:203246_s_at:291:619; Interrogation_Position = 906; Antisense;
    TCTCCGGGATGTGTTCCAGCTATAC
    >probe:HG-U133A:203246_s_at:435:309; Interrogation_Position = 1011; Antisense;
    GCTGATCCAGTTCGGGCTTATGAAG
    >probe:HG-U133A:203246_s_at:519:353; Interrogation_Position = 1032; Antisense;
    GAAGAACCTCATCAGGCGACTACAG
    >probe:HG-U133A:203246_s_at:587:377; Interrogation_Position = 1074; Antisense;
    GACTCGGGAAGAGCAGAGCCACCCT
    >probe:HG-U133A:203246_s_at:675:541; Interrogation_Position = 1103; Antisense;
    GGCTTTATACAGGCTGCCACAGCTA
    >probe:HG-U133A:203246_s_at:248:363; Interrogation_Position = 1183; Antisense;
    GAAAATGACCCCAACATCATCATCT
    >probe:HG-U133A:203246_s_at:540:111; Interrogation_Position = 1239; Antisense;
    ACACATTGCTGTGGGTAGTCCCTCC
    >probe:HG-U133A:203246_s_at:388:47; Interrogation_Position = 1268; Antisense;
    AGGAGGCTTGTCATACTGTCTAGAG
    Probes sequences for 203466_at:
    >probe:HG-U133A:203466_at:427:109; Interrogation_Position = 410; Antisense;
    ACAGCGGGATTATCCTGATGCCCTT
    >probe:HG-U133A:203466_at:27:173; Interrogation_Position = 440; Antisense;
    CAACTACTATCTATGGCCTGCTGTG
    >probe:HG-U133A:203466_at:649:591; Interrogation_Position = 463; Antisense;
    TGCAGTTAGCCAACTTCTACCTGGT
    >probe:HG-U133A:203466_at:691:609; Interrogation_Position = 487; Antisense;
    TCCCCCTTCATTACAGGTTGGCCGT
    >probe:HG-U133A:203466_at:241:91; Interrogation_Position = 544; Antisense;
    ACCTGTCCTGGAAGGCACATCGGCT
    >probe:HG-U133A:203466_at:229:591; Interrogation_Position = 600; Antisense;
    TGCAGTGATGCAGCTTGACCCTGGA
    >probe:HG-U133A:203466_at:241:457; Interrogation_Position = 628; Antisense;
    GTCAGACAACCTCCTCAAAGTGGGC
    >probe:HG-U133A:203466_at:469:145; Interrogation_Position = 692; Antisense;
    AAGAGGACTAGCACCTGCAATGCCC
    >probe:HG-U133A:203466_at:186:599; Interrogation_Position = 712; Antisense;
    TGCCCTCTTCACTCTAAATGTACAC
    >probe:HG-U133A:203466_at:146:111; Interrogation_Position = 733; Antisense;
    ACACTGACTGCTTTAGAGCCCTTGA
    >probe:HG-U133A:203466_at:602:15; Interrogation_Position = 837; Antisense;
    ATATCCAGCAAATGCCACTCATCCC
    Probes sequences for 204247_s_at:
    >probe:HG-U133A:204247_s_at:235:265; Interrogation_Position = 497; Antisense;
    CCGTCCGCTGTTACTCAGCTGAGGT
    >probe:HG-U133A:204247_s_at:476:607; Interrogation_Position = 551; Antisense;
    TCCTCTTTGGGGCCAAGCTGTACTC
    >probe:HG-U133A:204247_s_at:382:181; Interrogation_Position = 576; Antisense;
    CACGTCCATCGACATGTGGTCAGCC
    >probe:HG-U133A:204247_s_at:446:293; Interrogation_Position = 636; Antisense;
    GCCTCTTTTTCCCGGCAATGATGTC
    >probe:HG-U133A:204247_s_at:37:387; Interrogation_Position = 726; Antisense;
    GACCAAGCTGCCAGACTATAAGCCC
    >probe:HG-U133A:204247_s_at:302:655; Interrogation_Position = 742; Antisense;
    TATAAGCCCTATCCGATGTACCCGG
    >probe:HG-U133A:204247_s_at:507:109; Interrogation_Position = 769; Antisense;
    ACAACATCCCTGGTGAACGTCGTGC
    >probe:HG-U133A:204247_s_at:502:245; Interrogation_Position = 789; Antisense;
    CGTGCCCAAACTCAATGCCACAGGG
    >probe:HG-U133A:204247_s_at:279:289; Interrogation_Position = 805; Antisense;
    GCCACAGGGAGGGATCTGCTGCAGA
    >probe:HG-U133A:204247_s_at:165:87; Interrogation_Position = 830; Antisense;
    ACCTTCTGAAGTGTAACCCTGTCCA
    >probe:HG-U133A:204247_s_at:330:267; Interrogation_Position = 847; Antisense;
    CCTGTCCAGCGTATCTCAGCAGAAG
    Probes sequences for 216012_at:
    >probe:HG-U133A:216012_at:397:73; Interrogation_Position = 1126; Antisense;
    AGAAGTCTGTATCTGGCATCTTCCC
    >probe:HG-U133A:216012_at:642:527; Interrogation_Position = 1282; Antisense;
    GGAATTAGGATCTCCTTTCAGCCCT
    >probe:HG-U133A:216012_at:684:689; Interrogation_Position = 1298; Antisense;
    TTCAGCCCTATCTTGGGACTGACTT
    >probe:HG-U133A:216012_at:40:211; Interrogation_Position = 1320; Antisense;
    CTTACATTCCCACATGTGTCCATGT
    >probe:HG-U133A:216012_at:144:57; Interrogation_Position = 1366; Antisense;
    AGTCTAGCTTGCCACACATGTGAAA
    >probe:HG-U133A:216012_at:398:369; Interrogation_Position = 1387; Antisense;
    GAAATCATTCCCAGTTCACAGGTGG
    >probe:HG-U133A:216012_at:359:101; Interrogation_Position = 1462; Antisense;
    ACTTACATTTCATTGCTGCATCTGC
    >probe:HG-U133A:216012_at:231:67; Interrogation_Position = 1506; Antisense;
    AGAGCAACTTATTCCCTTGTGACAT
    >probe:HG-U133A:216012_at:661:375; Interrogation_Position = 1526; Antisense;
    GACATGATGACACACCTCTGGATAG
    >probe:HG-U133A:216012_at:306:613; Interrogation_Position = 1602; Antisense;
    TCCCTAGTATCCTATTACCTGTCTG
    >probe:HG-U133A:216012_at:583:453; Interrogation_Position = 1632; Antisense;
    GTACACTGTGTACCTTCGTATCACT
    Probes sequences for 211328_x_at:
    >probe:HG-U133A:211328_x_at:270:681; Interrogation_Position = 14; Antisense;
    TTTTACTGGGCATCTCCTGAGCCTA
    >probe:HG-U133A:211328_x_at:268:449; Interrogation_Position = 49; Antisense;
    GTAGGGTGACTTCTGGAGCCATCCC
    >probe:HG-U133A:211328_x_at:192:261; Interrogation_Position = 97; Antisense;
    GCGGAGACTTAACGGGGACGTGCGG
    >probe:HG-U133A:211328_x_at:498:63; Interrogation_Position = 124; Antisense;
    AGAGCTGGGGAAATGGGCCCGCGAG
    >probe:HG-U133A:211328_x_at:427:271; Interrogation_Position = 231; Antisense;
    CCTCTTCATGGGTGCCTCAGAGCAG
    >probe:HG-U133A:211328_x_at:548:189; Interrogation_Position = 248; Antisense;
    CAGAGCAGGACCTTGGTCTTTCCCT
    >probe:HG-U133A:211328_x_at:48:463; Interrogation_Position = 263; Antisense;
    GTCTTTCCCTGTTTGAAGCTTTGGG
    >probe:HG-U133A:211328_x_at:430:563; Interrogation_Position = 284; Antisense;
    TGGGCTACGTGGATGACCTCGTGAT
    >probe:HG-U133A:211328_x_at:499:231; Interrogation_Position = 358; Antisense;
    CTGCCCAGCCGTCAAAAGAGTCTTA
    >probe:HG-U133A:211328_x_at:485:663; Interrogation_Position = 414; Antisense;
    TATGTTACTACATGCACTTGGCTGC
    >probe:HG-U133A:211328_x_at:682:51; Interrogation_Position = 468; Antisense;
    AGGGCAGGTGCTTCAGGATACCATA
    Probes sequences for 218336_at:
    >probe:HG-U133A:218336_at:404:243; Interrogation_Position = 101; Antisense;
    CGGTGTCCGCAGAGCAGGTGATTGC
    >probe:HG-U133A:218336_at:411:75; Interrogation_Position = 113; Antisense;
    AGCAGGTGATTGCTGGCTTCAACCG
    >probe:HG-U133A:218336_at:429:539; Interrogation_Position = 143; Antisense;
    GGCAGGAACAGCGAGGCCTGGCATC
    >probe:HG-U133A:218336_at:91:393; Interrogation_Position = 199; Antisense;
    GAGCACAGCCTAGTGATCGATACAC
    >probe:HG-U133A:218336_at:424:133; Interrogation_Position = 240; Antisense;
    AACTCGTAAGTGCTACCGCATGGTT
    >probe:HG-U133A:218336_at:469:403; Interrogation_Position = 269; Antisense;
    GAGTGCTGGTGGAGCGAACTGTCAA
    >probe:HG-U133A:218336_at:387:595; Interrogation_Position = 299; Antisense;
    TGCTGCCCGCTTTGGAGAACAACAA
    >probe:HG-U133A:218336_at:639:3; Interrogation_Position = 343; Antisense;
    ATTGAGACACTGACACAGCAGCTTC
    >probe:HG-U133A:218336_at:516:121; Interrogation_Position = 402; Antisense;
    AAAGCACAACATTCGTCTCATGGGA
    >probe:HG-U133A:218336_at:131:315; Interrogation_Position = 482; Antisense;
    GCTCAGCTGGAGTGTTGGTCTCCTA
    >probe:HG-U133A:218336_at:289:229; Interrogation_Position = 501; Antisense;
    CTCCTAGGGACCAAGGCCTTTGCAT
    Probes sequences for 209746_s_at:
    >probe:HG-U133A:209746_s_at:165:27; Interrogation_Position = 55; Antisense;
    ATCCGGGTGGATCATGCAGGCGAAT
    >probe:HG-U133A:209746_s_at:390:107; Interrogation_Position = 223; Antisense;
    ACAGTTCTGATGCCCTTGTGGAACG
    >probe:HG-U133A:209746_s_at:406:383; Interrogation_Position = 273; Antisense;
    GACCGCCTTGCTCGGGAAGGAAGGT
    >probe:HG-U133A:209746_s_at:199:535; Interrogation_Position = 287; Antisense;
    GGAAGGAAGGTGCCATGGCCTGCAC
    >probe:HG-U133A:209746_s_at:335:549; Interrogation_Position = 302; Antisense;
    TGGCCTGCACCGTGGCGGTGGAAGA
    >probe:HG-U133A:209746_s_at:556:397; Interrogation_Position = 325; Antisense;
    GAGAGCATAGCACATCACTACAACA
    >probe:HG-U133A:209746_s_at:258:391; Interrogation_Position = 430; Antisense;
    GAGCTTGAGCACCATGACATAGGCC
    >probe:HG-U133A:209746_s_at:358:581; Interrogation_Position = 444; Antisense;
    TGACATAGGCCTCGACCATGATGCA
    >probe:HG-U133A:209746_s_at:314:591; Interrogation_Position = 465; AntisenSe;
    TGCAGAATTGGCTCCAGCCTATGCC
    >probe:HG-U133A:209746_s_at:260:185; Interrogation_Position = 479; Antisense;
    CAGCCTATGCCGTCCTGAAGAGCAT
    >probe:HG-U133A:209746_s_at:302:391; Interrogation_Position = 498; Antisense;
    GAGCATTATCCAGGCCGGATGCAGA
    Probes sequences for 214722_at:
    >probe:HG-U133A:214722_at:302:51; Interrogation_Position = 3851; Antisense;
    AGTGGTTGTCAGAGACTGTAAGAAG
    >probe:HG-U133A:214722_at:514:489; Interrogation_Position = 3878; Antisense;
    GGGAATTGGAGAGTGACTGCCCATA
    >probe:HG-U133A:214722_at:252:165; Interrogation_Position = 3881; Antisense;
    AATTGGAGAGTGACTGCCCATAGGT
    >probe:HG-U133A:214722_at:501:555; Interrogation_Position = 3884; Antisense;
    TGGAGAGTGACTGCCCATAGGTACA
    >probe:HG-U133A:214722_at:467:483; Interrogation_Position = 3951; Antisense;
    GTGGTGATGGTTGCAGAACTTTTGG
    >probe:HG-U133A:214722_at:533:219; Interrogation_Position = 4141; Antisense;
    CTCAGTGGTTTACTCTTGTAATCCC
    >probe:HG-U133A:214722_at:436:503; Interrogation_Position = 4278; Antisense;
    GGTGATGTGTGCCTAGAGTTCCAAC
    >probe:HG-U133A:214722_at:191:415; Interrogation_Position = 4281; Antisense;
    GATGTGTGCCTAGAGTTCCAACTAC
    >probe:HG-U133A:214722_at:56:481; Interrogation_Position = 4284; Antisense;
    GTGTGCCTAGAGTTCCAACTACTTG
    >probe:HG-U133A:214722_at:293:513; Interrogation_Position = 4350; Antisense;
    GGTTGCAGTGATCTATAATCACCAC
    >probe:HG-U133A:214722_at:66:55; Interrogation_Position = 4356; Antisense;
    AGTGATCTATAATCACCACTGCACT
    Probes sequences for 214599_at:
    >probe:HG-U133A:214599_at:400:195; Interrogation_Position = 1846; Antisense;
    CATAAATAACCACCCGCAGTGTCCA
    >probe:HG-U133A:214599_at:465:301; Interrogation_Position = 1874; Antisense;
    GCCCTCAGATCGTCTCATACAAGGG
    >probe:HG-U133A:214599_at:235:281; Interrogation_Position = 1916; Antisense;
    TCCACTTATTTCGGGTCCGCTAGGT
    >probe:HG-U133A:214599_at:140:467; Interrogation_Position = 1969; Antisense;
    GTCCCTCTACATGTCTCTTTAATGG
    >probe:HG-U133A:214599_at:263:525; Interrogation_Position = 2014; Antisense;
    GGAATTATTGTCCAGTGCCAACCCC
    >probe:HG-U133A:214599_at:709:137; Interrogation_Position = 2054; Antisense;
    AACCTCAGGTGAGCAGAGCCTCTAC
    >probe:HG-U133A:214599_at:80:329; Interrogation_Position = 2066; Antisense;
    GCAGAGCCTCTACTTGAGGGACTAT
    >probe:HG-U133A:214599_at:296:669; Interrogation_Position = 2097; Antisense;
    TATAGGAATCCTTACTTCCCCAGTA
    >probe:HG-U133A:214599_at:427:11; Interrogation_Position = 2199; Antisense;
    ATTTTCTTCTATCACACCACATAAA
    >probe:HG-U133A:214599_at:77:265; Interrogation_Position = 2226; Antisense;
    CCTGTGTATGGGTCAATGGCTGCAA
    >probe:HG-U133A:214599_at:485:639; Interrogation_Position = 2238; Antisense;
    TCAATGGCTGCAAGAGACTCCCACG
    Probes sequences for 220113_x_at:
    >probe:HG-U133A:220113_x_at:707:41; Interrogation_Position = 4117; Antisense;
    AGGCTTAGCAGGATGGGCGCAGTGG
    >probe:HG-U133A:220113_x_at:487:181; Interrogation_Position = 4144; Antisense;
    CACGCCACTAATCCCAACATTTTAG
    >probe:HG-U133A:220113_x_at:218:43; Interrogation_Position = 4176; Antisense;
    AGGCAGGAGCAATCACTTGTGCCTG
    >probe:HG-U133A:220113_x_at:96:471; Interrogation_Position = 4194; Antisense;
    GTGCCTGGGAGTTCTAGACTAGCCT
    >probe:HG-U133A:220113_x_at:471:239; Interrogation_Position = 4217; Antisense;
    CTGGGCGAGACTTCATCTCTACAAA
    >probe:HG-U133A:220113_x_at:582:657; Interrogation_Position = 4266; Antisense;
    TAGCCAAGCATAGTGGCACACCCCT
    >probe:HG-U133A:220113_x_at:178:77; Interrogation_Position = 4398; Antisense;
    AGCAGGACCCTGTCTCTATTTTATA
    >probe:HG-U133A:220113_x_at:663:27; Interrogation_Position = 4460; Antisense;
    ATCCCAACACTTTGGCTCAGCAGAT
    >probe:HG-U133A:220113_x_at:243:543; Interrogation_Position = 4473; Antisense;
    GGCTCAGCAGATTGCTTGAACCCAG
    >probe:HG-U133A:220113_x_at:70:345; Interrogation_Position = 4610; Antisense;
    GAATCTCCTGAGCCTGGAAGGTCCA
    >probe:HG-U133A:220113_x_at:625:533; Interrogation_Position = 4625; Antisense;
    GGAAGGTCCAGGCAGTGAGCCAAGA
    Probes sequences for 213212_x_at:
    >probe:HG-U133A:213212_x_at:593:361; Interrogation_Position = 2042; Antisense;
    GAAAATGTAAATCAACCCTATCCAT
    >probe:HG-U133A:213212_x_at:544:137; Interrogation_Position = 2055; Antisense;
    AACCCTATCCATAATAGATTCTCTA
    >probe:HG-U133A:213212_x_at:508:133; Interrogation_Position = 2081; Antisense;
    AACTTTATCTTACAGTCACTTTCAA
    >probe:HG-U133A:213212_x_at:517:455; Interrogation_Position = 2122; Antisense;
    GTAACTGCTATATTAACGTCTTAAA
    >probe:HG-U133A:213212_x_at:235:367; Interrogation_Position = 2221; Antisense;
    GAAATGCCAATTCCAGTCCAAAGCT
    >probe:HG-U133A:213212_x_at:17:119; Interrogation_Position = 2240; Antisense;
    AAAGCTGTATTTGCCAAGTTTTCTT
    >probe:HG-U133A:213212_x_at:648:681; Interrogation_Position = 2258; Antisense;
    TTTTCTTAGAATGACTTTTACCGAT
    >probe:HG-U133A:213212_x_at:309:655; Interrogation_Position = 2276; Antisense;
    TACCGATTTATGAATTCTTATACAC
    >probe:HG-U133A:213212_x_at:26:655; Interrogation_Position = 2318; Antisense;
    TACTGATTTTTGTCTAAAGTGGCAT
    >probe:HG-U133A:213212_x_at:666:149; Interrogation_Position = 2334; Antisense;
    AAGTGGCATTATTGACTGCTGCTGT
    >probe:HG-U133A:213212_x_at:696:671; Interrogation_Position = 2343; Antisense;
    TATTGACTGCTGCTGTGATGCTACT
    Probes sequences for 202842_s_at:
    >probe:HG-U133A:202842_s_at:225:443; Interrogation_Position = 1452; Antisense;
    GTTTGATATTTACCACAGCGCTGTG
    >probe:HG-U133A:202842_s_at:350:303; Interrogation_Position = 1469; Antisense;
    GCGCTGTGCCTTTCTACAGTAGAAC
    >probe:HG-U133A:202842_s_at:615:511; Interrogation_Position = 1509; Antisense;
    GGTTTTATTGCCCATAGTCATTTAG
    >probe:HG-U133A:202842_s_at:167:15; Interrogation_Position = 1649; Antisense;
    ATATTTCTTTCTTAGTTGTTGGCAC
    >probe:HG-U133A:202842_s_at:248:433; Interrogation_Position = 1663; Antisense;
    GTTGTTGGCACTCTTAGGTCTTAGT
    >probe:HG-U133A:202842_s_at:124:483; Interrogation_Position = 1704; Antisense;
    GTGTGTGTGTAGTTTATCCTCTCTC
    >probe:HG-U133A:202842_s_at:661:429; Interrogation_Position = 1744; Antisense;
    GATTGACTGATACCTCATTCTGTTT
    >probe:HG-U133A:202842_s_at:222:163; Interrogation_Position = 1784; Antisense;
    AATTTCTGTGCAACCTTACTATGTG
    >probe:HG-U133A:202842_s_at:670:479; Interrogation_Position = 1833; Antisense;
    GTGTGCTTTTGTTTTCGGATAGACT
    >probe:HG-U133A:202842_s_at:627:11; Interrogation_Position = 1859; Antisense;
    ATTTCTTTAGTTCTGCACTTTTCCA
    >probe:HG-U133A:202842_s_at:217:179; Interrogation_Position = 1874; Antisense;
    CACTTTTCCACATTATACTCCATAT
    Probes sequences for 217671_at:
    >probe:HG-U133A:217671_at:584:315; Interrogation_Position = 16; Antisense;
    GCTAAGAATGCTGTCCTTCACTTTT
    >probe:HG-U133A:217671_at:217:45; Interrogation_Position = 60; Antisense;
    AGGAATAGTCAAGTTCCCCTATCTC
    >probe:HG-U133A:217671_at:337:155; Interrogation_Position = 125; Antisense;
    AATACTTACTATCTTTACCACTCAC
    >probe:HG-U133A:217671_at:195:657; Interrogation_Position = 140; Antisense;
    TACCACTCACTGCTCTATTTTGAGA
    >probe:HG-U133A:217671_at:313:127; Interrogation_Position = 190; Antisense;
    AAAATGCCCTGTTTGCATACCTTCT
    >probe:HG-U133A:217671_at:79:593; Interrogation_Position = 203; Antisense;
    TGCATACCTTCTCTTCTTTGTATAC
    >probe:HG-U133A:217671_at:278:89; Interrogation_Position = 208; Antisense;
    ACCTTCTCTTCTTTGTATACTGTAA
    >probe:HG-U133A:217671_at:389:19; Interrogation_Position = 224; Antisense;
    ATACTGTAAACTCCTGTATAATAGG
    >probe:HG-U133A:217671_at:365:439; Interrogation_Position = 271; Antisense;
    GTTACTCTTTAAGGATATTTTCATT
    >probe:HG-U133A:217671_at:184:5; Interrogation_Position = 293; Antisense;
    ATTGAACTTGTAATATATCCAGATT
    >probe:HG-U133A:217671_at:587:427; Interrogation_Position = 314; Antisense;
    GATTAACATATGGGCCAAGTGCAGC
    Probes sequences for 207365_x_at:
    >probe:HG-U133A:207365_x_at:115:45; Interrogation_Position = 5312; Antisense;
    AGGTTCCTCCCACAACAATGGGAAT
    >probe:HG-U133A:207365_x_at:79:563; Interrogation_Position = 5365; Antisense;
    TGGGTGGAGACACGGCCAAACTGTA
    >probe:HG-U133A:207365_x_at:74:373; Interrogation_Position = 5373; Antisense;
    GACACGGCCAAACTGTATCACATTG
    >probe:HG-U133A:207365_x_at:78:193; Interrogation_Position = 5527; Antisense;
    CAGATGTTAATTCCTGTGACTGACT
    >probe:HG-U133A:207365_x_at:284:361; Interrogation_Position = 5559; Antisense;
    GAAAGGTAACCTAAGGCTGGGCACG
    >probe:HG-U133A:207365_x_at:106:457; Interrogation_Position = 5577; Antisense;
    GGGCACGGTGGCTCGTGCCTATAGT
    >probe:HG-U133A:207365_x_at:69:295; Interrogation_Position = 5593; Antisense;
    GCCTATAGTCCTGGCACTTTGAGTG
    >probe:HG-U133A:207365_x_at:101:499; Interrogation_Position = 5623; Antisense;
    GGTGGGTCGATTGCTTGAAACCAGG
    >probe:HG-U133A:207365_x_at:579:365; Interrogation_Position = 5676; Antisense;
    GAAACCTCATCCCTACAGAAAATAC
    >probe:HG-U133A:207365_x_at:647:175; Interrogation_Position = 5725; Antisense;
    CACACCTGTAGTCTCAGCTACTCGG
    >probe:HG-U133A:207365_x_at:520:387; Interrogation_Position = 5774; Antisense;
    GAGCTCAGGAGGTCGAGGCTTCAGT
    Probes sequences for 218067_s_at:
    >probe:HG-U133A:218067_s_at:632:321; Interrogation_Position = 1198; Antisense;
    GCTTGTTCTAATAGGGGCTATGCTC
    >probe:HG-U133A:218067_s_at:402:683; Interrogation_Position = 1245; Antisense;
    TTTTCCTTCCACTAAGTCAAATCCT
    >probe:HG-U133A:218067_s_at:619:457; Interrogation_Position = 1260; Antisense;
    GTCAAATCCTTATCAGATCATTGTT
    >probe:HG-U133A:218067_s_at:455:681; Interrogation_Position = 1306; Antisense;
    TTTTCACCTGTTTGGATTCTATATT
    >probe:HG-U133A:218067_s_at:611:517; Interrogation_Position = 1369; Antisense;
    GGATGGTCTGATAAGGCTTTTACTG
    >probe:HG-U133A:218067_s_at:294:251; Interrogation_Position = 1396; Antisense;
    CCCACTGACTTCAGAGTTATACTCT
    >probe:HG-U133A:218067_s_at:581:641; Interrogation_Position = 1434; Antisense;
    TAATGCTGGTTTTGCTGACTTTTTG
    >probe:HG-U133A:218067_s_at:277:685; Interrogation_Position = 1490; Antisense;
    TTGGTGATTGCATTGGGAAATTCCC
    >probe:HG-U133A:218067_s_at:79:3; Interrogation_Position = 1509; Antisense;
    ATTCCCAGGGTATTACTGGACCTAT
    >probe:HG-U133A:218067_s_at:468:571; Interrogation_Position = 1543; Antisense;
    TGTTAAACCAGTGTCCTTGTGATAC
    >probe:HG-U133A:218067_s_at:392:281; Interrogation_Position = 1688; Antisense;
    CCAGCATTCCCCATTGTGTAAATAA
    Probes sequences for 205238_at:
    >probe:HG-U133A:205238_at:576:305; Interrogation_Position = 2148; Antisense;
    GCGGCTGACTTAACCTTAGATTTCA
    >probe:HG-U133A:205238_at:626:691; Interrogation_Position = 2169; Antisense;
    TTCAGATTCCCTTTCAATCATGCCC
    >probe:HG-U133A:205238_at:2:159; Interrogation_Position = 2184; Antisense;
    AATCATGCCCGTTTGCTTGGGTTTG
    >probe:HG-U133A:205238_at:552:607; Interrogation_Position = 2221; Antisense;
    TCCTGGATCTGGACTGTTTCTTCCT
    >probe:HG-U133A:205238_at:383:595; Interrogation_Position = 2248; Antisense;
    TGCTGTCAGCCTTGGTTCTAGTTTA
    >probe:HG-U133A:205238_at:257:625; Interrogation_Position = 2264; Antisense;
    TCTAGTTTATAGTCTTGTCTCCTCC
    >probe:HG-U133A:205238_at:649:269; Interrogation_Position = 2284; Antisense;
    CCTCCGTTTCTGCTTATGGTGTGGA
    >probe:HG-U133A:205238_at:75:97; Interrogation_Position = 2350; Antisense;
    ACTGGAGCCTGTGCATTTGCTCAGA
    >probe:HG-U133A:205238_at:226:441; Interrogation_Position = 2532; Antisense;
    GTTTTCTTGGTTTTAAGGCATCTCG
    >probe:HG-U133A:205238_at:316:321; Interrogation_Position = 2549; Antisense;
    GCATCTCGAATGCCCCTTGAAAATG
    >probe:HG-U133A:205238_at:83:453; Interrogation_Position = 2572; Antisense;
    TGTAGTTTTCCCTGGTGGTGGGCAC
    Probes sequences for 209432_s_at:
    >probe:HG-U133A:209432_s_at:656:409; Interrogation_Position = 906; Antisense;
    GAGGAAACTCCAGGCCATGGTGATT
    >probe:HG-U133A:209432_s_at:667:317; Interrogation_Position = 1009; Antisense;
    GCTATGTACTCCTCTGACACAAGGG
    >probe:HG-U133A:209432_s_at:519:79; Interrogation_Position = 1044; Antisense;
    AGCTGAGCATGGAGTGTTGTCCCGC
    >probe:HG-U133A:209432_s_at:553:279; Interrogation_Position = 1085; Antisense;
    CCAGTGAGGACCCTTACCAGCTGGA
    >probe:HG-U133A:209432_s_at:626:599; Interrogation_Position = 1112; Antisense;
    TGCCTGCCCTGCAGTCAGAAGTGCC
    >probe:HG-U133A:209432_s_at:43:337; Interrogation_Position = 1145; Antisense;
    GCACACACCAGTGGTTGGACGGCTC
    >probe:HG-U133A:209432_s_at:258:555; Interrogation_Position = 1160; Antisense;
    TGGACGGCTCAGACTGTGTACTCCA
    >probe:HG-U133A:209432_s_at:651:597; Interrogation_Position = 1204; Antisense;
    TGCCTGCTGCATTACATGCCTCAGG
    >probe:HG-U133A:209432_s_at:387:259; Interrogation_Position = 1305; Antisense;
    CCCCCTGCAGGCAAATCTCACAAGG
    >probe:HG-U133A:209432_s_at:263:517; Interrogation_Position = 1333; Antisense;
    GGAGGATGGCTTCCTACTGGTAGCC
    >probe:HG-U133A:209432_s_at:519:299; Interrogation_Position = 1355; Antisense;
    GCCCCTCTGTCATTTTGCAGGACAG
    Probes sequences for 213919_at:
    >probe:HG-U133A:213919_at:660:31; Interrogation_Position = 89; Antisense;
    ATGAGGAGGAGGCAGTACCCCAGCA
    >probe:HG-U133A:213919_at:511:655; Interrogation_Position = 104; Antisense;
    TACCCCAGCACTTGTTTGTTTTGTT
    >probe:HG-U133A:213919_at:685:485; Interrogation_Position = 159; Antisense;
    GTGGCCTCACGCTGTGAAACTGGGA
    >probe:HG-U133A:213919_at:324:525; Interrogation_Position = 211; Antisense;
    GGAGCTGTGTGTTTGGTGGGCAGAC
    >probe:HG-U133A:213919_at:187:207; Interrogation_Position = 235; Antisense;
    CTTGTCATGGACTGTGGTTCGTGGA
    >probe:HG-U133A:213919_at:547:485; Interrogation_Position = 255; Antisense;
    GTGGAGACTTTGGGGGACTACCTGA
    >probe:HG-U133A:213919_at:326:379; Interrogation_Position = 270; Antisense;
    GACTACCTGAGCGGAGCTGGTCATC
    >probe:HG-U133A:213919_at:193:181; Interrogation_Position = 315; Antisense;
    CACGGCTGAGCACACGGTATGCCTC
    >probe:HG-U133A:213919_at:639:633; Interrogation_Position = 342; Antisense;
    TCAGCTCCACAAAGCGGCTGTGCAG
    >probe:HG-U133A:213919_at:308:69; Interrogation_Position = 417; Antisense;
    AGAAAGCTCGTTTAACTTCCTCAGT
    >probe:HG-U133A:213919_at:53:633; Interrogation_Position = 437; Antisense;
    TCAGTGCTGGCACCAGGATGCACCC
    Probes sequences for 221810_at:
    >probe:HG-U133A:221810_at:571:491; Interrogation_Position = 2193; Antisense;
    GGGAAGTAGGTGATGCCAGCCCTCA
    >probe:HG-U133A:221810_at:522:235; Interrogation_Position = 2221; Antisense;
    CTGTCTTCAGCCAGGGACTTGAGAA
    >probe:HG-U133A:221810_at:481:195; Interrogation_Position = 2257; Antisense;
    CAGTGGCTCCAATCTGTGGACCAGT
    >probe:HG-U133A:221810_at:375:347; Interrogation_Position = 2351; Antisense;
    GAAGGACTATATTTGTACTGTACCC
    >probe:HG-U133A:221810_at:429:427; Interrogation_Position = 2482; Antisense;
    GATTTTAGGGTGCAGCTACGCTCAC
    >probe:HG-U133A:221810_at:504:647; Interrogation_Position = 2509; Antisense;
    TAAACTTTTGGTGGCCTGGGGCATG
    >probe:HG-U133A:221810_at:447:501; Interrogation_Position = 2526; Antisense;
    GGGGCATGTCTTGAGGCCCAGACTG
    >probe:HG-U133A:221810_at:476:595; Interrogation_Position = 2597; Antisense:
    TGCTGTCTTGAGACTCCATCCAGCC
    >probe:HG-U133A:221810_at:489:283; Interrogation_Position = 2649; Antisense;
    CCACTATCTCCCTGTGACGGGTGAA
    >probe:HG-U133A:221810_at:548:487; Interrogation_Position = 2668; Antisense;
    GGTGAACTTCGTGTACTGTGTCTCG
    >probe:HG-U133A:221810_at:312:453; Interrogation_Position = 2680; Antisense;
    GTACTGTGTCTCGGGTCCATATATG
    Probes sequences for 217923_at:
    >probe:HG-U133A:217923_at:277:501; Interrogation_Position = 1062; Antisense;
    GGGGCTGAGGCCACACAGATAGGAG
    >probe:HG-U133A:217923_at:624:341; Interrogation_Position = 1112; Antisense;
    GAATGTCCTGATGGCCATGAGCAGT
    >probe:HG-U133A:217923_at:66:201; Interrogation_Position = 1127; Antisense;
    CATGAGCAGTTGAGTGGCACAGCCT
    >probe:HG-U133A:217923_at:74:539; Interrogation_Position = 1152; Antisense;
    GGCACCAGGAGCAGGTCCTTGTAAT
    >probe:HG-U133A:217923_at:38:59; Interrogation_Position = 1179; Antisense;
    AGTTAGTGTCCAGTCAGCTGAGCTC
    >probe:HG-U133A:217923_at:681:193; Interrogation_Position = 1215; Antisense;
    CAGTGGTGAGTGTTCATCGGCCTGT
    >probe:HG-U133A:217923_at:511:23; Interrogation_Position = 1230; Antisense;
    ATCGGCCTGTTACCGTTAGTACCTG
    >probe:HG-U133A:217923_at:393:87; Interrogation_Position = 1264; Antisense;
    ACCAGGCCATCCTGTCAAACGAGCC
    >probe:HG-U133A:217923_at:203:421; Interrogation_Position = 1327; Antisense;
    GATCTGTCTATGGGACCAGTGGCTT
    >probe:HG-U133A:217923_at:494:619; Interrogation_Position = 1356; Antisense;
    TCTGCCACACCCATAAATCCTTGTG
    >probe:HG-U133A:217923_at:695:705; Interrogation_Position = 1376; Antisense;
    TTGTGTGTTAACTTCTAGCTGCCTG
    Probes sequences for 210297_s_at:
    >probe:HG-U133A:210297_s_at:238:155; Interrogation_Position = 23; Antisense;
    AATGAATGTTCTCCTGGGCAGCGTT
    >probe:HG-U133A:210297_s_at:649:563; Interrogation_Position = 37; Antisense;
    TGGGCAGCGTTGTGATCTTTGCCAC
    >probe:HG-U133A:210297_s_at:239:89; Interrogation_Position = 60; Antisense;
    ACCTTCGTGACTTTATGCAATGCAT
    >probe:HG-U133A:210297_s_at:689:165; Interrogation_Position = 77; Antisense;
    CAATGCATCATGCTATTTCATACCT
    >probe:HG-U133A:210297_s_at:337:611; Interrogation_Position = 94; Antisense;
    TCATACCTAATGAGGGAGTTCCAGG
    >probe:HG-U133A:210297_s_at:84:443; Interrogation_Position = 134; Antisense;
    GTTTCTACACCTGTGGGTTATGACA
    >probe:HG-U133A:210297_s_at:22:511; Interrogation_Position = 149; Antisense;
    GGTTATGACAAAGACAACTGCCAAA
    >probe:HG-U133A:210297_s_at:316:525; Interrogation_Position = 190; Antisense;
    GGACTGCAAGTATATCGTGGTGGAG
    >probe:HG-U133A:210297_s_at:161:237; Interrogation_Position = 235; Antisense;
    CTGTTCTGTCAGTGAATGGATAATC
    >probe:HG-U133A:210297_s_at:653:591; Interrogation_Position = 265; Antisense;
    TGCTTCTAGTAGGCACAGGGCTCCC
    >probe:HG-U133A:210297_s_at:21:243; Interrogation_Position = 309; Antisense;
    CTGGCCTCTAATAGTCAATGATTGT
    Probes sequences for 202475_at:
    >probe:HG-U133A:202475_at:161:153; Interrogation_Position = 302; Antisense;
    AAGGCGGCATCTATGACTTCATTGG
    >probe:HG-U133A:202475_at:696:273; Interrogation_Position = 375; Antisense;
    CCTTGTCATGTCCCGGAATGCCGGC
    >probe:HG-U133A:202475_at:159:595; Interrogation_Position = 444; Antisense;
    TGCTGAGCTTATTATGTCCCGCTGC
    >probe:HG-U133A:202475_at:29:83; Interrogation_Position = 486; Antisense;
    AGCCCGGGGCATTGAGTTTGACTGG
    >probe:HG-U133A:202475_at:298:23; Interrogation_Position = 541; Antisense;
    ATCAGTCTGGGTCCATACATCGTCG
    >probe:HG-U133A:202475_at:424:249; Interrogation_Position = 564; Antisense;
    CGCGTCTGCTCAGGTCTGGATGATA
    >probe:HG-U133A:202475_at:69:647; Interrogation_Position = 587; Antisense;
    TAACACGCTATGATCTGTACCACAC
    >probe:HG-U133A:202475_at:427:227; Interrogation_Position = 628; Antisense;
    CTCCTGCTGATGTTCCTCAGAGTTT
    >probe:HG-U133A:202475_at:701:661; Interrogation_Position = 666; Antisense;
    TATGGAGACGTTTGTCCACCTATGC
    >probe:HG-U133A:202475_at:414:703; Interrogation_Position = 770; Antisense;
    TTGCCGTTGTCAATGTGCACTCCTA
    >probe:HG-U133A:202475_at:193:607; Interrogation_Position = 790; Antisense;
    TCCTAGGCTTGGTGTCTCAGACATT
    Probes sequences for 206160_at:
    >probe:HG-U133A:206160_at:564:47; Interrogation_Position = 576; Antisense;
    AGGAGGCTGGCTGTAAACTGCGCAT
    >probe:HG-U133A:206160_at:194:643; Interrogation_Position = 589; Antisense;
    TAAACTGCGCATCATGAAGCCCCAG
    >probe:HG-U133A:206160_at:30:83; Interrogation_Position = 606; Antisense;
    AGCCCCAGGACTTCGAATATGTCTG
    >probe:HG-U133A:206160_at:521:449; Interrogation_Position = 739; Antisense;
    GTAGGGCAACTGGGCTTTGCCTCAC
    >probe:HG-U133A:206160_at:138:649; Interrogation_Position = 800; Antisense;
    TACAGCCATCTGGGACATGCCTGTC
    >probe:HG-U133A:206160_at:470:599; Interrogation_Position = 817; Antisense;
    TGCCTGTCTTCCTAATACCATTTGG
    >probe:HG-U133A:206160_at:75:165; Interrogation_Position = 866; Antisense;
    CAATCATACTGGACAAGGCCCTTAG
    >probe:HG-U133A:206160_at:239:139; Interrogation_Position = 995; Antisense;
    AACCATACATGGGCTCCAGTCAACT
    >probe:HG-U133A:206160_at:286:381; Interrogation_Position = 1024; Antisense;
    GACTGAAGGTCCTAATTGCTCACCC
    >probe:HG-U133A:206160_at:557:703; Interrogation_Position = 1039; Antisense;
    TTGCTCACCCAAGGGGGCTGCTTAA
    >probe:HG-U133A:206160_at:265:111; Interrogation_Position = 1063; Antisense;
    ACACAAACAGCCTCAGACCCGAGGT
    Probes sequences for 215275_at:
    >probe:HG-U133A:215275_at:220:661; Interrogation_Position = 19; Antisense;
    TAGGGACTCCAAAGGCAGCTGACAG
    >probe:HG-U133A:215275_at:356:309; Interrogation_Position = 36; Antisense;
    GCTGACAGCATCTGGCTTTCAGTTC
    >probe:HG-U133A:215275_at:660:53; Interrogation_Position = 64; Antisense;
    AGTCACCACTACTTTGTACCAAATT
    >probe:HG-U133A:215275_at:682:449; Interrogation_Position = 79: Antisense;
    GTACCAAATTCACTGTTTTGGCTCT
    >probe:HG-U133A:215275_at:597:517; Interrogation_Position = 128; Antisense;
    GGATGTCTGCATTGCTCATGCAAAT
    >probe:HG-U133A:215275_at:552:107; Interrogation_Position = 175; Antisense;
    ACACCATCACCACCCAAATGAAAAG
    >probe:HG-U133A:215275_at:496:629; Interrogation_Position = 215; Antisense;
    TCATCAGCCTACTAATGTCATCTCC
    >probe:HG-U133A:215275_at:87:227; Interrogation_Position = 256; Antisense;
    CTCCATCCCAAAAAAGCATCCAGTT
    >probe:HG-U133A:215275_at:475:75; Interrogation_Position = 270; Antisense;
    AGCATCCAGTTCAGAATTGCCCACT
    >probe:HG-U133A:215275_at:497:217; Interrogation_Position = 488; Antisense;
    CTGAGAAACCTACCCTGGTGACCAA
    >probe:HG-U133A:215275_at:326:341; Interrogation_Position = 519; Antisense;
    GAATTGGCAGGTTTTGCTTCAAAAT
    Probes sequences for 214463_x_at:
    >probe:HG-U133A:214463_x_at:264:35; Interrogation_Position = 46; Antisense;
    ATGTCTGGCCGCGGCAAAGGCGGGA
    >probe:HG-U133A:214463_x_at:659:315; Interrogation_Position = 91; Antisense;
    GCTAAGCGCCACCGTAAAGTACTGC
    >probe:HG-U133A:214463_x_at:115:303; Interrogation_Position = 116; Antisense;
    GCGACAATATCCAGGGCATCACCAA
    >probe:HG-U133A:214463_x_at:692:207; Interrogation_Position = 157; Antisense;
    CTTGCTCGCCGCGGCGGCGTGAAGC
    >probe:HG-U133A:214463_x_at:706:25; Interrogation_Position = 184; Antisense;
    ATCTCCGGCCTCATCTACGAGGAGA
    >probe:HG-U133A:214463_x_at:629:379; Interrogation_Position = 207; Antisense;
    GACTCGCGGGGTGCTGAAGGTGTTC
    >probe:HG-U133A:214463_x_at:20:435; Interrogation_Position = 228; Antisense;
    GTTCCTGGAGAACGTGATCCGGGAC
    >probe:HG-U133A:214463_x_at:588:385; Interrogation_Position = 258; Antisense;
    GACCTATACAGAGCACGCCAAGCGC
    >probe:HG-U133A:214463_x_at:215:459; Interrogation_Position = 289; Antisense;
    GTCACCGCCATGGATGTGGTCTACG
    >probe:HG-U133A:214463_x_at:269:631; Interrogation_Position = 340; Antisense;
    TACGGTTTCGGTGGTTGAGCGTCCT
    >probe:HG-U133A:214463_x_at:55:581; Interrogation_Position = 355; Antisense;
    TGAGCGTCCTTTTCTACCAATAAAA
    Probes sequences for 201145_at:
    >probe:HG-U133A:201145_at:697:577; Interrogation_Position = 614; Antisense;
    TGATGCAAGAAGTGAATCCCCCCAA
    >probe:HG-U133A:201145_at:457:415; Interrogation_Position = 681; Antisense;
    GATGATGTATGGCCTATGGACCCCC
    >probe:HG-U133A:201145_at:384:67; Interrogation_Position = 799; Antisense;
    AGAGCATCTCTGTGACCAAGATCAC
    >probe:HG-U133A:201145_at:77:563; Interrogation_Position = 869; Antisense;
    TGAGGGCCGGACAGAGACTACAGTA
    >probe:HG-U133A:201145_at:283:67; Interrogation_Position = 883; Antisense;
    AGACTACAGTAACCCGACACGAAGC
    >probe:HG-U133A:201145_at:693:421; Interrogation_Position = 909; Antisense;
    GATAGCAGTCCTAGGGGTGATCCAG
    >probe:HG-U133A:201145_at:636:299; Interrogation_Position = 951; Antisense;
    GCCCTGGATGATGCCTTTTCCATCC
    >probe:HG-U133A:201145_at:585:601; Interrogation_Position = 969; Antisene;
    TCCATCCTGGACTTATTCCTGGGAC
    >probe:HG-U133A:201145_at:46:451; Interrogation_Position = 1010; Antisense;
    GTAGCCTTGTTAACCCTCAGAGGCC
    >probe:HG-U133A:201145_at:439:703; Interrogation_Position = 1090; Antisense;
    TTGCCACCTCAGGGGCTTGGATATG
    >probe:HG-U133A:201145_at:603:671; Interrogation_Position = 1142; Antisense;
    TTTGTCACTCACCCAAACTGACCAA

Claims (23)

We claim:
1. A method of diagnosing lung cancer in an individual comprising the steps of:
a) measuring a biological sample comprising lung epithelial tissue from the individual for the expression of at least 20 gene transcripts from Table 6;
b) comparing the expression of the at least 20 gene transcripts to a control sample of those transcripts from individuals without cancer,
wherein increased expression of the gene transcripts as indicated by a negative score in the last column of Table 6 and/or decreased expression of the gene transcripts as indicated by a positive score in the last column of Table 6 is indicative of the individual having lung cancer.
2. The method of claim 1, wherein at least 40 gene transcripts are measured.
3. The method of claim 1, wherein at least 60 gene transcripts are measured.
4. The method of claim 1, wherein at least 70 gene transcripts are measured.
5. The method of claim 1, wherein the gene transcript measured is set forth in Table 5.
6. The method of claim 1, wherein the gene transcript measured is set forth in Table 7.
7. The method of claim 1, wherein the gene transcript measured is set forth in Table 1 wherein the measurement of the gene transcript relative to the control uses the third column of Table 1 setting forth direction of expression in lung cancer to determine if the individual has lung cancer.
8. The method of claim 7, wherein the transcript measured is at least Table 3.
9. The method of claim 7, wherein the transcript used is at least the transcripts set forth in Table 4.
10. A method of diagnosing a lung disease in an individual exposed to air pollutant comprising the steps of
a) measuring expression profile of a gene group in the sample from a test individual; and
b) comparing the expression profile of the test individual to an expression profile of a first control individual exposed to similar air pollutant who does not have the lung disease and the expression profile of a second control individual exposed to similar air pollutants who has the lung disease,
wherein similarity of the expression profile of the test individual with the expression profile of the first control rather than that of the second control is indicative of the test individual being not affected with the lung disease and wherein similarity of the expression profile of the test individual with the expression profile of the second control individual rather than that of the first control individual is indicative of the test individual being affected or at high risk of developing the lung disease.
11. The method of claim 7, wherein the gene group comprises at least 30, sequences of genes selected from the group consisting with GenBank identification Nos. NM_003335; NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_001319; NM_006545.1; NM_021145.1; NM_002437.1; NM_006286; NM_001003698 /// NM_001003699 /// NM_002955; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_002853.1; NM_019067.1; NM_024917.1; NM_020979.1; NM_005597.1; NM_007031.1; NM_009590.1; NM_020217.1; NM_025026.1; NM_014709.1; NM_014896.1; AF010144; NM_005374.1; NM_001696; NM_005494 /// NM_058246; NM_006534 /// NM_181659; NM_006368; NM_002268 /// NM_032771; NM_014033; NM_016138; NM_007048 /// NM_194441; NM_006694; NM_000051 /// NM _138292 /// NM_138293; NM_000410 /// NM_139002 /// NM_139003 /// NM_139004 /// NM_139005 /// NM_139006 /// NM_139007 /// NM_139008 /// NM_139009 /// NM_139010 /// NM_139011; NM_004691; NM_012070 /// NM_139321 /// NM_139322; NM_006095; AI632181; AW024467; NM_021814; NM_005547.1; NM_203458; NM_015547 /// NM_147161; AB007958.1; NM_207488; NM_005809 /// NM_181737 /// NM_181738; NM_016248 /// NM_144490; AK022213.1; NM_005708; NM_207102; AK023895; NM_144606 /// NM_144997; NM_018530; AK021474; U43604.1; AU147017; AF222691.1; NM_015116; NM_001005375 /// NM_001005785 /// NM_001005786 /// NM_004081 /// NM_020363 /// NM_020364 /// NM_020420; AC004692; NM_001014; NM_000585 /// NM_172174 /// NM_172175; NM_054020 /// NM_172095 /// NM_172096 /// NM_172097; BE466926; NM_018011; NM_024077; NM_012394; NM_019011 /// NM_207111 /// NM_207116; NM_017646; NM_021800; NM_016049; NM_014395; NM_014336; NM_018097; NM_019014; NM_024804; NM_018260; NM_018118; NM_014128; NM_024084; NM_005294; AF077053; NM_138387; NM_024531; NM_000693; NM_018509; NM_033128; NM_020706; AI523613; and NM_014884.
12. The method of claim 1, wherein the gene group comprises sequences of genes selected from the group consisting of genes with GenBank identification Nos. NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; AB014576.1; BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1.
13. The method of claim 1, wherein the gene group comprises sequences of genes selected from the group consisting of genes with GenBank identification Nos. NM_007062.1; NM_001281.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; NM_002268 /// NM_032771; NM_007048 /// NM_194441; NM_006694; U85430.1; NM_004691; AB014576.1; BF218804; BE467941; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_021971.1; NM_014128.1; AA133341; and AF198444.1
14. The method of claim 1, wherein the group comprises sequences of genes selected from the group consisting of genes with GenBank or Unigene identification Nos. NM_030757.1; R83000; AK021571.1; NM_014182.1; NM_17932.1; U85430.1; AI683552; BC002642.1; AW024467; NM_030972.1; BC021135.1; AL161952.1; AK026565.1; AK023783.1; BF218804; NM_001281.1; NM_024006.1; AK023843.1; BC001602.1; BC034707.1; BC064619.1; AY280502.1; BC059387.1; AF135421.1; BC061522.1; L76200.1; U50532.1; BC006547.2; BC008797.2; BC000807.1; AL080112.1; BC033718.1 /// BC046176.1 /// BC038443.1; NM_000346.1; BC008710.1; Hs.288575 (UNIGENE ID); AF020591.1; BC000423.2; BC002503.2; BC008710.1; BC009185.2; Hs.528304 (UNIGENE ID); U50532.1; BC013923.2; BC031091; NM_007062; Hs.249591 (Unigene ID); BC075839.1 /// BC073760.1; BC072436.1 /// BC004560.2; BC001016.2; Hs.286261 (Unigene ID); AF348514.1; BC005023.1; BC066337.1 /// BC058736.1 /// BC050555.1; Hs.216623 (Unigene ID); BC072400.1; BC041073.1; U43965.1; BC021258.2; BC016057.1; BC016713.1 /// BC014535.1 /// AF237771.1; BC000360.2; BC007455.2; BC000701.2; BC010067.2; BC023528.2 /// BC047680.1; BC064957.1; Hs.156701 (Unigene ID); BC030619.2; BC008710.1; U43965.1; BC066329.1; Hs.438867 (Unigene ID); BC035025.2 /// BC050330.1; BC023976.2; BC074852.2 /// BC074851.2; Hs.445885 (Unigene ID); BC008591.2 /// BC050440.1 ///; BC048096.1; AF365931.1; AF257099.1; and BC028912.1.
15. The method of claim 1, wherein the group comprises sequences of genes selected from the group consisting of genes with GenBank or Unigene identification Nos: NM_003335; NM_001319; NM_021145.1; NM_001003698 /// NM_001003699 ///; NM_002955; NM_002853.1; NM_019067.1; NM_024917.1; NM_020979.1; NM_005597.1; NM_007031.1; NM_009590.1; NM020217.1; NM_025026.1; NM_014709.1; NM_014896.1; AF010144; NM_005374.1; NM_006534 /// NM_181659; NM_014033; NM_016138; NM_007048 /// NM_194441; NM_000051 /// NM_138292 /// NM_138293; NM_000410 /// NM_139002 /// NM_139003 /// NM_139004 /// NM_139005 /// NM_139006 /// NM_139007 /// NM_139008 /// NM_139009 /// NM_139010 /// NM_139011; NM_012070 /// NM_139321 /// NM_139322; NM_006095; AI632181; AW024467; NM_021814; NM_005547.1; NM_203458; NM_015547 /// NM_147161; AB007958.1; NM_207488; NM_005809 /// NM_181737 /// NM_181738; NM_016248 /// NM_144490; AK022213.1; NM_005708; NM_207102; AK023895; NM_144606 /// NM_144997; NM_018530; AK021474; U43604.1; AU147017; AF222691.1; NM_015116; NM_001005375 /// NM_001005785 /// NM_001005786 /// NM_004081 /// NM_020363 /// NM_020364 /// NM_020420; AC004692; NM_001014; NM_000585 /// NM_172174 /// NM172175; NM_054020 /// NM_172095 /// NM_172096 /// NM_172097; BE466926; NM_018011; NM_024077; NM_019011 /// NM207111 /// NM207116; NM_017646; NM_014395; NM_014336; NM_018097; NM_019014; NM_024804; NM_018260; NM_018118; NM_014128; NM_024084; NM_005294; AF077053; NM_000693; NM_033128; NM_020706; AI523613; and NM_014884, and wherein decrease in expression of at least 5 of these genes is indicative of the individual being affected with a lung disease.
16. The method of claim 1, wherein the group comprises sequences of genes selected from the group consisting of genes with GenBank or Unigene identification Nos NM_030757.1; R83000; AK021571.1; NM_17932.1; U85430.1; AI683552; BC002642.1; AW024467; NM_030972.1; BC021135.1; AL161952.1; AK026565.1; AK023783.1; BF218804; AK023843.1; BC001602.1; BC034707.1; BC064619.1; AY280502.1; BC059387.1; BC061522.1; U50532.1; BC006547.2; BC008797.2; BC000807.1; AL080112.1; BC033718.1 /// BC046176.1 ///; BC038443.1; Hs.288575 (UNIGENE ID); AF020591.1; BC002503.2; BC009185.2; Hs.528304 (UNIGENE ID); U50532.1; BC013923.2; BC031091; Hs.249591 (Unigene ID); Hs.286261 (Unigene ID); AF348514.1; BC066337.1 /// BC058736.1 /// BC050555.1; Hs.216623 (Unigene ID); BC072400.1; BC041073.1; U43965.1; BC021258.2; BC016057.1; BC016713.1 /// BC014535.1 /// AF237771.1; BC000701.2; BC010067.2; Hs.156701 (Unigene ID); BC030619.2; U43965.1; Hs.438867 (Unigene ID); BC035025.2 /// BC050330.1; BC074852.2 /// BC074851.2; Hs.445885 (Unigene ID); AF365931.1; and AF257099.1, and wherein decrease in expression of at least 5 of these genes is indicative of the individual being affected with a luncg disease.
17. The method of claim 1, wherein the group comprises sequences of genes selected from the group consisting of genes with GenBank or Unigene identification Nos BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1, and wherein decrease in expression of at least 5 of these genes is indicative of the individual being affected with a lung disease.
18. The method of claim 1, wherein the group comprises sequences of genes selected from the group consisting of genes with GenBank or Unigene identification Nos NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_006545.1; NM_002437.1; NM_006286; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_001696; NM_005494 /// NM_058246; NM_006368; NM_002268 /// NM _032771; NM_006694; NM_004691; NM_012394; NM_021800; NM_016049; NM_138387; NM_024531; and NM_018509, and wherein increase in expression of at least 5 of these genes is indicative of the individual being affected with a lung disease.
19. The method of claim 1, wherein the group comprises sequences of genes selected from the group consisting of genes with GenBank or Unigene identification Nos NM_014182.1; NM_001281.1; NM_024006.1; AF135421.1; L76200.1; NM_000346.1; BC008710.1; BC000423.2; BC008710.1; NM_007062; BC075839.1 /// BC073760.1; BC072436.1 /// BC004560.2; BC001016.2; BC005023.1; BC000360.2; BC007455.2; BC023528.2 /// BC047680.1; BC064957.1; BC008710.1; BC066329.1; BC023976.2; BC008591.2 /// BC050440.1 /// BC048096.1; and BC028912.1, and wherein increase in expression of at least 5 of these genes is indicative of the individual being affected with a lung disease.
20. The method of claim 1, wherein the group comprises sequences of genes selected from the group consisting of genes with GenBank or Unigene identification Nos NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; and AB014576.1, and wherein increase in expression of at least 5 of these genes is indicative of the individual being affected with a lung disease.
21. The method of claim 1, wherein the group comprises 5-9 sequences of genes selected from group 1 and group 2, group 1 consisting of genes with GenBank or Unigene identification Nos. NM_003335; NM_001319; NM_021145.1; NM_001003698 /// NM_001003699 ///; NM_002955; NM_002853.1; NM_019067.1; NM_024917.1; NM_020979.1; NM_005597.1; NM_007031.1; NM_009590.1; NM_020217.1; NM_025026.1; NM_014709.1; NM_014896.1; AF010144; NM_005374.1; NM_006534 /// NM_181659; NM_014033; NM_016138; NM_007048 /// NM_194441; NM_000051 /// NM _138292 /// NM_138293; NM_000410 /// NM_139002 /// NM_139003 /// NM_139004 /// NM_139005 /// NM_139006 /// NM_139007 /// NM_139008 /// NM_139009 /// NM_139010 /// NM_139011; NM_012070 /// NM_139321 /// NM_139322; NM_006095; AI632181; AW024467; NM_021814; NM_005547.1; NM_203458; NM_015547 /// NM_147161; AB007958.1; NM_207488; NM_005809 /// NM_181737 /// NM_181738; NM_016248 /// NM_144490; AK022213.1; NM_005708; NM_207102; AK023895; NM_144606 /// NM_144997; NM_018530; AK021474; U43604.1; AU147017; AF222691.1; NM_015116; NM_001005375 /// NM_001005785 /// NM_001005786 /// NM_004081 /// NM_020363 /// NM_020364 /// NM_020420; AC004692; NM_001014; NM_000585 /// NM_172174 /// NM_172175; NM_054020 /// NM_172095 /// NM_172096 /// NM_172097; BE466926; NM_018011; NM_024077; NM_019011 /// NM_207111 /// NM_207116; NM_017646; NM_014395; NM_014336; NM_018097; NM_019014; NM_024804; NM_018260; NM_018118; NM_014128; NM_024084; NM_005294; AF077053; NM_000693; NM_033128; NM_020706; AI523613; and NM_014884, and group 2 consisting of genes with GenBank or Unigene identification Nos. NM_000918; NM_006430.1; NM_001416.1; NM_004090; NM_006406.1; NM_003001.2; NM_006545.1; NM_002437.1; NM_006286; NM_001123 /// NM_006721; NM_024824; NM_004935.1; NM_001696; NM_005494 /// NM_058246; NM_006368; NM_002268 /// NM_032771; NM_006694; NM_004691; NM_012394; NM_021800; NM_016049; NM_138387; NM_024531; and NM_018509, and a group of at least 20 genes selected from group 3 and group 4, group 3 consisting of genes with GenBank or Unigene identification Nos BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1 and group 4 consisting of genes with GenBank or Unigene identification Nos. NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; and AB014576.1.
22. The method of claim 20, wherein decrease in the expression of any one of the group 1 genes and increase in the expression of any one of the group 2 genes, and decrease of the group 3 genes and increase of the group 4 genes is indicative of the individual being affected with a lung disease.
23. The method of claim 1, wherein the group comprises 5-9 sequences of genes selected from group 5 and group 6, group 5 consisting of genes with GenBank or Unigene identification Nos. NM_030757.1; R83000; AK021571.1; NM_17932.1; U85430.1; AI683552; BC002642.1; AW024467; NM_030972.1; BC021135.1; AL161952.1; AK026565.1; AK023783.1; BF218804; AK023843.1; BC001602.1; BC034707.1; BC064619.1; AY280502.1; BC059387.1; BC061522.1; U50532.1; BC006547.2; BC008797.2; BC000807.1; AL080112.1; BC033718.1 /// BC046176.1 ///; BC038443.1; Hs.288575 (UNIGENE ID); AF020591.1; BC002503.2; BC009185.2; Hs.528304 (UNIGENE ID); U50532.1; BC013923.2; BC031091; Hs.249591 (Unigene ID); Hs.286261 (Unigene ID); AF348514.1; BC066337.1 /// BC058736.1 /// BC050555.1; Hs.216623 (Unigene ID); BC072400.1; BC041073.1; U43965.1; BC021258.2; BC016057.1; BC016713.1 /// BC014535.1 /// AF237771.1; BC000701.2; BC010067.2; Hs.156701 (Unigene ID); BC030619.2; U43965.1; Hs.438867 (Unigene ID); BC035025.2 /// BC050330.1; BC074852.2 /// BC074851.2; Hs.445885 (Unigene ID); AF365931.1; and AF257099.1, and group 6 consisting of genes with GenBank or Unigene identification Nos. NM_014182.1; NM_001281.1; NM_024006.1; AF135421.1; L76200.1; NM_000346.1; BC008710.1; BC000423.2; BC008710.1; NM_007062; BC075839.1 /// BC073760.1; BC072436.1 /// BC004560.2; BC001016.2; BC005023.1; BC000360.2; BC007455.2; BC023528.2 /// BC047680.1; BC064957.1; BC008710.1; BC066329.1; BC023976.2; BC008591.2 /// BC050440.1 ///; BC048096.1; and BC028912.1, and a group of at least 20 genes selected from group 3 and group 4, group 3 consisting of genes with GenBank or Unigene identification Nos BF218804; AK022494.1; AA114843; BE467941; NM_003541.1; R83000; AL161952.1; AK023843.1; AK021571.1; AK023783.1; AU147182; AL080112.1; AW971983; AI683552; NM_024006.1; AK026565.1; NM_014182.1; NM_021800.1; NM_016049.1; NM_019023.1; NM_021971.1; NM_014128.1; AK025651.1; AA133341; and AF198444.1 and group 4 consisting of genes with GenBank or Unigene identification Nos. NM_007062.1; NM_001281.1; BC000120.1; NM_014255.1; BC002642.1; NM_000346.1; NM_006545.1; BG034328; NM_021822.1; NM_021069.1; NM_019067.1; NM_017925.1; NM_017932.1; NM_030757.1; NM_030972.1; AF126181.1; U93240.1; U90552.1; AF151056.1; U85430.1; U51007.1; BC005969.1; NM_002271.1; AL566172; and AB014576.1, wherein decrease in the expression of the group 5 genes and increase in the expression of the group 6 genes, and decrease the group 3 genes and increase of the group 4 genes is indicative of the individual being affected with a lung disease.
US16/510,584 2005-04-14 2019-07-12 Diagnostic for lung disorders using class prediction Abandoned US20190376148A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US16/510,584 US20190376148A1 (en) 2005-04-14 2019-07-12 Diagnostic for lung disorders using class prediction

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US67124305P 2005-04-14 2005-04-14
PCT/US2006/014132 WO2006113467A2 (en) 2005-04-14 2006-04-14 Diagnostic for lung disorders using class prediction
US91858808A 2008-02-08 2008-02-08
US12/869,525 US20120041686A1 (en) 2005-04-14 2010-08-26 Diagnostic for lung disorders using class prediction
US13/524,749 US20130023437A1 (en) 2005-04-14 2012-06-15 Diagnostic for lung disorders using class prediction
US14/613,210 US9920374B2 (en) 2005-04-14 2015-02-03 Diagnostic for lung disorders using class prediction
US15/888,831 US10808285B2 (en) 2005-04-14 2018-02-05 Diagnostic for lung disorders using class prediction
US16/510,584 US20190376148A1 (en) 2005-04-14 2019-07-12 Diagnostic for lung disorders using class prediction

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US15/888,831 Continuation US10808285B2 (en) 2005-04-14 2018-02-05 Diagnostic for lung disorders using class prediction

Publications (1)

Publication Number Publication Date
US20190376148A1 true US20190376148A1 (en) 2019-12-12

Family

ID=37115745

Family Applications (6)

Application Number Title Priority Date Filing Date
US11/918,588 Abandoned US20100035244A1 (en) 2005-04-14 2006-04-14 Diagnostic for lung disorders using class prediction
US12/869,525 Abandoned US20120041686A1 (en) 2005-04-14 2010-08-26 Diagnostic for lung disorders using class prediction
US13/524,749 Abandoned US20130023437A1 (en) 2005-04-14 2012-06-15 Diagnostic for lung disorders using class prediction
US14/613,210 Active US9920374B2 (en) 2005-04-14 2015-02-03 Diagnostic for lung disorders using class prediction
US15/888,831 Active US10808285B2 (en) 2005-04-14 2018-02-05 Diagnostic for lung disorders using class prediction
US16/510,584 Abandoned US20190376148A1 (en) 2005-04-14 2019-07-12 Diagnostic for lung disorders using class prediction

Family Applications Before (5)

Application Number Title Priority Date Filing Date
US11/918,588 Abandoned US20100035244A1 (en) 2005-04-14 2006-04-14 Diagnostic for lung disorders using class prediction
US12/869,525 Abandoned US20120041686A1 (en) 2005-04-14 2010-08-26 Diagnostic for lung disorders using class prediction
US13/524,749 Abandoned US20130023437A1 (en) 2005-04-14 2012-06-15 Diagnostic for lung disorders using class prediction
US14/613,210 Active US9920374B2 (en) 2005-04-14 2015-02-03 Diagnostic for lung disorders using class prediction
US15/888,831 Active US10808285B2 (en) 2005-04-14 2018-02-05 Diagnostic for lung disorders using class prediction

Country Status (6)

Country Link
US (6) US20100035244A1 (en)
EP (6) EP2360279A1 (en)
JP (2) JP2008537120A (en)
AU (1) AU2006236621B2 (en)
CA (1) CA2605158A1 (en)
WO (1) WO2006113467A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11639527B2 (en) 2014-11-05 2023-05-02 Veracyte, Inc. Methods for nucleic acid sequencing
US11976329B2 (en) 2013-03-15 2024-05-07 Veracyte, Inc. Methods and systems for detecting usual interstitial pneumonia

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1633892A4 (en) * 2003-06-10 2007-10-24 Univ Boston Detection methods for disorders of the lung
EP2360279A1 (en) 2005-04-14 2011-08-24 Trustees Of Boston University Diagnostic for lung disorders using class prediction
WO2007103541A2 (en) * 2006-03-09 2007-09-13 The Trustees Of Boston University Diagnostic and prognostic methods for lung disorders using gene expression profiles from nose epithelial cells
GB0620255D0 (en) * 2006-10-12 2006-11-22 Fusion Antibodies Ltd Antibody and uses thereof
EP2087140A2 (en) * 2006-11-13 2009-08-12 Source Precision Medicine, Inc. Gene expression profiling for identification, monitoring, and treatment of lung cancer
US20090203011A1 (en) * 2007-01-19 2009-08-13 Epigenomics Ag Methods and nucleic acids for analyses of cell proliferative disorders
EP2193211A4 (en) * 2007-09-19 2010-12-08 Univ Boston Identification of novel pathways for drug development for lung disease
WO2009121070A1 (en) * 2008-03-28 2009-10-01 The Trustees Of Boston University Multifactorial methods for detecting lung disorders
US9495515B1 (en) 2009-12-09 2016-11-15 Veracyte, Inc. Algorithms for disease diagnostics
US20100240546A1 (en) * 2009-03-20 2010-09-23 Samuel Chun Lap Lo Use of biomarkers for the diagnosis and prognosis of lung cancer
US10318877B2 (en) * 2010-10-19 2019-06-11 International Business Machines Corporation Cohort-based prediction of a future event
WO2012119150A2 (en) * 2011-03-03 2012-09-07 Cornell University Lung cancer-relevant human embryonic stem cell signature
WO2012125712A2 (en) * 2011-03-14 2012-09-20 Respira Health, Llc Lung tumor classifier for current and former smokers
US20150088430A1 (en) * 2012-04-26 2015-03-26 Allegro Diagnostics Corp Methods for evaluating lung cancer status
CN104885102A (en) * 2012-10-16 2015-09-02 里昂水务法国公司 Method for the real-time estimation of the total consumption of a fluid distributed to users, and a distribution network implementing said method
WO2014186036A1 (en) 2013-03-14 2014-11-20 Allegro Diagnostics Corp. Methods for evaluating copd status
US20160130656A1 (en) 2014-07-14 2016-05-12 Allegro Diagnostics Corp. Methods for evaluating lung cancer status
WO2017079571A1 (en) 2015-11-05 2017-05-11 Arphion Diagnostics Process for the indentication of patients at risk for oscc
US10370667B2 (en) * 2015-11-18 2019-08-06 Rosalind Franklin University Of Medicine And Science Antisense compounds targeting leucine-rich repeat kinase 2 (LRRK2) for the treatment of parkinsons disease
CN105653900B (en) * 2015-12-25 2019-03-26 北京百迈客生物科技有限公司 Without ginseng transcriptome analysis system and method
WO2017123648A1 (en) * 2016-01-12 2017-07-20 Arizona Board Of Regents On Behalf Of Arizona State University Plasma autoantibody biomarkers for diagnosis of lung cancer
KR102552778B1 (en) * 2016-06-29 2023-07-10 (주)아모레퍼시픽 Biomarker for identifying exposure to particulate matter and identification method using the same
US10927417B2 (en) 2016-07-08 2021-02-23 Trustees Of Boston University Gene expression-based biomarker for the detection and monitoring of bronchial premalignant lesions
JP7179766B2 (en) * 2017-05-12 2022-11-29 ラボラトリー コーポレイション オブ アメリカ ホールディングス Systems and methods for biomarker identification
US10463945B2 (en) * 2017-05-25 2019-11-05 Nike, Inc. Shin guard with sock engaging feature
CN108388768A (en) * 2018-02-08 2018-08-10 南京恺尔生物科技有限公司 Utilize the biological nature prediction technique for the neural network model that biological knowledge is built
WO2020041243A1 (en) * 2018-08-20 2020-02-27 Trustees Of Boston University Methods related to bronchial premalignant lesion severity and progression
WO2020092993A1 (en) * 2018-11-02 2020-05-07 The Trustees Of Columbia University In The City Of New York Pharmacologic treatment for right ventricular failure
US20230113252A1 (en) * 2020-03-13 2023-04-13 Terumo Kabushiki Kaisha Method of determining development of myocarditis or risk of myocarditis development
CN113109569B (en) * 2021-03-05 2022-08-19 李朴 Application of GSDMD as biomarker in differential diagnosis and curative effect evaluation of pleural effusion related diseases
CN113252896B (en) * 2021-05-13 2022-08-09 四川大学华西医院 Kit for predicting acute respiratory distress syndrome prognosis by using ND1 in alveolar lavage fluid

Family Cites Families (151)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US63603A (en) 1867-04-09 Improvement in animal traps
US1359808A (en) 1916-03-20 1920-11-23 Martin R Jacobus Poultry-feeder
US1359801A (en) 1920-03-03 1920-11-23 Goldner Joe Combination mattress and pillow
US3640268A (en) 1965-10-23 1972-02-08 Hugh J Davis Method and device for biopsy specimen collecting and handling
US4437975A (en) 1977-07-20 1984-03-20 Mobil Oil Corporation Manufacture of lube base stock oil
US4632901A (en) 1984-05-11 1986-12-30 Hybritech Incorporated Method and apparatus for immunoassays
US4641662A (en) 1984-09-28 1987-02-10 Jaicks John R Endocervical curette system
US5242794A (en) 1984-12-13 1993-09-07 Applied Biosystems, Inc. Detection of specific sequences in nucleic acids
US5333675C1 (en) 1986-02-25 2001-05-01 Perkin Elmer Corp Apparatus and method for performing automated amplification of nucleic acid sequences and assays using heating and cooling steps
US4965188A (en) 1986-08-22 1990-10-23 Cetus Corporation Process for amplifying, detecting, and/or cloning nucleic acid sequences using a thermostable enzyme
US4683195A (en) 1986-01-30 1987-07-28 Cetus Corporation Process for amplifying, detecting, and/or-cloning nucleic acid sequences
US4683202A (en) 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US4800896A (en) 1985-11-08 1989-01-31 Jalowayski Alfredo A Cell sample collector probe
US4800159A (en) 1986-02-07 1989-01-24 Cetus Corporation Process for amplifying, detecting, and/or cloning nucleic acid sequences
IL86724A (en) 1987-06-19 1995-01-24 Siska Diagnostics Inc Method and kits for the amplification and detection of nucleic acid sequences
CA1340843C (en) 1987-07-31 1999-12-07 J. Lawrence Burg Selective amplification of target polynucleotide sequences
JP2650159B2 (en) 1988-02-24 1997-09-03 アクゾ・ノベル・エヌ・ベー Nucleic acid amplification method
CA1340807C (en) 1988-02-24 1999-11-02 Lawrence T. Malek Nucleic acid amplification process
US4988617A (en) 1988-03-25 1991-01-29 California Institute Of Technology Method of detecting a nucleotide change in nucleic acids
HU216105B (en) 1988-12-16 1999-04-28 Akzo Nobel N.V. Amplification method with self-sustained sequence replication system
US5856092A (en) 1989-02-13 1999-01-05 Geneco Pty Ltd Detection of a nucleic acid sequence or a change therein
US5424186A (en) 1989-06-07 1995-06-13 Affymax Technologies N.V. Very large scale immobilized polymer synthesis
US5871928A (en) 1989-06-07 1999-02-16 Fodor; Stephen P. A. Methods for nucleic acid analysis
US6346413B1 (en) 1989-06-07 2002-02-12 Affymetrix, Inc. Polymer arrays
US5744101A (en) 1989-06-07 1998-04-28 Affymax Technologies N.V. Photolabile nucleoside protecting groups
US5242974A (en) 1991-11-22 1993-09-07 Affymax Technologies N.V. Polymer reversal on solid surfaces
US5143854A (en) 1989-06-07 1992-09-01 Affymax Technologies N.V. Large scale photolithographic solid phase synthesis of polypeptides and receptor binding screening thereof
US6040138A (en) 1995-09-15 2000-03-21 Affymetrix, Inc. Expression monitoring by hybridization to high density oligonucleotide arrays
US5547839A (en) 1989-06-07 1996-08-20 Affymax Technologies N.V. Sequencing of surface immobilized polymers utilizing microflourescence detection
US6309822B1 (en) 1989-06-07 2001-10-30 Affymetrix, Inc. Method for comparing copy number of nucleic acid sequences
US5527681A (en) 1989-06-07 1996-06-18 Affymax Technologies N.V. Immobilized molecular synthesis of systematically substituted compounds
US5800992A (en) 1989-06-07 1998-09-01 Fodor; Stephen P.A. Method of detecting nucleic acids
US5252743A (en) 1989-11-13 1993-10-12 Affymax Technologies N.V. Spatially-addressable immobilization of anti-ligands on surfaces
US6013431A (en) 1990-02-16 2000-01-11 Molecular Tool, Inc. Method for determining specific nucleotide variations by primer extension in the presence of mixture of labeled nucleotides and terminators
US5494810A (en) 1990-05-03 1996-02-27 Cornell Research Foundation, Inc. Thermostable ligase-mediated DNA amplifications system for the detection of genetic disease
EP0561796B1 (en) 1990-08-24 1997-12-29 The University Of Tennessee Research Corporation Dna amplification fingerprinting
WO1992007095A1 (en) 1990-10-15 1992-04-30 Stratagene Arbitrarily primed polymerase chain reaction method for fingerprinting genomes
US5726060A (en) 1991-09-17 1998-03-10 Bridges; Michael Anthony Method for culturing mammalian respiratory epithelial cells
JP3939338B2 (en) 1991-11-22 2007-07-04 アフィメトリックス, インコーポレイテッド Combinatorial strategies for polymer synthesis.
US5412087A (en) 1992-04-24 1995-05-02 Affymax Technologies N.V. Spatially-addressable immobilization of oligonucleotides and other biological polymers on surfaces
US5324633A (en) 1991-11-22 1994-06-28 Affymax Technologies N.V. Method and apparatus for measuring binding affinity
US5384261A (en) 1991-11-22 1995-01-24 Affymax Technologies N.V. Very large scale immobilized polymer synthesis using mechanically directed flow paths
US5550215A (en) 1991-11-22 1996-08-27 Holmes; Christopher P. Polymer reversal on solid surfaces
US5541061A (en) 1992-04-29 1996-07-30 Affymax Technologies N.V. Methods for screening factorial chemical libraries
US5422273A (en) 1993-03-23 1995-06-06 Baal Medical Products, Inc. Cell collection apparatus
US5491074A (en) 1993-04-01 1996-02-13 Affymax Technologies Nv Association peptides
US5639606A (en) 1993-04-06 1997-06-17 The University Of Rochester Method for quantitative measurement of gene expression using multiplex competitive reverse transcriptase-polymerase chain reaction
US5876978A (en) 1993-04-06 1999-03-02 Medical College Of Ohio Method for quantitative measurement of gene expression using multiplex competitive reverse transcriptase-polymerase chain reaction
US5643765A (en) 1993-04-06 1997-07-01 University Of Rochester Method for quantitative measurement of gene expression using multiplex competitive reverse transcriptase-polymerase chain reaction
US5477863A (en) 1993-04-14 1995-12-26 Grant; Michael A. Collection kit with a sample collector
US5858659A (en) 1995-11-29 1999-01-12 Affymetrix, Inc. Polymorphism detection
US5837832A (en) 1993-06-25 1998-11-17 Affymetrix, Inc. Arrays of nucleic acid probes on biological chips
US6045996A (en) 1993-10-26 2000-04-04 Affymetrix, Inc. Hybridization assays on oligonucleotide arrays
US5440942A (en) 1994-02-02 1995-08-15 Hubbard; Stephen H. Biological sample collecting and holding device
US5578832A (en) 1994-09-02 1996-11-26 Affymetrix, Inc. Method and apparatus for imaging a sample on a device
US5631734A (en) 1994-02-10 1997-05-20 Affymetrix, Inc. Method and apparatus for detection of fluorescently labeled materials
US6090555A (en) 1997-12-11 2000-07-18 Affymetrix, Inc. Scanned image alignment systems and methods
EP0758403B1 (en) 1994-05-05 1998-06-24 Beckman Instruments, Inc. Oligonucleotide repeat arrays
US5571639A (en) 1994-05-24 1996-11-05 Affymax Technologies N.V. Computer-aided engineering system for design of sequence arrays and lithographic masks
US5795716A (en) 1994-10-21 1998-08-18 Chee; Mark S. Computer-aided visualization and analysis system for sequence evaluation
US5959098A (en) 1996-04-17 1999-09-28 Affymetrix, Inc. Substrate preparation process
US5599695A (en) 1995-02-27 1997-02-04 Affymetrix, Inc. Printing molecular library arrays using deprotection agents solely in the vapor phase
US5795782A (en) 1995-03-17 1998-08-18 President & Fellows Of Harvard College Characterization of individual polymer molecules based on monomer-interface interactions
US5624711A (en) 1995-04-27 1997-04-29 Affymax Technologies, N.V. Derivatization of solid supports and methods for oligomer synthesis
US5545531A (en) 1995-06-07 1996-08-13 Affymax Technologies N.V. Methods for making a device for concurrently processing multiple biological chip assays
US5968740A (en) 1995-07-24 1999-10-19 Affymetrix, Inc. Method of Identifying a Base in a Nucleic Acid
US5733729A (en) 1995-09-14 1998-03-31 Affymetrix, Inc. Computer-aided probability base calling for arrays of nucleic acid probes on chips
US6147205A (en) 1995-12-15 2000-11-14 Affymetrix, Inc. Photocleavable protecting groups and methods for their use
US6114122A (en) 1996-03-26 2000-09-05 Affymetrix, Inc. Fluidics station with a mounting system and method of using
US5981956A (en) 1996-05-16 1999-11-09 Affymetrix, Inc. Systems and methods for detection of labeled materials
JP3402972B2 (en) 1996-11-14 2003-05-06 東京エレクトロン株式会社 Method for manufacturing semiconductor device
DE69823206T2 (en) 1997-07-25 2004-08-19 Affymetrix, Inc. (a Delaware Corp.), Santa Clara METHOD FOR PRODUCING A BIO-INFORMATICS DATABASE
DE69827154T2 (en) 1997-08-15 2006-03-09 Affymetrix, Inc. (n.d.Ges.d.Staates Delaware), Santa Clara POLYMORPHISM IDENTIFICATION WITH HELP CLUSTER ANALYSIS
DE69829402T2 (en) 1997-10-31 2006-04-13 Affymetrix, Inc. (a Delaware Corp.), Santa Clara EXPRESSION PROFILES IN ADULTS AND FOLDS ORGANS
US6013449A (en) 1997-11-26 2000-01-11 The United States Of America As Represented By The Department Of Health And Human Services Probe-based analysis of heterozygous mutations using two-color labelling
US6201639B1 (en) 1998-03-20 2001-03-13 James W. Overbeck Wide field of view and high speed scanning microscopy
US6428752B1 (en) 1998-05-14 2002-08-06 Affymetrix, Inc. Cleaning deposit devices that form microarrays and the like
US6269846B1 (en) 1998-01-13 2001-08-07 Genetic Microsystems, Inc. Depositing fluid specimens on substrates, resulting ordered arrays, techniques for deposition of arrays
US6185030B1 (en) 1998-03-20 2001-02-06 James W. Overbeck Wide field of view and high speed scanning microscopy
US6020135A (en) 1998-03-27 2000-02-01 Affymetrix, Inc. P53-regulated genes
US5936324A (en) 1998-03-30 1999-08-10 Genetic Microsystems Inc. Moving magnet scanner
US6085907A (en) 1998-05-08 2000-07-11 Institute Of Legal Medicine, University Of Bern Foldable cardboard box for contact-free drying and long-term storage of biological evidence recovered on cotton swabs and forensic evidence collection kit including same
JP3688585B2 (en) 1998-05-21 2005-08-31 ダイアデクスアス・インコーポレーテッド A novel method for diagnosing, monitoring and staging lung cancer
JP3565025B2 (en) 1998-07-07 2004-09-15 日産自動車株式会社 Jig changing apparatus and jig changing method
US6204375B1 (en) 1998-07-31 2001-03-20 Ambion, Inc. Methods and reagents for preserving RNA in cell and tissue samples
US6185561B1 (en) 1998-09-17 2001-02-06 Affymetrix, Inc. Method and apparatus for providing and expression data mining database
US6262216B1 (en) 1998-10-13 2001-07-17 Affymetrix, Inc. Functionalized silicon compounds and methods for their synthesis and use
AU2144000A (en) 1998-10-27 2000-05-15 Affymetrix, Inc. Complexity management and analysis of genomic dna
US20040241728A1 (en) * 1999-01-06 2004-12-02 Chondrogene Limited Method for the detection of lung disease related gene transcripts in blood
US6177248B1 (en) 1999-02-24 2001-01-23 Affymetrix, Inc. Downstream genes of tumor suppressor WT1
EP1165839A2 (en) 1999-03-26 2002-01-02 Whitehead Institute For Biomedical Research Universal arrays
US6218803B1 (en) 1999-06-04 2001-04-17 Genetic Microsystems, Inc. Position sensing with variable capacitance transducers
US6300070B1 (en) 1999-06-04 2001-10-09 Mosaic Technologies, Inc. Solid phase methods for amplifying multiple nucleic acids
US6746846B1 (en) 1999-06-30 2004-06-08 Corixa Corporation Methods for diagnosing lung cancer
EP1223866A1 (en) 1999-10-18 2002-07-24 Denis Marshall-Fraser Sample taking device
US6386749B1 (en) 2000-06-26 2002-05-14 Affymetrix, Inc. Systems and methods for heating and mixing fluids
US6383804B1 (en) 2000-07-13 2002-05-07 International Bioproducts, Inc. Sampling device with snap-off head and method of use
US6797471B2 (en) * 2000-08-04 2004-09-28 Board Of Regents, The University Of Texas System Detection and diagnosis of smoking related cancers
AU2002220009A1 (en) 2000-11-20 2002-05-27 Diadexus, Inc. Compositions and methods relating to lung specific genes and proteins
WO2002044331A2 (en) 2000-11-29 2002-06-06 Cangen International Dap-kinase and hoxa9, two human genes associated with genesis, progression, and aggressiveness of non-small cell lung cancer
US6391592B1 (en) 2000-12-14 2002-05-21 Affymetrix, Inc. Blocker-aided target amplification of nucleic acids
US6972181B2 (en) 2001-01-12 2005-12-06 Mount Sinai School Of Medicine Of New York University Differential diagnosis of cancer and other conditions based on expression of p63
WO2003040325A2 (en) 2001-02-01 2003-05-15 Curagen Corporation Novel proteins and nucleic acids encoding same
JP2004526154A (en) 2001-03-12 2004-08-26 モノジェン インコーポレイテッド Cell-based detection and disease state discrimination
WO2002072866A2 (en) 2001-03-14 2002-09-19 Medical College Of Ohio Method for quantitative measurement of gene expression for identifying individuals at risk for bronchogenic carcinoma
CA2444691A1 (en) 2001-04-18 2002-10-31 Protein Design Labs, Inc. Methods of diagnosis of lung cancer, compositions and methods of screening for modulators of lung cancer
US6494845B2 (en) 2001-05-04 2002-12-17 Oralscan Laboratories, Inc. Retractable brush for use with endoscope for brush biopsy
JP2005503145A (en) 2001-08-16 2005-02-03 ザ ユナイテッド ステイツ オブ アメリカ リプレゼンティッド バイ ザ シークレタリー デパートメント オブ ヘルス アンド ヒューマン サービシーズ Molecular characteristics of non-small cell lung cancer
EP1444361A4 (en) * 2001-09-28 2006-12-27 Whitehead Biomedical Inst Classification of lung carcinomas using gene expression analysis
WO2003040317A2 (en) * 2001-11-02 2003-05-15 Pfizer Products Inc. Lung cancer therapeutics and diagnostics
US20040005294A1 (en) 2002-02-25 2004-01-08 Ho-Young Lee IGFBP-3 in the diagnosis and treatment of cancer
EP1534854A4 (en) * 2002-03-28 2006-11-15 Ohio Med College Method and compositions for the diagnosis and treatment of non-small cell lung cancer using gene expression profiles
DE10219117C1 (en) 2002-04-29 2003-10-30 Adnagen Ag Use of lithium dodecyl sulfate for stabilizing RNA in solution, particularly during purification of RNA from cell lysate
WO2004005891A2 (en) 2002-07-10 2004-01-15 The Regents Of The University Of Michigan Expression profile of lung cancer
JP4790266B2 (en) 2002-09-30 2011-10-12 バイエル・シエリング・フアーマ・アクチエンゲゼルシヤフト Condensed azolepyrimidine derivatives
WO2004070062A2 (en) 2003-02-04 2004-08-19 Wyeth Compositions and methods for diagnosing and treating cancers
US20040241725A1 (en) 2003-03-25 2004-12-02 Wenming Xiao Lung cancer detection
WO2004091511A2 (en) 2003-04-10 2004-10-28 Genzyme Corporation Compositions and methods to diagnose and treat lung cancer
US20070105105A1 (en) * 2003-05-23 2007-05-10 Mount Sinai School Of Medicine Of New York University Surrogate cell gene expression signatures for evaluating the physical state of a subject
EP1633892A4 (en) * 2003-06-10 2007-10-24 Univ Boston Detection methods for disorders of the lung
WO2004111197A2 (en) * 2003-06-10 2004-12-23 Trustees Of Boston University Gene expression signatures, methods and compositions for diagnosing disorders of the lung
WO2005047451A2 (en) 2003-11-12 2005-05-26 Trustees Of Boston University Isolation of nucleic acid from mouth epithelial cells
EP1753873A4 (en) * 2004-03-05 2008-11-05 Ohio Med College Methods and compositions for assessing nucleic acids and alleles
WO2006056080A1 (en) 2004-11-29 2006-06-01 Diagnocure Inc. Gpx2 a specific and sensitive target for lung cancer diagnosis, prognosis and/or theranosis
US7811562B2 (en) 2004-12-03 2010-10-12 Schering Corporation Biomarkers for pre-selection of patients for anti-IGF1R therapy
US7862995B2 (en) 2004-12-10 2011-01-04 Targeted Molecular Diagnostics Methods and materials for predicting responsiveness to treatment with dual tyrosine kinase inhibitor
WO2006079033A2 (en) * 2005-01-21 2006-07-27 Gene Express, Inc. Methods and compositions for assessing nucleic acids
EP2360279A1 (en) 2005-04-14 2011-08-24 Trustees Of Boston University Diagnostic for lung disorders using class prediction
WO2007028162A2 (en) * 2005-09-02 2007-03-08 The University Of Toledo Methods and compositions for identifying biomarkers useful in diagnosis and/or treatment of biological states
WO2007103541A2 (en) 2006-03-09 2007-09-13 The Trustees Of Boston University Diagnostic and prognostic methods for lung disorders using gene expression profiles from nose epithelial cells
EP2193211A4 (en) 2007-09-19 2010-12-08 Univ Boston Identification of novel pathways for drug development for lung disease
US20090246779A1 (en) 2008-02-15 2009-10-01 University Of Washington Increasing genomic instability during premalignant neoplastic progression revealed through high resolution array-cgh
WO2009121070A1 (en) 2008-03-28 2009-10-01 The Trustees Of Boston University Multifactorial methods for detecting lung disorders
US9513300B2 (en) 2008-05-05 2016-12-06 Cornell University Determination of serum anti-mullerian hormone as a diagnostic test for spay in companion animals
WO2010054233A1 (en) 2008-11-08 2010-05-14 The Wistar Institute Of Anatomy And Biology Biomarkers in peripheral blood mononuclear cells for diagnosing or detecting lung cancers
CN102725632A (en) 2009-08-28 2012-10-10 奥斯瑞根公司 MiRNA biomarkers of lung disease
WO2012094744A1 (en) 2011-01-11 2012-07-19 University Health Network Prognostic signature for oral squamous cell carcinoma
US20120288860A1 (en) 2011-05-12 2012-11-15 George Mason University Differential gene expression for detecting and/or differentiating lung disease
US20150080243A1 (en) 2011-09-01 2015-03-19 Allegro Diagnostics Corp. Methods and compositions for detecting cancer based on mirna expression profiles
WO2013049152A2 (en) 2011-09-26 2013-04-04 Allegro Diagnostics Corp. Methods for evaluating lung cancer status
CA2865335A1 (en) 2012-03-09 2013-09-12 Caris Life Sciences Luxembourg Holdings, S.A.R.L. Biomarker compositions and methods
US20150088430A1 (en) 2012-04-26 2015-03-26 Allegro Diagnostics Corp Methods for evaluating lung cancer status
US9677138B2 (en) 2012-05-20 2017-06-13 Trustees Of Boston University Methods and systems for monitoring, diagnosing, and treating chronic obstructive pulmonary disease
WO2014186036A1 (en) 2013-03-14 2014-11-20 Allegro Diagnostics Corp. Methods for evaluating copd status
EP2971128B1 (en) 2013-03-15 2023-03-01 Veracyte, Inc. Biomarkers for diagnosis of lung diseases and methods of use thereof
US20160130656A1 (en) 2014-07-14 2016-05-12 Allegro Diagnostics Corp. Methods for evaluating lung cancer status
CN105321933B (en) 2014-08-01 2019-08-09 乾坤科技股份有限公司 Semiconductor package assembly and a manufacturing method thereof with conformal electromagnetic armouring structure
CN106252749B (en) 2015-06-04 2020-12-29 松下知识产权经营株式会社 Control method for battery pack and battery pack
WO2017197335A1 (en) 2016-05-12 2017-11-16 Trustees Of Boston University Nasal epithelium gene expression signature and classifier for the prediction of lung cancer
US10927417B2 (en) 2016-07-08 2021-02-23 Trustees Of Boston University Gene expression-based biomarker for the detection and monitoring of bronchial premalignant lesions
EP3510173A4 (en) 2016-09-07 2020-05-20 Veracyte, Inc. Methods and systems for detecting usual interstitial pneumonia

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11976329B2 (en) 2013-03-15 2024-05-07 Veracyte, Inc. Methods and systems for detecting usual interstitial pneumonia
US11639527B2 (en) 2014-11-05 2023-05-02 Veracyte, Inc. Methods for nucleic acid sequencing

Also Published As

Publication number Publication date
US20120041686A1 (en) 2012-02-16
JP2013179945A (en) 2013-09-12
US20100035244A1 (en) 2010-02-11
JP2008537120A (en) 2008-09-11
EP3770278A1 (en) 2021-01-27
WO2006113467A3 (en) 2007-04-05
US20130023437A1 (en) 2013-01-24
US10808285B2 (en) 2020-10-20
US9920374B2 (en) 2018-03-20
US20150354008A1 (en) 2015-12-10
US20180171418A1 (en) 2018-06-21
WO2006113467A9 (en) 2008-03-13
AU2006236621B2 (en) 2011-10-06
AU2006236621A1 (en) 2006-10-26
EP2360279A1 (en) 2011-08-24
EP2360278A1 (en) 2011-08-24
EP2390347A1 (en) 2011-11-30
EP3211093A1 (en) 2017-08-30
WO2006113467A2 (en) 2006-10-26
EP1874962A2 (en) 2008-01-09
CA2605158A1 (en) 2006-10-26

Similar Documents

Publication Publication Date Title
US20190376148A1 (en) Diagnostic for lung disorders using class prediction
US20200115763A1 (en) Detection methods for disorders of the lung
US20200248274A1 (en) Diagnostic for lung disorders using class prediction
EP2227568B1 (en) Molecular in vitro diagnosis of breast cancer
EP1934377B1 (en) Methods for identifying biomarkers useful in diagnosis of biological states
US20170328908A1 (en) Diagnostic and prognostic methods for lung disorders using gene expression profiles from nose epithelial cells
JP2021525069A (en) Cell-free DNA for assessing and / or treating cancer
US20230366034A1 (en) Compositions and methods for diagnosing lung cancers using gene expression profiles
CA2504403A1 (en) Prognostic for hematological malignancy
WO2017119510A1 (en) Test method, gene marker, and test agent for diagnosing breast cancer
EP2436763B1 (en) Method for assessing lymph node metastasis of cancer or the risk thereof, and rapid assessment kit for said method

Legal Events

Date Code Title Description
STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

AS Assignment

Owner name: TRUSTEES OF BOSTON UNIVERSITY, MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BRODY, JEROME S.;SPIRA, AVRUM;BEANE-EBEL, JENNIFER E.;AND OTHERS;SIGNING DATES FROM 20201113 TO 20210311;REEL/FRAME:055998/0937

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION