ZA200610644B - Oligonucleotides for cancer diagnosis - Google Patents
Oligonucleotides for cancer diagnosis Download PDFInfo
- Publication number
- ZA200610644B ZA200610644B ZA200610644A ZA200610644A ZA200610644B ZA 200610644 B ZA200610644 B ZA 200610644B ZA 200610644 A ZA200610644 A ZA 200610644A ZA 200610644 A ZA200610644 A ZA 200610644A ZA 200610644 B ZA200610644 B ZA 200610644B
- Authority
- ZA
- South Africa
- Prior art keywords
- cancer
- probes
- proteins
- genes
- sample
- Prior art date
Links
- 206010028980 Neoplasm Diseases 0.000 title claims description 153
- 108091034117 Oligonucleotide Proteins 0.000 title claims description 151
- 201000011510 cancer Diseases 0.000 title claims description 122
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 title claims description 81
- 238000003745 diagnosis Methods 0.000 title claims description 22
- 108090000623 proteins and genes Proteins 0.000 claims description 270
- 239000000523 sample Substances 0.000 claims description 257
- 238000000034 method Methods 0.000 claims description 145
- 230000014509 gene expression Effects 0.000 claims description 108
- 210000004027 cell Anatomy 0.000 claims description 102
- 239000002299 complementary DNA Substances 0.000 claims description 70
- 108020004999 messenger RNA Proteins 0.000 claims description 64
- 102000004169 proteins and genes Human genes 0.000 claims description 57
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 53
- 229920001184 polypeptide Polymers 0.000 claims description 52
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 52
- 201000010099 disease Diseases 0.000 claims description 50
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 50
- 108020005187 Oligonucleotide Probes Proteins 0.000 claims description 48
- 239000002751 oligonucleotide probe Substances 0.000 claims description 48
- 238000012360 testing method Methods 0.000 claims description 45
- 230000000295 complement effect Effects 0.000 claims description 37
- 239000007787 solid Substances 0.000 claims description 37
- 238000011835 investigation Methods 0.000 claims description 22
- 238000012544 monitoring process Methods 0.000 claims description 21
- 206010006187 Breast cancer Diseases 0.000 claims description 20
- 208000026310 Breast neoplasm Diseases 0.000 claims description 20
- 102000002278 Ribosomal Proteins Human genes 0.000 claims description 20
- 108010000605 Ribosomal Proteins Proteins 0.000 claims description 20
- 210000001519 tissue Anatomy 0.000 claims description 18
- 230000014616 translation Effects 0.000 claims description 16
- 102000005962 receptors Human genes 0.000 claims description 15
- 108020003175 receptors Proteins 0.000 claims description 15
- 102100040247 Tumor necrosis factor Human genes 0.000 claims description 14
- 230000001105 regulatory effect Effects 0.000 claims description 13
- 230000002441 reversible effect Effects 0.000 claims description 12
- 238000013519 translation Methods 0.000 claims description 10
- 108020004414 DNA Proteins 0.000 claims description 9
- 230000028993 immune response Effects 0.000 claims description 9
- 230000004044 response Effects 0.000 claims description 9
- 210000001124 body fluid Anatomy 0.000 claims description 8
- 239000010839 body fluid Substances 0.000 claims description 8
- 108010077544 Chromatin Proteins 0.000 claims description 7
- 102000008857 Ferritin Human genes 0.000 claims description 7
- 108050000784 Ferritin Proteins 0.000 claims description 7
- 238000008416 Ferritin Methods 0.000 claims description 7
- 108010063738 Interleukins Proteins 0.000 claims description 7
- 102000015696 Interleukins Human genes 0.000 claims description 7
- 210000003483 chromatin Anatomy 0.000 claims description 7
- 230000005764 inhibitory process Effects 0.000 claims description 7
- 229940047122 interleukins Drugs 0.000 claims description 7
- 239000002699 waste material Substances 0.000 claims description 7
- 241000282414 Homo sapiens Species 0.000 claims description 6
- 102000001253 Protein Kinase Human genes 0.000 claims description 6
- 210000003719 b-lymphocyte Anatomy 0.000 claims description 6
- -1 protcosome subunits Proteins 0.000 claims description 6
- 108060006633 protein kinase Proteins 0.000 claims description 6
- 238000013518 transcription Methods 0.000 claims description 6
- 230000035897 transcription Effects 0.000 claims description 6
- 102000034286 G proteins Human genes 0.000 claims description 5
- 108091006027 G proteins Proteins 0.000 claims description 5
- 102000002812 Heat-Shock Proteins Human genes 0.000 claims description 5
- 108010004889 Heat-Shock Proteins Proteins 0.000 claims description 5
- 108010033040 Histones Proteins 0.000 claims description 5
- 206010021143 Hypoxia Diseases 0.000 claims description 5
- 108060003951 Immunoglobulin Proteins 0.000 claims description 5
- 230000033228 biological regulation Effects 0.000 claims description 5
- 238000002405 diagnostic procedure Methods 0.000 claims description 5
- 239000003623 enhancer Substances 0.000 claims description 5
- 230000007954 hypoxia Effects 0.000 claims description 5
- 102000018358 immunoglobulin Human genes 0.000 claims description 5
- 239000003112 inhibitor Substances 0.000 claims description 5
- 210000005259 peripheral blood Anatomy 0.000 claims description 5
- 239000011886 peripheral blood Substances 0.000 claims description 5
- 210000003705 ribosome Anatomy 0.000 claims description 5
- 102000014914 Carrier Proteins Human genes 0.000 claims description 4
- 102000011727 Caspases Human genes 0.000 claims description 4
- 108010076667 Caspases Proteins 0.000 claims description 4
- 102000003951 Erythropoietin Human genes 0.000 claims description 4
- 108090000394 Erythropoietin Proteins 0.000 claims description 4
- 102000002241 Eukaryotic Initiation Factors Human genes 0.000 claims description 4
- 108010014863 Eukaryotic Initiation Factors Proteins 0.000 claims description 4
- 102000043136 MAP kinase family Human genes 0.000 claims description 4
- 108091054455 MAP kinase family Proteins 0.000 claims description 4
- 241000124008 Mammalia Species 0.000 claims description 4
- 102000007456 Peroxiredoxin Human genes 0.000 claims description 4
- 102000019197 Superoxide Dismutase Human genes 0.000 claims description 4
- 108010012715 Superoxide dismutase Proteins 0.000 claims description 4
- 230000004913 activation Effects 0.000 claims description 4
- 108091008324 binding proteins Proteins 0.000 claims description 4
- 229940105423 erythropoietin Drugs 0.000 claims description 4
- 230000036542 oxidative stress Effects 0.000 claims description 4
- 108030002458 peroxiredoxin Proteins 0.000 claims description 4
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 claims description 4
- 238000001243 protein synthesis Methods 0.000 claims description 4
- 102000005701 Calcium-Binding Proteins Human genes 0.000 claims description 3
- 108010045403 Calcium-Binding Proteins Proteins 0.000 claims description 3
- 102000016736 Cyclin Human genes 0.000 claims description 3
- 108050006400 Cyclin Proteins 0.000 claims description 3
- 102000004127 Cytokines Human genes 0.000 claims description 3
- 108090000695 Cytokines Proteins 0.000 claims description 3
- 102000000541 Defensins Human genes 0.000 claims description 3
- 108010002069 Defensins Proteins 0.000 claims description 3
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 claims description 3
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 claims description 3
- 108010050904 Interferons Proteins 0.000 claims description 3
- 102000014150 Interferons Human genes 0.000 claims description 3
- 206010033128 Ovarian cancer Diseases 0.000 claims description 3
- 206010061535 Ovarian neoplasm Diseases 0.000 claims description 3
- 108010044843 Peptide Initiation Factors Proteins 0.000 claims description 3
- 102000005877 Peptide Initiation Factors Human genes 0.000 claims description 3
- 108091023040 Transcription factor Proteins 0.000 claims description 3
- 102000040945 Transcription factor Human genes 0.000 claims description 3
- 102000018594 Tumour necrosis factor Human genes 0.000 claims description 3
- 108050007852 Tumour necrosis factor Proteins 0.000 claims description 3
- 210000000481 breast Anatomy 0.000 claims description 3
- 239000003102 growth factor Substances 0.000 claims description 3
- 229940079322 interferon Drugs 0.000 claims description 3
- ZIIUUSVHCHPIQD-UHFFFAOYSA-N 2,4,6-trimethyl-N-[3-(trifluoromethyl)phenyl]benzenesulfonamide Chemical compound CC1=CC(C)=CC(C)=C1S(=O)(=O)NC1=CC=CC(C(F)(F)F)=C1 ZIIUUSVHCHPIQD-UHFFFAOYSA-N 0.000 claims description 2
- KISWVXRQTGLFGD-UHFFFAOYSA-N 2-[[2-[[6-amino-2-[[2-[[2-[[5-amino-2-[[2-[[1-[2-[[6-amino-2-[(2,5-diamino-5-oxopentanoyl)amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]-3-hydroxypropanoyl]amino]-5-oxopentanoyl]amino]-5-(diaminomethylideneamino)p Chemical compound C1CCN(C(=O)C(CCCN=C(N)N)NC(=O)C(CCCCN)NC(=O)C(N)CCC(N)=O)C1C(=O)NC(CO)C(=O)NC(CCC(N)=O)C(=O)NC(CCCN=C(N)N)C(=O)NC(CO)C(=O)NC(CCCCN)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 KISWVXRQTGLFGD-UHFFFAOYSA-N 0.000 claims description 2
- 102100022600 40S ribosomal protein S3a Human genes 0.000 claims description 2
- 102100022406 60S ribosomal protein L10a Human genes 0.000 claims description 2
- 102100022289 60S ribosomal protein L13a Human genes 0.000 claims description 2
- 102100021690 60S ribosomal protein L18a Human genes 0.000 claims description 2
- 102100023247 60S ribosomal protein L23a Human genes 0.000 claims description 2
- 102100021927 60S ribosomal protein L27a Human genes 0.000 claims description 2
- 101710187898 60S ribosomal protein L28 Proteins 0.000 claims description 2
- 102100022276 60S ribosomal protein L35a Human genes 0.000 claims description 2
- 102100031002 60S ribosomal protein L36a Human genes 0.000 claims description 2
- 102100036126 60S ribosomal protein L37a Human genes 0.000 claims description 2
- 102100036630 60S ribosomal protein L7a Human genes 0.000 claims description 2
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 claims description 2
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 claims description 2
- 102100022014 Angiopoietin-1 receptor Human genes 0.000 claims description 2
- 102000006996 Aryldialkylphosphatase Human genes 0.000 claims description 2
- 108010008184 Aryldialkylphosphatase Proteins 0.000 claims description 2
- 102000000905 Cadherin Human genes 0.000 claims description 2
- 108050007957 Cadherin Proteins 0.000 claims description 2
- 102100035882 Catalase Human genes 0.000 claims description 2
- 108010053835 Catalase Proteins 0.000 claims description 2
- 102000016362 Catenins Human genes 0.000 claims description 2
- 108010067316 Catenins Proteins 0.000 claims description 2
- 102000005600 Cathepsins Human genes 0.000 claims description 2
- 108010084457 Cathepsins Proteins 0.000 claims description 2
- 108010012236 Chemokines Proteins 0.000 claims description 2
- 102000019034 Chemokines Human genes 0.000 claims description 2
- 206010009944 Colon cancer Diseases 0.000 claims description 2
- 108010058546 Cyclin D1 Proteins 0.000 claims description 2
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 claims description 2
- 102000003849 Cytochrome P450 Human genes 0.000 claims description 2
- 102220519085 Cytochrome P450 1A2_S18C_mutation Human genes 0.000 claims description 2
- 101710088194 Dehydrogenase Proteins 0.000 claims description 2
- 102000002045 Endothelin Human genes 0.000 claims description 2
- 108050009340 Endothelin Proteins 0.000 claims description 2
- 108010092408 Eosinophil Peroxidase Proteins 0.000 claims description 2
- 102100028471 Eosinophil peroxidase Human genes 0.000 claims description 2
- 108090000386 Fibroblast Growth Factor 1 Proteins 0.000 claims description 2
- 102100031706 Fibroblast growth factor 1 Human genes 0.000 claims description 2
- 201000003741 Gastrointestinal carcinoma Diseases 0.000 claims description 2
- 102000005731 Glucose-6-phosphate isomerase Human genes 0.000 claims description 2
- 108010070600 Glucose-6-phosphate isomerase Proteins 0.000 claims description 2
- 102000017278 Glutaredoxin Human genes 0.000 claims description 2
- 108050005205 Glutaredoxin Proteins 0.000 claims description 2
- 108010063907 Glutathione Reductase Proteins 0.000 claims description 2
- 102100036442 Glutathione reductase, mitochondrial Human genes 0.000 claims description 2
- 108010036164 Glutathione synthase Proteins 0.000 claims description 2
- 102100034294 Glutathione synthetase Human genes 0.000 claims description 2
- 102000005720 Glutathione transferase Human genes 0.000 claims description 2
- 108010070675 Glutathione transferase Proteins 0.000 claims description 2
- 102100029242 Hexokinase-2 Human genes 0.000 claims description 2
- 101710198385 Hexokinase-2 Proteins 0.000 claims description 2
- 102000006947 Histones Human genes 0.000 claims description 2
- 101000691550 Homo sapiens 39S ribosomal protein L13, mitochondrial Proteins 0.000 claims description 2
- 101000755323 Homo sapiens 60S ribosomal protein L10a Proteins 0.000 claims description 2
- 101000681240 Homo sapiens 60S ribosomal protein L13a Proteins 0.000 claims description 2
- 101000752293 Homo sapiens 60S ribosomal protein L18a Proteins 0.000 claims description 2
- 101001115494 Homo sapiens 60S ribosomal protein L23a Proteins 0.000 claims description 2
- 101001110988 Homo sapiens 60S ribosomal protein L35a Proteins 0.000 claims description 2
- 101001127203 Homo sapiens 60S ribosomal protein L36a Proteins 0.000 claims description 2
- 101001092424 Homo sapiens 60S ribosomal protein L37a Proteins 0.000 claims description 2
- 101000853243 Homo sapiens 60S ribosomal protein L7a Proteins 0.000 claims description 2
- 101000753291 Homo sapiens Angiopoietin-1 receptor Proteins 0.000 claims description 2
- 101000882390 Homo sapiens Histone acetyltransferase p300 Proteins 0.000 claims description 2
- 101001134134 Homo sapiens Oxidation resistance protein 1 Proteins 0.000 claims description 2
- 101000980827 Homo sapiens T-cell surface glycoprotein CD1a Proteins 0.000 claims description 2
- 101000738335 Homo sapiens T-cell surface glycoprotein CD3 zeta chain Proteins 0.000 claims description 2
- 101001115218 Homo sapiens Ubiquitin-40S ribosomal protein S27a Proteins 0.000 claims description 2
- 102000016878 Hypoxia-Inducible Factor 1 Human genes 0.000 claims description 2
- 108010028501 Hypoxia-Inducible Factor 1 Proteins 0.000 claims description 2
- 108091054729 IRF family Proteins 0.000 claims description 2
- 102000016854 Interferon Regulatory Factors Human genes 0.000 claims description 2
- 108010002352 Interleukin-1 Proteins 0.000 claims description 2
- 102000011845 Iodide peroxidase Human genes 0.000 claims description 2
- 108010036012 Iodide peroxidase Proteins 0.000 claims description 2
- 102000004856 Lectins Human genes 0.000 claims description 2
- 108090001090 Lectins Proteins 0.000 claims description 2
- 102000016267 Leptin Human genes 0.000 claims description 2
- 108010092277 Leptin Proteins 0.000 claims description 2
- 102000004058 Leukemia inhibitory factor Human genes 0.000 claims description 2
- 108090000581 Leukemia inhibitory factor Proteins 0.000 claims description 2
- 206010058467 Lung neoplasm malignant Diseases 0.000 claims description 2
- 102000047918 Myelin Basic Human genes 0.000 claims description 2
- 101710107068 Myelin basic protein Proteins 0.000 claims description 2
- 102000003896 Myeloperoxidases Human genes 0.000 claims description 2
- 108090000235 Myeloperoxidases Proteins 0.000 claims description 2
- 102100029156 Nucleolar protein of 40 kDa Human genes 0.000 claims description 2
- 101710159068 Nucleolar protein of 40 kDa Proteins 0.000 claims description 2
- 108090000630 Oncostatin M Proteins 0.000 claims description 2
- 102000004140 Oncostatin M Human genes 0.000 claims description 2
- 102100034219 Oxidation resistance protein 1 Human genes 0.000 claims description 2
- 102400000050 Oxytocin Human genes 0.000 claims description 2
- XNOPRXBHLZRZKH-UHFFFAOYSA-N Oxytocin Natural products N1C(=O)C(N)CSSCC(C(=O)N2C(CCC2)C(=O)NC(CC(C)C)C(=O)NCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(CCC(N)=O)NC(=O)C(C(C)CC)NC(=O)C1CC1=CC=C(O)C=C1 XNOPRXBHLZRZKH-UHFFFAOYSA-N 0.000 claims description 2
- 101800000989 Oxytocin Proteins 0.000 claims description 2
- 102000015439 Phospholipases Human genes 0.000 claims description 2
- 108010064785 Phospholipases Proteins 0.000 claims description 2
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 claims description 2
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 claims description 2
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 claims description 2
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 claims description 2
- 102000001938 Plasminogen Activators Human genes 0.000 claims description 2
- 108010001014 Plasminogen Activators Proteins 0.000 claims description 2
- 206010060862 Prostate cancer Diseases 0.000 claims description 2
- 102000003923 Protein Kinase C Human genes 0.000 claims description 2
- 108090000315 Protein Kinase C Proteins 0.000 claims description 2
- 102000002727 Protein Tyrosine Phosphatase Human genes 0.000 claims description 2
- 102000057361 Pseudogenes Human genes 0.000 claims description 2
- 108091008109 Pseudogenes Proteins 0.000 claims description 2
- 102000017143 RNA Polymerase I Human genes 0.000 claims description 2
- 108010013845 RNA Polymerase I Proteins 0.000 claims description 2
- 102000044126 RNA-Binding Proteins Human genes 0.000 claims description 2
- 108700020471 RNA-Binding Proteins Proteins 0.000 claims description 2
- 102000006382 Ribonucleases Human genes 0.000 claims description 2
- 108010083644 Ribonucleases Proteins 0.000 claims description 2
- 101000677219 Schizosaccharomyces pombe (strain 972 / ATCC 24843) 60S ribosomal protein L33-A Proteins 0.000 claims description 2
- 101000691164 Schizosaccharomyces pombe (strain 972 / ATCC 24843) 60S ribosomal protein L43-A Proteins 0.000 claims description 2
- 101000732892 Schizosaccharomyces pombe (strain 972 / ATCC 24843) 60S ribosomal protein L8 Proteins 0.000 claims description 2
- 102100023921 Serine/threonine-protein kinase ATR Human genes 0.000 claims description 2
- 101710178061 Serine/threonine-protein kinase ATR Proteins 0.000 claims description 2
- 102000043912 Sestrin Human genes 0.000 claims description 2
- 208000000453 Skin Neoplasms Diseases 0.000 claims description 2
- 208000005718 Stomach Neoplasms Diseases 0.000 claims description 2
- 108091008874 T cell receptors Proteins 0.000 claims description 2
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 claims description 2
- 102100024219 T-cell surface glycoprotein CD1a Human genes 0.000 claims description 2
- 102100037906 T-cell surface glycoprotein CD3 zeta chain Human genes 0.000 claims description 2
- 102000013090 Thioredoxin-Disulfide Reductase Human genes 0.000 claims description 2
- 108010079911 Thioredoxin-disulfide reductase Proteins 0.000 claims description 2
- 102000002689 Toll-like receptor Human genes 0.000 claims description 2
- 108020000411 Toll-like receptor Proteins 0.000 claims description 2
- 102100023341 Ubiquitin-40S ribosomal protein S27a Human genes 0.000 claims description 2
- 239000012190 activator Substances 0.000 claims description 2
- 210000001072 colon Anatomy 0.000 claims description 2
- 208000029742 colonic neoplasm Diseases 0.000 claims description 2
- ZUBDGKVDJUIMQQ-UBFCDGJISA-N endothelin-1 Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)NC(=O)[C@H]1NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@@H](CC=2C=CC(O)=CC=2)NC(=O)[C@H](C(C)C)NC(=O)[C@H]2CSSC[C@@H](C(N[C@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N2)=O)NC(=O)[C@@H](CO)NC(=O)[C@H](N)CSSC1)C1=CNC=N1 ZUBDGKVDJUIMQQ-UBFCDGJISA-N 0.000 claims description 2
- 238000013467 fragmentation Methods 0.000 claims description 2
- 238000006062 fragmentation reaction Methods 0.000 claims description 2
- 206010017758 gastric cancer Diseases 0.000 claims description 2
- 102000006495 integrins Human genes 0.000 claims description 2
- 108010044426 integrins Proteins 0.000 claims description 2
- 201000002313 intestinal cancer Diseases 0.000 claims description 2
- 239000002523 lectin Substances 0.000 claims description 2
- 229940039781 leptin Drugs 0.000 claims description 2
- NRYBAZVQPHGZNS-ZSOCWYAHSA-N leptin Chemical compound O=C([C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)CCSC)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CS)C(O)=O NRYBAZVQPHGZNS-ZSOCWYAHSA-N 0.000 claims description 2
- 210000000265 leukocyte Anatomy 0.000 claims description 2
- 239000003446 ligand Substances 0.000 claims description 2
- 210000004072 lung Anatomy 0.000 claims description 2
- 201000005202 lung cancer Diseases 0.000 claims description 2
- 208000020816 lung neoplasm Diseases 0.000 claims description 2
- XNOPRXBHLZRZKH-DSZYJQQASA-N oxytocin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CSSC[C@H](N)C(=O)N1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(N)=O)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 XNOPRXBHLZRZKH-DSZYJQQASA-N 0.000 claims description 2
- 229960001723 oxytocin Drugs 0.000 claims description 2
- 229940127126 plasminogen activator Drugs 0.000 claims description 2
- 230000008488 polyadenylation Effects 0.000 claims description 2
- 210000002307 prostate Anatomy 0.000 claims description 2
- 108020000494 protein-tyrosine phosphatase Proteins 0.000 claims description 2
- 108010078070 scavenger receptors Proteins 0.000 claims description 2
- 102000014452 scavenger receptors Human genes 0.000 claims description 2
- 108700020508 sestrin Proteins 0.000 claims description 2
- 210000003491 skin Anatomy 0.000 claims description 2
- 201000000849 skin cancer Diseases 0.000 claims description 2
- 210000002784 stomach Anatomy 0.000 claims description 2
- 201000011549 stomach cancer Diseases 0.000 claims description 2
- 102100024113 40S ribosomal protein S15a Human genes 0.000 claims 1
- 102000006311 Cyclin D1 Human genes 0.000 claims 1
- 102100038885 Histone acetyltransferase p300 Human genes 0.000 claims 1
- 101001118566 Homo sapiens 40S ribosomal protein S15a Proteins 0.000 claims 1
- 102220588437 Keratin, type I cytoskeletal 18_S18A_mutation Human genes 0.000 claims 1
- 102000007999 Nuclear Proteins Human genes 0.000 claims 1
- 108010089610 Nuclear Proteins Proteins 0.000 claims 1
- 230000027455 binding Effects 0.000 description 51
- 238000004458 analytical method Methods 0.000 description 31
- 239000000047 product Substances 0.000 description 24
- 210000004369 blood Anatomy 0.000 description 20
- 239000008280 blood Substances 0.000 description 20
- 239000011159 matrix material Substances 0.000 description 18
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 17
- 238000002790 cross-validation Methods 0.000 description 17
- 150000007523 nucleic acids Chemical class 0.000 description 16
- 238000010238 partial least squares regression Methods 0.000 description 16
- 238000004519 manufacturing process Methods 0.000 description 15
- 102000039446 nucleic acids Human genes 0.000 description 15
- 108020004707 nucleic acids Proteins 0.000 description 15
- 239000003550 marker Substances 0.000 description 14
- 238000013459 approach Methods 0.000 description 13
- 238000009396 hybridization Methods 0.000 description 13
- 239000012528 membrane Substances 0.000 description 13
- 230000003321 amplification Effects 0.000 description 12
- 238000003199 nucleic acid amplification method Methods 0.000 description 12
- 238000003786 synthesis reaction Methods 0.000 description 11
- 108020004635 Complementary DNA Proteins 0.000 description 10
- 230000015572 biosynthetic process Effects 0.000 description 10
- 238000002493 microarray Methods 0.000 description 9
- 108091000080 Phosphotransferase Proteins 0.000 description 8
- 102000020233 phosphotransferase Human genes 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- 102100032420 Protein S100-A9 Human genes 0.000 description 7
- 238000013145 classification model Methods 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 238000010200 validation analysis Methods 0.000 description 7
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 238000002372 labelling Methods 0.000 description 6
- 230000011664 signaling Effects 0.000 description 6
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 229940088598 enzyme Drugs 0.000 description 5
- 238000011068 loading method Methods 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 238000003752 polymerase chain reaction Methods 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 241000894007 species Species 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- 108010052495 Calgranulin B Proteins 0.000 description 4
- 238000000018 DNA microarray Methods 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 108091093037 Peptide nucleic acid Proteins 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 239000012472 biological sample Substances 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000010195 expression analysis Methods 0.000 description 4
- 239000012530 fluid Substances 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 238000010606 normalization Methods 0.000 description 4
- 229910052760 oxygen Inorganic materials 0.000 description 4
- 239000003642 reactive oxygen metabolite Substances 0.000 description 4
- 238000010839 reverse transcription Methods 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- 101150029115 HOPX gene Proteins 0.000 description 3
- 239000004677 Nylon Substances 0.000 description 3
- 101710156990 Protein S100-A9 Proteins 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 238000003491 array Methods 0.000 description 3
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 3
- 230000008827 biological function Effects 0.000 description 3
- 238000001574 biopsy Methods 0.000 description 3
- 210000004556 brain Anatomy 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 239000007850 fluorescent dye Substances 0.000 description 3
- 239000007789 gas Substances 0.000 description 3
- 238000003384 imaging method Methods 0.000 description 3
- 230000003100 immobilizing effect Effects 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 208000024312 invasive carcinoma Diseases 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 229920001778 nylon Polymers 0.000 description 3
- 239000001301 oxygen Substances 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000004416 surface enhanced Raman spectroscopy Methods 0.000 description 3
- 238000012549 training Methods 0.000 description 3
- ABEXEQSGABRUHS-UHFFFAOYSA-N 16-methylheptadecyl 16-methylheptadecanoate Chemical compound CC(C)CCCCCCCCCCCCCCCOC(=O)CCCCCCCCCCCCCCC(C)C ABEXEQSGABRUHS-UHFFFAOYSA-N 0.000 description 2
- 208000024827 Alzheimer disease Diseases 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 102100030801 Elongation factor 1-alpha 1 Human genes 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 102100020760 Ferritin heavy chain Human genes 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 101000920078 Homo sapiens Elongation factor 1-alpha 1 Proteins 0.000 description 2
- 101001002987 Homo sapiens Ferritin heavy chain Proteins 0.000 description 2
- 208000037396 Intraductal Noninfiltrating Carcinoma Diseases 0.000 description 2
- 241000764238 Isis Species 0.000 description 2
- 241000042032 Petrocephalus catostoma Species 0.000 description 2
- 239000004698 Polyethylene Substances 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- 238000012952 Resampling Methods 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 101710119418 Superoxide dismutase [Mn] Proteins 0.000 description 2
- 102100032891 Superoxide dismutase [Mn], mitochondrial Human genes 0.000 description 2
- 101710202572 Superoxide dismutase [Mn], mitochondrial Proteins 0.000 description 2
- 102100040103 Upstream stimulatory factor 2 Human genes 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 239000008186 active pharmaceutical agent Substances 0.000 description 2
- 238000000540 analysis of variance Methods 0.000 description 2
- 238000010420 art technique Methods 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 238000003705 background correction Methods 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000007621 cluster analysis Methods 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 2
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000009795 derivation Methods 0.000 description 2
- 238000012631 diagnostic technique Methods 0.000 description 2
- 208000028715 ductal breast carcinoma in situ Diseases 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 235000013601 eggs Nutrition 0.000 description 2
- 201000003373 familial cold autoinflammatory syndrome 3 Diseases 0.000 description 2
- 125000000524 functional group Chemical group 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 230000003394 haemopoietic effect Effects 0.000 description 2
- 238000005417 image-selected in vivo spectroscopy Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 238000012739 integrated shape imaging system Methods 0.000 description 2
- 229910052742 iron Inorganic materials 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 230000003211 malignant effect Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 238000000491 multivariate analysis Methods 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 230000000704 physical effect Effects 0.000 description 2
- 229920002223 polystyrene Polymers 0.000 description 2
- 238000007639 printing Methods 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 238000000611 regression analysis Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 229910052717 sulfur Inorganic materials 0.000 description 2
- 230000009897 systematic effect Effects 0.000 description 2
- 230000009885 systemic effect Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 210000002700 urine Anatomy 0.000 description 2
- ZRZROXNBKJAOKB-GFVHOAGBSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[2-[[2-[[(2s)-2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]propanoyl]amino]acetyl]amino]acetyl]amino]-3-carboxypropanoyl]amino]propanoyl]amino]-3-hydroxypropanoyl]amino]acetyl]amino]pentanedioic acid Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 ZRZROXNBKJAOKB-GFVHOAGBSA-N 0.000 description 1
- KVUYSIIZYQUVMT-UHFFFAOYSA-N 1,2-diisothiocyanatoethane Chemical compound S=C=NCCN=C=S KVUYSIIZYQUVMT-UHFFFAOYSA-N 0.000 description 1
- PFFIDZXUXFLSSR-UHFFFAOYSA-N 1-methyl-N-[2-(4-methylpentan-2-yl)-3-thienyl]-3-(trifluoromethyl)pyrazole-4-carboxamide Chemical compound S1C=CC(NC(=O)C=2C(=NN(C)C=2)C(F)(F)F)=C1C(C)CC(C)C PFFIDZXUXFLSSR-UHFFFAOYSA-N 0.000 description 1
- BPPVUXSMLBXYGG-UHFFFAOYSA-N 4-[3-(4,5-dihydro-1,2-oxazol-3-yl)-2-methyl-4-methylsulfonylbenzoyl]-2-methyl-1h-pyrazol-3-one Chemical compound CC1=C(C(=O)C=2C(N(C)NC=2)=O)C=CC(S(C)(=O)=O)=C1C1=NOCC1 BPPVUXSMLBXYGG-UHFFFAOYSA-N 0.000 description 1
- 102100023415 40S ribosomal protein S20 Human genes 0.000 description 1
- 102100027271 40S ribosomal protein SA Human genes 0.000 description 1
- FHVDTGUDJYJELY-UHFFFAOYSA-N 6-{[2-carboxy-4,5-dihydroxy-6-(phosphanyloxy)oxan-3-yl]oxy}-4,5-dihydroxy-3-phosphanyloxane-2-carboxylic acid Chemical compound O1C(C(O)=O)C(P)C(O)C(O)C1OC1C(C(O)=O)OC(OP)C(O)C1O FHVDTGUDJYJELY-UHFFFAOYSA-N 0.000 description 1
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 description 1
- BSFODEXXVBBYOC-UHFFFAOYSA-N 8-[4-(dimethylamino)butan-2-ylamino]quinolin-6-ol Chemical compound C1=CN=C2C(NC(CCN(C)C)C)=CC(O)=CC2=C1 BSFODEXXVBBYOC-UHFFFAOYSA-N 0.000 description 1
- 241001556567 Acanthamoeba polyphaga mimivirus Species 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 241000219498 Alnus glutinosa Species 0.000 description 1
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonium chloride Substances [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 1
- 102100036013 Antigen-presenting glycoprotein CD1d Human genes 0.000 description 1
- 102100028449 Arginine-glutamic acid dipeptide repeats protein Human genes 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000534000 Berula erecta Species 0.000 description 1
- 208000003174 Brain Neoplasms Diseases 0.000 description 1
- 101100501250 Caenorhabditis elegans elo-3 gene Proteins 0.000 description 1
- 101100074328 Caenorhabditis elegans lec-7 gene Proteins 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 241001091551 Clio Species 0.000 description 1
- 239000004971 Cross linker Substances 0.000 description 1
- 102000001493 Cyclophilins Human genes 0.000 description 1
- 108010068682 Cyclophilins Proteins 0.000 description 1
- 102000000634 Cytochrome c oxidase subunit IV Human genes 0.000 description 1
- 108090000365 Cytochrome-c oxidases Proteins 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 101710096438 DNA-binding protein Proteins 0.000 description 1
- 101150105088 Dele1 gene Proteins 0.000 description 1
- 108010051088 Delta Sleep-Inducing Peptide Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 101100278839 Drosophila melanogaster sw gene Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241001658031 Eris Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000238558 Eucarida Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000628997 Flos Species 0.000 description 1
- 102100024165 G1/S-specific cyclin-D1 Human genes 0.000 description 1
- 102100036263 Glutamyl-tRNA(Gln) amidotransferase subunit C, mitochondrial Human genes 0.000 description 1
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000000039 Heat Shock Transcription Factor Human genes 0.000 description 1
- 108050008339 Heat Shock Transcription Factor Proteins 0.000 description 1
- 244000043261 Hevea brasiliensis Species 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 102100028798 Homeodomain-only protein Human genes 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101001114932 Homo sapiens 40S ribosomal protein S20 Proteins 0.000 description 1
- 101000694288 Homo sapiens 40S ribosomal protein SA Proteins 0.000 description 1
- 101000716121 Homo sapiens Antigen-presenting glycoprotein CD1d Proteins 0.000 description 1
- 101001061654 Homo sapiens Arginine-glutamic acid dipeptide repeats protein Proteins 0.000 description 1
- 101001001786 Homo sapiens Glutamyl-tRNA(Gln) amidotransferase subunit C, mitochondrial Proteins 0.000 description 1
- 101001041393 Homo sapiens Serine protease HTRA1 Proteins 0.000 description 1
- 101000633680 Homo sapiens Tetratricopeptide repeat protein 37 Proteins 0.000 description 1
- 101001010792 Homo sapiens Transcriptional regulator ERG Proteins 0.000 description 1
- 101000671649 Homo sapiens Upstream stimulatory factor 2 Proteins 0.000 description 1
- 241000257303 Hymenoptera Species 0.000 description 1
- 206010065042 Immune reconstitution inflammatory syndrome Diseases 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 206010073094 Intraductal proliferative breast lesion Diseases 0.000 description 1
- 102100038609 Lactoperoxidase Human genes 0.000 description 1
- 108010023244 Lactoperoxidase Proteins 0.000 description 1
- 244000211187 Lepidium sativum Species 0.000 description 1
- 235000007849 Lepidium sativum Nutrition 0.000 description 1
- 241000337544 Limnoriidae Species 0.000 description 1
- 229920001410 Microfiber Polymers 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 241000428199 Mustelinae Species 0.000 description 1
- 101100108611 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) alg-8 gene Proteins 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108010068113 Octamer Transcription Factor-6 Proteins 0.000 description 1
- 241001024099 Olla Species 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 102100026458 POU domain, class 3, transcription factor 1 Human genes 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 241001674048 Phthiraptera Species 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 102100032421 Protein S100-A6 Human genes 0.000 description 1
- 240000004350 Prunus spinosa Species 0.000 description 1
- 235000010829 Prunus spinosa Nutrition 0.000 description 1
- 101150107341 RERE gene Proteins 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 102000009661 Repressor Proteins Human genes 0.000 description 1
- 108010034634 Repressor Proteins Proteins 0.000 description 1
- 102000009738 Ribosomal Protein S6 Kinases Human genes 0.000 description 1
- 108010034782 Ribosomal Protein S6 Kinases Proteins 0.000 description 1
- 108010005260 S100 Calcium Binding Protein A6 Proteins 0.000 description 1
- 102100037658 STING ER exit protein Human genes 0.000 description 1
- 101710198240 STING ER exit protein Proteins 0.000 description 1
- 101000679735 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) 60S ribosomal protein L16-A Proteins 0.000 description 1
- 102100037205 Sal-like protein 2 Human genes 0.000 description 1
- 101710192308 Sal-like protein 2 Proteins 0.000 description 1
- 235000017276 Salvia Nutrition 0.000 description 1
- 241001072909 Salvia Species 0.000 description 1
- 102100021119 Serine protease HTRA1 Human genes 0.000 description 1
- 102220580081 Serrate RNA effector molecule homolog_S18D_mutation Human genes 0.000 description 1
- 241000033695 Sige Species 0.000 description 1
- 102000011990 Sirtuin Human genes 0.000 description 1
- 108050002485 Sirtuin Proteins 0.000 description 1
- 102100030584 Small integral membrane protein 1 Human genes 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 206010051379 Systemic Inflammatory Response Syndrome Diseases 0.000 description 1
- 239000004809 Teflon Substances 0.000 description 1
- 229920006362 Teflon® Polymers 0.000 description 1
- 102100030784 Telomeric repeat-binding factor 2 Human genes 0.000 description 1
- 108050002561 Telomeric repeat-binding factor 2 Proteins 0.000 description 1
- 241000897276 Termes Species 0.000 description 1
- 102100029210 Tetratricopeptide repeat protein 37 Human genes 0.000 description 1
- 241000863032 Trieres Species 0.000 description 1
- 241001504505 Troglodytes troglodytes Species 0.000 description 1
- 102100033649 Tumor necrosis factor alpha-induced protein 8 Human genes 0.000 description 1
- 101710098767 Tumor necrosis factor alpha-induced protein 8 Proteins 0.000 description 1
- 101710184190 Upstream stimulatory factor 2 Proteins 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 229940072056 alginate Drugs 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 239000001132 aluminium potassium sulphate Substances 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 239000002870 angiogenesis inducing agent Substances 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 229940127219 anticoagulant drug Drugs 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 235000013405 beer Nutrition 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 230000008499 blood brain barrier function Effects 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 210000001218 blood-brain barrier Anatomy 0.000 description 1
- 210000004958 brain cell Anatomy 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000004281 calcium formate Substances 0.000 description 1
- 239000000378 calcium silicate Substances 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 235000009120 camo Nutrition 0.000 description 1
- 244000213578 camo Species 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 229920002301 cellulose acetate Polymers 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000012502 diagnostic product Substances 0.000 description 1
- 238000012850 discrimination method Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 201000007273 ductal carcinoma in situ Diseases 0.000 description 1
- 230000008846 dynamic interplay Effects 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 238000002571 electroretinography Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000013401 experimental design Methods 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 239000004225 ferrous lactate Substances 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 235000021474 generally recognized As safe (food) Nutrition 0.000 description 1
- 235000021473 generally recognized as safe (food ingredients) Nutrition 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000008240 homogeneous mixture Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 238000010191 image analysis Methods 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 201000004933 in situ carcinoma Diseases 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 206010073095 invasive ductal breast carcinoma Diseases 0.000 description 1
- 201000010985 invasive ductal carcinoma Diseases 0.000 description 1
- 206010073096 invasive lobular breast carcinoma Diseases 0.000 description 1
- 239000011630 iodine Substances 0.000 description 1
- 229910052740 iodine Inorganic materials 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 229940057428 lactoperoxidase Drugs 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 238000009607 mammography Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000004066 metabolic change Effects 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 229910052618 mica group Inorganic materials 0.000 description 1
- 238000010208 microarray analysis Methods 0.000 description 1
- 239000003658 microfiber Substances 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 238000005312 nonlinear dynamic Methods 0.000 description 1
- 238000001543 one-way ANOVA Methods 0.000 description 1
- 238000007410 oral glucose tolerance test Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920000915 polyvinyl chloride Polymers 0.000 description 1
- 239000004800 polyvinyl chloride Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000000163 radioactive labelling Methods 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000000284 resting effect Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000011343 solid material Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 229910021653 sulphate ion Inorganic materials 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
- 238000011477 surgical intervention Methods 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 229920005613 synthetic organic polymer Polymers 0.000 description 1
- 238000012353 t test Methods 0.000 description 1
- 210000003411 telomere Anatomy 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 125000002088 tosyl group Chemical group [H]C1=C([H])C(=C([H])C([H])=C1C([H])([H])[H])S(*)(=O)=O 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 208000007004 trichohepatoenteric syndrome Diseases 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 238000012285 ultrasound imaging Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/112—Disease subtyping, staging or classification
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10T—TECHNICAL SUBJECTS COVERED BY FORMER US CLASSIFICATION
- Y10T436/00—Chemistry: analytical and immunological testing
- Y10T436/10—Composition for standardization, calibration, simulation, stabilization, preparation or preservation; processes of use in preparation for chemical testing
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Engineering & Computer Science (AREA)
- Immunology (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Hospice & Palliative Care (AREA)
- Biophysics (AREA)
- Oncology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Description
OLIGONUCLEOTIDES FOR CANCER DIAGNOSIS
The present invention relates to oligonucleotide probes, for use in assessing gene transcript levels in a cell, which may be used in analytical techniques, particularly diagnostic techniques. Conveniently the probes are provided in kit form. Different sets of probes may be used in techniques to prepare gene expression patterns and identify, diagnose or monitor different cancers or stages thereof.
The identification of quick and easy methods of sample analysis for, for example, diagnostic applications, remains the goal of many researchers. End users seek methods which are cost effective, produce statistically significant results and which may be implemented routinely without the need for highly skilled individuals.
The analysis of gene expression within cells has been used to provide information on the state of those cells and importantly the state of the individual from which the cells are derived. The relative expression of various genes in a cell has been identified as reflecting a particular state within a body. For example, cancer cells are known to exhibit altered expression of various proteins and the transcripts or the expressed proteins may therefore be used as markers of that disease state.
Thus biopsy tissue may be analysed for the presence of these markers and cells originating from the site of the disease may be identified in other tissues or fluids of the body by the presence of the markers. Furthermore, products of the altered expression may be released into the blood stream and these products may be analysed. In addition cells which have contacted disease cells may be affected by their direct contact with those cells resulting in altered gene expression and their expression or products of expression may be similarly analysed.
However, there are some limitations with these methods. For example, the use of specific tumour markers for identifying cancer suffers from a variety’ of defects, such as lack of specificity or sensitivity, association of the marker with disease states besides the specific type of cancer, and difficulty of detection in asymptomatic individuals.
In addition to the analysis of one or two marker transcripts or proteins, more recently, gene expression patterns have been analysed. Most of the work involving large-scale gene expression analysis with implications in disease diagnosis has involved clinical samples originating from diseased tissues or cells. For example, several recent publications, which demonstrate that gene expression data can be used to distinguish between similar cancer types, have used clinical samples from diseased tissues or cells (Alon et al. 1999, PNAS, 96, p6745-6750; Golub et al. 1999, Science, 286, p531 -537,; Alizadeh et al, 2000,
Nature, 403, p503-511; Bittner et al., 2000, Nature, 406, p536-540).
However, these methods have relied on analysis of a sample containing diseased cells or products of those cells or cells which have been contacted by disease cells. Analysis of such samples relies on knowledge of the presence ofa disease and its location, which may be difficult in asymptomatic patients.
Furthermore, samples can not always be taken from the disease site, e.g. in diseases of the brain.
In a finding of great significance, the present inventors identified the previously untapped potential of all cells within a body to provide information relating to the state of the organism from which the cells were derived.
W098/49342 describes the analysis of the gene expression of cells distant from the site of disease, e.g. peripheral blood collected distant from a cancer site.
PCT/GB03/005102, incorporated herein by reference, describes specific probes for the diagnosis of breast cancer and Alzheimer's disease and discusses protocols for identifying other appropriate probes for that purpose and for diagnosing other diseases.
This finding is based on the premise that the different parts of an organism's body exist in dynamic interaction with each other. When a disease affects one part of the body, other parts of the body are also affected. The interaction results from a wide spectrum of biochemical signals that are released from the diseased area, affecting other areas in the body. Although, the nature of the biochemical and physiological changes induced by the released signals can vary in the different body parts, the changes can be measured at the level of gene expression and used for diagnostic purposes.
The physiological state of a cell in an organism is determined by the pattern with which genes are expressed in it. The pattern depends upon the internal and external biological stimuli to which said cell is exposed, and any change either in the extent or in the nature of these stimuli can lead to a change in the pattern with which the different genes are expressed in the cell. There is a growing understanding that by analysing the systemic changes in gene expression patterns in cells in biological samples, it is possible to provide information on the type and nature of the biological stimuli that are acting on them. Thus, for example, by monitoring the expression of a large number of genes in cells in a test sample, it is possible to determine whether their genes are expressed with a pattern characteristic for a particular disease, condition or stage thereof. Measuring changes in gene activities in cells, e.g. from tissue or body fluids is therefore emerging as a powerful tool for disease diagnosis.
Such methods have various advantages. Often, obtaining clinical : samples from certain areas in the body that is diseased can be difficult and may involve undesirable invasions in the body, for example biopsy is often used to obtain samples for cancer. In some cases, such as in Alzheimer's disease the diseased brain specimen can only be obtained post-mortem. Furthermore, the tissue specimens which are obtained are often heterogeneous and may contain a mixture of both diseased and non-diseased cells, making the analysis of generated gene expression data both complex and difficult.
It has been suggested that a pool of tumour tissues that appear to be pathogenetically homogeneous with respect to morphological appearances of the tumour may well be highly heterogeneous at the molecular level (Alizadeh, 2000, supra), and in fact might contain tumours representing essentially different diseases (Alizadeh, 2000, supra; Golub, 1999, supra). For the purpose of identifying a disease, condition, or a stage thereof, any method that does not : require clinical samples to originate directly from diseased tissues or cells is highly desirable since clinical samples representing a homogeneous mixture of cell types can be obtained from an easily accessible region in the body.
We have now identified a family of sequences which allow the derivation of a set of probes of surprising utility for identifying cancer, particularly breast cancer. Thus, we now describe families of genes whose expression is altered in "the cells of blood samples from cancer patients, which may be used to generate probes for use in methods of identifying, diagnosing or monitoring cancer or stages thereof.
In work leading up to this invention, the inventors examined the level of expression of a large number of genes in cancer patients relative to normal patients. Not only were a large number of genes found to exhibit altered expression, but, in addition, those which exhibited altered expression were found to fall within discrete families of genes, by virtue of their function. As such these genes provide a pool from which corresponding probes may be generated which can be used collectively to generate a fingerprint of the expression of these genes in an individual. Since the expression of these genes is altered in the cancer individual, and may hence be considered informative for that state, the generated fingerprint from the collection of probes is indicative of the disease relative to the normal state.
The families of genes that have been identified as being differentially expressed in cancer patients may be summarized as follows: (i) genes encoding proteins involved in protein synthesis and/or stability; (ii) genes encoding proteins involved in the regulation of defence and/or chromatin remodelling.
Family (i) includes: (a) genes encoding ribosomal proteins and ribosomal activation proteins (ie. proteins comprising components of ribosomal proteins or involved in modification of their function and are found to be down-regulated in cancer patients). These encoded proteins include ribosomal proteins L1-
L56, L7A, L10A, L13A, L18A, L23A, L27A, L35A, L36A, L37A, PO,
P1, P2, S2-S29, S31, S33-S36, S3A, S154, S18A, S18B, S18C, S27A, 63, 115 (and pseudogenes), ribosomal protein kinases (e.g. S6 kinase), ribonucleases, putative S1 RNA binding domain protein, eukaryotic translation initiation factors and guanine nucleotide binding protein G;
(b) genes encoding translation inhibition and initiation factors (ie. proteins involved in the translation of mRNA to a protein product and are found to be down-regulated in cancer patients). These encoded proteins include eukaryotic translation elongation factors, tRNA synthetases, 5 RNA binding proteins, polyadenylation element binding proteins, tyrosine phosphatases, eukaryotic translation initiation factors, and RNA polymerase I, III transcription factors; (c) genes encoding other modulators of transcription or translation such as cyclin D-type binding protein and guanine nucleotide binding protein.
Family (ii) includes: (a) genes encoding immune response related proteins (ie. proteins which are up-regulated in response to immune stimulation, and which include proteins upregulated in response to inflammation or in generating an inflammatory response, and are found to be up-regulated in cancer patients). These encoded proteins include T-cell receptor and associated components, e.g. protein kinases, various cytokines, including the interleukins and their receptors (such as IL-1, 2, 3,4, 5,6, 7,8, 9, 10, 11, 12, 13, 15, 17, 18, 20, 22, 24), tumour necrosis factor and its receptor and its superfamily (e.g. TNF superfamily members 2,3, 4,5,6,7, 8,9, 11, 12, 13, 14, 15), interferon regulatory factors, oncostatin M, Leukemia inhibitory factor, chemokine ligand and receptor family (e.g. numbers 1- 28), complement components, interferon stimulated factors such as transcription factors, MHC (e.g HLA) class I or II (or related components) (e.g. DQ, DR, DO, DP, DM alpha or beta), adhesion proteins (e.g. CD1A, CDIC, CD1D, CD3Z, 6, 8, 11, 14, 18, 24, 27, 28, 29, 40, 44, 50, 54, 59, 74, 79B, 80, 81, 83, 86, 96, ICAM), nuclear factor of kappa polypeptide gene enhancer in B-cells, myelin basic protein, cathepsin, toll-like receptor, proteosome subunits, ferritin, protein kinases or phosphatases as well as their activators and inhibitors, leukocyte immunoglobulin-like receptor, immunoglobulin components, e.g. heavy chain or Fc fragments, e.g. of IgG, IgE or IgA or their superfamily, defensin, oxytocin, S100 calcium binding protein, lectin and its receptor and superfamily, leptin, phospholipase and growth factors (such as endothelial cell growth factor or erythropoietin); (b) genes encoding TNF-induced proteins (ie. proteins which are induced in an individual in response to exposure to TNF and are found to be up- regulated in cancer patients). These encoded proteins include TNF alpha-induced protein 8, integrin, inhibitor of kappa light polypeptide gene enhancer in B-cells, TNF-associated factor 2, 5, nuclear factor of kappa light polypeptide gene enhancer in B-cells, MAP kinases, protein kinase C, ubiquitous kinase, cadherin, caspase, cyclin D1, superoxide dismutase and interleukins; (c) genes encoding hypoxia-induced proteins (ie. proteins which are induced when the individual or a part thereof is in a state of hypoxia and are found to be up-regulated in cancer patients). These encoded proteins include sestrin, B1A binding protein p300, endothelin, ataxia telangiectasia and Rad3 related protein, hexokinase 2, TEK tyrosine kinase, DNA. fragmentation factor, caspase, plasminogen activator, hypoxia-inducible factor 1 and glucose phosphate isomerase; (d) genes encoding oxidative stress proteins (ie. proteins which are induced in an individual or part thereof under oxidative stress and are found to be up-regulated in cancer patients). These encoded proteins include superoxide dismutase, glutathione synthetase, catalase, lactoperoxidase, thyroid peroxidase, myeloperoxidase, eosinophil peroxidase, oxidation resistance 1, peroxiredoxin, cytochrome P450, scavenger receptor, paraoxonase, glutathione reductase, NAD(P)H dehydrogenase, glutathione S-transferase, catenin, glutaredoxin, heat shock proteins (such as heat shock transcription factors), mitogen- activated protein kinases, enolase, thioredoxin reductase and peroxiredoxin; (e) genes encoding proteins involved in chromatin remodelling (ie. proteins which are instrumental in maintaining or modifying chromatin structure and may be essential for gene regulation). These encoded proteins include histone replacement proteins, e.g. H3.3A or H3.3B family.
Appropriate gene sequences falling within the families described above may be identified by interrogation of appropriate databases using as keywords the family name, e.g. "immune response" on gene or protein databases at
National Centre for Biotechnology Information, Norway. For confirmation of the utility of such gene sequences for the development of oligonucleotides for the tests described herein, the expression of a particular gene sequence may be assessed in a test cancer patient versus a normal patient. Variation in expression above or below control levels is indicative of the utility of the sequence for probe derivation.
Generally the genes encoding the above (i) families are down-regulated in cancer versus normal patients and in the case of (ii) families the encoding genes are up-regulated.
It is speculated that in cancer patients the systematic decreased expression of genes involved in ribosome production and translation control may indicate that blood cells are responding to a new condition in those patients by decreasing the rate of protein synthesis which may be a cellular adaption to an environment of low oxygen and energy deficiency. This is supported by the observation that genes involved in defence against reactive oxygen species (ROS) such as MnSOD and ferritin are upregulated in cancer samples. Low erythropoietin may explain the low oxygen levels in cancer patients. TNF activation is also believed to be a route for the changes in the families of genes described above since TNF is known to up regulate expression of e.g. ferritin, defensin, MnSOD and calgranulin B. TNF also inhibits BPO production which can itself cause a low oxygen condition in the blood environment. Hypoxia is "Known to induce TNF levels. These changes may be triggered by angiogenic factors entering the bloodstream. Although not wishing to be bound by theory, the hypothesis underlying the above described effects is shown in Figure 1.
Thus the invention provides a set of oligonucleotide probes which correspond to genes in a cell whose expression is affected in a pattern characteristic of a particular cancer or stage of, wherein said genes are systemically affected by said cancer or stage thereof. Preferably said genes are constitutively moderately or highly expressed. Preferably the genes are moderately or highly expressed in the cells of the sample but not in cells from disease cells or in cells having contacted such disease cells.
Such probes, particularly when isolated from cells distant to the site of disease, do not rely on the development of disease to clinically recognizable levels and allow detection of a cancer or stage thereof very early after the onset of said cancer, even years before other subjective or objective symptoms appear.
As used herein "systemically" affected genes refers to genes whose expression is affected in the body without direct contact with a disease cell or disease site and the cells under investigation are not disease cells. "Contact" as referred to herein refers to cells coming into close proximity with one another such that the direct effect of one cell on the other may be observed, e.g. an immune response, wherein these responses are not mediated by secondary molecules released from the first cell over a large distance to affect the second cell. Preferably contact refers to physical contact, or contact that is as close as is sterically possible, conveniently, cells which contact one another are found in the same unit volume, for example within 1cm’.
A "disease cell” is a cell manifesting phenotypic changes and is present at the disease site at some time during its life-span, e.g. a tumour cell at the tumour site or which has disseminated from the tumour, or a brain cell in the case of cancer of the brain. "Moderately or highly" expressed genes refers to those present in resting cells in a copy number of more than 30-100 copies/cell (assuming an average 3x10° mRNA molecules in a cell).
Specific probes having the above described properties are provided "herein.
Thus in one aspect, the present invention provides a set of oligonucleotide probes, wherein said set comprises at least 10 oligonucleotides selected from: an oligonucleotide corresponding to a gene sequence from family (i) or (ii) as defined hereinbefore or derived from such a sequence, or an
S oligonucleotide with a complementary sequence, or a functionally equivalent oligonucleotide.
The invention further provides a method of preparing a set of oligonucleotides for use in the methods described herein, comprising the step of selecting one or more oligonucleotides corresponding to a gene sequence from family (i) and one or more oligonucleotides corresponding to a gene sequence from family (ii). Preferably more than 1 oligonucleotides is selected from each family (e.g. from different sub-families) and the selected oligonucleotides are from preferred genes as described herein.
The invention also provides one or more oligonucleotide probes, wherein each oligonucleotide probe is selected from the oligonucleotides listed in Table 2, 3 or 4 (e.g. from Table 2) or derived from a sequence described in Table 2, 3 or 4, or a complementary sequence thereof. Said derived oligonucleotides include oligonucleotides derived from the genes corresponding to the sequences provided in those tables, e.g. the genes set forth in Tables 2, 5 or 6 (see the
Accession numbers), or the complementary sequences thereof. The use of such probes in products and methods of the invention, form further aspects of the invention.
As referred to herein an "oligonucleotide" is a nucleic acid molecule having at least 6 monomers in the polymeric structure, ie. nucleotides or modified forms thereof The nucleic acid molecule may be DNA, RNA or PNA (peptide nucleic acid) or hybrids thereof or modified versions thereof, e.g. chemically modified forms, e.g. LNA (Locked Nucleic acid), by methylation or made up of modified or non-natural bases during synthesis, providing they retain their ability to bind to complementary sequences. Such oligonucleotides are used in accordance with the invention to probe target sequences and are thus "referred to herein also as oligonucleotide probes or simply as probes.
An oligonucleotide corresponding to a gene sequence from family (i) or (ii) refers to an oligonucleotide corresponding to all or a part of said gene sequence or its transcript. When a part of the gene sequence is used, it satisfies the requirements of the oligonucleotide probes as described herein, e.g. in length and function. Preferably said parts have the size described hereinafter. Said oligonucleotide is referred to hereinafter as the primary oligonucleotide. A derived oligonucleotide refers to an oligonucleotide which is a part of the primary oligonucleotide but satisfies the requirements for probes as described herein. ) Preferably the oligonucleotide probes forming said set are at least 15 bases in length to allow binding of target molecules. Especially preferably said oligonucleotide probes are from 20 to 200 bases in length, e.g. from 30 to 150 bases, preferably 50-100 bases in length. . As referred to herein the term "complementary sequences" refers to 10 sequences with consecutive complementary bases (ie. T:A, G:C) and which complementary sequences are therefore able to bind to one another through their complementarity.
Reference to "10 oligonucleotides” refers to 10 different oligonucleotides. Whilst an oligonucleotide from a gene sequence family as described herein, a derived oligonucleotide and their functional equivalent are considered different oligonucleotides, complementary oligonucleotides are not considered different. Preferably however, the at least 10 oligonucleotides correspond to 10 different gene sequences within the described gene sequence families (or derived oligonucleotides or their functional equivalents). Thus said 10 different oligonucleotides are preferably able to bind to 10 different transcripts.
Preferably the at least 10 oligonucleotides are made up of a combination of oligonucleotides from family (i) and (ii), e.g. 5 oligonucleotides from each family may be used, or 4 from one family and 6 from the other family. This advantageously allows the use of genes which are up and down-regulated in cancer relative to normal patients. Conveniently, one or more oligonucleotides "from different sub-families may be used, e.g. 2 probes each from (i)a, (Db, (ic, (ii)a and (ii)b. Especially preferably said set of oligonucleotides includes oligonucleotides from family (i)a, (ii)a and (ii)e. 30 . Preferred proteins encoded by family (i)a genes are ribosomal proteins and preferably each set includes an oligonucleotide from a gene encoding such a protein.
Preferred immune response proteins encoded by family (ii)a genes include adhesion proteins, interleukins their receptors and superfamily, TNF its receptor and superfamily, immunoglobulin components and erythropoietin.
Particularly preferably said set includes oligonucleotides from genes encoding one or more ribosomal proteins and optionally one or more histones and optionally ferritin.
Preferably said oligonucleotides are as described in Table 2 or 3 or are derived from a sequence described in Table 2 or 3, e.g. as described in Table 2.
Said set may additionally comprise one or more oligonucleotide probes listed in
Table 4, or derived from a sequence described in Table 4, or a complementary sequence thereof. Said derived oligonucleotides include oligonucleotides derived from the genes corresponding to the sequences provided in those tables, e.g. the genes set forth in Tables 2, 5 or 6 (see the Accession numbers), or the complementary sequences thereof.
A "set" as described refers to a collection of unique oligonucleotide probes (ie. having a distinct sequence) and preferably consists of less than 1000 oligonucleotide probes, especially less than 500 probes, e.g. preferably from 10 to 500, e.g. 10 to 100, 200 or 300, especially preferably 20 to 100, e.g. 30 to 100 probes. In some cases less than 10 probes may be used, e.g. from 2 to 9 probes, e.g. 5 to 9 probes.
Tt will be appreciated that increasing the number of probes will prevent the possibility of poor analysis, €.8. misdiagnosis by comparison to other diseases which could similarly alter the expression of the particular genes in question. Other oligonucleotide probes not described herein may also be present, particularly if they aid the ultimate use of the set of oligonucleotide probes. However, preferably said set consists only of the oligonucleotides described herein, or a sub-set thereof (e.g. of the size as described above).
Multiple copies of each unique oligonucleotide probe, e.g. 10 or more copies, may be present in each set, but constitute only a single probe.
A set of oligonucleotide probes, which may preferably be immobilized on a solid support or have means for such immobilization, comprises the at least 10 oligonucleotide probes selected from those described hereinbefore. As mentioned above, these 10 probes must be unique and have different sequences.
Having said this however, two separate probes may be used which recognize the same gene but reflect different splicing events. However oligonucleotide probes which are complementary to, and bind to distinct genes are preferred.
As described herein a "functionally equivalent" or derived oligonucleotide refers to an oligonucleotide which is capable of identifying the same gene as an oligonucleotide from a sequence in the gene sequence families described herein ie. it can bind to the same mRNA molecule (or DNA) transcribed from a gene (target nucleic acid molecule) as the primary oligonucleotide or the derived oligonucleotide (or its complementary sequence).
Thus in a preferred feature said derived or functionally equivalent oligonucleotide is a part of a gene sequence as defined in Table 2, 5 or 6, or the complementary sequence thereof. Preferably said functionally equivalent oligonucleotide is capable of recognizing, ie. binding to the same splicing product as a primary oligonucleotide or a derived oligonucleotide. Preferably said mRNA molecule is the full length mRNA molecule which corresponds to the primary oligonucleotide or the derived oligonucleotide.
As referred to herein "capable of binding" or "binding" refers to the ability to hybridize under conditions described hereinafter.
Alternatively expressed, functionally equivalent oligonucleotides (or complementary sequences) have sequence identity or will hybridize, as described hereinafter, to a region of the target molecule to which molecule a primary oligonucleotide or a derived oligonucleotide or a complementary oligonucleotide binds. Preferably, functionally equivalent oligonucleotides (or their complementary sequences) hybridize to one of the mRNA sequences which corresponds to a primary oligonucleotide or a derived oligonucleotide under the conditions described hereinafter or has sequence identity to a part of one of the mRNA sequences which corresponds to a primary oligonucleotide or a derived oligonucleotide. A "part" in this context refers to a stretch of at least 5, e.g. at least 10 or 20 bases, such as from 5 to 100, e.g. 10 to 50 or 15 to 30 bases.
In a particularly preferred aspect, the functionally equivalent oligonucleotide binds to all or a part of the region of a target nucleic acid molecule (mRNA or cDNA) to which the primary oligonucleotide or derived oligonucleotide binds. A "target" nucleic acid molecule is the gene transcript or related product e.g. mRNA, or cDNA, or amplified product thereof. Said "region" of said target molecule to which said primary oligonucleotide or derived oligonucleotide binds is the stretch over which complementarity exists.
At its largest this region is the whole length of the primary oligonucleotide or derived oligonucleotide, but may be shorter if the entire primary sequence or derived oligonucleotide is not complementary to a region of the target sequence.
Preferably said part of said region of said target molecule is a stretch of at least 5, e.g. at least 10 or 20 bases, such as from 5 to 100, e.g. 10to 50 or 15 to 30 bases. This may for example be achieved by said functionally equivalent oligonucleotide having several identical bases to the bases of the primary oligonucleotide or the derived oligonucleotide. These bases may be identical over consecutive stretches, e.g. in a part of the functionally equivalent oligonucleotide, or may be present non-consecutively, but provide sufficient complementarity to allow binding to the target sequence.
Thus in a preferred feature, said functionally equivalent oligonucleotide hybridizes under conditions of high stringency to a primary oligonucleotide or a derived oligonucleotide or the complementary sequence thereof. Alternatively expressed, said functionally equivalent oligonucleotide exhibits high sequence identity to all or part of a primary oligonucleotide. Preferably said functionally equivalent oligonucleotide has at least 70% sequence identity, preferably at least 80%, e.g. at least 90, 95, 98 or 99%, to all of a primary oligonucleotide or a part thereof. As used in this context, a "part" refers to a stretch of at least 5, e.g. at least 10 or 20 bases, such as from 5 to 100, e.g. 10 to 50 or 15to 30 bases, in said primary oligonucleotide. Bspecially preferably when sequence identity to only a part of said primary oligonucleotide is present, the sequence identity is high, e.g. at least 80% as described above.
Functionally equivalent oligonucleotides which satisfy the above stated functional requirements include those which are derived from the primary oligonucleotides and also those which have been modified by single or multiple nucleotide base (or equivalent) substitution, addition and/or deletion, but which nonetheless retain functional activity, e.g. bind to the same target molecule as the primary oligonucleotide or the derived oligonucleotide from which they are further derived or modified. Preferably said modification is of from 1 to 50, e.g. from 10 to 30, preferably from 1 to 5 bases. Especially preferably only minor modifications are present, e.g. variations in less than 10 bases, e.g. less than 5 base changes.
Within the meaning of "addition" equivalents are included oligonucleotides containing additional sequences which are complementary to the consecutive stretch of bases on the target molecule to which the primary oligonucleotide or the derived oligonucleotide binds. Alternatively the addition may comprise a different, unrelated sequence, which may for example confer a further property, e.g. to provide a means for immobilization such as a linker to bind the oligonucleotide probe to a solid support.
Particularly preferred are naturally occurring equivalents such as biological variants, e.g. allelic, geographical or allotypic variants, e.g. oligonucleotides which correspond to a genetic variant, for example as present in a different species.
Functional equivalents include oligonucleotides with modified bases, e.g. using non-naturally occurring bases. Such derivatives may be prepared during synthesis or by post production modification. “Hybridizing" sequences which bind under conditions of low stringency are those which bind under non-stringent conditions (for example, 6x SSC/50% formamide at room temperature) and remain bound when washed under conditions of low stringency (2 X SSC, room temperature, more preferably 2 X
SSC, 42°C). Hybridizing under high stringency refers to the above conditions in which washing is performed at 2 X SSC, 65°C (where SSC = 0.15M NaCl, " 0.015M sodium citrate, pH 7.2). “Sequence identity" as referred to herein refers to the value obtained when assessed using ClustalW (Thompson et al., 1994, Nucl. Acids Res., 22, p4673-4680) with the following parameters:
Pairwise alignment parameters - Method: accurate, Matrix: TUB, Gap open penalty: 15.00, Gap extension penalty: 6.66;
Multiple alignment parameters - Matrix: TUB, Gap open penalty: 15.00, % identity for delay: 30, Negative matrix: no, Gap extension penalty: 6.66, DNA transitions weighting: 0.5.
Sequence identity at a particular base is intended to include identical bases which have simply been derivatized.
The invention also extends to polypeptides encoded by the mRNA sequence to which a Table 2, 3 or 4 oligonucleotide or a Table 2, 3 or 4 derived oligonucleotide (e.g. having a sequence as defined in Table 2, Sor6 ora complementary sequence thereto) binds. The invention further extends to antibodies which bind to any of said polypeptides.
As described above, conveniently said set of oligonucleotide probes may be immobilized on one or more solid supports. Single or preferably multiple copies of each unique probe are attached to said solid supports, e.g. 10 or more, e.g. at least 100 copies of each unique probe are present.
One or more unique oligonucleotide probes may be associated with separate solid supports which together form a set of probes immobilized on multiple solid support, e.g. one or more unique probes may be immobilized on multiple beads, membranes, filters, biochips etc. which together form a set of probes, which together form modules of the kit described hereinafter. The solid support of the different modules are conveniently physically associated although the signals associated with each probe (generated as described hereinafter) must be separately determinable. Alternatively, the probes may be immobilized on discrete portions of the same solid support, e.g. each unique oligonucleotide probe, e.g. in multiple copies, may be immobilized fo a distinct and discrete portion or region of a single filter or membrane, e.g. to generate an array.
A combination of such techniques may also be used, e.g. several solid supports may be used which each immobilize several unique probes.
The expression "solid support" shall mean any solid material able to bind oligonucleotides by hydrophobic, ionic or covalent bridges. "Immobilization" as used herein refers to reversible or irreversible association of the probes to said solid support by virtue of such binding. If reversible, the probes remain associated with the solid support for a time sufficient for methods of the invention to be carried out.
Numerous solid supports suitable as immobilizing moieties according to the invention, are well known in the art and widely described in the literature and generally speaking, the solid support may be any of the well-known supports or matrices which are currently widely used or proposed for immobilization, separation etc. in chemical or biochemical procedures. Such materials include, but are not limited to, any synthetic organic polymer such as polystyrene, polyvinylchloride, polyethylene; or nitrocellulose and cellulose acetate; or tosyl activated surfaces; or glass or nylon or any surface carrying a group suited for covalent coupling of nucleic acids. The immobilizing moieties may take the form of particles, sheets, gels, filters, membranes, microfibre strips, tubes or plates, fibres or capillaries, made for example of a polymeric material e.g. agarose, cellulose, alginate, teflon, latex or polystyrene or magnetic beads. Solid supports allowing the presentation of an array, preferably in a single dimension are preferred, e.g. sheets, filters, membranes, plates or biochips.
Attachment of the nucleic acid molecules to the solid support may be performed directly or indirectly. For example if a filter is used, attachment may be performed by UV-induced crosslinking. Alternatively, attachment may be performed indirectly by the use of an attachment moiety carried on the oligonucleotide probes and/or solid support. Thus for example, a pair of affinity binding partners may be used, such as avidin, streptavidin or biotin, DNA or
DNA binding protein (e.g. either the lac I repressor protein or the lac operator sequence to which it binds), antibodies (which may be mono- or polyclonal), antibody fragments or the epitopes or haptens of antibodies. In these cases, one partner of the binding pair is attached to (or is inherently part of) the solid support and the other partner is attached to (or is inherently part of) the nucleic acid molecules.
As used herein an "affinity binding pair" refers to two components which recognize and bind to one another specifically (ic. in preference to binding to other molecules). Such binding pairs when bound together form a complex.
Attachment of appropriate functional groups to the solid support may be performed by methods well known in the art, which include for example, attachment through hydroxyl, carboxyl, aldehyde or amino groups which may be provided by treating the solid support to provide suitable surface coatings. Solid supports presenting appropriate moieties for attachment of the binding partner may be produced by routine methods known in the art.
Attachment of appropriate functional groups to the oligonucleotide probes of the invention may be performed by ligation or introduced during synthesis or amplification, for example using primers carrying an appropriate moiety, such as biotin or a particular sequence for capture.
Conveniently, the set of probes described hereinbefore is provided in kit form.
Thus viewed from a further aspect the present invention provides a kit comprising a set of oligonucleotide probes as described hereinbefore immobilized on one or more solid supports.
Preferably, said probes are immobilized on a single solid support and each unique probe is attached to a different region of said solid support.
However, when attached to multiple solid supports, said multiple solid supports form the modules which make up the kit. Especially preferably said solid support is a sheet, filter, membrane, plate or biochip.
Optionally the kit may also contain information relating to the signals generated by normal or diseased samples (as discussed in more detail hereinafter in relation to the use of the kits), standardizing materials, e.g. mRNA or cDNA from normal and/or diseased samples for comparative purposes, labels for incorporation into cDNA, adapters for introducing nucleic acid sequences for amplification purposes, primers for amplification and/or appropriate enzymes, "buffers and solutions. Optionally said kit may also contain a package insert describing how the method of the invention should be performed, optionally providing standard graphs, data or software for interpretation of results obtained when performing the invention.
The use of such kits to prepare a standard diagnostic gene transcript pattern as described hereinafter forms a further aspect of the invention.
The set of probes as described herein have various uses. Principally however they are used to assess the gene expression state of a test cell to provide information relating to the organism from which said cell is derived. Thus the probes are useful in diagnosing, identifying or monitoring a cancer or stage thereof in an organism.
Thus in a further aspect the invention provides the use of a set of oligonucleotide probes or a kit as described hereinbefore to determine the gene expression pattern of a cell which pattern reflects the level of gene expression of genes to which said oligonucleotide probes bind, comprising at least the steps of: a) isolating mRNA from said cell, which may optionally be reverse transcribed to cDNA; b) hybridizing the mRNA or cDNA of step (a) to a set of ol igonucleotide probes or a kit as defined herein, and c) assessing the amount of mRNA or cDNA hybridizing to each of said probes to produce said pattern.
The mRNA and cDNA as referred to in this method, and the methods hereinafter, encompass derivatives or copies of said molecules, e.g. copies of such molecules such as those produced by amplification or the preparation of complementary strands, but which retain the identity of the mRNA sequence, ie. would hybridize to the direct transcript (or its complementary sequence) by virtue of precise complementarity, or sequence identity, over at least a region of said molecule. It will be appreciated that complementarity will not exist over the entire region where techniques have been used which may truncate the transcript or introduce new sequences, e.g. by primer amplification. For convenience, said mRNA or cDNA is preferably amplified prior to step b). As with the oligonucleotides described herein said molecules may be modified, e.g. by using non-natural bases during synthesis providing complementarity remains. Such molecules may also carry additional moieties such as signalling or immobilizing means.
The various steps involved in the method of preparing such a pattern are described in more detail hereinafter.
As used herein "gene expression” refers to transcription of a particular gene to produce a specific mRNA product (ie. a particular splicing product).
The level of gene expression may be determined by assessing the level of transcribed mRNA molecules or cDNA molecules reverse transcribed from the mRNA molecules or products derived from those molecules, e.g. by amplification.
The "pattern" created by this technique refers to information which, for example, may be represented in tabular or graphical form and conveys information about the signal associated with two or more oligonucleotides.
Preferably said pattern is expressed as an array of numbers relating to the expression level associated with each probe.
Preferably, said pattern is established using the following linear model: y=Xb+f Equation 1 wherein, X is the matrix of gene expression data and y is the response variable, b is the regression coefficient vector and f the estimated residual vector. Although many different methods can be used to establish the relationship provided in equation 1, especially preferably the partial Least Squares Regression (PLSR) method is used for establishing the relationship in equation 1.
The probes are thus used to generate a pattern which reflects the gene expression of a cell at the time of its isolation. The pattern of expression 1s characteristic of the circumstances under which that cells finds itself and depends on the influences to which the cell has been exposed. Thus, a characteristic gene transcript pattern standard or fingerprint (standard probe pattern) for cells from an individual with a particular cancer may be prepared and used for comparison to transcript patterns of test cells. This has clear applications in diagnosing, monitoring or identifying whether an organism is suffering from a particular cancer or stage thereof.
The standard pattern is prepared by determining the extent of binding of total mRNA (or cDNA or related product), from cells from a sample of one or more organisms with the cancer or stage thereof, to the probes. This reflects the level of transcripts which are present which correspond to each unique probe.
The amount of nucleic acid material which binds to the different probes is assessed and this information together forms the gene transcript pattern standard of that cancer or stage thereof. Each such standard pattern is characteristic of the cancer or stage thereof.
In a further aspect therefore, the present invention provides a method of preparing a standard gene transcript pattern characteristic of a cancer or stage thereof in an organism comprising at least the steps of: a) isolating mRNA from the cells of a sample of one or more organisms having the cancer or stage thereof, which may optionally be reverse transcribed to cDNA; b) hybridizing the mRNA or cDNA of step (a) to a set of oligonucleotides or a kit as described hereinbefore specific for said cancer or stage thereof in an organism and sample thereof corresponding to the organism and sample thereof under investigation; and c) assessing the amount of mRNA or cDNA hybridizing to each of said probes to produce a characteristic pattem reflecting the level of gene expression of genes to which said oligonucleotides bind, in the sample with the cancer or stage thereof.
For convenience, said oligonucleotides are preferably immobilized on one or more solid supports.
The standard pattern for a great number of cancers and different stages thereof using particular probes may be accumulated in databases and be made available to laboratories on request. "Disease" samples and organisms or "cancer" samples and organisms as referred to herein refer to organisms (or samples from the same) with abnormal cell proliferation e.g, in a solid mass such as a tumour. Such organisms are known to have, or which exhibit, the cancer or stage thereof under study. "Stages" thereof refer to different stages of the cancer which may or may not exhibit particular physiological or metabolic changes, but do exhibit changes at the genetic level which may be detected as altered gene expression. It will be appreciated that during the course of a cancer the expression of different transcripts may vary. Thus at different stages, altered expression may not be exhibited for particular transcripts compared to "normal" samples. However,
combining information from several transcripts which exhibit altered expression at one or more stages through the course of the cancer can be used to provide a characteristic pattern which is indicative of a particular stage of the cancer. Thus for example different stages in cancer, €.g. pre-stage 1, stage I, stage II, II or v can be identified. "Normal" as used herein refers to organisms or samples which are used for comparative purposes. Preferably, these are "normal” in the sense that they do not exhibit any indication of, or are not believed to have, any disease or condition that would affect gene expression, particularly in respect of cancer for which they are to be used as the normal standard. However, it will be appreciated that different stages of a cancer may be compared and in such cases, the "normal" sample may correspond to the earlier stage of the cancer.
As used herein a "sample" refers to any material obtained from the organism, e.g. human or non-human animal under investigation which contains cells and includes, tissues, body fluid or body waste or in the case of prokaryotic organisms, the organism itself. "Body fluids" include blood, saliva, spinal fluid, semen, lymph. "Body waste" includes urine, expectorated matter (pulmonary patients), faeces etc. "Tissue samples" include tissue obtained by biopsy, by surgical interventions or by other means e.g. placenta. Preferably however, the samples which are examined are from areas of the body not apparently affected by the cancer. The cells in such samples are not disease cells, i.e. cancer cells, have not been in contact with such disease cells and do not originate from the site of the cancer. The "site of disease" is considered to be that area of the body which manifests the disease in a way which may be objectively determined, e.g. a tumour. Thus for example peripheral blood may be used for the diagnosis of non-haematopoietic cancers, and the blood does not require the presence of malignant or disseminated cells from the cancer in the blood. Similarly in diseases of the brain, in which no diseased cells are found in the blood due to the blood-brain barrier, peripheral blood may still be used in the methods of the invention.
It will however be appreciated that the method of preparing the standard transcription pattern and other methods of the invention are also applicable for use on living parts of eukaryotic organisms such as cell lines and organ cultures and explants.
As used herein, reference to "corresponding" sample etc. refers to cells preferably from the same tissue, body fluid or body waste, but also includes cells 5 . from tissue, body fluid or body waste which are sufficiently similar for the purposes of preparing the standard or test pattem. When used in reference to genes "corresponding" to the probes, this refers to genes which are related by sequence (which may be complementary) to the probes although the probes may reflect different splicing products of expression. "Assessing" as used herein refers to both quantitative and qualitative assessment which may be determined in absolute or relative terms.
The invention may be put into practice as follows.
To prepare a standard transcript pattern for a particular cancer or stage thereof, sample mRNA is extracted from the cells of tissues, body fluid or body waste according to known techniques (see for example Sambrook et. al. (1989),
Molecular Cloning : A laboratory manual, 2nd Ed., Cold Spring Harbor
Laboratory Press, Cold Spring Harbor, N.Y.) from a diseased individual or organism.
Owing to the difficulties in working with RNA, the RNA is preferably reverse transcribed at this stage to form first strand cDNA. Cloning of the cDNA or selection from, or using, a cDNA library is not however necessary in this or other methods of the invention. Preferably, the complementary strands of the first strand cDNAs are synthesized, ie. second strand cDNAs, but this will depend on which relative strands are present in the oligonucleotide probes. The
RNA may however alternatively be used directly without reverse transcription and may be labelled if so required.
Preferably the cDNA strands are amplified by known amplification techniques such as the polymerase chain reaction (PCR) by the use of appropriate primers. Alternatively, the cDNA strands may be cloned with a vector, used to transform a bacteria such as E. coli which may then be grown to multiply the nucleic acid molecules. ‘When the sequence of the cDNAs are not known, primers may be directed to regions of the nucleic acid molecules which have been introduced. Thus for example, adapters may be ligated to the cDNA molecules and primers directed to these portions for amplification of the cDNA molecules. Alternatively, in the case of eukaryotic samples, advantage may be taken of the polyA tail and cap of the RNA to prepare appropriate primers.
S To produce the standard diagnostic gene transcript pattern or fingerprint for a particular cancer or stage thereof, the above described oligonucleotide probes are used to probe mRNA or cDNA of the diseased sample to produce a signal for hybridization to each particular oligonucleotide probe species, ie. each unique probe. A standard control gene transcript pattern may also be prepared if desired using mRNA or cDNA from a normal sample. Thus, mRNA or cDNA is brought into contact with the oligonucleotide probe under appropriate conditions to allow hybridization.
When multiple samples are probed, this may be performed consecutively using the same probes, e.g. on one or more solid supports, ie. on probe kit modules, or by simultaneously hybridizing to corresponding probes, e.g. the modules of a corresponding probe kit.
To identify when hybridization occurs and obtain an indication of the number of transcripts/cDNA molecules which become bound to the oligonucleotide probes, it is necessary to identify a signal produced when the transcripts (or related molecules) hybridize (e.g. by detection of double stranded nucleic acid molecules or detection of the number of molecules which become bound, after removing unbound molecules, e.g. by washing).
In order to achieve a signal, either or both components which hybridize (ie. the probe and the transcript) carry or form a signalling means or a part thereof. This "signalling means" is any moiety capable of direct or indirect detection by the generation or presence of a signal. The signal may be any "detectable physical characteristic such as conferred by radiation emission, scattering or absorption properties, magnetic properties, or other physical properties such as charge, size or binding properties of existing molecules (e.g. labels) or molecules which may be generated (e.g. gas emission etc.).
Techniques are preferred which allow signal amplification, e.g. which produce multiple signal events from a single active binding site, e.g. by the catalytic action of enzymes to produce multiple detectable products.
Conveniently the signalling means may be a label which itself provides a detectable signal. Conveniently this may be achieved by the use of a radioactive or other label which may be incorporated during cDNA production, the preparation of complementary cDNA strands, during amplification of the target mRNA/cDNA or added directly to target nucleic acid molecules.
Appropriate labels are those which directly or indirectly allow detection or measurement of the presence of the transcripts/cDNA. Such labels include for example radiolabels, chemical labels, for example chromophores or fluorophores (e.g. dyes such as fluorescein and rhodamine), or reagents of high electron density such as ferritin, haemocyanin or colloidal gold. Alternatively, the label may be an enzyme, for example peroxidase or alkaline phosphatase, wherein the presence of the enzyme is visualized by its interaction with a suitable entity, for example a substrate. The label may also form part of a signalling pair wherein the other member of the pair is found on, or in close proximity to, the oligonucleotide probe to which the transcript/cDNA binds, for example, a fluorescent compound and a quench fluorescent substrate may be used. A label may also be provided on a different entity, such as an antibody, which recognizes a peptide moiety attached to the transcripts/cDNA, for example attached to a base used during synthesis or amplification.
A signal may be achieved by the introduction of a label before, during or after the hybridization step. Alternatively, the presence of hybridizing transcripts may be identified by other physical properties, such as their absorbance, and in which case the signalling means is the complex itself.
The amount of signal associated with each oligonucleotide probe is then "assessed. The assessment may be quantitative or qualitative and may be based on binding of a single transcript species (or related cDNA or other products) to each probe, or binding of multiple transcript species to multiple copies of each unique probe. It will be appreciated that quantitative results will provide further information for the transcript fingerprint of the cancer which is compiled. This data may be expressed as absolute values (in the case of macroarrays) or may be determined relative to a particular standard or reference e.g. a normal control sample.
Furthermore it will be appreciated that the standard diagnostic gene pattern transcript may be prepared using one or more disease samples (and normal samples if used) to perform the hybridization step to obtain patterns not biased towards a particular individual's variations in gene expression.
The use of the probes to prepare standard patterns and the standard diagnostic gene transcript patterns thus produced for the purpose of identification or diagnosis or monitoring of a particular cancer or stage thereof in a particular organism forms a further aspect of the invention.
Once a standard diagnostic fingerprint or pattern has been determined for a particular cancer or stage thereof using the selected oligonucleotide probes, this information can be used to identify the presence, absence or extent or stage of that cancer in a different test organism or individual.
To examine the gene expression pattern of a test sample, a test sample of tissue, body fluid or body waste containing cells, corresponding to the sample used for the preparation of the standard pattern, is obtained from a patient or the organism to be studied. A test gene transcript pattern is then prepared as described hereinbefore as for the standard pattern.
In a further aspect therefore, the present invention provides a method of preparing a test gene transcript pattern comprising at least the steps of: a) isolating mRNA from the cells of a sample of said test organism, which may optionally be reverse transcribed to cDNA; b) hybridizing the mRNA or cDNA of step (2) to a set of oligonucleotides or a kit as described hereinbefore specific for a cancer or stage thereof in an organism and sample thereof corresponding to the organism and sample thereof under investigation, and ¢) assessing the amount of mRNA or cDNA hybridizing to each of said probes to produce said pattern reflecting the level of gene expression of genes to which said oligonucleotides bind, in said test sample.
This test pattern may then be compared to one or more standard pattems to assess whether the sample contains cells having the cancer or stage thereof.
Thus viewed from a further aspect the present invention provides a method of diagnosing or identifying or monitoring a cancer or stage thereof in an organism, comprising the steps of: a) isolating mRNA from the cells of a sample of said organism, which may optionally be reverse transcribed to cDNA; b) hybridizing the mRNA or cDNA of step (a) to a set of oligonucleotides or a kit as described hereinbefore specific for said cancer or stage thereof in an organism and sample thereof corresponding to the organism and sample thereof under investigation; c) assessing the amount of mRNA or cDNA hybridizing to each of said probes to produce a characteristic pattern reflecting the level of gene expression of genes to which said oligonucleotides bind, in said sample; and d) comparing said pattern to a standard diagnostic pattern prepared according to the method of the invention using a sample from an organism corresponding to the organism and sample under investigation to determine the presence of said cancer or a stage thereof in the organism under investigation.
The method up to and including step c) is the preparation of a test pattern as described above.
As referred to herein, "diagnosis" refers to determination of the presence or existence of a cancer or stage thereof in an organism. "Monitoring" refers to establishing the extent of a cancer, particularly when an individual is known to be suffering from cancer, for example to monitor the effects of treatment or the development of a cancer, e.g. to determine the suitability of a treatment or provide a prognosis.
The presence of the cancer or stage thereof may be determined by determining the degree of correlation between the standard and test samples’ patterns. This necessarily takes into account the range of values which are obtained for normal and diseased samples. Although this can be established by obtaining standard deviations for several representative samples binding to the probes to develop the standard, it will be appreciated that single samples may be sufficient to generate the standard pattern to identify a cancer if the test sample exhibits close enough correlation to that standard. Conveniently, the presence,
absence, or extent of a cancer or stage thereof in a test sample can be predicted by inserting the data relating to the expression level of informative probes in test : sample into the standard diagnostic probe pattern established according to equation 1.
Data generated using the above mentioned methods may be analysed using various techniques from the most basic visual representation (e.g. relating to intensity) to more complex data manipulation to identify underlying pattems which reflect the interrelationship of the level of expression of each gene to which the various probes bind, which may be quantified and expressed mathematically. Conveniently, the raw data thus generated may be manipulated by the data processing and statistical methods described hereinafter, particularly normalizing and standardizing the data and fitting the data to a classification model to determine whether said test data reflects the pattern of a particular cancer or stage thereof.
The methods described herein may be used to identify, monitor or diagnose a cancer or its stage or progression, for which the oligonucleotide probes are informative. "Informative" probes as described herein, are those which reflect genes which have altered expression in the cancer in question, or particular stages thereof. Probes of the invention may not be sufficiently informative for diagnostic purposes when used alone, but are informative when used as one of several probes to provide a characteristic pattern, e.g. in a set as described hereinbefore.
Preferably said probes correspond to genes which are systemically affected by said cancer or stage thereof. Especially preferably said genes, from which transcripts are derived which bind to probes of the invention, are moderately or highly expressed. The advantage of using probes directed to moderately or highly expressed genes is that smaller clinical samples are required for generating the necessary gene expression data set, e.g. less than 1ml blood samples.
Furthermore, it has been found that such genes which are already being actively transcribed tend to be more prone to being influenced, in a positive or negative way, by new stimuli. In addition, since transcripts are already being produced at levels which are generally detectable, small changes in those levels are readily detectable as for example, a certain detectable threshold does not need to be reached.
In preferred methods of the invention, the set of probes of the invention are informative for a variety of different cancers or stages thereof. A sub-set of the probes disclosed herein may be used for diagnosis, identification or monitoring a particular cancer or stage thereof. : Cancers for which the probes may be used for diagnosis, identification and monitoring include stomach, lung, breast, prostate gland, bowel, skin, colon and ovary cancer. Especially preferably the probes are used for breast cancer analysis.
The diagnostic method may be used alone as an alternative to other diagnostic techniques or in addition to such techniques. For example, methods of the invention may be used as an alternative or additive diagnostic measure to diagnosis using imaging techniques such as Magnetic Resonance Imagine (MRI), ultrasound imaging, puclear imaging or X-ray imaging, for example in the identification and/or diagnosis of tumours.
The methods of the invention may be performed on cells from prokaryotic or eukaryotic organisms which may be any eukaryotic organisms such as human beings, other mammals and animals, birds, insects, fish and plants, and any prokaryotic organism such as a bacteria.
Preferred non-human animals on which the methods of the invention may be conducted include, but are not limited to mammals, particularly primates, domestic animals, livestock and laboratory animals. Thus preferred animals for diagnosis include mice, rats, guinea pigs, cats, dogs, pigs, cOWs, goats, sheep, horses. Particularly preferably cancer of humans is diagnosed, identified or monitored.
As described above, the sample under study may be any convenient sample which may be obtained from an organism. Preferably however, as mentioned above, the sample is obtained from a site distant to the site of disease and the cells in such samples are not disease cells, have not been in contact with such cells and do not originate from the site of the disease. In such cases,
although preferably absent, the sample may contain cells which do not fulfil these criteria. However, since the probes of the invention are concerned with transcripts whose expression is altered in cells which do satisfy these criteria, the probes are specifically directed to detecting changes in transcript levels in those cells even if in the presence of other, background cells.
It has been found that the cells from such samples show significant and informative variations in the gene expression of a large number of genes. Thus, the same probe (or several probes) may be found to be informative in determinations regarding two or more cancers, or stages thereof by virtue of the particular level of transcripts binding to that probe or the interrelationship of the extent of binding to that probe relative to other probes. As a consequence, it is possible to use a relatively small number of probes for screening for multiple cancers. This has consequences with regard to the selection of probes, but also for the use of a single set of probes for more than one diagnosis.
Thus, the present invention also provides sets of probes for diagnosing, identifying or monitoring two or more cancers or stages thereof, wherein at least one of said probes is suitable for said diagnosing, identifying or monitoring at least two of said cancers or stages thereof, and kits and methods of using the : same. Preferably at least 5 probes, e.g. from 5 to 15 probes, are used in at least two diagnoses.
Thus, in a further preferred aspect, the present invention provides a method of diagnosis or identification or monitoring as described hereinbefore for the diagnosis, identification or monitoring of two or more cancers or stages thereof in an organism, wherein said test pattem produced in step c) of the diagnostic method is compared in step d) to at least two standard diagnostic patterns prepared as described previously, wherein each standard diagnostic pattern is a pattem generated for a different cancer or stage thereof.
Whilst in a preferred aspect the methods of assessment concern the development of a gene transcript pattern from a test sample and comparison of the same to a standard pattern, the elevation or depression of expression of certain markers may also be examined by examining the products of expression :
and the level of those products. Thus a standard pattern in relation to the expressed product may be generated.
In such methods the levels of expression of a set of polypeptides encoded by the gene to which a primary oligonucleotide or a derived oligonucleotide, binds, are analysed.
Various diagnostic methods may be used to assess the amount of . polypeptides (or fragments thereof) which are present. The presence or concentration of polypeptides may be examined, for example by the use of a binding partner to said polypeptide (e.g. an antibody), which may be immobilized, to separate said polypeptide from the sample and the amount of polypeptide may then be determined. "Fragments" of the polypeptides refers to a domain or region of said polypeptide, e.g. an antigenic fragment, which is recognizable as being derived from said polypeptide to allow binding of a specific binding partner. Preferably such a fragment comprises a significant portion of said polypeptide and corresponds to a product of normal post-synthesis processing.
Thus in a further aspect thc present invention provides a method of preparing a standard gene transcript pattern characteristic of a cancer or stage thereof in an organism comprising at least the steps of: a) releasing target polypeptides froma sample of one or more organisms having the cancer or stage thereof; b) contacting said target polypeptides with one or more binding partners, wherein each binding partner is specific to a marker polypeptide (or a fragment thereof) encoded by the gene to which a primary oligonucleotide (or derived sequence) binds, to allow binding of said binding partners to said target polypeptides, wherein said marker polypeptides are specific for said cancer in an organism and sample thereof corresponding to the organism and sample thereof under investigation; and c) assessing the target polypeptide binding to said binding partners to produce a characteristic pattern reflecting the level of gene expression of genes which express said marker polypeptides, in the sample with the cancer or stage thereof.
WQ 2005/118851 PCT/GB2005/002180 .
As used herein "target polypeptides” refer to those polypeptides present in a sample which are to be detected and "marker polypeptides" are polypeptides which are encoded by the genes to which primary oligonucleotides or derived oligonucleotides bind, ie. genes within the gene families. The target and marker polypeptides are identical or at least have areas of high similarity, e.g. epitopic regions to allow recognition and binding of the binding partner. "Release" of the target polypeptides refers to appropriate treatment of a sample to provide the polypeptides in a form accessible for binding of the binding partners, e.g. by lysis of cells where these are present. The samples used in this case need not necessarily comprise cells as the target polypeptides may be released from cells into the surrounding tissue or fluid, and this tissue or fluid may be analysed, e.g. urine or blood. Preferably however the preferred samples as described herein are used. "Binding partners" comprise the separate entities which together make an affinity binding pair as described above, wherein one partner of the binding pair is the target or marker polypeptide and the other partner binds specifically to that polypeptide, e.g. an antibody.
Various arrangements may be cnvisaged for detecting the amount of binding pairs which form. In its simplest form, a sandwich type assay ¢.g. an immunoassay such as an ELISA, may be used in which an antibody specific to the polypeptide and carrying a label (as described elsewhere herein) may be bound to the binding pair (e.g. the first antibody:polypeptide pair) and the amount of label detected.
Other methods as described herein may be similarly modified for analysis of the protein product of expression rather than the gene transcript and related nucleic acid molecules.
Thus a further aspect of the invention provides amethod of preparing a test gene transcript pattern comprising at least the steps of: a) releasing target polypeptides from a sample of said test organism, b) contacting said target polypeptides with one or more binding partners, wherein each binding partner is specific to a marker polypeptide (or a fragment thereof) encoded by the gene to which a primary oligonucleotide (or derived sequence) binds, to allow binding of said binding partners to said target polypeptides, wherein said marker polypeptides are specific for said cancer in an organism and sample thereof corresponding to the organism and sample thereof under investigation; and c) assessing the target polypeptide binding to said binding partners to produce a characteristic pattern reflecting the level of gene expression of genes which express said marker polypeptides, in said test sample. .
A yet further aspect of the invention provides a method of diagnosing or identifying or monitoring a cancer or stage thereof in an organism comprising the steps of: a) releasing target polypeptides from a sampie of said organism, b) contacting said target polypeptides with one or more binding partners, wherein each binding partner is specific to a marker polypeptide (or a fragment thereof) encoded by the gene to which a primary oligonucleotide (or derived sequence) binds, to allow binding of said binding partners to said target polypeptides, wherein said marker polypeptides are specific for said cancer in an organism and sample thereof corresponding to the organism and sample thereof under investigation; and c) assessing the target polypeptide binding to said binding partners to produce a characteristic pattern reflecting the level of gene expression of genes which express said marker polypeptides in said sample; and d) comparing said pattern to 2 standard diagnostic pattern prepared as described hereinbefore using a sample from an organism corresponding to the organism and sample under investigation to determine the degree of correlation indicative of the presence of said cancer or a stage thereof in the organism under investigation.
The methods of generating standard and test patterns and diagnostic : techniques rely on the use of informative oligonucleotide probes to generate the gene expression data. In some cases it will be necessary to select these informative probes for a particular method, e.g. to diagnose a particular cancer, from a selection of available probes, e.g. the Table 2 and/or 3 oligonucleotides, the Table 2 and/or 3 derived oligonucleotides, their complementary sequences and functionally equivalent oligonucleotides and optionally the Table 4 oligonucleotides, their derived oligonucleotides, complementary sequences and functionally equivalent oligonucleotides. Said derived oligonucleotides include oligonucleotides derived from the genes corresponding to the sequences provided in those tables, e.g. the genes set forth in Tables 2, 5 or 6 (see the
Accession numbers), or the complementary sequences thereof. The following methodology describes a convenient method for identifying such informative probes, or more particularly how to select a suitable sub-set of probes from the probes described herein.
Probes for the analysis of a particular cancer or stage thereof, may be identified in a number of ways known in the prior art, including by differential expression or by library subtraction (see for example W098/49342). As described in PCT/GB03/005102 and as described hereinafter, in view of the high information content of most transcripts, as a starting point one may also simply analyse a random sub-set of mRNA or cDNA species corresponding to the family of sequence described herein and pick the most informative probes from that sub-set. The following method describes the use of immobilized oligonucleotide probes (e.g. the probes of the invention) to which mRNA (or related molecules) from different samples are bound to identify which probes are the most informative to identify a particular type of cancer, e.g. 2 disease sample.
The immobilized probes can be derived from various unrelated or related organisms; the only requirement is that the immobilized probes should bind specifically to their homologous counterparts in test organisms. Probes can also be derived from commercially available or public databases and immobilized on solid supports. The selected probes necessarily correspond to one of the genes in the gene sequence families described herein, but the probes of interest may be randomly selected from within that entire group of families.
The length of the probes immobilised on the solid support should be long enough to allow for specific binding to the target sequences. The immobilised probes can be in the form of DNA, RNA or their modified products or PNAs (peptide nucleic acids). Preferably, the probes immobilised should bind specifically to their homologous counterparts representing highly and moderately expressed genes in test organisms. Conveniently the probes which are used are the probes described herein.
The gene expression pattern of cells in biological samples can be generated using prior art techniques such as microarray or macroarray as
S described below or using methods described herein. Several technologies have now been developed for monitoring the expression level of a large number of genes simultaneously in biological samples, such as, high-density oligoarrays (Lockhart et al., 1996, Nat. Biotech., 14, p1675 -1680), cDNA microarrays (Schena et al, 1995, Science, 270, p467 -470) and cDNA macroarrays (Maier E et al., 1994, Nucl. Acids Res., 22, p3423-3424; Bernard et al., 1996, Nucl. Acids
Res., 24, p1435-1442).
In high-density oligoarrays and cDNA microarrays, hundreds and thousands of probe oligonucleotides or cDNAs, are spotted onto glass slides or nylon membranes, or synthesized on biochips. The mRNA isolated from the test and reference samples are labelled by reverse transcription with a red or green fluorescent dye, mixed, and hybridised to the microarray. After washing, the bound fluorescent dyes are detected by a laser, producing two images, one for each dye. The resulting ratio of the red and green spots on the two images provides the information about the changes in expression levels of genes in the test and reference samples. Alternatively, single channel or multiple channel microarray studies can also be performed.
In cDNA macroarray, different cDNAs are spotted on a solid support such as nylon membranes in excess in relation to the amount of test mRNA that can hybridise to each spot. mRNA isolated from test samples is radio- labelled by reverse transcription and hybridised to the immobilised probe cDNA. After washing, the signals associated with labels bybridising specifically to immobilised probe cDNA are detected and quantified. The data obtained in macroarray contains information about the relative levels of transcripts present in the test samples. Whilst macroarrays are only suitable to monitor the expression of a limited number of genes, microarrays can be used to monitor the expression of several thousand genes simultaneously and is, therefore, a preferred choice for large-scale gene expression studies.
A macroarray technique for generating the gene expression data set has been used to illustrate the probe identification method described herein. For this . purpose, mRNA is isolated from samples of interest and used to prepare labelled target molecules, e.g. mRNA or cDNA as described above. The labelled target molecules are then hybridised to probes immobilised on the solid support.
Various solid supports can be used for the purpose, as described previously.
Following hybridization, unbound target molecules are removed and signals from target molecules hybridizing to immobilised probes quantified. If radio labelling is performed, Phospholmager can be used to generate an image file that can be used to generate araw data set. Depending on the nature of label chosen for labelling the target molecules, other instruments can also be used, for example, when fluorescence is used for labelling, a Fluorolmager can be used to generate an image file from the hybridised target molecules.
The raw data corresponding to mean intensity, median intensity, or volume of the signals in each spot can be acquired from the image file using commercially available software for image analysis. However, the acquired data needs to be corrected for background signals and normalized prior to analysis, since, several factors can affect the quality and quantity of the hybridising signals. For example, variations in the quality and quantity of mRNA isolated from sample to sample, subtle variations in the efficiency of labelling target molecules during each reaction, and variations in the amount of unspecific binding between different macroarrays can all contribute to noise in the acquired data set that must be corrected for prior to analysis.
Background correction can be performed in several ways. The lowest pixel intensity within a spot can be used for background subtraction or the mean or median of the line of pixels around the spots’ outline can be used for the purpose. One can also define an area representing the background intensity based on the signals generated from negative controls and use the average intensity of this area for background subtraction.
The background corrected data can then be transformed for stabilizing the variance in the data structure and normalized for the differences in probe intensity. Several transformation techniques have been described in the literature and a brief overview can be found in Cui, Kerr and Churchill http://www jax.org/research/ churchill/research/ expression/Cui-Transform. pdf).
Normalization can be performed by dividing the intensity of each spot with the collective intensity, average intensity or median intensity of all the spots ina macroarray or a group of spots in a macroarray in order to obtain the relative intensity of signals hybridising to immobilised probes in a macroarray. Several. methods have been described for normalizing gene expression data (Richmond and Somerville, 2000, Current Opin. Plant Biol., 3, p108-116; Finkelstein et al., 2001, In "Methods of Microarray Data Analysis. Papers from CAMDA, Eds.
Lin & Johnsom, Kluwer Academic, p57-68; Yang et al., 2001, In "Optical
Technologies and Informatics”, Bds. Bittner, Chen, Dorsel & Dougherty,
Proceedings of SPIE, 4266, p141-1 52; Dudoit et al, 2000, J. Am. Stat. Ass., 97, p77-87; Alter et al 2000, supra; Newton et al., 2001, J. Comp. Biol, 8, p37-52).
Generally, a scaling factor or function is first calculated to correct the intensity effect and then used for normalising the intensities. The use of external controls has also been suggested for improved normalization.
One other major challenge encountered in large-scale gene expression analysis is that of standardization of data collected from experiments performed at different times. We have observed that gene expression data for samples acquired in the same experiment can be efficiently compared following background correction and normalization. However, the data from samples acquired in experiments performed at different times requires further standardization prior to analysis. This is because subtle differences in experimental parameters between different experiments, for example, differences in the quality and quantity of mRNA extracted at different times, differences in time used for target molecule labelling, hybridization time or exposure time, can affect the measured values. Also, factors such as the nature of the sequence of transcripts under investigation (their GC content) and their amount in relation to the each other determines how they are affected by subtle variations in the experimental processes. They determine, for example, how efficiently first strand cDNAs, corresponding to a particular transcript, are transcribed and labelled during first strand synthesis, or how efficiently the corresponding labelled target molecules bind to their complementary sequences during hybridization. Batch to batch difference in the printing process is also a major factor for variation in the generated expression data.
Failure to properly address and rectify for these influences leads to situations where the differences between the experimental series may overshadow the main information of interest contained in the gene expression data set, i.e. the differences within the combined data from the different experimental series. Hence, when required the expression data should be batch- adjusted prior to data analysis.
Monitoring the expression of a large number of genes in several samples leads to the generation of a large amount of data that is too complex to be easily interpreted. Several unsupervised and supervised multivariate data analysis techniques have already been shown to be useful in extracting meaningful biological information from these large data sets. Cluster analysis is by far the most commonly used technique for gene expression analysis, and has been performed to identify genes that are regulated in a similar manner, and or identifying new/unknown tumour classes using gene expression profiles (Eisen et al., 1998, PNAS, 95, p14863-14868, Alizadeh et al. 2000, supra, Perou et al. 2000, Nature, 406, p747-752; Ross et al, 2000, Nature Genetics, 24(3), p227- 235; Herwig et al., 1999, Genome Res., 9, p1093-1105; Tamayo et al, 1999,
Science, PNAS, 96, p2907-2912). 1n the clustering method, genes are grouped into functional categoties
1999, Tech report Univ Stanford.) Plaid model (Lazzeroni, 2002, Stat. Sinica, 12, p61-86), and self-organizing maps (Tamayo et al. 1999, supra). Also, related methods of multivariate statistical analysis, such as those using the singular value decomposition (Alter et al., 2000, PNAS, 97(18), p10101-10106; Ross et al. 2000, supra) or multidimensional scaling can be effective at reducing the dimensions of the objects under study.
However, methods such as cluster analysis and singular value decomposition are purely exploratory and only provide a broad overview of the internal structure present in the data. They are unsupervised approaches in which the available information concerning the nature of the class under investigation is not used in the analysis. Often, the nature of the biological perturbation to which a particular sample has been subjected is known. For example, it is sometimes known whether the sample whose gene expression pattern is being analysed derives from a diseased or healthy individual. In such instances, discriminant analysis can be used for classifying samples into various groups based on their gene expression data.
In such an analysis one builds the classifier by training the data that is capable of discriminating between member and non-members of a given class.
The trained classifier can then be used to predict the class of unknown samples.
Examples of discrimination methods that have been described in the literature include Support Vector Machines (Brown et al, 2000, PNAS, 97, p262-267),
Nearest Neighbour (Dudoit et al., 2000, supra), Classification trees (Dudoit et al., 2000, supra), Voted classification (Dudoit et al., 2000, supra), Weighted
Gene voting (Golub et al. 1999, supra), and Bayesian classification (Keller et al. 2000, Tec report Univ of Washington). Also a technique in which PLS (Partial
Least Square) regression analysis is first used to reduce the dimensions in the gene expression data set followed by classification using logistic discriminant analysis and quadratic discriminant analysis (LD and QDA) has recently been described (Nguyen & Rocke, 2002, Bioinformatics, 18, p39-50 and 1216-1226).
A challenge that gene expression data poses to classical discriminatory methods is that the number of genes whose expression are being analysed is very large compared to the number of samples being analysed. However in most cases only a small fraction of these genes are informative in discriminant analysis problems. Moreover, there is a danger that the noise from irrelevant genes can mask or distort the information from the informative genes. Several methods have been suggested in literature to identify and select genes that are informative in microarray studies, for example, t-statistics (Dudoit et al, 2002, J.
Am. Stat. Ass., 97, p77-87), analysis of variance (Kerr et al., 2000, PNAS, 98, p8961-8965), Neighbourhood analysis (Golub et al, 1999, supra), Ratio of between groups to within groups sum of squares (Dudoit et al., 2002, supra),
Non parametric scoring (Park et al., 2002, Pacific Symposium on Biocomputing, p52-63) and Likelihood selection (Keller et al., 2000, supra).
In the methods described herein the gene expression data that has been normalized and standardized is analysed by using Partial Least Squares
Regression (PLSR). Although PLSR is primarily a method used for regression analysis of continuous data (see Appendix A), it can also be utilized as a method for model building and discriminant analysis using a dummy response matrix based on a binary coding. The class assignment is based on a simple dichotomous distinction such as breast cancer (class 1) / healthy (class 2),o0ra multiple distinction based on multiple disease diagnosis such as breast cancer (class 1) / ovarian cancer (class 2) / healthy (class 3). The list of diseases for classification can be increased depending upon the samples available corresponding to other cancers or stages thereof.
PLSR applied as a classification method is referred to as PLS-DA (DA standing for Discriminant analysis). PLS-DA is an extension of the PLSR algorithm in which the Y-matrix is a dummy matrix containing zn YOws (corresponding to the number of samples) and K columas (corresponding to the number of classes). The Y-matrix is constructed by inserting 1 in the kth column and -1 in all the other columns if the corresponding ith object of X belongs to class k. By regressing Y onto X, classification of a new sample is achieved by selecting the group corresponding to the largest component of the fitted, x)= (0 1(%), § 2X);--» F(x). Thus, in a-1/1 response matrix, a prediction value below 0 means that the sample belongs to the class designated as -1, while a prediction value above 0 implies that the sample belongs to the class designated as 1.
An advantage of PLSR-DA is that the results obtained can be easily represented in the form of two different plots, the score and loading plots. Score plots represent a projection of the samples onto the principal components and shows the distribution of the samples in the classification model and their relationship to one another. Loading plots display correlations between the variables present in the data set.
It is usually recommended to use PLS-DA as a starting point for the classification problem due to its ability to handle collinear data, and the property of PLSR as a dimension reduction technique. Once this purpose has been satisfied, it is possible to use other methods such as Linear discriminant analysis,
LDA, that has been shown to be effective in extracting further information,
Indahl et al. (1999, Chem. and Intell. Lab. Syst., 49, p19-31). This approach is based on first decomposing the data using PLS-DA, and then using the scores vectors (instead of the original variables) as input to LDA. Further details on
LDA can be found in Duda and Hart (Classification and Scene Analysis, 1973,
Wiley, USA).
The next step following model building is of model validation. This step is considered to be amongst the most important aspects of multivariate analysis, and tests the "goodness" of the calibration model which has been built. In this work, a cross validation approach has been used for validation. In this approach, one or a few samples are kept out in each segment while the model is built using a full cross-validation on the basis of the remaining data. The samples left out are then used for prediction/classification. Repeating the simple cross-validation process several times holding different samples out for each cross-validation leads to a so-called double cross-validation procedure. This approach has been shown to work well with a limited amount of data, as is the case in some of the
Examples described here. Also, since the cross validation step is repeated several times the dangers of model bias and overfitting are reduced.
Once a calibration model has been built and validated, genes exhibiting an expression pattem that is most relevant for describing the desired information in the model can be selected by techniques described in the prior art for variable selection, as mentioned elsewhere. Variable selection will help in reducing the final model complexity, provide a parsimonious model, and thus lead to a reliable model that can be used for prediction. Moreover, use of fewer genes for the purpose of providing diagnosis will reduce the cost of the diagnostic product.
In this way informative probes which would bind to the genes of relevance may- be identified.
We have found that after a calibration model has been built, statistical techniques like Jackknife (Effron, 1982, The Jackknife, the Bootstrap and other resampling plans. Society for Industrial and Applied mathematics, Philadelphia,
USA), based on resampling methodology, can be efficiently used to select or confirm significant variables (informative probes). The approximate uncertainty variance of the PLS regression coefficients B can be estimated by:
M
S?B = % (B-Bn)2)’ m=1 where
S2B = estimated uncertainty variance of B;
B = the regression coefficient at the cross validated rank A using all the N objects;
B,, = the regression coefficient at the rank A using all objects except the object(s) left out in cross validation segment m; and g = scaling coefficient (here: g=1).
In our approach, Jackknife has been implemented together with cross-validation. For each variable the difference between the B-coefficients B; in a cross-validated sub-model and Biot for the total model is first calculated.
The sum of the squares of the differences is then calculated in all sub-models to obtain an expression of the variance of the B; estimate for a variable. The significance of the estimate of By is calculated using the t-test. Thus, the resulting regression coefficients can be presented with uncertainty limits that correspond to 2 Standard Deviations, and from that significant variables are detected.
No further details as to the implementation or use of this step are provided here since this has been implemented in commercially available software, The Unscrambler, CAMO ASA, Norway. Also, details on variable selection using Jackknife can be found in Westad & Martens (2000, J. Near Inf.
Spectr., 8, p117-124).
The following approach can be used to select informative probes from a gene expression data set: a) keep out one unique sample (including its repetitions if present in the data set) per cross validation segment, b) build a calibration model (cross validated segment) on the remaining samples using PLSR-DA; c) select the significant genes for the model in step b) using the
Jackknife criterion; d) repeat the above 3 steps until all the unique samples in the data . set are kept out once (as described in step a). For example, if 75 unique samples are present in the data set, 75 different calibration models are built resulting in a collection of 75 different sets of significant probes; e) select the most significant variables using the frequency of occurrence criterion in the generated sets of significant probes in step d). For example, a set of probes appearing in all sets (100%) are more informative than probes appearing in only 50% of the generated sets in step d).
Once the informative probes for a disease have been selected, a final model is made and validated. The two most commonly used ways of validating the model are cross-validation (CV) and test set validation. In cross-validation, the data is divided into k subsets. The model is then trained k times, each time leaving out one of the subsets from training, but using only the omitted subset to compute error criterion, RMSEP (Root Mean Square Error of Prediction). Ifk equals the sample size, this is called "leave-one-out" cross-validation. The idea of leaving one or a few samples out per validation segment is valid only in cases where the covariance between the various experiments is zero. Thus, one sample at-a-time approach can not be justified in situations containing replicates since keeping only one of the replicates out will introduce a systematic bias in our analysis. The correct approach in this case will be to leave out all replicates of the same samples at a time since that would satisfy assumptions of zero covariance between the CV-segments.
The second approach for model validation is to use a separate test-set for validating the calibration model. This requires running a separate set of experiments to be used as a test set. This is the preferred approach given that real test data are available.
The final model is then used to identify a cancer or stage thereof in test samples. For this purpose, expression data of selected informative genes is generated from test samples and then the final model is used to determine whether a sample belongs to a diseased or non-diseased class or has a cancer or stage thereof.
Preferably a model for classification purposes is generated by using the data relating to the probes identified according to the above described method.
Preferably the sample is as described previously. Preferably the oligonucleotides which are ;mmobilized in step (a) are randomly selected from within the family described hereinbefore, but alternatively may be selected to represent the different families, e.g. by selecting one or more of the oligonucleotides corresponding to genes encoding proteins with common functions in different families. Bspecially preferably, said selection is made to encompass oligonucleotides derived from genes of family (i) and (ii). Such oligonucleotides may be of considerable length, e.g. if using CDNA (which is encompassed within the scope of the term "oligonucleotide"). The identification of such cDNA molecules as useful probes allows the development of shorter oligonucleotides which reflect the specificity of the cDNA molecules but are easier to manufacture and manipulate.
The above described model may then be used to generate and analyse data of test samples and thus may be used for the diagnostic methods of the invention. In such methods the data generated from the test sample provides the gene expression data set and this is normalized and standardized as described above. This is then fitted to the calibration model described above to provide classification.
The method described herein can also be used to simultaneously select informative probes for several cancers. Depending upon which cancers have been included in the calibration or training set, informative probes can be selected for the said cancers. The informative probes selected for one cancer may or may not be similar to the informative probes selected for another cancer of interest. It is the pattern with which the selected genes are expressed in relation to each other during a cancer or stage thereof, that determines whether or not they are informative for the cancer or stage thereof.
In other words, informative genes are selected based on how their expression correlates with the expression of other selected informative genes under the influence of responses generated by the cancer or stage thereof under investigation.
For the purpose of isolating informative probes or identifying several cancers and stages thereof simultaneously, the gene expression data set must contain the information on how genes are expressed when the subject has a particular cancer or stage thereof under investigation. The data set is generated from a set of healthy or diseased samples, where a particular sample may contain the information of only one cancer or stage thereof or may also contain information about multiple cancers or stages thereof. Hence, the method also teaches an efficient experimental design to reduce the number of samples required for isolating informative probes by selecting samples representing more than one cancer or stage thereof.
As mentioned previously, in view of the high information content of most transcripts, the identification and selection of informative probes for use in diagnosing, monitoring or identifying a particular cancer or stage thereof may be dramatically simplified. Thus the pool of genes from which a selection may be made to identify informative probes may be radically reduced.
Unlike, in prior art technologies where informative probes are selected from a population of thousands of genes that are being expressed in a cell, like in microarray, in the method described herein, the informative probes are selected from 2 limited number of genes as described in the gene sequence families described hereinbefore. From within these families, probes of interest may be randomly selected.
Thus in a preferred aspect, said set of oligonucleotides are randomly 3 selected from the primary oligonucleotides as described hereinbefore.
As referred to herein "random" refers to selection which is not biased based on the extent of information carried by the transcripts in relation to the cancer or organism under study, ie. without bias towards their likely utility as informative probes. Whilst a random selection may be made from a pool of transcripts (or related products) which have been biased, e.g. to highly or moderately expressed transcripts, preferably random selection is made from a pool of transcripts not biased or selected by a sequence-based criterion. The larger set may therefore contain oligonucleotides corresponding to highly and moderately expressed genes, or alternatively, may be enriched for those corresponding to the highly and moderately expressed genes.
Random selection from highly and moderately expressed genes can be achieved in a wide variety of ways. For example, by randomly picking a significant number of cDNA clones from a cDNA library constructed from a biological specimen under investigation containing clones corresponding to the gene sequence families described hereinbefore. Since, in a cDNA library, the cDNA clones corresponding to transcripts present in high or moderate amount are more frequently present than transcripts corresponding to cDNA present in low amount, the former will tend to be picked up more frequently than the latter.
A pool of cDNA enriched for those corresponding to highly and moderately expressed genes can be isolated by this approach.
To identify genes that are expressed in high or moderate amount among the isolated population for use in methods of the invention, the information about the relative level of their transcripts in samples of interest can be generated using several prior art techniques. Both non-sequence based methods, such as differential display or RNA fingerprinting, and sequence-based methods such as microarrays or acroarrays can be used for the purpose. Alternatively, specific primer sequences for highly and moderately expressed genes can be designed and methods such as quantitative RT-PCR can be used to determine the levels of highly and moderately expressed genes. Hence, a skilled practitioner may use a variety of techniques which are known in the art for determining the relative level of mRNA in a biological sample.
Especially preferably the sample for the isolation of mRNA in the above described method is as described previously and is preferably not from the site . of disease and the cells in said sample are not disease cells and have not contacted disease cells, for example the use of a peripheral blood sample for detection of non-haematopoietic cancer, €.8. breast cancer.
The following examples are given by way of illustration only in which the Figures referred to are as follows:
Figure 1 shows the possible interplay of various factors responsible for changes in expression in an individual with breast cancer,
Figure 2 shows the projection of 102 normal (including benign) and breast cancer samples onto a classification model generated by PLSR-DA using the data of 35 informative genes, in which PC is the principal components and N and C are normal and breast cancer samples, respectively;
Figure 3 shows a prediction plot based on 3 principal components using the data of 35 cDNAs; and
Figure 4 shows the mean level of expression of the 35 genes used for prediction of breast cancer.
Example 1: Diagnosis of Breast Cancer
Methods
Blood samples
Blood samples were collected from donors with their informed consent under an approval from Regional Ethical Committee of Norway. All donors were treated anonymously during analysis. Blood was drawn from females with a suspect initial mammogram, which included both females with breast cancer and fernales with abnormal mammograms, prior to any knowledge of whether the
~ WO 2005/118851 PCT/GB2005/002180 abnormality observed during first screening was benign or malignant. In all cases, the blood samples were drawn between 8 am. and 4 p.m. From each female, 10 ml blood was drawn by skilled personnel either in vacutainer tubes containing EDTA as anticoagulant (Becton Dickinson, Baltimore, USA) or directly in PAXgene ™ tubes (PreAnalytiX, Hombrechtikon, Switzerland).
Blood collected in EDTA tubes was immediately stored at - 80°C, while PAX tubes were left overnight and then stored at - 80°C until used.
Preparation of cDNA arrays 1435 cDNA clones were randomly picked froma plasmid library constructed from whole blood of 550 healthy individuals (Clontech, Palo Alto, USA). About 20% of the randomly picked clones were redundant. For amplification of inserts, bacterial clones were grown in microtiter plates containing 150 wl LB with 50 pg/ml carbenicillin, and incubated overnight with agitation at 37°C. To lyse the cells, 5 pl of each culture were diluted with 50 pl H,O and incubated for 12 min. at 95°C. Of this mixture, 2 pl were subjected to a PCR reaction using 40 ppmol of 5'- and 3'- sequencing primer in the presence of 1.5 mM MgCl. PCR reactions were performed with the following cycling protocol: 4 min. at 95°C, followed by 25 cycles of 1 min. at 94°C, 1 min. at 60°C and 3 min. at 72°C either in a RoboCycler® Temperature Cycler (Stratagene, La Jolla, USA) or
DNA Engine Dyad Peltier Thermal Cycler (MJ Research Inc., Waltham, USA).
The amplified products were denatured with NaOH (0.2 M, final concentration) for 30 min and spotted onto Hybond-N" membranes (Amersham Pharmacia
Biotech, Little Chalfont, UK), using a MicroGrid II workstation according to the manufacturer’s instructions (BioRobotics Ltd, Cambridge England). The immobilized cDNAs were fixed using aUV cross-linker (Hoefer Scientific
Instruments, San Francisco, USA).
In addition to the 1435 cDNAs, the printed arrays also contained controls for assessing background level, consistency and sensitivity of the assay. These were spotted at multiple positions and included controls such as PCR mix (without any insert), controls of SpotReport™ 10 array validation system
(Stratagene, La Jolla, USA) and cDNAs corresponding to constitutively expressed genes such as B-actin, y-actin, GAPDH, HOD and cyclophilin.
RNA extraction, probe synthesis and hybridization
Blood collected in EDTA tubes was thawed at 37°C and transferred to PAX tubes, and total RNA was purified according to the supplier’s instructions (PreAnalytiX, Hombrechtikon, Switzerland). From blood collected directly in
PAX tubes total RNA was extracted in the tubes as above without any transfer to new tubes. Contaminating DNA was removed from the isolated RNA by
DNAase I treatment using DNA-free kit (Ambion, Inc. Austin, USA). RNA quality was determined visually by inspecting the integrity of 28S and 18S ribosomal bands following agarose gel electrophoresis. Only samples from which good quality RNA was extracted were used in this study. In our experience, blood collected in EDTA tubes often resulted in poor quality RNA, while that collected in PAX tubes almost always gave good quality RNA. The concentration and purity of extracted RNA was determined by measuring the absorbance at 260 nm and 280 nm. From the total RNA, mRNA was isolated using Dynabeads according to the supplier’s instructions (Dynal AS , Oslo,
Norway).
Labelling and hybridization experiments were performed in 16 batches.
The number of samples assayed in each batch varied from six to nine. To minimize the noise due to batch-to-batch variation in printing, only the arrays manufactured during the same print run were used in each batch. When samples were assayed more than once (replicates), aliquots from the same mRNA pool were used for probe synthesis. For probe synthesis, aliquots of mRNA corresponding to 4-5 pg of total RNA were mixed together with oligodTzsnv (0.5 pg/pl) and mRNA spikes of SpotReport™ 10 array validation system (10 pe; Spike 2, 1 pg), heated to 70°C, and then chilled on ice. Probes were prepared in 35 pl reaction mixes by reverse transcription in the presence of 50 pCi [oP] 320 dATP, 3.5 uM dATP, 0.6 mM each of dCTP, dTTP, dGTP, 200 units of
SuperScript reverse transcriptase (Invitrogen, LifeTechnologies) and 0.1 M DTT labelling for 1.5 br at 42°C. Following synthesis, the enzyme was deactivated for
10 min. at 70°C and mRNA removed by incubating the reaction mix for 20 min. at 37°C in 4 units of Ribo H (Promega, Madison USA). Unincorporated nucleotides were removed using ProbeQuant G 50 Columns (Amersham
Biosciences, Piscataway, USA).
The membranes were equilibrated in 4 x SSC for 2 hr at room temperature and prehybridized overnight at 65°C in 10 ml prehybridization solution (4 x SSC,0.1 M NaH;PO4 1 mM EDTA, 8% dextran sulphate, 10 x
Denhardt's solution, 1% SDS). Freshly prepared probes were added to 5 ml of the same prehybridization solution, and hybridization continued overnight at 65°C. The membranes were washed at 65°C with increasing stringency (2 x 30 min. each in 2 x SSC, 0.1% SDS; 1x SSC, 0.1% SDS; 0.1 x SSC, 0.1% SDS).
Quantification of hybridization signals
The hybridized membranes were exposed to Phosphoscreen (super resolution) for two days and an image file generated using Phospholmager (Cyclone,
Packard, Meriden, USA). The identification and quantification of the hybridization signals, as well as subtraction of local background values was performed using Phoretix software (Non Linear Dynamics, UK). For background subtraction, the median of the line of pixels around each spot outline was subtracted from the intensity of the signals assessed in each spot.
Data analysis
From the 1435 background-subtracted expression data, signals of 67 genes were removed from each membrane to exclude genes expressed with a high degree of variance. These included removal of 1.25% of the lowest and highest signals from each membrane. For normalization, the value of each spot was first divided by the mean of signals in each array followed by a cube root transformation of all the spots. The normalized data was then batch adjusted using a one-way analysis of variance (ANOVA).
The pre-processed data was then used to isolate the informative probes by:
a) buildinga crossvalidated PLSR model, where one unique sample (including all repetitions of the selected sample) was kept out per cross- validation segment. ‘5 b) selecting the set of significant genes for the model in step a) using the
Jackknife criterion. ¢) buildinga crossvalidated PLSR-DA model as in step a) using the gene selected in step b). . d) selecting again the set of most significant genes for the model in step c) using the Jackknife criterion.
Step b) resulted in 125 genes.
Step d) resulted in selection of 35 significant genes. Based on these genes a final classification model was constructed.
The selected informative probes based on occurrence criterion were used to construct a classification model. The result of the classification model based on 35 probes is shown in Figure 2 in which it is seen that the expression pattern of these genes was able to classify most women with breast cancer and women with no breast cancer into distinct groups. In this figure PC1 and PC2 indicate the two principal components statistically derived from the data which best define the systemic variability present in the data. This allows each sample, and the data from each of the informative probes to which the sample's labelled first strand cDNA was bound, to be represented on the classification model as a single point which is a projection of the sample onto the principal components - the score plot.
Figure 3 shows the prediction plot using the 35 significant genes. In the prediction plot shown, the cancer samples appear on the x axis at +1 and the non-cancer samples appear at -1. The y axis represents the predicted class membership. During prediction, if the prediction is correct, cancer samples should fall above zero and non-cancer samples should fall below zero. In each case almost all samples are correctly predicted. For cross-validation 102 experimental samples were divided into 60 cross-validation segments where each segment represented one unique sample and included its replicates if present. :
Correct prediction of most breast cancer cells was achieved. 19 out of 22 cancer patients were correctly predicted as were 34/35 normal patients. Full details of the individuals examined and the accuracy of prediction are shown in Table 1.
Table 2 provides details of the 35 informative genes, the genes in public databases to which they show sequence similarity and their putative biological function. Their sequences follow these examples.
Figure 4 shows the expression level of the 35 genes and it will be seen that some are over-expressed and others under-expressed relative to expression in normal : patients.
Example 2: Identification of further informative probes and use in diagnosis of breast cancer
Methods
The methods of identification and analysis used were essentially as described in
Example 1, except that instead of preparing a cDNA array, samples were analysed using a commercially available platform for large-scale gene expression analysis (Agilent 22K chip).
A larger number of samples comprising 122 in total (78 control and 44 with breast cancer) were analysed. The data was analysed using PLSR as described previously. The genes of interest were selected by a 10-fold cross validation approach. Thus, the data from 122 samples was divided into 10 sets, each set containing 12-13 samples. A calibration model was built on 9 sets leaving out one set. Significant genes were identified by the Jackknife technique on the built-in model. These steps were repeated for all 10 sets, in which each set was kept out at least once. Informative genes were then identified based on the frequency of occurrence criterion. 109 genes were found to informative in all 10 calibration models.
Results
The above described 109 genes and 3 further genes were used to predict the classification of 122 of the samples used. The results are shown in the table below. predicted predicted
The 109 informative genes may be divided into three categories, namely those falling into families (i) and (ii) as described herein and other genes. Table 3 provides details of the informative probes whose corresponding genes fall into families (i) and (ii) and provides the number assigned by Agilent to that probe.
Table 4 similarly provides details of the informative probes whose corresponding genes do not appear to fall within families (i) and (ii). Tables 5 and 6 provide details of the genes to which the probes in Tables 3 and 4, respectively, show sequence similarity, their putative biological function where known and the accession numbers for those genes.
Appendix A
Partial Least Squares regression (PLSR)
Let a multivariate regression model be defined as:
Y=XB+F where .
X a NxP matrix with N predictor variables (genes);
Y (NxJ) being the J predicted variables. In our case Y represents a matrix containing dummy variables;
B is a matrix of regression coefficients; and
F is a NxJ matrix of residuals. i5
The structure of the PLSR model can be written as:
X = TP" + Ba, and
Y= TQ" + Fa, where where
T (Nx4) is a matrix of score vectors which are linear combinations of the x-variables;
P (PxA) is a matrix with the x-loading vectors pa as columns;
Q (JxA) is a matrix with the y-loading vectors Ge as columns;
E, (NxP) is the matrix for X after A factors; and
F, (NJ) is the matrix for Y after A factors.
The criterion in PLSR is to maximize the explained covariance of [X,Y]. This is achieved by the loading weights vector Was, which is the first eigenvector of
E."F.Fa'E, (B. 2nd Fa are the deflated X and Y after a factors or PLS components).
The regression coefficients are given by:
B — wETw)'Qt
A PLSR model with full rank, i.e. maximum number of components, is
S equivalent to the MLR solutions. Further details on PLSR can be found in
Marteus & Naes, 1989, Multivariate Calibration, John Wiley & Sons, Inc., USA and Kowalski & Seasholtz, 1991, supra.
Nucleotide Sequences of 34/35 genes selected by Jacklanife
Clone Ids and their sequences 1-30
CTTTTCCTCCCGCT GTCOCCCACGGAGGGGACTGCICT CCCCOGCTGCATCCIT
TCTGTGAGGTACCTT. ACCCACCT CAGCACCTGAGAGGGTGAAATAGAATTCT AA
CCT CGACATTCGGGAAGTGTITTTTGAGAAGTCT OGGTCGGTAAGGGAAGTCTTC
CAAGTCCGTGCAGCACT AACGTATTGGCACCTGCCTCCICIT CGGCCACCCCCC
AGATGAGGCAGCTGTGACT GTGTCAAGGGAAGCCACGACTCT GACCATAGTCTT
CTCTCAGCTTCCACTGCCGTCT CCACAGGAAACCCAGAAGTTCT GTGAACAAGT
CCATGCTGCCATCAAGGCAT TTATTGCAGTGTACTATTTGCT TCCAAAGGATCA
GGCCCTGAGAACAATGACCT TATTTCCTACAACAGTGTCT GGGTTGCGTGCCAG
CAGATGCCT CAGATACCAAGAGATAACAAAGCT GCAGCTCTTTTGATGCTGACC
AAGAATGTGGATTTTGTGAAGGATGCACATGAAGAAATGGAGCAGGCT GTGGAA
GAATGTGACCCTTACTCTGGCCT CTTGAATGATACTGAGGAGAACAACT CTGAC
AACCACAATCATGAGGATGATGTGTTGGGGTTTCCCAGCAATCAGGACTTGTAT
TGGTCAGAGGACGATCAAGAGCTCATAATCCCATGCCTT GCGCTGGTGAGAGCA
TCCAAAGCCTGCCTGAAGAAAATT CGGATGTT AGTGGCAGAGAATGGGAAGAAG
GATCAGGTGGCACAGCT GGATGACATTIGTGGATATTTCT GATGAAATCAGCCCT
AGTGTGGATGATTTGGCTCT GAGCATATATCCACCT ATGTGTCACCTGACCGTG
CGAATCAATTCTGCGAAACTT GTATCTGTTTTAAAGAAGGCACTT GAAATTACA
AAAGCAAGTCATGTGACCCCTICAG CCAGAAGATAGTTGGATCCCT TTACITATT
AATGCCATTGATCATTGCATGAATAGAATCAAGGAGCT CACTCAGAGTGAACTT
GAATTATGACTTTTCAGGCT CATTTGTACTCTCTTCCCCTCT CATCGTCATGGT
CAGGCTCTGATACCTGCTTTTAAAATGGAGCT AGAATGCTTGCTGGATTGAAAG
GGAGTGCCTATCT ATATTTAGCAAGAGACACT ATTACCAAAGATTGTTGGTTAG
GCCAGATTGACACCT ATTTATAAACCATATGCGTATATTTTTCTGTGCT ATATA
TGAAAAATAATTGCATGATTTCTCATTCCT GAGTCATTTCTCAGAGATTCCTAG
GAAAGCTGCCTTATTCICTT TTTGCAGTAAAGTATGTTGTTTTCATTGTAAAGA
TGTTGATGGTCTCAATAAAATGCTAACTT GCCAGTGAAAAAAAAAAAAAA
111-02
AGGATCTAAGACCAGCCT GGCAGCCACCAGATGGTGATTCTAGTCCT GGCTCAG
TCAGTAATAGGTCACT GACCCCAGAGAAATCAATTCAGCCTCCCCAGGTCCTTG
GATTTCTTTCT GTGAAAATGAAAGCATAGGTAGGAATTTCCCATGGAACAGCTA
GCAGAGGAGAAATATTAAAAGT CAGGAGACTCATGCTATAGTTT TCATACTTCA
TTACAACAATGTTGTTTAGGACAAGTGAGTTAACCT GTTAGCTTCCICTATATA
AAATGGAAAGTCATTAAAAACCT. ACATAGCAGGGTTCT TGTGAAGATCAAGTGA
TAATGTAGGAAGCATGTACAAATGTCACATT CTGCCGTCACGTAATGGTCCT CA
CAGCTTGAGGT AGCATTTAGCATGTGTCATGATTTAGT ACAAGGGTTGGCAAAC
S TGTTGCTCTTGGATTAAGTCTGGCT CATTGCCT GTTTTTCAAAGAAAAAAATTG
TATATGTGTGTATATATGTTATATATAGGTACACACACATATGTGCT ATATATA
GCATATATACACACATAATATAT AAACATGTACATATATAGCATTATATATATA
CGTGTATAATATCTCCAGTCCT CATGACCAGCCATGCTT GTTCATTTACATTTG
CATACTCTATGATTGCTTTCAT GCAACAATGGCAGAGTTGAGTGATTGTTITGC
AACAGAGACTGTATGGCCCACT. AAACCT AAAATATTTAGTCTCTGACCCTGAAA
TGTAAGATTGAT AGCCCAGGACCAGGCGTGGTGGCT CACACTTGTAATCCTAGC
ACT TTGGCAGGCCAAGGAGGGTGGATCACCT GAGGTCAGGAGTTAAAGACCAGC
CTGGCCAACATGGTGAAACCCTGACTCT ACTAAAAATACAGAAATTAGCTGGGC
GTGGTAATGGGTGCCTGCAATCCAAGCT ACTCTGGAGGCTGAGGCAGGAGAATC
ACTT! GAACCCAGGAGGCAGAAGTTACAGTGAGCT GAGATGGTGCCACTGCACTC
CAGCCTGGACGACAGAGTGAGACT CCATCTCAAAAA m-27
CCATTCTCCTGCCTCAGCCTCTCAAGTAGCT GGGACTACAGGCGCCCACAACCA
CGCCOGGCTAATGTTTTTGGTATTTIT CGTAGAGACGGGGTTTCACCTTGTTAG
CCAGGATGGTCTTGATCTCCTGACCTCGTGATCT GCCTGCCTCGGCCTCCCAAA
GTGTTGGGATTACAGGCACATTTTTCACAATTTTTTAACACTT AAGAATGACTT
AACTGAATCATGCCTTTAGAAGAAACTTTCT GTTTAAAAAAAAAAAAAAA
11-60
CTGCOGCOGCCCCCAGCTCCCCOGCCT CGGGGAGGGCACCAGGTCACTGCAGCC
AGAGGGGT CCAGAAGAGAGAGGAGGCACTGCCT CCACTACAGCAACTGCACCCA
CGATGCAGAGCATCAAGT GCGTGGTGGTGGGTGATGGGGCT GTGGGCAAGACGT
GCCTGCT CATCTGCTACACAACTAACGCTTTCCCCAAAGAGTACATCCCCACCG
TGTTCGACAATTACAGCGCGCAGAGCGCAGTTGACGGGCGCACAGTGAACCT GA
ACCT GTGGGACACTGCGGGCCAGGAGGAGTATGACCGCCT CCGTACACTCTCCY
ACCCTCAGACCAACGTTTTCGTCATCTGTTTCT! CCATTGCCAGTCOGCCGTCCT
ATGAGAACGTGCGGCACAAGTGGCAT CCAGAGGTGTGCCACCACTGCCCTGATG
TGCCCATCCTGCT GGTGGGCACCAAGAAGGACCT GAGAGCCCAGCCTGACACCC
TACGGCGCOCTCAAGGAGCAGGGCCAGGCGCCCATCACACCGCAGCAGGGCCAGE
CACTGGCCAAGCAGATCCACGCTGTGCGCTACCT CGAATGCTCAGCCCTGCAAC
AGGATGGTGTCAAGGAAGTGTTCGCCGAGGCT GTCCGGGCTGTGCTCAACCCCA
CG CCGATCAAGOGTGGGCGGTCCT GCATCCTCTTGT GACCCTGGCACTTGGCTT
GGAGGCTGCCCCIGCCCT CCCCCCACCAGTTGTGCCTTG GTGCCTTGTCCGCCT
CAGCTGTGCCTTAAGGACT AATTCTGGCACCCCTTT CCAGGGGGTTCCCTGAAT
GCCTTTTTCTICTGAGTGCCT TTTTCTCCTTAAGGAGGCCT GCAGAGAAAGGGGC
TTTGGGCTCIGCCCCCCTCTGCTT GGGAACACTGGGTATTCT CATGAGCTCATC
CAAGCCAAGGTTGGACCCCT CCCCAAGAGGCCAACCCAGTG CCCCCTCCCATTT
TCCGTACTGACCAGTTCATCCAGCTT TCCACACAGTTGTTGCTGCCT. ATTGTGG
TGCCGCCTCAGGTTAGGGGCICT CAGCCATCTCTAACCTCTGCCCT CGCTGCTC
TTGGAATTGCGCCCCCAAGATGCTCT CTCCCTTCT CCAATGAGGGAGCCACAGA
ATCCTGAGAAGGTGAATGTGCCCT AACCTGCTCCTICT GTGCCTAGGCCTTACGC
ATTTGCTGACTGACT CAGCCCCCATGCTTCTGGGGACCTT TCCTACCCCCATCA
GCATCAATAAAACCTCCTGTCT CCAGTGA
IvV-26
CAGCCCTCCGTCACCT CTTCACCGCACCCTCGGACT GCCOCCAAGGCCCCCGCOG
CCGCT CCAGOGCCGOGCAGCCACCGCCGCCGCCGCCGCCT CTCCTTAGTCOGCCG
CCATGACGACCGCGTCCACCT CGCAGGTGCGCCAGAACTACCACCAGGACT CAG
: AGGCCGCCATCAACOGCCAGATCAACCTGGAGCT CTACGCCTCCTACGTTTACC
TGTCCATGTCTTACTACT TTGACCGCGATGATGTGGCTTTGAAGAACTT TGCCA
AATACTTTCITCACCAATCT CATGAGGAGAGGGAACATGCT! GAGAAACTGATGA
AGCTGCAGAACCAACGAGGTGGCCGAATCTT CCTTCAGGATATCAAGAAACCAG
ACTGTGATGACTGGGAGAGCGGGCTGAATGCAATGGAGTGTGCATTACATTIGG
AAAAAAATGTGAATCAGTCACTACT GGAACTGCACAAACTGGCCACTGACAAAA
ATGACCCCCATTTGTGTGACTT CATTGAGACACATTACCT GAATGAGCAGGTGA
AAGCCATCAAAGAATTGGGTGACCACGTGACCAACT TGCGCAAGATGGGAGCGC
COGAATCTGGCTTGGCGGAATATCTCT TTGACAAGCACACCCTGGGAGACAGTG
ATAATGAAAGCTAAGCCTCGGGCT AATTTCCOCATAGCCOGTGGGGTGACTTCCC
TGGTCACCAAGGCAGTGCATGCAT GTTGGGGTTTCCTTTACCTTITCTATAAGT
TGTACCAAAACATCCACTTAAGTTCT TTGATTTGTACCATTCCTTCAAATAAAG
AAATTTGGTACCCAAAAAAAA v-41
GCCATTTCTAAGACCTACAGCT ACCTGACCCCOGACCTCTGGAAGGAGACT GTA
TTCACCAAGTCTCCCTAT CAGGAGTTCACTGACCACCT CGTCAAGACCCACACC
AGAGTCTCCGTGCAGCGGACTCAGGCT CCAGCTGTGGCTACAACATAGGGTTTT
TATACAAGAAAAATAAAGTG AATTAAGCGTGAAAA v-s1
ATTICT GTGGATACAGTGCCCACOGCCCTCCT CCACTTGGAAACGGTATCCTCC
CTGCCCATCCGTCT GTCTGTCGCCCTTCTCCCGGCCCT CACTAAGCCCCGGCAC
TTCTAGTGGTCTCACCT GGAGGCAAGAGGGAGGGGACAGAGG CCCTGCCACGTC
CCGCTGCCTCCTGCTCTCT! GGAGGTACT GAGACAGGGTGCT GATGGGAAGGAGG
GGAGCCTTTGGGGGGCCACCCGGGGCCTGGACCT ATGCAGGGAGGCCACGTCCC
ACCCCACCTCT TGTTTCTGGGTCCCTGCTCCCCT TTGGGGGTGTGTGTGTGTGT
TTTAATTTTCT TTATGGAAAAATTGACAAAAAAAAATAGAGAGAGAGGTATTTA
ACTGCAATAAACTGGCCCCATGTGGCCCCCGCCTTGTCAAAAAAAAAA
V-09
TGGATTCCCGTCGTAACTT AAAGGGAAACTT TCACAATGTCCGGAGCCCTTGAT
GTCCTGCAAATGAAGGAGGAGGATGTCCT TAAGTTCCTTGCAGCAGGAACCCAC
TTAGGTGGCACCAATCTTGACTT CCAGATGGAACAGTACATCTAT AAAAGGAAA
AGTGATGGCATCTATAT CATAAATCTCAAGAGGACCTGGGAGAAGCT TCTGCTG
GCAGCTCGTGCAATTGTTGCCATIGAAAACCCTGCTGATGTCAGTGTTATATCC
TCCAGGAATACTGGCCAGAGGGCTGTGCT GAAGTTTGCTGCTGCCACTGGAGCC
ACTCCAATTGCTGGCCGCTTCACTCCT GGAACCTTCACTAACCAGATCCAGGCA
GCCTTCCGGGAGCCACGGCTTCTTGTGGTTACT GACCCCAGGGCTGACCACCAG
CCTCTCACGGAGGCATCTTATGTTAACCTACCT ACCATTGOGCTGTGTAACACA
GATTCTCCTCTGOGCTATGTGGACATT GCCATCCCATGCAACAACAAGGGAGCT
CACTCAGTGGGTTTAATGTGGTGGATGCTGGCT CGGGAAGTTCTGCGCATGCGT
GGCACCATTTCCCGTGAACACCCATGGGAGGTCATGCCT GATCTGTACTTCTAC
AGAGATCCTGAAGAGAT TGAAAAAGAAGAGCAGGCTGCIGCT GAGAAGGCAGTG
ACCAAGGAGGAATTTCAGGGTGAATGGACTGCT CCCGCTCCTGAGTTCACTGCT
ACTCAGCCTGAGGTTGCAGACT GGTCTGAAGGTGTACAGGTGCCCTCTGTGCCT
ATTCAGCAATTCCCTACTGAAGACTGGAGCGCT CAGCCTGCCACGGAAGACTGG
TCTGCAGCTCCCACTGCT CAGGCCACTGAATGGGTAGGAGCAACCACT GACTGG
TCTTAAGCTGTTCTTGCATAGGCTCTTAAGCAGCAT GGAAAAATGGTTGATGGA
AAATAAACATCAGTTTCT
20
V-38
GTTTAAATTTGACAAACTAAAGCTAATTACT GCTATAAGAGTAATAACTGCTCA
TTTTCCATAACT CATTCTTAAAGTTTTAGTAATGTAAAAGTTATTTTITIGCAG
TAAGTTATAATGATAGAAGCTTACATGT TTTTTCATGCCTCATCTGTTTCCCCT
TAAAACTATAATTATCAGTAAAGTCCT GTGGTATTTTTCAATTTGTAAGAAACT
AGGCTATATATACATTGGGAAAAACAGCCTT CATTTGTCAATGCACTAGTGTTC
CAAAGGTTTCTGGTAATTGTGTGCTATTGCTTT TTGTTGACTTGCAAAAAAAAA
AAAAAAAAAATTACTATGACT TGTGGTAGCCCTGCAACCTT CGGAAGTGCTTAG
CCCAGTCT GACCATACATTTATATTTAGAATGCTT AGGTAAATAAATAATATGC
CTAAACCCAATGCT ATAAGATACTATATAATATCT CATAATTTTAAAAATCACT
GTTTTGTATAAT AATAAAACAAGGCAGGCAAGCT GTTCTACAATGACTGTTGGT
AAGGGTGCTGAGGAAGAAAAACAAACAATCTT GATTCAGGGATAGTGAATAGAC
AAAAAATGTCCT AATCAATGAAGCTGTGTGATGATT CTGATTGACAGAGAGTGC
TGCCACAAGATTCTT AGGCTACACTCAAATCAGCAGAAAAAGTGCT ACAATAAA
TTAGAAGTGACT ATTACAGGTGCAGATGAGGGTTGGTAGTACCT GTTTGCCATT
TCTCTTCTAATCTTATATTTICT GACCCTCCT ACTGTAAGTOGOGCGGAGGOGG
AGGCTTGGGTGOGTTCAAGAT TCAACT TCACCOGTAACCCACCGCCATGGCCGA
GGAAGGCATTGCTGCT GGAGGTGTAATGGACGTTAATACTGCT TTACAAGAGGT
TCT GAAGACTGCCCTCATCCACGATGGCCT AGCACGTGGAATTCGCGAAGCTGC
CAAAGCCTT AGACAAGCGCCAAGCCCATCTTT GTGTGCTTGCATCCAACTGTGA
TGAGCCT ATGTATGTCAAGTTGGTGGAGGCCCTT TGTGCTGAACACCAAATCAA
CCTAATT AAGGTTGATGACAACAAGAAACTAGGAGAATGGGTAGGCCTTT GTAA
AATTGACAGAGAGGGGAAACCCCGTAAAGTGGTTGGTTGCAGTTGTGTAGTAGT
TAAGGACTATGGCAAGGAGTCT CAGGCCAAGGATGTCATTGAAGAGTATTTCAA
ATGCAAGAAATGAAGAAATAAATCTTT GGCTCACAAA
VI-44
GAGAATGGCITGAACCCAGTAGGCAGAGGTTGTAGTGAGCCGAGATTGGGCCAC
TGCACTTTAGCCT GGGTGACAGAGTGAGACTCTIGTCT CAAAAAAAAAAAAAAAA
AATTTAAATAAAATAAAAAACCTTTACTT ATTTTTAAATTGGGTTGTCTITTTTG
GTATTGAGTTGTTAAAGTTCTTTATATATTTTAGGTACAAATCCCTT ATGAGAT
ACGTGATTTGAAAATATTTTCTCCCATTCT GTGGGTTGCTTTTTCACTTTCTTG
GTTGTATCCT TTGAAGCACAGAAGTTTTAAATTTTGATGAAGTCCAGTITATTT
ATTTTTTTGCT GTTGTTTCTGCTCATACTTTTGAGGTCATGTCTGAGAAACCAT
TGTCAAATCCAAGGTOGTGATGACTT ACCCCTGTGTTTTCTTCTAAGAGTTTTA
AAGGCATCTGAAGCTTAATGTGCACT. AGATGGATTCTAAATATCATCTCATCCA
AAACCTGCTATATATACTACCTTCCTCATCTCAGTT GAAGGCAAGTCCATTGTT
, TCAATTGCCTGGGCAAAAAATATTCT AAATAATTCATAATTTTTCCTCAACTCC
ACATCTATTGGTAAATCCTGTGGGTTCTCCT TTTAAAACATATCCAAAATAGAA
TCATTTCTCACT ATCATTCCACTGCAGGCACCAAGTCT CAATAGTCTCCTAGCA
GATAATCATGTCT ACATTTATTCTCAATGTAGCAGCTAGAGAGCT TITTTG
VI1-49
GCGGTCGTAAGGGCTGAG GATTTTTGGTCOGCACGCTCCTGCTCCTGACT CACC
GCTGTTCGCTCTCGCOGAGGAACAAGTCGGTCAGGAAGCCOGOGOGCAACAGCC
ATGGCITTTAAGGATACCGGAAAAACACCCGTGGAGCCGGAGGTGGCAATTCAC
CG AATTCGAATCACCCTAACAAGCCGCAACGTAAAATCCTTGGAAAAGGTGTGT
GCTGACTTGATAAGAGGCGCAAAAGAAAAGAATCTCAAAGTGAAAGGACCAGTT
0G AATGOCTACCAAGACTTTGAGAATCACTACAAGAAAAACTCCTTGTGGTGAA
GGTTCTAAGACGTGGGATCGTTTCCAGATGAGAATTCACAAGCGACTCATTGAC
TTGCACAGTCCITCIGAGATTGTTAAGCAGATTACTTCCATCAGTATTGAGCCA
GGAGTTGAGGTGGAAGTCACCATTGCAGATGCTTAAGTCAACTATTITAATAAA
TTGATGACCAGTTGTTAAAAAAAAAAAAAAA
VI1-52
GAAAAGGGNTNGCNCOCAANGGGCAGAGGTTGGGCTGATGCCGATATTGGGCCN
CTGCNCTNCANACCTGGGTGACATGAATGAAACTCTGTCTCACATAAAAACCCA
AAAAANCTAAATGAAATAAAAGACCTTTGCTTATTNCTAANTTGGGTACGC
VII-15
CCCATCOCCTCGACCGCTCGOGTOGCATTTGGCOGCCTCCCTACCGCTCCAAGE
CCAGCCCTCAGCCATGGCATGCCCOCTGGATCAGGCCATTGGCCTCCTCRTGGE
CATCTTCCACAAGTACTCCGGCAGGGAGGGTGACAAGCACACCCTGAGCAAGAA
GGAGCTGAAGGAGCTGATCCAGAAGGAGCTCACCATTGGCTOGAAGCTGCAGGA
TGCTGAAATTGCAAGGCTGATGGAAGACTTGGACOGGAACAAGGACCAGGAGGT
GAACTTCCAGGAGTATGTCACCTTCCTGGGGGCCTTGGCTTTGAT
VII-32
AATTAGAGAGGTGAGGATCTGGTATTTCCIGGACTAAATTCCCCTTGGGGAAGA
CGAAGGGATGCTGCAGTTCCAAAAGAGAAGGACTCTTCCAGAGTCATCTACCTG
AGTCCCAAAGCTCCCTGTCCTGAAAGCCACAGACAATATGGTCCCAAATGACTG
ACTGCACCTTCTGTGCCTCAGCCGTTYTTGACATCAAGAATCTTCTGTTCCACA
TCCACACAGCCAATACAATTAGTCAAACCACTGTTATTAACAGATGTAGCAACA
TGAGAAACGCTTATGTTACAGGTTACATGAGAGCAATCATGTAAGTCTATATGA
CTTCAGAAATGTTAAAATAGACTAACCTCTAACAACAAATTAAAAGTGATTGTT
TCAAGGTGATGCAATTATTGATGACCTATTTTATT TTTCTATAATGATCATATA
TACCTTTGTAATAAAACATTATAACCAAAAAAAAAAAAAAAAAAAAAAAAASAA
AAAA
VI-48
CTTAAGTATGCCCT GACAGGAGATGAAGTAAAGAAGATTTGCATGCAGCGGTTC
ATTAAAATCOGATGGCAAGGTCCGAACT GATATAACCTACCCTGCTGGATTCATG
GATGTCATCAGCATTGACAAGACGGGAGAGAATTTCCGTCT GATCTATGACACC
AAGGGTOGCTTTGCT GTACATCGTATTACACCT! GAGGAGGCCAAGTACAAGTTG
TGCAAAGTGAGAAAGATCTTT GTGGGCACAAAAGGAATCCCT CATCTGGTGACT
CATGATGCCCGCACCATCCGCT. ACOCCGATCCCCTCATCAAGGTGAATGATACC
ATTCAGATTGATTTAGAGACTGGCAAGATTACT GATTTCATCAAGTTCGACACT
GGTAACCTGTGTATGGTGACT GGAGGTGCTAACCT AGGAAGAATTGGTGTGATC
ACCAACAGAGAGAGGCACCCTGGATCTTTT GACGTGGTTCACGTGAAAGATGCC
AATGGCAACAGCTTTGCCACT CGACTTTCCAACATTTTTGTTATIGGCAAGGGC
AACAAACCATGGATTTCTCT TCCCOGAGGAAAGGGTATCCGCCT CACCATTGCT
GAAGAGAGAGACAAAAGACT GGOGGCCAAACAGAGCAGTGGGTGAAATGGGTCC
CTGGGTGACATGTCAGATCTTTGT ACGTAATTAAAAATATTGTGGCAGGATTAA
TAGCC
VII-76
AGACACACGAGCATATTTCACCTCOGCT ACCATAATCATOGCTATCCCCACCGG
CGTCAAAGTATTTAGCTGACT CGCCACACTCCACGGAAGCAATATGAAAT GATC
TGCTGCAGTGCTCT GAGCCCTAGGATTCATCTT TCT TTTCACCGTAGGTGGCCT
GACTGGCATTGTATTAGCAAACT CATCACTAG ACATCGTACTACACGACACGTA
CTACGTTGTAGCCCACTTCCACT ATGTCCTATCAATAGGAGCTGTATTTGCCAT
CATAGGAGGCT TCATTCACTGATTTCCCCTATTCTCAGGCT ACACCCTAGACCA
AACCTACGCCAAAATCCATTTCACT ATCATATTCATCGGCOGTAAATCTAACTTT
CTTCCCACAACACTTTCTCGGCCT ATCCGGAATGCCCOGACGTTACTCGGACT A
CCCCGATGCATACACCACATGAAACATCCT ATCATCTGTAGGCT CATTCATTTC
TCTAACAGCAGTAATATTAATAATTTTCAT GAT TTGAGAAGCCTTCGCTT CGAA
GCGAAAAGTCCT AATAGTAGAAGAACCCT CCATAAACCTGGAGTGACTATATGG
ATGCCCCCCACCCT ACCACACATTCGAAGAACCCGTATACAT
© IX-24
AGAGTG CAAGACGATGACTTGCAAAATGTCGCAGCT GGAACGCAACATAGAGAC
CATCATCAACACCT TCCACCAATACTCTGTGAAGCT GGGGCACCCAGACACCCT
GAACCAGGGGGAATTCAAAGAGCT GGTGCGAAAAGATCTGCAAAATTTTCT CAA
GAAGGAGAAT AAGAATGAAAAGGTCAT AGAACACATCATGGAGGACCTGGACAC
AAATGCAGACAAGCAGCTGAGCTT OGAGGAGTTCATCATGCTGATGGCGAGGCT
AACCTGGGCCT COCACGAGAAGATGCACGAGGGTGACGAGGGCCCTGGCCACCA
CCATAAGCCAGGCCT CGGGGAGGGCACCCCCT. AAGACCACAGTGGCCAAGATCA
CAGTGGCCACGGCCACGGCCACAGTCATGETGGCCACGGCCACAGOCACT AATC
AGGAGGCCAGGCCACCCTGCCT CT ACCCAACCAGGGCCCCGGGGCCT GTTATGT
CAAACTGTCTTGGCT GTGGGGCTAGGGGCTGGGGCCAAATAAAGTCTCTT CCTC
CAAAAAAAA
IX-39
CTTGGCTCCTGTGGAGGCCT GCTGGGAACGGGACT TCTAAAAGGAACTATGTCT
GGAAGGCTGTGGTCCAAGG CCATTTTTGCTGGCT ATAAGCGGGGTCTCCGGAAC
CAAAGGGAGCACACAGCTCTT CTTAAAATTGAAGGTGTTTACGCCCGAGATGAA
ACAGAATTCT ATTTGGGCAAGAGATGCGCTT ATGTATATAAAGCAAAGAACAAC
ACAGTCACTCCT! GGCGGCAAACCAAACAAAACCAGAGTCATCTGGGGAAAAGTA
ACT (GGGCCCATGGAAACAGTGGCATGGTTCGTGCCAAATTCCGAAGCAATCT T
CCTGCT AAGGCCATTGGACACAGAATCCGAGTGATGCTGTACCCCT CAAGGATT
TAAACT AACGAAAAATCAATAAATAAATGTGGATTTGTGCT CTTGTA
1X46
ACGOGAGATGGCAGTGCAAAT ATCCAAGAAGAGGAAGTTTGTCGCT GATGGCAT
CIT CAAAGCTGAACTGAATGAGTTTCTTACT OGGGAGCTGGCTGAAGATGGCTA
CTCTGGAGTTGAGGTGOGAGTTACACCAACCAGGACAGAAATCATTATCT TAGC
CACCAGAACACAGAATGTTCT TGGTGAGAAGGGCCGGOGGATTOGGGAACTGAC
TGCIGTAGTTCAGAAGAGGTTTGGCTTTCCAGAGGGCAGTGTAGAGCTTE ATGC
TGAAAAGGTGGCCACTAGAGGTCT GTGTGOCATTGCCCAGGCAGAGTCTCTGCG
TTACAAACICCTAGGAGGGCTTGCT! GTGCGGAGGGCCTGCTATGGTGTGCTGCG
GTTCATCATGGAGAGTGGGGCCAAAGGCTGCGAGGTTIGTGGTGTCT GGGAAACT
CCGAGGACAGAGGGCT AAATCCATGAAGTTTGTGGATGGCCTGATGATCCACAG
CGGAGACCCTGTTAACT ACTACGTTGACACTGCT GTGCGCCACGTGTTGCTCAG
ACAGGGTGTGCTGGGCAT CAAGGTGAAGATCATGCTGCCCT GGGACCCAACTGG
TAAGATTGGCCCTAAGAAGCCCCTGCCT GACCACGTGAGCATTGTGGAACCCAA
AGATGAGATACTGCCCACCACCCCCATCT CAGAACAGAAGGGTGGGAAGCCAGA
© GCOGCCTGCCATGCCCCAGCCAGT CCCCACAGCATAACAGGGTCTCCTTGGCAG
CTGTATTCTGGAGTCTGGATGTTGCTCT CTAAAGACCTTTAATAAAATTTTGT
X-50
GTCCATCCTGCAGGCCACAAGCTCT GGATGAGGAACTTGAGGCAAGTCACCAGC
CCCTGATCATTT OGCCTAAAAGAGCAAGGACTAGAGTTCCT GACCTCCAGGCCA
GTCCCTGATCCCTGACCT. AATGTTATCGCGGAATGATGATATATGTATCT ACGG
CGGCCTGGGGCTGGGOGGACTCCIGCTTCTGGCAGTGGTCCTTCTGTCOGCCTG
CCTGTGTTGGCTGCATCGAAGAGTAAAGAGGCTGGAGAGGAGCTGOGCCCAGEE
CTCCICAGAGCAGGAACTCCACTATGCATCTCTGCAGAGGCTGCCAGTGCOCAS
CAGTGAGGGACCIGACCTCAGGGGCAGAGACAAGAGAGGCACCAAGGAGGATCC
AAGAGCTGACTATGCCTGCATTGCTGAGAACAAACCCACCTGAGCACCCCAGAC
ACCTTCCTCAACCCAGGCGGGTGGACAGGGTCCOCCTGTGGTCCAGOCAGTAAL
AAGCATGGTCCCCCCACTICTGTGTCTCAGTCCTCTCAGTCCATCICGAGCCTC
OGTT CAAAATGATCATCATCAAAACTTATGTGGCTTITTGACCTTTGAATAGGG
AATTTITTAAATITITTAAAAATTAAAATAAAAAAAACACATGGCTCACCCTTC
CACCCAAAAAAAAAA
X-77 (CTCCOGGGCTCTTAAGCCCCTCTCTTTCTCTAACAGAAAAAGOGGATGGTGGT
TCCTGCTGCCCTCAAGGTCGTGOGTCTGAAGCCTACAAGAAAGTTTGCCTATCT
GGGGCGCCTGGCTCACGAGGTTGGCTGGAAGTACCAGGCAGTGACAGCCACCLT
GGAGGAGAAGAGGAAAGAGAAAGCCAAGATCCACTACCGGAAGAAGAAACAGCT
CATGAGGCTACGGAAACAGGCOGAGAAGAACGTGGAGAAGAAAATTGACAAATA
CACAGAGGTCCTCAAGACCCACGGACTCCTGGTCTGAGCCCAATAAAGACTGTT
AATTCCTCATGOGTTGCCTGCCCTTCCTCCATTGTTGCCCTGGAATGTACGGGA
CCCAGGGGCAGCAGCAGTCCAGGTGCCACAGGCAGCCCTEGGACATAGGAAGCT
GGGAGCAAGGAAAGGGTCTTAGTCACTGOCTCCCGAAGTTGCTTGAAAGCACTC
GGAGAATTGTGCAGGTGTCATTTATCTATGACCAATAGGAAGAGCAACCAGTTA
CTATGAGTGAAAGGGAGCCAGAAGACTGATTGGAGGGCCCTATCTTGTGAGTGG
GGCATCTGTTGGACTTICCACCTGGTCATATACTCTGCAGCTGTTAGAATGTGC
AAGCACTTGGGGACAGCATGAGCTTGCTGTTGTACACAGGGTATT
X1-13 .
CTGCCAACATGGTGTTCAGGCGCTTCGTGGAGGTIGGCCGGGTGGCCTATGTCT
CCTTTGGACCTCATGCOGGAAAATTGGTCGCGATTGTAGATGTTATIGATCAGA
ACAGGGCTTTGGTCGATGGACCTTGCACTCAAGTGAGGAGACAGGCCATGCCTT
© CAAGTGCATGCAGCTCACTGATTTCATCCTCAAGTTTCCGCACAGTGCCCACC
AGAAGTATGTCCGACAAGCCTGGCAGAAGGCAGACATCAATACAAAATGGGCAG
CCACACGATGGGCCAAGAAGATTGAAGCCAGAGAAAGGAAAGCCAAGATGACAG
ATTTIGATCGTTTTAAAGTTATGAAGGCAAAGAAAATGAGGAACAGAATAATCA
AGAATGAAGTTAAGAAGCTTCAAAAGGCAGCTCTCCTGAAAGCTTCTCCCAAAA
AAGCACCTGGTACTAAGGGTACTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTG
CTGCTGCTGCTGCTGCTAAAGTTCCAGCAAAAAAGATCACCGCOGCGAGTAAAA
AGGCTCCAGCCCAGAAGGTTCCTGCCCAGAAAGCCACAGG CCAGAAAGCAGCGC
CTGOTCCAAAAGCTCAGAAGGGTCAAAAAGCTCCAGCCCAGAAAGCACCTGETC
CAAAGGCATCTGGCAAGAAAGCATAAGTGGCAATCATAAAAAGT AATARAGGTT
CTTTTTGACCTGTTAAAAAA
X1-49
GATCAACCTGGAGCT CTACGCCTCCT ACGTTTACCT GTCCATGTCTTACTACTT
TGACCGOGATGATGTGGCTTIGAAGAACTTTGCCAAATACTTTCTT CACCAATC
+ CATGAGGAGAGGGAACATGCTGAGAAACTGATGAAGCTGCAGAACCAACGAGG
TGGOCGAATCTTCCTTCAGGATATCAAGAAACCAGACTGTGATGACTGGGAGAG
CGGGCTGAATGCAATGGAGTGTGCATTACATTTGGAAAAAAATGTGAATCAGTC
ACT ACTGGAACTGCACAAACT GGCCACTGACAAAAATGACCCCCATTTGTGTGA
CTTCATTGAGACACATTACCTGAATGAGCAGGTGAAAGCCATCAAAGAATTGGG
TG ACCACGTGACCAACTTGCGCAAGATGGGAGCGCCCGAATCTGOCTTGGCGGA
ATATCTCTTTGACAAGCACACCCT GGGAGACAGTGATAATGAAAGCT AAGCCTC
GGGCTAATTTCCCCATAGCCGTGGGGTGACT TCCCT GGTCACCAAGGCAGTGCA
TGCATGTTGGGGTTTCCITTACCTTTTCTATAAGTTGTACCAAAACATCCACTT
AAGTTCTTTGATTTGTACCATTCCTTCAAATAAAGAAATTTGGTACCC
X1-81
AGAGCAGCAGCCATGGCCCT ACGCTACCCT ATGGCCGTGGGCCT CAACAAGGGC
CACAAAGTGACCAAGAACGTGAGCAAGCCCAGGCACAGOOGACGCCGOGGGOGT
CTGACCAAACACACCAAGTTCGTGCGGGACATGATTCGGGAGGTGTGTGGCTTT
GCCCOGTACGAGCGGOGCGCCATGGAGTTACTGAAGGTCTCCAAGGACAAACGG
GCCCTCAAATTTATCAAGAAAAGGGTGGGGACGCACATCCGOGCCAAGAGGAAG
CGGGAGGAGCTGAGCAACGTACTGGCCGCCATGAGGAAAGCTGCTGCCAAGAAA
GACTGAGCCCCTCCCCTGCOCTCTCCCTGAAATAAA
X11-35
CTCTCCTGTCAACAGCGGCCAGOCTCCCAACTACGAGATGCTCAAGGAGGAGCA
GGAAGTGGCTATGCTGGGGGCGOCCCACAACCCTGCTCOCCCGACGTCCACCST
GATCCACATCCGCAGCGAGACCTCOGTGCCOGACCATGTCGTCTGGTCCCTGTT
CAACACCCICTT CATGAACACCTGCT GCCTGGGCTT CATAGCATTCGCCTACTC
CGTGAAGTCTAGGGACAGGAAGATGGTTGGCGACGTGACCGGGGCCCAGGCCTA
[GCCTCCACCGCCAAGTGCCTGAACATCTGGGCCCTGATTTTGGGCATCTTCAT
GACCATTCT! GCTCGTCATCATCCCAGTGTTGGTCGT CCAGGCCCAGCGATAGAT
CAGGAGGCATCATTGAGGCCAGGAGCTCTGCCCGTGACCTGTATCCCACGTACT
CTATCTTCCATTCCT CGCOCTGCCCCCAGAGGCCAGGAGCT CTGCCCTTGACCT
GTATTCCACTTACT CCACCTTCCATTCCTCGCCCT! GTCCCCACAGCCGAGTCCT
GCATCAGCCCTTTATCCTCACACGCTT TTCTACAATGGCATTCAATAAAGTGTA
TATGTTTCTGGTGCTGCTGTGACIT CAA
X11-77
GTAAGAAAGCCCTT AAATAAAGAAGGTAAGAAACCT AGGACCAAAGCACCCAAG
ATTCAGCGTCTTGTTACT CCACGTGTCCTGCAGCACAAACGGCGGCGTATTGCT
CTGAAGAAGCAGCGTACCAAGAAAAATAAAGAAGAGGCT GCAGAATATGCTAAA
CTTTTGGCCAAGAGAATGAAGGAGGCT AAGGAGAAGCGCCAGGAACAAATTGCG
AAGAGACGCAGACTTTCCTCTCTGOGAGCTT CTACTTCTAAGTCTGAATCCAGT
X1-29
CTCGCTCACGCAGCACTOGTGGCAGTCCCT GAAGGACCGCTACCTCAAGCACCT
GOGGGGCCAGGAGCATAAGTACCTIGCT GGGGGACGCGCCGGTGAGCCCCTCCTC
CCAGAAGCTCAAGCGGAAGGCGGAGGAGGACCOGGAGGCCGOGGATAGCGGGGA
ACCACAGAATAAGAGAACTCCAGATTTGCCT GAAGAAGAGTATGTGAAGGAAGA
AATCCAGGAGAATGAAGAAGCAGTCAAAAAGATGCTTGTGGAAGCCACCCGGGA
GTTTGAGGAGGTTGTGGTGGATGAGAGCCCTCCT GATTTTGAAATACATATAAC
TATGTGTGAT GATGATCCACCCACACCTGAGGAAGACT CAGAAACACAGCCTGA
TGAGGAGGAAGAAGAAGAAGAAGAAAAAGTTTCT CAACCAGAGGTGGGAGCTGC
CATTAAGATCATTCGGCAGTTAATGGAGAAGTTTAACT TGGATCTATCAACAGT
TACACAGGCCITCCY. AAAAAATAGTGGTGAGCTGGAGGCTACTT COGCCITCTT
AGCGTCTGGTCAGAGAGCTGATGGATATCCCATT TGGTCCCGACAAGATGACAT
AG ATTTGCAAAAAGATGATGAGGATACCAGAGAGG CATTGGTCAAAAAATTTGG
TGCTCAGAATGTAGCT CGGAGGATTGAATTTCGAAAGAAATAATTGGCAAGATA
ATGAGAAAAGAAAAAAGTCATGGTAGGTGAGGTGGTTAAAAAAAATTGTGACCA
ATGAACTTTAGAGAGTTCTT! GCATTGGAACTGGCACTTATTTTCTGACCATCGC © TGCTGTTGCTCIGTGAGTCCTAGATT
X1n-84
ATTATCCTCAGTTCCCAAGAGCAATCATACTTT TCCACACATACOGTGTGTCTC
ATGTTAGGTAAATGTATTTTTACAATGAGCACCACTTCT GTGGAAAAAGTTCCC
TGCACGGGGAG GTCCAGCTTCCAGACTGCTCCATCGCATAAGGACT TCCCCATT
CCCCTAAATGCTGCTCTGTCAGAACCT GCCCAGGTAATGGTAATGACCCTAGAG
AGATGATTTCT GAACCGCAATTTTGAGCCCATTAGAAGGTGTGTGGTGGGCATT
TATTTCATCCTGATGCTCT GGTGAGAATCTTTGCAGACGCACT AGATCCAGAAG
CTGTTAATCTTGGTGCATTTATTTTCCT ACCTAAAAGAACCAAGCAGCTCAGAG
GCAGTGACTGTACAGGATGCAGTGTTTATAATAATGCT GAGCTTGCTGGTCIGG
AACCCCACACTT CAGCAATCCCAGCATTGTTCCTGTTTATGAAGTTGACAAAGT
GACCAGGGCAAGGGGGT ATTATCATTAAATACACTCT AGGAGAGGCAGAACACA
TGAGGGCAATGTTTTTCAGAGGTCTTTAGGCCACCGCATCAGATICT CCTGGAG
CATAAAGCAAATGCTT TATGAGTCCAGGGCCCCT GCAGACCTACTGTATACTAG
TATACAGCTCCCTCTTAGTGGATCT CAAGCTTGTTTCCAAAAAGTCATTACACT
COTTACCAAAGCCCATGACACATTCATACAGATTCAT CCAGACATAACCCACTG
CATGGTCCAGTGCATGCTTGTGTGCTT AACTTATTATAGATCAAGTGTTATTTA
AGTCCAACATATTAAACGTGACT GAATATT
XV-49
AAGTCTGCCCAGAAAGCTCAGAAGGCT AAATGAATATTATCCCTAATACCTGCC
ACCCCACTCTT AATCAGTGGTGGAAGAACGGTCT CAGAACTGTTTGTTITCAATT
GGCCATTTAAGTTTAGTAGTAAAAGACTGGTTAATGATAACAATGCATCGTAAA :
ACCT TCAGAAGGAAAGGAGAATGTTTTGTGGACCACTTT GGTTTTCTTTTTTGC
GTGTGGCAGTTTTAAGTTATTAGTT TTTAAAATCAGTACTTTTTAATGGAAACA
ACTT GACCAAAAATTTGTCACAGAATTTTGAGACCCATTAAAAAAGTTAAATGAG
XV-54
AAGAGCAGGTCTCTGGAGGCT! GAGTTGCATGGGGCCTAGTAACACCAAGCCAGT
GAGCCTCTAATGCTACTGCGCCCTGGGGGCT CCCAGGGCCTGGGCAACTTAGCT
GCAACT GGCAAAGGAGAAGGGTAGTTTGAGGTGTGACACCAGTTTGCTCCAGAA
AGTTTAAGGGGTCIGTTTCTCATCT CCATGGACATCTTCAACAGCTTCACCTGA
CAACGACTGTTCCTATGAAGAAGCCACT TGTGTTTTAAGCAGAGGCAACCTCTIC
TCTTCT CCTCTGTITCGTGAAGGCAGGGGACACAGATGGGAGAGATTGAGCCAA
GTCAGCCTTCIGTT GGTTAATATGGTATAATGCATGGCTTT GTGCACAGCCCAG
TGTGGGATTACAGCTTTGGGATGACCGCT TACAAAGTTCIGTTTGGTTAGTATT
} GGCATAGTTTTTCTATATAGCCATAAATGOGTATATATACCCATAGGGCTAGAT
CT GTATCTTAGTGTAGCGATGTATACATATACACATCCACCT ACATGTTGAAGG
GCCTAACCAGCCTTGGGAGTATTGACT GGTCCCTTACCTCTTATGGCTAAGTCT
TTGACTGTGTTCAT TTACCAAGTTGACCCAGTTTGTCTIT TAGGTTAAGTAAGA
CTCGAGAGTAAAGGCAAGGAGGGGGGCCAGCCT CTGAATGCGGCCACGGATGCC
TTGCTGCTGCAACCCTTTCCCCAGCTGTCCACT GAAACGTGAAGTCCTGTTTTG
AATGCCAAACCCACCATTCACTGGTGCTGACT ACATAGAATGGGGTTGAGAGAA
GATCAGTTTGGGCTTCACAGTGTCATTTGAAAACGTTTTTIGTTTTGTTTTGTA
ATTATTGTGGAAAACTTICAAGTGAACAGAAGGATGGTGTCCTACTGTGGATGA
GGGATGAACAAGGGGATGGCTTIGATCCAATOGAGCCTGGGAGGTGTGCCCAGA
AAGCTTGTCTGTAGCGGGTTTTGTGAGAGTGAACACITTCCACT TT TIGACACC
TTATCCTGATGTATGGTTCCAGGATTTGGATTT TGATTTTCCAAATGTAGCTTG
AAATTICAATAAACTTTGCTCTGTTTTTCTAAAAATAAAAAAAAAAAAAAAAAA
AAAAAAAA
XV-75
A GCAGATGACCCTTCGTGGCACCCTCAAGGGCCACAACGGCTGGGTAACCCAGA
TOGCTACTACCCCGCAGTTCCCGGACATGATCCTCTCOGCCTCTCGAGATAAGA
COATCATCATGTGGAAACTGACCAGGGATGAGACCAACTATGGAATTCCACAGC
GTGCTCTGOGGGGTCACTCCCACTTTGTTAGTGATGTGGTTATCTCCTCAGATG
GOCAGTTIGOCCTCTCAGGCTCCTGGGATGGAACCCTGOGCCTCTGGGATCTCA
CAACGGGCACCACCACGAGGCGATTTGTGGGCCATACCAAGGATGTGCTGAGTG
TGGCCTTCTOCTCTGACAACCGGCAGATTGTCTCTGGATCTCGAGATAAAACCA
TCAAGCTATGGAATACCCTGGGTGTGTGCAAATACACTGTCCAGGATGAGAGCC
ACTCAGAGTGGGTGTCITGTGTCOGCTTCTCGCCCAACAGCAGCAACCCTATCA :
TCGTCTCCTGTGGCTGGGACAAGCTGGTCAAGGTATGGAACCTGGCTAACTGCA
AGCTGAAGACCAACCACATIGGOCACACAGGCTATCTGAACACGGTGACTGTCT
CTCCAGATGGATCOCTCTGTGCTTCTGGAGGCAAGGATGGCCAGGCCATGTTAT
GGGATCTCAACGAAGGCAAACACCTTTACACGCTAGATGGTGGGGACATCATCA
ACGCCCIGTGCTTCAGCCCTAACOGCTACTGGCTGTGTGCTGCCACAGGOCCCA
GCATCAAGATCTGGGATTTAGAGGGAAAGATCATTGTAGATGAACTGAAGCAAG
AAGTTATCAGTACCAGCAGCAAGGCAGAACCACCCCAGTGCACCTCCCTGGCCT
GGTCTGCTGATGGCCAGACTCTGTTTGCTGGCTACACGGACAACCTGGTGCGAG
TGTGGCAGGTGACCATTGGCACACGCTAGAAGTTTATGGCAGAGCTTTACAAAT
AAAAAAAAAACTGGCTTTTCTGACAAAAAAAAAA
XV-86
GCAAAATGTCGCAGCTGGAACGCAACATAGAGACCATCATCAACACCTTCCACC
‘ AATACTCTGTGAAGCTGGGGCACCCAGACACCCTGAACCAGGGGGAATTCAAAG
AGCTGGTGOGAAAAGATCTGCAAAATTTTCTCAAGAAGGAGAATAAGAATGAAA
AGGTCATAGAACACATCATGGAGGACCTGGACACAAATGCAGACAAGCAGCTGA
GCTTCGAGGAGTTCATCATGCTGATGGCGAGGCTAACCTGGGCCTCCCACGAGA
AGATGCACGAGGGTGACGAGGGCCCTGGCCACCACCATAAGCCAGGCCTOGGGE
AGGGCACCOCCTAAGACCACAGTGGCCAAGATCACAGTGGCCACGGCCACGGCE
ACAGTCATGGTGGCCACGGCCACAGCCACT AATCAGGAGGCCAGGCCACCCT GCCT
CT ACCCAACCAGGGCCCCGGGGCCTGTTATGTCAAACTGTCTTGGCT GTGGG
GCTAGGGGCT GGGGCCAAATAAAGTCTCTT CCT CCAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAA
XV1-74
CG CCGCCGOGCOGCCGTCGCT CTCCAACGCCAGCGCOGCCICT COGCTCGCCGAG
CT CCAGCCGAAGGAGAAGGGGGGTAAGTAAGGAGGT CTCTGTACCATGGCTCGT
ACAAAGCAGACT! GCCCGCAAATCGACCGGTGGTAAAGCACCCAGGAAGCAACT G
GCTACAAAAGCOGCT CGCAAGAGTGCGCCCT CTACTGGAGGGGTGAAGAAACCT
CATCGTTACAGGCCTGGTACT GTGGCGCT! COGTGAAATTAGACGTTATCAGAAG
TCCACTGAACT TCTGATTCGCAAACTTCCCTTCCAG CGTCTGGTGCGAGAAATT
GCTCAGGACTTTAAAACAGATCT GOGCTTCCAGAGCGCAGCT ATCGGTGCTTITG
CAGGAGGCAAGTGAGGCCTATCT GGTTGGCCT TTTTGAAGACACCAACCTGTGT
GCTAT CCATGCCAAACGTGTAACAATTATGCCAAAAGACATCCAGCT AGCACGC
CGCATACGTGGAGAACGTGCTT. AAGAATCCACT ATGATGGGAAACATTTCATTC
TCAAAAAAAAAAAAAAAAAATTTCTCTT CTTCCTGTTATTGGTAGTTCIGAACG
TTAGATATTTTTTTTCCATGGGGTCAAAAGGTACCT AAGTATATGATTGCGAGT
GGAAAAATAGGGGACAGAAATCAGGTATTGGCAGTTITTCCATITTICATTTGTG
TGTGAATTTTTAATATAAATGCGGAGACGTAAAGCATTAATGCAAGTTAAAATG
TTTCAGTGAACAAGTTTCAGCGGTTCAACTT TATAATAATTATAAATAAACCTG
TTAAATTTTTCT GGACAATGOCAGCATTTGGATTTTTTTAAAACAAGTAAATTT
CTTATTGATGGCAACT AAATGGTGTTTGTAGCATTTTTATCATACAGTAGATTC
CATCCATTCACTATACTTTTCTAACT GAGTTGTCCTACATGCAAGTACATGTTT
TTAATGTTGTCTGTCTTCTGTGCTGTTCCT GTAAGTTTGCTATTAAAATACATT
AAACTATAAAAAAAAAAAAAAAAAAA
XVII-77
CAGACACCCT GAACCAGGGGGAATTCAAAGAGCT GGTGCGAAAAGATCTGCAAA
ATTTTCT CAAGAAGGAGAATAAGAATGAAAAGGTCATAGAACACATCATGGAGG
ACCTGGACACAAATGCAGACAAGCAGCT GAGCTTCGAGGAGTTCATCATGCTGA
TGGCGAGGCTAACCT GGGCCTCCCACGAGAAGATGCACGAGGGTGACGAGGGCC
CTGGCCACCACCATAAGCCAGGCCT CGGGGAGGGCACCCCCTAAGACCACAGTG
GCCAAGATCACAGTGGCCACGGCCACGG CCACAGTCATGGTGGCCACGGCCACA
GCCACTAATCAGGAGGCCAGGCCACCCT GCCTCTACCCAACCAGGGCCCCGGGG
CCTGTTATGTCAAACTGTCTTGGCT GTGGGGCTAGGGGCTGGGGCCAAATAAAG
TCTCTTCCTCCAAAAAAA
XII-78 no sequence available
Table 1. Sample detail. Stage 0, in situ carcinoma; Stage I, invasive carcinoma with tumour size < 20 mm; Stage II, invasive carcinoma with tumour size > 20-50 mm; Stage III, invasive carcinoma with tumour size >50 mm. Stage IV, cancer spread to distant parts. IDC, invasive ductal carcinoma, DCIS, ductal carcinoma in situ; ILC, invasive lobular carcinoma. n.a.,-not available. ND, non-decision. +, Blood samples taken at five consecutive weeks from the same female 2 PY IPO Pp I == EE
ID Age | Suge Histology Grade Size (mny) Nodes presen/comments assayed | prediction
EE eee we : | 8 we am | we [ola [
SEN Er RS I ITTV AN I EE EE
Be Hi fn rs MEL eT we ew oo [2] ND
Sr Era—— EE ETE - 2 | aw e Tso lu] we [2 | 0 - + 9 IE NE ATR NN I I i AY No fr HY EW EP BR + bupre and linfraclavicular, 11 65 Iv nodes -
I + is rir me — Ts [eo | - [a =~
Tw lo | wu | o |." lz | + te Termes ew [0 1s +
STEEP IY ES A ENS NN A : 17 7 IE = NY AE RW EN EE + wml we | a | ue | 0 | >
Te we [we Ie [| +
EE i sis + 2 | 4 Tw la ws | o [| Tt] + te Tw we ow | oe Lb +
Ta we oe oe 1 [] +
Subgroup A2: Women with abnormal first mammography
Other disease if Times rami re [a | pepe | oa [x
ET Er RE Rr EE
3 [| eimmminions | Spettciom initia | 1 + micas | Coon pinion | 1 | 0
Eo IT IE reer IP NS EEE 3 SF rere I RE RS 31 BS Ere A ECU Ra mem | Geetimon | 3 |W eT hem | pepe | 5
OT Somme | | 2 | + 3 A mimi] Cowes | 3 | +
SEI PS TR vee ER EE
SN rT Yeeros ES RT RS
SE I ETT ES RN NEE aT ree swmepeoms | 1 Lt
Te begaesy | pupwes | 2 | + 2 Benign density I NY
ET mew | wren | 1] +
Subgroup A3: Women with no breast abnormality a] oe [
ID Age Comments assayed | Predictiom w || - | 2 | +
Hi-a 45 34 months 3 +
SLE
27 months + os | wed 2 | + wed] 1 | +] wed 1 [+ |]
Weel] 1 | +
Lm 48 months
Tag | 30 | Btpeding | 2 | +
ENE I EET
EE AE Bra
Ea m——— 3 ENE I [54 | 34 | Brestfting | 3 | + s 1-1 [1 + infection in udddition to chronic EBV 56 51 infection 1 + i 71
Table 2 Details of the 35 significant genes selected by Jackknife. Their position in the amay, clone ID is shown as well as the accession number of sequences in public databases that match them, and their known or putative cellular function.
UPREGULATED GENES
Position ID oo] omen | Putative biological function 6A ET I NE
Chromosome 3 clone RP11- 321A23; From sequence no. nr-27 10M AC096970 135183-135446
Signal transduction, Kinase
Ras homolog geas family, member inhibitor? (RboH has been 1-60 14AC NM_001665 te G (rho G) described as a kinase inhibétor)
Iron storage; defence against
IV-26 BC016857 Ferritin, heavy polypeptide 1 ROS
Upstream transcription factor 2, .
USF2 iv-51 102 BC042655 — * Transciptional regulator
Chromosome 11, From sequence
VI-44 14X AC087441 no. 116068-116692 on as wee scien $100 calcium binding protein A6
BC001431 (calcyclin) Defence; inhibition of caseine vis | om kinase II
Human low-affinity IgG Fe
ViI-32 31M M28697 receptor (alpha- Fo-gamma-RII) Immune response
S100 calcium binding protein A9 | Defence; inhibition of caseine
X-24 31) BC047681 (calgranulin B) kinase II _IX:50 72 momar |] Defeace-related
Ferritin, heavy polypeptide 1, Iron storage; defence against
XI-49 3AB BC016857 mRNA ROS
Interferon induced transmembrane
X-35 12Q BC0096%6 protein 2 Immune response
ETE EE ES A RA
From sequence number 75875-
X11-84 16AP A1391903 76710
Delta sleep inducing peptide,
XV-54 24AA BC018148 immunoreactol Immune response?
S100 calcium binding protein A9 | Defence; inhibition of oaseine
XV-86 24AQ BC047681 (calgranulin B) kinase II
Xvi74 | sak HS histone, family 38 (3.38)
S100 calcium binding protein A9 | Defence; inhibition of caseine
KVI-77 20AN BC047681 (calgranulin B kinase II
Table 2 -«~™* } Downregulated genes
Clone | Position | Accession Putative biolog/ cal
ID ID number Gene similarity function
Cyclin D-type binding E2F-mediated i-30 210 | BC00%683 protein transcription
Ribosomal protein
WV-41 2V BC010165 S2 Ribosome production
V-09 | 2G | BC053370 Ribosomal protein SA Ribosome production
Ribosomal protein .
V-38 22S | NM_001018 S12 Ribosome production
Ribosomal protein
VI-49 2AB |NM_001023 $20 (RPS20)
Vikas] 31U_| M22146 [Ribosomal protein S4jRibosome production
Cytochrome C oxidase subunit, COX Mitochondrial electron
Vil-76 | 15AK | AY495316 1 transport chain
Ribosomal protein
IX-39 | 27R BC001037 L35a Ribosome production
Ribosomal protein 1X-46 23V BC034149 S3 Ribosome production
Ribosomal protein 419AM [| BCO000514 L13a Ribosome production
Ribosomal protein
D87735 L14 Ribosome production 60S Ribosomal
X|-81 3AR AF077043 protein L36 Ribosome production
Ribosomal protein
Xil-77| 20AK | BC035447 S6 Ribosome production
Telomeric repeat binding factor 2, Telomere length
Xli-29| 20N BC004465 | interacting protein regulation
Eukaryotic translation elongation factor 1 alpha 1,
XV-49] 4AA BCO018641 (EEF1A) Protein translation
Guanine nucleotide binding protein, beta polypeptide 2-like ;
RACKSs (for 'recepto for activated C-
XV-75] 12AM | BC019093 kinase) Protein translation
Hp
Olo ol |. Bla GI2 < {= < CRIRlo0[s BIS ® 59 SERGIBEE RISD
Qsl=lo Sea
Oo SIs SEREREeREs QEIS
Elo BIR lo l[3 (2 G3 SRERER ology al
SRB IRISIBI3I2 ada 251815 Eh el islicfolnd SBERREbRE 10 |< [© oko
Q oFlolRliB2L ole SEEEBERSS lols y ole SEE S8lBER sib Solel o <|9 Rell nl
EE RRRERE SIBBIBBIEI clBBIBERE IRL —~ Ian Slo 2S = EBL ee 2 Hasse gas
Ny 3elol2i3 lo CloiEiEieE EoBlecgiElR ~ SEE REBERS BIBER aibI2BIEleRielE x3 Slol=l® olFRIBIS RIE Eel dichs lolol sg 288 BERERERE2 QRE<I2IE blo «© a
Riel O ee io) ~ a SiFloaizid 0 wri 2EiElGlE a ~ dc faces An OI=18 2595
SRIRBEIRCES ER O
> SB BERE ERE CeREisIsIeS II1215 — ZRII OO oy i Si a=! Slee eRiERER SE £ a co! 202 to Hg OEI21Q
ERBERRERE A ea i 21313 eles O 2EelBle O= ERiskE ek oO 2656
H FSR <EEbE clelelolis|s SEE
Q Slol3ecERS zee QE pd 1 ol35I8 EI Slo EE Ra o Plea <= 2z8loE SE A500 ERle © Ola2 ol 05 IE SIREOlE0I8 12 FICEBIE [= v CEEBRBEEE aE k= oR SE Qo ad oe Slzlel2iE oF ERE 03 oslo Cole = |® 7 SERS lo POEL 2031210/0lE < 3 18 a Seance x 22 BRERE 210s IS C2 RPE RERS
Q y 30 REERRS << elle <i Ola 0) RRS = bE “ ] |G] << |< oO oliolalS 2 =10|5 oO =| 4
S18 pe ER 2 Reis § lle Sia ae SER RBEGE v8 < 310 < 0 ¢ 19 hn 2 Ig 2 FE de © o az S10lE12l3|% 3518812 2 |B |r lo log 0
Hola SEB RBRERRE © 53 |8SIN
SI EA olfoBlefE Z2iEEEERIER - ERR BcEE Eee
[0] —~ < Ww nee % 1° Ce Rie alee o - Be
Gq een
Ela Ee 0 i QRS RIS «jC 2 o fo. jaa ESE 5S <|< o~ a § SRE RIE RIRRISIRE pO hed ad RO
LE NJ \ || yin | + - cll<c'l << QAI TD .e jd <C [Vo] [ad fo Bsa) ™ |& Sb ind Dl ed Ph
AE] bard BR Br ab pi 3 ls
A Bikol
PCT/GB2005/002180
O} < ol le 0 < g . Bl = _ 0
Sel ERE al EEL EE BREE © 10 5 25 E Se
Ole AlOiZ|O olzS(ol ake:
ERR ESREBERECEBERER 81212 22BBEE LEER SBRBER <\<iald alolsSI2I8I3I8 EI! SlElblelw
Eola SIBlecIRIZ(SIQIe IEOe 2
Solz|k|0l812[CIE |S ZED oo I<
Eloi =l{Sizloolo < alls 2 2
Lok @hiLio ERIRI0IOE =|2 Eis 2 elRRISICERBIBRBICIER GiElgle
BREE EERE RRELEELR 3REI=IS
RoE CBRbRER olFIQI3 0 oI |G O
S5IZBIZIR0 EIS geielelcloRleeRlE 3
QO ERR E2RIBE BIS :
ClooI2IkEs IZ SiEBleBBIec8S
LER RE BREESE Olin lolG Ho 008100 FIol IF 2101618
SRIBEIEIEEIRG SERRE R20
QlaiIleRIZIEIG Elalzlalolsis 0 |< °
Q QlEISIZE O20 |6IdI0 250%
OR oIKBIS io =o ZR 121016 3ClolEIg
QOH IO [SAC] ol [SE o LER
REE ERE 5 as < [< 2601 i<IEIG10(819 Eigie SIRS
SolbEREBRGIRIRIZISIE SPRREGEE 2eEloEZBIEBGL 251ml) SEE <Q lz 612 (@ EXPE Reo REER =O Olin iolQ Fle L586] SERER gE EEESBIRERRES << O <lo|om > S Sis
El2SeBIEIS SioleElzikEiIB 0 skis
QEEIRIKIEIQIR Ci800213/5 12 oO E Bie RRERIO 28
Olio ESRB REE ERERE
SER ERRRS o|Gln|2|E <|0
Ol ElO]O ERIGIRBIBEE 29131210 =o BERERR JElleR8E SRB
Qi=i2io|O SEEIo|=|Z[o pS oO =I]
Seine SBE QERERERISR
GRIBBLE SoBERbEELERBIKD
QO |= re) SES 512188 EES Res od 4 alfa olRBEER2BI0 E800
EBERLE SEED He
EE BIRR IKIES ESR 313 se ERERRES =O SEER 5o olRl5zI8leBla <IH OPS |O
QB FICE 3 = OIF |Q=
Q < [O (@] oO 8 & 3 < o =O
SEBREERER gaged ele
SRIEEBES 2 L200 olEISIP PIER I5IBIE 2B RIEIoE |S 228 = =< |o SERS RISE feat 3281890 SLE REEEIEE 2EEEISISEBE [(e} boll E22 Ue 2 had 8 ~~ 250 B R815 vg |8
BEBieioicBiz Be RBR BNE ERNE 82 ZC HE BERLE IRIE RBBIZE dS RS
SS dd da dd x oo aR |e ojala ja lof Cb Bo i i ot i i pi ole elellelollolleloleo lols! QUIRK ON [NIN [ON 1S,
R[F SR RII RIIRE RGR GIRS 1 cl]
QO Hn
Lal's] [nd (=d ==] [ad ARRARARRNZES
02180
PCT/GB
WO 2005/1185] sd LE —
Ql 15 O 0} =lo i . Se 00 =< LEE Sl<|® <0 (O1TS) OQ
SoBaEeep BREE he n oO oO Q 10] ciZlelels lo Q < lols ISIE ERS : a
O23 = Q < Aly 5) Q oO 2191210 cEREBRE cRREBeh oO of | mg
SERRE blelalcls gio SEREER —_ SIElelE EE 3lol3la Ae a 5) oO EIS BIOBIEE HOD SEBERRER - Sehise oe oO 2 OF |< O o na) 251515 E510 LiL in LER RBE O |=
Oh OS 2ElolF ioc 0] [3
EE < SEBIEEIS 1s) 2ZREG 52 bo] SiR [0](0] SR BIBIBIEIEIS g = gls23 BE BIRD 0) ISIS IS oll Q = SlRielzI2(313 SRIRIE Q QO LloBlERIS 5
RIRI0IEC 21319 seks QE 1 heap 513 gEiclBiols — 28015 =O << o ShEeERie - ZIQ|0 8250 olzle Saad Te ol cise 31813151513 SBIR sl=lolZ(S
BRIG I162 ES SRE: olof58IE |S o. Soiziols8 OlEiE2IE o < E EISRIBIS512 — bd Hea Q olhie3Ie Ll a O10 a £ oP RESIS zizioR Ol <= (®) EBle 00 =] q518 Q om Ilo EE Q O P41 QSL it gaa SPREE EE ! SER=I8 Q il FER & Slo|=|5|e BBE sbiRoRs SIFle Fydic EBS 8 Go| o 32 |Sl5 Ee
SREERR PERe opp shi Ho , 3 = ho) © olSIERg oRIESIRIBIER Oo
SRIREEER Q Plo E 91213 ESO Qe i 5a EERERES SBI8IS E EEE2E 0) 2RBIRICE El® 0l8EhBRBIER SE OSISe 0 Slo reel 2k 2 a Sleek elas 3 © SSIS ols BP n 4s) 0 aq rod iat SIGE 23 Oloals Q Sl<l5|2
SLR IeERE OE EISI2 SERS 21512 o FER ERREkS: SIRI 33380 Soll 0 Sa Pe 23205 ZEB pi OCF (0 01215 SBRBIRRRES rd ® aan SeREenssin 2zle ht 51215 IS Sizim o < oO OQ ol% 5) oO Ole iO [2 3 =i (&] (U}
Q a geiBhogees " 2 135 x Sold 0 ERE Als < lol IRIE his olF 0 <= Q (CHOI < [+ wi 2 2 SlEElaxiel SEEEBER 0 OlF|Q|3i o 2 ERE SRIeB E21 ERE rol REID oO OOF o FREER
A Fil R ERR Ee ° an ee > ee lel=lzlle ] Who |= SINS LUIS 0.10.0, [UKIKIR llc n py iodine lelalzlzls hell g . ee
ERIS 125 3 ] QK]U]K] \ < o . Hie pp SE |r
Yq a, IRIS Il ec'|< retry rd 8 5 @ dk bi lec] <c pod pod bag Rt . )] << |<C pd Dod Bk] ped cg ol 4 0 ” <H pa] pa] NTP 4 1S
NSS
= le i AR
0 5/00218
PCT/GB200 76
WO 2005/118851 < =loRBE po ol 2lale
Q < ol Cee E aaah t < <| lo Qa |g SSIS Ol01m 21 Is ee 3 2 ARIES ZISlokle <io|E Flak © ois lzlo@lele CebBEREE SIEeRBle
Siete Qld SE SIS 58615
OQ |<C aceiEldatls oO plosBeLialsis O AEE
CRE ls3(e SaoEEEc sR 9253 501 QZ |= QUES CRbRBERREE EE gle
Zolol=ih 681312 o[2[3BIEIBI8I0 101 olBISI3 al
Slolo Solo Q ClolElo 200 RIS oO 1210S oo a5loEB3Ie QoS? ® fred Oo irate 210 Olz E = =O Qld 2ICI015 =I SRE 2259 < Ol= |< os ZL |= OO
EE EeRBEoeEe ZBRISIE o< LICE QR
SlEie EES ERE O|G|o EREBGISE E 25 lelClo
SR|LERS Sle lElRlo|ai) Eel o5IE = 5i5CleEie siRlReRBEl dele gets L218 ol28/0 8120100 oO 28d eas BBE CERES
SEEERREE Elelals CEChRRERES <lE=[R|5
EERE EEE Ol® BR BIEIE(S SER RERE
SsELEPERs: Shek ERE RRleE zEQ BEER OlQl0 013 0la|3 QlElolal@ Q < O=1519 5181610 = 2B2[=IE ziSlalzls aed HAE Qlz|o|@ Piezo
Oi £10315 cRBEr EERE Ee of 1 SE
EEREReR(s 225012218 2S SPE 351285
Solio Ele < ERBERERELRS =~ 3215121815 52 RE SEES ClogISI9 SRIB3 BBE
Qe =o olo oO 28 Oo Olo|2 2lolzERIQ Llololg ele
O EBRhEBEERS SEES Ql<IElo Clos
SEE ER ERE OIRIBIC[B|E 0 SBELECELS
Hea sess 35121818 Slo|Q ee (OI Sa
ZEEE SEEGER 5EBERBR Sols
Fl2o2e ie] mh Ei2blalE|e Pogo io 2/310 cloBhelekle oO SERGE 2S0lS Salk ol = SREB SRELSEER Ole lE=lG S ©
SEREBERk SEER EER SQ E — Q =105 [U] QOLl=12 <€ be) = 0] [0]
SRE SEIS 3) LI25I05IS|E Ql Ll<|2(2
O SEIS ORs Qlo Flo oO 10)
HE 330 3310/2 R98 ILle -
Qlzlz < 3 QQ Old eds O ole
SRR: La 5|0|0|S 0 Ele SEE TE = O 309 E(S
EES ClElzlEl E Qo ehiZRELIGS 31513 <|Cls S12 o blolzl=Eg ) ERIZIE EoiclolBis he JEainpiaEiEn Sts Sis le 2
EEREREREL slolelzle 33 <|1Z1219 Plolelole
Elo al2lo|E FREER alslelg Flo 2231621 | S g Sloi251 pS Sloe <|El0lgle Solel IN
Flos elo iZis E O olojo re loo |&
SEE o EERE Fog RaERil
EERIE o BEEBE IR RBRE
ERIBIoF en ‘ [52] NILITIB [5] IFRS [sW[s IR QUT] l=3 [3] [Yel ™® hong hg hg ojo. [IR [23 N I
HEE HE IST ISS <<
HH QUIRK IRR << a i SLL << Ep hog Pa a, oO lela zlgls cl sieigleiels [IRIRBRERIRISIR alala
RN, lll NM hl ~|co (02%) <C wn co [co . {ON [+0] [ag ad EA beg PY
FRIEIRIR
I te A
WO [\o
GB2005/00 /
PCT
77 851 /118 2005,
WO
(8) ol8l 23192 3[31%15 el 26319 lol
BH
$2806 2lo =i 5
EisiZi
Ehl3E 21818 on oO
Jat: oz @ loo
SE 23
Sl 3 0] Sia
BER a2
Sle <
SBS ee
EGER
Sod
ERE
Sol6 52212 (&]
ERR
Ee
Els 22BIE
Ziel
EiRiElz
Llale|g
EiEie)2 gee leis oR
Eee
SANS
© 0 "ale
QIQIRIR
~
N hele << ; [= i(3] pt —! —
75 PCT/GB2005/002180
H 9) = L—3 S a 1] he] © £ + c bay Sc [] I 5 o Eel og , E ° R2] s
P= o [72]
Be (2 5 ® 3 2.2 Q o 9
Ye QC = 3 QQ & 2 [= RP =3 5 |= o |c QO — 2 DS © o on) = oO wn S R= ® 2 p] > |§¢ cE |g TO so |B eg Cc 2 a © 2 ol hi = Qo «Nn «C gE
S 5 |® |8 olds 29 Q . 22 |= 18 [§|S% 25 15 (8x (34 o a3 a |E [elk 35 | 3 © & [= c & [= — c © Qs 2 |B o [] = 8 > |[@ 0 |2|@ vw |5 c= =
E185 |2 [© |g88 <8 2 IgE |58 8 |=€ (4 > |Ss2 rR 5 Bs |e c ~ a, <u [3+] © DN 3 + 2
Q El-2Z tc? [ok 8 lug |o E ir. ac | loa |&6%
Iv) © = [= - HER 5 © Sols a op 58 Se|l |& c c2> 230 |E0 a 5 |28|g BIT 3g SE [2888
E28 [E28 [328 || £5 [BglEE 5
BS = Sle |PISE Le ° 8 |» 0) Ss lex |Z: PSE 20 |co|d@ =
Q 8% |1ZBlc (Sl€ 0 s2 |g Be a x 2S Eales |2B2 os |1&8|Ics 58 he) ® SE 1g es = = (2) © [] ge — [3] = = o 8 © oO 2 gs (SEE [5/3 23 [2588 B=
CLE [52 |zgle |B|ES £2 |EB|8s 12k — 0 4 (a) galt ? co I< To [BOX >= -— Pf ' 25 RAR cles Zz © © SS 3a 50 4B |g |35 S512 |5|88 [| (28 S228 |S® _ a [=] © o = © d wn 3 clo |gi8 £ go IsLI= CE
Eo i] 2 BLE gle Ble 8 |e 25g 5s © 9 22k © I=2€l8 us |Z€0C
SI < |28 |Z SB O ad |E £2 9
ES 5 182 [85S [8]5= Z| 28 |S8EE £3 — A o ~~ © SE ale c o@ = c £5 = 9 j= o Lo Jp EIEN = k= EBle wn © 3 | [8 |58 |E588EE 5 Eo (2518p 38% 0 I= — cE 8 ©
SBS [28 [33e BEES 5 [28 |55/28 |2Et 2 J 2 ZC BI8|E F4RY] 2 Q I) c . - © © LE glalgeQ = = [= R=] oo) 2, | IR 28 sscoEsecc| 26 22852582
EE | |x IZE §olEE2|E8 hE [85 52828525 u- » 2 on 2l¢ 2/5/82 5/8 25 ole ES Eee] © <Q < o 32 ol5le 226 ££ 55 0c8§c®
Cs lhe [ve (EEC C>OEn|SE EES ev BEE 8 IST (us £e63ls|sieas ecg see gts yu 2 SE E25 E clE|E SI>|6 §|2ElS E5855
Hole E18 Elst 8888 EEE SBRBEGIS E238 o 8 |§|€ EES 2EL5 25ST e888 58854 —- == 5 Elo rH AEE REE EEE
S|» [2S cSc|EEE8|c|EE [2g EEE ESI 2 Sle ele 52a 25 SlSScltlR|ZsBlEn|® alta 2 |8l€ 8128S ele EE 53(5I5 ZEEg|iEQ=2 iS} [H E gE 3~ = £10 23 SI3= a|838 = ~~ 0 I] $= (= Slo eis -® 2 = —-— ~ c 5 13 HEH ST EE EH 0 siBc|2=ci8E|Eglz0c8 5 ElgSim alls 88 2 15 [Bree g|gk|E 20c2l5E aga Ll2ehes
[2] = Q | = Zio 6 ElE| 5x 2 Elal® 5s WME Da CNT oa be EEE EE Ee HE [$3 < SlaE 3le BxXKO|L cE O | |= So EIT E oO g eggs tL 28 aD SIBES!S £2 Elo -
R= |S BlojE c gE 5180552825
Hoo|o 2x s|S 82 8 g|5iR 2 cE olERIEES = = o Sf ol 82 E Sic [ES QEE|EeN|EZ 0 s alr ZEISS ERzER [= & 8 ~ he @ 2 r~ 3 = 2 ¥ © 0 |g 2 al B 82 8 8 of 8 8
OU © <t el Nn [=] wn «— het N ri © 0 ml @ Ro QQ © © S
HE o S| 2 Ox =] 2 9 3 S [2 << 4 = a = 0 © [=] d > < al = 2 Q ; 5 &
O d oN Te) o < a © qld > al 3 o Se gl 2 © gH laa 8 2 1) 2 IS 8 a oO b= = [=3 IT Q ed od ° 1 o [=] o Sle o o = . . | [ =} oS Q
Z| Z Zz SiS = = = ul z Zz — [] pe)
IE 10
Hoja ~N Ln o ™ oN | pr} a ™ 0 in ~
PCT/GB2005/002180 4 hd °
La - - ue 13 ® os - 2 = c @ © Fa - 2 & pS
FO |80O es | c z (@ 2 lag |5% |o £ 8c 135 | 3 lz. 18 | EE | 8 18 82 Ig | |e o ac —_ Id ry = " & = © => _2 22 5S , & 3 > 0 2 89 D Ss r= >
ES ; TE 86° lon | . |g |eg = = = x = © = ee [2] 2E 25 (BE |E€ |? |v |u 2 18 |c8 |g I
E < 2 > 2 cs > = -
Biles lec Se lr EER lsh li Be © 23 198 |® c |[87® es (g [<8 g - § a |gE Pa Ee ce 2 |§c a led |e 2 |Z 52 (82 |2 |8° |B g 29 |B 4
TE |T8 |2€ (25 [8 |8x |& |@E [8 (8% [2
I £ 12a c =G 5 o g a © [J bod fa] Q f= [8] = f= [=% cc [82 129 |E¢ [> |Ew |3 |28 2 cS |£ a dD 2s «© .S [le] «x > Sg = = 5 = c eg ES |g8% |Ex 2 Ex 1g |g E |Ed |B | |8 > «5 c BB I=) -— - 33 B21 = © [+V] = © 290 « 20 = = % 2 a el
Ee |5€ |58 |ES |E |B2 [So|%g |E (82 |= | |e
Sw |Z og 82 s 18s |o B |E = = 55 ITE |2< |82 |€ |3% (28pm [2 (88 [8 | |8 [4 k — = 3 2 | 2 Su | ac 3 58 |S
SE 2c |2% [38 |2.l25 |88|238 |B |BE (3 > |=8 (2E |52 |B8|B% 285% |, _|EZ | | 2 - a 2
EE |82 |%¢ ($s 52132 |Ealed |2clsE |g > o 2] ] £ |28|8%8 |S8lse [8S|€as |B c 22 |&aE 28 5 [28/52 8855 (B8l85 (3 Ci — > Q us Len |= oO oO -« @ a: c © oO = z= R=3 - Dl 2 = dc (Ec |8S |Be |2§|28 |83|52 [FEBS IE 3
ES |g |-E |28 |<2|E2 |Ex|8¢ [s8I8E: |S g
Sg 8 |= = Zz E TIS = © = =z S 83 |8> |£z (8% |[p@es £3|5% EEE S ! o —1 b—3
SE £Z |t% (Ec [85/28 |28|82 (E8IgE |Z 8
Eq 03 58 |88 [2T|EEB |2el8e2 |85|98 |£ S cs hell 72} oa BZ S28 acl|le 5 nalel © 2 28 158 (2 |S ESBS sic ¥ Sled |E E 3 & co 29 Sg = 8s «© 218 = Oo o 82 |52 [32 [8% x98 |So|8 o|SE |o 2 = — Q == —~ 28 |e5 3% (EZ |2E|ES |Se|235./82%s |B | |2 £E =§ |°5 £o 8232€5 8 us58a8 le =
D = - O Sh c Ele 8% oS bi |= . m0 — = — ® =
ES.9% 58 [BS |25/RcE|882 52888252 ols 88388 558. |9a 0 QE22S S| SEE GE 5 =a co28? |8R2lcg £8|lec7|2F|anrlles|25K|8 |S 8Ea52 B2Else E=cligciosZEaccsS [EI ; == Sle = - uO ££ 8|2e EgE[c8,|5s(c88 88 cla Ele Ss |5 [+] — pay 1 - = = =| ~~ 2228 pt er EERE REE = |E
S25|E2 |585(885|25|E50|65B28So|8EX|s BIE
SEEIES hat REAR IR RUE R= k= 1 R=] FST) ol
QE ww > 8a oc EO BES o|tW RQ >alE ElED 2 [8s
BEE2E |EBe|SES|a2EREEREEoEnIERZIE [82 pt - © = Q 2 RIT law DO >™g [=a k=] £|E
ESD If SGE|8SElWBI-=8|SE|E 2 glen |E E
SSE|ZLE|E XE ERGs s8lcEln ES Elune 58 |8|8 = [3] += © Ta Cc “= $3I5EIB 3 2 BEE LIE |= cchiclyg|Plold=0caic @ 2 nis oy a cle |@ 8325558ls-8|S8s|85Ensle|SeS|s EE 8|SSR |S ga 2 o5SB|SEE slp ogle8lalals PE ole BIE |B vaolEEPlamo|gs 822288 833|g Eo |S REl2 cools l ES |523|Eg/8e3 IN al8cE el83uis 58 > cle ££. = =
Zo3BLEIERE|55 B2EI6 EBS 88 E58 3|20FLE0 2 2 o «| w| I= gl 8 § § 3 2 of I Bl 2 hs : pe gl 3 2 8 adi» 3 wl 5 ~~ ~— «= ©
N N 8 = al © Rl © Y=] 0% 2 BK 8% 85 88 < < <C ol = of ol wl |o 3 Bl % S| Bl Bl 8 3 3 gg @ ® N i 88 § 35 8 8g gg 8 3 @ & 8 o “| 7 © © © | © v7 © { ! sl sls! sls = = = = =| = p= 2 zl 2 2
Z z p4 2 Z Z Z Z [ew on ~ on af I 0 ~{ ha ~ ~ a ™ wl on “| = |~ o . 3 3 = £ o b=] nO £ ® = oo 8&8] =
DO ° ~ + ° [72} ca > 3 3] wr a « |E LL Ee) ez | 12 | 8 cB, BE | Ele 22
GE g gE leg Ig (38 | | 5 2 159 gE LIE ic IE 1B ISE IB 12] ~ | |E [EE "TE 2 @ DD o uo 3 3 |a I ce 28 £3 = a |ea |e (g@ [5 (3 > cs leg
Sw Xe) 2 |3s |& oc & a |? = % 2®
TH Net c oc 1S] c a baad = 0 2G = E DB |le= | 5% 2] 8 2 |§2 =5 : = © c.L2l< Hw c |B o = jo5) =% ao @ | a | |8 D XQ oy «<< - Q c O 5B B8IES k = br x My od 1 © ois 2 =) = a o <8 = Eg 2 |g [23888 |8 [Esl.5 |68 oC Q [= = R Ee) pr=] PRA) by = 5 £8 @ 2 lac |Sgjag |2 |EB52 8 [RE =X 8 5 182 1258 |x |2828 2 led £2 @ E [238 S Peg |o ZEE|E 2 Is 23 S £ |§2 |e€|lg2 iO |§& =. = $3 @ = So Sle 8 © S8|2 7 2 =E 2] = 2 |=3 [BE|I8g €o0|=8 = x8 [Ta] 3 2 8< SEIS] 2 25|8 SS c Ow z = E ~ c ow ; ) oe |8=|€® |g |=2=2 Eo 2E 4 | |B [EE 233g (E 1222s | |B |B pt 5 & {6 Scige § |ES|ES 0 2a i = c £ 2 L9le@ [od EZ Hi] 58 = (0 = a= bd e bs c o [7] © c= 7) I) UU O a. SO
TE =] > |Ez |2Q|EB | (83|856 g EE o S |85 |EE|B? |T |2282 © Ed
SE | low (8 |e [Feiss IB 2ERE | 5 |E 3c | leg |E [55 |5S|ee |S |S5[3E | 18 82 = p= -
JEBe | 125 18 2% |=3l55 (5 |ERes B 083 <5 | Eg |§ [58 (14SEC |B |88|2E | 8852 a = 0 - i= - ©
Ziggy 2 1B |S |28|an |E [2423 THES
Ele & Sel |& |8E |2elad [2 253 EBIGE
Slee oc | |B8 S28 wn E [081+ 5H £€ leo
TIE ® 23 T<IEE |2BB 4 |T 832 SZleg ®|a cc o +2 [Vs c a 2 hid pO8c ®lm @
Fis 2 QZ o 2 oo [J] [4] =e 2 =| ® oP |Ex|ge &£OlaB®yle SCIP E oxica —{Z Ec] (8 og E|l.a& cle BSE |oE|Z2 Ee a °(8 nix oo tS EIS |ZE|F2ss [BEldEE 7 = po
TI eSsE|a 9 Cla le 2215 Ie SEES
ZT|>GolE|aa ee Ondo ox £300 85125 —lEselsic8ix=zslBa |S alEce EE §x2 80 |25|8e
SlSEEIE|IcETEE gg |88 ok 2E EEL IIERIR
PISEEfE|soS25(E8 |EE(a0dls E522 sREE Ss
So 38 [8 Bis = £2 |le2t #8; SER © 2 = [BQIEIPR|e te) Cc ale EA =} [<) xcl>-lew alc ce X= x|8 cls © elas S 2l- Q 82 SIEElEE [585288 FEEER EQ c 22 le eiEBIEE Q 0S = cElEECIEa Plat
Slc 8% c & [7] © 2 ct o|lE EQE2c © ©
Sl= HIER @ c a Se - So = Slo E = 212 87S z 55° Ki=S 8 4 b=} » Ho EGE Sie p44 had [WV] Q-9le = E 22 2 =| 2 313 28a ElE2 = [ad =F = [2 3E= [=
SS|R|EEI8laiaE E2ag2 Sole8sidagis 2|c8281£3 80 85 cs SE EARS Sele a°|lg 3 oo 0.28 EE RBleR|a al ols om Q nix .
LT EBs 53 228% Ele B=0 gl= £2|S BD w|i —=|oE al¥S ois |e glRlen|o 8 &(T C0 gls [£9 8 Elg|>E
Toc 28 5 OlEjal|e B20 c|BR Es $o|E23ie|ZElgg lad sla cElc|2E|lS gle SIRT 5|8 cle Sle OEEIESIES 3882 8<Is|21|225 2518251 EE|25|25 SEER
C2 ogieZ|0 TB|RE|ST BARK EIER rd sles clRlegCigRIxe ela cic 65a 8XIRI2BBL
Ilo ao EX CERISE ESEIES ESSE EES ole Sl € 0 I © © ~ w| Ww IN
QQ ~ 0 ~ ~ [= ND «- @ © - - kl = ro EE © wl ow gl 8 | xl al 3 & 5g 3 gg 8
Q =] =] oo eS] © ol 8 = Q = 2
Q x 4 > Z| «© 2 gl ¢ 3) = ® 4 E = Z| 5 al = @ Ql =o © oN © ol w «| wv ol o » © N ol g > © 2 ©» 0 & o - ab <= gg 3 8 5 § ¢ gl 5 ol 2 NE S| 5 bt, = = = 2 = 21 2 sl = pa pa p< Zz Zz Z Z Zz pa o — o ot wv ~ aN © © wn iv) © ~ ™ wl Ln ~| =| wn| wn ~ n
05/002
PCT/GB20 81 = 0 2005/11885
Ye o ES = Ze [=] c x
EE; 32 |~8 £ z 8 £3 : © ° |& . 8 [ he us 0 ste 2g 55 53 i E 5% 95 =A @ 8 Fol = n 2 c 0 5 : : 1 o® > 5 © QC SE 3 : 5 c =e [= LC = a £9 52 £5
SE iE 85 S38 23 2 & (2c > |SE 4c 28 38 0 it £ es |5g g Se £5 128 SE [as 1 -—r K~
S B|5E IE |3E Z9 Isls 25 33 5% a @ oe > cc = co Sle = 8 Ow © + § cg E c= i CC =) ° 8a £ Eo 3) Se 3 Ea 2 = S (eg 8 |835 5E |2] <5 = 2: a r= [= 3 << EQ Bo Sls 8 > g £ ‘© Own 8 |58 cc 512 €3 2X 53 : 2¢ g br—1
S S 183 5 Se 3] Ble £8 =5 |23 ~ F |8e & = DE co 5 £2 : : [5h 18% | Sa £8 |8¢ Sg - |. = =o ?lo £ ge 5 [4H] wn Be B® QQ» p= fan] IOS D = Ea 8,
R= = = @ EE 2 : :
S| SS |3 sc |88 SIE 23 £3 x 2 2 |&¢ a 58 |E= 2s [53 5 1 o a JE = S S| ji =e i e 2 82 0 $8 2 SEER i 4 . c 8 £8 E : : 3 245% 2 52 58 xlEgEe iat
B S2EL |T £8 = alggl2s 2 8 £8 eH : 3 eB HAE £8 ng [=% QD aig o9Q ca yt 4 gs : [288% 25|88 |ES elZfes la 23 : +h £882 58 Si5 3a Ex 2
Slaw = CE = o = 3 E o =
Ec 555¢ EBlES gE E585 EE 23 © > 2S ® B= SIEaEZ 8 gt 2 2 wis c 2 : il
HEHE ® HEE EE = oo
BolS|EEE 2 23252 2o%|Z|g5|a 2. Ce
EEE 25|555|25¢ Tle ®@ CRS E= 2BE(NERT LH ERR ie eas INET JH EEIHEE $8 (52 : sages ele olcz sl a KI E “©
SET 193s Clg o >8 88; EG 2 i 5826528 REE HH
Sia le ~ -— = Els 8 E : li J SETHE EE 55 145 > © Y— 3 QO = : : 5
SEclEEEE HEA i 552585 sEasegicsices 88 550 [«] Ql - SEIZS 3 © og 2
Ss 10 8 P=] [=] = ole 215 3le 38
SSIS IE DF Gl E co 8 2 5: 85 +H. Hi S3|G¢E8 EZlsiZ gis 8 22 |c5% i 222 RE £|8/°3|& SIeh EZ
SEED iE das eax idiegies acc
FEE HH pl858 522322
B<|5 ieee
LoS |T iS 8S Els 1} B|=|5 gle E 5:55 2 «=r iz BoE oa | [3] 5S BES cE3 52
Ge op Ea aa slic EEE Scio x fo 228 28s oEsacaiaecs 8 ole HEE HEN] © ola Jdelss< 2 25
NES 86 2EEl08 R55
D Ele SESE
Mm aid ~ old £ 3 2 oN a8 5 : : 2] le) Q © |S 8 3
Te) 3 @ S 3 SS : < Rd 2 8 : ; : 2 5 3 2 <<< a) S HIE : g wg 8 : : - lol © 3 ; NEAR: n
Sl 2 N 8 Slo] « &
S lf : 5 315 S,
J 8 EE : > < l= = = - 3 sz ZZ ~ © s = = S i oN 3. n . *
S =) aN
PCT/GB2005/002180 he) - gc DT 2 SQ Eo = 5 8c wn £ 55 cig E T= 82 BSE | [05] 8 cd ols so 2
QO also = QO -~ oa Tio SE =o 528 | [BE B® 82 88 2k &
SE |ElE 5 a 8 > ge [DOB EE
DC wl oc >> 2 oT =lc’L Pd £ 8 [SY E=] gL 2 ols © G I= @8 Ee 5 = © 3 w-— E33 © on oc =D b=} cL == = 7 =4 =
Gg © |Slec gs S ac SHE» ol 5 ny oQ © 5 © 82 AEE @ go 38 (Slew Sal 2
Qe Ble > C2 @®
E28 |g 8 £28| E <8 gi8e ER 8
Ea ln > ~<g| 8 29 nog ox 5 £ Tal 38 [= [=] 3] © a= |-|§54a =~ 2 g oo = El = m 3 = Q ” oY = =(53 Vile O c “~ =12a9 UB QT 5 220 |3€o EP 8
Sieg |faoa Sc| 3
SES 128g | [E58 ©
SIS =e .E mg — - fe} = o glee o|a 28 £
S|§E |zlo> SE F 21Sg |E|oF Eo ©
Aw = © oc c
Cc | a | ~~ —~ (0 =
SIE |8lod [EleB® =
EIB& [ae glx|pE
S|2c |ElsEEE|lgEl 2 2 | © nlsSie 2X CRESS 5
AIL alsigc =] wy oalitsw 5 9 [S
Sleg8glc|l§EBIS|23! E olos 258 = ke) pe
Ses 838s Slai2 a Oo 2 |x a. o|©9 € ld [= ~ 73
S|E28i5|= «82 >
A EEEENNEGR
SZ o=c|3|O0I SI |2E 8 3|P2S|E(JE EEE o
IC ERIEIEZEIE|IL S $4 me olla = dls Ql 2 = @ pe © ~ [= 2 hod IS 2 2
SE 8 | 8 8 a
NO OQ x ~ =|m << a cS. 0 ~~ > [23 hom ~ -— 0 « oon <x oO olo =i
N= oN oS
OQ Ye) oc ss! s| 28
ZZ Z £3
Ge 8 [eo] ~~ ba [andl Ind < < =
83 PCT/GB2005/002180 = [sY — pa Fo >
[1] ro 2
IE lez 3] 82 £12 q @ a |§ |kEE £ cal % |a¢
Z BIE ® as |g [7] 28 € £T g 12 | [82 la 2E3I5E Ec € 2 le pat D2D(8 —® pt =] gs 2 Aah - =| |e IE cS |e Sle Sle 8 DS
Sli 2 oa |%a |§ 2i=gl=3 a2
Sil 12 IT |8F IE 2=Blag |28 lle 19 = |&@& S(8 58 88 .
Oo g |5 [EZ E c=|0 ¥
OlE|8 b= D eles £2 [LI olEe lc 1B 138 2 | | zzZ |29 dif 3 2 [38 [2 2EI8E |ot - —
Pee 2 [2 55 5 | [Elgg 238 =] 3I12|5 cs |2 |&X = E gi ct w |0 =ER a [E> |° 2|® 2a g 5
Q | Elo|2 |e 8 85 |= cloB5E = 8 523 [2 |S |58 3 Sleglgs |e oO SEE ~= L = ~N sl? 25 2 a2 i: 25s =k [3 “3 12 | [slg3|ES 3 a EEE |= 5 28 |§ SZEI2E vg
Z Ia 322 8 |§ 53 (2 2/2882 |53 0 SlEle S |g [ER a gg alg 8 ga od S5le 13 = QD (si E|EE|Z = = g | ®|s Q | gs Eg a= Ss © |O gi8(8 I) > - cle ols 7 Eo
C 8315 15 IE [22 2% [E552 [22 a) a £[g|8 = S |E2 £2 Sgsl5 8 £8 a HEE 2 82 gal [EIS 8E (5% > {0 2|&|S 2 I= (88 =U S|gglE 3 ES = 1h a= 8 IE [Es =¥| (5/88/23 EY = ZI81E|8 |T 3 182 |Esgie als 5|5 3 @ © oS Zl8lele c 2 |e§ E5282 E|E g 22 woos ElaIBlE cS |S |28 C EE Plaid 2 =
A 1 ME ga |S 25 (28 eles Ec od i E 5 |= |2 22818 SlE T= g |= alzlzle |= | 28 IEE EE |EB g |g LE|Bi8 |8 <E |2 2 (SIE 28% S > a Elsi |B Ss |82 2 NIS 22 gs 2 © 5 egg 18 |g gs [EZ SeElze 2
CR EER [S |2E c8Bl5Es5E Bd
HID £ glx <a |B ae |= s8|§ AEP KR] 5 fa REEERER 5 |58 25095 EIES 58 g |H 21212|210 ojo f= 83 S88 NESTS kA u © £15218 @ S © B 2 50 |a RIE IPC 5 14 |Slealsle age (52 gas ERE o 2 Cc . R= of © -
N T2SEElREls 2 EN FEES a SHEER ££ THEE 5 5 J HA eH o L 38s E|DEgE gE|s aR |& 5 cSMES = oo % [3] a 18 foisted ie dee oo (Oo HEE Sea 212 le|5 E|§ Elcs®
O Gi |T|Ti|n Fe AE ® | SESS 25
LY slo 50ES ais ElE223EER
So|l=z cla eo ax?
AS 9 £3a>E [*] [1] [{e le] o 0 nic K ~~ & |e © B13 8B § 3 al g 0 - Iiywl I 35 81 ¥ =] o po ov |v o sya 3 8 3 3 gl Ig 8 a Q
EA I] Sleld| 8 & € 81 8 Q 3 oH m|<<|D | XxX I=} oN [re] » ©
[1] Q © © |a J] = iH Oo
E 13 o a a In NE ml o ja L1381% S02 [o} [=] p= [3] 3 ~ gle. BliE gl 8 2 IS 5 2 ger i2 | = sis pe hy
Go |= 22 sg i 1S S 2
Zl 2 = = 1 Sl @ =z Zz = = — a z Zz ad [8g 0
I po wn bl 2 <= 0) in ©
2 (gE % |5 | |23 3g 35 . 8 le 23 £ & l=
E3 (82 | se || 2 || 18% | 18 |Z sx |g 23 5 |E =] 95 = & > oa [ a Q 2 7)
SE |8 a5 ee |§ 2 Bg @ 80 c [=] go I= E o |e wn cD cz So 1B nw 2 gs e 5 g8 85 ° on |® az |& |g > i 24 .- (0 eu [i] -— BE s c OK
Qo © ~ Jo} 2 [] © 2 8 Sw r=] Sa
Eo |ad ody] sg 2 5E 18 |8 E §8 £8 0 o = ® 2 |E c ds 185 es |0Z 89 ga § s8 |= Ew ¢ IES xg |€C 5 2 <2 |S 3c Bale S lew x8 oa <3 | aR] ir] 2 ® go = 2 |Z 23 lst g 58
IRN Eg go EZ Is £ o SB ae > |@ ae EE 3 oS = o Ed S82 an = ES 8& |28 >ll12a |2 | |28 (293s | IE |28 © |8xg Ee se |B 68 |Og|E IT = @3 |Bg S e >a |ZEIES 8 Ee 28 |25 | 25 | |2z |2 | (Es |23|E8 | |= |=°
Oc (BF eg «led |£ 28 aT|E = led 8 om vg [ZlE8 TE Flak @ EE 23 |§2 gs ®ES | 32 (83x : 38
Ss 20 Xo |E|lzS ] 28 S |o8
CET 152 | 182 58% |2 | |B8 82|E§ |3|2 |ED - - |= o £ - 5S |Ea Se WES [Z| 1B3 [52|2% [Bs (5% €3 |88 SE olg% 3E |S0|88 [EI |BS oc 3 ee £2 |B S88 12355 2 |8 8
ES |o 85 |X|= B Ze [ggg a [EE £ 2 > £ oa Ee |e =e Fo] £
Eo |B9 w 3 VE 5 > xO |e = |=le a 8 aq = ca [~Mw oe a 9 > ot 0x I] £4 28 co (2/EE |o ss [25123 1818 |58 5% |35 | |B [&35 [= | [5% [25st Eg [SE ¢= 122 | [25 12182 |= | 35 [28135 [2 [5d oO 2% > clo ® 17] ce |SeigE a |2 gE |5 £8 Sige |B 83 [BSINE [Siz IES a5 |58 25 [28 |& sg |®3l8c [Bl5 |238
EE rR] = El E Q 2s S=lE® 14 £8
St [sz | (85 [E82 [Bal |Bg |oE|28 518 |23 £5 |EE 22 18182 (23% |=8 38g sg (ES gE 15¢ 2e 18128 NE] |2% E3183 SEa8E
Sg |og og |B188 |FEl [22 |2E|xE BlEala® & 1cB Sa |glEo 5 5 [251% Dia §|= 3
Ev [83 2 > [S82 [a 50 ne|l8E Sls? 2d |B g 1228 52 SE [328 [23888 » 5 S5 SEogta Te £2 YELE Hlo3 85 8 w Ld + > =| [= Ce sScef | [Reds|izq ES [22 £52385 S883 ® c 2178 89 I] k GRAS < Oo ££ c 5 8 ola E La oO © BAS N= 2332625 |385|a coiss Sg Solxg§l8E IER
SR 2cEs (8 2el=68|8 8 ge |N§|l2Z SIE GS
I0E|Es dl [85285285588 c SOX E|2EESD
SELES: 2eel8s3|738 CExlsolag Beagles sE2Cl59g8s|SeEl@ 8g ITER . = R[EF|L ccl®|oF|>6
Esc52zelglss gd SITE3 |v EC|GRZTES|a 0&9 ¢ SIcESIz oe SRL 5 8|55|8 cS 5lad ToIRB BOE
EEE EE ERE EINER pe cESiccg|8ofieizlcER|gan(aEqsgEEREE = = nN 0 Q [3] © =
SSolaBcle|s gee 2S|I83a8|E6=|S8|AREEES|SE © [=z] [52d [eo] wm 0 - ~ Ye] I] 2 © ™ [324 o © & 8 elk 8 « Sl 9 gl 2 1 I. gel g| 8 gl 8 HIN 3) Q [9 8 o 3 9 o| 8] £ @ @ S| S| & a el 2 2 om <| X <C -— -— [7e) fo ®m| «© ~~ © oo ~ © [{e} N ~
S < ~3 3 2 8 & ~ 2 a [ om hoy [Ty] [(o] he of [apd [Vel 8 3 gis 8 2 $8 gl | g § i | iS i | 2 = = Z2i2 = 2 =| = = 2 2 4 2 pd 4 Z p=4 z Z pd p4 4 [e)} [9] oo — [=3 [a] oOo [wo Ln] \o [ea¥ [ad] mn eo} t ~N —! — oN Ty] [ange] [e6]
0 /00218
PCT/GB2005 85 a 18851 - = 2005/1 c 5
WO > 3 a] [3] o o £ E ° [o] ® 0B k=] o 3 Sc 3 = 8 |3 £3 Bx |B 3% 23 0 FE < g 5a c [TC 2 Oo © 2a 23 =
NDE 3) [0] =
[7] [= CL o 1 © [= E85 ge = E
ERE: cf |e 3 |53 8 sg | |§
E 12 55 IE £8 15g tT 28 gg |o¢ O = 9 | = 2c © © Q
S | JT ao B °S 123 aw 2 © [) [OI] e Zz [Bg @ ED
So% %: 8 [28 k 2 [32 |5% [53 g |g
Qa 0g |= >< Eo 8 |&E B® |EE c |Z
Ege £g 3 |e 232 Sled 52 le ER e9lx f= = 2 © a3 D = 3 £ o j= uw = = [=] = Q pd [=] - 2d ga £® ec EE g 22 29 ge a 1
EE alO Ex |e 25 |& " 7 lag 5 & Ga © iS 282 a8 |S £8 (Bg o € 8 2) » |o c3 + = [3] 3 a © £ B32 ® 2 8 Of we [2] -— @ = a, = w £8 £E 3 |-E $c 3 §= (22 EE | 3 x hhh hs lh a 5 5 ge Ec |2 y= ° | oO 2 288, Ee = 83g % 8% 2 ES zg Zc 3 2 feleind [8 oF =Slel 7,58 =8 2% | |2 5
HERE <8 sas coc 55 18% | 2 i:
S2leE pid p= > |2 8g Eig = £2 Q g5 Sg |g (82 gElel |5 E82 |S ® >a
HEE 2 E gS |8 3
Tt I 5% Rik Zgi8s gs 84d HH pgs se Sf leo s Be 258 S |Z ££ £8|g 38 2 gd a 9g aly 8 2 |25 88 =2|3 5 2 Zz o . a a Ea & £2 2.9 g oi = 28 =< -— 53 aes SHS oF - |=
Tle = Q OFle BD © B= 8 E
CIES Be 2] ge £8 gS(ZE © F == 55 doce |g 8< [BEE £352 (23 85a nEiE 582382 [5 iE (232 EEE |5% 252 [293
Elo 8 | = 5 olg E€lg§ = °o8%§ = Po
SE|% Eos g Pol a fe 5 (88 (3 Zo
Blo S = -oiEl ET 2g 82 [E85 2531 MEE 5385 58 [C83 8812 7 c |g a Talg 28|55 @ 22% 28ig
SenSisE a 8g 3K] S828 Ee [a8 523
FEE 2 CP ® 3|2 8 8|E 2 ze SER 2x5 eilEslis iE 13 HEE HH 25
EEE ol EB 5 ele 23(cE5odalnsE Qa 12828 5 2 RSE HEE S380 (38
EZc® 2H 82 h-] LE 0S|2E 8 20 Zo oh] 3158 I8 = 35/2 S cEiEigias O Ela 2clgd|58 = 0 = 23g SES EEEREES 515
S315 58S 3 qd 3gie SSI280R Een £ of e D2 £29 c 2 51852 SE§lzEz 8 Sia
SE 3|® . 2:0 2355 £E gle
BEESIET old 38 HEEL ELE SE EE
Ss221282(8 Be HOHE EEE SI& ES 335 ‘Q © a Sle rr ‘D Gla 0, Qc E28EES 3 og
S22 gesisl |B Ee|g|8 ZR glew WiC 5) 28 Sec ple 2 2228 3 38E\E SLL 2 Los 228° gElelz | =2iEl5 (28) EERE ol
EE lef fpr SIZ go 251388815
Hi HEE es8Igeles 52355555 gS EIZ] Sib ls Bo E|SE5R 5S coleElcc® < 3 81% _ QQ BIB Slo <
HERE <&& . 3, 2gleglsee 3 218 1 Alo RD S gle < [=] [Ye Q
QQ fs) [Te] mm xg 1 PY ~ [Te] > <C 1 2 58 a ec <
FAREN (3 gl Ii
Pe} [3] ~ o rei] > ol
B Py Ys] 5 © <Q SIR gg 8g 3 2 8 8 2 o o = 28 | 8 3 ¢ 3 | 3% pS ®e - © S, == ho e © © © S, = 4
S| Bl 88 © = Zz 2 3 18 8 S19 s 5 © ! z 2 2 20 HN Shs z
S o 8 < sis Zz 0 s ZZ ~ bY = _ S z| =z o a
La tn + in “ ™ SI N [=
PAL I
£ . ©
S|Ee If
Soe o 212g |2 > olB> |8 D
ERE |2 =
RE |£ nlc © QQ slas 18 .S ggg |& 8
ESE [na o 2|% 2 <8 2 2 5
Ele |o 81 ix 0 [4
SiS 5 SE ol 18 =| |e pe} 08 @ TS gaia PlolE 4 c|l8e |83% dls 5 fe 2
SIZE 28 218m clio o
T% [83 <|alale Elo 3
Els £ aL Zl? lS 2! |2[0|3 7]
Glee 123] | RiELR IE (Bg |B 2lsa has 218 23 @©
SiZE [4% ~lo18]8] |= SIE 8
Sa |Ec MEER © 2 | ELE als|s
Bla 8 2.8 JIS|E|E| BiZOlB|e|sS|E | IL oT a 53 LIS IBIS LlZ|s|8lB|2 {|< Zz
S|®w 8 olg SIs |ls EEE BIEIEIE SIE £25 88 | 3|2|8|820|ElRiE|5|E(E|5 || SERIE
S|& ce a 5 olsls|sXlci<a|Bl5| ill 0RI<El= lose |8E Sle(Bl8E[SIEl el EleiRlS RIS SES g £5 = |8|8|S EE 2 8] 5l8 3 BIR (BIS IER 5183 |& HE REE EE EE En 28g oO 21212130 S1Q12(2]215 SEGRE
S28 122 | EI5[eleleBSigsleRRleBIRI8RIS 28 125 | BlacEhllege scene
S[Z22 |Se sl2 121812228 E RIE eB la SES
El-2 (28 AE FP RF Ek RE ET eg sls a2 cB |EiElolgl=|E]|S sISIER BIg gle 82 | |12)312l8|8(aSIREIRS RB se BS IRIN ed ber = [+p]
S132 155 | [BSEERIBRERERRERRBRR cla 82 313[21213 al2|228e5 83 |a3Ie 26g |7. 3olslsld cE se 2RE 2 SRE |= g|l5E 28 E2122 Ble 23 2 sis 2 is|s ls is |S 82 lug SlalalBISI8|al0141212|0(212|212|2
EL | P ize 2 5815/2181 28/8 |2B181BIBIB 58> jot ESS 6]8 ol ad pt A FSA pl ESE RA A RA RA 5|e2 |3<| |5|BE|els|zlzlnlEle es a Ie! |E|E : Sot IRIBIEIEIBISIEICIBIEIEEILIBIEIL|S|S]: 535 |28| [25021225 25 Ele EEE Ele EE 5
Elgg [8s SlclziBiBls|glss8|2 8 812 8 £15 5 o =!
Ble eg 5ls|5|3|5|5(5| 2 z| 25 |S |S SIE 5 1S(5(2(212
S508 aiS|2l0(|8le|2|ElR|E|g|e|gEisiglE LIE Iss 78 28|2 zl2 || iE 2lE |2leleizIE 2 Ll 22 8 8 8 2 2 >& 58 BSI HHHBHEEEBEEREE EL:
TIE L2E ERISInig 2 clataralclsSlaaisiaalaiaia 2i% 3g 4/5[0|S ‘55 ala e[c(6|o|n| ola nlals|a = ® ad clalzlelele|alalc|lelc|a| eee |@(@|@
S| o- Ble 3|2lslela|Be|e 3|8[3 I alg|e
Efren ER 2 REE BEE REE EEE Els
SIE 5 eid SER Ecc |B Bele |x 22 |e |E (=| EET wn 2 0 f~ mmm =[~1=18 © = 2 pg han TRIER I2BBRIRISBIBIBS CBSE = < o p:3 [o: = Ns [2 = ]ed [ND [3 163 [= |e [NON 1 2 8 8 Ck| BEERBLEREECBE RRS & =) a| a [<< Sle ER%IBERRRRRx]REE XB & © 8 oN <r p=4 Qo eid QR TINO ISIS a |N |W) |D [ON © [0 8 un 0 ~~ NIN [|< 107 |= CN <t [xt |0O [LO JON so) [7] bond hag Dad band bl ond Dll [nd [= RLY 2 [SY An 3 [nd un N= [I~ IN WD D ES pl ol ol IRIRCIRIRISISEIISISEIT] [TiTiSISiTt |= = = S| st sisiEiEiziElziEEiEEEE =S=EEE = E
Zl. Z| Z| [ZEEIZIZZEEZEZZIZIZ] 221222] 2 2 c o 3 wlio |H|MN 8 [ad atin im|ojmo|m r= 0 (~ OO [rd || a [oo] [HANH inn [on a ied <C
Claims (48)
1. A method of preparing a standard gene transcript pattern characteristic of a cancer or stage thereof in an organism comprising at least the steps of: a) isolating mRNA from the cells of a sample of one or more organisms having the cancer or stage thereof, which may optionally be reverse transcribed: to cDNA, b) hybridizing the mRNA or cDNA of step (a) to a set of oligonucleotide probes specific for said cancer or stage thereof in an organism and sample thereof corresponding to the organism and sample thereof under investigation, wherein said set comprises at least 10 oligonucleotides, wherein each oligonucleotide is selected from an oligonucleotide corresponding to a gene sequence from: family (i) genes encoding proteins involved in protein synthesis and/or stability; or family (ii) genes encoding proteins involved in the regulation of defence and/or chromatin remodelling; or derived from such a sequence, or an oligonucleotide with a complementary sequence, or a functionally equivalent oligonucleotide; and c) assessing the amount of mRNA or cDNA hybridizing to each of said probes to produce a characteristic pattern reflecting the level of gene expression of genes to which said oligonucleotides bind, in the sample with the cancer or stage thereof.
2. A method of preparing a test gene transcript pattern comprising at least the steps of: a) isolating mRNA from the cells of a sample of said test organism, which may optionally be reverse transcribed to cDNA; b) hybridizing the mRNA or cDNA of step (a) to aset of oligonucleotides as defined in claim 1 specific for a cancer or stage thereof in an organism and sample thereof corresponding to the organism and sample thereof under investigation; and
¢) assessing the amount of mRNA or cDNA hybridizing to each of sald probes to produce said pattern reflecting the level of gene expression of genes to which said oligonucleotides bind, in said test sample.
3. A method of diagnosing or identifying or monitoring a cancer or stagc thercof in an organism, comprising the steps of: a) isolating mRNA from the cells of a sample of said organism, which may optionally be reverse transcribed to cDNA; b) hybridizing the mRNA or cDNA of step (a) to a sct of oligonucleotides as defined in claim 1 specific for said cancer or stage thereof in an organism and sample thereof corresponding to the organism and sample thercof under investigation; c) assessing the amount of mRNA or cDNA hybridizing to each of said probes to produce a characteristic pattern reflecting the level of gene expression of genes to which said oligonucleotides bind, in said sample; and d) comparing said pattern to a standard diagnostic pattern preparcd according to the method of claim 1 using a sample from an organism corresponding to the organism and sample under investigation to determine the presence of said cancer or a stage thereof in the organism under investigation.
4. A method as claimed in any one of claims 1 to 3 wherein said family (1) genes encoding proteins involved in protein synthesis and/or stability are selected from the group consisting of: (a) genes encoding ribosomal proteins and ribosomal activation proteins; (b) genes encoding translation inhibition and initiation factors; and (c) genes encoding other modulators of transcription or translation.
5. A method as claimed in claim 4 wherein said ribosomal proteins and ribosomal activation proteins are selected from the group consisting of ribosomal proteins L1-L56, L7A, L10A, L13A, L18A, L23A, L27A, L35A, L36A, L37A, PO, P1, P2, S2-§29, S31, §33-S836, S3A, S15A, S18A, S18B, S18C, S27A, 63, 115 (and pseudogenes), ribosomal protein kinases, ribonucleases, putative S1 RNA binding domain protein, eukaryotic translation initiation factors and guanine nucleotide binding protein G. Amended sheet: 9" June 2008
0. A method as claimed in claim 4 or 5 wherein said translation inhibition and initiation factors are selected from the group consisting of eukaryotic translation elongation factors, tRNA synthetases, RNA binding proteins, polyadenylation element binding proteins, tyrosine phosphatases, eukaryotic translation initiation factors, and RNA polymerase I, III transcription factors.
7. A method as claimed in any one of claims 4 to 6 wherein said other modulators of transcription or translation are sclected from cyclin D-type binding protein and guanine nucleotide binding protein.
8. A method as claimed in any onc of claims 1 to 7 wherein said family (i1) genes encoding proteins involved in the regulation of defence and/or chromatin remodelling family are selected from the group consisting of: (a) genes encoding immune response related proteins; (b) genes encoding TNF-induced proteins; (c) genes encoding hypoxia-induced proteins; (d) genes encoding oxidative stress proteins; and (e) genes encoding proteins involved in chromatin remodelling.
9. A method as claimed in claim 8 wherein said immune response related proteins are selected from the group consisting of T-cell receptor and associated components, various cytokines, interferon regulatory factors, oncostatin M, Leukemia inhibitory factor, chemokine ligand and receptor family, complement components, interferon stimulated factors, MHC class or II (or related components), adhesion proteins, nuclear factor of kappa polypeptide gene enhancer in B-cells, myelin basic protein, cathepsin, toll- like receptor, protcosome subunits, ferritin, protein kinases or phosphatases as well as their activators and inhibitors, leukocyte immunoglobulin-like receptor, immunoglobulin components, defensin, oxytocin, S100 calcium binding protein, lectin and its receptor and superfamily, leptin, phospholipase and growth factors.
10. A method as claimed in claim 8 or 9 wherein said TNF-induced proteins arc selected from the group consisting of TNF alpha-induced protein Amended sheet: 9" June 2008
8, integrin, inhibitor of kappa light polypeptide gene enhancer in B-cells, TNF-associated factor 2, 5, nuclear factor of kappa light polypeptide gene cnhancer in B-cells, MAP kinases, protein kinase C, ubiquitous kinase, cadherin, caspase, cyclin D1, superoxide dismutase and interleukins.
11. A method as claimed in any one of claims 8 to 10 wherein said hypoxia-induced proteins are sclected from the group consisting of sestrin, E1A binding protein p300, endothelin, ataxia telangiectasia and Rad3 related protein, hexokinase 2, TEK tyrosine kinase, DNA fragmentation factor, caspase, plasminogen activator, hypoxia-inducible factor 1 and glucose phosphate isomerase.
12. A method as claimed in any one of claims 8 to 11 wherein said oxidative stress proteins arc selected from the group consisting of superoxide dismutase, glutathione synthetase, catalase, lactopcroxidase, thyroid peroxidase, myeloperoxidase, eosinophil peroxidase, oxidation resistance 1, peroxiredoxin, cytochrome P450, scavenger receptor, paraoxonase, glutathione reductase, NAD(P)H dehydrogenase, glutathione S-transferase, catenin, glutaredoxin, heat shock proteins, mitogen-activated protein kinases, enolase, thioredoxin reductase and peroxiredoxin.
13. A method as claimed in any one of claims 8 to 12 wherein said proteins involved in chromatin remodelling are histone replacement proteins.
14. A mcthod as claimed in any one of claims 8 — 13, wherein: (1) said cytokines arc the interleukins or their receptors or tumour necrosis factor or its receptor or its superfamily; and/or (11) said adhesion proteins are CD1A, CDI1C, CDI1D, CD3Z, 6,8, 11, 14, 18, 24,27, 28, 29, 40, 44, 50, 54, 59, 74, 79B, 80, 81, 83, 86, 96 or ICAM; and/or (in) said immunoglobulin components are heavy chain or Fc fragments; and/or (iv) said growth factors are endothelial cell growth factor or erythropoictin. Amended sheet: 9" June 2008
1S. A method as claimed in claim 14 wherein said interleukins are selected from the group consisting of IL-1, 2, 3,4, 5,6, 7, 8,9, 10, 11, 12, 13, 15,17, 18, 20,22 and 24.
16. A method as claimed in claim 14 or 15 wherein said tumour necrosis factor superfamily is selected from the group consisting of TNF superfamily members 2,3,4,5,6,7,8,9,11,12,13, 14 and 15.
17. A method as claimed in any one of claims 14 to 16 wherein said Fc fragments are sclected from IgG, IgE or IgA or their superfamily.
18. A method as claimed in any one of claims 1 to 17 wherein said immunc response proteins encoded by family (ii) genes are adhesion proteins, interleukins, their receptors or superfamily, TNT its receptor or superfamily, immunoglobulin components or erythropoietin.
19. A method as claimed in any one of claims 1 to 18 wherein the genes encoding the (i) families are down-regulated in cancer versus normal patients and in the case of (ii) families the encoding gencs are up-regulated.
20. A method as claimed in any one of claims 1 to 19 wherein said probes correspond to genes which are systemically affected by said cancer or stage thereof.
21. A method as claimed in any onc of claims 1 to 20 wherein said genes are constitutively moderately or highly expressed.
22. A method as claimed in any one of claims 1 to 20 wherein said set comprises a combination of oligonucleotides from family (i) and family (ii).
23. A method as claimed in any onc of claims 4 to 22 wherein said set of oligonucleotides includes oligonucleotides from family (i)a, (i1)a and (ii)e. Amended sheet: 9" June 2008
24. A method as claimed in any one of claims 1 to 23 wherein said set includes oligonucleotides from genes encoding one or more ribosomal proteins and optionally one or more histones and optionally ferritin.
25. A method as claimed in any one of claims 1 to 24 wherein each oligonucleotide probe is selected from the oligonucleotides listed in Table 2 or 3, or derived from a sequence described in Table 2 or 3, or a complementary scquence thereof.
26. A method as claimed in claim 25 wherein each oligonucleotide probe is selected from the oligonucleotides listed in Table 2, or derived from a scquence described in Table 2, or a complementary sequence thercof.
27. A method as claimed in claim 25 wherein each oligonucleotide probe is selected from the oligonucleotides listed in Table 3, or derived from a scquence described in Table 3, or a complementary sequence thereof.
28. A method as claimed in any one of claims 25 to 27 wherein said set additionally comprises one or more oligonucleotide probes selected from the oligonucleotides listed in Table 4, or derived from a scquence described in Table 4, or a complementary sequence thereof.
29. A mcthod as claimed in any one of claims 25 to 28 wherein said derived oligonucleotide of Table 2, 3 or 4 is a part of a gene described by its Accession number in Table 2, 5 or 6, respectively, or a complementary sequence thereof.
30. A method as claimed in any one of claims 1 to 29 wherein said set consists of from 10 to 500 probes.
31. A method as claimed in any one of claims 1 to 30 wherein said set of probcs are immobilized on one or more solid supports. Amended sheet: 9" Junc 2008
32. A method as claimed in any one of claims 1 to 31 wherein said cells are not disease cells, have not contacted disease cells and do not originate from the site of disease.
33. A method as claimed in any one of claims 1 to 32 wherein said sample has been obtained from a site distant to the site of disease.
34. A method as claimed in any one of claims 1 to 33 wherein said samplc 1s tissue, body fluid or body waste.
35. A method as claimed in claim 34 wherein said sample is peripheral blood.
36. A method as claimed in any onc of claims 1 to 35 wherein said cancer is stomach, lung, breast, prostate gland, bowel, skin, colon or ovary cancer.
37. A method as claimed in claim 35 wherein said cancer in breast cancer.
38. A method as claimed in any one of claims 1 to 37 wherein said organism is a mammal.
39. A method as claimed in claim 38 wherein said mammal is human.
40. A method as claimed in any one of claims 1 to 39 wherein at least one of said probes in said set is suitable for diagnosing, identifying or monitoring at lcast two of said cancers or stages thereof.
41. A method of diagnosis or identification or monitoring as claimed in any one of claims 3 to 40 for the diagnosis, identification or monitoring of two or more cancers or stages thereof in an organism, wherein said test pattern produced in step c) of the diagnostic method is compared in step d) to at least two standard diagnostic patterns prepared as defined in any onc of claims 1 or 4 to 40, wherein each standard diagnostic pattern is a pattern gencrated for a different cancer or stage thereof. Amended sheet: 9™ June 2008
42. A set of less than 500 oligonucleotide probes as defined in any one of claims 1 to 41.
43. Ait for performing a method as claimed in any one of claims 1 to 41 comprising a set of oligonucleotide probes as defined in claim 42 immobilized on one or more solid supports.
44. A kit as claimed in claim 43 additionally comprising a package insert detailing how the method should be performed.
45. The use of a sct of oligonucleotide probes or a kit as defined in any one of claims 42 to 44 to determine the genc expression pattern of a cell which pattern reflects the level of gene expression of genes to which said oligonucleotide probes bind, comprising at least the steps of: a) isolating mRNA from said cell, which may optionally be reverse transcribed to cDNA; b) hybridizing the mRNA or cDNA of step (a) to a set of oligonucleotide probes or a kit as defined in any one of claims 42 to 44; and ¢) assessing the amount of mRNA or cDNA hybridizing to each of said probes to produce said pattern.
46. A method as claimed in claim 1 substantially as herein described with reference to cither example 1 or 2.
47. A method as claimed in claim 2 substantially as herein described with reference to cither example 1 or 2.
48. A method as claimed in claim 3 substantially as herein described with reference to either example 1 or 2. Amended sheet: 9" June 2008
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB0412301.4A GB0412301D0 (en) | 2004-06-02 | 2004-06-02 | Product and method |
Publications (1)
Publication Number | Publication Date |
---|---|
ZA200610644B true ZA200610644B (en) | 2008-07-30 |
Family
ID=32696533
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ZA200610644A ZA200610644B (en) | 2004-06-02 | 2005-06-02 | Oligonucleotides for cancer diagnosis |
Country Status (17)
Country | Link |
---|---|
US (2) | US8105773B2 (en) |
EP (1) | EP1766056B1 (en) |
JP (1) | JP5060945B2 (en) |
CN (1) | CN101027410A (en) |
AP (1) | AP2006003862A0 (en) |
AT (1) | ATE493511T1 (en) |
AU (1) | AU2005250219B2 (en) |
CA (1) | CA2568889A1 (en) |
DE (1) | DE602005025633D1 (en) |
DK (1) | DK1766056T3 (en) |
ES (1) | ES2358910T3 (en) |
GB (1) | GB0412301D0 (en) |
NO (1) | NO20065939L (en) |
NZ (1) | NZ551797A (en) |
SI (1) | SI1766056T1 (en) |
WO (1) | WO2005118851A1 (en) |
ZA (1) | ZA200610644B (en) |
Families Citing this family (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006128082A2 (en) * | 2005-05-26 | 2006-11-30 | The Johns Hopkins University | Biomarkers for breast cancer |
JP2007275027A (en) * | 2006-04-12 | 2007-10-25 | Fujifilm Corp | Method for diagnosing cancer using cancer-related deletion gene marker |
JP5208477B2 (en) * | 2007-10-12 | 2013-06-12 | 生化学工業株式会社 | Carrier for gene subtraction and use thereof |
WO2011063232A1 (en) * | 2009-11-20 | 2011-05-26 | University Of Louisville Research Foundation, Inc. | Biomarkers of cancer |
GB201000688D0 (en) * | 2010-01-15 | 2010-03-03 | Diagenic Asa | Product and method |
CN110444287B (en) * | 2011-04-29 | 2024-02-06 | 癌症预防和治疗有限公司 | Methods for identifying and diagnosing pulmonary diseases using a classification system and kits therefor |
KR20140099874A (en) * | 2011-11-02 | 2014-08-13 | 나노프리시젼 프로덕츠 인코포레이션 | Castellated optical fiber cable retention structure |
JP6486826B2 (en) | 2012-10-01 | 2019-03-20 | ミレニアム ファーマシューティカルズ, インコーポレイテッドMillennium Pharmaceuticals, Inc. | Biomarkers and methods for predicting response to inhibitors and uses thereof |
EP2922971B1 (en) * | 2012-11-20 | 2018-10-17 | Lund, Tore Eiliv | Gene expression profile in diagnostics |
JP6498210B2 (en) | 2013-12-29 | 2019-04-10 | チュアラボ オンコロジー, インコーポレーテッドCurelab Oncology, Inc. | Methods and compositions relating to P62 / SQSTM1 for the treatment and prevention of inflammation-related diseases |
KR102523245B1 (en) * | 2014-06-18 | 2023-04-20 | 도레이 카부시키가이샤 | Esophageal cancer detection kit or device, and detection method |
FR3029210A1 (en) | 2014-11-27 | 2016-06-03 | Centre Nat De La Rech Scient (Cnrs) | METHOD OF PROGNOSIS OF HEMOPATHIES |
US10140708B2 (en) * | 2016-01-21 | 2018-11-27 | Riverside Research Institute | Method for gestational age estimation and embryonic mutant detection |
GB201611738D0 (en) * | 2016-07-05 | 2016-08-17 | Cambridge Entpr Ltd | Biomarkers for inflammatory bowel disease |
WO2019036388A1 (en) * | 2017-08-14 | 2019-02-21 | Raytheon Company | Subtraction algorithm for detection of tumors |
CN109529018B (en) * | 2019-01-10 | 2021-11-30 | 南方医科大学南方医院 | Application of ATXN2L in preparation of drug for reversing oxaliplatin resistance of gastric cancer |
CN109596611A (en) * | 2019-01-21 | 2019-04-09 | 湖北紫荆生物技术有限公司 | A kind of water body calcium content test agent and water body calcium content measurement method |
US11475558B2 (en) | 2019-11-13 | 2022-10-18 | Raytheon Company | Organ isolation in scan data |
US11282209B2 (en) | 2020-01-10 | 2022-03-22 | Raytheon Company | System and method for generating contours |
US11562512B2 (en) | 2020-12-09 | 2023-01-24 | Raytheon Company | System and method for generating and displaying contours |
US11893745B2 (en) | 2020-12-09 | 2024-02-06 | Raytheon Company | System and method for generating and displaying contours |
CN113913436B (en) * | 2021-11-18 | 2023-09-29 | 天津市肿瘤医院(天津医科大学肿瘤医院) | Reverse transcription transposable gene L1-ATP8B1, application thereof as lung squamous carcinoma marker and medicine for treating lung squamous carcinoma |
CN115976212B (en) * | 2022-12-21 | 2024-06-11 | 四川大学华西医院 | Probe library and kit for detecting bone tumor diagnosis related genes |
Family Cites Families (77)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4981783A (en) | 1986-04-16 | 1991-01-01 | Montefiore Medical Center | Method for detecting pathological conditions |
US6040138A (en) | 1995-09-15 | 2000-03-21 | Affymetrix, Inc. | Expression monitoring by hybridization to high density oligonucleotide arrays |
US5871928A (en) | 1989-06-07 | 1999-02-16 | Fodor; Stephen P. A. | Methods for nucleic acid analysis |
US5800992A (en) | 1989-06-07 | 1998-09-01 | Fodor; Stephen P.A. | Method of detecting nucleic acids |
US5925525A (en) | 1989-06-07 | 1999-07-20 | Affymetrix, Inc. | Method of identifying nucleotide differences |
JP2802125B2 (en) | 1989-06-23 | 1998-09-24 | キヤノン株式会社 | Nucleic acid detection method |
US5474796A (en) | 1991-09-04 | 1995-12-12 | Protogene Laboratories, Inc. | Method and apparatus for conducting an array of chemical reactions on a support surface |
DE69214243T2 (en) | 1991-09-23 | 1997-02-06 | Pfizer | Methods for the detection of specific mRNA and DNA in cells |
GB2260811B (en) | 1991-10-23 | 1995-07-05 | Yorkshire Cancer Research Camp | Detection of malignant tumours |
AU691469B2 (en) | 1992-03-04 | 1998-05-21 | Synaptic Pharmaceutical Corporation | DNA encoding taurine and gaba transporters and uses thereof |
US6114114A (en) | 1992-07-17 | 2000-09-05 | Incyte Pharmaceuticals, Inc. | Comparative gene transcript analysis |
US5633137A (en) | 1992-12-01 | 1997-05-27 | The University Of South Florida | Method for measuring specific gene expression: transcriptional activity per gene dose |
US5677125A (en) | 1994-01-14 | 1997-10-14 | Vanderbilt University | Method of detection and diagnosis of pre-invasive cancer |
US5830645A (en) | 1994-12-09 | 1998-11-03 | The Regents Of The University Of California | Comparative fluorescence hybridization to nucleic acid arrays |
US5545531A (en) | 1995-06-07 | 1996-08-13 | Affymax Technologies N.V. | Methods for making a device for concurrently processing multiple biological chip assays |
US6156502A (en) | 1995-12-21 | 2000-12-05 | Beattie; Kenneth Loren | Arbitrary sequence oligonucleotide fingerprinting |
AU2253397A (en) | 1996-01-23 | 1997-08-20 | Affymetrix, Inc. | Nucleic acid analysis techniques |
WO1997029212A1 (en) | 1996-02-08 | 1997-08-14 | Affymetrix, Inc. | Chip-based speciation and phenotypic characterization of microorganisms |
WO1998008083A1 (en) | 1996-08-20 | 1998-02-26 | Motorola Inc. | Method and apparatus for detecting predetermined molecular structures in a sample |
US6235474B1 (en) | 1996-12-30 | 2001-05-22 | The Johns Hopkins University | Methods and kits for diagnosing and determination of the predisposition for diseases |
US6190857B1 (en) | 1997-03-24 | 2001-02-20 | Urocor, Inc. | Diagnosis of disease state using MRNA profiles in peripheral leukocytes |
NO972006D0 (en) * | 1997-04-30 | 1997-04-30 | Forskningsparken I Aas As | New method for diagnosis of diseases |
US5994076A (en) | 1997-05-21 | 1999-11-30 | Clontech Laboratories, Inc. | Methods of assaying differential expression |
WO1999006831A2 (en) | 1997-07-29 | 1999-02-11 | Dorssers Lambert C J | Breast cancer resistance genes, methods for their detection and uses thereof |
US6607879B1 (en) | 1998-02-09 | 2003-08-19 | Incyte Corporation | Compositions for the detection of blood cell and immunological response gene expression |
US6174680B1 (en) * | 1998-12-30 | 2001-01-16 | Dana-Farber Cancer Institute, Inc. | Method for identifying mismatch repair glycosylase reactive sites, compounds and uses thereof |
CA2318789C (en) | 1998-02-25 | 2011-05-10 | The United States Of America As Represented By The Secretary Department Of Health And Human Services | Cellular arrays for rapid molecular profiling |
US6004755A (en) | 1998-04-07 | 1999-12-21 | Incyte Pharmaceuticals, Inc. | Quantitative microarray hybridizaton assays |
CN1313908A (en) | 1998-08-21 | 2001-09-19 | 纳克斯科尔公司 | Assays using crosslinkable immobilized nucleic acids |
US20090233276A1 (en) * | 1998-09-22 | 2009-09-17 | Oncomedx, Inc. | Method Enabling the Use of Extracellular Ribonucleic Acid (RNA) Extracted from Plasma or Serum to Detect, Monitor or Evaluate Cancer or Premalignant Conditions |
US6593084B2 (en) | 1998-10-13 | 2003-07-15 | Robert E. Bird | Carcinogen assay |
WO2000022172A1 (en) | 1998-10-13 | 2000-04-20 | Clontech Laboratories, Inc. | Nucleic acid arrays |
JP5213202B2 (en) | 1998-10-28 | 2013-06-19 | バイシス・インコーポレーテツド | Detection and use of cell arrays and markers of genetic disorders |
WO2000026412A1 (en) | 1998-11-02 | 2000-05-11 | Kenneth Loren Beattie | Nucleic acid analysis using sequence-targeted tandem hybridization |
US20040241726A1 (en) | 1999-01-06 | 2004-12-02 | Chondrogene Limited | Method for the detection of allergies related gene transcripts in blood |
US20050042630A1 (en) | 1999-01-06 | 2005-02-24 | Chondrogene Limited | Method for the detection of asthma related gene transcripts in blood |
US20040248169A1 (en) | 1999-01-06 | 2004-12-09 | Chondrogene Limited | Method for the detection of obesity related gene transcripts in blood |
US20040265868A1 (en) | 1999-01-06 | 2004-12-30 | Chondrogene Limited | Method for the detection of depression related gene transcripts in blood |
US20050123938A1 (en) | 1999-01-06 | 2005-06-09 | Chondrogene Limited | Method for the detection of osteoarthritis related gene transcripts in blood |
WO2000040749A2 (en) | 1999-01-06 | 2000-07-13 | Genenews Inc. | Method for the detection of gene transcripts in blood and uses thereof |
US20040241728A1 (en) | 1999-01-06 | 2004-12-02 | Chondrogene Limited | Method for the detection of lung disease related gene transcripts in blood |
US20040265869A1 (en) | 1999-01-06 | 2004-12-30 | Chondrogene Limited | Method for the detection of type II diabetes related gene transcripts in blood |
US20050003394A1 (en) | 1999-01-06 | 2005-01-06 | Chondrogene Limited | Method for the detection of rheumatoid arthritis related gene transcripts in blood |
US20040248170A1 (en) | 1999-01-06 | 2004-12-09 | Chondrogene Limited | Method for the detection of hyperlipidemia related gene transcripts in blood |
US7473528B2 (en) | 1999-01-06 | 2009-01-06 | Genenews Inc. | Method for the detection of Chagas disease related gene transcripts in blood |
US20040241727A1 (en) | 1999-01-06 | 2004-12-02 | Chondrogene Limited | Method for the detection of schizophrenia related gene transcripts in blood |
DE19937187A1 (en) * | 1999-08-06 | 2001-03-01 | Qiagen Gmbh | Automated protein purification in multiwell format through vacuum filtration |
AU2001278076A1 (en) * | 2000-07-26 | 2002-02-05 | Applied Genomics, Inc. | Bstp-5 proteins and related reagents and methods of use thereof |
US7108969B1 (en) * | 2000-09-08 | 2006-09-19 | Affymetrix, Inc. | Methods for detecting and diagnosing oral cancer |
US6955876B2 (en) * | 2000-11-01 | 2005-10-18 | Kane Michael D | Compositions and systems for identifying and comparing expressed genes (mRNAs) in eukaryotic organisms |
WO2002059367A2 (en) | 2000-11-30 | 2002-08-01 | Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College | Diagnostic microarray for inflammatory bowel disease, crohn's disease and ulcerative colitis |
EP1350114A2 (en) * | 2001-01-12 | 2003-10-08 | Yale University | Detection of survivin in the biological fluids of cancer patients |
US20020132274A1 (en) * | 2001-01-17 | 2002-09-19 | Nevalainen Marja T. | Diagnostic and monitorings methods for cancer |
AU2002253878A1 (en) | 2001-01-25 | 2002-08-06 | Gene Logic, Inc. | Gene expression profiles in breast tissue |
AU2002237124B2 (en) | 2001-02-28 | 2008-03-06 | Chondrogene Inc. | Compositions and methods relating to osteoarthritis |
WO2002088750A2 (en) * | 2001-05-02 | 2002-11-07 | Oxford Glycosciences (Uk) Ltd | Proteins, genes and their use for diagnosis and treatment of breast cancer |
US20020169560A1 (en) | 2001-05-12 | 2002-11-14 | X-Mine | Analysis mechanism for genetic data |
US7335467B2 (en) * | 2001-05-15 | 2008-02-26 | Ludwig Institute For Cancer Research | Breast cancer antigens |
US20030036077A1 (en) * | 2001-05-31 | 2003-02-20 | Chronix Biomedical | Methods for generating an mRNA expression profile from an ecellular mRNA containing blood sample and using the same to identify functional state markers |
WO2003001985A2 (en) * | 2001-06-28 | 2003-01-09 | Dermtech International | Method for detection of melanoma |
WO2003008963A1 (en) * | 2001-07-18 | 2003-01-30 | University Of Louisville Research Foundation, Inc. | E-genechip online web service for data mining bioinformatics |
US7229774B2 (en) * | 2001-08-02 | 2007-06-12 | Regents Of The University Of Michigan | Expression profile of prostate cancer |
US7622260B2 (en) * | 2001-09-05 | 2009-11-24 | The Brigham And Women's Hospital, Inc. | Diagnostic and prognostic tests |
US7371736B2 (en) * | 2001-11-07 | 2008-05-13 | The Board Of Trustees Of The University Of Arkansas | Gene expression profiling based identification of DKK1 as a potential therapeutic targets for controlling bone loss |
US7668659B2 (en) * | 2001-11-07 | 2010-02-23 | The Board Of Trustees Of The University Of Arkansas | Diagnosis and classification of multiple myeloma |
US20030232350A1 (en) * | 2001-11-13 | 2003-12-18 | Eos Biotechnology, Inc. | Methods of diagnosis of cancer, compositions and methods of screening for modulators of cancer |
WO2003041562A2 (en) * | 2001-11-14 | 2003-05-22 | Whitehead Institute For Biomedical Research | Molecular cancer diagnosis using tumor gene expression signature |
AU2002361908A1 (en) * | 2001-12-31 | 2003-07-24 | Quark Biotech, Inc. | Methods for identifying marker genes for cancer |
US20030195431A1 (en) * | 2002-04-10 | 2003-10-16 | Sukhatme Vikas P. | Observation of effects of body fluids on indicator cells for disease diagnosis |
WO2003100000A2 (en) * | 2002-05-24 | 2003-12-04 | Tularik Inc. | Amplification and overexpression of oncogenes |
CN100478689C (en) * | 2002-08-30 | 2009-04-15 | 肿瘤疗法科学股份有限公司 | Polypeptide and use thereof |
EP1581652A4 (en) | 2002-09-12 | 2008-02-13 | Genenews Corp | Identification of sequences particularly useful for the diagnosis and identification of therapeutic targets for osteoarthritis |
US20060024692A1 (en) * | 2002-09-30 | 2006-02-02 | Oncotherapy Science, Inc. | Method for diagnosing non-small cell lung cancers |
GB0227238D0 (en) * | 2002-11-21 | 2002-12-31 | Diagenic As | Product and method |
US7910550B2 (en) * | 2004-04-16 | 2011-03-22 | Stc.Unm | Human kunitz-type inhibitor with enhanced antifibrinolytic activity |
EP1809765A2 (en) * | 2004-11-04 | 2007-07-25 | Roche Diagnostics GmbH | Classification of acute myeloid leukemia |
US20070178503A1 (en) * | 2005-12-19 | 2007-08-02 | Feng Jiang | In-situ genomic DNA chip for detection of cancer |
-
2004
- 2004-06-02 GB GBGB0412301.4A patent/GB0412301D0/en not_active Ceased
-
2005
- 2005-06-02 ZA ZA200610644A patent/ZA200610644B/en unknown
- 2005-06-02 CA CA002568889A patent/CA2568889A1/en not_active Abandoned
- 2005-06-02 CN CN200580026265.8A patent/CN101027410A/en active Pending
- 2005-06-02 NZ NZ551797A patent/NZ551797A/en not_active IP Right Cessation
- 2005-06-02 DE DE602005025633T patent/DE602005025633D1/de active Active
- 2005-06-02 AP AP2006003862A patent/AP2006003862A0/en unknown
- 2005-06-02 SI SI200531251T patent/SI1766056T1/en unknown
- 2005-06-02 JP JP2007514130A patent/JP5060945B2/en not_active Expired - Fee Related
- 2005-06-02 AU AU2005250219A patent/AU2005250219B2/en not_active Ceased
- 2005-06-02 WO PCT/GB2005/002180 patent/WO2005118851A1/en active Application Filing
- 2005-06-02 US US11/628,300 patent/US8105773B2/en not_active Expired - Fee Related
- 2005-06-02 AT AT05747218T patent/ATE493511T1/en not_active IP Right Cessation
- 2005-06-02 DK DK05747218.5T patent/DK1766056T3/en active
- 2005-06-02 ES ES05747218T patent/ES2358910T3/en active Active
- 2005-06-02 EP EP05747218A patent/EP1766056B1/en active Active
-
2006
- 2006-12-20 NO NO20065939A patent/NO20065939L/en not_active Application Discontinuation
-
2012
- 2012-01-05 US US13/344,088 patent/US20120295800A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
NZ551797A (en) | 2010-03-26 |
JP2008501332A (en) | 2008-01-24 |
AU2005250219A1 (en) | 2005-12-15 |
US20120295800A1 (en) | 2012-11-22 |
AU2005250219B2 (en) | 2011-01-20 |
SI1766056T1 (en) | 2011-05-31 |
EP1766056B1 (en) | 2010-12-29 |
ATE493511T1 (en) | 2011-01-15 |
CA2568889A1 (en) | 2005-12-15 |
US8105773B2 (en) | 2012-01-31 |
DE602005025633D1 (en) | 2011-02-10 |
JP5060945B2 (en) | 2012-10-31 |
DK1766056T3 (en) | 2011-03-14 |
GB0412301D0 (en) | 2004-07-07 |
US20080026385A1 (en) | 2008-01-31 |
WO2005118851A1 (en) | 2005-12-15 |
CN101027410A (en) | 2007-08-29 |
AP2006003862A0 (en) | 2006-12-31 |
EP1766056A1 (en) | 2007-03-28 |
ES2358910T3 (en) | 2011-05-16 |
NO20065939L (en) | 2007-02-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ZA200610644B (en) | Oligonucleotides for cancer diagnosis | |
Hodge et al. | Conserved cell types with divergent features in human versus mouse cortex | |
US10697975B2 (en) | Methods for identifying, diagnosing, and predicting survival of lymphomas | |
CN105917008B (en) | Gene expression panels for prognosis of prostate cancer recurrence | |
US20020187480A1 (en) | Diagnostic method for assessing a condition of a performance animal | |
KR20180123020A (en) | Use of DNA fragment size to determine copy number variation | |
US20170096713A1 (en) | Applications of single molecule sequencing | |
US20060057608A1 (en) | Producing, cataloging and classifying sequence tags | |
CN107771221A (en) | The abrupt climatic change analyzed for screening for cancer and fetus | |
EP2121988B1 (en) | Prostate cancer survival and recurrence | |
WO2011086174A2 (en) | Diagnostic gene expression platform | |
CN109679957B (en) | IncRNA L NC-004208 and application of detection reagent thereof in preparation of brain glioma prognosis reagent | |
Kim et al. | Single-cell transcriptomics: a novel precision medicine technique in nephrology | |
Liehr | Cytogenetics and molecular cytogenetics | |
US20180051342A1 (en) | Prostate cancer survival and recurrence | |
Chang et al. | Microarray analysis of stem cells and differentiation | |
ZA200503797B (en) | Product and method | |
AU2007277142B2 (en) | Methods for identifying, diagnosing, and predicting survival of lymphomas | |
Cattelan | Development of a NGS workflow for diagnostic applications in oncology | |
CN114517232A (en) | Method, model and marker for judging endometrial receptivity in noninvasive mode |