WO2000077257A1 - Reseaux specifiques de genes et leur utilisation - Google Patents

Reseaux specifiques de genes et leur utilisation Download PDF

Info

Publication number
WO2000077257A1
WO2000077257A1 PCT/US2000/015850 US0015850W WO0077257A1 WO 2000077257 A1 WO2000077257 A1 WO 2000077257A1 US 0015850 W US0015850 W US 0015850W WO 0077257 A1 WO0077257 A1 WO 0077257A1
Authority
WO
WIPO (PCT)
Prior art keywords
array
hybridization
gene
gene transcripts
polynucleotide
Prior art date
Application number
PCT/US2000/015850
Other languages
English (en)
Inventor
Narayan Baidya
Yii-Der Ida Chen
Julie Holding
Yie-Teh Yu
Original Assignee
Clingenix, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Clingenix, Inc. filed Critical Clingenix, Inc.
Priority to EP00939721A priority Critical patent/EP1185701A1/fr
Priority to AU54766/00A priority patent/AU5476600A/en
Priority to CA002375220A priority patent/CA2375220A1/fr
Publication of WO2000077257A1 publication Critical patent/WO2000077257A1/fr

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6834Enzymatic or biochemical coupling of nucleic acids to a solid phase
    • C12Q1/6837Enzymatic or biochemical coupling of nucleic acids to a solid phase using probe arrays or probe chips
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/30Microarray design
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/10Gene or protein expression profiling; Expression-ratio estimation or normalisation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression

Definitions

  • TECHNICAL FIELD This invention is in the field of genetic analysis. Specifically, the invention relates to the generation of an array of polynucleotide probes comprising sequences complementary to the 3' untranslated region of a gene transcript, whose chromosomal location has been defined.
  • the compositions and methods embodied in the present invention are particularly useful for high throughput screening of differential gene expression patterns among multiple subjects.
  • the structure and biological behavior of a cell is determined by the pattern of gene expression within that cell.
  • Each human cell contains approximately three billion base pairs encoding between 50, 000 to 100, 000 genes (Schuler et al. (1996) Science 274:540- 546; Guyer et al. (1995) Proc. Natl. Acad. Scie. USA 92:10841-10848; Rowen et al. (1997)
  • the array-based technology involves hybridization of a pool of target polynucleotides corresponding to gene transcripts of a test subject to an array of tens and thousands of probe sequences immobilized on the array substrate.
  • the technique allows simultaneous detection of multiple gene transcripts and yields quantitative information on the relative abundance of each gene transcript expressed in a test subject.
  • By comparing the hybridization patterns generated by hybridizing different pools of target polynucleotides to the arrays one can readily obtain the relative transcript abundance in two pools of target samples.
  • the analysis can be extended to detecting differential expression of genes between diseased and normal tissues, among different types of tissues and cells, amongst cells at different cell-cycle points or at different developmental stages, and amongst cells that are subjected to various environmental stimuli or lead drugs.
  • WO 97/10365 describes an oligonucleotide array made of synthetically generated oligonucleotides of 20-500 nucleotides in length; each of the following references WO 98/53103, Duggan et al. (1999) Nature Genetics Supplement 21: 10-14, Wang et al. (1999) Gene 229: 101-108, Khan et al. (1999) Electrophoresis 20: 223- 229, and Chen et al. (1998) Genomics 51: 313-324, describes a DNA microarray for monitoring changes in gene expression profile of one or multiple test subjects. Neither of these references discloses arrays necessarily contains probes having minimum secondary
  • a principal aspect of the present invention is the design of arrays of polynucleotide probes having reduced secondary structures. Such arrays are highly specific for simultaneous detection of differential expression of multiple genes. Accordingly, the present invention provides an array comprising a plurality of polynucleotide probes immobilized on a solid support, which exhibits the following characteristics: (a) the plurality of polynucleotide probes corresponds to a multiplicity of gene transcripts; (b) each polynucleotide probe of the plurality is localized to a predetermined region on a solid support; (c) each polynucleotide probe is from about 50 to about 500 nucleotides in length; and (d) each polynucleotide probe is complementary to 3' untranslated sequence of a gene transcript, said untranslated sequence having a defined chromosomal location.
  • the arrays of the present invention further comprise control probes which can be normalization control probes, expression level control probes, and/or mismatch control probes.
  • the arrays comprise target polynucleotides corresponding to gene transcripts expressed in a subject, wherein the target polynucleotides are bound to the polynucleotide probes in form of stable target-probe complexes.
  • the plurality of polynucleotide probes immobilized on the arrays may comprise at least about 10 polynucleotides, each being complementary to a distinct gene transcript.
  • the plurality of polynucleotide probes comprises at least about 100 polynucleotides.
  • an array comprises a plurality of sequence-tagged site (STS) tags.
  • the predetermined region of the invention array comprises at least 10 single-stranded polynucleotides that are complementary to the same
  • the predetermined region may also comprise at least 100 single-stranded polynucleotides that are complementary to the same gene transcript. In a preferred embodiment, the predetermined region comprises single-stranded polynucleotides of identical sequences.
  • the solid support on which the probes are arrayed can be flexible or rigid.
  • the solid support is made of one or more substances selected from the group consisting of nitrocellulose, nylon, polypropylene, glass, and silicon.
  • the present invention also provides a method of preparing and using an array having the above-mentioned characteristics.
  • the present invention provides a method of simultaneously detecting expression of a multiplicity of gene transcripts of a subject.
  • the method comprises the steps of: (a) contacting more than one labeled target polynucleotides corresponding to gene transcripts of said subject with an array of polynucleotide probes as disclosed herein under the conditions sufficient to produce stable target-probe complexes; and (b) detecting the formation of the stable target-probe complexes, thereby detecting expression of a multiplicity of gene transcripts.
  • the invention provides a method of detecting differential expression of a multiplicity of gene transcripts of at least two subjects.
  • the method involves (a) contacting more than one labeled target polynucleotides corresponding to gene transcripts of a first subject with an invention array, under the conditions sufficient to produce stable target-probe complexes that form a first hybridization pattern; (b) contacting more than one labeled target polynucleotides corresponding to gene transcripts of a second subject with an invention array, under the conditions sufficient to produce stable target- probe complexes that form a second hybridization pattern; and (c) comparing the hybridization patterns, thereby detecting the differential expression of a multiplicity of gene transcripts of the subjects.
  • the hybridization patterns are generated on the same array. In another aspect of the embodiment, the hybridization patterns are generated on different arrays.
  • the target polynucleotides can be DNA or RNA molecules, and preferably cDNAs.
  • the present invention also includes a computer readable medium having recorded thereon arrays of polynucleotide probes as disclosed herein. Featured computer media
  • H include magnetic storage medium, optical storage medium, electrical storage medium, hybrid storage medium of any of these categories.
  • the invention further provides a computer-based system for detecting differential expression of a multiplicity of gene transcripts indicated by a difference in hybridization patterns on an array of polynucleotide probes.
  • the computer-based system comprises: a) a data storage device comprising a reference hybridization pattern and a test hybridization pattern, wherein the reference hybridization pattern is generated by hybridizing an array of polynucleotide probes having the above-described characteristics, with more than one labeled target polynucleotides corresponding to gene transcripts expressed in a control; and wherein the test hybridization pattern is generated by hybridizing an invention array of polynucleotide probes with more than one labeled target polynucleotides corresponding to gene transcripts expressed in a test subject; b) a search device for comparing the test hybridization pattern to the reference hybridization pattern of the data storage device of step (a) to detect the differences in hybridization patterns; and c) a retrieval device for obtaining said differences in hybridization patterns of step (
  • Also embodied in the invention is a method of determining differential expression of a multiplicity of gene transcripts of at least two subjects using a computer.
  • kits containing the invention arrays for simultaneous detection of expression of multiple gene transcripts are provided by the invention.
  • polynucleotide refers to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three-dimensional structure, and may perform any function, known or unknown.
  • polynucleotides coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers.
  • a polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs.
  • modifications to the nucleotide structure may be imparted before or after assembly of the polymer.
  • the sequence of nucleotides may be interrupted by non-nucleotide components.
  • a polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.
  • a “polynucleotide probe” refers to a polynucleotide used for detecting or identifying its corresponding target polynucleotide in a hybridization reaction.
  • a “gene” refers to a polynucleotide containing at least one open reading frame that is capable of encoding a particular protein after being transcribed and translated.
  • the phrase "3" untranslated sequences" as applied to a gene transcript refers to the 3' end sequences located immediately outside the open reading frame of the gene transcript. The part of 3' untranslated sequences that has a defined chromosomal location excludes the poly- A tail located at the very end of the 3' untranslated region.
  • Genes of a specific developmental origin refer to genes expressed at certain but not all developmental stages. For instance, a gene may be of embryonic or adult origin depending on the stage during which the gene is expressed.
  • a disease-associated gene refers to any gene which is yielding transcription or translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a control. It may be a gene that becomes expressed at an abnormally high level; it may be a gene that becomes expressed at an abnormally low level, where the altered expression correlates with the occurrence and/or progression of the disease.
  • a disease-associated gene also refers to gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with gene(s) that is responsible for the etiology of a disease. The transcribed or translated products may be known or unknown, and may be at normal or abnormal level.
  • RNA also known as gene transcript
  • cDNA corresponds to the RNA from which it has been produced, such as by a reverse transcription reaction, or by chemical synthesis of a DNA based upon knowledge of the RNA sequence.
  • cDNA also corresponds to the gene that encodes the RNA. Polynucleotides may be said to correspond even when one of the pair is derived from only a portion of the other.
  • a gene "database” denotes a set of stored data which represent a collection of sequences including nucleotide and peptide sequences, which in turn represent a collection of biological reference materials.
  • expression refers to the process by which a polynucleotide is transcribed into mRNA and/or the process by which the transcribed mRNA (also referred to as “transcript”) is subsequently being translated into peptides, polypeptides, or proteins.
  • the transcripts and the encoded polypeptides are collectedly referred to as gene product. If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in an eukaryotic cell.
  • a "Differential expression” refers to alterations in the abundance or the expression pattern of a gene product. An alteration in “expression pattern” may be indicated by a change in sub-tissue distribution, or a change in hybridization pattern reviewed on an array of the present invention.
  • a “primer” is a short polynucleotide, generally with a free 3' -OH group, that binds to a target or "template” potentially present in a sample of interest by hybridizing with the target, and thereafter promoting polymerization of a polynucleotide complementary to the target.
  • hybridize refers to the ability of the polynucleotide to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues in a hybridization reaction.
  • the hydrogen bonding may occur by Watson-Crick base pairing, Hoogstein binding, or in any other sequence-specific manner.
  • the complex may comprise two strands forming a duplex structure, three or more strands forming a multi-stranded complex, a single self-hybridizing strand, or any combination of these.
  • the hybridization reaction may constitute a step in a more extensive process, such as the initiation of a PCR reaction, or the enzymatic cleavage of a polynucleotide by a ribozyme.
  • Hybridization can be performed under conditions of different "stringency". Relevant conditions include temperature, ionic strength, time of incubation, the presence of additional solutes in the reaction mixture such as formamide, and the washing procedure.
  • Higher stringency conditions are those conditions, such as higher temperature and lower sodium ion concentration, which require higher minimum complementarity between hybridizing elements for a stable hybridization complex to form.
  • a low stringency hybridization reaction is carried out at about 40 C in 10 x SSC or a solution of equivalent ionic strength/temperature.
  • a moderate stringency hybridization is typically performed at about 50 °C in 6 x SSC, and a high stringency hybridization reaction is generally performed at about 60 °C in 1 x SSC.
  • a double-stranded polynucleotide can be “complementary” or “homologous” to another polynucleotide, if hybridization can occur between one of the strands of the first polynucleotide and the second.
  • “Complementarity” or “homology” is quantifiable in terms of the proportion of bases in opposing strands that are expected to form hydrogen bonding with each other, according to generally accepted base-pairing rules.
  • Luminescence is the term commonly used to refer to the emission of light from a substance for any reason other than a rise in its temperature.
  • atoms or molecules emit photons of electromagnetic energy (e.g., light) when then move from an "excited state” to a lower energy state (usually the ground state); this process is often referred to as "radiative decay”.
  • electromagnetic energy e.g., light
  • photoluminescence the luminescence process is referred to as "photoluminescence”.
  • the exciting cause is an electron, the luminescence process is referred to as "electroluminescence”.
  • electroluminescence results from the direct injection and removal of electrons to form an electron-hole pair, and subsequent recombination of the electron-hole pair to emit a photon.
  • Luminescence which results from a chemical reaction is usually referred to as “chemiluminescence”.
  • Luminescence produced by a living organism is usually referred to as “bioluminescence”.
  • photoluminescence is the result of a spin-allowed transition (e.g., a single-singlet transition, triplet-triplet transition)
  • the photoluminescence process is usually referred to as "fluorescence”.
  • fluorescence emissions do not persist after the exciting cause is removed as a result of short-lived excited states which may rapidly relax through such spin-allowed transitions.
  • photoluminescence is the result of a spin-forbidden transition (e.g. , a triplet-singlet transition)
  • the photoluminescence process is usually referred to as "phosphorescence”.
  • phosphorescence emissions persist long after the exciting cause is removed as a result of long-lived excited states which may relax only through such spin-forbidden transitions.
  • a "luminescent label" of the present invention may have any one of the above- described properties.
  • predefined region refers to a localized area on the surface of a solid support, which is intended for registration or tracking the identify of the polynucleotide probes that are immobilized onto the predefined region.
  • a “subject” as used herein refers to a biological entity containing expressed genetic materials.
  • the biological entity is preferably a vertebrate, preferably a mammal, more preferably a human. Tissues, cells and their progeny of a biological entity obtained in vivo or cultured in vitro are also encompassed.
  • a "control” is an alternative subject or sample used in an experiment for comparison purpose. A control can be "positive” or "negative”.
  • a positive control a subject or a sample from a subject, exhibiting such differential expression and syndromes characteristic of that disease
  • a negative control a subject or a sample from a subject lacking the differential expression and clinical syndrome of that disease
  • a central aspect of the present invention is the design of an array of polynucleotide probes applicable for detecting differential expression of a multiplicity of genes.
  • the subject arrays Distinguished from the previously described cDNA or oligonucleotide microarrays, the subject arrays have the following unique characteristics: (a) the plurality of polynucleotide probes constituting the array corresponds to a multiplicity of gene transcripts; (b) each polynucleotide probe of the plurality is localized to a predetermined region on a solid support; (c) each polynucleotide probe is from about 50 to about 500 nucleotides in length; (d) each polynucleotide probe is complementary to 3' untranslated sequence of a gene transcript, said untranslated sequence having a defined chromosomal location.
  • a preferred array comprises sequence-tagged site (STS) probes whose chromosomal locations have been identified.
  • a selected probe is specific to an expressed gene transcript, and unique to the entire expressed genome. Such unique probe lacks substantial sequence homology with any other existing gene transcripts when optimally aligned, and thus having a low probability of cross-hybridizing with sequences found in any other genes.
  • the 3' untranslated sequence of a gene transcript is highly specific; it typically exhibits little sequence similarity to other expressed genes.
  • ⁇ io examples of these programs are Blast , Fasta , DNA Star, MegAlign, and GeneJocky. Any sequence databases that contains DNA sequences corresponding to a gene or a segment thereof can be used for sequence analysis. Commonly employed databases include but are not limited to GenBank, EMBL, DDBJ, PDB, SWISS-PROT, EST, STS, GSS, and HTGS. Sequence similarity can be discerned by aligning the probe sequence against a DNA sequence database. Common parameters for determining the extent of homology set forth by one or more of the aforementioned alignment programs include p value and percent sequence identity. P value is the probability that the alignment is produced by chance. For a single alignment, the p value can be calculated according to Karlin et al.
  • Percent sequence identify is defined by the ratio of the number of nucleotide matches between the query sequence and the known sequence when the two are optimally aligned.
  • a probe sequence is considered to have no substantial homology when the region of alignment exhibits less than 20% of sequence identity, more preferably less than 10% identity, even more preferably less than
  • a second consideration of designing the subject array is to select probes which have minimal secondary structures and internal sequence homology. Extensive homology within the probe due to e.g., inverted repeats, promotes self-hybridization, and thus interfering the binding of the probe to the target sequences.
  • probes having similar thermal profiles and internal stability are chosen by selecting probes with comparable length and G/C content.
  • probes have 50 to 60% G+C composition.
  • probes to be arrayed have a minimal length of about 50 nucleotides, more preferably about 100 nucleotide, and even more preferably about 150.
  • probes of the subject arrays have a maximal length of about 500 nucleotides, more preferably about 400 nucleotides, and even more preferably about 300 nucleotides.
  • the probes are generated by amplifying genomic DNA using primers complementary to the 3' untranslated regions of genes of particular interest.
  • the types of arrays embodied in the present invention may differ in the nature of the polynucleotide probes immobilized thereon, and specifically the types of genes to which the probes correspond.
  • the types of genes may be characterized based on one or more of the following features: species origin, developmental origin, primary structural similarity, involvement in a particular biological process, association with a particular disease or disease stage, tissue, sub-tissue or cell-specific expression pattern, and subcellular location of the expressed gene product.
  • the arrays of the present invention comprise probes corresponding to gene transcripts expressed in a eukaryote cell, such as a cell derived from a vertebrate, preferably a mammal, more preferably a primate, even more preferably a human being.
  • the arrays contain probes capable of hybridizing to genes of a specific developmental origin, such as those expressed in an embryo or an adult, during ectoderm, endoderm or mesoderm formation in a multi-cellular organism.
  • the invention arrays comprise probes binding to a family of gene transcripts, or a sub-family of gene transcripts that share primary structural similarities. Structural similarities can be discerned with the aid of computer software described above.
  • Non-limiting examples of gene families include those encoding cell surface receptors, protein kinases (e.g. tyrosine, serine/threonine or histidine kinases), trimeric G-proteins, cytokines, SH2-, SH3-, PH-, PDZ-domain containing proteins, and any of those gene families published by Human
  • the arrays present probes recognizing gene transcripts involved in a specific biological process, including but not limited to cell cycle regulation, cell differentiation, apoptosis, chemotaxsis, cell motility and cytoskeletal rearrangement.
  • the invention arrays contain probes hybridizing to gene transcripts that are associated with a particular disease or with a specific disease stage.
  • genes include but are not limited to those associated with autoimmune diseases, obesity, hypertension, diabetes, neuronal and/or muscular degenerative diseases, cardiac diseases, endocrine disorders, any combinations thereof.
  • the arrays of the present invention comprise probes hybridizing to gene transcripts with restricted expression patterns.
  • Non-limiting exemplary gene transcripts of this class include those that are not ubiquitously expressed, but rather are differentially expressed in one or more of the body tissues including heart, liver, prostate, lung, kidney, bone marrow, blood, skin, bladder, brain, muscles, nerves, and selected tissues that are affected by various types of cancer (malignant or non-metastatic), affected by cystic fibrosis or polycystic kidney disease.
  • non-ubiquitously expressed genes are those whose gene products are localized to certain subcellular locations: extracellular matrix, nucleus, cytoplasm, cytoskeleton, plasma and/or intracellular membranous structures which include but are not limited to coated pits, Golgi apparatus, endoplasmic reticulum, endosome, lysosome, and mitochondria.
  • a preferred array comprises 3' untranslated sequences of gene transcripts listed in Table 1.
  • the arrays of the present invention comprise control probes, positive or negative, for comparison purpose.
  • the selection of an appropriate control probe is dependent on the sample probe initially selected and its expression pattern which is under investigation.
  • the control probes may also be classified into the following three categories: (a) normalization controls; (b) expression level control; and (c) mismatch controls.
  • Normalization controls serve to generate signals during hybridization as a control for variations in hybridization conditions, label intensity, "reading" efficiency or any other factors that may cause the signal of a specific hybridization to vary between arrays and among different regions of the same arrays.
  • signals e.g., fluorescence intensity
  • read from all other probes in the array are divided by the signal
  • the normalization controls comprises sequences that are perfectly complementary to their respective target sequences. Virtually any probe may serve as a normalization control. However, it is recognized that hybridization efficiency varies with base composition and probe length. Preferred normalization probes are selected to reflect the average length of the other probes present in the array. However, they can be selected to cover a range of lengths. The normalization control(s) can also be selected to reflect the base composition of the other probes in the array.
  • Expression level controls are probes that hybridize specifically with constitutively expressed genes in the biological sample. Expression level controls are designed to control for the overall health and metabolic activity of a cell. Examination of the covariance of an expression level control with the expression level of the target nucleic acid indicates whether measured changes or variations in expression level of a gene is due to changes in transcription rate of that gene or to general variations in health of the cell. Thus, for example, when a cell is in poor health or lacking a critical metabolite the expression levels of both an active target gene and a constitutively expressed gene are expected to decrease. The converse is also true.
  • the change may be attributed to changes in the metabolic activity of the cell as a whole, not to differential expression of the target gene in question.
  • the expression levels of this target gene and the expression level control do not covary, the variation in the expression level of the target gene is attributed to differences in regulation of that gene and not to overall variations in the metabolic activity of the cell.
  • any constitutively expressed gene provides a suitable candidate for expression level control probes.
  • expression level control probes have sequences complementary to constitutively expressed "housekeeping genes," which include, but are not limited to the ⁇ -actin gene, the transferrin receptor gene, the GAPDH gene, and the like.
  • Mismatch probes provide a control for non-specific binding or cross-hybridization to a nucleic acid in the sample other than the target to which the probe is directed. Mismatch probes thus indicate whether a hybridization is specific or not. For example, if the target is present the perfect match probes should be consistently brighter than the mismatch probes.
  • mismatch controls are polynucleotide probes identical to their corresponding target polynucleotide except for the presence of one or more mismatched bases. Mismatches are selected such that under appropriate hybridization conditions (e.g., stringent conditions) the test or control probe would be expected to hybridize with its target sequence, but the mismatch probe would not hybridize (or would hybridize to a significantly lesser extent). In general, as much as 20% base-pair mismatch (when optimally aligned) can be tolerated.
  • Control probes of any kind can be localized at any position in the array or at multiple positions throughout the array to control for spatial variation, overall expression level, or non-specific binding in hybridization assays.
  • the polynucleotide probes embodied in this invention can be obtained by chemical synthesis, recombinant cloning, e.g. PCR, or any combination thereof. Methods of chemical polynucleotide synthesis are well known in the art and need not be described in
  • Selected probes are immobilized onto predetermined regions of a solid support by any suitable techniques that effect in stable association of the probes with the surface of a solid support.
  • stably associated is meant that the polynucleotides remain localized to the predetermined region under hybridization and washing conditions.
  • the polynucleotides can be covalently associated with or non-covalently attached to the support surface. Examples of non-covalent association include binding as a result of non-specific adsorption, ionic, hydrophobic, or hydrogen bonding interactions.
  • Covalent association involves formation of chemical bond between the polynucleotides and a functional group present on the surface of a support. The functional may be naturally occurring or introduced as a linker.
  • Non-limiting functional groups include but are not limited to hydroxyl, amine, thiol and amide.
  • Exemplary techniques applicable for covalent immobilization of polynucleotide probes include, but are not limited to, UV cross-linking or other light-directed chemical coupling, and mechanically directed coupling (see, e.g. U.S. Patent No. 5,837,832, 5,143,854, 5800992, WO 92/10092, WO 93/09668, and WO
  • a preferred method is to link one of the termini of a polynucleotide probe to the support surface via a single covalent bond.
  • each array is generated by depositing a plurality of probe samples either manually or more commonly using an automated device, which spots samples onto a number of predefined regions in a serial operation.
  • automated spotting devices are commonly employed for production of polynucleotide arrays. Such devices include piezo or ink-jet devices, automated micro-pipetters and any of those devices that are commercially available (e.g. Beckman Biomek 2000).
  • the total number of probe samples spotted on the support will vary depending on the number of different polynucleotide probes one wish to display on the surface, as well as the number of control probes, which
  • the array comprises at least about 20 distinct polynucleotides, usually at least about 100 polynucleotides, preferably about 1000 polynucleotides, more preferably about 10,000 polynucleotides, but will usually not exceed 100,000 polynucleotides, wherein each polynucleotide is complementary to a distinct gene transcript.
  • the polynucleotide spots may take a variety of configurations ranging from simple to complex, depending on the intended use of the array.
  • the probes may be spotted in any convenient pattern across or over the surface of the array so as to from a grid, a circular, ellipsoid, oval or some other analogously curved shape.
  • a predetermined region comprises at least 2, preferably at least 100 single-stranded polynucleotides, more preferably about 1000 single-stranded polynucleotides, and will usually not exceed 10,000 polynucleotide probes, that are complementary to the same gene transcript.
  • a predetermined region is spotted with at least 2, usually at least 100 single-stranded polynucleotides of identical sequences.
  • the predetermined region generally has an average size ranging from about 0.01 cm 2 to about 1 cm 2 .
  • the substrates of the subject arrays may be manufactured from a variety of materials.
  • the materials with which the support is fabricated exhibit a low level of non-specific binding during hybridization assay.
  • a preferred solid support is made from one or more of the following types of materials: nitrocellulose, nylon, polypropylene, glass, and silicon.
  • the materials may be flexible or rigid.
  • a flexible substrate is capable of being bent, folded, twisted or similarly manipulated, without breaking.
  • a rigid substrate is one that is stiff or inflexible and prone to breakage. As such, the rigid substrates of the subject arrays are sufficient to provide physical support and structure to the polymeric targets present thereon under the assay conditions in which the arrays are employed, particularly under high throughput assay conditions.
  • Exemplary materials suitable for fabricating flexible support include a diversity of membranous materials, such as nitrocellulose, nylon or derivatives thereof, and plastics (e.g. polytetrafluoroethylene, polypropylene, polystyrene, polycarbonate, and blends thereof).
  • plastics e.g. polytetrafluoroethylene, polypropylene, polystyrene, polycarbonate, and blends thereof.
  • materials suitable for making rigid support include but are not limited to glass, semi-conductors such as silicon and germanium, metals such as platinum and gold.
  • the solid support on which arrays of polynucleotide probes are attached comprises at least one surface, which may be smooth or substantially planar, or with irregularities such as depressions or elevations.
  • the surface on which the pattern of probes is deposited may be modified with one or more different layers of compounds that serve to modify the properties of the surface in a desirable manner.
  • Modification layers coated on the solid support may comprise inorganic layers made of, e.g. metals, metal oxides, or organic layers composed of polymers or small organic molecules and the like.
  • Polymeric layers of interest include layers of peptides, proteins, polysaccharides, lipids, phospholipids, polyurethanes, polyesters, polycarbonates, polyureas, polyamides, polyethyleneamines, polyarylene sulfates, polysiloxanes, polyimides, polyacetates and the like, where the polymers may be hetero- or homopolymeric, and may or may not be conjugated to functional moieties.
  • the arrays of polynucleotide probes provide an effective means of detecting or monitoring expression of a multiplicity of genes.
  • the expression detecting methods of this invention may be used in a wide variety of circumstances including detection of disease, identification and quantification of differential gene expression between at least two samples, linking the differentially expressed genes to a specific chromosomal location, and/or screening for compositions that upregulate or downregulate the expression or alter the pattern of expression of particular genes.
  • this invention provides a method of simultaneously detecting expression of a multiplicity of gene transcripts of a subject.
  • the method comprises the steps of: (a) contacting more than one labeled target polynucleotides corresponding to gene transcripts of the test subject with an array of polynucleotide probes of the invention under
  • T the conditions sufficient to produce stable target-probe complexes; and (b) detecting the formation of the stable target-probe complexes, thereby detecting expression of a multiplicity of gene transcripts.
  • the invention provides a method for detecting differential expression of a multiplicity of gene transcripts of at least two subjects.
  • the method involves (a) contacting more than one labeled target polynucleotides corresponding to gene transcripts of a first subject with an array of polynucleotide probes of the invention, under the conditions sufficient to produce stable target-probe complexes that form a first hybridization pattern; (b) contacting more than one labeled target polynucleotides corresponding to gene transcripts of a second subject with an invention array, under the conditions sufficient to produce stable target-probe complexes that form a second hybridization pattern; and (c) comparing the hybridization patterns, thereby detecting the differential expression of a multiplicity of gene transcripts of the subjects.
  • the test subject used for this invention can be body fluid, solid tissue samples, tissue cultures or cells derived therefrom and the progeny thereof, and sections or smears prepared from any of these sources, or any other samples that contain nucleic acids.
  • polynucleotides corresponding to gene transcripts refer to nucleic acids for whose synthesis, the mRNA transcript or subsequences thereof have ultimately served as a template.
  • a cDNA reverse transcribed from a mRNA, an RNA molecule transcribed from that cDNA, a DNA molecule amplified from the cDNA, an RNA transcribed from the amplified DNA and etc. are all corresponding to a gene transcript.
  • target polynucleotides representing cellular mRNA pools of a subject are generated by reverse transcription using an oligo-dT primer. This has the virtue of producing a product from the 3' end of the gene transcript, directly complementary to immobilized probes on the arrays.
  • a variation of this approach is to employ total RNA pools rather than mRNAs selected by oligo-dT, to
  • the resulting transcribed nucleic acids may be amplified prior to hybridization.
  • amplification method if a quantitative result is desired, caution must be taken to use a method that maintains or controls for the relative copies of the amplified nucleic acids.
  • Methods of "quantitative" amplification are well known to those of skill in the art. For example, quantitative PCR involves simultaneously co-amplifying a known quantity of a control sequence using the same primers. This provides an internal standard that may be used to calibrate the PCR reaction.
  • the subject array may also include probes specific to the internal standard for quantification of the amplified nucleic acid.
  • One preferred internal standard is a synthetic AW 106 cRNA.
  • the AW 106 cRNA is combined with RNA isolated from the sample according to standard techniques known to those of skill in the art.
  • the RNA is then reverse transcribed using a reverse transcriptase to provide cDNA.
  • the cDNA sequences are then amplified (e.g., by PCR) using labeled primers.
  • the amplification products are separated, typically by electrophoresis, and the amount of radioactivity (proportional to the amount of amplified product) is determined.
  • the amount of mRNA in the sample is then calculated by comparison with the signal produced by the known AW 106 RNA standard.
  • Detailed protocols for quantitative PCR are provided in PCR Protocols, A Guide to Methods and Applications, Innis et al. ,
  • target polynucleotides may involve cloning the sequences into suitable vectors for replication and storage purpose.
  • suitable vectors A vast number of vectors are available in the art and thus are not detailed herein.
  • the target polynucleotides may also be modified prior to hybridization to the probe arrays in order to reduce sample complexity thereby decreasing background signal and improving sensitivity of the measurement using any techniques known in the art. See, for example, the procedures disclosed in WO 97/10365.
  • target polynucleotides are allowed to form stable complexes with probes on the aforementioned arrays in a hybridization reaction. It will be appreciated by one of skill in the art that where antisense RNA is used as the target nucleic acid, the polynucleotide probes provided in the array are
  • V3 chosen to be complementary to sequences of the antisense nucleic acids.
  • the polynucleotide probes are selected to be complementary to sequences of the sense nucleic acids.
  • the probes may be of either sense and/or antisense as the target nucleic acids include both sense and antisense strands.
  • Suitable hybridization conditions for the practice of the present invention are such that the recognition interaction between the probe and target is both sufficiently specific and sufficiently stable.
  • hybridization reactions can be performed under conditions of different "stringency". Relevant conditions include temperature, ionic strength, time of incubation, the presence of additional solutes in the reaction mixture such as formamide, and the washing procedure. Higher stringency conditions are those conditions, such as higher temperature and lower sodium ion concentration, which require higher minimum complementarity between hybridizing elements for a stable hybridization complex to form. Conditions that increase the stringency of a hybridization reaction are widely known and published in the art. See, for example, (Sambrook, et al., (1989), supra).
  • the conditions may often be selected to be universally equally stable independent of the specific sequences involved. This typically will make use of a reagent such as an alkylammonium buffer. See, Wood et al. (1985) "Base Composition-independent Hybridization in Tetramethylammonium Chloride: A Method for Oligonucleotide Screening of Highly Complex Gene Libraries," Proc. Natl. Acad. Sci. USA, 82:1585-1588; and Krupov et al. (1989) "An Oligonucleotide Hybridization Approach to DNA Sequencing," FEBS Letters, 256:1 18-122; each of which is hereby incorporated herein by reference.
  • alkylammonium buffer tends to-minimize differences in hybridization rate and stability due to GC content. By virtue of the fact that sequences then hybridize with approximately equal affinity and stability, there is relatively little bias in strength or kinetics of binding for particular sequences. Temperature and salt conditions along with other buffer parameters may also be selected such that the kinetics of renaturation should be essentially independent of the specific target subsequence or polynucleotide probe involved. In order to ensure this, the hybridization reactions will usually be performed in a single incubation of all arrays to be tested together exposed to the identical same target probe solution under the same conditions.
  • various arrays may be individually treated differently. Different probes may be produced, each having reagents which bind to target subsequences with substantially identical stability and kinetics of hybridization. For example, all of the high GC content probes could be synthesized on a single array which is treated accordingly. In this embodiment, the arylammonium buffers could be unnecessary. Each array is then treated in a manner such that the collection of arrays show essentially uniform binding and the hybridization data of target binding to the individual array is combined with the data from other arrays to derive the necessary subsequence binding information. Preferably, control hybridization is included to determine the stringency and kinetics of each hybridization reaction.
  • hybridization specificity stringency
  • signal intensity there is a tradeoff between hybridization specificity (stringency) and signal intensity.
  • washing the hybridized array prior to detecting the target-probe complexes is performed to enhance the noise-signal ratio.
  • the hybridized array is washed at successively higher stringency solutions and signals are read between each wash. Analysis of the data sets thus produced will reveal a wash stringency above which the hybridization pattern is not appreciably altered and which provides adequate signal for the particular polynucleotide probes of interest. Parameters governing the wash stringency are generally the same as those of hybridization stringency.
  • Other measures such as inclusion of blocking reagents (e.g. sperm DNA, detergent or other organic or inorganic substances) during hybridization can also reduce non-specific binding.
  • the target polynucleotides are conjugated to a detectable label.
  • Detectable labels suitable for use in the present invention include any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means.
  • a wide variety of appropriate detectable labels are known in the art, which include luminescent labels, radioactive isotope labels, enzymatic or other ligands.
  • the labels may be incorporated by any of a number of means well known to those of skill in the art.
  • the label is simultaneously incorporated during the amplification step in the preparation of the target polynucleotides.
  • the label is simultaneously incorporated during the amplification step in the preparation of the target polynucleotides.
  • PCR 1X polymerase chain reaction
  • labeled primers or labeled nucleotides can provide a labeled amplification product.
  • transcription reaction as described above, using a labeled nucleotide (e.g. fluorescein-labeled UTP and/or CTP) or a labeled primer, incorporates a detectable label into the transcribed nucleic acids.
  • a label may be added directly to the original nucleic acid sample
  • label e.g., mRNA, polyA, mRNA, cDNA, etc.
  • Means of attaching labels to nucleic acids include, for example nick translation or end-labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and subsequent attachment (ligation) of a nucleic acid linker joining the sample nucleic acid to a label (e.g., a fluorophore).
  • radiolabels may be detected using photographic film or phosphoimager (for detecting and quantifying 32 P incorporation).
  • Fluorescent markers may be detected and quantified using a photodetector to detect emitted light (see U.S. Patent No.
  • Enzymatic labels are typically detected by providing the enzyme with a substrate and measuring the reaction product produced by the action of the enzyme on the substrate; and finally colorimetric labels are detected by simply visualizing the colored label.
  • the detection method provides a positional localization of the region where hybridization has taken place. The position of the hybridized region correlates to the specific sequence of the probe, and hence the identify of the gene transcript expressed in the test subject.
  • the detection methods also yield quantitative measurement of the level of hybridization intensity at each hybridized region, and thus a direct measurement of the level of expression of a given gene transcript.
  • hybridization pattern A collection of the data indicating the regions of hybridization present on an array and their respective intensities constitutes a "hybridization pattern" that is representative of a multiplicity of expressed gene transcripts of a subject. Any discrepancies detected in the hybridization patterns generated by hybridizing target polynucleotides derived from different subjects are indicative of differential expression of a multiplicity of gene transcripts of these subjects.
  • hybridization signals will vary in strength with efficiency of hybridization, the amount of label on the target nucleic acid and the amount of particular target nucleic acid in the sample.
  • target nucleic acids present at very low levels e.g., ⁇ lpmol
  • a threshold intensity value may be selected below which a signal is not counted as being essentially indistinguishable from background.
  • the provision of appropriate controls permits a more detailed analysis that controls for variations in hybridization conditions, cell health, non-specific binding and the like.
  • the hybridization patterns to be compared can be generated on the same array. In such case, different patterns are distinguished by the distinct types of detectable labels. In a separate aspect, the hybridization patterns employed for the comparison are generated on different arrays, where discrepancies are indicative of a differential expression of a particular gene in the subjects being compared.
  • the subjects employed for the comparative hybridization analysis may be (a) cells from different organisms of the same species (e.g. cells derived from different humans); (b) cells derived from the same organism but from different tissue types including normal or disease tissues, embryonic or adult tissues; (c) cells at different points in the cell-cycle; (d) cells treated with or without external or internal stimuli.
  • the comparative hybridization analysis using the arrays of the present invention can be employed to monitor gene expression in a wide variety of contexts. Such analysis may be extended to detecting differential expression of genes between diseased and normal tissues, among different types of tissues and cells, amongst cells at different cell-cycle points or at different developmental stages, and amongst cells that are subjected to various environmental stimuli or lead drugs.
  • a "computer readable medium” refers to any medium which can be read and accessed directly by a computer. Such media include, but are not limited to magnetic storage media, such as floppy discs, hard disc storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these categories, such as magnetic/optical storage media.
  • the term "recorded” refers to a process for storing information on computer readable medium.
  • a skilled artisan can readily adopt any of the presently know methods for recording information on computer readable medium to generate manufactures comprising the arrays of the present invention.
  • a variety of data storage structures are available to a skilled artisan for creating a computer readable medium having recorded thereon an array of polynucleotide probes of this invention. The choice of the data storage structure will generally be based on the means chosen to access the stored information.
  • a variety of data processor programs and formats can be used to store the array information of the present invention on a computer readable medium.
  • the array information can be represented in a word processing file including but not limited to doc, txt, wpf, and formatted in commercially-available software such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, Oracle, Informix, SQL or the like.
  • the array information can also be represented in comma delimited file, tab delimited file, space delimited file, data interchange format (DIF), quatro pro file, SAS file, SPSS file, flat file,
  • Dbase file all adobe acrobat files: pdf, PDF file, document template file, filemaker pro fp3 file, or the like.
  • a skilled artisan can readily adapt any number of data-processor structuring formats (e.g., text flex or database) in order to obtain computer readable medium having recorded thereon the probe array information of the present invention.
  • the computer readable medium can be incorporated as part of the computer-based system of the present invention, and can be employed for a computer-based analysis as described below.
  • the computer-based system of the present invention is designed to detect differential expression of a multiplicity of gene transcripts indicated by a difference in hybridization patterns on an array of polynucleotide probes.
  • Such system comprises: (a) a data storage device comprising a reference hybridization pattern and a test hybridization pattern, wherein the reference hybridization pattern is generated by hybridizing an array of polynucleotide probes as disclosed herein, with more than one labeled target polynucleotides corresponding to gene transcripts expressed in a control; and wherein the test hybridization pattern is generated by hybridizing an array of polynucleotide probes as described above with more than one labeled target polynucleotides corresponding to gene transcripts expressed in a test subject; (b) a search device for comparing the test hybridization pattern to the reference hybridization pattern of the data storage device of step (a) to detect the differences in hybridization patterns; and (c) a retrieval device for obtaining said differences in hybridization patterns of step (b).
  • a computer-based system includes hardware and software.
  • the "data storage device” as part of the system refers to memory which can store reference hybridization pattern(s) and test hybridization pattern(s), which are generated by hybridizing one or more arrays of polynucleotide probes as disclosed herein, with target polynucleotides corresponding to gene transcripts expressed in distinct subjects.
  • the datastorage device may also include a memory access device which can access manufactures having recorded thereon the array information of the present invention.
  • Non-limiting exemplary data storage devices are media storage, floppy drive, super floppy, tape drive, zip drive, syquest syjet drive, hard drive, CD Rom recordable (R), CD Rom rewritable (RW), M.D. drives, optical media, and punch cards/tape.
  • the "search device” as part of the computer-based system encompasses one or more programs which are implemented on the system to compare the test hybridization pattern to the reference hybridization pattern in order to detect the differences in these hybridization patterns.
  • a variety of known algorithms are disclosed publicly and a variety of commercially available software useful for pattern recognition can be used in computer- based systems of the present invention. Examples of array analysis software include Biodiscovery, HP, and any of those applicable for image analyses.
  • Some currently employed search devices include those embodied in "Gene Array Scanner (Hewlett Packard)", “General Scanning”, “reader Hitachi system”, “Genomics Solutions” and "GeneChip work station”.
  • the retrieval device includes program(s) which are implemented on the system to retrieve the differences in hybridization patterns detected by the search device. Hardware necessary for displaying the detected device may also form part of the retrieval device.
  • the storage, search, retrieval devices may be assemble as a PC, Mac, Apollo workstation (Cray), SGI machine, Sun machine, UNIX or LINUX based Workstations, Be OS systems, laptop computer, palmtop computer, and palm pilot system, or the like.
  • the computer-based method comprises the following steps: (a) providing a database comprising hybridization patterns that represent expression patterns of multiple genes for a plurality of subjects, wherein each hybridization pattern is generated by hybridizing an array of polynucleotide probes disclosed herein, with more than one labeled target polynucleotides corresponding to gene transcripts expressed in a distinct subject, wherein said hybridizing step yields detectable target-probe complexes with different levels of hybridization intensities; (b) receiving two or more of hybridization patterns for comparison; (c) determining differences in the selected hybridization patterns; and (d) displaying the results of said determination.
  • the determining step includes the step of calculating the differences between the hybridization intensities of target-probe complexes localized in predetermined regions on the solid support.
  • kits containing the polynucleotide probe arrays of this invention include those that allow simultaneous detection of the expression and/or quantification of the level of expression of multiple gene transcripts of a subject. Further embodied by the invention are kits useful for detecting differential expression of a multiplicity of gene transcripts of a test subject in comparison to a control.
  • kits necessarily comprises the reagents which render the hybridization procedure possible: an array of polynucleotide probes of the invention used for detecting target polynucleotides; hybridization reagents that allow formation of stable target-probe complexes during a hybridization reaction.
  • the kits may also contain reagents useful for generating labeled target polynucleotides corresponding to gene transcripts of a test subject.
  • the arrays contained in the kits may be pre-hybridized with polynucleotides corresponding to gene transcripts of the control to which the test subject is compare.
  • Each reagent can be supplied in a solid form or dissolved/suspended in a liquid buffer suitable for inventory storage, and later for exchange or addition into the reaction
  • kits can optionally provide additional components that are useful in the procedure. These optional components include, but are not limited to, buffers, capture reagents, developing reagents, labels, reacting surfaces, means for detection, control samples, instructions, and interpretive information.
  • the kits can be employed to test a variety of biological samples, including body fluid, solid tissue samples, tissue cultures or cells derived therefrom and the progeny thereof, and sections or smears prepared from any of these sources. Diagnostic or prognostic procedures using the kits of this invention can be performed by clinical laboratories, experimental laboratories, practitioners, or private individuals.
  • Sequence-tagged site (STS) probes are generated by amplifying human genomic DNA using selected primer pairs.
  • the selected primer pairs yield amplified sequences corresponding to the 3' untranslated region of gene transcripts of particular interest.
  • a list of exemplary primer pairs and the resultant gene sequences are summarized in Table 1. Additional primer pairs may be obtained from worldwide web at http://www.ncbi.nlm.nih.gov/dbSTS/index.html or related web sites.
  • Each PCR reaction contains approximately 100 pmoles of each primer, 50 ng human genomic DNA, and other reagents included in Advantage Genomic PCR kit (Clontech). The PCR reaction is carried out according to manufacturer's instructions which yields approximately 5 ug of each STS tag.
  • the resultant STS tags are analyzed, sequenced, purified, and concentrated by lyophilization (Savant) to approximately 2 ug/ul. Samples of concentrated STS tags are aliquoted and stored at low temperature for future use.
  • each STS tag is printed using a robot (Molecular Devices or Genomic Micro Systems) onto positively charged nylon membrane (Hybond-N+ from Amersham Pharmacia Biotech). Approximately 5000 STS tags are spotted on each membrane (5cm x 7cm). Each spot of sample has a diameter of about 0.1 mm, and is spaced about 0.15 mm apart. The array optionally is spotted with control samples comprising human genomic DNAs or cDNAs of house-keeping genes. The spotted STS tags and control probes are then denatured in about 0.2 M NaOH. After neutralization, the denatured probes are cross-linked to the nylon membrane by UV irradiation.
  • cDNA probes are generated by reverse transcription of mRNAs extracted from about lOxlO 7 cells.
  • the cells are eukaryotic cells, more preferably they are mammalian cells, and even more preferably they are human cells.
  • RNA molecules are isolated using NucleoSpin RNA kit (Clontech), and polyA+- RNA molecules are extracted from total RNA using mRNA Separator kit (Clontech). Labeling the target sequences is carried out during reverse transcription of the isolated RNA molecules using kits provided by Life Technology /BRL according to the following experimental procedures. The target sequences are labeled with biotin-
  • RNA molecules Approximately 200 ng of mRNA molecules are suspended in 16 ul water and mixed with 2 ul Oligo-dT primer (1 ug/ul 10-20 mer mixture). The reaction mixture is then denatured at about 70 °C for approximately 10 min followed by rapid chilling on ice for about 3 min. Appropriate amount of buffers containing reagents necessary for first strand synthesis (first strand buffer provided by BRL/Life Tech Cat# 18064- 014) and suitable amount of reverse transcriptase are added to the reaction mixture.
  • first strand buffer provided by BRL/Life Tech Cat# 18064- 014
  • suitable amount of reverse transcriptase are added to the reaction mixture.
  • the first stand buffer contains lul DTT (0.1M) , 1.5 ul dNTPs mixture (dATP, dCTP, dGTP at 20 mM, Pharmacia Cat# 27-2035-02), 1.5 ul 0.8 mM dTTP and 0.8 mM biotin-16-dUTP. Typically, 1.5 ul reverse transcriptase (200 units/ul, Superscript II RT from BRL/Life Tech Cat # 18064-014) is used. The reaction mixture is then incubated at about 37 °C for approximately 90 min. The labeled target sequences are purified by passing through a Bio-Spin Chromatography Column (Bio-Rad Cat. #732-60020.
  • the target sequences Prior to hybridization with the probes immobilized on an array, the target sequences are denatured by heating at 100 °C for about 3 min followed by rapid chilling to about 4 °C.
  • 200 ng of mRNA is also reverse-transcribed in a similar labeling reaction mixture as described above which contains digoxigenin-11-dUTP instead of biotin-16-dUTP.
  • the array containing immobilized probes may be pre-hybridized for at least about 2 hours at 42 C in a hybridization buffer (MicroHyb solution, Research Genetics, Cat # HYB125.GF) that contains 0.5 ug/ml poly- dA (Research Genetics Cat.# POLYA.GF) and 0.5 ug/ml human Cotl DNA (BRL/Life Tech Cat # 15279-
  • a hybridization buffer MacroHyb solution, Research Genetics, Cat # HYB125.GF
  • Poly- dA Research Genetics Cat.# POLYA.GF
  • human Cotl DNA BRL/Life Tech Cat # 15279-
  • the labeled target sequences as described above are then added to the prehybridization mixture, and incubated at about 42 C for approximately 12-18 hours. Unbound target sequences are washed off from the array according to the following procedures: two times at 50 C in 2X SSC, 1% SDS for 20 min and three times at room temperature in 0.2XSSC, 0.1% SDS for 15 min each. The array is then blocked in IX BM blocking reagent (BRL/Life Tech, Cat # ) for about 30 min at room temperature.
  • anti-DIG-alkaline phosphatase conjugates are first diluted 15,000 fold in blocking buffer (0.1 M maleic acid, 0.15 M NaCl, and 0.3 % Tween 20 at pH 7.5) containing 0.5X BM blocking reagent and incubated with the membranes at room temperature for 45 min. The membrane is then washed with blocking buffer thrice, 10 min each time. The membrane is then blocked with 1% BM blocking reagent containing 2% dextran sulphate at room temperature for 1 hour and then rinsed with IX TBS buffer solution (10 mM Tris-HCl, pH 7.4 and 150 mM NaCl) containing 0.3 % BSA. To detect the hybridized target-probe complexes on
  • the array streptavidin b-galactosidase enzyme conjugate and anti- digoxigenin/alkaline phosphatase antibody-enzyme conjugate are used.
  • the detection can also be carried out in single color mode. In this case, either one of the antibody/enzyme conjugates is used.
  • the array is typically incubated with a mixture containing 700X diluted streptavidin/b-galactosidase (GIBCO-BRL),
  • X-gal solution 1.2 mM X-gal, 1 mM MgC12, 3 mM K3Fe(CN)6, 3 mM K4Fe(CN)6 in 1 X TBS buffer
  • the membrane is then rinsed briefly with deionized water and stained with Fast
  • Red TR/Naphthol AS-MX substrate (Sigma), an alkaline phosphatase substrate.
  • the color development reactions is then stopped by IX PBS containing 20 mM EDTA.
  • Target sequences labeled with biotin reacts with Strep-GAL and yields "blue” chromogen.
  • Target sequences labeled with digoxigenin reacts with anti-Dig/AP and fast red to yield "red” chromogen.
  • a digital camera (DCS-420, Kodak) attached to a stereomicroscope (Zeiss, Stemi 2000C) is employed to scan the color images of the array.
  • the data recorded by the digital camera can be further processed by a computer using appropriate software.
  • a purple spot on the array indicates the presence of a gene commonly expressed in two separate mRNA samples derived from two separate sources.
  • a spot exhibiting blue or red color above the average stain intensity indicates the presence of a preferentially expressed gene.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Medical Informatics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

La présente invention concerne des réseaux comportant un groupe de plusieurs sondes polynucléotidiques présentant des séquences complémentaires de la région 3' non traduite d'un produit de transcription génétique, dont la position chromosomique a été définie. Ces réseaux sont particulièrement utilisés dans les analyses d'expression génétique comparative. En outre, cette invention concerne un procédé de préparation de ces réseaux et des procédés variés d'utilisation de ces réseaux pour détecter l'expression différentielle de produits de transcription génétiques multiples, entre autres nombreux sujets. Par ailleurs, cette invention concerne un support lisible par ordinateur sur lequel est enregistré un réseau de sondes polynucléotidiques tel qu'il a été spécifié plus haut, un système informatisé, et des kits de détection de l'expression différentielle d'une multitude de produits de transcription génétique.
PCT/US2000/015850 1999-06-11 2000-06-09 Reseaux specifiques de genes et leur utilisation WO2000077257A1 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP00939721A EP1185701A1 (fr) 1999-06-11 2000-06-09 Reseaux specifiques de genes et leur utilisation
AU54766/00A AU5476600A (en) 1999-06-11 2000-06-09 Gene specific arrays and the use thereof
CA002375220A CA2375220A1 (fr) 1999-06-11 2000-06-09 Reseaux specifiques de genes et leur utilisation

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13869099P 1999-06-11 1999-06-11
US60/138,690 1999-06-11

Publications (1)

Publication Number Publication Date
WO2000077257A1 true WO2000077257A1 (fr) 2000-12-21

Family

ID=22483187

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/015850 WO2000077257A1 (fr) 1999-06-11 2000-06-09 Reseaux specifiques de genes et leur utilisation

Country Status (4)

Country Link
EP (1) EP1185701A1 (fr)
AU (1) AU5476600A (fr)
CA (1) CA2375220A1 (fr)
WO (1) WO2000077257A1 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002072789A2 (fr) * 2001-03-12 2002-09-19 Irm, Llc. Essais biologiques cellulaires grande vitesse fondes sur la genomique, elaboration de ces essais, et collections de rapporteurs cellulaires
US20160194707A1 (en) * 2003-06-03 2016-07-07 Life Technologies Corporation 3' biased detection of nucleic acids
WO2019037828A1 (fr) * 2017-08-25 2019-02-28 Zoetis Services Llc Sonde d'acide nucléique, procédé d'immobilisation de l'acide nucléique sur un support solide à l'aide de lumière uv, support solide comprenant une sonde d'acide nucléique immobilisée et dispositif de test comprenant un support solide

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997010365A1 (fr) * 1995-09-15 1997-03-20 Affymax Technologies N.V. Mesure de l'expression par l'hybridation avec des systemes tres denses d'oligonucleotides
WO1998053103A1 (fr) * 1997-05-21 1998-11-26 Clontech Laboratories, Inc. Ensembles d'acide nucleique
WO1999005323A1 (fr) * 1997-07-25 1999-02-04 Affymetrix, Inc. Systeme d'expression et d'evaluation de genes

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997010365A1 (fr) * 1995-09-15 1997-03-20 Affymax Technologies N.V. Mesure de l'expression par l'hybridation avec des systemes tres denses d'oligonucleotides
WO1998053103A1 (fr) * 1997-05-21 1998-11-26 Clontech Laboratories, Inc. Ensembles d'acide nucleique
WO1999005323A1 (fr) * 1997-07-25 1999-02-04 Affymetrix, Inc. Systeme d'expression et d'evaluation de genes

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
BASETT D E ET AL: "GENE EXPRESSION INFORMATICS - IT'S ALL IN YOUR MINE", NATURE GENETICS,US,NEW YORK, NY, vol. 21, no. SUPPL, January 1999 (1999-01-01), pages 51 - 55, XP000865988, ISSN: 1061-4036 *
BOWTELL D D L: "OPTIONS AVAILABLE - FROM START TO FINISH - FOR OBTAINING EXPRESSION DATA BY MICROARRAY", NATURE GENETICS,US,NEW YORK, NY, vol. 21, no. SUPPL, January 1999 (1999-01-01), pages 25 - 32, XP000865983, ISSN: 1061-4036 *
DUGGAN D J ET AL: "EXPRESSION PROFILING USING CDNA MICROARRAYS", NATURE GENETICS,US,NEW YORK, NY, vol. 21, no. SUPPL, January 1999 (1999-01-01), pages 10 - 14, XP000865980, ISSN: 1061-4036 *
ERMOLAEVA O ET AL: "DATA MANAGEMENT AND ANALYSIS FOR GENE EXPRESSION ARRAYS", NATURE GENETICS,US,NEW YORK, NY, vol. 20, 20 September 1998 (1998-09-20), pages 19 - 23, XP002916034, ISSN: 1061-4036 *
HELLER R A ET AL: "Discovery and analysis of inflammatory disease-related genes using cDNA microarrays", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA, XP002076789 *
SCHENA M ET AL: "QUANTITATIVE MONITORING OF GENE EXPRESSION PATTERNS WITH A COMPLEMENTARY DNA MICROARRAY", SCIENCE,US,AMERICAN ASSOCIATION FOR THE ADVANCEMENT OF SCIENCE,, vol. 270, no. 5235, 20 October 1995 (1995-10-20), pages 467 - 470, XP000644675, ISSN: 0036-8075 *
WANG K ET AL: "Monitoring gene expression profile changes in ovarian carcinomas using cDNA microarray", GENE,NL,ELSEVIER BIOMEDICAL PRESS. AMSTERDAM, vol. 229, no. 1-2, 18 March 1999 (1999-03-18), pages 101 - 108, XP004161163, ISSN: 0378-1119 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002072789A2 (fr) * 2001-03-12 2002-09-19 Irm, Llc. Essais biologiques cellulaires grande vitesse fondes sur la genomique, elaboration de ces essais, et collections de rapporteurs cellulaires
WO2002072789A3 (fr) * 2001-03-12 2003-07-03 Irm Llc Essais biologiques cellulaires grande vitesse fondes sur la genomique, elaboration de ces essais, et collections de rapporteurs cellulaires
US20160194707A1 (en) * 2003-06-03 2016-07-07 Life Technologies Corporation 3' biased detection of nucleic acids
WO2019037828A1 (fr) * 2017-08-25 2019-02-28 Zoetis Services Llc Sonde d'acide nucléique, procédé d'immobilisation de l'acide nucléique sur un support solide à l'aide de lumière uv, support solide comprenant une sonde d'acide nucléique immobilisée et dispositif de test comprenant un support solide
US11248257B2 (en) 2017-08-25 2022-02-15 Zoetis Services Llc Method of immobilizing a nucleic acid probe to a solid support

Also Published As

Publication number Publication date
AU5476600A (en) 2001-01-02
EP1185701A1 (fr) 2002-03-13
CA2375220A1 (fr) 2000-12-21

Similar Documents

Publication Publication Date Title
Barrett et al. Microarrays: the use of oligonucleotides and cDNA for the analysis of gene expression
EP0799897B1 (fr) Kits et méthodes pour la détection des acides nucléiques cibles à l'aide des acides nucléiques marqueurs
JP5171037B2 (ja) マイクロアレイを用いた発現プロファイリング
US6703228B1 (en) Methods and products related to genotyping and DNA analysis
WO2017161251A1 (fr) Procédés de détection et d'identification d'acides nucléiques génomiques
EP2476760A1 (fr) Procédé d'analyse d'une mutation d'acide nucléique utilisant la technique de la puce d'hybridation génomique comparative
US20060084105A1 (en) Arrays for detection of products of mRNA splicing
US20060141506A1 (en) Methods for monitoring the expression of alternatively spliced genes
JP2003530116A (ja) アレー上での相同ヌクレオチド配列の検出による生物学的(微)生物の同定
JP2004535814A (ja) ニック形成エンドヌクレアーゼを使用する指数関数的核酸増幅
EP1552010A4 (fr) Strategies pour l'analyse de l'expression genique
US6692915B1 (en) Sequencing a polynucleotide on a generic chip
US7312035B2 (en) Methods of genetic analysis of yeast
US20060281126A1 (en) Methods for monitoring the expression of alternatively spliced genes
JP2001054400A (ja) 遺伝子型決定二対立遺伝子マーカー
US9689867B2 (en) Assays for affinity profiling of nucleic acid binding proteins
US6716579B1 (en) Gene specific arrays, preparation and use
WO2000077257A1 (fr) Reseaux specifiques de genes et leur utilisation
US20070281863A1 (en) Dual polarity analysis of nucleic acids
US20070148636A1 (en) Method, compositions and kits for preparation of nucleic acids
CA2525179A1 (fr) Equation genetique utilisee pour diagnostiquer une polyarthrite rhumatoide
CA2493791A1 (fr) Reseau de grandes molecules sens circulaires
CA2572176A1 (fr) Methodes d'optimisation de sondes
WO2007097876A2 (fr) Sondes de standardisation pour des réseaux d'hybridation génomique comparative
JP4873812B2 (ja) マイクロアレイ上のrna:dnaハイブリッドの免疫検出

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
ENP Entry into the national phase

Ref document number: 2375220

Country of ref document: CA

Ref country code: CA

Ref document number: 2375220

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 2000939721

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2000939721

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Ref document number: 2000939721

Country of ref document: EP