EP1451355A1 - Procedes et systemes pour identifier des produits de transcription antisens naturels et procedes, kits et jeux ordonnes d'echantillons qui les comprennent - Google Patents

Procedes et systemes pour identifier des produits de transcription antisens naturels et procedes, kits et jeux ordonnes d'echantillons qui les comprennent

Info

Publication number
EP1451355A1
EP1451355A1 EP02788472A EP02788472A EP1451355A1 EP 1451355 A1 EP1451355 A1 EP 1451355A1 EP 02788472 A EP02788472 A EP 02788472A EP 02788472 A EP02788472 A EP 02788472A EP 1451355 A1 EP1451355 A1 EP 1451355A1
Authority
EP
European Patent Office
Prior art keywords
sequence
oligonucleotide
sequences
naturally occurring
mrna
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP02788472A
Other languages
German (de)
English (en)
Inventor
Erez Levanon
Sarah Pollock
Sergey Nemzer
Avi Shoshan
Rami Khosravi
Shira Walach
Zurit Levine
Jeanne Bernstein
Dvir Dahari
Alon Wasserman
Galit Rotman
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Compugen Ltd
Original Assignee
Compugen Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Compugen Ltd filed Critical Compugen Ltd
Publication of EP1451355A1 publication Critical patent/EP1451355A1/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6834Enzymatic or biochemical coupling of nucleic acids to a solid phase
    • C12Q1/6837Enzymatic or biochemical coupling of nucleic acids to a solid phase using probe arrays or probe chips
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/10Gene or protein expression profiling; Expression-ratio estimation or normalisation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids

Definitions

  • the present invention relates to the field of naturally occurring, antisense transcripts. More particularly, the present invention relates to methods of identifying naturally occurring antisense transcripts, databases storing polynucleotide sequences encoding identified naturally occurring antisense transcripts, oligonucleotides derived therefrom and methods and kits utilizing same.
  • Naturally occurring antisense RNA transcripts are endogenous transcripts, which exhibit complementarity to sense transcripts of which are typically of a known function. It has been established that these endogenous antisense transcripts play an important role in regulating prokaryotic gene expression and are increasingly implicated as involved in eukaryotic gene regulation.
  • Cw-encoded antisense transcripts are encoded by the same locus as the sense transcripts and are transcribed from strand of DNA opposite to that encoding the sense transcript; as such, cis encoded antisense transcripts are typically completely complementary with a portion of the sense transcript. Traw-encoded antisense transcripts are by contrast, transcripts, which are encoded on a different locus and as such, may display only partial complementarity with a sense transcript. Natural antisense RNAs were first described in prokaryote studies, which suggested that such transcripts play a role in gene expression regulation.
  • Prokaryotic antisense transcripts are widely distributed and are involved in the control of numerous biological functions including transposition, plasmid replication, incompatibility and conjugation. In prokaryotes, antisense transcripts are typically involved in down-regulation of sense transcript expression, although involvement in positive regulation was also suggested [reviewed in Wagner EG. and Simons RW. (1994) Annu. Rev. Microbiol. 48:713-742].
  • RNAs are conserved between species suggesting that these antisense RNAs are not fortuitous but rather play an important role in gene expression regulation [Kidny MS. et al. (1987) Mol. Cell Biol. 7:2857-2862, Nepveu A. and Marcu KB. (1986) EMBO J. 5:2859-2865 and Bentley DL. et al. (1986) Nature 321 :702-706].
  • Antisense transcripts can also encode proteins. Examples for protein encoding antisense transcripts include rev-ErbAx [Lazar MA. (1989) Mol. Cell. Biol. 9:1128-1136], gfg [Kimelman D. et al. (1989) Cell 59:687-696] and n-cym [A ⁇ nstrong BC. et al. (1992) Cell Growth Differ. 3:385-390]. Such antisense transcripts typically include a distinct open reading frame (ORF) and polyadenylation signal for cytoplasm transportation.
  • ORF open reading frame
  • Natural antisense regulation of gene expression can be effected via one of several mechanisms.
  • 30 bp may initiate DNA-methylation, a well-established phenomenon in a number of organisms [Sharp A. (2001) Genes Dev. 15:485-490]. Methylation can be directed to different portions of an encoding region of the gene or to the promoter region. DNA methylation results in complete suppression of transcription probably by recruitment of histone deacetylases.
  • Transcriptional regulation in which case antisense transcription hampers sense transcription. Such interference may involve the collision of two transcription complexes. Alternatively, interference may result from competition on an essential rate limiting transcription factor resulting in premature termination or in reduced elongation of transcription, the transcripts with the highest rate of transcription being predominant.
  • Post-transcriptional nuclear regulation involves antisense intervention of either maturation and/or transport of the sense transcript to the cytoplasm.
  • antisense transcripts displaying similar structural features to sense transcripts can bind proteins expected to interact with their sense counterparts, thereby depriving sense messengers from proteins necessary for their function.
  • Messenger stability -double stranded RNA may affect messenger stability via "RNA interference", which involves short segments of double stranded RNA (dsRNA) homologous in sequence to the silenced gene. These undersized segments, which are generated by a ribonuclease III cleavage of longer dsRNAs, can guide a single stranded target mRNA, via base pairing, to a multisubunit complex which participates in the degradation of the target mRNA.
  • messenger stability may be affected by RNA degradation, which is mediated by double stranded RNA-directed Rnases.
  • a method of identifying putative naturally occurring antisense transcripts comprising: (a) computationally aligning a first database including sense-oriented polynucleotide sequences with a second database including expressed polynucleotide sequences; and (b) identifying expressed polynucleotide sequences from the second database being capable of forming a duplex with at least one sense-oriented polynucleotide sequence of the first database, thereby identifying putative naturally occurring antisense transcripts.
  • kits for quantifying at least one mRNA transcript of interest comprising at least one oligonucleotide being designed and configured so as to be complementary to a sequence region of the mRNA transcript of interest, the sequence region not being complementary with a naturally occurring antisense transcript.
  • kits for quantifying at least one mRNA transcript of interest comprising at least one pair of oligonucleotides including a first oligonucleotide capable of binding the at least one mRNA transcript of interest and a second oligonucleotide being capable of binding a naturally occurring antisense transcript complementary to the mRNA of interest.
  • a method of designing artificial antisense transcripts comprising: (a) providing a database of naturally occurring antisense transcripts; (b) extracting from the database criteria governing structure and/or function of the naturally occurring antisense transcripts; and (c) designing the artificial antisense transcripts according to the criteria.
  • the criteria governing structure and/or function of the naturally occurring antisense transcripts are selected from the group consisting of antisense length, complementarity length, complementarity position, intron molecules, alternative splicing sites, tissue specificity, pathological abundance, chromosomal mapping, open reading frames, promoters, hairpin structures, helix structures, stem and loops, pseudoknots and tertiary interactions, guanidine and/or cytosine content, guanidine tandems, adenosine content, thermodynamic criteria, RNA duplex melting point, RNA modifications, protein-binding motifs, palindromic sequence and predicted single stranded and double stranded regions.
  • a computer readable storage medium comprising a database including a plurality of sequences, wherein each sequence is of a naturally occurring antisense transcript.
  • the database further includes information pertaining to each sequence of the naturally occurring antisense transcripts, the information is selected from the group consisting of related sense gene, antisense length, complementarity length, complementarity position, intron molecules, alternative splicing sites, tissue specificity, pathological abundance, chromosomal mapping, open reading frames, promoters, hairpin structures, helix structures, stem and loops, pseudoknots and tertiary interactions, guanidine and/or cytosine content, guanidine tandems, adenosine content, thermodynamic criteria, RNA duplex melting point, RNA modifications, protein-binding motifs, palindromic sequence and predicted single stranded and double stranded regions.
  • the database further includes information pertaining to generation of the database and potential uses of the database.
  • a method of generating a database of naturally occurring antisense transcripts comprising: (a) computationally aligning a first database including sense-oriented polynucleotide sequences with a second database including expressed polynucleotide sequences; (b) identifying expressed polynucleotide sequences from the second database being capable of forming a duplex with at least one sense-oriented polynucleotide sequence of the first database so as to identify putative naturally occurring antisense transcripts; and (c) storing sequence information of the identified naturally occurring antisense transcripts, thereby generating the database of the naturally occurring antisense transcripts.
  • a system for generating a database of a plurality of putative naturally occurring antisense transcripts comprising a processing unit, the processing unit executing a software application configured for: (a) computationally aligning a first database including sense-oriented polynucleotide sequences with a second database including expressed polynucleotide sequences; and (b) identifying expressed polynucleotide sequences from the second database being capable of forming a duplex with at least one sense-oriented polynucleotide sequence of the first database.
  • a method of identifying putative naturally occurring antisense transcripts comprising screening a database of expressed polynucleotides sequences according to at least one sequence criterion, the at least one sequence criterion being selected to identify putative naturally occurring antisense transcripts.
  • a method of quantifying at least one mRNA of interest in a biological sample comprising: (a) contacting the biological sample with at least one oligonucleotide capable of binding with the at least one mRNA of interest, wherein the at least one oligonucleotide is designed and configured so as to be complementary to a sequence region of the mRNA transcript of interest, the sequence region not being complementary with a naturally occurring antisense transcript; and (b) detecting a level of binding between the at least one mRNA of interest and the at least one oligonucleotide to thereby quantify the at least one mRNA of interest in the biological sample.
  • a method of quantifying the expression potential of at least one mRNA of interest in a biological sample comprising: (a) contacting the biological sample with at least one pair of oligonucleotides including a first oligonucleotide capable of binding the at least one mRNA of interest and a second oligonucleotide being capable of binding a naturally occurring antisense transcript complementary to the mRNA of interest; and (b) detecting a level of binding between the at least one mRNA of interest and the first oligonucleotide and a level of binding between the naturally occurring antisense transcript complementary to the mRNA of interest and the second oligonucleotide to thereby quantify the expression potential of the at least one mRNA of interest in the biological sample.
  • a method of quantifying at least one naturally occurring antisense transcript of interest in a biological sample comprising: (a) contacting the biological sample with at least one oligonucleotide capable of binding with the at least one naturally occurring antisense transcript of interest, wherein the at least one oligonucleotide is designed and configured so as to be complementary to a sequence region of the naturally occurring antisense transcript of interest, the sequence region not being complementary with a naturally occurring mRNA transcript; and (b) detecting a level of binding between the at least one naturally occurring antisense transcript of interest and the at least one oligonucleotide to thereby quantify the at least one naturally occurring antisense transcript of interest in the biological sample.
  • the first database includes sequences of a type selected from the group consisting of genomic sequences, expressed sequence tags, contigs, intron sequences, complementary DNA (cDNA) sequences, pre-messenger RNA (mRNA) sequences and mRNA sequences.
  • sequences of a type selected from the group consisting of genomic sequences, expressed sequence tags, contigs, intron sequences, complementary DNA (cDNA) sequences, pre-messenger RNA (mRNA) sequences and mRNA sequences.
  • the second database includes sequences of a type selected from the group consisting of expressed sequence tags, contigs, complementary DNA (cDNA) sequences, pre-messenger RNA (mRNA) sequences and mRNA sequences.
  • an average sequence length of the expressed polynucleotide sequences of the second database is selected from a range of 0.02 to 0.8 Kb.
  • the second database is generated by: (i) providing a library of expressed polynucleotides; (ii) obtaining sequence information of the expressed polynucleotides; (iii) computationally selecting at least a portion of the expressed polynucleotides according to at least one sequence criterion; and (iv) storing the sequence information of the at least a portion of the expressed polynucleotides thereby generating the second database.
  • the at least one sequence criterion for computationally selecting the at least a portion of the expressed polynucleotide is selected from the group consisting of sequence length, sequence annotation, sequence information, intron splice consensus site, intron sharing, sequence overlap, rare restriction site , poly(T) head, poly(A) tail, and ⁇ oly(A) signal.
  • the step of testing the putative naturally occurring antisense transcripts for an ability to form the duplex with the at least one sense oriented polynucleotide sequence under physiological conditions is performed.
  • the method further comprising the step of computationally testing the putative naturally occurring antisense transcripts according to at least one criterion selected from the group consisting of sequence annotation, sequence information, intron splice consensus site, intron sharing, sequence overlap, rare restriction site , poly(T) head, poly(A) tail, and poly(A) signal.
  • a length of the at least one oligonucleotide is selected from a range of 15-200 nucleotides.
  • the at least one oligonucleotide is a single stranded oligonucleotide.
  • the at least one oligonucleotide is a double stranded oligonucleotide. According to still further features in the described preferred embodiments a guanidine and cytosine content of the at least one oligonucleotide is at least 25 %.
  • the at least one oligonucleotide is labeled. According to still further features in the described preferred embodiments the at least one oligonucleotide is attached to a solid substrate.
  • the solid substrate is configured as a microarray and whereas the at least one oligonucleotide includes a plurality of oligonucleotides each attached to the microarray in a regio-specific manner.
  • a length of each of the first and second oligonucleotides is selected from a range of 15-200 nucleotides.
  • the first and second oligonucleotides are single stranded oligonucleotides.
  • the first and second oligonucleotides are double stranded oligonucleotide. According to still further features in the described preferred embodiments a guanidine and cytosine content of each of the first and second oligonucleotides is at least 25 %.
  • first and second oligonucleotides are labeled.
  • the first and second oligonucleotides are attached to a solid substrate.
  • the solid substrate is configured as a microarray and whereas each of the first and second oligonucleotides includes a plurality of oligonucleotides each attached to the microarray in a regio-specific manner.
  • a method of identifying a novel drug target comprising: (a) determining expression level of at least one naturally occurring antisense transcript of interest in cells characterized by an abnormal phenotype; and (b) comparing the expression level of the at least one naturally occurring antisense transcript of interest in the cells characterized by an abnormal phenotype to an expression level of the at least one naturally occurring antisense transcript of interest in cells characterized by a normal phenotype, to thereby identify the novel drug target.
  • the abnormal phenotype of the cells is selected from the group consisting of biochemical phenotype, morphological phenotype and nutritional phenotype.
  • determining expression level of at least one naturally occurring antisense transcript of interest is effected by at least one oligonucleotide designed and configured so as to be complementary to a sequence region of the at least one naturally occurring antisense transcript of interest, the sequence region not being complementary with a naturally occurring mRNA transcript.
  • a method of treating or preventing a disease, condition or syndrome associated with an upregulation of a naturally occurring antisense transcript complementary to a naturally occurring mRNA transcript comprising administering a therapeutically effective amount of an agent for regulating expression of the naturally occurring antisense transcript.
  • the agent for regulating expression of the naturally occurring antisense transcript is at least one oligonucleotide designed and configured so as to hybridize to a sequence region of the at least one naturally occurring antisense transcript.
  • the at least one oligonucleotide is a ribozyme.
  • the at least one oligonucleotide is a sense transcript.
  • a method of diagnosing a disease, condition or syndrome associated with a substandard expression ratio of an mRNA of interest over a naturally occurring antisense transcript complementary to the mRNA of interest comprising: (a) quantifying expression level of the mRNA of interest and the naturally occurring antisense transcript complementary to the mRNA of interest; (b) calculating the expression ratio of the mRNA of interest over the naturally occurring antisense transcript complementary to the mRNA of interest, thereby diagnosing the disease, condition or syndrome.
  • the present invention successfully addresses the shortcomings of the presently known configurations by providing a novel approach for identifying naturally occurring antisense transcripts, methods of designing artificial antisense transcripts according to information derived therefrom and methods and kits using naturally occurring and synthetic antisense transcripts.
  • FIG. 1 illustrates EST alignment along genomic DNA, generated according to the teachings of the present invention. Alignment results identify two strand groups of transcripts i.e., sense transcripts and antisense transcripts with an indicated sequence overlap.
  • FIG. 2 illustrates a system designed and configured for generating a database of naturally occurring antisense sequences generated according to the teachings of the present invention.
  • FIG. 3 illustrates a remote configuration of the system described in Figure 2.
  • FIGs. 4a-k are sequence alignments of overlapping regions of selected naturally occurring antisense and sense sequence pairs identified according to the teachings of the present invention.
  • FIGs. 5a-g are sequence alignments of overlapping regions of selected naturally occurring antisense and sense sequence pairs identified according to the teachings of the present invention.
  • FIG. 6 schematically illustrates two transcription products of 53BP1 gene (red and green) and their corresponding partial complementary antisense transcripts of the 76p gene (blue). Numbers in parenthesis indicate length of sequence complementation. Schematic location of strand-specific RNA probes used for northern blotting of sense (53BP1, Riboprobe#l) and antisense (76p, Riboprobe#2) transcripts is shown.
  • FIG. 7 is an autoradiogram of a northern blot analysis depicting cellular distribution and expression levels of 53BP1 transcripts. Arrows on the right indicate the molecular weight of the identified 53BP1 transcripts relative to the migration of 28S and 18S ribosomal RNA subunits.
  • FIG. 8 is an autoradiogram of a northern blot analysis depicting cellular distribution and expression levels of 76p transcripts. Arrows on the right indicate the molecular weight of the identified 76p transcripts relative to the migration of 28S and 18S ribosomal RNA subunits.
  • FIG. 9 is an autoradiogram of a northern blot analysis depicting tissue distribution and expression levels of 76p transcripts. Arrows on the right indicate the molecular weight of the identified 76p transcripts. Numbers on the left denote the migration of molecular weight marker in Kb.
  • FIG. 10 illustrates the genomic organization of the 53BP1 gene and 76p gene, as elucidated from the RT-PCR analysis presented in the Examples section hereinbelow.
  • Black arrows indicate the location of the primers used for RT-PCR analysis.
  • Asterisks denote stop codons.
  • FIG. 11 schematically illustrates two transcription products of CIDE-B gene and their corresponding partial complementary antisense transcript of the BLTR2 gene.
  • Schematic location of the strand-specific 430 nucleotide RNA probe used for northern analysis of sense (CIDE-B) and antisense (BLTR2) transcripts is shown. Dashed rectangles indicate the predicted coding sequence of the transcripts.
  • FIG. 12 is an autoradiogram of a northern blot analysis depicting cellular distribution and expression levels of BLTR2 transcripts. Arrows on the right indicate the molecular weight of the identified BLTR2 transcripts relative to the migration of 28S and 18S ribosomal RNA subunits. Numbers on the left denote the size of molecular weight markers in Kb.
  • FIG. 13 shows autoradiogram of a northern blot analysis depicting cellular distribution and expression levels of CIDE-B transcripts. Arrows on the right indicate the molecular weight of the identified CIDE-B transcripts relatively to the migration of 28S and 18S ribosomal RNA subunits. Numbers on the left denote the migration size of molecular weight markers in Kb.
  • FIG. 14 schematically illustrates a transcription product of APAF-1 gene and its corresponding partial complementary antisense transcripts of the EB-1 gene.
  • Schematic location of the strand-specific 366 nucleotide RNA probe used for northern analysis of sense (APAF-1) and antisense (EB-1) transcripts is shown. Asterisks indicate the predicted coding sequence borders of the transcripts.
  • FIGs. 15a-b are autoradiograms of northern blot analyses depicting cellular distribution and expression levels of EB-1 ( Figure 15 a) and APAF-1 transcripts ( Figure 15b). Numbers on the left denote the size of molecular weight marker in Kb.
  • FIG. 16 schematically illustrates a transcription product of the MINK-2 gene and its corresponding partial complementary antisense transcript of the AchR- ⁇ gene. Schematic location of the strand-specific 280 nucleotide RNA probe used for northern analysis of sense (Mink-2) and antisense (AchR- ⁇ ) transcripts is shown.
  • FIGs. 17a-b are autoradiograms of northern blot analyses depicting cellular distribution and expression levels of AchR- ⁇ antisense transcripts (Figure 17a) and the sense complementary transcript of Mink-2 ( Figure 17b). Arrows on the right denote the migration of molecular weight markers in Kb.
  • FIG. 18 schematically illustrates a transcription product of Cyclin-E2 gene and its corresponding partial complementary antisense transcript. Schematic location of strand-specific RNA probes used for northern blotting of sense (Riboprobe#l) and antisense (Riboprobe#2) transcripts is shown.
  • FIGs. 19a-b are autoradiograms of northern blot analyses depicting cellular distribution and expression levels of Cyclin E2 antisense transcript (Figure 19a) and the sense complementary transcript ( Figure 19b). Arrows on the left denote the migration of molecular weight markers in Kb.
  • FIG. 20 illustrates results from RT-PCR analysis of the expression patterns of CIDE-B transcript and its complementary naturally occurring antisense transcript following concentration dependent induction of apoptosis.
  • Lanes (1) 50 ⁇ M etoposide; (2) 100 ⁇ M etoposide; (3) 250 ⁇ M etoposide; (4) 500 ⁇ M etoposide; (5) 10 nM staurosporine; (6) 100 nM staurosporine; (7) 250 nM staurosporine; (8) 1000 nM staurosporine; (9) untreated cells (UT).
  • FIGs. 21a-c are results of RT-PCR analyses depicting expression patterns of AchR ⁇ and its naturally occurring antisense transcript following time-dependent induction of differentiation.
  • Figure 21a illustrates the position of riboprobes used for reverse transcription reaction.
  • Figure 21b shows the reciprocal expression pattern of sense and antisense transcripts (indicated by arrows).
  • Figure 21c shows the expression pattern of the antisense transcript alone.
  • FIGs. 22a-j illustrate results of northern blot analysis of sense/antisense clusters revealing positive signals for sense/antisense genes in the microarray analysis.
  • Diagrams describing genomic organization of the relevant region for each of the sense/antisense clusters are included above the autoradiograms, and regions of overlap (including GenBank accession number) from which the strand-specific riboprobes were derived are included.
  • Sense-antisense pair numbers are as they appear in the microarray (as depicted in Table S2 on the attached CD-ROM3 and in conversion Table 6).
  • Figure 22a reveals expression patterns of randomly selected sequence pair number 235, denoted as Rand_235 in Table 6.
  • Figure 22b corresponds to pair number 173, Figure 22c to pair number 248, Figure 22d to pair number 6, Figure 22e to pair number 216, Figure 22f to pair number 239, Figure 22g to pair number 202, Figure 22h to pair number 114, Figure 22i to pair number 188, and Figure 22j to pair number 223.
  • FIG. 23 is a Table depicting expression patterns in various cell lines and tissues as probed with a subset of 264 pairs from the putative sense/antisense dataset of the present invention.
  • the pairs are denoted by the pair number and described in Table SI of CD-ROM3.
  • C and “AC” denote the two counterpart probes.
  • Expression was also verified for positive controls, including the ubiquitously expressed genes gapdh, actin, hsp70 and gnbHl in various concentrations, and 11 previously documented sense/antisense pairs. Expression thresholds were verified and indicated as "+”, if the probe passed the threshold in at least one cell line or tissue or "-”, if the probe did not pass the threshold in all experiments.
  • the present invention is of methods of identifying naturally occurring antisense transcripts, which can be used in kits and methods for quantifying gene expression levels.
  • the antisense molecules and related oligonucleotides generated according to information derived therefrom of the present invention can be used to detect, quantify, or specifically regulate antisense and respective sense transcripts thereby enabling detection and treatment of a wide range of disorders.
  • oligonucleotide refers to a single stranded or double stranded oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or mimetics thereof.
  • oligonucleotides composed of naturally-occurring bases, sugars and covalent internucleoside linkages (e.g., backbone) as well as oligonucleotides having non-naturally-occurring portions, which function similarly. Such modified or substituted oligonucleotides are often preferred over native forms because of desirable properties such as, for example, enhanced cellular uptake, enhanced affinity for nucleic acid target and increased stability in the presence of nucleases.
  • antisense refers to a complementary strand of an mRNA transcript e.g., antisense RNA.
  • naturally occurring antisense transcripts refers to RNA transcripts encoded from an antisense strand of the DNA. These endogenous transcript exhibit at least partial complementarity to mRNA transcripts transcribed from the sense strand of a DNA, also termed sense transcripts, cis- encoded naturally occurring antisense transcripts are transcribed from the same locus as the sense transcripts. tr ⁇ / ⁇ -encoded antisense transcripts are transcribed from a different locus than the respective sense transcripts.
  • antisense strand or "anticoding strand” refers to a strand of
  • DNA which serves as a template for mRNA transcription and as such is complementary to the mRNA transcript formed.
  • sense strand or "coding strand” refers to the strand of
  • cDNA complementary DNA
  • ense oriented polynucleotides refers to polynucleotide sequences of a complementary or genomic DNA. Such polynucleotide sequences can be from exon regions, in which case they can encode mRNAs or portions thereof, or from intron regions, in which case they typically do not encode mRNA or portions thereof.
  • the term "contig” refers to a series of overlapping sequences with sufficient identity to create a longer contiguous sequence.
  • the term “cluster” refers to a plurality of contigs all derived, with a high degree of probability, from a single gene. Clusters are generally formed based upon a specified degree of homology and overlap (e.g., a stringency). The different contigs in a cluster do not typically represent the entire sequence of the gene, rather the gene may comprise one or more unknown intervening sequences between the defined contigs.
  • the phrase "open reading frame” (ORF) refers to a nucleotide sequence, which could potentially be translated into a polypeptide. Such a stretch of sequence is uninterrupted by a stop codon.
  • An ORF that represents the coding sequence for a full protein begins with an ATG "start” codon and terminates with one of the three “stop” codons.
  • an ORF may be any part of a coding sequence, with or without start and/or stop codons.
  • a minimum size requirement is often set, for example, a stretch of DNA that would code for a protein of 50 amino acids or more.
  • ORF is not usually considered an equivalent to a gene or locus until a phenotype is associated with a mutation in the ORF, an mRNA transcript for a gene product generated from the ORF's DNA has been detected, and/or the ORF's protein product has been identified.
  • annotation refers to a functional or structural description of a sequence, which may include identifying attributes such as locus name, poly(A)/poly(T) tail and/or signal, key words, Medline references and orientation cloning data.
  • Naturally occurring antisense molecules can play a role in sense transcription stability and function (e.g. translation). To date, most, if not all of the information relating to naturally occurring antisense transcripts was obtained by either low efficiency computational approaches (described hereinabove) or by approaches utilizing RNase protection assays, northern blot analysis, strand-specific RT PCR, subtractive hybridization, differential plaque hybridization, affinity chromatography, electrospray mass spectrometry and the like. These methods, though highly reliable, are extremely laborious, time consuming and are directed at individual target transcripts. As such, current approaches for uncovering antisense transcripts can be used to detect a negligible portion of the number of naturally occurring antisense molecules thought to exist. As described hereinunder and in the Examples section, which follows, the present invention provides a novel approach for systematically identifying naturally occurring antisense molecules.
  • the present method can be used to identify naturally occurring antisense molecules even in cases where the antisense transcriptional unit is localized to an intron of an expressed gene or to a different locus than the complementary sense encoding gene (e.g., trans- encoded antisense), or in cases where the antisense molecule lacks an open reading frame or appreciable complementarity to known sense molecules.
  • Antisense transcripts uncovered according to the teachings of the present invention can be used for detecting and accurately quantifying respective sense counterparts as well as for sensibly designing artificial antisense molecules suitable for down-regulation of sense counterparts.
  • the method according to this aspect of the present invention is effected by the following steps.
  • sense-oriented polynucleotide sequences of a first database are computationally aligned with expressed polynucleotide sequences of a second database.
  • expressed polynucleotide sequences are analyzed according to one or more criteria for their ability to hybridize or form a duplex or partial complementation with the sense-oriented polynucleotide sequences (further detailed hereinbelow and in the Examples section which follows).
  • Expressed polynucleotide sequences which are capable of forming a duplex with sense oriented sequences are considered as putative naturally occurring antisense molecules and as such can be stored in a database which can be generated by a suitable computing platform.
  • Final confirmation of computationally obtained putative naturally occurring antisense molecules can be effected either computationally or preferably by using suitable laboratorial methodologies, based on nucleotide hybridization including RNase protection assay, subtractive hybridization, differential plaque hybridization, affinity chromatography, electrospray mass spectrometry, northern analysis, RT-PCR and the like (for further details see the Examples section).
  • Expressed polynucleotide sequences used as a potential source for identifying naturally occurring antisense transcripts according to this aspect of the present invention are preferably libraries of expressed messenger RNA [i.e., expressed sequence tags (EST), cDNA clones, contigs, pre-mRNA, etc.] obtained from tissue or cell-line preparations which can include genomic and/or cDNA sequence.
  • expressed messenger RNA i.e., expressed sequence tags (EST), cDNA clones, contigs, pre-mRNA, etc.
  • Expressed polynucleotide sequences can be retrieved from pre-existing publicly available databases (i.e., GenBank database maintained by the National Center for Biotechnology Information (NCBI), part of the National Library of Medicine, and the TIGR database maintained by The Institute for Genomic Research) or private databases (i.e., the LifeSeq.TM and PathoSeq.TM databases available from Incyte Pharmaceuticals, Inc. of Palo Alto, CA).
  • GenBank database maintained by the National Center for Biotechnology Information (NCBI), part of the National Library of Medicine, and the TIGR database maintained by The Institute for Genomic Research
  • private databases i.e., the LifeSeq.TM and PathoSeq.TM databases available from Incyte Pharmaceuticals, Inc. of Palo Alto, CA.
  • sequence database of the expressed polynucleotide sequences utilized by the present invention can be generated from sequence libraries (e.g., cDNA libraries, EST libraries, mRNA libraries and others). cDNA libraries are suitable sources for expressed sequence information.
  • Generating a sequence database in such a case is typically effected by tissue or cell sample preparation, RNA isolation, cDNA library construction and sequencing.
  • cDNA libraries can be constructed from RNA isolated from whole organisms, tissues, tissue sections, or cell populations. Libraries can also be constructed from tissue reflecting a particular pathological or physiological state. Of particular interest are libraries constructed from sources associated with certain disease states, including malignant, neoplastic, hyperplastic tissues and the like.
  • sequences are selected and preferably annotated before stored in a database. Selection proceeds according to one or more sequence criterion, which will be further detailed hereinunder.
  • the editing, annotation and selection process is divided into two stages of processing. One stage comprises removal of repetitive, redundant or non- informative and contaminant sequences. The second stage involves selection of suitable candidates of putative naturally occurring antisense sequences.
  • Vector contamination - "chops" vector elements and linker motifs used for the process of cloning from desired expressed nucleotide sequences can be effected by screening manually updated databases of sequences included in commonly used expression or cloning vectors.
  • Contaminating sequences - includes sequences which are derived from an undesired source. Such sequences can be recognized by their nucleotide distribution and/or by homology searches such as alignment searches using any sequence alignment algorithm such as BLAST (Basic Local Alignment Search Tool, available through www.ncbi.nlm.nih.gov/BLAST) or the Smith- Waterman algorithm.
  • Other contaminating sequences may include sequences exhibiting high occurrence of di-nucleotide distribution mostly related to sequencing artifacts and ribosomal RNA sequences.
  • Repetitive elements and low complexity sequences eliminates or masks expressed sequences comprising known repetitive elements (ALU, LI etc.) and low complexity sequences (i.e., a di- or tri-nucleotide repeat). Such elimination is preferably effected by comparison with database of known repetitive elements. It will be appreciated that this type of selection is mostly species specific.
  • Masking of low complexity sequences can be effected by substituting an N (i.e., an inert character) for the actual nucleotide (i.e., G, A, T, or C).
  • N i.e., an inert character
  • Sequence length - preferred expressed sequences are of a length between 20-2000, preferably 20-1000, more preferably 20-500, most preferably 20-300 base pairs.
  • Sequence annotation - expressed sequences retrieved from external databases, i.e., GenBank oftentimes include an annotation which indicates direction of the sequencing of the insert clone (i.e., 5' or 3' direction).
  • Sequence annotation though "noisy" by nature due to multiple entries from various sources; artifacts taking place during directional cloning and incidence of palindromic eight-cutter restriction sites located at the end of the sequence, can serve as an important tool for deducing strand identity using dedicated computer software which are further discussed hereinunder
  • Intron splice site consensus sequence intron splice site sharing- intron sequences nearly always begin with a di-nucleotide sequence of GT ("splice donor") and end with an AG ("splice acceptor") preceded by a pyrimidine-rich tract.
  • This consensus sequence is part of the signal for splicing.
  • Intron splice site consensus sequence on the complementary strand e.g., antisense strand
  • expressed sequences having a GT...AG can be considered as sense-oriented sequences, while a CT...AC pattern is considered as an antisense oriented sequence. This selection criterion is very stringent since only negligible portions of introns have a CT...AC pattern.
  • Sequences that share a similar splicing pattern, as deduced by alignment to genomic data, may be considered as having the same sense orientation, also termed herein as "intron sharing". It will be appreciated by one skilled in the art that using these selection criteria requires a careful and accurate alignment of expressed sequences to genomic sequence.
  • Poly(A) tails and Poly(T) heads - most eukaryotic mRNA molecules contain a poly-adenylation [poly(A)] tail at their 3' end. This poly(A) tail is not encoded by DNA. Therefore an expressed sequence which has a poly(A) tail can be considered as sense oriented.
  • poly(T) heads which are not encoded from a genomic sequence indicate that a sequence is of the opposite direction, namely antisense oriented.
  • genomically encoded Poly(A) tails and poly(T) heads provide no information as to the sequence orientation.
  • Poly (A) signal - some mature mRNA transcripts contain internal
  • AAUAAA sequence This internal sequence is part of an endonuclease cleavage signal. Following cleavage by the endonuclease, a poly(A) polymerase adds about 250 A residues to the 3' end of the transcript. Hence, expressed sequences containing a poly(A) signal can be considered as sense oriented.
  • any endonuclease capable of digesting a palindromic sequence i.e., Xhol, Sail, Pad etc.
  • strand orientation is preferably effected using other parameters as well.
  • Sequence overlap - sequences that completely overlap are considered to have the same strand orientation.
  • the above described parameters are used individually or in combination to analyze the expressed polynucleotide sequences so as to select anti-sense oriented sequences.
  • Selection can be effected on the basis of a single criterion or several criteria considered individually or in combination.
  • a scoring system e.g., a scoring matrix, is preferably used.
  • Such a scoring matrix can list the various expressed polynucleotide sequences across the X-axis of the matrix while each criterion can be listed on the Y-axis of the matrix. Criteria include both a predetermined range of values from which a single value is selected from each sequence, and a weight. Each sequence is scored at each criterion according to its value and the weight of the criterion.
  • Genomic DNA or a portion thereof is preferably used as sense-oriented sequence data according to this aspect of the present invention. It is conceivable that the present invention can determine sense orientation and antisense orientation of a database of expressed sequences simply by computationally aligning the sequences of the expressed database onto the genome, and finding whether two complementary expressed sequences hybridize to the genome (e.g., virtually generate a double stranded portion thereof). Such two overlapping sequences constitute sense and naturally occurring antisense transcripts.
  • genomic DNA is preferred for the following reasons: (i) identifying trans-encoded antisense transcripts; (ii) analyzing intron splice consensus site and intron sharing; (iii) omitting genomically encoded poly(A) and poly(T) sequences; and (iv) analyzing sequences encompassing eight-cutter restriction sites.
  • Computational alignment of expressed polynucleotide sequences to the sense-oriented polynucleotide sequences can be effected using any commercially available alignment software, including sequence alignment tools utilizing algorithm such as BLAST (Basic Local Alignment Search Tool, available through www.ncbi.nlm.nih.gov/BLAST) or Smith- Waterman.
  • BLAST Basic Local Alignment Search Tool, available through www.ncbi.nlm.nih.gov/BLAST
  • Smith- Waterman Smith- Waterman.
  • Assembly software is preferably used according to this aspect of the present invention. Such software is of high value when complete genomic information is unavailable or when handling large amounts of expressed sequence data.
  • a number of commonly used computer software fragment read assemblers capable of forming clusters of expressed sequences are now available. These packages include but are not limited to, The TIGR Assembler [Sutton G. et al. (1995) Genome Science and Technology 1 :9-19], GAP [Bonf ⁇ eld JK. et al. (1995) Nucleic Acids Res. 23:4992-4999], CAP2 [Huang X. et al. (1996) Genomics 33:21-31], The Genome Construction Manager [Laurence CB. Et al.
  • Computer assembly and alignment programs can be modified to incorporate sequence criteria for determining sense or antisense orientation of expressed nucleotide sequences, as described hereinabove. Thereby, avoiding deliberate inversion of sequences during the assembly process, while ignoring the natural orientation of the sequences (i.e., sense or antisense orientation).
  • Figure 1 illustrates results of expressed sequence assembly against genomic data and final distinction between sense oriented transcripts and antisense oriented transcripts of a single gene.
  • duplexes are identified.
  • the term "duplex" is used herein to indicate that a sequence identified according to this aspect of the present invention is complementary to a sense-oriented polynucleotide sequence. Complementation may be to a portion of the sense sequence, i.e., a region thereof, or alternatively, to two or more non-contiguous regions, which may be separated by one or more nucleotides on the sense strand.
  • sense-antisense duplexes does not require 100 % complementation nor does it require participation of the entire sense/antisense transcript sequence.
  • the sense or antisense transcripts can have a secondary structure (e.g., stem and loop) generated by intra-sequence hybridization which can prevent specific sequence regions in the sense or antisense transcripts from participating in duplex formation.
  • the antisense of the sequence identified, according to this aspect of the present invention can be complementary to its sense counterparts in several regions, which are not necessarily close to each other when the sense transcript is in linear form.
  • the method of uncovering putative antisense transcripts of the present invention is preferably carried out using a dedicated computational system.
  • system 10 for generating a database of putative naturally occurring antisense sequences which system is referred to hereinunder as system 10.
  • System 10 includes a processing unit 12, which executes a software application designed and configured for aligning sense oriented polynucleotide sequences with expressed polynucleotide sequences and identifying expressed polynucleotide sequences which are capable of forming a duplex with the sense oriented polynucleotide sequences, thereby recognizing putative naturally occurring antisense transcripts.
  • System 10 may also include a user input interface 14 (e.g., a keyboard and/or a mouse) for inputting database or database related information, and a user output interface 16 (e.g., a monitor) for providing database information to a user.
  • a user input interface 14 e.g., a keyboard and/or a mouse
  • a user output interface 16 e.g., a monitor
  • System 10 preferably stores sequence information of the putative antisense transcripts identified thereby on a computer readable media such as a magnetic, optico-magnetic or optical disk to thereby generate a database of putative antisense transcript sequences.
  • a database further includes information pertaining to database generation (e.g., source library), parameters used for selecting polynucleotide sequences, putative uses of the stored sequences, and various other annotations and references which relate to the stored sequences or respective sense transcripts.
  • System 10 of the present invention may be used by a user to query the stored database of sequences, to retrieve nucleotide sequences stored therein or to generate polynucleotide sequences from user inputted sequences.
  • System 10 can be any computing platform known in the art including but not limited to, a personal computer, a work station, a mainframe and the like.
  • the database generated and stored by system 10 can be accessed by an on-site user of system 10, or by a remote user communicating with system 10.
  • Communication network 20 can be any private or public communication network including, but not limited to, a standard or cellular telephony network, a computer network such as the Internet or intranet, a satellite network or any combination thereof.
  • communication network 20 includes one or more communication servers 22 (one shown in Figure 3) which serves for communicating data pertaining to the polypeptide of interest between remote user 18 and processing unit 12.
  • any expressed polynucleotide sequence of interest can be "uploaded" by user 18 onto a Web site maintained by a database server 28.
  • database server 28 which serves as processing unit 12 can be instructed by the user to processes the polynucleotide as is described hereinabove.
  • nucleic acid sequence results can be displayed at the web site maintained by database server 28 and/or communicated back to site 24, via for example, e-mail communication.
  • a remote configuration of system 10 can provide polynucleotide sequence analysis services to a plurality of sites 24 (one shown in Figure 3).
  • system 10 of the present invention is especially advantageous in cases where polypeptide analysis can not be effected on-site.
  • laboratories which lack the equipment necessary for executing the analysis or lack the necessary skills to operate it.
  • data extracted from the database of naturally occurring antisense transcripts of the present invention is of high value for designing oligonucleotides suitable for transcript detection and quantification and for sensibly designing artificial antisense oligonucleotides for down-regulation and elimination of a transcript of interest or changing the balance between sense and complementary antisense transcripts.
  • the possibility of up-regulating a transcript of interest using naturally occurring antisense based-oligonucleotides generated according to the teachings of the present invention is also realized.
  • antisense technology is based upon the pairing of an artificially designed antisense oligonucleotide, with a target nucleic acid.
  • the use of antisense technology requires a complementarity of the antisense nucleotide sequence to a target zone of an mRNA target sequence that will effect inhibition of gene expression [reviewed in Stein CA. and Cohen JS. (1988) Cancer Res. 48:2659-68].
  • antisense technology relies on: (i) cellular uptake; (ii) stability of artificial antisense molecules under physiological conditions (i.e., cellular pH, endonucleases etc.); (iii) complementation between the oligonucleotide and a single stranded target sequence (i.e., tertiary structure of target RNA will not form a good target); (iv) binding specificity of antisense oligonucleotide so as not to compete with other RNA binders (e.g. proteins) to thereby maintain an effective antisense concentration.
  • RNA binders e.g. proteins
  • the method according to this aspect of the present invention is effected by the following steps.
  • First, structural and/or functional parameters pertaining to naturally occurring antisense transcripts are extracted/deduced from a database such as the one described hereinabove. These parameters may be generally deduced from all sequences stored in the database, or extracted from specific antisense sequences or preferably groups of antisense sequences.
  • Second, artificial antisense molecules of interest are designed according to the extracted parameters.
  • Topographical parameters (i) position of sequence overlap on the sense transcript (i.e., coding region, 5'UTR, 3'UTR); (ii) position of the sequence overlap on the antisense transcript (end overlap, middle overlap, full overlap), (iii) length of overall sequence overlap; (iv) continuity or discontinuity of sequence overlap.
  • Structural parameters - pertains to both sense and antisense transcripts
  • tertiary structure i.e., hairpin, helix, stem and loop, pseudoknot, and the like
  • tertiary structure i.e., hairpin, helix, stem and loop, pseudoknot, and the like
  • single stranded versus double stranded regions iii
  • GC content i.e., GC content
  • tandem Gs i.e., tandem Gs
  • adenosine/inosine content adenosine/inosine content
  • thermodynamic stability of tertiary structures i.e., duplex melting point; (viii) methylations and other RNA modifications; (ix) RNA-protein interactions ; and (x) transcript length.
  • each parameter is preferably weighted according to its importance. Due to the multi-factorial design of artificial antisense transcripts according to this aspect of the present invention, employing a scoring system (described hereinabove) is preferably used to simplify and increase the accuracy of the process.
  • Synthetic antisense oligonucleotides designed according to the teachings of the present invention can be generated according to any oligonucleotide synthesis method known in the art such as enzymatic synthesis or solid phase synthesis. Equipment and reagents for executing solid-phase synthesis are commercially available from, for example, Applied Biosystems. Any other means for such synthesis may also be employed; the actual synthesis of the oligonucleotides is well within the capabilities of one skilled in the art.
  • Oligonucleotides used according to this aspect of the present invention are those having a length selected from a range of 10 to about 200 bases preferably 15-150 bases, more preferably 20-100 bases, most preferably 20-50 bases.
  • the oligonucleotides of the present invention may comprise heterocylic nucleosides consisting of purines and the pyrimidines bases, bonded in a 3' to 5' phosphodiester linkage.
  • Preferably used oligonucleotides are those modified in either backbone, internucleoside linkages or bases, as is broadly described hereinunder. Such modifications can oftentimes facilitate oligonucleotide uptake and resistance to intracellular conditions.
  • oligonucleotides useful according to this aspect of the present invention include oligonucleotides containing modified backbones or non-natural internucleoside linkages. Oligonucleotides having modified backbones include those that retain a phosphorus atom in the backbone, as disclosed in U.S. Pat.
  • Preferred modified oligonucleotide backbones include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkyl phosphotriesters, methyl and other alkyl phosphonates including 3'-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3'-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3 '-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3'-5' to 5'-3' or 2'-5' to 5'-2'.
  • Various salts, mixed salts and free acid forms can also be used.
  • modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages.
  • morpholino linkages formed in part from the sugar portion of a nucleoside
  • siloxane backbones sulfide, sulfoxide and sulfone backbones
  • formacetyl and thioformacetyl backbones methylene formacetyl and thioformacetyl backbones
  • alkene containing backbones sulfamate backbones
  • sulfonate and sulfonamide backbones amide backbones; and others having mixed N, O, S and CH2 component parts, as disclosed in U.S. Pat. Nos.
  • oligonucleotides which can be used according to the present invention, are those modified in both sugar and the internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups.
  • the base units are maintained for complementation with the appropriate polynucleotide target.
  • An example for such an oligonucleotide mimetic includes peptide nucleic acid (PNA).
  • PNA peptide nucleic acid
  • a PNA oligonucleotide refers to an oligonucleotide where the sugar-backbone is replaced with an amide containing backbone, in particular an aminoethylglycine backbone.
  • the bases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone.
  • Modified bases include but are not limited to other synthetic and natural bases such as 5-methylcytosine (5-me- C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6- methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2- thiocytosine, 5-halouraciI and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5 -uracil (pseudouracil), 4-thiouracil, 8-halo, 8- amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted
  • 5-substituted pyrimidines include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O- 6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5- propynylcytosine.
  • 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2 °C. [Sanghvi YS et al. (1993) Antisense Research and Applications, CRC Press, Boca Raton 276-278] and are presently preferred base substitutions, even more particularly when combined with 2'-0- methoxyethyl sugar modifications.
  • oligonucleotides of the invention involves chemically linking to the oligonucleotide one or more moieties or conjugates, which enhance the activity, cellular distribution or cellular uptake of the oligonucleotide.
  • moieties include but are not limited to lipid moieties such as a cholesterol moiety, cholic acid, a thioether, e.g., hexyl-S-tritylthiol, a thiocholesterol, an aliphatic chain, e.g., dodecandiol or undecyl residues, a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethylammonium 1,2-di-O- hexadecyl-rac-glycero-3-H-phosphonate, a polyamine or a polyethylene glycol chain, or adamantane acetic acid, a palmityl mo
  • the present invention also includes antisense molecules, which are chimeric molecules.
  • "Chimeric” antisense molecules are oligonucleotides, which contain two or more chemically distinct regions, each made up of at least one nucleotide. These oligonucleotides typically contain at least one region wherein the oligonucleotide is modified so as to confer upon the oligonucleotide increased resistance to nuclease degradation, increased cellular uptake, and/or increased binding affinity for the target polynucleotide. An additional region of the oligonucleotide may serve as a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids.
  • RNase H which is a cellular endonuclease which cleaves the RNA strand of an RNA:DNA duplex.
  • Activation of RNase H therefore, results in cleavage of the RNA target, thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene expression. Consequently, comparable results can often be obtained with shorter oligonucleotides when chimeric oligonucleotides are used, compared to phosphorothioate deoxyoligonucleotides hybridizing to the same target region.
  • RNA target can be routinely detected by gel electrophoresis and, if necessary, associated nucleic acid hybridization techniques known in the art.
  • Chimeric antisense molecules of the present invention may be formed as composite structures of two or more oligonucleotides, modified oligonucleotides, as described above.
  • Representative U.S. patents that teach the preparation of such hybrid structures include, but are not limited to, U.S. Pat. Nos. 5,013,830; 5,149,797; 5,220,007; 5,256,775; 5,366,878; 5,403,711; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, each of which is herein fully incorporated by reference.
  • chimeric oligonucleotides of the present invention can comprise a ribozyme sequence.
  • Ribozymes are being increasingly used for the sequence- specific inhibition of gene expression by the cleavage of mRNAs.
  • Several ribozyme sequences can be fused to the oligonucleotides of the present invention. These sequences include but are not limited ANGIOZYME specifically inhibiting formation of the VEGF-R (Vascular Endothelial Growth Factor receptor), a key component in the angiogenesis pathway, and HEPTAZYME, a ribozyme designed to selectively destroy Hepatitis C Virus (HCV) RNA, (Ribozyme Pharmaceuticals, Incorporated - WEB home page).
  • HCV Hepatitis C Virus
  • oligonucleotides generated according to the teachings of the present invention can be used for both diagnostic and therapeutic purposes.
  • oligonucleotides of the present invention can be used to diagnose and treat a variety of diseases or pathological conditions associated with an abnormal expression (i.e., up-regulation or down-regulation) of at least one mRNA molecule of interest, including but not limited to diabetes, autoimmune diseases, Parkinson, Alzheimer' disease, HIV, malaria, cholera, influenza, rabies, diphtheria, breast cancer, colon cancer, cervical cancer, melanoma, lung cancer, ovarian cancer, pancreatic cancer, prostate cancer, lymphomas, leukemias and the like and any other diseases (see Example 8 of the Examples section) which are associated with aberrant expression of multiple mRNAs (i.e., sense and/or antisense) or with unregulated formation of endogenous double stranded RNA complexes.
  • oligonucleotide probes which are complementary to one or more regions of the mRNA to be quantitated. Such probes are designed while considering interspecies sequence variation, sequence length, GC content etc. However design of such prior art probes (i.e., riboprobes or deoxyriboprobes) does not take into consideration the presence of antisense transcripts which can effect probe binding efficiency.
  • Discounting antisense presence can lead to inaccurate diagnosis, which is oftentimes followed by an erroneous treatment protocol.
  • the present invention provides an mRNA-detection/quantification assay, which is devoid of this limitation.
  • a method of quantifying at least one mRNA of interest in a biological sample is provided.
  • biological sample refers to any sample derived from biological tissues or fluids, including blood (serum or plasma), sputum, pleural effusions, urine, biopsy specimens, isolated cells and/or cell membrane preparation. Methods of obtaining tissue biopsies and body fluids from mammals are well known in the art.
  • the method of this aspect of the present invention is effected by contacting mRNA from a cell type or within a cell with one or more oligonucleotides that hybridizes efficiently with a sequence region of an mRNA transcript which is not complementary with a naturally occurring antisense transcript.
  • prior art diagnostic/detection assays also fail to consider the effect of antisense transcription on the protein expression levels of a gene of interest. It stands to reason that presence of antisense transcripts in a biological sample can substantially reduce the resultant protein levels translated from a complementary sense transcript. Consistently, diseases which are associated with endogenous dsRNA complexes, are also very difficult to detect and moreover to treat, due to insufficient sequence data pertaining to duplex forming transcripts.
  • both the sense and antisense levels must be quantified and/or their respective expression ratio must be determined.
  • a biological sample By contacting a biological sample with one or more pairs of oligonucleotides, where one oligonucleotide is capable of hybridizing with the mRNA of interest and the second oligonucleotide is capable of hybridizing with a naturally occurring antisense transcript which is complementary with the mRNA of interest such accurate quantification can be effected.
  • Contacting the oligonucleotides of the present invention with the biological sample is effected by stringent, moderate or mild hybridization (as used in any polynucleotide hybridization assay such as northern blot, dot blot,
  • stringent hybridization is effected by a hybridization solution of 6 x SSC and 1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6),
  • 0.5 % SDS at 1 - 1.5 °C below the Tm moderate hybridization is effected by a hybridization solution of 6 x SSC and 0.1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS, 100 mg/ml denatured salmon sperm DNA and 0.1 % nonfat dried milk, hybridization temperature of 2 - 2.5 °C below the Tm, final wash solution of 3 M TMACI,
  • the oligonucleotides of the present invention can be attached to a solid substrate, which may consist of a particulate solid phase such as nylon filters, glass slides or silicon chips [Schena et al. (1995) Science 270:467-470].
  • a solid substrate which may consist of a particulate solid phase such as nylon filters, glass slides or silicon chips [Schena et al. (1995) Science 270:467-470].
  • oligonucleotides of the present invention can be attached to a solid substrate, which is designed as a microarray.
  • Microarrays are known in the art and consist of a surface to which probes that correspond in sequence to gene products (e.g., cDNAs, mRNAs, cRNAs, polypeptides, and fragments thereof), can be specifically hybridized or bound at a known position (regiospecificity).
  • cDNAs e.g., cDNAs, mRNAs, cRNAs, polypeptides, and fragments thereof
  • Several methods for attaching the oligonucleotides to a microarray are known in the art including but not limited to glass-printing, described generally by Schena et al., 1995, Science 270:467-47, photolithographic techniques
  • quantifying hybridization complexes is well known in the art and may be achieved by any one of several approaches. These approaches are generally based on the detection of a label or marker, such as any radioactive, fluorescent, biological or enzymatic tags or labels of standard use in the art.
  • a label can be applied on either the oligonucleotide probes or nucleic acids derived from the biological sample.
  • oligonucleotides of the present invention can be labeled subsequent to synthesis, by incorporating biotinylated dNTPs or rNTP, or some similar means (e.g., photo-cross-linking a psoralen derivative of biotin to RNAs), followed by addition of labeled streptavidin (e.g., phycoerythrin-conjugated streptavidin) or the equivalent.
  • biotinylated dNTPs or rNTP or some similar means (e.g., photo-cross-linking a psoralen derivative of biotin to RNAs)
  • streptavidin e.g., phycoerythrin-conjugated streptavidin
  • fluorescein, lissamine, phycoerythrin, rhodamine (Perkin Elmer Cetus), Cy2, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, FluorX (Amersham) and others [ e.g., Kricka et al. (1992), Academic Press San Diego, Calif] can be attached to the oligonucleotides.
  • pairs of fluorophores are chosen when distinction between two emission spectra of two oligonucleotides is desired or optionally, a label other than a fluorescent label is used.
  • a radioactive label or a pair of radioactive labels with distinct emission spectra, can be used [Zhao et al. (1995) Gene 156:207].
  • fluorophores rather than radioisotopes is more preferred.
  • the intensity of signal produced in any of the detection methods described hereinabove may be analyzed manually or using a software application and hardware suited for such purposes.
  • mRNA quantification is preferably effected alongside a calibration curve so as to enable accurate mRNA determination.
  • quantifying transcript(s) originating from a biological sample is preferably effected by comparison to a normal sample, which sample is characterized by normal expression pattern of the examined transcript(s).
  • the detection method described above can also be used for quantifying at least one naturally occurring antisense transcript in a biological sample.
  • the oligonucleotide used for quantification is designed to hybridize with a sequence region of naturally occurring antisense transcript of interest, which is not complementary with a naturally occurring mRNA transcript.
  • the diagnostic assays described hereinabove can be used to accurately distinguish between absence, presence and excess expression of any transcripts of interest (e.g., sense, antisense), and to monitor their level during therapeutic intervention. These methods are also capable of diagnosing diseases associated with an improper balance or ratio between sense and antisense expression and diseases associated with endogenous dsRNA.
  • transcripts of interest e.g., sense, antisense
  • oligonucleotide-pair arrays are provided in Example 9 of the Examples section which follows.
  • oligonucleotides of the present invention can be also used for therapeutic purposes, such as treating diseases or conditions associated with aberrant expression levels of one or more sense and/or antisense transcripts and conditions, which are associated with endogenous dsRNA such as unregulated formation of double-strand RNA (i.e., up/down- regulation).
  • Accumulative knowledge shows strong correlation between a variety of human diseases and mutations, over-expression and function of the protein building blocks (i.e., protein kinases, phosphatsases) and their effectors and regulators, which constitute numerous intracellular signaling pathways.
  • protein building blocks i.e., protein kinases, phosphatsases
  • effectors and regulators which constitute numerous intracellular signaling pathways.
  • inactivation of both copies of ZAP-70 or Jak-3 causes severe combined immunodeficiency and mutation of the X-linked BTK gene results in agammaglobulinemia.
  • Many genetic disorders are also associated with mutations for example, in protein-serine kinases (PSKs) and phosphatases.
  • PSKs protein-serine kinases
  • the Coffin-Lowry syndrome results from inactivation of the X-linked Rsk2 gene, and myotonic dystrophy is due to decreased levels of expression of the myotonic dystrophy PSK.
  • over-expression of ErbB2 receptor tyrosine kinase is implicated in breast and ovarian carcinoma [reviewed by Hunter T. (2000) Cell 100:113-127] .
  • treating refers to alleviating or diminishing a symptom associated with the disease or the condition.
  • treating cures, e.g., substantially eliminates, and/or substantially decreases, the symptoms associated with the diseases or conditions of the present invention.
  • the treatment method according to the teachings of the present invention includes administering to an individual a therapeutically effective amount of the synthetic antisense oligonucleotides of the present invention.
  • Preferred individual subjects according to the present invention are mammals such as canines, felines, ovines, porcines, equines, bovines, humans and the like. 15
  • a therapeutically effective amount implies an amount of agent effective to prevent, alleviate or ameliorate symptoms of disease or prolong the survival of the individual being treated
  • the agent of the method of the present invention can be administered to an individual per se, or as part of a pharmaceutical composition where it is 0 mixed with a pharmaceutically acceptable carrier.
  • a "pharmaceutical composition” refers to a composition of one or more of the agents described hereinabove, or physiologically acceptable salts or prodrugs thereof, with other chemical components.
  • the purpose of a pharmaceutical composition is to facilitate administration of a 5 compound to an organism.
  • compositions of the present invention may be administered in a number of ways depending upon whether local or systemic treatment is desired and upon the area to be treated. Administration may be topical (including ophthalmic and to mucous membranes including vaginal and 0 rectal delivery), pulmonary, e.g., by inhalation or insufflation of powders or aerosols, including by nebulizer; intratracheal, intranasal, epidermal and transdermal), oral or parenteral. Parenteral administration includes intravenous, intraarterial, subcutaneous, intraperitoneal or intramuscular injection or infusion; or intracranial, e.g., intrathecal or intraventricular, administration. Oligonucleotides with at least one 2'-0-methoxyethyl modification are believed to be particularly useful for oral administration.
  • compositions and formulations for topical administration may include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders.
  • Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable.
  • Coated condoms, gloves and the like may also be useful.
  • compositions and formulations for oral administration include powders or granules, suspensions or solutions in water or non-aqueous media, capsules, sachets or tablets. Thickeners, flavoring agents, diluents, emulsifiers, dispersing aids or binders may be desirable.
  • compositions and formulations for parenteral, intrathecal or intraventricular administration may include sterile aqueous solutions which may also contain buffers, diluents and other suitable additives such as, but not limited to, penetration enhancers, carrier compounds and other pharmaceutically acceptable carriers or excipients.
  • compositions of the present invention include, but are not limited to, solutions, emulsions, and liposome-containing formulations. These compositions may be generated from a variety of components that include, but are not limited to, preformed liquids, self-emulsifying solids and self-emulsifying semisolids.
  • the pharmaceutical formulations of the present invention may be prepared according to conventional techniques well known in the pharmaceutical industry. Such techniques include the step of bringing into association the active ingredients with the pharmaceutical carrier(s) or excipient(s). In general the formulations are prepared by uniformly and intimately bringing into association the active ingredients with liquid carriers or finely divided solid carriers or both, and then, if necessary, shaping the product.
  • compositions of the present invention may be formulated into any of many possible dosage forms such as, but not limited to, tablets, capsules, liquid syrups, soft gels, suppositories, and enemas.
  • the compositions of the present invention may also be formulated as suspensions in aqueous, non-aqueous or mixed media.
  • Aqueous suspensions may further contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran.
  • the suspension may also contain stabilizers.
  • the pharmaceutical compositions may be formulated and used as foams.
  • Pharmaceutical foams include formulations such as, but not limited to, emulsions, microemulsions, creams, jellies and liposomes. While basically similar in nature these formulations vary in the components and the consistency of the final product.
  • the preparation of such compositions and formulations is generally known to those skilled in the pharmaceutical and formulation arts and may be applied to the formulation of the compositions of the present invention.
  • the pharmaceutical compositions of the present invention may employ various penetration enhancers to effect the efficient delivery of nucleic acids, particularly oligonucleotides, to the skin of animals.
  • Penetration enhancers may be classified as belonging to one of five broad categories, i.e., surfactants, fatty acids, bile salts, chelating agents, and non-chelating non-surfactants [Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems (1991) 92] as disclosed in U.S. Pat. No: 6,300,132, 6,271,030, 6,277,633, 6,284,538, 6,287,860, 6,294,382, 6,277,640 and 6,258,601 each of which is herein fully inco ⁇ orated by reference.
  • cationic lipids such as lipofectin [U.S. Pat. No. 5,705,188], cationic glycerol derivatives, and polycationic molecules, such as polylysine [PCT Application. WO 97/30731] are also known, to enhance the cellular uptake of oligonucleotides.
  • reagents may be utilized to enhance the penetration of the administered nucleic acids, including glycols such as ethylene glycol and propylene glycol, pyrrols such as 2-pyrrol, azones, and te ⁇ enes such as limonene and menthone.
  • glycols such as ethylene glycol and propylene glycol
  • pyrrols such as 2-pyrrol
  • azones such as 2-pyrrol
  • te ⁇ enes such as limonene and menthone.
  • compositions of the present invention may also inco ⁇ orate carrier compounds.
  • carrier compound or “carrier” can refer to a nucleic acid, or analog thereof, which is inert (i.e., does not possess biological activity per se) but is recognized as a nucleic acid by in vivo processes that reduce the bioavailability of a nucleic acid having biological activity by, for example, degrading the biologically active nucleic acid or promoting its removal from circulation.
  • a nucleic acid and a carrier compound can result in a substantial reduction of the amount of nucleic acid recovered in the liver, kidney or other extracirculatory reservoirs, presumably due to competition between the carrier compound and the nucleic acid for a common receptor.
  • the recovery of a partially phosphorothioate oligonucleotide in hepatic tissue can be reduced when it is coadministered with polyinosinic acid, dextran sulfate, polycytidic acid or 4-acetamido-4' isothiocyano-stilbene-2,2'-disulfonic acid [Miyao et al., Antisense Res.
  • an "excipient” is a pharmaceutically acceptable solvent, suspending agent or any other pharmacologically inert vehicle for delivering one or more nucleic acids to an animal.
  • the excipient may be liquid or solid and is selected, with the planned manner of administration in mind, so as to provide for the desired bulk, consistency, etc., when combined with a nucleic acid and the other components of a given pharmaceutical composition.
  • Typical excipients include, but are not limited to, binding agents (e.g., pregelatinized maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose, etc.); fillers (e.g., lactose and other sugars, microcrystalline cellulose, pectin, gelatin, calcium sulfate, ethyl cellulose, polyacrylates or calcium hydrogen phosphate, etc.); lubricants (e.g., magnesium stearate, talc, silica, colloidal silicon dioxide, stearic acid, metallic stearates, hydrogenated vegetable oils, corn starch, polyethylene glycols, sodium benzoate, sodium acetate, etc.); disintegrants (e.g., starch, sodium starch glycolate, etc.); and wetting agents (e.g., sodium lauryl sulphate, etc.).
  • binding agents e.g., pregelatinized maize starch, polyvinylpyrrolidone or
  • compositions of the present invention can also be used to formulate the compositions of the present invention.
  • suitable pharmaceutically acceptable carriers include, but are not limited to, water, salt solutions, alcohols, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.
  • Formulations for topical administration of nucleic acids may include sterile and non-sterile aqueous solutions, non-aqueous solutions in common solvents such as alcohols, or solutions of the nucleic acids in liquid or solid oil bases.
  • the solutions may also contain buffers, diluents and other suitable additives.
  • Pharmaceutically acceptable organic or inorganic excipients suitable for non-parenteral administration, which do not deleteriously react with nucleic acids can be used.
  • Suitable pharmaceutically acceptable excipients include, but are not limited to, water, salt solutions, alcohol, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.
  • compositions of the present invention may additionally contain other adjunct components conventionally found in pharmaceutical compositions, at their art-established usage levels.
  • the compositions may contain additional, compatible, pharmaceutically-active materials such as, for example, antipruritics, astringents, local anesthetics or anti-infla ⁇ imatory agents, or may contain additional materials useful in physically formulating various dosage forms of the compositions of the present invention, such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers.
  • additional materials useful in physically formulating various dosage forms of the compositions of the present invention such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers.
  • such materials when added, should not unduly interfere with the biological activities of the components of the compositions of the present invention.
  • the formulations can be sterilized and, if desired, mixed with auxiliary agents, e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the nucleic acid(s) of the formulation.
  • auxiliary agents e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the nucleic acid(s) of the formulation.
  • Aqueous suspensions may contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran.
  • the suspension may also contain stabilizers.
  • compositions and their subsequent administration is believed to be within the skill of those in the art. Dosing is dependent on severity and responsiveness of the disease state to be treated, with the course of treatment lasting from several days to several months, or until a cure is effected or a diminution of the disease state is achieved. Optimal dosing schedules can be calculated from measurements of drug accumulation in the body of the patient. Persons of ordinary skill can easily determine optimum dosages, dosing methodologies and repetition rates. Optimum dosages may vary depending on the relative potency of individual oligonucleotides, and can generally be estimated based on EC50 found to be effective in in vitro and in vivo animal models.
  • the methods of the present invention have evident utility in the diagnosis and treatment of various diseases and conditions.
  • such methods can also be used in non-clinical applications, such as, for example, differential cloning, detection of rearrangements in DNA sequences as disclosed in U.S. Pat. No: 5,994,320, drug discovery and the like.
  • oligonucleotides generated according to the teachings of the present invention can be included in a diagnostic or therapeutic kit.
  • oligonucleotides sets pertaining to specific disease related transcripts can be packaged in a one or more containers with appropriate buffers and preservatives along with suitable instructions for use and used for diagnosis or for directing therapeutic treatment.
  • the containers include a label.
  • Suitable containers include, for example, bottles, vials, syringes, and test tubes.
  • the containers may be formed from a variety of materials such as glass or plastic.
  • additives such as stabilizers, buffers, blockers and the like may also be added.
  • Table 1 lists polynucleotide sequence pairs that were selected for the in-vitro expression validation assays described in examples 1-7.
  • RNA probes for northern analysis were generated by PCR amplification of a desired DNA fragment and cloning into Zero Blunt TOPO (Invitrogen Co ⁇ .) or pSPT 18/19 vectors (Roche Ltd.). Alternatively PCR products were ligated into T7 RNA polymerase promoter-containing adaptors using the Lignscribe kit (Ambion Europe Ltd.). Corresponding RNA transcripts were synthesized using T7 RNA polymerase (Roche Ltd.) and labeled with 32P-UTP according to manufacturer's instructions. RNA probes were purified on Mini Quick Spin RNA columns.
  • RNA per lane 2 ⁇ g RNA per lane
  • 2 ⁇ g of poly(A)-RNA prepared from various human cell- lines were electrophoretically separated on 1 % agarose gel, and electrotransferred to Nytran SuperCharge membrane (Schleicher & Schuell ) and subjected to fixing by UV radiation.
  • Membranes were stained with methylene blue to ensure quantitative RNA transfer.
  • Membranes were then prehybridized in a hybridization solution (UltraHyb solution Ambion Europe Ltd.) for 30 minutes at 68 °C in a rotating hybridization tube.
  • Hybridization solution was then supplemented with 106 cpm of labeled RNA probe per each ml of hybridization solution. Blots were hybridized for 16 hours at 68 °C in a rotating hybridization tube. Membranes were then washed twice with 2 x SSC, 0.1 % sodium dodecyl sulfate (SDS) and twice with 0.1 % SDS at 68 °C. RNA transcripts signals were detected using a phosphoimager (Molecular Dynamics, Sunnyvale CA). Microarray
  • Oligonucleotide design - oligonucleotide design tools (1) were applied to each pair of sense/antisense genes in order to select two complementary 60- mer oligonucleotides from the region where the two genes overlap.
  • the design criteria included the following: low cross-homology (up to 75%) to other expressed sequences in the human transcriptome; a continuous hit of no more than 17 bp to the sequence of another gene; balanced GC content (30-70%) without significant windows of local imbalance; no more than 2 palindromes with a length of 6 bp; a hit of no more than 15 bp to a repeat, vector or low- complexity region; and no long stretches of identical nucleotides.
  • Microarray preparation - 60-mer oligonucleotides were synthesized by Sigma-Genosys (The Woodlands, TX), resuspended at 40 ⁇ M in 3X SSC, and spotted in quadruplicates on poly-L-lysine coated glass slides as detailed in the online protocol of the National Human Genome Research Institute (http://www.nhgri.nih.gov/DIR/Microarray/Protocols.pdf). To avoid local differences in the hybridization conditions, the probes selected from the overlapping regions of each sense/antisense pair were spotted in the same block, next to each other.
  • Human cell lines The following cell lines utilized were purchased from ATCC (Manassas, VA): MCF7 (breast adenocarcinoma, Cat. No. HTB- 22,), HeLa (cervical adenocarcinoma, Cat. No. CCL-2) HEK-293 (embryonal kidney cells, Cat. No. CRL-1573), Jurkat (acute T-cell leukemia, Cat. No. TIB- 152), K-562 (chronic myelogenous leukemia, Cat. No. CCL-243), HepG2 (liver carcinoma, Cat. No. HB-8065), T24 (urinary bladder carcinoma, Cat. No.
  • HTB- 4 SK-N-DZ (neuroblastoma, Cat. No. CRL-2149), NK-92 (non-Hodgkin's lymphoma, Cat. No. CRL-2407), MG-63 (osteosarcoma, Cat. No. CRL-1427), DU 145 (prostatic carcinoma, Cat. No. HTB-81), G-361 (melanoma, Cat. No. CRL-1424), PANC-1 (pancreatic carcinoma, Cat. No. CRL-1469), ES-2 (ovary clear cell carcinoma, Cat. No. CRL-1978), Y79 (retinoblastoma, Cat. No. HTB- 18), HT-29 (colorectal adenocarcinoma, Cat.
  • HTB-38 H1299 (large cell lung carcinoma, Cat. No. CRL-5803), SNUl (gastric carcinoma, Cat. No. CRL-5971), NL564 (EBV-transformed human lymphoblasts) and MCF10 (benign rumor breast cells).
  • labeled cDNA was synthesized by reverse transcription of 0.5 ⁇ g of mRNA, in the presence of 100 pmol of random 9-mers, l ⁇ g of oligo(dT)20, IX RT buffer, 10 mM DTT, 3 nmol of Cy5- or Cy3 -conjugated dUTP, 0.5 mM of dATP, dGTP and dCTP, and 0.2 mM dTTP, in a final volume of 40 ⁇ l (Amersham). The reaction mixture was incubated for 5 minutes at 65 °C and cooled to 42 °C.
  • Hybridizations were performed in duplicate using fluorescent reversal of Cy3- and Cy5-labeled cDNA from test cell mRNAs and pooled mRNAs. Pairs of Cy5/Cy 3 -labeled cDNA samples were combined, and subsequently purified and concentrated to a final volume of 5-7 ⁇ l using a Microcon-30 (Millipore) concentrator.
  • Hybridization and washing conditions Microarray slides were prehybridized with 40 ⁇ l of 5X SSC, 0.1 % SDS and 1 % BSA for 30 min at 42 °C, washed for 2 minutes with double distilled water, then rinsed with isopropanol, and spun dried at 500 g for 3 minutes. Prior to hybridization, the labeled probe was combined with 10 ⁇ g of Cot- 1 DNA, 10 ⁇ g poly(dA)80, and 4 ⁇ g yeast tRNA, in a final volume of 15 ⁇ l. The mixture was denatured at 100 °C for 3 minutes and placed on ice.
  • Formamide final concentration 16 %), SSC (to 5X concentration) and 0.1 % SDS were added to a final volume of 30 ⁇ l.
  • the mixture was placed on the array under a glass cover slip in a tightly sealed hybridization chamber, and immersed in a water bath at 42 °C, for 16 hours.
  • Microarray slides were then washed for 4 minutes with 2X SSC, 0.1 % SDS; 4 minutes with IX SSC, 0.01 % SDS; 4 minutes with 0.2X SSC and 15 seconds with 0.05X SSC and spun dry by centrifugation for 3 minutes at 500g.
  • Image processing Following hybridization, arrays were scanned using a GenePix 4000B scanner (Axon Instruments, Union City, CA). Scanned array images were manually inspected and areas with visible artifacts or deformities were marked. Images were processed using GenePix Pro 3.0 (www.axon.com) software.
  • the intensity for each spot was calculated as its mean intensity minus the median background around the spot.
  • the signal for each oligo was calculated as the average of intensity values of the four redundant spots of each oligo. Normalization of the oligo signals was performed at several levels as is further described below.
  • Normalization of blocks was carried out in order to normalize the gradient of intensities within each slide.
  • an Ai parameter was calculated as the average of intensities of 56 positive control spots (oligonucleotide probes for the ubiquitously expressed housekeeping genes gapdh, actin, hsp70 and gnb211, in various probe concentrations).
  • An average A of all Ai averages was calculated.
  • the ratio Rjk for oligo j in cell-line k was calculated as the average of calculated ratios Rji between the two reciprocal experiments of the cell-line k. In cases where only one of the two reciprocal experiments showed an elevated or decreased ratio, while in the other the ratio was 1.0, the average Rjk was converted to 1.0.
  • the actual pool signal for each oligo was calculated to be the average of the normalized oligo signals in the pool channel of all experiments.
  • a virtual pool signal was calculated as the average of the normalized oligo signals in the cell-line channel of all experiments. The virtual pool signals were found to be very close to the actual pool signals, indicating consistency in the analysis.
  • Threshold determination To determine an expression threshold above, in which a normalized signal would be considered a 'positive' signal indicating expression, the distribution of all 16,512 normalized negative control signals and the standard deviation (neg-std-dev) were calculated. The neg-std-dev obtained was 38. An oligo j was considered 'present' in a cell-line k if Rjk x actual-pool-signalj > 4 x neg-std-dev.
  • the tumor suppressor p53 binding protein 1 (SEQ ID NO: 15) is one of the various p53 target proteins. It binds to the DNA-binding domain of p53 and enhances p53-mediated transcriptional activation.
  • 53BP1 is characterized by several structural motifs shared by several proteins involved in DNA repair and/or DNA damage-signaling pathways. 53BP1 becomes hype ⁇ hosphorylated and forms discrete nuclear foci in response to DNA damage induced by radiation and chemotherapy. Recent reports suggest that 53BP1 is an ataxia telangiectasia mutated (ATM) substrate that is involved early in the DNA damage-signaling pathways in mammalian cells, attributing a role to 53BP1 in the development of various mammalian pathologies.
  • ATM telangiectasia mutated
  • RNA-probes were generated. Schematic location of the probes used for sense and antisense validation (Riboprobe#l and Riboprobe#2, respectively SEQ ID NO: 17 and 18, respectively) is illustrated in Figure 6. These RNA probes were used to identify the corresponding full-length transcripts. As shown in Figure 7, Riboprobe#l detected two transcripts of approximately 6.3 Kb and 10.5 Kb, corresponding to the sense mRNA. The absolute levels of the short messenger were rather homogeneous in all cell-lines examined. The 10.5 Kb variant exhibited a more heterogenic pattern of cellular distribution, and was mostly expressed in K562, MG-63, 293 HEK and Hela cells. In general, the longer sense transcript which is an alternatively polyadenylated variant was markedly lower expressed in the various cell lines examined.
  • RT-PCR Reverse transcription amplification
  • CIDE-B Cell death inducing DFF45-like effector
  • BLTR2 was recently identified as a putative seven-transmembrane receptor with a high homology to the Leukotriene B (4) receptor [Tryselius Y. et al. (2000) Biochem. Biophys. Res. Commun. 274:377-82]. Although the mechanism of action of BLTR2 is poorly understood, it is conceivable that BLTR2 mRNA plays a role in the regulation of CIDE-B apoptotic effector and vice versa.
  • EXAMPLE 3 Identification of mRNA and complementary transcripts of the apoptosis inducing factor APAF-1 Background: A conserved series of events including cellular shrinkage, nuclear condensation, externalization of plasma membrane phosphatidyl serine, and oligonucleosomal DNA fragmentation characterizes apoptotic cell death. Regardless of the circumstance, induction and execution of apoptotic events require activation of caspases, a family of aspartate-specific cysteine proteinases. Caspase activation may be regulated by the mitochondrion and specifically by the apoptosome consisting of an oligomeric complex of apoptotic protease-activating factor- 1 (APAF-1), cytochrome C and dATP.
  • APAF-1 oligomeric complex of apoptotic protease-activating factor- 1
  • the apoptosome recruits and activates caspase-9, which in turn activates the executioner caspases, caspase-3 and -7.
  • the active executioners kill the cell by proteolysis of key cellular substrates [Zou H. et al. (1999) J. Biol. Chem. 274:11549-11556].
  • Evasion or inactivation of the mitochondrial apoptosis pathway may contribute to oncogenesis by allowing cell proliferation.
  • unregulated cell proliferation may occur by inactivation of APAF-1, which has been suggested to occur via genetic loss or inhibition by HSP-70 and HSP-90.
  • RNA transcript has been previously described for APAF-1 [ Zou H. et al. (1999) J. Biol. Chem. 274:11549-11556] (SEQ ID NO: 10) (SEQ ID NO: 24).
  • Computational search for natural antisense transcripts has revealed two complementary transcripts for APAF-1 messenger RNA (SEQ ID NOs: 25 and 26). These antisense transcripts include an open reading frame encoding the EB-1 gene (GenBank accession numbers AF145204; AF164792). The overlap between the APAF-1 messenger RNA and the longer antisense transcript is of at least 300 nucleotides.
  • RNA-probes of 366 ribonucleotides were generated (sense and antisense strands, respectively). Schematic location of the probes used for sense and antisense validation (Riboprobe#l and Riboprobe#2, SEQ ID NOs: 27 and 28, respectively) is illustrated in Figure 14.
  • the sense RNA probe directed at visualizing the antisense transcripts identified a clear band of 3 Kb corresponding to the long computationally retrieved antisense transcript as well as other transcripts sizing from 1 Kb to 8 Kb ( Figure 15a).
  • Transcripts were essentially found in all cell lines but especially in 293 HEK and LN-Cap lines.
  • the muscle nicotinic Acetylcholine Receptor ⁇ subunit encodes for one of five subunits of a ligand gated ion channel receptor located at the neuromuscular synapse.
  • AChR ⁇ is up-regulated in the postnatal period when it replaces ⁇ subunit of the receptor [Witzamann, V. et al., (1987) FEBS Lett. 223, 104-112]. It is also up-regulated in synapse development, specifically by the trophic factor neuregulin [Martinou J. C. (1991) Pro. Natl. Acad. Sci. USA 88, 7669-7673].
  • computational screen for AChR ⁇ K complementary transcript was carried out.
  • RNA probes corresponding to the overlap region in either antisense or sense orientation (SEQ ID NOs: 31 and 32, respectively).
  • an AChR ⁇ transcript was expressed as a predominant 4 Kb band and had the highest expression in the heart, kidney and brain while su ⁇ risingly only a limited expression was observed in the skeletal muscle.
  • AChR ⁇ and Mink genes are antisense each to one another with a significant overlap, and the fact that the two genes are co- expressed in some tissues (eg., brain) suggest the possibility that one of them may regulate the other under certain conditions.
  • the human cyclin E2 gene encodes a 404-amino-acid protein that is most closely related to cyclin E.
  • Cyclin E2 associates with Cdk2 in a functional kinase complex that is inhibited by both p27(Ki ⁇ l) and p21(Cipl).
  • the catalytic activity associated with cyclin E2 complexes is cell cycle regulated and peaks at the Gl/S transition.
  • Overexpression of cyclin E2 in mammalian cells accelerates cell-cycle progression. Unlike cyclin El, cyclin E2 levels are low to undetectable in nontransformed cells and increase significantly in tumor- derived cells suggesting specific mechanism of regulation. Results:
  • RNA transcript was found for cyclin E2 (SEQ ID NO: 33.
  • Computational search for natural antisense transcripts has revealed one complementary transcript for cyclin E2 messenger RNA (SEQ ID NO: 34).
  • the overlap between the cyclin E2 sense RNA and the antisense transcript is of at least 72 nucleotides.
  • Riboprobe#l detected two transcripts of approximately 3 Kb and 4.3 Kb.
  • the absolute levels of the transcripts were quite heterogenic in all cell-lines examined. Both transcripts were completely absent from the Ln Cap cell line, while significantly high expression was observed in MCF-7 and DLD-1 lines, especially of the short transcript.
  • CIDE-B 5 apoptosis inducing factor, CIDE-B (see Example 2 hereinabove), suggested that the latter may be regulated by its complementary transcript, thereby establishing a novel mechanism of regulation.
  • differential expression analysis of CIDE-B expression and its endogenous antisense transcript expression was performed following induction of apoptosis.
  • Monolayers of 293 cells were either left untreated (UT) or incubated with increasing concentrations of etoposide or staurosporine (Sigma IL).
  • RNA was extracted as 5 decribed hereinabove. Purified RNA was further treated with DNasel. A reverse transcription reaction were carried out with equivalent amounts of RNA in a final volume of 20 ⁇ l containing 100 pmol of the oligo(dT) primer, 250 ng of total RNA, 0.5 mM each of four deoxynucleoside triphosphates and 5 units of reverse transcriptase. The reaction mixture was incubated at 65 °C for 5 min, 0 42 °C for 50 min and 70 °C for 15 min.
  • PCR was carried out in a final volume of 25 ⁇ l containing 12.5 pmol each of the oligonucleotide primers derived of exons 3 and 7 of CIDE-B (SEQ ID NOs: 39 and 40), 1 ⁇ l of RT solution and 1.75 units of Taq polymerase.
  • Amplification was carried out by an initial denaturation step at 94 °C for 5 min followed by 35 cycles of [94 °C for 30 s, 68 5 °C for 30 s, and 68 °C for 130 min].
  • products were analyzed on agarose gels stained with ethidium bromide and visualized with UV light.
  • Results Amplification reaction yielded two major PCR products of 740 bp and 0 2285 bp ( Figure 20).
  • an increase of sense transcript, concomitant with a decrease of antisense transcript was observed following treatment with etoposide (lanes 1-4) as compared to untreated cells (lane 9), while no change was detected following staurosporine treatment (lanes 5-8).
  • the mouse nicotinic acetylcholine receptor, epsilon (mAchR ⁇ ) subunit (SEQ ID NO: 35) has a critical function in a variety of differentiation processes.
  • AchR ⁇ nicotinic acetylcholine receptor, epsilon subunit
  • Amplification reaction showed a gradual increase in AchR ⁇ transcript expression, concomitant with the differentiation state of the cells.
  • a second amplification product which corresponded to an unspliced transcript was seen in untreated cells and disappeared following induction of differentiation.
  • This fragment corresponds to a putative antisense transcript of the AchR ⁇ , and may represent an alternative 3' UTR of the Mink gene , of which the known transcript terminates 400 bp downstream to AchR ⁇ (see Example 4).
  • Another PCR reaction was carried out using antisense specific riboprobes F4 and R4 (SEQ ID NO: 43). Reverse transcription products of this amlification reaction showed a single band which corresponded to a naturally occurring antisense transcript of the AchR ⁇ .
  • EXAMPLE 8 A polynucleotide database of sequences corresponding to the naturally occurring antisense transcripts identified by the present invention and their complementary sense sequences
  • CD-ROM1 includes a "seq” text file which contains the actual polynucleotide sequences, and a “table” file which contains summarized data pertaining to each sense- antisense sequence pair.
  • CD-ROM2 includes an "augments” file which contains sequence alignments of sense and antisense overlapping regions.
  • CD-ROM3 contains Excel files: "Table SI” and “Table S2", further described in Example 9.
  • Table 3 below exemplifies the format of the Table provided in CD- ROM 1. Each row represents a pair of transcripts. The columns of Table 3 represent (from the left): the serial number of the pair, the name of the first transcript, its length in nucleotides, the name of the second transcript, its length in nucleotides, the number of base pairs that overlap between the two transcripts, offsets of overlap beginning at the first transcript, offsets of overlap beginning at the second transcript.
  • Transcript names are arbitrary designataions.
  • antisense transcripts identified by the present invention and disclosed in the enclosed CD-ROMs can be used to detect and/or treat a variety of diseases, disorders or conditions, examples of which are listed hereinunder.
  • antisense transcripts or sequence information derived therefrom can be used to construct microarray kits (described in details in the preferred embodiments section) dedicated to diagnosing specific diseases, disorders or conditions.
  • Proteins involved in the immune and complement systems including antigens and autoantigens, immunoglobulins, MHC and HLA proteins and their associated proteins can be used to diagnose/treat diseases involving the immunological system including inflammation, autoimmune diseases, infectious diseases, as well as cancerous processes.
  • Nucleotide binding proteins including antigens and autoantigens, immunoglobulins, MHC and HLA proteins and their associated proteins.
  • nucleic acid binding proteins can be used to diagnose/treat diseases involving dysregulated expression, activity or localization of nucleotide binding proteins.
  • RNA polymerase II transcription factors such as transcription factors, RNA and DNA binding proteins, zinc fingers, helicase, isomerase, histones, nucleases, ribonucleoproteins, transcription and translation factors and others can be used to diagnose/treat diseases involving DNA or RNA binding proteins such as: helicases, isomerases, histones and nucleases, for example diseases where there is non-normal replication or transcription of DNA and RNA respectively.
  • RNA binding proteins such as specific and non-specific RNA polymerase II transcription factors, enhancer binding, ligand-regulated transcription factor and general RNA polymerase II transcription factors can be used to diagnose/treat diseases involving RNA polymerase II transcription factors, for example disorders involving abnormal transcription of RNA.
  • RNA binding proteins such as specific and non-specific RNA polymerase II transcription factors, enhancer binding, ligand-regulated transcription factor and general RNA polymerase II transcription factors can be used to diagnose/treat diseases involving RNA polymerase II transcription factors, for example disorders involving abnormal transcription of RNA.
  • RNA binding proteins involved in splicing and translation regulation such as tRNA binding proteins, RNA helicases, double- stranded RNA and single-stranded RNA binding proteins, mRNA binding proteins, snRNA cap binding proteins, 5S RNA and 7S RNA binding proteins, poly-pyrimidine tract binding proteins, snRNA binding proteins, and AU- specific RNA binding proteins can be used to diagnose/treat diseases involving transcription and translation factors such as: helicases, isomerases, histones and nucleases, for example diseases where there is non-normal transcription, splicing, post-transcriptional processing, translation or stability of the RNA. Chaperones
  • Information derived from proteins such as ribosomal chaperone, peptidylprolyl isomerase, lectin-binding chaperone, nucleosome assembly chaperone, chaperonin ATPase, cochaperone, heat shock protein, HSP70 HSP90 organizing protein, fimbrial chaperone, metallochaperone, rubulin folding, HSC70-interacting protein can be used to diagnose/treat diseases involving pathological conditions, which are associated with non- normal protein activity or structure. Binding of the products of the proteins of this family, or antibodies reactive therewith, can modulate a plurality of protein activities as well as change protein structure.
  • actin binding proteins such as actin cross- linking, actin bundling, F-actin capping, actin monomer binding, actin lateral binding, actin depolymerizing, actin monomer sequestering, actin filament severing, actin modulating, membrane associated actin binding, actin thin filament length regulation and actin polymerizing proteins can be used to diagnose/treat diseases involving cytoskeletal malformations, aberrant cellular mo ⁇ hology affecting extracellular interactions and dysregulated intracellular signaling.
  • kinases which phosphorilate serine/threonine residues, mainly involved in signal transduction, such as transmembrane receptor protein serine/threonine kinase, 3-phosphoinositide-dependent protein kinase, DNA-dependent protein kinase, G-protein-coupled receptor phosphorylating protein kinase, SNFlA/AMP-activated protein kinase, casein kinase, calmodulin regulated protein kinase, cyclic-nucleotide dependent protein kinase, cyclin-dependent protein kinase, eukaryotic translation initiation factor 2alpha kinase, galactosyltransferase-associated kinase, glycogen synthase kinase 3, protein kinase C, receptor signaling protein serine/threonine kinase, ribosomal protein S6 kinase and Ik
  • Enzyme inhibitors Information derived from inhibitors and suppressors of other proteins and enzymes, such as inhibitors of Kinases, phosphatases, chaperones, guanylate cyclase, DNA gyrase, ribonuclease, proteasome inhibitors, diazepam- binding inhibitor, ornithine decarboxylase inhibitor GTPase inhibitors, dUTP pyrophosphatase inhibitor, phospholipase inhibitor, proteinase inhibitor, protein biosynthesis inhibitors, alpha-amylase inhibitors can be used to treat diseases in which beneficial effect may be achieved by modulating the activity of inhibitors and suppressors of proteins and enzymes.
  • inhibitors of Kinases such as inhibitors of Kinases, phosphatases, chaperones, guanylate cyclase, DNA gyrase, ribonuclease, proteasome inhibitors, diazepam- binding inhibitor, ornithine decarboxylase inhibitor
  • Signal transducers Information derived from various signal transducers can be used to diagnose/treat diseases involving abnormal signal-transduction, either as a cause, or as a result of the disease.
  • Receptors such as activin inhibitors, receptor-associated proteins alpha-2 macroglobulin receptors, mo ⁇ hogens, quorum sensing signal generators, quorum sensing response regulators, receptor signaling proteins, ligands, receptors, two-component sensor molecules, two-component response regulators can be used to diagnose/treat diseases involving abnormal signal-transduction, either as a cause, or as a result of the disease.
  • Receptors such as activin inhibitors, receptor-associated proteins alpha-2 macroglobulin receptors, mo ⁇ hogens, quorum sensing signal generators, quorum sensing response regulators, receptor signaling proteins, ligands, receptors, two-component sensor molecules, two-component response regulators can be used to diagnose/treat diseases involving abnormal signal-transduction, either as a
  • Information derived from various receptors can be used to diagnose/treat diseases involving various receptors, including receptors to neurotransmitters, hormones and various other effectors and ligands.
  • Receptor signaling proteins Information derived from receptor proteins involved in signal transduction, such as receptor signaling protein serine/threonine kinase, receptor signaling protein tyrosine kinase, receptor signaling protein tyrosine phosphatase, aryl hydrocarbon receptor nuclear translocator, hematopoeitin/interferon-class , (D200-domain) cytokine receptor signal transducer, transmembrane receptor protein tyrosine kinase signaling protein, transmembrane receptor protein serine/threonine kinase signaling protein, receptor signaling protein serine/threonine kinase signaling protein, receptor signaling protein serine/threonine phosphatase signaling protein, small GTPase regulatory/interacting protein, receptor signaling protein tyrosine kinase signaling protein, and receptor signaling protein serine/threonine phosphatase can be used to diagnose/treat diseases involving non-normal
  • Small GTPase regulatory/interacting proteins Information derived from small GTPase regulatory proteins, such as RAB escort protein, guanyl-nucleotide exchange factor, guanyl-nucleotide exchange factor adaptor, GDP-dissociation inhibitor, GTPase inhibitor, GTPase activator, guanyl-nucleotide releasing factor , GDP-dissociation stimulator, regulator of G-protein signaling, RAS interactor, RHO interactor, RAB interactor, and RAL interactor can be used to diagnose/treat diseases involving signal-transduction, typically involving G-proteases is non-normal, either as a cause, or as a result of the disease.
  • ligands such as opioid peptides, baboon receptor ligand, branchless receptor ligand, breathless receptor ligand, ephrin, frizzled receptor ligand, frizzled-2 receptor ligand, heartless receptor ligand, Notch receptor ligand, patched receptor ligand, punt receptor ligand, Ror receptor ligand, saxophone receptor ligand, SE20 receptor ligand, sevenless receptor ligand, smooth receptor ligand, thickveins receptor ligand, Toll receptor ligand, Torso receptor ligand, death receptor ligand, scavenger receptor ligand, neuroligin, integrin ligand, hormones, pheromones, growth factors and sulfonylurea receptor ligand can be used to diagnose/treat:
  • Information derived from proteins that serve as adhesion molecules between adjoining cells can be used to diagnose/treat diseases where adhesion between adjoining cells is involved, typically conditions in which the adhesion is non-normal. Typical examples of such conditions are cancer conditions in which non-normal adhesion may cause and enhance the process of metastasis. Other examples of such conditions include conditions of non- normal growth and development of various tissues in which modulation adhesion among adjoining cells can improve the condition. Structural proteins
  • Proteins involved in cell structure such as ribosomal proteins, cell wall proteins, cytoskeletal proteins, extracellular matrix proteins, extracellular matrix glycoproteins, amyloid proteins, plasma proteins, eye lens proteins, chorion proteins (sensu Insecta), cuticle proteins (sensu Insecta), puparial glue protein (sensu Diptera), bone proteins, yolk proteins, muscle proteins, vitelline membrane proteins (sensu Insecta), peritrophic membrane proteins (sensu Insecta), and nuclear pore proteins can be used to diagnose/treat diseases involving abnormalities in cytoskeleton, including cancerous cells, and diseased cells including those which do not propagate, grow or function normally. Diseases involving non-normal sub-cellular proteins such as non-normal ribozymal proteins. Transporter proteins
  • Information derived from proteins such as amine/polyamine transporter, lipid transporter, neurotransmitter transporter, organic acid transporter, oxygen transporter, water transporter, carriers, intracellular transportes, protein transporters, ion transporters, carbohydrate transporter, polyol transporter, amino acid transporters, vitamin/cofactor transporters, siderophore transporter, drug transporter, channel/pore class transporter, group translocator, auxiliary transport proteins, Permeases, murein transporter, organic alcohol transporter, nucleobase, nucleoside and nucleotide and nucleic acid transporters can be used to diagnose/treat diseases in which abnormal transport of molecules and macromolecules such as neurotransmitters, hormones, sugar etc. leads to various pathologies.
  • proteins such as amine/polyamine transporter, lipid transporter, neurotransmitter transporter, organic acid transporter, oxygen transporter, water transporter, carriers, intracellular transportes, protein transporters, ion transporters, carbohydrate transporter, polyol
  • Intracellular transporters Information derived from proteins that mediate the transport of molecules and macromoleules inside the cell, such as intracellular nucleoside transporter, vacuolar assembly proteins, vesicle transporters, vesicle fusion proteins, and type II protein secretors can be used to diagnose/treat diseases in which abnormal transport of molecules and macromolecules leads to various pathologies.
  • Electron transporters Information derived from various proteins, involved in diverse biological functions, such as pyridoxal phosphate binding, carbohydrate binding, magnesium binding, amino acid binding, cyclosporin A binding, nickel binding, chlorophyll binding, biotin binding, penicillin binding, selenium binding, tocopherol binding, lipid binding, drug binding, oxygen transporter, electron transporter, steroid binding, juvenile hormone binding, retinoid binding, heavy metal binding, calcium binding, protein binding, glycosaminoglycan binding, folate binding, odorant binding, lipopolysaccharide binding, and nucleotide binding can be used to diagnose/treat diseases involving improper intracellular or extracellular accumulation or removal of small molecules such as calcium ions, improper inco ⁇ oration of metals and modified amino acids (i.e., seleno-cystein), dysregulated signaling effected by improper steroid titration etc.
  • Electron transporters such as calcium ions, improper inco ⁇ oration of metals and modified amino acids
  • Information derived from ligand binding proteins or carrier proteins involved in electron transport such as flavin-containing electron transporter, cytochromes, electron donors, electron acceptors, electron carriers and cytochrome-c oxidases can be used to diagnose/treat diseases involving dysregulated mitochondrial activity.
  • Information derived from calcium binding proteins, ligand binding proteins or carriers such as diacylglycerol kinase, Calpain, calcium-dependent protein serine/threonine phosphatase, calcium sensing proteins and calcium storage proteins can be used to diagnose/treat diseases in which intracellular or extracellular calcium storage or release is improper.
  • RNA and DNA binding Information derived from proteins involved in transcription factors binding, RNA and DNA binding, such as transcription factors, RNA and DNA binding proteins, zinc fingers, helicase, isomerase, histones, and nucleases can be used to diagnose/treat diseases involving transcription factors binding proteins, for example diseases where there is abnormal replication or transcription of DNA and RNA respectively.
  • Information derived from enzyme regulators such as activators of kinases, phosphatases, sphingolipids, chaperones, guanylate cyclase, tryptophan hydroxylase, proteases, phospholipases, caspases, proprotein convertase 2 activator, cyclin-dependent protein kinase 5 activator, superoxide- generating NADPH oxidase activator, sphingomyelin phosphodiesterase activator, monophenol monooxygenase activator, proteasome activator, and GTPase activator can be used to diagnose/treat diseases in which beneficial effect may be achieved by modulating the activity of activators of proteins and enzymes.
  • enzyme regulators such as activators of kinases, phosphatases, sphingolipids, chaperones, guanylate cyclase, tryptophan hydroxylase, proteases, phospholipases, caspases, proprotein convertase 2 activator
  • Information derived from proteins involved in any biological process required for cell survival, growth and maintenance including proteins involved in cell organization and biogenesis, cell growth, cell proliferation, metabolism, cell cycle, budding, cell shape and cell size control, sporulation (sensu Saccharomyces), transport, ion homeostasis, autophagy, cell motility, chemi- mechanical coupling, membrane fusion, cell-cell fusion and stress response can be used to diagnose/treat diseases involving premature death of cells, such as degenerative diseases, for example neurodegenerative diseases or conditions associated with aging, or alternatively, diseases in which cell apoptosis is not turned on, such as cancerous diseases.
  • Metabolic proteins derived from proteins involved in any biological process required for cell survival, growth and maintenance including proteins involved in cell organization and biogenesis, cell growth, cell proliferation, metabolism, cell cycle, budding, cell shape and cell size control, sporulation (sensu Saccharomyces), transport, ion homeostasis, autophagy, cell motility, chemi- mechanical
  • Info ⁇ nation derived from proteins that mediate the transport of molecules and macromoleules across membranes, such as alpha-type channels, porins and pore-forming toxins can be used to diagnose/treat diseases in which the transport of molecules and macromolecules such as neurotransmitters, hormones, sugar etc. is non-normal leading to various pathologies.
  • RNA products of the genes of this family, or antibodies reactive therewith can modulate a plurality of tubulin activities as well as change microtubulin structure.
  • kinases such as 2-amino-4-hydroxy-6- hydroxymethyldihydropteridine pyrophosphokinase, NAD(+) kinase, acetylglutamate kinase, adenosine kinase, adenylate kinase, adenylsulfate kinase, arginine kinase, aspartate kinase, choline kinase, creatine kinase, cytidylate kinase, deoxyadenosine kinase, deoxycytidine kinase, deoxyguanosine kinase, dephospho-CoA kinase, diacylglycerol kinase, dolichol kinase, ethanolamine kinase, galactokinase, glucokinase, glutamate 5-kinase, g
  • Information derived from enzymes that catalyze an oxidation-reduction reaction including oxidoreductases acting on CH-OH, CH-CH, CH-NH2, CH- NH, NADH or NADPH, nitrogenous compounds, sulfur group of donors, heme group, hydrogen group, diphenols and related substances as donors, oxidoreductases acting on peroxide as acceptor, superoxide radicals as acceptor, oxidizing metal ions, CH2 groups, reduced ferredoxin donor, reduced flavodoxin donor, and aldehyde or oxo group of donors can be used to diagnose/treat diseases involving non-normal activity of oxidoreductases.
  • Information derived from enzymes that catalyze the transfer of a chemical group, such as a phosphate or amine, from one molecule to another including transferases, transferring one-carbon groups, aldehyde or ketonic groups, acyl groups, glycosyl groups, alkyl or aryl (other than methyl) groups, nitrogenous, phosphorus-containing groups, sulfur-containing groups and lipoyltransferase, deoxycytidyl transferases can be used to diagnose/treat diseases in which the transfer of a chemical group from one molecule to another is abnormal and a beneficial effect may be achieved by modulation of such abnormal reactions.
  • a chemical group such as a phosphate or amine
  • Information derived from enzymes that catalyze the transfer of a single carbon from one molecule to another including methyltransferase, amidinotransferase, hydroxymethyl-, formyl- and related transferase, carboxyl- and carbamoyltransferase can be used to diagnose/treat diseases in which the transfer of a one-carbon chemical group from one molecule to another is abnormal and a beneficial effect may be achieved by modulation of such an abnormal reaction.
  • Transferases - glycosyl groups including methyltransferase, amidinotransferase, hydroxymethyl-, formyl- and related transferase, carboxyl- and carbamoyltransferase
  • Information derived from enzymes that catalyze the transfer of a glycosyl from one molecule to another including murein lytic endotransglycosylase E and sialyltransferase can be used to diagnose/treat diseases in which the transfer of a glycosyl chemical group from one molecule to another is abnormal and a beneficial effect may be achieved by modulation of such an abnormal reaction.
  • Information derived from enzymes that catalyze the transfer of phosphate from one molecule to another can be used to diagnose/treat diseases in which the transfer of a phosphate group to a modulated moiety is abnormal and a beneficial effect may be achieved by modulation of such abnormal transfer.
  • Hydrolases Information derived from hydrolytic enzymes acting on ester bonds, glycosyl bonds, ether bonds, carbon-nitrogen (but not peptide) bonds, acid anhydrides, acid carbon-carbon bonds, acid halide bonds, acid phosphorus- nitrogen bonds, acid sulfur-nitrogen bonds, acid carbon-phosphorus bonds and acid sulfur-sulfur bonds can be used to diagnose/treat diseases in which the hydrolytic cleavage of a covalent bond with accompanying addition of water, - H being added to one product of the cleavage and -OH to the other, is abnormal and a beneficial effect may be achieved by modulation of such an abnormal reaction.
  • Hydrolases acting on ester bonds
  • Information derived from hydrolytic enzymes, acting on ester bonds, such as nucleases, sulfuric ester hydrolase, carboxylic ester hydrolases, thiolester hydrolase, phosphoric monoester hydrolase, phosphoric diester hydrolase, triphosphoric monoester hydrolase, diphosphoric monoester hydrolase and phosphoric triester hydrolase can be used to diagnose/treat diseases in which the hydrolytic cleavage of a covalent bond with accompanying addition of water, -H being added to one product of the cleavage and -OH to the other, is abnormal and a beneficial effect may be achieved by modulation of such an abnormalreaction.
  • Carboxylic ester hydrolases Information derived from hydrolytic enzymes, acting on carboxylic ester bonds, such as N-acetylglucosaminylphosphatidylinositol deacetylase, 2-acetyl- 1-alkylglycerophosphocholine esterase, aminoacyl-tRNA hydrolase, arylesterase, carboxylesterase, cholinesterase, gluconolactonase, sterol esterase, acetylesterase, carboxymethylenebutenolidase, protein-glutamate methylesterase, and lipase, 6-phosphogluconolactonase can be used to diagnose/treat diseases which the hydrolytic cleavage of a covalent bond with accompanying addition of water, -H being added to one product of the cleavage and -OH to the other, is abnormal and a beneficial effect may be achieved by modulation of such an abnormal reaction.
  • Phosphoric monoester hydrolases such as N-
  • hydrolytic enzymes acting on ester bonds such as nuclease, sulfuric ester hydrolase, carboxylic ester hydrolase, thiolester hydrolase, phosphoric monoester hydrolase, phosphoric diester hydrolase, triphosphoric monoester hydrolase, diphosphoric monoester hydrolase and phosphoric triester hydrolase can be used to diagnose/treat diseases in which the hydrolytic cleavage of a covalent bond with accompanying addition of water, -H being added to one product of the cleavage and -OH to the other, is abnormal and a beneficial effect may be achieved by modulation of such an abnormal reaction.
  • Hydrolases acting on glycosyl bonds such as nuclease, sulfuric ester hydrolase, carboxylic ester hydrolase, thiolester hydrolase, phosphoric monoester hydrolase, phosphoric diester hydrolase, triphosphoric monoester hydrolase, diphosphoric monoester hydrolase and phosphoric triester hydrolase can be used to diagnose/treat diseases in which the hydro
  • O-glycosyl compounds can be used to diagnose/treat diseases in which the hydrolase-related activities are abnormal.
  • Hydrolases acting on acid anhydrides such as hydrolases hydrolyzing N-gfycosyl compounds and S-glycosyl compounds
  • Information derived from hydrolytic enzymes which act on acid anhydrides such as phosphorus-containing anhydrides, sulfonyl-containing anhydrides, and hydrolases catalysing transmembrane movement of substances, and involved in cellular and subcellular movement can be used to diagnose/treat diseases in which the hydrolase-related activities are abnormal.
  • Information derived from enzymes that catalyze the formation of double bonds by removing chemical groups from a substrate without hydrolysis or catalyze the addition of chemical groups to double bonds including carbon- carbon lyases, carbon-oxygen lyases, carbon-nitrogen lyases, carbon-sulfur lyases, carbon-halide lyases, phosphorus-oxygen lyases, and other lyases can be used to diagnose/treat diseases in which lyase activity, expression or localization is abnormal.
  • Information derived from enzymes that catalyze the linkage of two molecules, generally utilizing ATP as the energy donor can be used to diagnose/treat diseases in which the joining together of two molecules in an energy-dependent process is abnormal and a beneficial effect may be achieved by modulation of such an abnormal reaction.
  • Information derived from enzymes that catalyze the linkage between carbon and oxygen can be used to diagnose/treat diseases in which the linkage between carbon and oxygen in an energy-dependent process is abnormal and a beneficial effect may be achieved by modulation of such an abnormal reaction.
  • Information derived from enzymes such as plasma membrane cation- transporting ATPase, ATP-binding cassette (ABC) transporter, magnesium- ATPase, hydrogen-/sodium-translocating ATPase, arsenite-transporting ATPase, protein-transporting ATPase, DNA translocase, and P-type ATPase can be used to diagnose/treat diseases associated with abnormal activity of an ATP hydrolyzing enzyme.
  • ATP-binding cassette (ABC) transporter such as plasma membrane cation- transporting ATPase, ATP-binding cassette (ABC) transporter, magnesium- ATPase, hydrogen-/sodium-translocating ATPase, arsenite-transporting ATPase, protein-transporting ATPase, DNA translocase, and P-type ATPase
  • enzymes such as plasma membrane cation- transporting ATPase, ATP-binding cassette (ABC) transporter
  • inflamatory diseases include, but are not limited to, chronic inflammatory diseases and acute inflammatory diseases. Inflammatory diseases associated with hypersensitivity
  • hypersensitivity examples include, but are not limited to, Types I-IV hypersensitivity, immediate hypersensitivity, antibody mediated hypersensitivity, immune complex mediated hypersensitivity, T lymphocyte mediated hypersensitivity and DTH.
  • type I or immediate hypersensitivity is asthma.
  • type II hypersensitivity examples include, but are not limited to, rheumatoid diseases, rheumatoid autoimmune diseases, rheumatoid arthritis (Krenn V.
  • myasthenic diseases myasthenic diseases, Lambert-Eaton myasthenic syndrome (Takamori M. Am J Med Sci. 2000 Apr;319 (4):204), paraneoplastic neurological diseases, cerebellar atrophy, paraneoplastic cerebellar atrophy, non-paraneoplastic stiff man syndrome, cerebellar atrophies, progressive cerebellar atrophies, encephalitis, Rasmussen's encephalitis, amyotr ⁇ phic lateral sclerosis, Sydeham chorea, Gilles de la Tourette syndrome, polyendocrinopathies, autoimmune polyendocrinopathies (Antoine JC. and Honnorat J.
  • vasculitises necrotizing small vessel vasculitises, microscopic polyangiitis, Churg and Strauss syndrome, glomerulonephritis, pauci-immune focal necrotizing glomerulonephritis, crescentic glomerulonephritis (Noel LH. Ann Med Interne (Paris). 2000 May;151 (3):178); antiphospholipid syndrome (Flamholz R. et al, J Clin Apheresis 1999; 14 (4): 171); heart failure, agonist-like beta-adrenoceptor antibodies in heart failure (Wallukat G. et al, Am J Cardiol.
  • Type IV or T cell mediated hypersensitivity examples include, but are not limited to, rheumatoid diseases, rheumatoid arthritis (Tisch R, McDevitt HO. Proc Natl Acad Sci U S A 1994 Jan 18;91 (2):437), systemic diseases, systemic autoimmune diseases, systemic lupus erythematosus (Datta SK., Lupus 1998;7 (9):591), glandular diseases, glandular autoimmune diseases, pancreatic diseases, pancreatic autoimmune diseases, Type 1 diabetes (Castano L. and Eisenbarth GS. Ann. Rev. Immunol. 8:647); thyroid diseases, autoimmune thyroid diseases, Graves' disease (Sakata S.
  • Examples of delayed type hypersensitivity include, but are not limited to, contact dermatitis and drug eruption.
  • Autoimmune diseases examples include, but are not limited to, cardiovascular diseases, rheumatoid diseases, glandular diseases, gastrointestinal diseases, cutaneous diseases, hepatic diseases, neurological diseases, muscular diseases, nephric diseases, diseases related to reproduction, connective tissue diseases and systemic diseases.
  • autoimmune cardiovascular diseases include, but are not limited to atherosclerosis (Matsuura E. et al, Lupus. 1998;7 Suppl 2:S135), myocardial infarction (Vaarala O. Lupus. 1998;7 Suppl 2:S132), thrombosis (Tincani A.
  • autoimmune rheumatoid diseases include, but are not limited to rheumatoid arthritis (Krenn V. et al, Histol Histopathol 2000 Jul;15 (3):791; Tisch R, McDevitt HO.
  • autoimmune glandular diseases include, but are not limited to, pancreatic disease, Type I diabetes, thyroid disease, Graves' disease, thyroiditis, spontaneous autoimmune thyroiditis, Hashimoto's thyroiditis, idiopathic myxedema, ovarian autoimmunity, autoimmune anti-sperm infertility, autoimmune prostatitis and Type I autoimmune polyglandular syndrome, diseases include, but are not limited to autoimmune diseases of the pancreas, Type 1 diabetes (Castano L. and Eisenbarth GS. Ann. Rev. Immunol.
  • autoimmune gastrointestinal diseases include, but are not limited to, chronic inflammatory intestinal diseases (Garcia Herola A. et al,
  • autoimmune cutaneous diseases include, but are not limited to, autoimmune bullous skin diseases, such as, but are not limited to, pemphigus vulgaris, bullous pemphigoid and pemphigus foliaceus.
  • autoimmune hepatic diseases include, but are not limited to, hepatitis, autoimmune chronic active hepatitis (Franco A. et al, Clin Immunol
  • autoimmune neurological diseases include, but are not limited to, multiple sclerosis (Cross AH. et al, J Neuroimmunol 2001 Jan 1;112 (1-2).T), Alzheimer's disease (Oron L. et al, J Neural Transm Suppl. 1997;49:77), myasthenia gravis (Infante AJ. And Kraig E, Int Rev Immunol 1999;18 (l-2):83; Oshima M. et al, Eur J Immunol 1990 Dec;20 (12):2563), neuropathies, motor neuropathies (Komberg AJ. J Clin Neurosci. 2000 May;7 (3):191); Guillain-Barre syndrome and autoimmune neuropathies (Kusunoki S.
  • autoimmune muscular diseases include, but are not limited to, myositis, autoimmune myositis and primary Sjogren's syndrome (Feist E. et al, Int Arch Allergy Immunol 2000 Sep;123 (1):92) and smooth muscle autoimmune disease (Zauli D. et al, Biomed Pharmacother 1999 Jun;53 (5- 6):234).
  • autoimmune nephric diseases include, but are not limited to, nephritis and autoimmune interstitial nephritis (Kelly CJ. J Am Soc Nephrol 1990 Aug;l (2):140).
  • autoimmune diseases related to reproduction include, but are not limited to, repeated fetal loss (Tincani A. et al, Lupus 1998;7 Suppl 2:S 107-9).
  • autoimmune connective tissue diseases include, but are not limited to, ear diseases, autoimmune ear diseases (Yoo TJ. et al, Cell Immunol
  • autoimmune systemic diseases include, but are not limited to, systemic lupus erythematosus (Erikson J. et al, Immunol Res 1998;17 (1-
  • infectious diseases include, but are not limited to, chronic infectious diseases, subacute infectious diseases, acute infectious diseases, viral diseases, bacterial diseases, protozoan diseases, parasitic diseases, fungal diseases, mycoplasma diseases and prion diseases.
  • Graft rejection diseases Examples of diseases associated with transplantation of a graft include, but are not limited to, graft rejection, chronic graft rejection, subacute graft rejection, hyperacute graft rejection, acute graft rejection and graft versus host disease.
  • Allergic diseases include, but are not limited to, asthma, hives, urticaria, pollen allergy, dust mite allergy, venom allergy, cosmetics allergy, latex allergy, chemical allergy, drug allergy, insect bite allergy, animal dander allergy, stinging plant allergy, poison ivy allergy and food allergy.
  • Cancerous diseases Examples of cancer include but are not limited to carcinoma, lymphoma, blastoma, sarcoma, and leukemia. Particular examples of cancerous diseases but are not limited to: Myeloid leukemia such as Chronic myelogenous leukemia. Acute myelogenous leukemia with maturation.
  • Acute promyelocytic leukemia Acute nonlymphocytic leukemia with increased basophils, Acute monocytic leukemia.
  • Acute myelomonocytic leukemia with eosinophilia Malignant lymphoma, such as Birkitt's Non-Hodgkin's
  • Lymphoctyic leukemia such as Acute lumphoblastic leukemia.
  • Chronic lymphocytic leukemia Myeloproliferative diseases, such as Solid tumors Benign Meningioma, Mixed tumors of salivary gland, Colonic adenomas; Adenocarcinomas, such as Small cell lung cancer, Kidney, Uterus, Prostate, Bladder, Ovary, Colon, Sarcomas, Liposarcoma, myxoid, Synovial sarcoma, Rhabdomyosarcoma (alveolar), Extraskeletel myxoid chonodrosarcoma, Ewing's tumor; other include Testicular and ovarian dysgerminoma, Retinoblastoma, Wilms' tumor, Neuroblastoma, Malignant melanoma, Mesothelioma, breast, skin, prostate, and ovarian.
  • Adenocarcinomas such as Small cell lung cancer, Kidney, Uterus, Prostate, Bladder, Ovary, Colon, Sarcomas, Lipos
  • Single 60-mer oligonucleotides were previously shown to offer reliability and sensitivity for detecting specific transcripts (T. R. Hughes, et al., Nature Biotech. 19, 342 (2001).).Initially only pairs of clusters with an overlap greater than 60 bases (2,464 pairs agree with this restriction) were selected for array construction.
  • each antisense pair was then verified for the presence of 60-mer oligonucleotides that matched a set of standards, such as minimal sequence similarity elsewhere in the human genome, uniform GC-content and Tm, and absence of palindromic sequences, in order to maximize the hybridization specificity.
  • Oligonucleotide probes meeting the criteria set forth were identified for 1,211 sense/antisense pairs and a random sample of 264 pairs, which constitutes roughly one-tenth of the original dataset of 2667 sense/antisense cluster pairs, was selected for analysis by Microarrays (Table SI on CD-ROM3, an exce ⁇ t of which is shown in Table 5 below). In this sample, the proportion of each of the nine subgroups depicted in Table 4 is similar to that of the original dataset, indicating a good representation of the various subgroups. Table 4
  • Table represents the proportion of sense/antisense clusters in the dataset of 2667 that contain: 1) a known mRNA and 2) expressed sequences spanning at least one intron, in one of the two clusters, in both clusters or in none of the clusters.
  • Table 5 below is an exce ⁇ t of Table SI provided on CD-ROM3;
  • the first column provides the pair number.
  • the next two columns provide the accession numbers of representative expressed sequences from the overlapping region of the sense and the antisense genes, respectively.
  • the two columns identified by the "RNA" header provide the accession numbers of known mRNAs in the sense and antisense clusters (if available), and the last two columns provide the GenBank descriptions of these mRNAs.
  • Microarrays were constructed by spotting each of the 264 pairs of oligonucleotide probes onto treated glass slides in quadruplicates. The two counte ⁇ art oligonucleotide probes of each pair were spotted next to each other to ensure similar hybridization conditions.
  • each of the blocks contained oligonucleotides spotted at various concentrations for four ubiquitously expressed housekeeping genes: guanine nucleotide binding protein beta polypeptide 2-like 1 (gnb211, HUMMHBA123, NM )06098), heat shock 70kD protein 10 (hsp70, HSHSC70CDS0, NM_006597), beta actin (actin, ACTB, NM_001101), and glyceraldehyde-3 -phosphate dehydrogenase (gapdh, NM_002046).
  • the Microarrays were hybridized with poly(A)+ RNAs obtained from 19 human cell lines representing a variety of tissues and four normal human tis ' sues (see General Materials and Methods section above). Each poly(A)+ RNA was reverse transcribed by priming with oligo(dT) and random nonamers, and engineered to inco ⁇ orate a fluorescent marker. A pool containing an equal mix of the RNAs from all cell lines was also transcribed and used as a reference target. The resulting fluorescently-labeled cDNAs were combined and hybridized to the oligonucleotide Microarrays.
  • the raw data was normalized at several levels; within each slide, between reciprocal slides, and globally between slides (see General Materials and Methods section above). Non-specific levels of hybridization were estimated from the negative controls. The threshold for significant positive signals resulting from authentic hybridization was set at 4 standard deviations of the mean normalized signals for the negative controls. Processed data was presented as normalized signal intensity and as normalized signal ratios (Table S2 on CD-ROM3).
  • Figures 22a-j illustrate results of such northern blot analysis.
  • Figure 22a reveals expression patterns of randomly selected sequence pair number 235, denoted as Rand_235 in Table 6.
  • Figure 22b corresponds to pair number 173, Figure 22c to pair number 248, Figure 22d to pair number 6, Figure 22e to pair number 216, Figure 22f to pair number 239, Figure 22g to pair number 202, Figure 22h to pair number 114, Figure 22i to pair number 188, and Figure 22j to pair number 223.
  • Figure 23 represents an exce ⁇ t of Table S2 (provided in CD-ROM3) which summarizes the results obtained utilizing the array generated according to the teachings of the present invention. Expression thresholds were verified and indicated and normalization for microarray signals was conducted as described above. Rji ratios were obtained for each cell line/tissue assessed.
  • the sensitivity of the experimental approach utilized i.e. the ability to detect a given transcript, stems from a combination of the stringency used in the microarray analysis and the level of expression and tissue specificity of the RNA. This can be estimated from the positive signals obtained for 65% of the oligos representing known RefSeq mRNAs on the Microarrays. This level of detection is comparable to that obtained in other studies, such as the 58% of known exons verified using microarray analysis (D. D. Shoemaker, et al., Nature 409, 922; 2001).
  • CD-ROM Content The following CD-ROMs are attached herewith:
  • CD-ROM1 CD-ROM1 :
  • CD-ROM2 CD-ROM2
  • CD-ROM3 l. Table Sl 79.5kb 10/07/2002 Microsoft Windows Microsoft Excel Worksheet

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Chemical & Material Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • Medical Informatics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Analytical Chemistry (AREA)
  • Molecular Biology (AREA)
  • Organic Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

La présente invention concerne un procédé pour identifier des produits de transcription antisens naturels putatifs. Le procédé comprend les étapes suivantes: (a) alignement informatique d'une première base de données comprenant des séquences polynucléotidiques sens avec une seconde base de données comprenant des séquences polynucléotidiques exprimées; et (b) identification des séquences polynucléotidiques exprimées de la seconde base de données qui sont capables de former un duplexe avec au moins une séquence polynucléotidique sens de la première base de données, ce qui permet l'identification de produits de transcription antisens naturels putatifs.
EP02788472A 2001-11-26 2002-11-11 Procedes et systemes pour identifier des produits de transcription antisens naturels et procedes, kits et jeux ordonnes d'echantillons qui les comprennent Withdrawn EP1451355A1 (fr)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US99339801A 2001-11-26 2001-11-26
US993398 2001-11-26
US20160502A 2002-07-24 2002-07-24
US201605 2002-07-24
PCT/IL2002/000904 WO2003046220A1 (fr) 2001-11-26 2002-11-11 Procedes et systemes pour identifier des produits de transcription antisens naturels et procedes, kits et jeux ordonnes d'echantillons qui les comprennent

Publications (1)

Publication Number Publication Date
EP1451355A1 true EP1451355A1 (fr) 2004-09-01

Family

ID=26896941

Family Applications (1)

Application Number Title Priority Date Filing Date
EP02788472A Withdrawn EP1451355A1 (fr) 2001-11-26 2002-11-11 Procedes et systemes pour identifier des produits de transcription antisens naturels et procedes, kits et jeux ordonnes d'echantillons qui les comprennent

Country Status (4)

Country Link
EP (1) EP1451355A1 (fr)
AU (1) AU2002353447A1 (fr)
IL (1) IL162149A0 (fr)
WO (1) WO2003046220A1 (fr)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPWO2005054468A1 (ja) * 2003-12-01 2008-03-13 株式会社リバース・プロテオミクス研究所 抗糖尿病剤の標的タンパク質及び対応する新規抗糖尿病剤「インスフル」
US9618474B2 (en) 2014-12-18 2017-04-11 Edico Genome, Inc. Graphene FET devices, systems, and methods of using the same for sequencing nucleic acids
US9857328B2 (en) 2014-12-18 2018-01-02 Agilome, Inc. Chemically-sensitive field effect transistors, systems and methods for manufacturing and using the same
US9859394B2 (en) 2014-12-18 2018-01-02 Agilome, Inc. Graphene FET devices, systems, and methods of using the same for sequencing nucleic acids
US10006910B2 (en) 2014-12-18 2018-06-26 Agilome, Inc. Chemically-sensitive field effect transistors, systems, and methods for manufacturing and using the same
US10020300B2 (en) 2014-12-18 2018-07-10 Agilome, Inc. Graphene FET devices, systems, and methods of using the same for sequencing nucleic acids
WO2016100049A1 (fr) 2014-12-18 2016-06-23 Edico Genome Corporation Transistor à effet de champ chimiquement sensible
EP3459115A4 (fr) 2016-05-16 2020-04-08 Agilome, Inc. Dispositifs à fet au graphène, systèmes et leurs méthodes d'utilisation pour le séquençage d'acides nucléiques

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO03046220A1 *

Also Published As

Publication number Publication date
AU2002353447A1 (en) 2003-06-10
WO2003046220A1 (fr) 2003-06-05
IL162149A0 (en) 2005-11-20

Similar Documents

Publication Publication Date Title
WO2004104161A2 (fr) Procedes et systemes permettant d'identifier des transcriptions antisens ayant lieu naturellement et procedes, kits et essais mettant en oeuvre ceux-ci
US7618814B2 (en) Microrna-related nucleic acids and uses thereof
Liu et al. Prediction of functional microRNA targets by integrative modeling of microRNA binding and target expression data
EP1716227A2 (fr) Procede d'identification de produits genetiques putatifs par comparaison de sequences inter-especes et de sequences de biologie moleculaire exposees par celles-ci
US7655785B1 (en) Bioinformatically detectable group of novel regulatory oligonucleotides and uses thereof
Shendure et al. Computational discovery of sense-antisense transcription in the human and mouse genomes
US20110183924A1 (en) Methods and systems for annotating biomolecular sequences
US20060068405A1 (en) Methods and systems for annotating biomolecular sequences
Mazur et al. Structure and expression of the TREX1 and TREX2 3′→ 5′ exonuclease genes
US20070042381A1 (en) Bioinformatically detectable group of novel regulatory viral and viral associated oligonucleotides and uses thereof
US20060257851A1 (en) Bioinformatically detectable group of novel viral regulatory genes and uses thereof
WO1999058719A1 (fr) Identification de sites d'interaction moleculaire dans l'arn pour la recherche de nouveaux medicaments
US20080318210A1 (en) Bioinformatically detectable group of novel regulatory viral and viral associated oligonucleotides and uses thereof
Sanchez et al. Six related nucleoside/nucleobase transporters Fromtrypanosoma brucei exhibit distinct biochemical functions
US20150203920A1 (en) Compositions and methods for using transfer rna fragments as biomarkers for cancer
Quarles et al. Ensemble analysis of primary microRNA structure reveals an extensive capacity to deform near the Drosha cleavage site
Ansari-Lari et al. Large-scale sequencing in human chromosome 12p13: experimental and computational gene structure determination.
Alderson et al. One origin for metallo-β-lactamase activity, or two? An investigation assessing a diverse set of reconstructed ancestral sequences based on a sample of phylogenetic trees
Parker et al. The expansion segments of 28S Ribosomal RNA extensively match human messenger RNAs
WO2003046220A1 (fr) Procedes et systemes pour identifier des produits de transcription antisens naturels et procedes, kits et jeux ordonnes d'echantillons qui les comprennent
Tian et al. Comparative analyses of bat genomes identify distinct evolution of immunity in Old World fruit bats
US7842800B2 (en) Bioinformatically detectable group of novel regulatory bacterial and bacterial associated oligonucleotides and uses thereof
WO2005087949A1 (fr) Mappage systematique de sites d'edition de l'adenosine a l'inosine dans le transcriptome humain
Warring et al. Small RNAs are implicated in regulation of gene and transposable element expression in the protist Trichomonas vaginalis
Magner et al. A structural potential of rare trinucleotide repeat tracts in RNA

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20040610

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LI LU MC NL PT SE SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20060601