WO2000077214A2 - Characterization of the yeast transcriptome - Google Patents

Characterization of the yeast transcriptome Download PDF

Info

Publication number
WO2000077214A2
WO2000077214A2 PCT/US2000/016223 US0016223W WO0077214A2 WO 2000077214 A2 WO2000077214 A2 WO 2000077214A2 US 0016223 W US0016223 W US 0016223W WO 0077214 A2 WO0077214 A2 WO 0077214A2
Authority
WO
WIPO (PCT)
Prior art keywords
cont
norf
gene
expression
phase
Prior art date
Application number
PCT/US2000/016223
Other languages
French (fr)
Other versions
WO2000077214A3 (en
Inventor
Victor Velculescu
Bert Vogelstein
Kenneth Kinzler
Original Assignee
The John Hopkins University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The John Hopkins University filed Critical The John Hopkins University
Priority to CA002378512A priority Critical patent/CA2378512A1/en
Priority to EP00939837A priority patent/EP1190044A2/en
Priority to JP2001503657A priority patent/JP2003516716A/en
Priority to AU54856/00A priority patent/AU5485600A/en
Publication of WO2000077214A2 publication Critical patent/WO2000077214A2/en
Publication of WO2000077214A3 publication Critical patent/WO2000077214A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • C07K14/4701Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
    • C07K14/4738Cell cycle regulated proteins, e.g. cyclin, CDC, INK-CCR
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/37Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi
    • C07K14/39Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from yeasts
    • C07K14/395Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from yeasts from Saccharomyces

Definitions

  • This invention is related to the characterization of the expressed genes of the yeast genome. More particularly, it is related to the identification and use of previously unrecognized genes.
  • transcriptome conveying the identity of each expressed gene and its level of expression for a defined population of cells.
  • the transcriptome can be modulated by both. external and internal factors. The transcriptome thereby serves as a dynamic link between an organism's genome and its physical characteristics.
  • transcriptome as defined above has not been characterized in any eukaryotic or prokaryotic organism, largely because of technological limitations. However, some general features of gene expression patterns were elucidated two decades ago through RNA-DNA hybridization measurements (Bishop et al, 1974; Hereford and Rosbash,
  • an isolated DNA molecule comprises a coding sequence of a yeast gene selected from the group consisting of NORF genes comprising a SAGE tag as shown in SEQ ID NOS.67-811.
  • a method of using NORF genes is provided. The method is for affecting the cell cycle of a cell. The method comprises the step of administering to a cell an isolated DNA molecule comprising a coding sequence of a NORF gene whose expression varies by at least 10% between any two phases of the cell cycle selected from the group consisting of log phase, S phase, and G2/M.
  • a method for screening candidate antifi ⁇ ngal drugs comprises the steps of contacting a test substance with a yeast cell and monitoring expression of a NORF gene whose expression varies by at least 10% between any two phases of the cell cycle selected from the group consisting of log phase, S phase, and G2/M, wherein a test substance which modifies the expression of the yeast gene is a candidate antifungal drug.
  • a method for identifying human genes which are involved in cell cycle progression comprises the step of contacting human DNA with a probe which comprises at least 14 contiguous nucleotides of a NORF gene whose expression varies by at least 10% between any two phases of the cell cycle selected from, the group consisting of log phase, S phase, and G2/M.
  • a human DNA sequence which hybridizes to the probe is identified as a sequence of a candidate human gene which is involved in cell cycle progression.
  • the present invention provides probes which comprise at least 14 contiguous nucleotides of a NORF gene comprising a SAGE tag as shown in SEQ ID NOS-67-
  • the invention also provides an array of probes on a solid support. At least one probe in the array comprises at least 14 contiguous nucleotides of a NORF gene comprising a SAGE tag as shown in SEQ ID NOS:67-811.
  • Still another embodiment of the invention is a method of identifying a candidate drug as a member of a class of drugs having a characteristic effect on gene expression in a yeast cell.
  • a yeast cell is contacted with a candidate drug.
  • Expression of at least one NORF gene whose expression is affected by the class of drugs is monitored in the yeast cell. Detection of a difference in expression of the at least one NORF gene relative to expression in the absence of the candidate drug identifies the candidate drug as a member of the class of drugs.
  • Gray arrows indicate all potential SAGE tags (Nlalll sites) and black arrows indicate 3' most SAGE tags. The total number of tags observed for each potential tag is indicated above (+ strand) or below (- strand) the tag. As expected, the observed SAGE tags were associated with the 3' end of expressed genes.
  • Figure 2 Sampling of Yeast Gene Expression. Analysis of increasing amounts of ascertained tags reveals a plateau in the number of unique expressed genes. Triangles represent genes with known functions, squares represent genes predicted on the basis of sequence information, and circles represent total genes.
  • FIG. 3 Virtual Rot.
  • A Abundance Classes in the Yeast Transcriptome. The transcript abundance is plotted in reverse order on the abscissa, whereas the fraction of total transcripts with at least that abundance is plotted on the ordinate. The dotted lines identify the three components of the curve, 1, 2, and 3. This is analogous to a Rot curve derived from reassociation kinetics where the product of initial RNA concentration and time is plotted on the abscissa, and the percent of labeled cDNA that hybridizes to excess mRNA is plotted on the ordinate.
  • B Comparison of Virtual Rot and Rot Components. Transitions and data from virtual Rot components were calculated from the data in Figure 3 A, while data for Rot components were obtained from Hereford and Rosbash, 1977.
  • FIG. 4 Chromosomal Expression Map for S. cerevisiae. Individual yeast genes were positioned on each chromosome according to their open reading frame (ORF) start coordinates. Abundance levels of tags corresponding to each gene are displayed on the vertical axis, with transcription from the + strand indicated above the abscissa and that from the - strand indicated below. Yellow bands at ends of the expanded chromosome represent telomeric regions that are undertranscribed (see text for details).
  • TDH2/3 , TEF 112 and NORF 1 are expressed relatively equally in all three states (lane 1, G2/M arrested; lane 2, S phase arrested; lane 3, log phase), while RNR4, RNR2 , and NORF5 are highly expressed in S-phase arrested cells.
  • Tag represents the 10 bp SAGE tag adjacent to the Nlalll site; Gene represents the gene or genes corresponding to a particular tag
  • Table 2 Expression of Putative Coding Sequences. Table column headings are the same as for Table 1.
  • Table 3 Expression of the most abundant NORF genes. SAGE Tag, Locus, and Copies/cell are the same as for Table 1 ; Chr and Tag Pos denote the chromosome and position of each tag; ORF Size denotes the size of the ORF corresponding to the indicated tag. In each case, the tag was located within or less than 250 bp 3' of the NORF.
  • Positive expression level indicates the tag is on the + strand of the chromosome; Negative expression level indicates the tag is on the - strand.
  • the NORFs exist and are expressed in yeast. These genes, as well as other previously identified and previously postulated genes, can be used to study, monitor, and affect phases of cell cycle.
  • the present invention identifies which genes are differentially expressed during the cell cycle. Differentially expressed genes can be used as markers of phases of the cell cycle. They can also be used to affect a change in the phase of the cell cycle. In addition, they can be used to screen for drugs which affect the cell cycle, by affecting expression of the genes. Human homologs of these eukaryotic genes are also presumed to exist, and can be identified using the yeast genes as probes or primers to identify the human homologs. New genes termed NORFs (not previously assigned open reading frames) have been found. They are uniquely identified by their SAGE tags.
  • Differentially expressed yeast genes are those whose expression varies by a statistically significant difference (to greater than 95% confidence level) within different growth phases, particularly log phase, S phase, and G2/M. Preferably the difference is at least 10%, 25%, 50%, or 100%. In some cases, differentially expressed genes are not expressed at detectable levels in one or more cell cycle phases as determined by SAGE analysis.
  • Genes which have been found to have differential expression characteristics include: NORF N a 1, 2, 4, 5, 6, 17, 25, 27, TEF1/TEF2, EN02, ADH1, ADH2, PGK1, CUP1A/CUP1B, PYKl, YKL056C, YMR1 16C, YEL033W, YOR182C, YCR013C, ribonucleotide reductase 2 and 4, and YJR085C.
  • Differential expression can be detected by any means known in the art, such as hybridization to specific probes or immunological assays.
  • Isolated DNA molecules according to the invention contain less than a whole chromosome and can be genomic or cDNA, i.e., lacking introns.
  • Isolated DNA molecules can comprise a yeast gene or a coding sequence of a yeast gene involved in cell cycle progression, such as NORF genes which comprise SAGE tags as shown in SEQ ID NOS-67-811.
  • Isolated DNA molecules which comprise yeast genes or coding sequences of yeast genes comprising SAGE tags as shown in SEQ ID NOS.37-
  • Isolated DNA molecules can also consist of a yeast gene or a coding sequence of a yeast gene which comprises a SAGE tag as shown in SEQ ID NOS:37-12,203 or 67-811.
  • any technique for obtaining a DNA of known sequence may be used to obtain isolated DNA molecules of the invention.
  • they are isolated free of other cellular components such as membrane components, proteins, and lipids. They can be made by a cell and isolated, or synthesized using PCR or an automatic synthesizer. Methods for purifying and isolating DNA are routine and are known in the art.
  • any DNA delivery techniques known in the art may be used, without limitation. These include liposomes, transfection, mating, transduction, transformation, viral infection, electroporation. Vectors for particular purposes and characteristics can be selected by the skilled artisan for their known properties.
  • Cells which can be used as gene recipients are yeast and other fungi, mammalian cells, including humans, and bacterial cells. Antifungal drugs can be identified using yeast cells as described herein.
  • a differentially expressed NORF gene can be monitored by any means known in the art.
  • a test substance modifies the expression of such a differentially expressed gene, for example by increasing or decreasing its expression, it is a candidate drug for affecting the growth properties of fungi and may be useful as an antifungal agent.
  • Expression of more than one NORF gene can be monitored.
  • expression of 2, 3, 4, 5, 10, 15, 20, 30, 40, 50, 60, 75, 100, 150, 250, 300, 350, 400, 450, or 500 or more NORF genes can be monitored in single or multiple assays.
  • differentially expressed genes are likely to be involved in cell cycle progression, it is likely that these genes are conserved among species.
  • the differentially expressed NORF genes identified by the present invention can be used to identify homologs in humans and other mammals by contacting DNA from these mammals with a probe which comprises at least 10 contiguous nucleotides of a differentially expressed NORF gene.
  • the DNA can be genomic or cDNA, as is known in the art. Means for identifying homologous genes among different species are well known in the art. Briefly, stringency of hybridization can be reduced so that imperfectly matching sequences hybridize. This can be in the context of inter alia Southern blots, Northern blots, colony hybridization or PCR. Any hybridization technique which is known in the art can be used.
  • a DNA sequence which hybridizes to the probe is identified as a sequence of a candidate gene which is involved in cell cycle expression.
  • Probes according to the present invention are isolated DNA molecules which have at least 10, and preferably at least 12, 14, 16, 18, 20, or 25 contiguous nucleotides of a particular NORF gene or other differentially expressed gene.
  • the probes may or may not be labeled. They may be used, for example, as primers for PCR assays, or for detection of gene expression for Southern or Northern blots or in situ hybridization.
  • the probes are immobilized on a solid support.
  • the solid support can be any surface to which a probe can be attached. Suitable solid supports include, but are not limited to, glass or plastic slides, tissue culture plates, microtiter wells, tubes, or particles such as beads, including but not limited to latex, polystyrene, or glass beads. Any method known in the art can be used to attach the a probe to the solid support, including use of covalent and non-covalent linkages, passive absorption, or pairs of binding moieties attached respectively to the probe and the solid support.
  • probes are present on an array so that multiple probes can simultaneously hybridize to a single biological sample.
  • the probes can be spotted onto the array or synthesized in situ on the array. See Lockhart et. al., Nature Biotechnology, Vol. 14, December 1996, "Expression monitoring by hybridization to high-density oligonucleotide arrays.”
  • a single array contains at least one NORF probe, but can contain more than 100, 500 or even 1,000 different probes in discrete locations.
  • one or more NORF probe(s) present on the array can be nucleotide sequences from a NORF gene which is differentially expressed during the cell cycle.
  • Genes identified by the present invention which are differentially expressed during the cell cycle can also be used to obtain gene expression profiles characteristic of the response of yeast genes of a yeast cell to a particular drug or class of drugs.
  • Classes of drugs of particular interest for which gene expression profiles can be generated include those drugs which affect cell cycle or other cell processes, such as chemotherapeutic agents.
  • gene expression profiles characteristic of more than one drug of a particular class can be generated and used to make a composite gene expression profile.
  • microtubule poison drugs such as vinblastin, taxol, vincristine, and taxotere can be used to generate gene expression profiles characteristic of microtubule poisons.
  • yeast cell is contacted with a particular drug or a member of a particular class of drugs.
  • Expression of at least one yeast gene is monitored, either before and after contacting or in the contacted cell and in another yeast cell Which has not been contacted with the drug.
  • Genes which are monitored can be any yeast gene, including NORFS.
  • these genes are differentially expressed during the cell cycle.
  • yeast genes can be selected from genes comprising the SAGE tags shown in Tables 3, 4, 5, and 6 (SEQ ID NOS:67-12,203). If desired, genes such as NORF N a 1, 2, 4, 5, 6, 17, 25, or 27, TEF1ATEF2, EN02, ADH1, ADH2, PGK1,
  • CUP1A/CUP1B, PYK1, YKL056C, YMR116C, YEL033W, YOR182C, YCR013C, ribonucleotide reductase 2 and 4, and YJR085C can be used for monitoring alterations in gene expression.
  • any number of these genes such as 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250, 500, 1000, 2000, 3000, 4000, 5000, or 5,500 genes, can be measured. It is particularly convenient to monitor expression of the differentially expressed genes using nucleic acids which are immobilized on a solid support or in an array, such as the gene arrays described above.
  • genes particularly cell cycle genes, are likely to be conserved between yeast and mammals, including humans.
  • gene expression profiles characteristic of a drug or class of drugs can be used to predict the effects of candidate drugs on human cells, by identifying the candidate drug as a member of a class of drugs whose characteristic gene expression profile is known.
  • the candidate drugs can be pharmacologic agents already known in the art or can be compounds previously unknown to have any pharmacological activity.
  • the candidate drugs can be naturally occurring or designed in the laboratory. They can be isolated from microorganisms, animals, or plants, and can be produced recombinantly or synthesized by chemical methods known in the art.
  • a gene expression profile obtained using the candidate drug which is similar to a gene expression profile for a particular drug or class of drugs identifies the candidate drug as a member of that class of drugs.
  • transcriptome The set of genes expressed from the yeast genome, herein called the transcriptome, using serial analysis of gene expression (SAGE). Analysis of 60,633 transcripts revealed 4,665 genes, with expression levels ranging from 0.3 to over 200 transcripts per cell. Of these genes, 1,981 had known functions, while 2,684 were previously uncharacterized. Integration of positional information with gene expression data allowed the generation of chromosomal expression maps, identifying physical regions of transcriptional activity, and identified genes that had not been predicted by sequence information alone. These studies provide insight into global patterns of gene expression in yeast and demonstrate the feasibility of genome- wide expression studies in eukaryotes.
  • SAGE serial analysis of gene expression
  • a short sequence tag (9-11 bp) contains sufficient information to uniquely identify a transcript, provided that it is derived from a defined location within that transcript.
  • SAGE libraries were generated from yeast cells in three states: log phase,
  • the number of SAGE tags required to define a yeast transcriptome depends on the confidence level desired for detecting low abundance mRNA molecules. Assuming the previously derived estimate of 15,000 mRNA molecules per cell
  • SUP44/RPS4 was measured by hybridization at 75 +/- 10 copies/cell (Iyer and Struhl, 1996), in good accord with the SAGE data of 63 copies/cell, suggesting that the estimate of 15,000 mRNA molecules per cell was reasonably accurate.
  • Analysis of SAGE tags from S phase arrested and G2/M phase arrested cells revealed similar expression levels for this gene (range 52 to 55 copies/cell), as well as for the vast majority of expressed genes. As less than 1% of the genes were expressed at dramatically different levels among these three states (see below), SAGE tags obtained from all libraries were combined and used to analyze global patterns of gene expression.
  • the 56,291 tags that precisely matched the yeast genome represented 4,665 different genes. This number is in agreement with the estimate of 3,000 to 4,000 expressed genes obtained by RNA-DNA reassociation kinetics (Hereford and Rosbash, 1977). These expressed genes included 85% of the genes with characterized functions (1,981 of 2,340), and 76% of the total genes predicted from analysis of the yeast genome (4,665 of 6,121). These numbers are consistent with a relatively complete sampling of the yeast transcriptome given the limited number of physiological states examined and the large number of genes predicted solely on the basis of genomic sequence analysis.
  • the SAGE expression data could be integrated with existing positional information to generate chromosomal expression maps ( Figure 4). These maps were generated using the sequence of the yeast genome and the position coordinates of ORFs obtained from the Stanford Yeast Genome Database. Although there were a few genes that were noted to be physically proximal and have similarly high levels of expression, there did not appear to be any clusters of particularly high or low expression on any chromosome. Genes like histones H3 and H4, which are known to have coregulated divergent promoters and are immediately adjacent on chromosome 14 (Smith and Murray, 1983), had very similar expression levels (5 and 6 copies per cell, respectively).
  • regions within 10 kb of telomeres appeared to be uniformly undertranscribed, containing on average 3.2 tags per gene as compared with 12.4 tags per gene for non-telomeric regions ( Figure 4). This is consistent with the previously described observations of "telomeric silencing" in yeast (Gottschling et al, 1990). Recent studies have reported telomeric position effects as far as 4 kb from telomere ends (Renauld et al, 1993).
  • Table 1 lists the 30 most highly expressed genes, all of which are expressed at greater than 60 mRNA copies per cell. As expected, these genes mostly correspond to well characterized enzymes involved in energy metabolism and protein synthesis and were expressed at similar levels in all three growth states (Examples in Figure 5). Some of these genes, including EN02 (McAlister and Holland, 1982), PDC1 (Schmitt et al, 1983), PGK1 (Chambers et al, 1989), PYK1 (Nishizawa et al, 1989), and
  • ADHl (Denis et al, 1983), are known to be dramatically induced in the glucose-rich growth conditions used in this study.
  • glucose repressible genes such as the GAL1/GAL7/GAL10 cluster (St John and Davis, 1979), and GAL3 (Bajwa et al, 1988) were observed to be expressed at very low levels (0.3 or fewer copies per cell).
  • mating type a specific genes such as the a factor genes (MFA1, MFA2) (Michaelis and Herskowitz, 1988), and alpha factor receptor (STE2) (Burkholder and Hartwell, 1985) were all observed to be expressed at significant levels (range 2 to 10 copies per cell), while mating type alpha specific genes (MFal, MFa2, STE3) (Hagen et al, 1986; Kurjan and Herskowitz, 1982; Singh et al, 1983) were observed to be expressed at very low levels ( ⁇ 0.3 copies/cell).
  • MFA1, MFA2 Mating type alpha specific genes
  • STE2 alpha factor receptor
  • NORF5 one of the NORF genes was only expressed in S phase arrested cells and corresponded to the transcript whose abundance varied the most in the three states analyzed (> 49 fold, Figure 5).
  • Comparison of S phase arrested cells to the other states also identified greater than 9 fold elevation of the RNR2 and RNR4 transcripts ( Figure 5). Induction of these ribonucleoside reductase genes is likely to be due to the hydroxyurea treatment used to arrest cells in S phase (Elledge and Davis, 1989).
  • G2/M arrested cells identified elevation of RBL2 and dynein light chain, both microtubule associated proteins (Archer et al, 1995 ; Dick et al, 1996).
  • Yeast genome intergenic regions were defined as regions outside annotated ORFs or the 500bp region downstream of annotated ORFs (yeast genome sequence and tables of annotated ORFs were obtained from SGD at http://genome-www.stanford.edu/Saccharomyces/). Based on sequence analysis a total of 9524 putative ORFs of 25-99 amino acids were present in the intergenic regions; 510 of these ORFs contain or are adjacent to observed SAGE tags (Table 6). Of the 60,633 SAGE tags analyzed, there were 302 unique SAGE tags either within or adjacent to intergenic ORFs (lOObp upstream or 500bp downstream of the ORF)
  • the expression level for each NORF shown in Table 6 corresponds to the number of mRNA transcript copies per cell. If the expression level is positive it means that the tag is on the + strand of the chromosome; if negative, the tag is on the
  • Comparison of gene expression patterns from altered physiologic states can provide insight into genes that are important in a variety of processes. Comparison of transcriptomes from a variety of physiologic states should provide a minimum set of genes whose expression is required for normal vegetative growth, and another set composed of genes that will be expressed only in response to specific environmental stimuli, or during specialized processes. For example, recent work has defined a minimal set of 250 genes required for prokaryotic cellular life (Mushegian and Koonin, 1996). Examination of the yeast genome readily identified homologous genes for 196 of these, over 90% of which were observed to be expressed in the SAGE analysis. Detailed analyses of yeast transcriptomes, as well as transcriptomes from other organisms, should ultimately allow the generation of a minimal set of genes required for eukaryotic life.
  • SAGE analysis of yeast transcriptomes has several potential limitations. First, a small number of transcripts would be expected to lack an Nlalll site and therefore would not be detected by our analysis. Second, our analysis was limited to transcripts found at least as frequently as 0.3 copies per cell.
  • transcriptomes from a variety of organisms, including human.
  • the data recorded here suggest that a reasonably complete picture of a human cell transcriptome will require only about 10 - 20 fold more tags than evaluated here, a number well within the practical realm achievable with a small number of automated sequencers.
  • the analysis of global expression patterns in higher eukaryotes is expected, in general, to be similar to those reported here for S. cerevisiae.
  • the analysis of the transcriptome in different cells and from different individuals should yield a wealth of information regarding gene function in normal, developmental, and disease states.
  • the source of transcripts for all experiments was S. cerevisiae strain YPH499 (MATa ura3-52 lys2-801 ade2-101 Ieu2- ⁇ l his3- ⁇ 200 trpl- ⁇ 63) (Sikorski and Hieter, 1989).
  • Logarithmically growing cells were obtained by growing yeast cells to early log phase (3 x 10 6 cells/ml) in YPD (Rose et al, 1990) rich medium (YPD supplemented with 6 mM uracil, 4.8 mM adenine and 24 mM tryptophan) at 30°C.
  • hydroxyurea 0.1 M was added to early log phase cells, and the culture was incubated an additional 3.5 hours at 30°C.
  • nocodazole 15 ⁇ g/ml was added to early log phase cells and the culture was incubated for an additional 100 minutes at 30°C.
  • Harvested cells were washed once with water prior to freezing at -70 °C. The growth states of the harvested cells were confirmed by microscopic and flow cytometric analyses (Basrai et al, 1996). SAGE protocol
  • the cDNA was cleaved with Nlalll (Anchoring Enzyme). As Nlalll sites were observed to occur once every 309 base pairs in three arbitrarily chosen yeast chromosomes (1, 5, 10), 95% of yeast transcripts were predicted to be detectable with a Nlalll-based SAGE approach. After capture of the 3' cDNA fragments on streptavidin coated magnetic beads (Dynal), the bound cDNA was divided into two pools, and one of the following linkers containing recognition sites for BsmFI was l i g at e d to each poo l : L i nker 1 , 5 ' -
  • TTTCTGCTCGAATTCAAGCTTCTAACGATGTACGGGGACATG-3' (SEQ ID NO:3) , 5'-TCCCCGTACATCG AGAAGCTTGAATTCGAGCAG[amino mod. C7]-3' (SEQ ID NO:4).
  • PCR product containing two tags ligated tail to tail was excised.
  • the PCR product was then cleaved with Nlalll, and the band containing the ditags was excised and self-ligated. After ligation, the concatenated products were separated by PAGE and products between 500 bp and 2 kb were excised. These products were cloned into the Sphl site of pZero (Invitrogen). Colonies were screened for inserts by PCR with
  • Each successful sequencing reaction identified an average of 26 tags; given a 90% sequencing reaction success rate, this corresponded to an average of about 850 tags per sequencing gel.
  • Sequence files were analyzed by means of the SAGE program group (Velculescu etal, 1995), which identifies the anchoring enzyme site with the proper spacing and extracts the two intervening tags and records them in a database.
  • SAGE program group Velculescu etal, 1995
  • 68,691 tags obtained contained 62,965 tags from unique ditags and 5,726 tags from repeated ditags. The latter were counted only once to eliminate potential PCR bias of the quantitation, as described (Velculescu et al, 1995). Of 62,965 tags, 2,332 tags corresponded to linker sequences, and were excluded from further analysis. Of the remaining tags, 4,342 tags could not be assigned, and were likely due to sequencing errors (in the tags or in the yeast genomic sequence). If all of these were due to tag sequencing errors, this corresponds to a sequencing error rate of about 0.7% per base pair (for a lObp tag), not far from what we would have expected under our automated sequencing conditions.
  • yeast genome sequence obtained from the Stanford yeast genome ftp site (genome-ftp.stanford.edu) on August 7, 1996).
  • SAGE tags can be derived from 3' untranslated regions of genes, a SAGE tag was considered to correspond to a particular gene if it matched the ORF or the region 500 bp 3' of the ORF (locus names, gene names and ORF chromosomal coordinates were obtained from Stanford yeast genome ftp site, and ORF descriptions were obtained from MIPS www site (http://www.mips.biochem.mpg.de/) on August 14, 1996).
  • ORFs were considered genes with known functions if they were associated with a three letter gene name, while ORFs without such designations were considered uncharacterized.
  • SAGE tags matched transcribed portions of the genome in a highly non-random fashion, with 88% matching ORFs or their adjacent 3' regions in the correct orientation (chi-squared P value ⁇ 10 '30 ).
  • the abundance was calculated to be the sum of the matched tags.
  • Tags that matched ORFs in the incorrect orientation were not used in abundance calculations.
  • a tag matched more than one region of the genome for example an ORF and non-ORF region
  • only the matched ORF was considered.
  • the 15th base of the tag could also be used to resolve ambiguities.
  • TACCACTCCT 9 EN02 YHR17 W 229 2-ph ⁇ 6-phoglycerat ⁇ d ⁇ tiydratase
  • TTGCCAGTCT 11 PDC1 YIR044C 207 pyruvat ⁇ decarbaxylase Isozyme 1 ⁇ . GGTGAAAACG 12 ADH1. ADH2 Y0L086C/YMR303C 182 alcohol dehy rooenas ⁇ I / II
  • TTGAACTACC 37 YKL056C 58 strong similarity to human IgE-dependent histamine-releasing factor (21 K tumor protein)
  • TTCGGGTCAC 38 YDR276C 56 strong similarity to Hordeum vulgare blt101 protein
  • CCAGATATGA 39 YIL093C 41 hypothetical protein ⁇ . TTTAAAATGG 40 YMR116C 38 similarity to ⁇ .crassa CPC2 protein

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Medicinal Chemistry (AREA)
  • Mycology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Molecular Biology (AREA)
  • Toxicology (AREA)
  • Cell Biology (AREA)
  • Microbiology (AREA)
  • Zoology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)

Abstract

Yeast genes which are differentially expressed during the cell cycle are described. They can be used to study, affect, and monitor the cell cycle of a eukaryotic cell. They can be used to obtain human homologs involved in cell cycle regulation. They can be used to identify antifungal agents and other classes of drugs. They can be formed into arrays on solid supports for interrogation of a cell's transcriptome under various conditions.

Description

CHARACTERIZATION OF THE YEAST TRANSCRIPTOME
This application is a continuation-in-part of co-pending application Serial No.
09/012,031 filed January 22, 1998, the disclosure of which is incoφorated by reference herein. This invention was made with government support under CA57345 awarded by the National Institutes of Health. The government has certain rights in the invention.
TECHNICAL FIELD OF THE INVENTION
This invention is related to the characterization of the expressed genes of the yeast genome. More particularly, it is related to the identification and use of previously unrecognized genes.
BACKGROUND OF THE INVENTION
It is by now axiomatic that the phenotype of an organism is largely determined by the genes expressed within it. These expressed genes can be represented by a "transcriptome," conveying the identity of each expressed gene and its level of expression for a defined population of cells. Unlike the genome, which is essentially a static entity, the transcriptome can be modulated by both. external and internal factors. The transcriptome thereby serves as a dynamic link between an organism's genome and its physical characteristics.
The transcriptome as defined above has not been characterized in any eukaryotic or prokaryotic organism, largely because of technological limitations. However, some general features of gene expression patterns were elucidated two decades ago through RNA-DNA hybridization measurements (Bishop et al, 1974; Hereford and Rosbash,
1977). In many organisms, it was thus found that at least three classes of transcripts could be identified, with either high, medium, or low levels of expression, and the number of transcripts per cell were estimated (Lewin, 1980). These data of course provided little information about the specific genes that were members of each class.
Data on the expression levels of individual genes have accumulated as new genes were discovered. However, in only a few instances have the absolute levels of expression of particular genes been measured and compared to other genes in the same cell type.
Description of any cell's transcriptome would therefore provide new information useful for understanding numerous aspects of cell biology and biochemistry.
SUMMARY OF THE INVENTION
It is an object of the present invention to provide isolated DNA molecules and methods of using such molecules to affect the cell cycle and identify candidate drugs. These and other objects of the invention are achieved by providing the art with one or more of the embodiments described below.
According to one embodiment of the invention an isolated DNA molecule is provided. It comprises a coding sequence of a yeast gene selected from the group consisting of NORF genes comprising a SAGE tag as shown in SEQ ID NOS.67-811. According to another embodiment of the invention a method of using NORF genes is provided. The method is for affecting the cell cycle of a cell. The method comprises the step of administering to a cell an isolated DNA molecule comprising a coding sequence of a NORF gene whose expression varies by at least 10% between any two phases of the cell cycle selected from the group consisting of log phase, S phase, and G2/M.
In yet another embodiment of the invention a method for screening candidate antifiαngal drugs is provided. The method comprises the steps of contacting a test substance with a yeast cell and monitoring expression of a NORF gene whose expression varies by at least 10% between any two phases of the cell cycle selected from the group consisting of log phase, S phase, and G2/M, wherein a test substance which modifies the expression of the yeast gene is a candidate antifungal drug.
In still another embodiment of the invention a method for identifying human genes which are involved in cell cycle progression is provided. The method comprises the step of contacting human DNA with a probe which comprises at least 14 contiguous nucleotides of a NORF gene whose expression varies by at least 10% between any two phases of the cell cycle selected from, the group consisting of log phase, S phase, and G2/M. A human DNA sequence which hybridizes to the probe is identified as a sequence of a candidate human gene which is involved in cell cycle progression. The present invention provides probes which comprise at least 14 contiguous nucleotides of a NORF gene comprising a SAGE tag as shown in SEQ ID NOS-67-
81 1.
The invention also provides an array of probes on a solid support. At least one probe in the array comprises at least 14 contiguous nucleotides of a NORF gene comprising a SAGE tag as shown in SEQ ID NOS:67-811.
Still another embodiment of the invention is a method of identifying a candidate drug as a member of a class of drugs having a characteristic effect on gene expression in a yeast cell. A yeast cell is contacted with a candidate drug. Expression of at least one NORF gene whose expression is affected by the class of drugs is monitored in the yeast cell. Detection of a difference in expression of the at least one NORF gene relative to expression in the absence of the candidate drug identifies the candidate drug as a member of the class of drugs.
These and other embodiments of the invention which will be apparent to those of skill in the art upon reading the detailed disclosure provided below, make available to the art hitherto unrecognized genes, and information about the expression of genes globally at the organismal level.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1. Schematic of SAGE Method and Genome Analysis. In applying
SAGE to the analysis of yeast gene expression patterns, the 3' most Nlalll site was used to define a unique position in each transcript and to provide a site for ligation of a linker with a BsmFI site. The type IIs enzyme BsmFI, which cleaves a defined distance from its non-palindromic recognition site, was then used to generate a 15bp SAGE tag (designated by the black arrows), which includes the Nlalll site. Automated sequencing of concatenated SAGE tags allowed the routine identification of about a thousand tags per 36-lane sequencing gel. Once sequenced, the abundance of each SAGE tag was calculated, and each tag was used to search the entire yeast genome to identify its corresponding gene. The lower panel shows a small region of Chromosome 15. Gray arrows indicate all potential SAGE tags (Nlalll sites) and black arrows indicate 3' most SAGE tags. The total number of tags observed for each potential tag is indicated above (+ strand) or below (- strand) the tag. As expected, the observed SAGE tags were associated with the 3' end of expressed genes.
Figure 2. Sampling of Yeast Gene Expression. Analysis of increasing amounts of ascertained tags reveals a plateau in the number of unique expressed genes. Triangles represent genes with known functions, squares represent genes predicted on the basis of sequence information, and circles represent total genes.
Figure 3. Virtual Rot. (A) Abundance Classes in the Yeast Transcriptome. The transcript abundance is plotted in reverse order on the abscissa, whereas the fraction of total transcripts with at least that abundance is plotted on the ordinate. The dotted lines identify the three components of the curve, 1, 2, and 3. This is analogous to a Rot curve derived from reassociation kinetics where the product of initial RNA concentration and time is plotted on the abscissa, and the percent of labeled cDNA that hybridizes to excess mRNA is plotted on the ordinate. (B) Comparison of Virtual Rot and Rot Components. Transitions and data from virtual Rot components were calculated from the data in Figure 3 A, while data for Rot components were obtained from Hereford and Rosbash, 1977.
Figure 4. Chromosomal Expression Map for S. cerevisiae. Individual yeast genes were positioned on each chromosome according to their open reading frame (ORF) start coordinates. Abundance levels of tags corresponding to each gene are displayed on the vertical axis, with transcription from the + strand indicated above the abscissa and that from the - strand indicated below. Yellow bands at ends of the expanded chromosome represent telomeric regions that are undertranscribed (see text for details).
Figure 5. Northern Blot Analysis of Representative Genes. TDH2/3 , TEF 112 and NORF 1, are expressed relatively equally in all three states (lane 1, G2/M arrested; lane 2, S phase arrested; lane 3, log phase), while RNR4, RNR2 , and NORF5 are highly expressed in S-phase arrested cells. The expression level observed by SAGE
(number of tags) is noted below each lane and was highly correlated with quantitation of the Northern blot by Phosphorlmager analysis (1^=0.97).
TABLE LEGENDS
Table 1. Highly Expressed Genes. Tag represents the 10 bp SAGE tag adjacent to the Nlalll site; Gene represents the gene or genes corresponding to a particular tag
(multiple genes that match unique tags are from related families, with an average identity of 93%); Locus and Description denote the locus name and functional description of each ORF, respectively; Copies/cell represents the abundance of each transcript in the SAGE library, assuming 15,000 total transcripts per cell and 60,633 ascertained transcripts.
Table 2. Expression of Putative Coding Sequences. Table column headings are the same as for Table 1.
Table 3. Expression of the most abundant NORF genes. SAGE Tag, Locus, and Copies/cell are the same as for Table 1 ; Chr and Tag Pos denote the chromosome and position of each tag; ORF Size denotes the size of the ORF corresponding to the indicated tag. In each case, the tag was located within or less than 250 bp 3' of the NORF.
Table 4. Expression of NORF genes. SAGE tag and Copies/cell are the same as for Table 1. Chr and Tag Pos denote the chromosome and position of each tag. Table 5. Gene expression changes in different cell cycle phases. L denotes log phase; S denotes synthesis phase; G2/M denotes the mitotic phase. Tag Sequence represents the 10 bp SAGE tag adjacent to the Nlalll site; "ratio L to S" denotes the ratio of expression in log phase to expression in synthesis phase; "ratio S to G2/M" denotes the ratio of expression in synthesis phase to expression in G2/M phase; "ratio G2/M to L" denotes the ratio of expression in G2/M to log phase. #DIV/0! indicates an increase in expression from 0; a value of 0 indicates a decrease in expression to 0; a value of 1 indicates no change; a value less than 1 indicates a decrease in expression; and a value greater than 1 indicates an increase in expression.
Table 6. Intergenic open reading frames that contain or are adjacent to observed SAGE tags. Copies/cell represents abundance of each mRNA transcript as in Table
1. Positive expression level indicates the tag is on the + strand of the chromosome; Negative expression level indicates the tag is on the - strand.
DETAILED DESCRIPTION It is a discovery of the present invention that certain hitherto unknown genes
(the NORFs) exist and are expressed in yeast. These genes, as well as other previously identified and previously postulated genes, can be used to study, monitor, and affect phases of cell cycle. The present invention identifies which genes are differentially expressed during the cell cycle. Differentially expressed genes can be used as markers of phases of the cell cycle. They can also be used to affect a change in the phase of the cell cycle. In addition, they can be used to screen for drugs which affect the cell cycle, by affecting expression of the genes. Human homologs of these eukaryotic genes are also presumed to exist, and can be identified using the yeast genes as probes or primers to identify the human homologs. New genes termed NORFs (not previously assigned open reading frames) have been found. They are uniquely identified by their SAGE tags. In addition their entire nucleotide sequences are known and publicly available. In general, these were not previously identified as genes due to their small size. However, they have now been found to be expressed. Differentially expressed yeast genes are those whose expression varies by a statistically significant difference (to greater than 95% confidence level) within different growth phases, particularly log phase, S phase, and G2/M. Preferably the difference is at least 10%, 25%, 50%, or 100%. In some cases, differentially expressed genes are not expressed at detectable levels in one or more cell cycle phases as determined by SAGE analysis. Genes which have been found to have differential expression characteristics include: NORF Na 1, 2, 4, 5, 6, 17, 25, 27, TEF1/TEF2, EN02, ADH1, ADH2, PGK1, CUP1A/CUP1B, PYKl, YKL056C, YMR1 16C, YEL033W, YOR182C, YCR013C, ribonucleotide reductase 2 and 4, and YJR085C. Differential expression can be detected by any means known in the art, such as hybridization to specific probes or immunological assays. Isolated DNA molecules according to the invention contain less than a whole chromosome and can be genomic or cDNA, i.e., lacking introns. Isolated DNA molecules can comprise a yeast gene or a coding sequence of a yeast gene involved in cell cycle progression, such as NORF genes which comprise SAGE tags as shown in SEQ ID NOS-67-811. Isolated DNA molecules which comprise yeast genes or coding sequences of yeast genes comprising SAGE tags as shown in SEQ ID NOS.37-
12,203 are also isolated DNA molecules of the invention. Isolated DNA molecules can also consist of a yeast gene or a coding sequence of a yeast gene which comprises a SAGE tag as shown in SEQ ID NOS:37-12,203 or 67-811.
Any technique for obtaining a DNA of known sequence may be used to obtain isolated DNA molecules of the invention. Preferably they are isolated free of other cellular components such as membrane components, proteins, and lipids. They can be made by a cell and isolated, or synthesized using PCR or an automatic synthesizer. Methods for purifying and isolating DNA are routine and are known in the art.
To administer yeast genes to cells, any DNA delivery techniques known in the art may be used, without limitation. These include liposomes, transfection, mating, transduction, transformation, viral infection, electroporation. Vectors for particular purposes and characteristics can be selected by the skilled artisan for their known properties. Cells which can be used as gene recipients are yeast and other fungi, mammalian cells, including humans, and bacterial cells. Antifungal drugs can be identified using yeast cells as described herein.
Expression of a differentially expressed NORF gene can be monitored by any means known in the art. When a test substance modifies the expression of such a differentially expressed gene, for example by increasing or decreasing its expression, it is a candidate drug for affecting the growth properties of fungi and may be useful as an antifungal agent. Expression of more than one NORF gene can be monitored.
For example, expression of 2, 3, 4, 5, 10, 15, 20, 30, 40, 50, 60, 75, 100, 150, 250, 300, 350, 400, 450, or 500 or more NORF genes can be monitored in single or multiple assays.
Because differentially expressed genes are likely to be involved in cell cycle progression, it is likely that these genes are conserved among species. The differentially expressed NORF genes identified by the present invention can be used to identify homologs in humans and other mammals by contacting DNA from these mammals with a probe which comprises at least 10 contiguous nucleotides of a differentially expressed NORF gene. The DNA can be genomic or cDNA, as is known in the art. Means for identifying homologous genes among different species are well known in the art. Briefly, stringency of hybridization can be reduced so that imperfectly matching sequences hybridize. This can be in the context of inter alia Southern blots, Northern blots, colony hybridization or PCR. Any hybridization technique which is known in the art can be used. A DNA sequence which hybridizes to the probe is identified as a sequence of a candidate gene which is involved in cell cycle expression.
Probes according to the present invention are isolated DNA molecules which have at least 10, and preferably at least 12, 14, 16, 18, 20, or 25 contiguous nucleotides of a particular NORF gene or other differentially expressed gene. The probes may or may not be labeled. They may be used, for example, as primers for PCR assays, or for detection of gene expression for Southern or Northern blots or in situ hybridization. Preferably the probes are immobilized on a solid support. The solid support can be any surface to which a probe can be attached. Suitable solid supports include, but are not limited to, glass or plastic slides, tissue culture plates, microtiter wells, tubes, or particles such as beads, including but not limited to latex, polystyrene, or glass beads. Any method known in the art can be used to attach the a probe to the solid support, including use of covalent and non-covalent linkages, passive absorption, or pairs of binding moieties attached respectively to the probe and the solid support.
More preferably, probes are present on an array so that multiple probes can simultaneously hybridize to a single biological sample. The probes can be spotted onto the array or synthesized in situ on the array. See Lockhart et. al., Nature Biotechnology, Vol. 14, December 1996, "Expression monitoring by hybridization to high-density oligonucleotide arrays." A single array contains at least one NORF probe, but can contain more than 100, 500 or even 1,000 different probes in discrete locations. If desired, one or more NORF probe(s) present on the array can be nucleotide sequences from a NORF gene which is differentially expressed during the cell cycle.
Genes identified by the present invention which are differentially expressed during the cell cycle can also be used to obtain gene expression profiles characteristic of the response of yeast genes of a yeast cell to a particular drug or class of drugs. Classes of drugs of particular interest for which gene expression profiles can be generated include those drugs which affect cell cycle or other cell processes, such as chemotherapeutic agents. If desired, gene expression profiles characteristic of more than one drug of a particular class can be generated and used to make a composite gene expression profile. For example, microtubule poison drugs such as vinblastin, taxol, vincristine, and taxotere can be used to generate gene expression profiles characteristic of microtubule poisons.
To generate a gene expression profile characteristic of a particular drug or class of drugs, a yeast cell is contacted with a particular drug or a member of a particular class of drugs. Expression of at least one yeast gene is monitored, either before and after contacting or in the contacted cell and in another yeast cell Which has not been contacted with the drug. Genes which are monitored can be any yeast gene, including NORFS. Preferably, these genes are differentially expressed during the cell cycle. For example, yeast genes can be selected from genes comprising the SAGE tags shown in Tables 3, 4, 5, and 6 (SEQ ID NOS:67-12,203). If desired, genes such as NORF Na 1, 2, 4, 5, 6, 17, 25, or 27, TEF1ATEF2, EN02, ADH1, ADH2, PGK1,
CUP1A/CUP1B, PYK1, YKL056C, YMR116C, YEL033W, YOR182C, YCR013C, ribonucleotide reductase 2 and 4, and YJR085C, can be used for monitoring alterations in gene expression.
The expression of any number of these genes, such as 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250, 500, 1000, 2000, 3000, 4000, 5000, or 5,500 genes, can be measured. It is particularly convenient to monitor expression of the differentially expressed genes using nucleic acids which are immobilized on a solid support or in an array, such as the gene arrays described above.
Many genes, particularly cell cycle genes, are likely to be conserved between yeast and mammals, including humans. Thus, gene expression profiles characteristic of a drug or class of drugs can be used to predict the effects of candidate drugs on human cells, by identifying the candidate drug as a member of a class of drugs whose characteristic gene expression profile is known. The candidate drugs can be pharmacologic agents already known in the art or can be compounds previously unknown to have any pharmacological activity. The candidate drugs can be naturally occurring or designed in the laboratory. They can be isolated from microorganisms, animals, or plants, and can be produced recombinantly or synthesized by chemical methods known in the art.
The effect of a candidate drug on expression of at least one gene whose expression is affected by the class of drugs is monitored. A gene expression profile obtained using the candidate drug which is similar to a gene expression profile for a particular drug or class of drugs identifies the candidate drug as a member of that class of drugs.
The effect of modifying particular substituents of a known drug or of a candidate drug can be similarly tested. Such methods are useful for determining whether alterations intended, for example, to increase solubility or absorption of a particular drug will have an unintended and possibly deleterious effect on genes which are differentially expressed during the cell cycle.
The above disclosure generally describes the present invention. A more complete understanding can be obtained by reference to the following specific examples which are provided herein for purposes of illustration only, and are not intended to limit the scope of the invention.
EXAMPLE Summary
We have analyzed the set of genes expressed from the yeast genome, herein called the transcriptome, using serial analysis of gene expression (SAGE). Analysis of 60,633 transcripts revealed 4,665 genes, with expression levels ranging from 0.3 to over 200 transcripts per cell. Of these genes, 1,981 had known functions, while 2,684 were previously uncharacterized. Integration of positional information with gene expression data allowed the generation of chromosomal expression maps, identifying physical regions of transcriptional activity, and identified genes that had not been predicted by sequence information alone. These studies provide insight into global patterns of gene expression in yeast and demonstrate the feasibility of genome- wide expression studies in eukaryotes.
Results Characteristics and Rationale of SAGE Approach
Several methods have recently been described for the high throughput evaluation of gene expression (Nguyen et al, 1995; Schena et al, 1995; Velculescu et al, 1995). We used SAGE (Serial Analysis of Gene Expression) because it can provide quantitative gene expression data without the prerequisite of a hybridization probe for each transcript. The SAGE technology is based on two basic principles
(Figure 1). First, a short sequence tag (9-11 bp) contains sufficient information to uniquely identify a transcript, provided that it is derived from a defined location within that transcript. Second, many transcript tags can be concatenated into a single molecule and then sequenced, revealing the identity of multiple tags simultaneously. The expression pattern of any population of transcripts can be quantitatively evaluated by determining the abundance of individual tags and identifying the gene corresponding to each tag.
Genome-wide expression
In order to maximize representation of genes involved in normal growth and cell-cycle progression, SAGE libraries were generated from yeast cells in three states: log phase,
S phase arrested and G2/M phase arrested. In total, SAGE tags corresponding to
60,633 total transcripts were identified (including 20,184 from log phase, 20,034 from
S phase arrested, and 20,415 from G2/M phase arrested cells). Of these tags, 56,291 tags (93%) precisely matched the yeast genome, 88 tags matched the mitochondrial genome, and 91 tags matched the 2 micron plasmid.
The number of SAGE tags required to define a yeast transcriptome depends on the confidence level desired for detecting low abundance mRNA molecules. Assuming the previously derived estimate of 15,000 mRNA molecules per cell
(Hereford and Rosbash, 1977), 20,000 tags would represent a 1.3 fold coverage even for mRNA molecules present at a single copy per cell, and would provide a 72% probability of detecting such transcripts (as determined by Monte Carlo simulations). Analysis of 20,184 tags from log phase cells identified 3,298 unique genes. As an independent confirmation of mRNA copy number per cell, we compared the expression level of SUP44/RPS4, one of the few genes whose absolute mRNA levels have been reliably determined by quantitative hybridization experiments (Iyer and Struhl, 1996), with expression levels determined by SAGE. SUP44/RPS4 was measured by hybridization at 75 +/- 10 copies/cell (Iyer and Struhl, 1996), in good accord with the SAGE data of 63 copies/cell, suggesting that the estimate of 15,000 mRNA molecules per cell was reasonably accurate. Analysis of SAGE tags from S phase arrested and G2/M phase arrested cells revealed similar expression levels for this gene (range 52 to 55 copies/cell), as well as for the vast majority of expressed genes. As less than 1% of the genes were expressed at dramatically different levels among these three states (see below), SAGE tags obtained from all libraries were combined and used to analyze global patterns of gene expression.
Analysis of ascertained tags at increasing increments revealed that the number of unique transcripts plateaued at -60,000 tags (Figure 2). This suggested that generation of further SAGE tags would yield few additional genes, consistent with the fact that sixty thousand transcripts represented a four-fold redundancy for genes expressed as low as 1 transcript per cell. Likewise, Monte Carlo simulations indicated that analysis of 60,000 tags would identify at least one tag for a given transcript 97% of the time if its expression level was one copy per cell.
The 56,291 tags that precisely matched the yeast genome represented 4,665 different genes. This number is in agreement with the estimate of 3,000 to 4,000 expressed genes obtained by RNA-DNA reassociation kinetics (Hereford and Rosbash, 1977). These expressed genes included 85% of the genes with characterized functions (1,981 of 2,340), and 76% of the total genes predicted from analysis of the yeast genome (4,665 of 6,121). These numbers are consistent with a relatively complete sampling of the yeast transcriptome given the limited number of physiological states examined and the large number of genes predicted solely on the basis of genomic sequence analysis.
The transcript expression per gene was observed to vary from 0.3 to over 200 copies per cell. Analysis of the distribution of gene expression levels revealed several abundance classes that were similar to those observed in previous studies using reassociation kinetics. A "virtual Rot" of the genes observed by SAGE (Figure 3 A) identified three main components of the transcriptome with abundances ranging over three orders of magnitude. A Rot curve derived from RNA-cDNA reassociation kinetics also contained three main components distributed over a similar range of abundances (Hereford and Rosbash, 1977). Although the kinetics of reassociation of a particular class of RNA and cDNA may be affected by numerous experimental variables, there were striking similarities between Rot and virtual Rot analyses (Figure 3B). Because Rot analysis may not detect all transcripts of low abundance (Lewin, 1980), it is not surprising that SAGE revealed both a larger total number of expressed genes and a higher fraction of the transcriptome belonging to the low abundance transcript class.
Integration of Expression Information with the Genomic Map
The SAGE expression data could be integrated with existing positional information to generate chromosomal expression maps (Figure 4). These maps were generated using the sequence of the yeast genome and the position coordinates of ORFs obtained from the Stanford Yeast Genome Database. Although there were a few genes that were noted to be physically proximal and have similarly high levels of expression, there did not appear to be any clusters of particularly high or low expression on any chromosome. Genes like histones H3 and H4, which are known to have coregulated divergent promoters and are immediately adjacent on chromosome 14 (Smith and Murray, 1983), had very similar expression levels (5 and 6 copies per cell, respectively). The distribution of transcripts among the chromosomes suggested that overall transcription was evenly dispersed, with total transcript levels being roughly linearly related to chromosome size (r2 =0.85, data not shown). However, regions within 10 kb of telomeres appeared to be uniformly undertranscribed, containing on average 3.2 tags per gene as compared with 12.4 tags per gene for non-telomeric regions (Figure 4). This is consistent with the previously described observations of "telomeric silencing" in yeast (Gottschling et al, 1990). Recent studies have reported telomeric position effects as far as 4 kb from telomere ends (Renauld et al, 1993).
Gene Expression Patterns Table 1 lists the 30 most highly expressed genes, all of which are expressed at greater than 60 mRNA copies per cell. As expected, these genes mostly correspond to well characterized enzymes involved in energy metabolism and protein synthesis and were expressed at similar levels in all three growth states (Examples in Figure 5). Some of these genes, including EN02 (McAlister and Holland, 1982), PDC1 (Schmitt et al, 1983), PGK1 (Chambers et al, 1989), PYK1 (Nishizawa et al, 1989), and
ADHl (Denis et al, 1983), are known to be dramatically induced in the glucose-rich growth conditions used in this study. In contrast, glucose repressible genes such as the GAL1/GAL7/GAL10 cluster (St John and Davis, 1979), and GAL3 (Bajwa et al, 1988) were observed to be expressed at very low levels (0.3 or fewer copies per cell). As expected for the yeast strain used in this study, mating type a specific genes, such as the a factor genes (MFA1, MFA2) (Michaelis and Herskowitz, 1988), and alpha factor receptor (STE2) (Burkholder and Hartwell, 1985) were all observed to be expressed at significant levels (range 2 to 10 copies per cell), while mating type alpha specific genes (MFal, MFa2, STE3) (Hagen et al, 1986; Kurjan and Herskowitz, 1982; Singh et al, 1983) were observed to be expressed at very low levels (<0.3 copies/cell).
Three of the highly expressed genes in Table 1 had not been previously characterized. One contained an ORF with predicted ribosomal function, previously identified only by genomic sequence analysis. Analyses of all SAGE data suggested that there were 2,684 such genes corresponding to uncharacterized ORFs which were transcribed at detectable levels. The 30 most abundant of these transcripts were observed more than 30 times, corresponding to at least 8 transcripts per cell (Table 2). The other two highly expressed uncharacterized genes corresponded to ORFs not predicted by analysis of the yeast genome sequence (NORF = Nonannotated ORF). Analyses of SAGE data suggested that there were at least 160 NORF genes transcribed at detectable levels. The 30 most abundant of these transcripts were observed at least 9 times (Table 3 and examples in Figure 5).
Interestingly, one of the NORF genes (NORF5) was only expressed in S phase arrested cells and corresponded to the transcript whose abundance varied the most in the three states analyzed (> 49 fold, Figure 5). Comparison of S phase arrested cells to the other states also identified greater than 9 fold elevation of the RNR2 and RNR4 transcripts (Figure 5). Induction of these ribonucleoside reductase genes is likely to be due to the hydroxyurea treatment used to arrest cells in S phase (Elledge and Davis, 1989). Likewise, comparison of G2/M arrested cells identified elevation of RBL2 and dynein light chain, both microtubule associated proteins (Archer et al, 1995 ; Dick et al, 1996). As with the RΝR inductions, these elevated levels seem likely to be related to the nocodazole treatment used to arrest cells in the G2/M phase. While there were many relatively small differences between the states (for example, NORF1, Figure 5), overall comparison of the three states revealed surprisingly few dramatic differences; there were only 29 transcripts whose abundance varied more than 10 fold among the three different states analyzed (Tables 4 and 5).
A comprehensive analysis for ΝORF genes was performed using the SAGE data. Yeast genome intergenic regions were defined as regions outside annotated ORFs or the 500bp region downstream of annotated ORFs (yeast genome sequence and tables of annotated ORFs were obtained from SGD at http://genome-www.stanford.edu/Saccharomyces/). Based on sequence analysis a total of 9524 putative ORFs of 25-99 amino acids were present in the intergenic regions; 510 of these ORFs contain or are adjacent to observed SAGE tags (Table 6). Of the 60,633 SAGE tags analyzed, there were 302 unique SAGE tags either within or adjacent to intergenic ORFs (lOObp upstream or 500bp downstream of the ORF)
(Table 6). Note that in some cases, more than one NORF contains or is adjacent to the SAGE tag. These tags matched the genome uniquely, were in the correct orientation, and were expressed at levels greater than 0.3 transcript copies per cell.
The expression level for each NORF shown in Table 6 corresponds to the number of mRNA transcript copies per cell. If the expression level is positive it means that the tag is on the + strand of the chromosome; if negative, the tag is on the
- strand of the chromosome.
Discussion
Analysis of a yeast transcriptome affords a unique view of the RNA components defining cellular life. Comparison of gene expression patterns from altered physiologic states can provide insight into genes that are important in a variety of processes. Comparison of transcriptomes from a variety of physiologic states should provide a minimum set of genes whose expression is required for normal vegetative growth, and another set composed of genes that will be expressed only in response to specific environmental stimuli, or during specialized processes. For example, recent work has defined a minimal set of 250 genes required for prokaryotic cellular life (Mushegian and Koonin, 1996). Examination of the yeast genome readily identified homologous genes for 196 of these, over 90% of which were observed to be expressed in the SAGE analysis. Detailed analyses of yeast transcriptomes, as well as transcriptomes from other organisms, should ultimately allow the generation of a minimal set of genes required for eukaryotic life.
Like other genome-wide analyses, SAGE analysis of yeast transcriptomes has several potential limitations. First, a small number of transcripts would be expected to lack an Nlalll site and therefore would not be detected by our analysis. Second, our analysis was limited to transcripts found at least as frequently as 0.3 copies per cell.
Transcripts expressed in only a minute fraction of the cell cycle, or transcripts expressed in only a fraction of the cell population, would not be reliably detected by our analysis. Finally, mRNA sequence data are practically unavailable for yeast, and consequently, some SAGE tags cannot be unambiguously matched to corresponding genes. Tags which were derived from overlapping genes, or genes which have unusually long 3' untranslated regions may be misassigned. Increased availability of 3' UTR sequences in yeast mRNA molecules should help to resolve the ambiguities. Despite these potential limitations, it is clear that the analyses described here furnish both global and local pictures of gene expression, precisely defined at the nucleotide level. These data, like the sequence of the yeast genome itself, provide simple, basic information integral to the interpretation of many experiments in the future. The availability of mRNA sequence information from EST sequencing as well as various genome projects, will soon allow definition of transcriptomes from a variety of organisms, including human. The data recorded here suggest that a reasonably complete picture of a human cell transcriptome will require only about 10 - 20 fold more tags than evaluated here, a number well within the practical realm achievable with a small number of automated sequencers. The analysis of global expression patterns in higher eukaryotes is expected, in general, to be similar to those reported here for S. cerevisiae. However, the analysis of the transcriptome in different cells and from different individuals should yield a wealth of information regarding gene function in normal, developmental, and disease states.
Experimental Procedures Yeast cell culture
The source of transcripts for all experiments was S. cerevisiae strain YPH499 (MATa ura3-52 lys2-801 ade2-101 Ieu2-Δl his3-Δ200 trpl-Δ63) (Sikorski and Hieter, 1989). Logarithmically growing cells were obtained by growing yeast cells to early log phase (3 x 106 cells/ml) in YPD (Rose et al, 1990) rich medium (YPD supplemented with 6 mM uracil, 4.8 mM adenine and 24 mM tryptophan) at 30°C. For arrest in the Gl/S phase of the cell cycle, hydroxyurea (0.1 M) was added to early log phase cells, and the culture was incubated an additional 3.5 hours at 30°C. For arrest in the G2/M phase of the cell cycle, nocodazole (15 μg/ml) was added to early log phase cells and the culture was incubated for an additional 100 minutes at 30°C. Harvested cells were washed once with water prior to freezing at -70 °C. The growth states of the harvested cells were confirmed by microscopic and flow cytometric analyses (Basrai et al, 1996). SAGE protocol
The SAGE method was performed as previously described (Velculescu et al. , 1995; K-inzler et al, U.S. Patents 5,866,330 and 5,695,937), with exceptions noted below. PolyA RNA was converted to double-stranded cDNA with a BRL synthesis kit using the manufacturer's protocol except for the inclusion of primer biotin-5'-T18-
3'. The cDNA was cleaved with Nlalll (Anchoring Enzyme). As Nlalll sites were observed to occur once every 309 base pairs in three arbitrarily chosen yeast chromosomes (1, 5, 10), 95% of yeast transcripts were predicted to be detectable with a Nlalll-based SAGE approach. After capture of the 3' cDNA fragments on streptavidin coated magnetic beads (Dynal), the bound cDNA was divided into two pools, and one of the following linkers containing recognition sites for BsmFI was l i g at e d to each poo l : L i nker 1 , 5 ' -
TTTGGATTTGCTGGTGCAGTACAACTAGGCTTAATAGGGACATG-3' (SEQ ID NO: 1).5'-TCCCTATTAAGCCTAGTTGTACTGCACCAGCAAATCC [amino mod. C7]-3'(SEQ ID NO:2).; Linker 2,5'-
TTTCTGCTCGAATTCAAGCTTCTAACGATGTACGGGGACATG-3' (SEQ ID NO:3) , 5'-TCCCCGTACATCG AGAAGCTTGAATTCGAGCAG[amino mod. C7]-3' (SEQ ID NO:4).
As BsmFI (Tagging Enzyme) cleaves 14 bp away from its recognition site, and the Nlalll site overlaps the BsmFI site by 1 bp, a 15 bp SAGE tag was released with
BsmFI. SAGE tag overhangs were filled-in with lenow, and tags from the two pools were combined and ligated to each other. The ligation product was diluted and then amplified with PCR for 28 cycles with 5,-GGATTTGCTGGTGCAGTACA-3* (SEQ ID NO:5) and 5'-CTGCTCGAATTCAAGCTTCT-3' (SEQ ID NO:6), as primers. The PCR product was analyzed by polyacrylamide gel electrophoresis (PAGE), and the
PCR product containing two tags ligated tail to tail (ditag) was excised. The PCR product was then cleaved with Nlalll, and the band containing the ditags was excised and self-ligated. After ligation, the concatenated products were separated by PAGE and products between 500 bp and 2 kb were excised. These products were cloned into the Sphl site of pZero (Invitrogen). Colonies were screened for inserts by PCR with
Ml 3 forward and M13 reverse sequences located outside the cloning site as primers. PCR products from selected clones were sequenced with the TaqFS DyePrimer kits (Perkin Elmer) and analyzed using a 377 ABI automated sequencer (Perkin
Elmer), following the manufacturer's protocol. Each successful sequencing reaction identified an average of 26 tags; given a 90% sequencing reaction success rate, this corresponded to an average of about 850 tags per sequencing gel.
SAGE data analysis
Sequence files were analyzed by means of the SAGE program group (Velculescu etal, 1995), which identifies the anchoring enzyme site with the proper spacing and extracts the two intervening tags and records them in a database. The
68,691 tags obtained contained 62,965 tags from unique ditags and 5,726 tags from repeated ditags. The latter were counted only once to eliminate potential PCR bias of the quantitation, as described (Velculescu et al, 1995). Of 62,965 tags, 2,332 tags corresponded to linker sequences, and were excluded from further analysis. Of the remaining tags, 4,342 tags could not be assigned, and were likely due to sequencing errors (in the tags or in the yeast genomic sequence). If all of these were due to tag sequencing errors, this corresponds to a sequencing error rate of about 0.7% per base pair (for a lObp tag), not far from what we would have expected under our automated sequencing conditions. However, some unassigned tags had a much higher than expected frequency of A's as the last five base pairs of the tag (5 of the 52 most abundant unassigned tags), suggesting that these tags were derived from transcripts containing anchoring enzyme sites within several base pairs from their polyA tails. Given the frequency of Nlalll sites in the genome (one in 309 base pairs), approximately 3% of transcripts were predicted to contain Nlalll sites within 10 bp of their polyA tails.
As very sparse data are available for yeast mRNA sequences and efforts to date have not been able to identify a highly conserved polyadenylation signal (Irniger and Braus, 1994; Zaret and Sherman, 1982), we used 14 bp of SAGE tags (i.e. the Nlalll site plus the adjacent 10 bp) to search the yeast genome directly (yeast genome sequence obtained from the Stanford yeast genome ftp site (genome-ftp.stanford.edu) on August 7, 1996). Because only coding regions are annotated in the yeast genome, and SAGE tags can be derived from 3' untranslated regions of genes, a SAGE tag was considered to correspond to a particular gene if it matched the ORF or the region 500 bp 3' of the ORF (locus names, gene names and ORF chromosomal coordinates were obtained from Stanford yeast genome ftp site, and ORF descriptions were obtained from MIPS www site (http://www.mips.biochem.mpg.de/) on August 14, 1996).
ORFs were considered genes with known functions if they were associated with a three letter gene name, while ORFs without such designations were considered uncharacterized.
As expected, SAGE tags matched transcribed portions of the genome in a highly non-random fashion, with 88% matching ORFs or their adjacent 3' regions in the correct orientation (chi-squared P value <10'30). In instances when more than one tag matched a particular ORF in the correct orientation, the abundance was calculated to be the sum of the matched tags. Tags that matched ORFs in the incorrect orientation were not used in abundance calculations. In instances when a tag matched more than one region of the genome (for example an ORF and non-ORF region) only the matched ORF was considered. In some cases the 15th base of the tag could also be used to resolve ambiguities.
For the identification of NORF genes, only tags were considered that matched portions of the genome that were further than 500 bp 3' of a previously identified ORF and were observed at least two times in the SAGE libraries.
Table 1. Highly expressed genes
C5 -M -M
Tag Seq. ID No. Gene Locus Copies/cell Description
© GGTGTTAACG 7 TDH--TDH3 YJR00ΘC/YGR192C 425 ςlycar-itJθhy β-3-phosp atθ dehydrogenase 2 & 3 o f-Λ AGACAAACTG 8 TEF1/TEF2 YPR080W YBR118W 248 cytoβc-lc elongation factor βEF-1 aip a-A chain
TACCACTCCT 9 EN02 YHR17 W 229 2-phθ6-phoglyceratβ dβtiydratase
H GGTTTCGGTT 10 PLA1. A2. A3, 10E YOLC-31C/YOL039W/YDL130W/YLR----OW 207 acidic riboβomal protein a1 / P2 beta / L44pπme / L10
TTGCCAGTCT 11 PDC1 YIR044C 207 pyruvatθ decarbaxylase Isozyme 1 α. GGTGAAAACG 12 ADH1. ADH2 Y0L086C/YMR303C 182 alcohol dehy rooenasβ I / II
ATCGCCGCTC 13 GP 1 YKL152C 168 phosphoglycθratθ mutase
GGTGCTAAGA 14 FBA1 YKLOβOC 166 fructo6-»4)ispho-sphate aJdc-ase II
TTAGTTTCTA 15 RPL47A YDL184C 143 ribosomal protein
TCTCTACTGG 16 PGK1 VCR012W 139 phos hoglycerate kinase
GGTTrrGGTT 17 RPLA4 YDR382 138 addle ribo-somal protein L45
GGTCCAGCTT 18 SSM1A/ SSM1B VPL220W-ΥGL135W 128 ribosomal protein
AATCCAGTTG 19 RPL5A / RPL5B Y1L018W-ΥFR031AC 102 ribosomal protein
TTCGTTCACT 20 N0RF1 94 nonar-notated ORF
AACAGACCAG 21 RPL16A / RPL16B YPR102C -ΥGR085C 83 ribosomal protein
CTGCTCTGGG 22 CUP1A CUP1B YHR053C/YHR055C 75 m-staUothio-nein
GC-AATACTAC 23 VOR2 3W -ΥMR230W 73 riboGomal protein S101 similarly to ribosomal protein S10
GCTCTCCCCC 24 N0RF2 73 non-annotated ORF
AAAGACAGAG 25 RPS31A YGR027C 72 ribosomal protein
TGTCGTGGTG 26 PU2A/ RPL2B YBR031W/YDR012W 70 ribosomal protein
CCAAGGGTAT 27 RP-S28A YGR118W 69 ribosomal protein
TCTCCAGAAG 28 RPL35B YDR500C 69 ribo-somal protein
GI Π I ICTTT 29 PYK1 YA1-038W 69 pyruvatθ kinase
ATCACTGGTG 30 RPL9A / RPL9B YG---147C /YNL0β7W 68 ribαβomal protein 1-9
ATGAAGGTTC 31 RPL27A YHR010W 68 ribo-somal protein 1-27
GTAGAGCCGG 32 RPS21 Y0L0 0C 67 ribosomal protein
GGTACTGATG 33 RPL43A YDL075W 67 ribosomal protein 1-31
CCAGAT7TGT 34 NAB1A/ NAB1B YGR214W/YLR0---8W 67 40S ribosomal protein p40 homoiog A
GTGCCGTCCA 35 URP1A YBR1B1 62 riboβomal protein 1-21
CAAMCCCAA 36 RPS18EB YM 026C 60 ribo-somal protein S18
-M r~- r~~
© o
Table 2
SAGE Tag Seq. ID No.
© Locus Copies Cell N> . \ t Description
TTGAACTACC 37 YKL056C 58 strong similarity to human IgE-dependent histamine-releasing factor (21 K tumor protein)
TTCGGGTCAC 38 YDR276C 56 strong similarity to Hordeum vulgare blt101 protein
H
CCAGATATGA 39 YIL093C 41 hypothetical protein α. TTTAAAATGG 40 YMR116C 38 similarity to Ν.crassa CPC2 protein
GGTGTCGTTG 41 YBR078W 34 strong similarity to sporulation specific Sps2p
TACTCTTCGC 42 YEL033W 33 hypothetical protein
TGTAATTAAA 43 YOR182C 26 homology to human ub-quitin-like protein/ribosomal protein S30
GGAGATCTTG 44 YCR013C 24 weak similarity to M.lepra B1496_F1_41 protein
TCAAGAAGTT 45 YER056AC 20 strong similarity to ribosomal protein L34
AAAAACTTTG 46 YIL051C 18 strong similarity to YER057c
AAGTTGAACA 47 YPR043W 17 ribosomal protein L37
GGGTGCGGGT 48 YDR032C 16 strong similarity to YCR004c and S.pombe obrl
TGACTCTTTG 49 YLR390W 14 hypothetical protein
GGTCAATGGC 50 YJR105W 11 hypothetical protein
TAAGAATTCT 51 YJL158C 11 member of the Pir1p/Hsp150p/Pir3p family
TCAATTATGT 52 YDR033W 11 strong similarity to putative heat shock protein YR02
ACGGCCAAGA 53 YBR162C 10 similarity to YJL171p
TTGGGCTAGT 54 YJL171C 10 similarity to YBR162c
CCTTCCAGGT 55 YJR085C 10 hypothetical protein
CCTCTCTTGT 56 YOR310C 10 homology to SIK1 protein
CCCAAAACTT 57 YEL018W 9 weak similarity to Rad50p
AACAAGTACT 58 YGL037C 9 similarity to E.coli hypothetical 23K protein
AACAATAAAA 59 YER072W 8 similarity to YFL004w
CAAAAGACCG 60 Y L056C 8 homology to human IMP dehydrogenase I
GG 1 I I I I GAT 61 YOR182C 8 homology to human ubiqu-tin-like protein/ribosomal protein S30
CAATCCATTT 62 YBR106W 8 hypothetical protein
1 1 1 1 GGGTCT 63 YMR318C 8 putative alcoho Jehydrogenase
AACTGTCCAT 64 YDR429C 8 similarity to nuclear RΝA binding proteins
CCAAGGTTAA 65 YAR002AC 8 strong similarity to YGL002w
GGTTTTTGAA 66 YOR273C 8 putative resistance protein r~~
© ©
O
Table 3. NORF genes
SAGE Tag Seq. ID No. Locus Copies/Cell Chr Tag Pos ORF Size (bp)
TTCGTTCACT 67 NORF1 94 4 1489450 198
GCTCTCCCCC 68 NORF2 73 16 75633 243
H U TGTACGCATT 69 NORF3 16 15 301251 189 α.
1 1 1 lATTATC 70 NORF4 15 6 223182 177
CTTCTC I I M 71 NORF5 12 13 158973 204
TTTCCTATAA 72 NORF6 11 13 511754 252
TCTAGTCGCC 73 NORF7 10 12 669659 192
ATCGI 1 1 IAT 74 NORF8 8 15 877140 174
GGCCAATGGT 75 NORF9 8 4 1202289 267
ACCCTGTCAT 76 NORF10 7 2 418633 255
AAAAGATCAT 77 NORF11 7 4 1489453 87
CAGAAAATGG 78 NORF12 6 8 115655 279
TGACATTCTT 79 NORF13 6 16 883669 183
TAGACATCTA 80 NORF14 6 2 491117 141
TGCCCTGGCC 81 NORF15 5 5 166452 216 ro
GGI 1 1 I GGCG 82 NORF16 4 3 24169 291
CCATACAGGT 83 NORF17 4 12 673851 114
CCAAATCAAA 84 NORF18 3 4 229494 258
AAGCGGTACT 85 NORF19 3 9 47889 399
AACGC I 1 1 IC 86 NORF20 3 2 351456 198
GAGGATAGAG 87 NORF21 3 2 356201 240
CAATGAACCG 88 NORF22 3 16 75541 243
TCTTTATATA 89 NORF23 3 1 73363 90
CGCCTCCAGT 90 NORF24 3 7 485774 108
TACGTAAGTT 91 NORF25 3 10 156139 81
GATTTAAACT 92 NORF26 3 15 254749 93
GCGCCTCCAA 93 NORF27 2 5 42622 222
CAATGGCCCA I 94 NORF28 2 13 511751 78
TTGAGGAACG 95 NORF29 2 3 154681 264 l-N r~- GCTAAGAACC 96 NORF30 2 4 302607 204 r~-
© ©
Additional NORFs
SAGE Tag Seq. ID No. Chr Tag Pos Copies/cell
GGCGCAATTT 97 4 1108395 2
TAAGTGATGA 98 7 593382 2
TTGTTGAATT 99 10 608373 2
GAAGCAGTAA 100 3 155607 2
ACATATGTTA 101 4 916112 2
CCCTACACGG 102 6 223289 2
GTAATTGGAC 103 10 392099 2
ATCAGACAAA 104 14 687272 2
TTATGAAAGA 105 15 81263 2
ATTCGTTCTA 106 15 841970 2
AGCAGGAGTT 107 16 188350 2
TTCTATTAGG 108 2 418749 2
TGGATTTCAG 109 4 1224930 2
CAGATATAAT 110 5 52488 2
CTG I 1 1 I GGG 111 11 374761 2
CAI 1 1 1 I AGT 112 11 508212 2
TTGAAAAGAT 113 13 104160 2
TAAGCCCATC 114 13 251273 2
AGCGTCCTCA 115 15 832420 2
TTTAGTTAAT 116 2 477623 2
ATGGTAGCCA 117 3 56961 2
AATTAGACTA 118 3 162589 2
AGTGACTCTT 119 4 1490879 2
GGACTATAAG 120 5 251266 2
ACI 1 1 1 I CAG 121 10 159213 2
GTCATATAGT 122 13 158765 2
CAACAAAGTG 123 13 171166 2
GTGGGAAAGG 124 13 804600 2
TACTTTATAT 125 16 366449 2
AATACCAGCG 126 3 175540
GCCTTGTATA 127 4 372624
GGTACATTCA 128 5 67152
GATTTCTCTG 129 5 187462
TAGTTGCTCC 130 7 317108
GTAAGAAATC 131 7 836202
CTTGGGCTAT 132 8 107992
AAATGGTGAT 133 11 558686
ATCATTTGGG 134 12 199358
CTGAACTTTA 135 12 283720
CCAGAAGGAG 136 13 652873
CCGGTTACTA 137 15 803663 ■j
CGATGAGAAG 138 15 1004369
AAACCGTCCC 139 16 199141
TCATTCATAC 140 2 164728
TATC I 1 1 1 I G 141 4 169784
TTAGAATAAT 142 4 603508
GTACGCTGTG 143 5 118089
TATATTAATT 144 6 64228 GTTCTTGCCT 145 7 939579
ATATAGCTGC 146 10 181144
CCAAAAAAAA 147 11 91785
GAACTCCACA 148 11 94125
CCTTCACTGC 149 11 374172
CACATCATAA 150 11 625896
GAAGTATTGA 151 12 603999
TGCGCGTATA - 152 13 206410
GGGTAGTACT 153 13 671730
TAGTTTTGTC 154 15 33475
CAATTCCTAC 155 1 172182 0.8
TTTGATTTGA 156 2 46431 0.8
GGCTCTGGTT 157 2 414510 0.8
CAGAAATAGC 158 2 565130 0.8
CTGTTAI 1 IT 159 2 616054 0.8
CGAAGTCAAA 160 2 680605 0.8
CTCTAGATAA 161 3 171584 0.8
AGTCAAAATG 162 4 192750 0.8
GCGAGTTTAG 163 4 691301 0.8
GCTCCAATAG 164 4 1131020 0.8
TTTATTTGAG 165 4 1237501 0.8
GTTATATTGA 166 4 1401803 0.8
TGGGTTGAAG 167 5 251266 0.8
Al 1 1 I ATTTG 168 5 447729 0.8
ATCATAAAAA 169 5 548612 0.8
TTATATAAAA 170 6 223182 0.8
CTACTTCTGC 171 8 34653 0.8
ATAAGACAGT 172 10 227802 0.8
TTCATAAGTT 173 10 471894 0.8
TAAATCTGAG 174 11 145617 0.8
CTGGTAGAAA 175 11 151174 0.8
CACGTACACA 176 11 403208 0.8
CCAAGATCAA 177 11 425882 0.8
AGCTTGTTCC 178 12 234966 0.8
CACATTCGTT 179 12 759953 0.8
CTTACATATA 180 12 789781 0.8
TCTATAGCAA 181 13 228936 0.8
CCTTTCTGAA 182 13 297985 0.8
CCTTTAGAAT 183 13 777999 0.8
AATTAACACC 184 13 842122 0.8
GCGCAGGGGC 185 14 440984 0.8
TGTTTATAAA 186 14 661710 0.8
AAAAGTCATT 187 15 32081 0.8
TTCGTAAACT 188 15 680625 0.8.
1 1 1 1 1 GGAGT 189 15 888343 0.8
AGGCATCTTG 190 16 250284 0.8
AAATCAAAAC 191 16 453890 0.8
AATTGACGAA 192 16 560169 0.8
TTGATGATTT 193 16 582360 0.8
CCTG M I N G 194 16 643476 0.8
1 1 1 1 I AAAAA 195 1 101436 0.5 AAGTTTGATC 196 1 199848 0.5
AGCACCTATG 197 2 46913 0.5
TGATTTATCC 198 2 418946 0.5
ACTGCATCTG 199 2 680860 0.5
CAAGTTAGGA 200 2 744770 0.5
ATACCCAATT 201 3 29939 0.5
AACTTTGTAT 202 3 30056 0.5
GCGGCGGGTG 203 3 41645 0.5
AAAATTGTTC 204 3 57108 0.5
TCAAGTACTC 205 3 157855 . 0.5
AACTGTATGC 206 3 223882 0.5
CTATCGGCCA 207 3 278840 0.5
ACAAGCCCAA 208 3 289917 0.5
GTACAGGGCT 209 4 93873 0.5
AAGATCATCG 210 4 254851 0.5
GAACTCCTGG 211 4 340891 0.5
GAACGAGAAG 212 4 371850 0.5
1 1 1 1 l AATAC 213 4 372058 0.5
TCTCCAGTTG 214 4 381712 0.5
AATACGTTAC 215 4 471791 0.5
ACGATTGGCT 216 4 509158 0.5
TGTTTATAAG 217 4 521709 0.5
CGTTTTCGTC 218 4 538839 0.5
TCGAACCTCT 219 4 578702 0.5
TCCACACACA 220 4 930972 0.5
CCGTGCGTGC 221 4 1324367 0.5
TTTCTTCAAC 222 5 116099 0.5
CCAAGTCTCG 223 5 159320 0.5
AGAGCGAATT 224 5 207517 0.5
TGTAGATTAT 225 5 280465 0.5
AAAAGTAGTT 226 5 286387 0.5
ACTTGGTATG 227 5 422942 0.5
TTAATGTTAT 228 5 544523 0.5
TACACGCGCG 229 5 544555 0.5
GGTCACTCCT 230 6 62983 0.5
AAGTGATGAA 231 6 76141 0.5
TTTATCTTGT 232 6 130327 0.5
AGTGATTGTT 233 6 256223 0.5
GCTTTGTTGT 234 7 72577 0.5
TCATTGATTC 235 7 110590 0.5
TTCACCGGAA 236 7 323655 0.5
ACTATTCTGT 237 7 423957 0.5
GGGCCAACCC 238 7 433787 0.5
AAAATATCTT 239 7 559397 0.5
TAGTAGTAAC 240 7 622201 0.5
AAGCGCACAA 241 7 735909 0.5
TCGCTG 1 1 1 1 242 7 800300 0.5
TGTA I 1 1 1 I G 243 7 836202 0.5
CTAAACAAAG 244 7 836587 0.5
TAGGAAGAAA 245 7 905046 0.5
GGAAAAATTA 246 7 958839 0-5 TTTGGATAGT 247 7 974754 0.5
CGTTTGTGTA 248 8 202655 0.5
AGAAAAAAAC 249 8 386651 0.5
TAAAGTCCAG 250 8 518998 0.5
TAAGCAGATT 251 8 529129 0.5
ATGAGCATTT 252 9 97114 0.5
AGGTGCAAAA 253 9 229077 0.5
TAACAAAGAG" 254 10 628227 0.5
CAATTGGCAA 255 10 721781 0.5
ACTCCCTGTA 256 11 93528 0.5
CTCTATTGAT 257 11 144281 0.5
GCTTTCCTTT 258 11 146665 0.5
ACCGCAAAGA 259 11 231872 0.5
CTTGTTCAAA 260 12 230972 0.5
AATGTGCTGT 261 12 320426 0.5
GCAGATAGCG 262 12 341324 0.5
TCTGACTTAG 263 12 368780 0.5
CCCGGATGTT - 264 12 433912 0.5
GTAACGATTG 265 12 449917 0.5
GAATAACGAA 266 12 673851 0.5
ACTGCTATTT 267 12 712476 0.5
GTTCTCTAGC 268 12 712712 0.5
CATCACCATC 269 12 794710 0.5
TTGCACTTCT 270 12 806833 0.5
ACTGTTTATG 271 12 867350 0.5
TTGCTATATA 272 12 1017911 0.5
TACATTCTAA 273 13 95707 0.5
CTCTTAGTTG 274 13 158970 0.5
ACGAACACTT 275 13 278341 0.5
TGCGCAAGTC 276 13 283795 0.5
1 1 1 1 1 CTTAA 277 13 363037 0.5
CAAATGCATT 278 13 390802 0.5
CAAATTGTGT 279 13 395599 0.5
GCAATACTAT 280 13 826521 0.5
AGTGACGATG 281 14 60143 0.5
TACTGGTTTA 282 14 118854 0.5
GTTTGACCTA 283 14 335512 0.5
AGCGTTTGAT 284 14 478481 0.5
CTCTGTTGCG 285 14 728251 0.5
AAATTCAAAA 286 15 35952 0.5
TTTGCTTGGT 287 15 242742 0.5
AG 1 1 1 1 CCTG 288 15 304813 0.5
TTTAAAGATA 289 15 331453 0.5
AAGGAGACAC 290 15 448624 0.5.
CTATATATCA 291 15 544530 0.5
GATGGAATAG 292 15 571210 0.5
TCGAGTCGAA 293 15 758202 0.5
AAAAAAGAAA 294 15 882567 0.5
TTTCCAGAAT 295 15 969884 0.5
TGGACAATGT 296 15 970607 0 5
GGAATTAAGA 297 15 979894 0-5 ACTATATGTT 298 16 582230 0.5 GATATATCAT 299 16 589647 0.5 AGAATTGATT 300 16 744406 0.5 CACTGTCTCC 301 16 824649 0.5
Table 5. Gene expression changes in different cell cycle phases
Figure imgf000031_0001
Table 5, cont.
Figure imgf000032_0001
Table 5, cont.
Figure imgf000033_0001
Table 5, cont.
Figure imgf000034_0001
Table 5, cont.
Figure imgf000035_0001
Table 5, cont.
Figure imgf000036_0001
Table 5, cont.
Figure imgf000037_0001
Table 5, cont.
Or
Figure imgf000038_0001
Table 5, cont.
Figure imgf000039_0001
Table 5, cont.
oe
Figure imgf000040_0001
Table 5, cont.
Figure imgf000041_0001
Table 5, cont.
Figure imgf000042_0001
Table 5, cont.
Figure imgf000043_0001
Table 5, cont.
Figure imgf000044_0002
Figure imgf000044_0001
Table 5, cont.
Figure imgf000045_0001
Table 5, cont.
Figure imgf000046_0001
Table 5, cont.
Figure imgf000047_0001
Table 5, cont.
Figure imgf000048_0001
Table 5, cont.
4- -4
Figure imgf000049_0001
Table 5, cont.
Figure imgf000050_0001
Table 5, cont.
Figure imgf000051_0001
Table 5, cont
Figure imgf000052_0001
Table 5, cont.
Figure imgf000053_0001
Table 5, cont.
Figure imgf000054_0001
Table 5, cont
Figure imgf000055_0001
Table 5, cont.
Figure imgf000056_0001
Table 5, cont.
Figure imgf000057_0001
Table 5, cont.
Or
Figure imgf000058_0001
Table 5, cont.
Figure imgf000059_0001
Table 5, cont.
Figure imgf000060_0001
Table 5, cont.
Figure imgf000061_0001
Table 5, cont.
Figure imgf000062_0001
Table 5, cont
Figure imgf000063_0001
Table 5, cont.
Or trJ
Figure imgf000064_0001
Table 5, cont.
Figure imgf000065_0001
Table 5, cont.
Figure imgf000066_0001
Table 5, cont.
Figure imgf000067_0001
Table 5, cont.
Or Or
Figure imgf000068_0001
Table 5, cont.
Figure imgf000069_0001
Table 5, cont.
Or oe
Figure imgf000070_0001
Table 5, cont.
Or 0
Figure imgf000071_0001
Table 5, cont.
-4
©
Figure imgf000072_0001
Table 5, cont.
Figure imgf000073_0001
Table 5, cont.
-4
Figure imgf000074_0001
Table 5, cont.
Figure imgf000075_0001
Table 5, cont.
Figure imgf000076_0001
Table 5, cont.
Figure imgf000077_0001
Table 5, cont.
-4
Or
Figure imgf000078_0001
Table 5, cont.
-4 -4
Figure imgf000079_0001
Table 5, cont.
-4 oe
Figure imgf000080_0001
Table 5, cont.
Figure imgf000081_0001
Table 5, cont.
oe ©
Figure imgf000082_0001
Table 5, cont.
Figure imgf000083_0001
Table 5, cont.
Figure imgf000084_0001
Table 5, cont.
Figure imgf000085_0001
Table 5, cont.
Figure imgf000086_0001
Table 5, cont.
Figure imgf000087_0001
Table 5, cont.
oe
Or
Figure imgf000088_0001
Table 5, cont.
oe
-4
Figure imgf000089_0001
Table 5, cont.
oe oe
Figure imgf000090_0001
Table 5, cont.
Figure imgf000091_0001
Table 5, cont.
Figure imgf000092_0001
Table 5, cont.
Figure imgf000093_0001
Table 5, cont.
Figure imgf000094_0001
Table 5, cont.
Figure imgf000095_0001
Table 5, cont.
Figure imgf000096_0001
Table 5, cont.
Figure imgf000097_0001
Table 5, cont.
Figure imgf000098_0001
Table 5, cont.
Figure imgf000099_0001
Table 5, cont.
oe
Figure imgf000100_0001
Table 5, cont.
Figure imgf000101_0001
Table 5, cont.
© ©
Figure imgf000102_0001
Table 5, cont.
Figure imgf000103_0001
Table 5, cont.
©
Is)
Figure imgf000104_0001
Table 5, cont.
Figure imgf000105_0001
Table 5, cont.
Figure imgf000106_0001
Table 5, cont.
Figure imgf000107_0001
Table 5, cont.
©
Or
Figure imgf000108_0001
Table 5, cont.
©
~4
Figure imgf000109_0001
Table 5, cont.
© oe
Figure imgf000110_0001
Table 5, cont.
Figure imgf000111_0001
Table 5, cont.
Figure imgf000112_0001
Table 5, cont.
Figure imgf000113_0001
Table 5, cont.
Figure imgf000114_0001
Table 5, cont.
Figure imgf000115_0001
Table 5, cont.
Figure imgf000116_0001
Table 5, cont.
Figure imgf000117_0001
Table 5, cont.
Figure imgf000118_0001
Table 5, cont.
Figure imgf000119_0001
Table 5, cont.
Figure imgf000120_0001
Table 5, cont.
Figure imgf000121_0001
Table 5, cont.
Is)
©
Figure imgf000122_0001
Table 5, cont.
Figure imgf000123_0001
Table 5, cont.
) )
Figure imgf000124_0001
Table 5, cont.
)
UI
Figure imgf000125_0001
Table 5, cont.
)
4-
Figure imgf000126_0001
Table 5, cont.
)
Figure imgf000127_0001
Table 5, cont.
)
Or
Figure imgf000128_0001
Table 5, cont.
Is) -4
Figure imgf000129_0001
Table 5, cont.
Figure imgf000130_0001
Table 5, cont.
Is)
Figure imgf000131_0001
Table 5, cont.
Figure imgf000132_0001
Table 5, cont.
Figure imgf000133_0001
Table 5, cont.
Figure imgf000134_0001
Table 5, cont
Figure imgf000135_0001
Table 5, cont.
I -
Figure imgf000136_0001
Table 5, cont.
Figure imgf000137_0001
Table 5, cont.
Figure imgf000138_0001
Table 5, cont.
Figure imgf000139_0001
Table 5, cont.
UI oe
Figure imgf000140_0001
Table 5, cont.
Figure imgf000141_0001
Table 5, cont.
4-
©
Figure imgf000142_0001
Table 5, cont.
Figure imgf000143_0001
Table 5, cont.
4- )
Figure imgf000144_0001
Table 5, cont.
4- Ul
Figure imgf000145_0001
Table 5, cont.
Figure imgf000146_0001
Table 5, cont.
Figure imgf000147_0001
Table 5, cont.
- \
Figure imgf000148_0001
Table 5, cont.
4- -4
Figure imgf000149_0001
Table 5, cont.
Figure imgf000150_0001
Table 5, cont.
Figure imgf000151_0001
Table 5, cont.
©
Figure imgf000152_0001
Table 5, cont
Figure imgf000153_0001
*0
© © f-Λ
H U α.
1M t~ t~
Figure imgf000154_0002
© o
Figure imgf000154_0001
Table 5, cont.
Figure imgf000155_0001
Table 5, cont
Figure imgf000156_0001
Table 5, cont.
Figure imgf000157_0001
Table 5, cont.
Figure imgf000158_0001
Table 5, cont.
Figure imgf000159_0001
Table 5, cont.
Figure imgf000160_0001
Table 5, cont.
Figure imgf000161_0001
Table 5, cont.
Or ©
Figure imgf000162_0001
Table 5, cont.
Figure imgf000163_0001
Table 5, cont.
Or Is)
Figure imgf000164_0001
Table 5, cont
Or UI
Figure imgf000165_0001
Table 5, cont.
Figure imgf000166_0001
Table 5, cont.
Or
Figure imgf000167_0001
Table 5, cont.
Or Or
Figure imgf000168_0001
Table 5, cont.
Or -4
Figure imgf000169_0001
Table 5, cont.
Os oe
Figure imgf000170_0001
Table 5, cont.
Figure imgf000171_0001
Table 5, cont.
-4
©
Figure imgf000172_0001
Table 5, cont.
Figure imgf000173_0001
Table 5, cont.
Figure imgf000174_0001
Table 5, cont.
Figure imgf000175_0001
Table 5, cont.
Figure imgf000176_0002
Figure imgf000176_0001
Table 5, cont.
-4
Figure imgf000177_0001
Table 5, cont.
-4
Or
Figure imgf000178_0001
Table 5, cont.
-4 -4
Figure imgf000179_0001
Table 5, cont.
-4 oe
Figure imgf000180_0001
Table 5, cont.
-4
Figure imgf000181_0001
Table 5, cont.
oe ©
Figure imgf000182_0001
Table 5, cont.
Figure imgf000183_0001
Table 5, cont.
Figure imgf000184_0001
Table 5, cont.
Figure imgf000185_0001
Table 5, cont.
Figure imgf000186_0001
Table 5, cont.
oe
Figure imgf000187_0001
Table 5, cont.
Figure imgf000188_0001
Table 5, cont.
Figure imgf000189_0001
Table 5, cont.
Figure imgf000190_0001
Table 5, cont.
Figure imgf000191_0001
Table 5, cont.
Figure imgf000192_0001
Table 5, cont.
Figure imgf000193_0001
l-N l-N
*0
© © f-Λ
H U α.
l-N l~- l~-
Figure imgf000194_0002
©
Figure imgf000194_0001
Table 5, cont.
Figure imgf000195_0001
Table 5, cont.
Figure imgf000196_0001
Table 5, cont.
Figure imgf000197_0001
Figure imgf000198_0001
Table 5, cont.
Figure imgf000199_0001
Figure imgf000199_0002
© © f-Λ
H U α.
l-N l~- l~-
©
Figure imgf000200_0001
O μjoo 'g θiqεj.
Table 5, cont.
Figure imgf000201_0001
Table 5, cont.
Is)
© ©
Figure imgf000202_0001
Table 5, cont.
Figure imgf000203_0001
Table 5, cont.
Is)
© Is)
Figure imgf000204_0001
Table 5, cont.
Is)
© UI
Figure imgf000205_0001
Figure imgf000206_0001
Table 5, cont.
Figure imgf000207_0001
Table 5, cont.
Is)
©
Or
Figure imgf000208_0001
Table 5, cont.
Is) © -4
Figure imgf000209_0001
Table 5, cont.
Is)
© 00
Figure imgf000210_0001
Table 5, cont.
Figure imgf000211_0001
Table 5, cont.
Is)
©
Figure imgf000212_0001
Table 5, cont.
Figure imgf000213_0001
Table 5, cont.
Is) Is)
Figure imgf000214_0001
Table 5, cont.
Figure imgf000215_0001
Table 5, cont.
Figure imgf000216_0001
Figure imgf000217_0001
Table 5, cont
Table 5, cont.
Is) -4
Figure imgf000219_0001
Table 5, cont.
Figure imgf000220_0001
Figure imgf000221_0001
Table 5, cont.
Is) Is)
©
Figure imgf000222_0001
Table 5, cont.
Is) Is)
Figure imgf000223_0001
Table 5, cont.
Is) Is) Is)
Figure imgf000224_0001
Table 5, cont.
Is) Is)
UI
Figure imgf000225_0002
Figure imgf000225_0001
Table 5, cont.
Is) Is)
Figure imgf000226_0001
c o o jϋ --α co
I-
Figure imgf000227_0001
O 00/77214
Figure imgf000228_0001
Table 5, cont.
Is) Is) -4
Figure imgf000229_0001
Table 5, cont.
Is) Is) 00
Figure imgf000230_0001
Table 5, cont.
Is) Is)
Figure imgf000231_0001
Table 5, cont.
Is)
UI
©
Figure imgf000232_0001
Table 5, cont.
Is)
UI
Figure imgf000233_0001
Table 5, cont.
Is)
UI Is)
Figure imgf000234_0001
Table 5, cont.
Is)
UI UI
Figure imgf000235_0001
Table 5, cont.
Is)
UI
Figure imgf000236_0001
Table 5, cont.
Figure imgf000237_0001
Table 5, cont.
Figure imgf000238_0001
Table 5, cont.
Is)
UI -4
Figure imgf000239_0001
Table 5, cont.
Figure imgf000240_0001
o o <o
-Q CO
I-
Figure imgf000241_0001
Table 5, cont.
Is)
4-
©
Figure imgf000242_0001
Table 5, cont.
Figure imgf000243_0001
Table 5, cont.
Is)
4- Is)
Figure imgf000244_0001
Table 5, cont.
Is)
4- Ul
Figure imgf000245_0001
Table 5, cont.
Is)
4- 4-
Figure imgf000246_0001
Table 5, cont.
Is)
4-
-n
Figure imgf000247_0001
Figure imgf000248_0001
Table 5, cont.
Is) 4- -4
Figure imgf000249_0001
Table 5, cont.
Is)
4-
00
Figure imgf000250_0001
Table 5, cont.
Is)
4-
Figure imgf000251_0001
Table 5, cont.
Figure imgf000252_0001
Table 5, cont.
Figure imgf000253_0001
Table 5, cont.
Is)
-n
Is)
Figure imgf000254_0001
Table 5, cont.
Is)
-n ui
Figure imgf000255_0001
Table 5, cont.
Figure imgf000256_0001
Table 5, cont.
Is)
-n
Figure imgf000257_0001
Figure imgf000258_0001
Table 5, cont.
Is)
-n
~4
Figure imgf000259_0001
Table 5, cont.
Is)
-n
00
Figure imgf000260_0001
Table 5, cont.
Is)
-n
Figure imgf000261_0001
Table 5, cont.
Is)
Or ©
Figure imgf000262_0001
Table 5, cont.
Is)
Or
Figure imgf000263_0001
Table 5, cont.
Is)
Or Is)
Figure imgf000264_0001
Table 5, cont.
Is)
Or UI
Figure imgf000265_0001
Table 5, cont.
Is)
Or
Figure imgf000266_0001
Table 5, cont.
Is)
Or
-n
Figure imgf000267_0002
Figure imgf000267_0001
Table 5, cont.
Is)
Or Or
Figure imgf000268_0001
Figure imgf000269_0001
Table 5, cont.
Is)
Or 00
Figure imgf000270_0001
Table 5, cont.
Is)
Or 0
Figure imgf000271_0001
Table 5, cont.
Is) ~4
©
Figure imgf000272_0001
Table 5, cont.
Is* ~4
Figure imgf000273_0001
Table 5, cont.
Is) -4 Is)
Figure imgf000274_0001
Table 5, cont.
Is) ~4
UI
Figure imgf000275_0001
© © f-Λ
H U α.
©
Figure imgf000276_0001
O
;uoo 'g Θiqεi
Table 5, cont.
Is) ~4
-n
Figure imgf000277_0001
Table 5, cont.
Is) ~4
Or
Figure imgf000278_0001
Table 5, cont.
Is) ~4 ~4
Figure imgf000279_0001
Table 5, cont.
Is) -4
00
Figure imgf000280_0001
Table 5, cont.
Is) ~4
Figure imgf000281_0001
Table 5, cont.
Is)
00
©
Figure imgf000282_0001
Table 5, cont.
Is)
00
Figure imgf000283_0001
Table 5, cont.
Is)
00 Is)
Figure imgf000284_0001
Table 5, cont.
Is)
00 UI
Figure imgf000285_0001
Table 5, cont.
Is)
00 4-
Figure imgf000286_0001
Table 5, cont.
Is)
00
-n
Figure imgf000287_0001
Table 5, cont.
Is)
00
Or
Figure imgf000288_0001
Table 5, cont.
Is)
00 ~4
Figure imgf000289_0001
Table 5, cont.
Is)
00 00
Figure imgf000290_0001
Table 5, cont.
Is)
00
Figure imgf000291_0001
Table 5, cont.
Figure imgf000292_0001
Table 5, cont.
Figure imgf000293_0001
c o o ω
-Q co
Figure imgf000294_0001
Table 5, cont.
Figure imgf000295_0001
Figure imgf000296_0002
Figure imgf000296_0001
Table 5, cont.
Is*
Figure imgf000297_0001
Table 5, cont.
Is)
Or
Figure imgf000298_0001
Table 5, cont.
Figure imgf000299_0001
Table 5, cont.
Figure imgf000300_0001
Table 5, cont.
Is)
Figure imgf000301_0001
Figure imgf000302_0001
Table 5, cont.
Figure imgf000303_0001
Table 5, cont.
UI
©
Is)
Figure imgf000304_0001
Table 5, cont.
UI
©
UI
Figure imgf000305_0001
Table 5, cont.
UI
©
4-
Figure imgf000306_0001
Table 5, cont.
Figure imgf000307_0001
Table 5, cont.
I
©
Or
Figure imgf000308_0001
Table 5, cont.
UI
©
~4
Figure imgf000309_0001
Table 5, cont.
Figure imgf000310_0001
Table 5, cont.
Figure imgf000311_0001
Table 5, cont.
Figure imgf000312_0001
Table 5, cont.
Figure imgf000313_0001
Table 5. cont.
Figure imgf000314_0001
Table 5, cont.
Figure imgf000315_0001
Table 5, cont.
Figure imgf000316_0001
Table 5, cont.
Figure imgf000317_0001
Table 5, cont.
UI
Or
Figure imgf000318_0001
c o υ to co
I-
Figure imgf000319_0001
Table 5, cont.
Figure imgf000320_0001
Table 5, cont.
Figure imgf000321_0001
Table 5, cont.
u*
Is)
©
Figure imgf000322_0001
Table 5, cont.
Figure imgf000323_0001
H U α.
l-N l~~ l~~
Figure imgf000324_0002
©
Figure imgf000324_0001
Table 5, cont.
UI Is)
UI
Figure imgf000325_0001
Table 5, cont.
Figure imgf000326_0001
Table 5, cont.
UI Is)
-n
Figure imgf000327_0001
Table 5, cont.
I Is)
Or
Figure imgf000328_0001
c o ϋ to
-Q co
I-
Figure imgf000329_0001
Table 5, cont.
Figure imgf000330_0001
Table 5, cont.
Figure imgf000331_0001
Figure imgf000331_0002
Table 5, cont.
Figure imgf000332_0001
Table 5, cont.
Figure imgf000333_0001
Table 5, cont.
Figure imgf000334_0001
Table 5, cont.
Figure imgf000335_0001
Table 5, cont.
Figure imgf000336_0001
Table 5, cont.
UI UI
-n
Figure imgf000337_0001
Table 5, cont.
Figure imgf000338_0001
Table 5, cont.
Figure imgf000339_0001
Table 5, cont.
Figure imgf000340_0001
Table 5, cont.
Figure imgf000341_0001
Table 5, cont.
UI 4-
©
Figure imgf000342_0001
Table 5, cont.
Figure imgf000343_0001
Table 5, cont.
UI 4- Is)
Figure imgf000344_0001
Table 5, cont.
UI 4- Ul
Figure imgf000345_0001
Figure imgf000346_0001
Table 5, cont.
UI 4-
-n
Figure imgf000347_0001
Table 5, cont.
UI 4-
Or
Figure imgf000348_0001
Table 5, cont.
Ul 4- ~4
Figure imgf000349_0001
c o o _ω
-Q o
I-
Figure imgf000350_0001
Table 5, cont.
Figure imgf000351_0001
Figure imgf000352_0001
Table 5, cont.
Figure imgf000353_0001
Table 5, cont.
UI
-n
Is)
Figure imgf000354_0001
Table 5, cont.
Figure imgf000355_0001
Table 5, cont.
Figure imgf000356_0001
Table 5, cont.
Figure imgf000357_0001
Table 5, cont.
Figure imgf000358_0001
Table 5, cont.
UI
-n
~4
Figure imgf000359_0001
Table 5, cont.
UI
-n
00
Figure imgf000360_0001
Table 5, cont.
Figure imgf000361_0001
Table 5, cont.
UI
Or ©
Figure imgf000362_0001
Table 5, cont.
Figure imgf000363_0001
Table 5, cont.
UI Or Is)
Figure imgf000364_0001
Table 5, cont.
Figure imgf000365_0001
Table 5, cont.
Figure imgf000366_0001
Table 5, cont.
UI
Or
-n
Figure imgf000367_0001
Table 5, cont.
UI
Or Or
Figure imgf000368_0001
Table 5, cont.
UI Or ~4
Figure imgf000369_0001
Table 5, cont.
UI
Or 00
Figure imgf000370_0001
Table 5, cont.
Figure imgf000371_0001
c o υ to
--Q c
Figure imgf000372_0001
Table 5, cont.
Figure imgf000373_0001
Table 5, cont.
UI ~4 Is)
Figure imgf000374_0001
Table 5, cont.
Figure imgf000375_0001
Table 5, cont.
Figure imgf000376_0001
Table 5, cont.
UI -4
-n
Figure imgf000377_0001
Table 5, cont.
UI ~4
Or
Figure imgf000378_0001
Table 5, cont.
UI ~4 ~4
Figure imgf000379_0001
Table 5, cont.
Figure imgf000380_0001
Table 5, cont.
Figure imgf000381_0001
o o ω
-Q o
Figure imgf000382_0001
Table 5, cont.
Figure imgf000383_0001
Table 5, cont.
UI 00 Is)
Figure imgf000384_0001
Table 5, cont.
Figure imgf000385_0001
Table 5, cont.
Figure imgf000386_0001
Table 5, cont.
UI
00
-n
Figure imgf000387_0001
Table 6. Analysis of NORFs in intergenic regions
UI
00
Or
Figure imgf000388_0001
Table 6, cont.
00 ~4
Figure imgf000389_0001
Table 6, cont.
UI
00 00
Figure imgf000390_0001
Table 6, cont.
Ul
00 vo
Figure imgf000391_0001
Table 6, cont.
Ul o ©
Figure imgf000392_0001
Table 6, cont.
UI vo
Figure imgf000393_0001
Table 6, cont.
UI vo
Is)
Figure imgf000394_0001
Table 6, cont.
UI vo
UI
Figure imgf000395_0001
Figure imgf000395_0002
Table 6, cont.
UI vo
Figure imgf000396_0002
Figure imgf000396_0001
Table 6, cont.
UI vo -n
Figure imgf000397_0001
Table 6, cont.
UI vo
Or
Figure imgf000398_0001
Table 6, cont.
UI vo
~4
Figure imgf000399_0001
Table 6, cont.
Figure imgf000400_0001
Table 6, cont.
Ul vo vo
Figure imgf000401_0001
Table 6, cont.
© ©
Figure imgf000402_0001
Table 6, cont.
Figure imgf000403_0001
Table 6, cont.
4- © Is)
Figure imgf000404_0001
Table 6, cont.
©
Ul
Figure imgf000405_0001
Table 6, cont.
Figure imgf000406_0001
References
Archer, J. E., Vega, L. R., and Solomon, F. (1995). Rbl2p, a yeast protein that binds to beta-tubulin and participates in microtubule function in vivo. Cell 82, 425- 434. Bajwa, W., Torchia, T. E., and Hopper, J. E. (1988). Yeast regulatory gene
GAL3: carbon regulation; UASGal elements in common with GAL1, GAL2, GAL7, GAL 10, GAL80, and MEL1; encoded protein strikingly similar to yeast and Escherichia coli galactokinases. Mol Cell Biol 8, 3439-3447.
Basrai, M. A., Kingsbury, J., Koshland, D., Spencer, F., and Hieter, P. (1996). Faithful chromosome transmission requires Spt4p, a putative regulator of chromatin structure in Saccharomyces cerevisiae. Mol Cell Biol 16, 2838-2847.
Bishop, J. O., Morton, J. G., Rosbash, M., and Richardson, M. (1974). Three abundance classes in HeLa cell messenger RNA. Nature 250, 199-204.
Burkholder, A. C, and Hartwell, L. H. (1985). The yeast alpha- factor receptor: structural properties deduced from the sequence of the STE2 gene. Nucleic Acids Res
13, 8463-8475.
Chambers, A., Tsang, J. S., Stanway, C, Kingsman, A. J., and Kingsman, S. M. (1989). Transcriptional control of the Saccharomyces cerevisiae PGK gene by RAP1. Mol Cell Biol 9, 5516-5524. Denis, C. L., Ferguson, J., and Young, E. T. (1983). mRNA levels for the fermentative alcohol dehydrogenase of Saccharomyces cerevisiae decrease upon growth on a nonfermentable carbon source. J Biol Chem 258, 1 165-1 171.
Dick, T., Surana, U., and Chia, W. (1996). Molecular and genetic characterization of SLCl, a putative Saccharomyces cerevisiae homolog of the metazoan cytoplasmic dynein light chainl . Mol Gen Genet 251, 38-43.
El-Deiry, W. S., Tokino, T., Velculescu, V. E., Levy, D. B., Parsons, R., Trent, J. M., Lin, D., Mercer, W. E., Kinzler, K. W., and Vogelstein, B. (1993). WAFl, a potential mediator of p53 tumor suppression. Cell 75, 817-825.
Elledge, S. J., and Davis, R. W. (1989). DNA damage induction of ribonucleotide reductase. Mol Cell Biol 9, 4932-4940.
Goffeau, A., Barrell, B.G., Bussey, H., Davis, R.W., Dujon, B., Feldmann, H., Galibert, F., Hoheisel, J.D., Jacq, C, Johnston, M., Louis, E.J., Mewes, H.W., Murakami, Y., Philippsen, P., Tettelin, H., and Oliver, S.G. (1996). Life with 6000 genes. Science 274, 546-567.
Gottschling, D. E., Aparicio, O. M., Billington, B. L., and Zakian, V. A. (1990). Position effect at S. cerevisiae telomeres: reversible repression of Pol II transcription.
Cell 63, 151-162.
Hagen, D. C, McCaffrey, G., and Sprague, G. F., Jr. (1986). Evidence the yeast STE3 gene encodes a receptor for the peptide pheromone a factor: gene sequence and implications for the structure of the presumed receptor. Proc Natl Acad Sci U S A 83, 1418-1422.
Hereford, L. M., and Rosbash, M. (1977). Number and distribution of polyadenylated RNA sequences in yeast. Cell 10, 453-462.
Imiger, S., and Braus, G. H. (1994). Saturation mutagenesis of a polyadenylation signal reveals a hexanucleotide element essential for mRNA 3' end formation in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A 91, 257-261.
Iyer, V., and Struhl, K. (1996). Absolute mRNA levels and transcriptional initiation rates in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A 93, 5208-5212.
Kurjan, J., and Herskowitz, I. (1982). Structure of a yeast pheromone gene (MF alpha): a putative alpha-factor precursor contains four tandem copies of mature alpha- factor. Cell 30, 933-943.
Leeds, P., Peltz, S. W., Jacobson, A., and Culbertson, M. R. (1991). The product of the yeast UPFl gene is required for rapid turnover of mRNAs containing a premature translational termination codon. Genes Dev 5, 230 3-2314.
Lewin, B. (1980). Gene Expression 2, (New York, New York: John Wiley and Sons), pp. 694-727.
McAlister, L., and Holland, M. J. (1982). Targeted deletion of a yeast enolase structural gene. Identification and isolation of yeast enolase isozymes. J Biol Chem 257, 7181-7188.
Michaelis, S., and Herskowitz, I. (1988). The a-factor pheromone of Saccharomyces cerevisiae is essential for mating. Mol Cell Biol 8, 1309-1318.
Mushegian, A. R., and Koonin, E. V. (1996). A minimal gene set for cellular life derived by comparison of complete bacterial genomes. Proc. Natl. Acad. Sci. USA 93, 10268-10273.
Nguyen, C, Rocha, D., Granjeaud, S., Baldit, M., Bernard, K., Naquet, P., and Jordan, B. R. (1995). Differential gene expression in the murine thymus assayed by quantitative hybridization of arrayed cDNA clones. Genomics 29, 207-216.
Nishizawa, M., Araki, R., and Teranishi, Y. (1989). Identification of an upstream activating sequence and an upstream repressible sequence of the pyruvate kinase gene of the yeast Saccharomyces cerevisiae. Mol Cell Biol 9, 442-451.
Renauld, H., Aparicio, O. M., Zierath, P. D., Billington, B. L., Chhablani, S. K., and Gottschling, D. E. (1993). Silent domains are assembled continuously from the telomere and are defined by promoter distance and strength, and by SIR3 dosage. Genes Dev 7, 1 133-1 145.
Rose, M. D., Winston, F., and P. Hieter. (1990). Methods in Yeast Genetics. (Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press), pp. 177. Schena, M., Shalon, D., Davis, R. W., and Brown, P. O. (1995). Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 270, 467-470.
Schmitt, H. D., Ciriacy, M., and Zimmermann, F. K. (1983). The synthesis of yeast pyruvate decarboxylase is regulated by large variations in the messenger RNA level. Mol Gen Genet 192, 247-252.
Sikorski, R. S., and Hieter, P. (1989). A system of shuttle vectors and yeast host strains designed for efficient manipulation of DNA in Saccharomyces cerevisiae. Genetics 122, 19-27.
Singh, A., Chen, E. Y., Lugovoy, J. M., Chang, C. N., Hitzeman, R. A., and Seeburg, P. H. (1983). Saccharomyces cerevisiae contains two discrete genes coding for the alpha- factor pheromone. Nucleic Acids Res //, 4049-4063.
Smith, M. M., and Murray, K. (1983). Yeast H3 and H4 histone messenger RNAs are transcribed from two non-allelic gene sets. J Mol Biol 169, 641-661.
St John, T. P., and Davis, R. W. (1979). Isolation of galactose-inducible DNA sequences from Saccharomyces cerevisiae by differential plaque filter hybridization.
Cell 16, 443-452. Velculescu, V. E., Zhang, L., Vogelstein, B., and Kinzler, K. W. (1995). Serial analysis of gene expression. Science 270, 484-487.
Zaret, K. S., and Sherman, F. (1982). DNA sequence required for efficient transcription termination in yeast. Cell 28, 563-573.

Claims

I . An isolated DNA molecule comprising a coding sequence of a yeast gene selected from the group of NORF genes comprising a SAGE tag as shown in SEQ ID NOS:67-81 1.
2. The isolated DNA molecule of claim 1 which is involved in cell cycle progression.
3. The isolated DNA molecule of claim 2 wherein expression of the NORF gene varies by at least 10% between any two phases of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
4. The isolated DNA molecule of claim 2 wherein expression of the NORF gene varies by at least 25% between any two phases of the cell cycle selected from the group consisting of: log phase, S phase, and G2 M.
5. The isolated DNA molecule of claim 2 wherein expression of the NORF gene varies by at least 50% between any two phases of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
6. The isolated DNA molecule of claim 2 wherein expression of the NORF gene varies by at least 100% between any two phases of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
7. The isolated DNA molecule of claim 2 wherein expression of the NORF gene varies by a statistically significant difference (greater than 95% confidence level) between any two phases of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
8. The isolated DNA molecule of claim 7 wherein the NORF gene is selected from the group consisting of NORF Nfl 1, 2, 4, 5, 6, 17, 25, and 27.
9. The isolated DNA molecule of claim 2 wherein the NORF gene is not expressed in at least one phase of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
10. The isolated DNA molecule of claim 1 which is genomic.
I I . The isolated DNA molecule of claim 1 which is cDNA.
12. A method of using NORF genes to affect the cell cycle, comprising the step of: administering to a cell an isolated DNA molecule comprising a coding sequence of a NORF gene whose expression varies by at least 10% between any two phases of the cell cycle selected from the group consisting of log phase, S phase, and G2/M.
13. The method of claim 12 wherein the cell is a yeast cell.
14. The method of claim 12 wherein the cell is a fungal cell.
15. The method of claim 12 wherein the cell is a mammalian cell.
16. The method of claim 12 wherein the NORF gene is selected from the group consisting of NORF Na 1, 2, 4, 5, 6, 17, 25, and 27.
17. A method for screening candidate antifungal drugs, comprising the steps of: contacting a test substance with a yeast cell; monitoring expression of a NORF gene whose expression varies by at least 10% between any two phases of the cell cycle selected from the group consisting of log "phase, S phase, and G2/M, wherein a test substance which modifies the expression of the yeast gene is a candidate antifungal drug.
18. The method of claim 17 wherein the NORF gene is selected from the group consisting of NORF Na 1, 2, 4, 5, 6, 17, 25, and 27.
19. A method for identifying human genes which are involved in cell cycle progression, comprising the steps of: contacting human DNA with a probe which comprises at least 10 contiguous nucleotides of a NORF gene whose expression varies by at least 10% between any two phases of the cell cycle selected from the group consisting of log phase, S phase, and G2 M phase, wherein a human DNA sequence which hybridizes to the probe is identified as a sequence of a candidate human gene which is involved in cell cycle progression.
20. The method of claim 19 wherein the NORF gene is selected from the group consisting of NORF Na 1, 2, 4, 5, 6, 17, 25, and 27.
21. A probe comprising at least 14 contiguous nucleotides of a NORF gene comprising a SAGE tag as shown in SEQ ID NOS:67-81 1.
22. The probe of claim 21 wherein expression of the NORF gene varies by at least 10% between any two phases of a cell cycle selected from the group consisting of: log phase, S phase, and G2 M.
23. The probe of claim 22 wherein expression of the NORF gene varies by at least 25% between any two phases of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
24. The probe of claim 22 wherein expression of the NORF gene varies by at least 50% between any two phases of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
25. The probe of claim 22 wherein expression of the NORF gene varies by at least 100% between any two phases of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
26. The probe of claim 22 wherein the NORF gene is not expressed in at least one phase of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
27. The probe of claim 22 wherein expression of the NORF gene varies by a statistically significant difference (greater than 95% confidence level) between any two phases of the cell cycle selected from the group consisting of: log phase, S phase, and G2/M.
28. The probe of claim 22 wherein the gene is selected from the group consisting of NORF Nfl 1, 2, 4, 5, 6, 17, 25, and 27.
29. The method of claim 17 wherein said step of monitoring expression is performed using nucleic acid molecules which are immobilized on a solid support.
30. The method of claim 29 wherein the nucleic acid molecules are in on array.
31. The method of claim 19 wherein a probe which comprises a portion of the NORF gene is in an array on a solid support.
32. An array of probes on a solid support wherein at least one probe comprises at least 14 contiguous nucleotides of a NORF gene comprising a SAGE tag as shown in SEQ ID NOS:67-81 1.
33. The array of claim 32 wherein the at least one NORF gene is involved in cell cycle progression.
34. The array of claim 32 wherein the NORF gene is selected from the group consisting of NORF No. 1, 2, 4, 5, 6, 17, 25, and 27.
35. The array of claim 32 which comprises at least 100 probes of distinct sequence.
36. The array of claim 32 which comprises at least 500 probes of distinct sequence.
37. The array of claim 32 which comprises at least 1,000 probes of distinct sequence.
38. A method of identifying a candidate drug as a member of a class of drugs having a characteristic effect on gene expression in a yeast cell, comprising the steps of: contacting a yeast cell with a candidate drug; and monitoring expression in the yeast cell of at least one NORF gene whose expression is affected by the class of drugs, wherein detection of a difference in expression of the at least one NORF gene in the yeast cell relative to expression in the absence of the candidate drug identifies the candidate drug as a member of the class of drugs.
39. The method of claim 38 wherein the step of monitoring expression is performed using nucleic acid molecules which are immobilized on a solid support.
40. The method of claim 39 wherein the nucleic acid molecules are in an array.
41. The method of claim 38 wherein expression of two or more NORF genes is monitored.
42. The probe of claim 21 which is immobilized on a solid support.
PCT/US2000/016223 1999-06-16 2000-06-14 Characterization of the yeast transcriptome WO2000077214A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CA002378512A CA2378512A1 (en) 1999-06-16 2000-06-14 Characterization of the yeast transcriptome
EP00939837A EP1190044A2 (en) 1999-06-16 2000-06-14 Characterization of the yeast transcriptome
JP2001503657A JP2003516716A (en) 1999-06-16 2000-06-14 Characterization of the yeast transcriptome
AU54856/00A AU5485600A (en) 1999-06-16 2000-06-14 Characterization of the yeast transcriptome

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US33503299A 1999-06-16 1999-06-16
US09/335,032 1999-06-16

Publications (2)

Publication Number Publication Date
WO2000077214A2 true WO2000077214A2 (en) 2000-12-21
WO2000077214A3 WO2000077214A3 (en) 2001-05-03

Family

ID=23309958

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/016223 WO2000077214A2 (en) 1999-06-16 2000-06-14 Characterization of the yeast transcriptome

Country Status (5)

Country Link
EP (1) EP1190044A2 (en)
JP (1) JP2003516716A (en)
AU (1) AU5485600A (en)
CA (1) CA2378512A1 (en)
WO (1) WO2000077214A2 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002272472A (en) * 2001-03-22 2002-09-24 Japan Science & Technology Corp GENE ENCODING SULFATE ION TRANSPORTER Sultr1;3
FR2837492A1 (en) * 2002-03-21 2003-09-26 Commissariat Energie Atomique New human apical iodide transporter useful for screening of compounds able to modulate apical iodide transport in cells for treatment, prevention or diagnosis of dysfunctional iodide transport
US6852536B2 (en) * 2001-12-18 2005-02-08 Isis Pharmaceuticals, Inc. Antisense modulation of CD36L 1 expression
WO2008047128A2 (en) * 2006-10-18 2008-04-24 Nazneen Rahman Materials and methods for determining susceptibility to cancer
EP1365032B1 (en) * 2002-05-21 2008-11-19 MTM Laboratories AG Marker molecules associated with lung tumors
EP2270233A1 (en) * 2005-09-05 2011-01-05 Bio-Rad Pasteur Use of both RD9 and IS6110 as nucleic acid targets for the diagnosis of tuberculosis, and provision of multiplex-compliant IS6110 and RD9 targets.
WO2011003984A1 (en) 2009-07-08 2011-01-13 Clause Means and methods for detecting and identifying a novel bacterium responsible for phytosanitary disorders in plants (zucchini) and novel resistant plants
US8188255B2 (en) 2006-10-20 2012-05-29 Exiqon A/S Human microRNAs associated with cancer
US8283158B2 (en) 2002-11-25 2012-10-09 The United States Of America, As Represented By The Secretary, Department Of Health And Human Services Method and apparatus for performing multiple simultaneous manipulations of biomolecules in a two dimensional array
WO2019054445A1 (en) * 2017-09-13 2019-03-21 大学共同利用機関法人情報・システム研究機構 Transgenic plant, method for producing transgenic plant, polynucleotide, polynucleotide cluster, vector, and kit

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110042099A (en) 2010-03-24 2019-07-23 菲奥医药公司 RNA in skin and fibrotic conditions is interfered
WO2011119871A1 (en) 2010-03-24 2011-09-29 Rxi Phrmaceuticals Corporation Rna interference in ocular indications

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998032847A2 (en) * 1997-01-23 1998-07-30 The Johns Hopkins University School Of Medicine Characterization of the yeast transcriptome

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998032847A2 (en) * 1997-01-23 1998-07-30 The Johns Hopkins University School Of Medicine Characterization of the yeast transcriptome

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
See also references of EP1190044A2 *
V.E.VCELCULESCU ET AL.: "Characterization of the yeast transcriptome" CELL, vol. 88, 24 January 1997 (1997-01-24), pages 243-251, XP002070903 CELL PRESS,CAMBRIDGE,MA,US; *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002272472A (en) * 2001-03-22 2002-09-24 Japan Science & Technology Corp GENE ENCODING SULFATE ION TRANSPORTER Sultr1;3
US6852536B2 (en) * 2001-12-18 2005-02-08 Isis Pharmaceuticals, Inc. Antisense modulation of CD36L 1 expression
FR2837492A1 (en) * 2002-03-21 2003-09-26 Commissariat Energie Atomique New human apical iodide transporter useful for screening of compounds able to modulate apical iodide transport in cells for treatment, prevention or diagnosis of dysfunctional iodide transport
EP1365032B1 (en) * 2002-05-21 2008-11-19 MTM Laboratories AG Marker molecules associated with lung tumors
US8283158B2 (en) 2002-11-25 2012-10-09 The United States Of America, As Represented By The Secretary, Department Of Health And Human Services Method and apparatus for performing multiple simultaneous manipulations of biomolecules in a two dimensional array
AU2005336287B2 (en) * 2005-09-05 2012-02-02 Bio-Rad Innovations Use of both RD9 and IS6110 as nucleic acid targets for the diagnosis of tuberculosis, and provision of multiplex-compliant IS6110 and RD9 targets
EP2270233A1 (en) * 2005-09-05 2011-01-05 Bio-Rad Pasteur Use of both RD9 and IS6110 as nucleic acid targets for the diagnosis of tuberculosis, and provision of multiplex-compliant IS6110 and RD9 targets.
EP1922415B1 (en) * 2005-09-05 2013-12-25 Bio-Rad Innovations Use of both rd9 and is6110 as nucleic acid targets for the diagnosis of tuberculosis, and provision of multiplex-compliant is6110 and rd9 targets
WO2008047128A3 (en) * 2006-10-18 2008-10-23 Nazneen Rahman Materials and methods for determining susceptibility to cancer
WO2008047128A2 (en) * 2006-10-18 2008-04-24 Nazneen Rahman Materials and methods for determining susceptibility to cancer
US8188255B2 (en) 2006-10-20 2012-05-29 Exiqon A/S Human microRNAs associated with cancer
WO2011003984A1 (en) 2009-07-08 2011-01-13 Clause Means and methods for detecting and identifying a novel bacterium responsible for phytosanitary disorders in plants (zucchini) and novel resistant plants
FR2947838A1 (en) * 2009-07-08 2011-01-14 Clause MEANS AND METHODS FOR DETECTION AND IDENTIFICATION OF A NEW BACTERIUM RESPONSIBLE FOR PHYTOSANITARY DISEASES ON PLANTS (COURGETTES) AND NEW RESISTANT PLANTS
WO2019054445A1 (en) * 2017-09-13 2019-03-21 大学共同利用機関法人情報・システム研究機構 Transgenic plant, method for producing transgenic plant, polynucleotide, polynucleotide cluster, vector, and kit

Also Published As

Publication number Publication date
AU5485600A (en) 2001-01-02
EP1190044A2 (en) 2002-03-27
JP2003516716A (en) 2003-05-20
WO2000077214A3 (en) 2001-05-03
CA2378512A1 (en) 2000-12-21

Similar Documents

Publication Publication Date Title
Velculescu et al. Characterization of the yeast transcriptome
US7504493B2 (en) Characterization of the yeast transcriptome
Brewster et al. An osmosensing signal transduction pathway in yeast
Inada et al. One-step affinity purification of the yeast ribosome and its associated proteins and mRNAs
Sakumoto et al. A series of protein phosphatase gene disruptants in Saccharomyces cerevisiae
Garcia-Barrio et al. GCD10, a translational repressor of GCN4, is the RNA-binding subunit of eukaryotic translation initiation factor-3.
Tolkacheva et al. Cloning of a Cryptococcus neoformans gene, GPA1, encoding a G-protein alpha-subunit homolog
EP1190044A2 (en) Characterization of the yeast transcriptome
Nugent et al. Gene expression during Ustilago maydis diploid filamentous growth: EST library creation and analyses
Niedenthal et al. Systematic analysis of S. cerevisiae chromosome VIII genes
AU749606C (en) Characterization of the yeast transcriptome
Porter et al. U1 small nuclear RNA from Schizosaccharomyces pombe has unique and conserved features and is encoded by an essential single-copy gene
Planta et al. Transcript analysis of 250 novel yeast genes from chromosome XIV
Schlecht et al. Genome-wide expression profiling, in vivo DNA binding analysis, and probabilistic motif prediction reveal novel Abf1 target genes during fermentation, respiration, and sporulation in yeast
Vollmer et al. [15] High expression cloning, purification, and assay of Ypt—GTPase-activating proteins
CA2158749A1 (en) Materials and methods relating to proteins that interact with casein kinase i
JP2003516149A (en) Array of functional genes in yeast
Naitou et al. Expression profiles of transcripts from 126 open reading frames in the entire chromosome VI of Saccharomyces cerevisiae by systematic northern analyses
Kessler et al. The use of direct cDNA selection to rapidly and effectively identify genes in the fungus Aspergillus fumigatus
Rempola et al. Fcf1p and Fcf2p are novel nucleolar Saccharomyces cerevisiae proteins involved in pre-rRNA processing
Gromadka et al. The KRR1 gene encodes a protein required for 18S rRNA synthesis and 40S ribosomal subunit assembly in Saccharomyces cerevisiae.
Boles Yeast as a model system for studying glucose transport
Karkusiewicz et al. Functional and physical interactions of Faf1p, a Saccharomyces cerevisiae nucleolar protein
JP2002525073A (en) C. Albicans-derived essential gene and method for screening antifungal substance using the gene
JP2003512007A (en) Drug targets in Candida albicans

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

WWE Wipo information: entry into national phase

Ref document number: 2378512

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 54856/00

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2000939837

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2000939837

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWW Wipo information: withdrawn in national office

Ref document number: 2000939837

Country of ref document: EP