EP1592811A2 - Methods for monitoring drug activities in vivo - Google Patents

Methods for monitoring drug activities in vivo

Info

Publication number
EP1592811A2
EP1592811A2 EP04710266A EP04710266A EP1592811A2 EP 1592811 A2 EP1592811 A2 EP 1592811A2 EP 04710266 A EP04710266 A EP 04710266A EP 04710266 A EP04710266 A EP 04710266A EP 1592811 A2 EP1592811 A2 EP 1592811A2
Authority
EP
European Patent Office
Prior art keywords
gene
encodes
cps
locusid
chromosome
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP04710266A
Other languages
German (de)
French (fr)
Inventor
Michael Burczynski
Natalie Twine
Andrew J. Dorner
William L. Trepicchio
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wyeth LLC
Original Assignee
Wyeth LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wyeth LLC filed Critical Wyeth LLC
Publication of EP1592811A2 publication Critical patent/EP1592811A2/en
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/106Pharmacogenomics, i.e. genetic variability in individual responses to drugs and drug metabolism
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers

Definitions

  • this application incorporates by reference all materials recorded in compact discs labeled "Copy 1 - Sequence Listing Part,” “Copy 2 - Sequence Listing Part” and “Copy 3 - Sequence Listing Part,” each of which includes “Sequence Listing.ST25.txt” (3,647 KB, created February 9, 2004). Furthermore, this application incorporates by reference all materials recorded in compact discs labeled “Copy 1 - Tables Part,” “Copy 2 - Tables Part” and “Copy 3 - Tables Part,” each of which includes “Table 1 (the Qualifier Table).txt” (210 KB, created February 9, 2004).
  • This invention relates to methods, systems and equipment useful for monitoring in vivo activities of CCI-779 or other drugs.
  • CCI-779 is an ester analog of the immunosuppressant rapamycin and as such is a potent, selective inhibitor of the mammalian target of rapamycin.
  • the mammalian target of rapamycin mTOR
  • mTOR mammalian target of rapamycin
  • CCI-779 may function as a cytostatic and immunosuppressive agent.
  • CCI-779 has been used as an anticancer drug and is currently being evaluated for indications in clinical trials for treating various oncology and inflammatory diseases. These diseases include, but are not limited to, renal cell carcinoma (RCC), prostate cancer, gliomas, lung cancer, and non-Hodgkin's lymphoma.
  • One of the main objectives of clinical pharmacogenomic studies is to identify suitable markers for monitoring in vivo activities of CCI-779 or other drugs.
  • the present invention employs easily-obtained tissues, such as peripheral blood, as surrogate tissues for the detection of in vivo activities of CCI-779 or other drugs.
  • the present invention provides methods that are useful for detecting in vivo activities of CCI-779 or other drugs.
  • the methods include comparing an expression profile of at least one drug activity gene in a peripheral blood sample of a patient of interest to a reference expression profile of the gene, where the gene is differentially expressed in peripheral blood mononuclear cells (PBMCs) of patients who have a non- blood disease and are subject to a drug therapy as compared to PBMCs isolated from the patients prior to the drug therapy.
  • PBMCs peripheral blood mononuclear cells
  • the patient of interest has the non- blood disease and is being treated by the drug therapy.
  • the drug therapy is an anti-cancer therapy, such as CCI-
  • the non-blood disease is a solid tumor, such as RCC, prostate cancer, or head/neck cancer.
  • the drug activity genes of the present invention are selected from Table 5.
  • the peripheral blood samples used in the present invention can be, without limitation, whole blood samples or samples comprising enriched PBMCs. Other peripheral blood samples that include PBMCs can also be employed in the present invention.
  • the expression profiles of the drug activity genes can be determined by various means, such as quantitative RT-PCR, Northern Blot, in situ hybridization, slot- blotting, nuclease protection assay, nucleic acid arrays, enzyme-linked immunosorbent assays (ELISAs), radioimmunoassay (RIAs), fluorescence-activated cell sorters (FACSs), or Western Blots.
  • the reference expression profile and the expression profile being compared are prepared using the same or comparable methodology.
  • the reference expression profile is an average baseline expression profile of drug activity genes in peripheral blood samples isolated from patient or patients prior to a drug treatment.
  • the reference expression profile is an expression profile of drug activity genes in peripheral blood samples of the patient of interest. The reference expression profile and the expression profile being compared can be obtained from peripheral blood samples isolated at different time points in a drug treatment.
  • the drug activity genes used in the present invention are over-expressed (or under-expressed) in PBMCs of patient who have a non-blood disease as compared to PBMCs of humans who do not have the non-blood disease.
  • the drug therapy being investigated can down-regulate (or up-regulate) the expression of the drug activity genes in PBMCs of patients who have the non-blood disease.
  • the expression of the drug activity genes in PBMCs can be stimulated (or suppressed) by phytohemagglutinin (PHA).
  • PHA phytohemagglutinin
  • the drug therapy being investigated can down-regulate (or up-regulate) the expression of these genes in PHA- treated PBMCs.
  • the present invention provides other methods that are useful for detecting in vivo activities of CCI-779 or other drugs.
  • the methods include comprises comparing an expression profile of at least one drug activity gene in a peripheral blood sample of a patient of interest to a reference expression profile of the gene, where the RNA transcript(s) of the drug activity gene can hybridize under stringent or nucleic acid array hybridization conditions to one or more qualifiers selected from the Qualifier Table.
  • the present invention provides methods useful for identifying drug activity genes. The methods include detecting the gene expression profile in peripheral blood samples of patients who have a non-blood disease and are subject to a drug therapy, and comparing the gene expression profile to a baseline gene expression profile in peripheral blood samples isolated before the drug therapy. Drug activity genes whose expression levels in peripheral blood samples can be modulated by the drug therapy can therefore be identified.
  • kits useful for detecting in vivo activities of CCI-779 or other drugs include a plurality of polynucleotides, and each polynucleotide can hybridize under stringent or nucleic acid array hybridization conditions to an RNA transcript, or the complement thereof, of a different respective drug activity gene.
  • the kits include a plurality of antibodies, and each antibody can bind to a polypeptide encoded by a different respective drug activity gene.
  • the drug activity genes can be selected, without limitation, from Table 5.
  • the present invention provides nucleic acid arrays useful for detecting in vivo activities of CCI-779 or other drugs. A substantial portion of all polypeptide probes on the nucleic acid array can hybridize under stringent or nucleic acid array hybridization conditions to RNA transcripts, or the complements thereof, of drug activity genes.
  • the present invention provides methods useful for the detection of in vivo activities of CCI-779 or other drugs.
  • Numerous drug activity genes can be identified by the present invention.
  • the expression profiles of these genes in PBMCs can be modulated by CCI-779 or other drugs. Accordingly, these genes can be used as surrogate markers for monitoring drug activities in vivo.
  • the methods of the present invention include comparing the expression profile of at least one drug activity gene in a peripheral blood sample of a patient of interest to a reference expression profile of the same drag activity gene.
  • the patient of interest has a non-blood disease, such as RCC, prostate cancer, or another solid tumor, and is being treated by a drug therapy.
  • a change in the peripheral blood expression profile of the drug activity gene is indicative of the in vivo activity of the drug therapy.
  • the reference expression. profile can be determined by using baseline peripheral blood samples isolated from patients prior to the drug therapy.
  • Peripheral blood samples amenable to the present invention include, but are not limited to, whole blood samples or samples comprising enriched PBMCs.
  • Expression profiles of drug activity genes can be detected using a variety of methods, such as quantitative RT-PCT, Northern Blot, in situ hybridization, nucleic acid arrays, enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), FACS (fluorescence-activated cell sorter), or Western Blot.
  • the drug activity genes of the present invention may also be used for assessing the efficiency of a drug therapy.
  • the present invention employs the systematic gene expression analysis technique to identify genes whose expression in peripheral blood can be modulated by a therapeutic agent such as CCI- 779. These genes are herein referred to as “drug activity genes.”
  • CCI-779 activity genes The genes whose expression levels in peripheral blood can be modified by CCI-779 are refereed to as "CCI- 779 activity genes.”
  • Drug activity genes can be identified by comparing peripheral blood gene expression profiles before and after a drug treatment. Numerous methods are available for detecting and comparing gene expression profiles.
  • gene expression profiles can be detected by measuring the levels of RNA transcripts in peripheral blood samples.
  • total RNAs or polyA RNAs are isolated from peripheral blood samples using conventional means.
  • the isolated RNAs can be amplified to produce cDNAs or cRNAs.
  • Peripheral blood gene expression profiles can be determined by measuring the amount of the amplified cDNAs or cRNAs.
  • Peripheral blood gene expression profiles can also be determined by measuring the levels of polypeptides in peripheral blood samples.
  • the amounts of polypeptides in peripheral samples can be detected using various methods, such as ELISAs, RIAs, FACSs, Western Blots or other immunoassays.
  • two-dimensional gel electrophoresis/mass spectrometry or other high-throughput protein sequencing and identification methods can be used.
  • nucleic acid arrays are used for detecting or comparing gene expression profiles in peripheral blood samples isolated at different stages of a therapeutic treatment. Nucleic acid arrays allow for quantitative detection of the expression levels of a large number of genes at one time. Examples of nucleic acid arrays include, but are not limited to, Genechip ® microarrays from Affymetrix (Santa Clara, CA), cDNA microarrays from Agilent Technologies (Palo Alto, CA), and bead arrays described in U.S. Patent Nos. 6,288,220 and 6,391,562.
  • the polynucleotides to be hybridized to nucleic acid arrays can be labeled with one or more labeling moieties to allow for detection of hybridized polynucleotide complexes.
  • the labeling moieties can include compositions that are detectable by spectroscopic, photochemical, biochemical, bioelectronic, immunochemical, electrical, optical or chemical means.
  • Exemplary labeling moieties include radioisotopes, chemiluminescent compounds, labeled binding proteins, heavy metal atoms, spectroscopic markers such as fluorescent markers and dyes, magnetic labels, linked enzymes, mass spectrometry tags, spin labels, electron transfer donors and acceptors, and the like.
  • Unlabeled polynucleotides can also be employed.
  • the polynucleotides can be DNA, RNA, or a modified form thereof.
  • Hybridization reactions can be performed in absolute or differential hybridization formats.
  • absolute hybridization format polynucleotides derived from one sample, such as a peripheral blood sample isolated from a cancer patient at a particular treatment stage, are hybridized to the probes in a nucleic acid array. Signals detected after the formation of hybridization complexes correlate to the polynucleotide levels in the sample.
  • differential hybridization format polynucleotides derived from two biological samples, such as one isolated from a cancer patient at a first stage of treatment and the other isolated from the same patient but at a second stage of treatment, are labeled with different labeling moieties.
  • a mixture of these differently labeled polynucleotides is added to a nucleic acid array.
  • the nucleic acid array is then examined under conditions in which the emissions from the two different labels are individually detectable.
  • the fluorophores Cy3 and Cy5 are used as the labeling moieties for the differential hybridization format.
  • Signals gathered from nucleic acid arrays can be analyzed using commercially available software, such as those provide by Affymetrix or Agilent Technologies. Controls, such as for scan sensitivity, probe labeling and cDNA/cRNA quantitation, can be included in the hybridization experiments.
  • the nucleic acid array expression signals are scaled or normalized before being subject to further analysis.
  • the expression signals for each gene can be normalized to take into account variations in hybridization intensities when more than one array is used under similar test conditions. Signals for individual polynucleotide complex hybridization can also be normalized using the intensities derived from internal normalization controls contained on each array.
  • genes with relatively consistent expression levels across the samples can be used to normalize the expression levels of other genes.
  • the expression levels of the genes are normalized across the samples such that the mean is zero and the standard deviation is one.
  • the expression data detected by nucleic acid arrays are subject to a variation filter which excludes genes showing minimal or insignificant variation across all samples.
  • peripheral blood samples can be used in the present invention.
  • the peripheral blood samples are whole blood samples.
  • the peripheral blood samples comprise enriched PBMCs.
  • enriched it means that the percentage of PBMCs in the sample is higher than that in whole blood.
  • the PBMC percentage in an enriched sample is at least 1, 2, 3, 4, 5 or more times higher than that in whole blood.
  • the PBMC percentage in an enriched sample is at least 90%, 95%, 98%, 99%, 99.5%, or more.
  • Blood samples containing enriched PBMCs can be prepared using any method known in the art, such as Ficoll gradients centrifugation or CPTs (cell purification tubes).
  • Peripheral blood samples used in the present can be isolated at any stage of a drug treatment (including baseline samples isolated before the drug treatment). For instance, the samples can be isolated from patients at 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 1 week, 2 weeks, 3 weeks, 4 weeks, 8 weeks, or 16 weeks after initiation of a drug treatment. Other time points can also be used for isolating blood samples for monitoring or assessing in vivo drug activities.
  • the patients being treated by a drug therapy of interest have a non-blood disease, such as a solid tumor.
  • Solid tumors amenable to the present invention include, but are not limited, RCC, prostate cancer, head/neck cancer, ovarian cancer, testicular cancer, brain tumor, breast cancer, lung cancer, colon cancer, pancreas cancer, stomach cancer, bladder cancer, skin cancer, cervical cancer, uterine cancer, and liver cancer.
  • the solid tumors have the following characteristics: (1) a mass of hyperproliferating cells of clonal origin, and (2) acquisition of an aggressively invasive phenotype, where cancer cells leave the tissue of origin and establish new tumor metastases at distant sites.
  • the patients have RCC.
  • Any cancer or disease treatment can be evaluated by the present invention.
  • Exemplary cancer treatments include the use of cytokines, such as mterferon or interleukin 2.
  • chemotherapy drugs can be used, either individually or in combination with other drugs, cytokines or therapies.
  • Suitable chemotherapy drags include, but are not limited to, CCI-779, AN-238, vinblastine, floxuridine, 5-fluorouracil, and tamoxifen.
  • AN238 is a cytotoxic agent which has 2-pyrrolinodoxorubicin linked to a somatostatin (SST) carrier octapeptide.
  • SST somatostatin
  • AN238 can be targeted to SST receptors on the surface of RCC tumor cells.
  • monoclonal antibodies, antiangiogenesis drags, and anti-growth factor drugs can be employed to treat cancers.
  • the gene expression profile in peripheral blood samples isolated at one stage of a drag treatment can be compared to that at another stage of the drag treatment.
  • Drug activity genes that are differentially expressed in PBMCs at one stage of the treatment relative to another stage of treatment can therefore be identified.
  • the PBMC expression level of a drag activity gene is substantially higher at one stage than at another stage.
  • an average PBMC expression level of a drag activity gene at one stage can be at least 1.5, 2, 3, 4, 5, 10, 20, or more times of that at another stage.
  • the PBMC expression level of a drag activity gene is substantially lower at one stage than at another stage.
  • an average PBMC expression level of a drag activity gene at one stage can be no greater than 0.67, 0,5, 0.33, 0.25, 0.1, 0.05, or less times of that at another stage.
  • drug activity genes can be identified using clustering algorithms based on the nucleic acid array gene expression data.
  • unsupervised cluster analyses can be used to analyze and categorize genes which have differential expression patterns between different drug treatment stages.
  • Algorithms for unsupervised cluster analysis include, but are not limited to, self-organized maps (SOMs), principle component analysis, average linkage clustering, and hierarchical clustering.
  • Supervised cluster analysis can also be used to organize and identify drag activity genes. Algorithms for supervised cluster analysis include, but are not limited to, nearest neighbors test, support vector machines, and SPLASH. Either two-class or multi- class correlation metrics can be used.
  • HG-U95Av2 gene chips (manufactured by Affymetrix) were used for detecting and comparing the levels of RNA transcripts in PBMC-enriched peripheral blood samples.
  • Peripheral blood samples were isolated from RCC patients at different stages of CCI-779 treatment (See Example 1). The CCI-779 treatment included intravenous administration of 25, 75 or 250 mg of CCI-779 once weekly. Peripheral blood samples were isolated from RCC patients immediately before the initial admi stration of CCI-779, and then 8 weeks and 16 weeks thereafter.
  • cRNA were prepared from the isolated PBMC samples and hybridized to
  • HG-U95Av2 genechips Hybridization signals were collected for each oligonucleotide probe on the genechips. Signals from the oligonucleotide probes of the same qualifier were averaged. Qualifiers that produce different hybridization signals in samples isolated at different treatment time points were identified. Examples of these identified qualifiers are illustrated in Table 1 ("the Qualifier Table").
  • each qualifier in the Qualifier Table corresponds to at least one
  • RNA transcripts of the gene can hybridize under stringent or nucleic acid array hybridization conditions to the qualifier.
  • hybridize to a qualifier means to hybridize to at least one oligonucleotide probe of the qualifier.
  • the RNA transcripts of a CCI-779 activity gene can hybridize under stringent or nucleic acid array hybridization conditions to at least 2, 4, 6, 8, 10, 12, 14 or 16 oligonucleotide probes of the corresponding qualifier.
  • Table 2 lists the expression profiles of some qualifiers that produced different hybridization signals for PBMC samples isolated at different treatment stages.
  • Each expression profile in Table 2 (“Baseline,” “8wks Average,” or “16wks Average”) was an average of PBMC samples of 110 RCC patients.
  • Each expression profile under “Baseline,” “8wks Average,” and “16wks Average” represented the hybridization signals on the respective qualifier for PBMC samples isolated immediately before the initial CCI-779 administration, 8 weeks after the initial administration, and 16 weeks after the initial administration, respectively.
  • the p-value of an ANOVA analysis was provided for each qualifier. The p-value suggests the statistical significance of the difference observed between the expression profiles obtained at different CCI-779 treatment stages. Lesser p- values indicate more statistical significance for the differences observed between different treatment stages.
  • Each qualifier has a corresponsding CCI-779 activity gene probe sequence
  • each CPS can be derived from a corresponding SEQ ID NO depicted in Table 3.
  • each CPS consists of an unambiguous fragment, or the complement thereof, of the corresponding SEQ ID NO.
  • each CPS also comprises at least one oligonucleotide probe of the corresponding qualifier.
  • Each SEQ ID NO in Table 3 is a cDNA or genomic sequence, or the complement thereof, of a CCI-779 activity gene represented by the qualifier that corresponds to the SEQ ID NO. Accordingly, each SEQ ID NO or its corresponding CPS can hybridize 'under stringent or nucleic acid array hybridization conditions to the RNA transcripts, or the complements thereof, of the represented CCI-779 activity gene.
  • Each SEQ ID NO may have an Entrez nucleotide sequence database accession number (see Table 4).
  • the Entrez nucleotide sequence database is maintained by the National Center of Biotechnology Information (NCBI), National Library of Medicine, Washington, DC. The database collects sequences from several sources, including GenBank, RefSeq, and PDB.
  • any ambiguous residue ("n") in a SEQ ID NO can be determined by numerous methods.
  • the ambiguous residues in a SEQ ID NO are determined by aligning the SEQ ID NO to a corresponding genomic sequence obtained from a human genome sequence database.
  • the ambiguous residues in a SEQ ID NO are determined based on the sequence of the corresponding Entrez accession number.
  • the ambiguous residues are determined by re-sequencing the SEQ ID NO.
  • each "n" position in a SEQ ID NO represents at least one nucleotide selected from a, t, g, and c, or contains no nucleotide residue.
  • IL1B Human interleukin 1-beta
  • MKP-2 Human MAP kinase phosphatase
  • CCI-779 activity genes represented by each qualifier in Table 2 can be identified based on the HG-U95Av2 gene chip annotation provided by Affymetrix. Genes thus identified are illustrated in Table 5. CCI-779 activity genes can also be determined based on the corresponding Entrez accession numbers. In addition, CCI-779 activity genes can be determined by BLAST searching the corresponding CPSs, or the unambiguous segments of the corresponding SEQ ID NOs, against a human genome sequence database. Suitable human genome sequence databases for this pmpose include, but are not limited to, the NCBI human genome database. The NCBI also provides BLAST programs, such as "blastn," for searching its sequence databases.
  • the BLAST search of the NCBI human genome database is conducted by using CPSs. Gene(s) that aligns to a given CPS with at least 95% sequence identity can be identified. In many cases, the identified gene(s) has at least 96%, 97%, 98%, 99%, or more sequence identity with the CPS. The results of the BLAST search are detailed below.
  • CPS 1 corresponds to DKFZP564E1962 (TXNDC) which encodes thioredoxin domain-containing.
  • TXNDC DKFZP564E1962
  • This gene has LocusID: 81542, and is located on chromosome 14 with reported cytogenetic location 14q21.3.
  • the gene resides in genomic locus NT_025892 (NCBI Genome Annotation).
  • the gene product is a member of the thioredoxin family.
  • CPS 1 also has 86-90% sequence identity with an intron sequence of GK003 which encodes GK003 protein.
  • GK003 has LocusID: 57002, and is located on chromosome 7 with reported cytogenetic location 7pl5.2.
  • fragments of CPS 1 align with a chromosomal region on chromosome 13 and an intron sequence of OATPRP4 with 87-95% sequence identity.
  • the chromosomal region on chromosome 13 is located near INGl which encodes inhibitor of growth family, member 1, and has LocusID: 3621.
  • OATPRP4 encodes organic anion transporter polypeptide-related protein 4, and has LocusID: 81796 with reported cytogenetic location 8ql3.1.
  • CPS 2 corresponds to ETR101 which encodes immediate early protein. This gene has LocusID: 9592, and is located on chromosome 19 with reported cytogenetic location 19pl3.13. The gene resides in genomic locus NT_031915 (NCBI Genome Annotation). Expression of immediate early protein can be induced by TPA stimulation in promyelocytic leukemia cell line HL-60 and in other leukemia cell lines.
  • a fragment of CPS 2 (nucleotides! 309 to 492 of CPS 2) shows 98% sequence identity with an intron sequence of LOCI 69782.
  • LOCI 69782 encodes a protein similar to transcription factor IIIA (Factor A) (TFIIIA), and has reported cytogenetic location 9p24.1.
  • CPS 3 corresponds to PFN1 which encodes profilin 1. This gene has
  • LocusID 5216, and is located on chromosome 17 with reported cytogenetic location 17pl3.3. The gene resides in genomic locus NT_033299 (NCBI Genome Annotation).
  • Profilin 1 is a ubiquitous actin monomer-binding protein belonging to the profilin family. It is thought to regulate actin polymerization in response to extracellular signals. Deletion of PFN1 gene is associated with Miller-Dieker syndrome.
  • Nucleotides 7 to 406 of CPS 3 align with various regions in the human genome with 86-93% sequence identity. These regions include LOC163511, COAS3, a region near LOG 149010, a region near LOC200030, an intron sequence of DKFZp434D177, FLJ20719, LOC199970, and LOC206456.
  • LOC163511 encodes a protein similar to profilin I, and has reported cytogenetic location lq23.2.
  • COAS3 encodes chromosome 1 amplified sequence 3, and has LocusID: 200025 with reported cytogenetic location lql2.
  • LOC149010 encodes a protein similar to hypothetical protein DKFZp434D177, and has reported cytogenetic location lql2.
  • LOC200030 encodes a protein similar to hypothetical protein DJ328E19.C1.1, and has reported cytogenetic location lql2.
  • DKFZp434D177 encodes hypothetical protein DKFZp434D177, and has LocusID: 84224 with reported cytogenetic location lp36.12.
  • FLJ20719 encodes hypothetical protein FLJ20719, and has LocusID: 55672 with reported cytogenetic location lp31.
  • LOG 199970 has reported cytogenetic location lpl l.l.
  • LOC206456 encodes a protein similar to chain P, structure of bovine beta-actin-profilin complex with actin bound Atp phosphates solvent accessible. LOC206456 is located on chromosome 6.
  • CPS 4 corresponds to APOBEC1L (APOBEC3C) which encodes a protein similar to APOBEC1 protein.
  • APOBEC1L gene has LocusID: 27350, and is located on chromosome 22 with reported cytogenetic location 22ql3.1-ql3.2.
  • APOBEC1L gene product is similar to phorbolin (DJ742C19.2), and may catalyze hydrolytic deamination of cytidine nucleotides.
  • CPS 4 also has 91% sequence identity with LOC200316 which encodes a protein similar to phorbolin 3 (APOBECl-like). LOC200316 has LocusID: 200316 with reported cytogenetic location 22ql3.1. In addition, CPS 4 aligns with two other regions on chromosome 22 with 89-92% sequence identity.
  • CPS 5 corresponds to CKAPl which encodes cytoskeleton-associated protein 1.
  • This gene has LocusID: 1155, and is located on chromosome 19 with reported cytogenetic location 19ql3.11-ql3.12. The gene resides in genomic locus NT_011296 (NCBI Genome Annotation). Cytoskeleton-associated protein 1 associates with microtubules. It contains a glycine domain which plays a role in association with microtubules.
  • Affymetrix annotation suggests that CPS 6 corresponds to KIAA0220 which has LocusID: 23117 and is located at chromosome 16pl2.1.
  • Blast search of the Entrez human genome sequence database shows that CPS 6 has 99% sequence identity to a region on chromosome 16. This region is near LOC255565, and resides in genomic locus NT_035368 (NCBI Genome Annotation).
  • fragments of CPS 6 align with various chromosomal regions with at least 90% sequence identity ⁇ These regions include LOC220555, LOC124302, a chromosomal region near LOC220567, a chromosomal region near LOC254081, and a chromosomal region near LOC197363.
  • LOC220555 encodes a protein similar to nuclear pore complex interacting protein, and has reported cytogenetic location 16pl 1.2.
  • LOG 124302 encodes a protein similar to nuclear pore complex interacting protein, and has reported cytogenetic location l ⁇ pl l.l.
  • LOC220567 encodes a protein similar to apolipoprotein B48 receptor, and is located at chromosome 16ql3.
  • LOC254081 encodes a protein similar to group X secretory phospholipase A2 precursor (phosphatidylcholine 2- acylhydrolase GX) (GX sPLA2) (sPLA2-X), and is located on chromosome 16.
  • LOC197363 encodes a protein similar to ataxin 2 related protein (isoform 1), and is located on chromosome 16.
  • CPS 7 corresponds to CORO1A which encodes coronin, actin binding protein, 1A.
  • This gene has LocusID: 11151, and is located on chromosome 16 with reported cytogenetic location 16ql3.
  • the gene resides in genomic locus NT_033291 (NCBI Genome Annotation).
  • the gene product (coronin 1 A) binds to actin, and may be involved in mitosis, cell motility, formation of phagocytic vacuoles and phagocytosis. Coronin 1 A has at least five WD domains.
  • CPS 8 corresponds to GNL1 which encodes guanine nucleotide binding protein-like 1.
  • This gene has LocusID: 2794, and is 'located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation).
  • the GNL1 gene identified in the human major histocompatibility complex class I region, shows a high degree of similarity with its mouse counterpart.
  • the GNL1 gene is located less than 2 kb centromeric to HLA-E, in the same transcriptional orientation.
  • GNL1 is telomeric to HLA-B and HLA-C.
  • CPS 9 corresponds to AKR1A1 which encodes aldo-keto reductase family 1, member Al (aldehyde reductase).
  • This gene has LocusID: 10327, and is located on chromosome 1 with reported cytogenetic location Ip33-p32. The gene resides in genomic locus NT_032972 (NCBI Genome Annotation).
  • Aldehyde reductase aldo-keto reductase family 1, member Al
  • CPS 10 corresponds to ID2 which encodes inhibitor of DNA binding 2, dominant negative helix-loop-helix protein. This gene has LocusID: 3398, and is located on chromosome 2 with reported cytogenetic location 2p25. The gene resides in genomic locus NT_005334 (NCBI Genome Annotation). The gene product is a member of the Id helix- loop-helix family of proteins, and may negatively regulate cell differentiation. [0058] CPS 10 also has 95% sequence identity with an intron sequence of PTPRG.
  • PTPRG encodes protein tyrosine phosphatase, receptor type, G, and has LocusID: 5793 with reported cytogenetic location 3p21-pl4.
  • the protein encoded by PTPRG gene is a member of the protein tyrosine phosphatase (PTP) family.
  • PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation.
  • the PTP encoded by PTPRG gene possesses an extracellular region, a single transmembrane region, and two tandem intracytoplasmic catalytic domains, and thus represents a receptor-type PTP.
  • PTPRG gene is located in a chromosomal region that is frequently deleted in renal cell carcinoma and lung carcinoma, and thus is thought to be a candidate tumor suppressor gene.
  • CPS 10 aligns with a chromosomal region near LOG 140282 with
  • LOCI 40282 encodes a protein similar to translationally controlled tumor protein (TCTP) (p23) (Histamine-releasing factor) (HRF), and has reported cytogenetic location 21q21.3.
  • TCTP translationally controlled tumor protein
  • HRF Heistamine-releasing factor
  • CPS 11 corresponds to PSG7 which encodes pregnancy specific beta- 1 -gly coprotein 7 and has LocusID: 5676.
  • PSG7 is located at chromosome 19ql3.2.
  • PSG1 encodes pregnancy specific beta-1-glycoprotein, and has LocusID: 5669 with reported cytogenetic location 19ql3.2.
  • the gene product is a member of the pregnancy-specific glycoprotein
  • CPS 11 has 92-95% sequence identity with various regions on chromosome 19. They include PSG2, PSG9, a region near PSG3, and a region near PSG7.
  • PSG2 encodes pregnancy specific beta- 1 -glycoprotein 2, and has LocusID: 5670 with reported cytogenetic location 19ql3.1-ql3.2.
  • PSG9 encodes pregnancy specific beta-1- glycoprotein 9, and has LocusID: 5678 with reported cytogenetic location 19ql3.2.
  • PSG3 encodes pregnancy specific beta- 1 -glycoprotein 3, and has LocusID: 5671 with reported cytogenetic location 19ql3.2.
  • PSG7 encodes pregnancy specific beta- 1 -glycoprotein 7, and has LocusID: 5676 with reported cytogenetic location 19ql3.2.
  • CPS 13 corresponds to IDH2 which encodes isocitrate dehydrogenase 2
  • NADP+ mitochondrial. This gene has LocusID: 3418, and is located on chromosome 15 with reported cytogenetic location 15q26.1. The gene resides in genomic locus NT__033276
  • Mitochondrial NADP(+)-specific isocitrate dehydrogenase 2 can decarboxylate isocitrate into alpha-ketoglutarate.
  • CPS 14 corresponds to RER1 which encodes a protein similar to S. cerevisiae RERl. This gene has LocusID: 11079, and is located on chromosome 1 with reported cytogenetic location lpter-q24. The gene resides in genomic locus NT_004350
  • CPS 15 corresponds to TUBB which encodes tubulin, beta polypeptide. This gene has LocusID: 7280, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_034880 (NCBI Genome
  • Beta-tubulin polymerizes to form microtubules, and is a member of a family of structural proteins.
  • CPS 16 corresponds to HADH2 which encodes hydroxyacyl-Coenzyme A dehydrogenase, type II.
  • This gene has LocusID: 3028, and is located on chromosome X with reported cytogenetic location Xpl 1.2. The gene resides in genomic locus NT_011799
  • the gene product can bind to amyloid-beta peptide.
  • CPS 17 corresponds to NDUFB5 which encodes NADH dehydrogenase
  • ubiquinone 1 beta subcomplex, 5 (16kD, SGDH).
  • This gene has LocusID: 4711, and is located on chromosome 3 with reported cytogenetic location 3q27.1.
  • the gene resides in genomic locus NT_022396 (NCBI Genome Annotation).
  • the gene product is a subunit of
  • NADH-ubiquinone oxidoreductase (complex I). It can transport electrons from NADH to ubiquinone.
  • CPS 17 also shows 77% sequence identity with a chromosomal region near
  • LOC205833 has reported cytogenetic location 4q34.3.
  • CPS 18 corresponds to APRT which encodes adenine phosphoribosyltransferase.
  • This gene has LocusID: 353, and is located on chromosome 16 with reported cytogenetic location 16q24. The gene resides in genomic locus NT_010404
  • Adenine phosphoribosyltransferase belongs to the purine/pyrimidine phosphoribosyltransferase family. This enzyme catalyzes the formation of AMP and inorganic pyrophosphate from adenine and 5-phosphoribosyl-l-pyrophosphate
  • PRPP polyamine biosynthesis pathway
  • CPS 19 corresponds to PSME2 wliich encodes proteasome (prosome, macropain) activator subunit 2 (PA28 beta).
  • This gene has LocusID: 5721, and is located on chromosome 14 with reported cytogenetic location 14qll.2. The gene resides in genomic locus NT_025892 (NCBI Genome Annotation).
  • CPS 19 also aligns with LOC220462 with 97% sequence identity.
  • LOC220462 encodes a protein similar to proteasome activator complex subunit 2
  • LOC220462 has reported cytogenetic location 13ql4.11.
  • CPS 19 has about 90-96% sequence identity with LOC257093, LOG 166868, and an intron sequence of LOC152940.
  • LOC257093 encodes a protein similar to proteasome activator complex subunit 2 (Proteasome activator 28-beta subunit), and is located on chromosome 5.
  • LOG 166868 encodes a protein similar to PA28beta, and has reported cytogenetic location 4pl4.
  • LOC220220 encodes a protein similar to PA28beta, and has reported cytogenetic location 1 Op 11.23.
  • LOC206704 also encodes a protein similar to PA28beta, and is located on chromosome 8p21.1.
  • CPS 20 corresponds to MTCP1 which encodes mature T-cell proliferation 1.
  • This gene has LocusID: 4515, and is located on chromosome X with reported cytogenetic location Xq28.
  • the gene resides in genomic locus NT_025965 (NCBI Genome Annotation).
  • the gene product may be involved in the leukemogenic process of mature T cell proliferation.
  • CPS 21 corresponds to NDUFB3 which encodes NADH dehydrogenase
  • ubiquinone 1 beta subcomplex, 3 (12kD, B12).
  • This gene has LocusID: 4709, and is located on chromosome 2 with reported cytogenetic location 2q31.3.
  • the gene resides in genomic locus NT_005370 (NCBI Genome Annotation).
  • the multisubunit NADH:ubiquinone oxidoreductase (complex I) is an enzyme complex in the electron transport chain of mitochondria.
  • NDUFB3 gene product is a subunit of NADH-ubiquinone oxidoreductase (complex I), and can transport electrons from NADH to ubiquinone.
  • CPS 21 also has about 96% sequence identity with NDUFB3P4.
  • NDUFB3P4 encodes NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 3 (12kD, B12) pseudogene 4. It has LocusID: 93995, and is located on chromosome 14. The gene is located in an intron of WARS which encodes tryptophanyl-tRNA synthetase and has LocusID: 7453 with reported cytogenetic location 14q32.31.
  • CPS 21 has about 86-91% sequence identity with NDUFB3P3, and intron sequence of KIAA0893, and an intron sequence of PPP6C.
  • NDUFB3P3 encodes NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 3 (12kD, B12) pseudogene 3. It has LocusID: 93996, and is located on chromosome 14.
  • KIAA0893 encodes KIAA0893 protein, and has LocusID: 22911 with reported cytogenetic location lpl3.2.
  • PPP6C encodes protein phosphatase 6, catalytic subunit, and has LocusID: 5537 with reported cytogenetic location 9q34.11.
  • CPS 23 corresponds to CTSL which encodes cathepsin L. This gene has
  • LocusID 1514, and is located on chromosome 9 with reported cytogenetic location 9q21- q22.
  • the gene resides in genomic locus NT_023935 (NCBI Genome Annotation).
  • the protein encoded by CTSL gene is a lysosomal cysteine proteinase that plays a role in intracellular protein catabolism. Its substrates include collagen and elastin, as well as alpha-
  • protease inhibitor a major controlling element of neutrophil elastase activity.
  • the encoded protein has been implicated in several pathologic processes, including myofibril necrosis in myopathies and in myocardial ischemia, and in the renal tubular response to proteinuria. This protein is a member of the peptidase Cl family. At least two transcript variants encoding the same protein have been found for CTSL gene.
  • Fragments of CPS 23 have 84-88% sequence identity with LOCI 18945,
  • LOCI 19215, and LOC219343 encode proteins similar to cathepsin L precursor (Major excreted protein) (MEP), and are located at chromosome 10q23.32, 10q21.1 and 10q23.2, respectively.
  • cathepsin L precursor Major excreted protein
  • CPS 24 corresponds to FGL2 which encodes fibrinogen-like 2 and has LocusID: 10875.
  • FGL2 has the reported cytogenetic location at chromosome 7ql 1.23.
  • Blast search of the Entrez human genome database shows that CPS 24 aligns with the non-protein-coding strand of KIAA1505 with at least 98% sequence identity.
  • KIAA1505 encodes KIAA1505 protein, and has LocusID: 57639 with reported cytogenetic location 7 ⁇ l2.3. KIAA1505 resides in genomic locus NT_007933 (NCBI Genome
  • LRP10 encodes low density lipoprotein receptor-related protein 10, and has LocusID: 26020. LRP10 is located at chromosome
  • nucleotides 164-283 and 359-589 have 95% sequence identity with an intron sequence of LOC121292.
  • LOC121292 is located at chromosome 12q24.31, and resides in genomic locus NT_035241 (NCBI Genome Annotation).
  • Nucleotides 253-323 and 359-589 of CPS 26 align with a region near LOC138924 with at least 95% sequence identity.
  • LOC138924 encodes a protein similar to peptidyl-Pro cis trans isomerase, and is located at chromosome 9q21.31 and in genomic locus NT_008580 (NCBI Genome Annotation).
  • Nucleotides 374-588 of CPS 26 have 93% sequence identity with a region between DIM1 and FLJ21172.
  • DIMl encodes a protein similar to S. pombe diml+, and has LocusID: 10907 with reported cytogenetic location 18q23.
  • FLJ21172 encodes hypothetical protein FLJ21172, and has LocusID: 79863 with reported cytogenetic location 18q23.
  • Nucleotides 634-1116 of CPS 26 have 86% sequence identity with an intron sequence of MYOID.
  • MYOID encodes myosin ID, and has LocusID: 4642 and reported cytogenetic location 17ql l-ql2.
  • CPS 27 corresponds to SEMA3F which encodes sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3F.
  • This gene has LocusID: 6405, and is located on chromosome 3 with reported cytogenetic location 3p21.3. The gene resides in genomic locus NT_006014 (NCBI Genome Annotation).
  • the semaphorins are a family of proteins that are involved in signaling. A typical family member has a secretion signal, a 500-amino acid sema domain, and 16 conserved cysteine residues. Sequence comparisons have grouped the secreted semaphorins into 3 general classes.
  • CPS 28 corresponds to BST2 which encodes bone marrow stromal cell antigen 2.
  • This gene has LocusID: 684, and is located on chromosome 19 with reported cytogenetic location 19pl3.2. The gene resides in genomic locus NT_011295 (NCBI Genome Annotation). Bone marrow stromal cells are involved in the growth and development of B-cells. Bone marrow stromal cell antigen 2 may play a role in pre-B-cell growth and in rheumatoid arthritis.
  • SLC7A1 encodes solute carrier family 7 (cationic amino acid transporter, y+ system), member 1.
  • SLC7A1 has LocusID: 6541, and is located on chromosome 13 with reported cytogenetic location 13ql2-ql4.
  • KIAA0774 encodes KIAA0774 protein, and has LocusID: 23281.
  • KIAA0774 is located on chromosome 13 with reported cytogenetic location 13ql2.2.
  • Both SLC7A1 and KIAA0774 have genomic locus NT_009799 (NCBI Genome Annotation).
  • CPS 29 is located 3' to the protein-coding regions of both genes.
  • CPS 29 matches with the protein-coding strand of SLC7A1.
  • SLC7A1 gene product can has strong similarity to murine Rec-1 (Atrcl), and can transport arginine, lysine and ornithine across the plasma membrane.
  • CPS 30 corresponds to UBE2L6 which encodes ubiquitin-conjugating enzyme E2L 6.
  • This gene has LocusID: 9246, and is located on chromosome 11 with reported cytogenetic location 1 lql2. The gene resides in genomic locus NT_033903 (NCBI Genome Annotation).
  • the gene product is a member of the ubiquitin-conjugating enzyme family, and can ubiquitinate cellular proteins and mark them for degradation. The gene product can also bind to Hect domains of E3 proteins.
  • CPS 31 corresponds to SLC11A1 which encodes solute carrier family 11
  • CPS 32 corresponds to CDKN1C which encodes cyclin-dependent kinase inhibitor IC (p57, Kip2). This gene has LocusID: 1028, and is located on chromosome 11 with reported cytogenetic location 1 lpl5.5.
  • CPS 32 also has 98% sequence identity with a chromosomal region near
  • LOC256784 encodes a protein similar to cyclin-dependent kinase inhibitor IC (Cyclin-dependent kinase inhibitor P57) (P57KIP2), and is located on chromosome 11 with genomic locus NT_009368 (NCBI Genome Annotation).
  • IC Cyclin-dependent kinase inhibitor P57
  • NT_009368 NCBI Genome Annotation
  • CPS 33 corresponds to GAIP (RGS 19) which encodes regulator of G-protein signaling 19.
  • This gene has LocusID: 10287, and is located on chromosome 20 with reported cytogenetic location 20ql3.3. The gene resides in genomic locus NT_011333 (NCBI Genome Annotation). G proteins mediate a number of cellular processes. The protein encoded by this gene belongs to the RGS (regulators of G-protein signaling) family and can interact with G protein, GAI3.
  • G-protein signaling 19 is a guanosine triphosphatase-activating protein that may function to down-regulate Galpha i/Galpha q- linked signaling.
  • CPS 34 corresponds to TACC2 which encodes transforming, acidic coiled- coil containing protein 2.
  • This gene has LocusID: 10579, and is located on chromosome 10 with reported cytogenetic location 10q26. The gene resides in genomic locus NT_030764 (NCBI Genome Annotation).
  • Transforming acidic coiled-coil proteins are a conserved family of centrosome- and microtubule-interacting proteins that are implicated in cancer.
  • the member encoded by TACC2 gene concentrates at centrosomes throughout the cell cycle, and it is a candidate breast tumor suppressor and biomarker for tumor progression.
  • CPS 35 corresponds to CLNS1A which encodes chloride channel, nucleotide-sensitive, 1A.
  • This gene has LocusID: 1207, and is located on chromosome 11 with reported cytogenetic location I lql3.5-ql4.
  • the gene resides in genomic locus NT_033927 (NCBI Genome Annotation).
  • the gene product is associated with a swelling- induced chloride channel, and may be involved in aqueous humor formation in the eye
  • CPS 35 also has 88% sequence identity with an intron sequence of TDRKH.
  • TDRKH encodes tudor and KH domain-containing protein, and has LocusID: 11022. It is located at chromosome lq21.
  • nucleotides 66-459 of CPS 35 have 89% sequence identity with a region near LOC221349.
  • LOC221349 encodes a protein similar to chloride conductance regulatory protein ICln ⁇ I(Cln)) (Chloride channel, nucleotide sensitive 1A) (Chloride ion current inducer protein) (C1CI) (Reticulocyte PICIn), and has reported cytogenetic location 6pl l.2. Nucleotides 1-66 of CPS 35 aligns with LOC152922 with 98% sequence identity.
  • LOCI 52922 encodes a protein similar to chloride conductance regulatory protein ICln (I(Cln)) (Chloride channel, nucleotide sensitive 1A) (Chloride ion current inducer protein) (C1CI) (Reticulocyte PICIn), and is located at chromosome 4q32.3.
  • CPS 36 corresponds to DPYSL2 which encodes dihydropyrimidinase-like 2.
  • CPS 37 corresponds to FCGR3A which encodes Fc fragment of IgG, low affinity Ilia, receptor for (CD 16).
  • This gene has LocusID: 2214, and is located on chromosome 1 with reported cytogenetic location lq23.
  • the gene resides in genomic locus NT_004668 (NCBI Genome Annotation).
  • the gene product is type III Fc gamma receptor, and can associate with zeta chain of the T-cell receptor complex (CD3Z).
  • CD3Z zeta chain of the T-cell receptor complex
  • CPS 38 corresponds to UQCRC1 which encodes ubiquinol-cytochrome c reductase core protein I. This gene has LocusID: 7384, and is located on chromosome 3 with reported cytogenetic location 3p21.3. The gene resides in genomic locus NT_005990 (NCBI Genome Annotation). The gene product, core I protein, is a subunit of the ubiquinol-cytochrome c oxidoreductase in the mitochondrial respiratory chain.
  • CPS 39 corresponds to GZMA which encodes granzyme A (granzyme 1, cytotoxic T-lymphocyte-associated serine esterase 3).
  • This gene has LocusID: 3001, and is located on chromosome 5 with reported cytogenetic location 5ql l-ql2. The gene resides in genomic locus NT_006431 (NCBI Genome Annotation). Cytolytic T lymphocytes (CTL) and natural killer (NK) cells share the ability to recognize, bind, and lyse specific target cells. They are thought to protect their host by lysing cells bearing on their surface "nonself antigens, usually peptides or proteins resulting from infection by intracellular pathogens. GZMA gene product is a T cell- and natural killer cell-specific serine protease that may function as a common component necessary for lysis of target cells by cytotoxic T lymphocytes and natural killer cells.
  • CPS 41 corresponds to PSMB10 which encodes proteasome (prosome, macropain) subunit, beta type, 10.
  • This gene has LocusID: 5699, and is located on chromosome 16 with reported cytogenetic location 16q22.1.
  • the gene resides in genomic locus NT_010478 (NCBI Genome Annotation).
  • the gene can replace beta subunit PSMB7 when cells are stimulated by interferon g.
  • CPS 42 corresponds to IDS which encodes iduronate 2-sulfatase (Hunter syndrome). This gene has LocusID: 3423, and is located on chromosome X with reported cytogenetic location Xq28. The gene resides in genomic locus NT_019686 (NCBI Genome Annotation). Iduronate-2-sulfatase is involved in the lysosomal degradation of heparan sulfate and dermatan sulfate. Mutations in this X-chromosome gene that result in enzymatic deficiency may lead to the sex-linked Mucopolysaccharidosis Type II, also known as Hunter Syndrome.
  • Iduronate-2-sulfatase has a sequence homology with human arylsulfatases A, B, and C, and human glucosamine-6-sulfatase. A splice variant of this gene has been described. [0099] Affymetrix annotation suggests that CPS 43 corresponds to AHNAK which encodes AHNAK nucleoprotein (desmoyokin) and has LocusID: 195. The gene is located at chromosome 11 ql 2-ql 3.
  • CPS 43 aligns with a region 3' to LOC221087 with 100% sequence identity.
  • LOC221087 is hypothetical gene supported by M80899. The gene is located on chromosome l lql3.1, and resides in genomic locus NT_033241 (NCBI Genome Annotation).
  • CPS 44 corresponds to PTPN7 which encodes protein tyrosine phosphatase, non-receptor type 7. This gene has LocusID: 5778, and is located on chromosome 1 with reported cytogenetic location lq32.1. The gene resides in genomic locus NT_034408 (NCBI Genome Annotation).
  • the gene product is a member of the protein tyrosine phosphatase (PTP) family.
  • PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation.
  • PTPN7 gene is expressed in a variety of hematopoietic cells, and is an early response gene in lymphokine stimulated cells.
  • the noncatalytic N-terminus of PTPN7 gene product can interact with MAP kinases and suppress the MAP kinase activities.
  • the gene product is also shown to be involved in the regulation of T cell antigen receptor (TCR) signaling, which is thought to function through dephosphorylating the molecules related to MAP kinase pathway.
  • TCR T cell antigen receptor
  • CPS 45 corresponds to DUSP2 wliich encodes dual specificity
  • This gene has LocusID: 1844, and is located on chromosome 2 with reported cytogenetic location 2qll. The gene resides in genomic locus NT_026970 (NCBI Genome Annotation).
  • the protein encoded by DUSP2 is a member of the dual specificity protein phosphatase subfamily. These phosphatases can inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation.
  • MAP mitogen-activated protein
  • CPS 46 corresponds to PSMB9 which encodes proteasome (prosome, macropain) subunit, beta type, 9 (large multifunctional protease 2).
  • PSMB9 has LocusID: 5698, and is located at chromosome 6p21.3.
  • Blast search of the Entrez human genome sequence database indicates that
  • CPS 46 shows 100% sequence identity with three regions near PSMB8. These regions are within genomic locus NT_007592 (NCBI Genome Annotation). PSMB8 encodes proteasome (prosome, macropain) subunit, beta type, 8 (large multifunctional protease 7). It has LocusID: 5696 and reported cytogenetic location 6p21.3.
  • HANI 1 encodes WD-repeat protein, and has LocusID: 10238 with reported cytogenetic location 17q21.33.
  • HAN11 resides in genomic locus NT_035428 (NCBI Genome Annotation).
  • HAN11 gene product has WD- repeats, and is similar to plant anl 1.
  • LILRBl encodes leukocyte immunoglobulin-like receptor, subfamily B (with TM and ITIM domains), member 1.
  • LILRBl has LocusID: 10859, and is located on chromosome 19 with reported cytogenetic location 19ql3.4.
  • LILRBl resides in genomic locus NT_011225 (NCBI Genome Annotation).
  • the gene product (leukocyte immunoglobulin-like receptor Bl) contains immunoreceptor tyrosine- based inhibitory motifs, and can bind to cellular and viral MHC class I antigens.
  • CPS 48 also aligns with LILRB2 with 91% sequence identity.
  • LILRB2 encodes leukocyte immunoglobulin-like receptor, subfamily B (with TM and ITIM domains), member 2.
  • the gene has LocusID: 10288 and reported cytogenetic location 19ql3.4.
  • CPS 49 corresponds to SDR1 which encodes short-chain dehydrogenase/reductase 1.
  • This gene has LocusID: 9249, and is located on chromosome 1 with reported cytogenetic location lp36.1. The gene resides in genomic locus NT_022041 (NCBI Genome Annotation). Short-chain dehydrogenase/reductase 1 can reduce all-trans- retinal during bleached visual pigment regeneration.
  • CPS 50 corresponds to POLR2L which encodes polymerase (RNA) II (DNA directed) polypeptide L, 7.6kDa.
  • POLR2L has LocusID: 5441, and is located at chromosome 1 lpl5.
  • CPS 51 corresponds to SCYA2 which encodes small inducible cytokine A2
  • CPS 52 corresponds to PAI2 (SERPINB2) which encodes serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 2.
  • This gene has LocusID: 5055, and is located on chromosome 18 with reported cytogenetic location 18q21.3. The gene resides in genomic locus NT_033907 (NCBI Genome Annotation).
  • the gene product is also known as plasminogen activator inhibitor II, and may function as a serine protease inhibitor. It is considered a member of the serpin family of serine protease inhibitors.
  • Blast search of the Entrez human genome sequence database shows that CPS
  • LOCI 58972 has reported cytogenetic location Xq28, and is located in genomic locus NT_026504 (NCBI Genome Annotation).
  • CPS 53 also aligns with a chromosomal region near LOC93052 with 93% sequence identity.
  • LOC93052 has reported cytogenetic location Xq28.
  • KIAA1564 encodes KIAA1564 protein, and has LocusID: 57680 with reported cytogenetic location 14ql 1.1.
  • KIAA1564 resides in genomic locus NT_025892 (NCBI Genome Annotation).
  • CPS 54 matches with the non-protein- coding strand of KIAA1564.
  • Affymetrix annotation indicates that CPS 55 corresponds to JMJ.
  • JMJ encodes jumonji homolog (mouse), and has LocusID: 3720.
  • the gene is located on chromosome 6 with reported cytogenetic location 6p24-p23 and genomic locus NT_007592 (NCBI Genome Annotation).
  • JMJ is an ortholog of the mouse jumonji gene, which encodes a nuclear protein involved in mouse embryogenesis, including neural tube formation. Overexpression of mouse jumonji negatively regulates cell proliferation.
  • the jumonji proteins contain a DNA-binding domain, called an AT-rich interaction domain (ARID), and share regions of similarity with human retinoblastoma-binding protein-2 and the human SMCX protein. Nucleotides 1997 to 198161 of AL021938 match with the non- protein-coding strand of JMJ. [0116] Nucleotides 1 to 66 of CPS 55 have 100% sequence identity with a chromosomal region in genomic locus NT_010463 (NCBI Genome Annotation). This chromosomal region is located 3 ' to LOC222499 which has reported cytogenetic location
  • CPS 56 corresponds to UGP2 which encodes UDP-glucose pyrophosphorylase 2. This gene has LocusID: 7360, and is located on chromosome 2 with reported cytogenetic location 2pl4-pl3. The gene resides in genomic locus NT_005375
  • UGP2 (NCBI Genome Annotation).
  • the enzyme encoded by UGP2 is an intermediary in mammalian carbohydrate interconversions. It can transfer a glucose moiety from glucose-
  • LOC253415 encodes a protein similar to UDP-glucose pyrophosphorylase 2
  • UDP-glucose-1 -phosphate uridyltransferase UDP-glucose diphosphorylase
  • LOC253415 is located on chromosome 2.
  • CPS 57 corresponds to KIAA0410. This gene has LocusID: 9818, and is located on chromosome 13 with reported cytogenetic location 13ql2.12. The gene resides in genomic locus NT_009799 (NCBI Genome Annotation).
  • CPS 58 corresponds to NDUFS7 which encodes NADH dehydrogenase
  • LocusID 4727, and is located on chromosome 19 with reported cytogenetic location 19pl3.
  • the gene resides in genomic locus NT_011268 (NCBI Genome Annotation).
  • CPS 59 corresponds to KIAA0645.
  • This gene has LocusID: 9681, and is located on chromosome 22 with reported cytogenetic location 22ql2.3. The gene resides in genomic locus NT_011520 (NCBI Genome Annotation).
  • CPS 60 corresponds to GSTP1 which encodes glutathione S-transferase pi.
  • This gene has LocusID: 2950, and is located on chromosome 11 with reported cytogenetic location l lql3. The gene resides in genomic locus NT_033241 (NCBI Genome
  • Glutathione S-transferases are a family of enzymes that play a role in detoxification by catalyzing the conjugation of many hydrophobic and electrophilic compounds with reduced glutathione.
  • the soluble GSTs are categorized into 4 main classes: alpha, mu, pi, and theta.
  • the glutathione S-transferase pi gene (GSTP1) is a polymorphic gene encoding active, functionally different GSTP1 variant proteins that are thought to function in xenobiotic metabolism and play a role in susceptibility to cancer, and other diseases.
  • Nucleotides 180 to 558 have 86% sequence identity with a chromosomal region near DGKA.
  • DGK encodes diacylglycerol kinase, alpha (80kD), and is located at chromosome 12ql3.3 with LocusID: 1606.
  • CPS 61 corresponds to DECR1 which encodes 2,4-dienoyl CoA reductase 1, mitochondrial. This gene has LocusID: 1666, and is located on chromosome 8 with reported cytogenetic location 8q21.3. The gene resides in genomic locus NT_034898 (NCBI Genome Annotation). The gene product is an accessory enzyme which participates in the beta-oxidation and metabolism of unsaturated fatty enoyl-CoA esters.
  • CPS 62 corresponds to PLXNCl which encodes plexin Cl. This gene has
  • LocusID 10154, and is located on chromosome 12 with reported cytogenetic location 12q23.3. The gene resides in genomic locus NT_009575 (NCBI Genome Annotation). Plexin Cl can function as a receptor for virally-encoded semphorin. It is a member of the plexin family.
  • CPS 63 corresponds to TUBA2 which encodes tubulin, alpha 2.
  • This gene has LocusID: 7278, and is located on chromosome 13 with reported cytogenetic location 13ql l. The gene resides in genomic locus NT_009799 (NCBI Genome Annotation).
  • Microtubules of the eukaryotic cytoskeleton perform essential and diverse functions and are composed of a heterodimer of alpha and beta tubulin.
  • the genes encoding these microtubule constituents are part of the tubulin superfamily, which is composed of six distinct families. Genes from the alpha, beta and gamma tubulin families are found in all eukaryotes.
  • the alpha and beta tubulins represent the major components of microtubules, while gamma tubulin plays a critical role in the nucleation of microtubule assembly.
  • TUBA2 gene is an alpha tubulin gene that encodes a protein similar to the mouse testis-specific Tuba3 and Tuba7 gene products.
  • TUBA2 gene is located in the 13ql l region, which is associated with the genetic diseases Clouston hidrotic ectodermal dysplasia and Kabuki syndrome. Alternative splicing has been observed for this gene and at least two variants have been identified.
  • CPS 63 has 95-96% sequence identity with H2-ALPHA and LOCI 12714.
  • H2-ALPHA encodes alpha-tubulin isotype H2-alpha, and has LocusID: 113457 with reported cytogenetic location 2q22.1.
  • LOC112714 encodes a protein similar to alpha tubulin, and has LocusID: 112714 with reported cytogenetic location 2ql4.2.
  • CPS 63 shows 85-90% sequence identity with a chromosomal region near MGC 16703.
  • MGC 16703 encodes alpha tubulin-like, and is located at chromosome 22ql l.21 with LocusID: 113691. Fragments of CPS have 83-91% sequence identity with regions on chromosomes 1 and 22.
  • CPS 64 corresponds to NMEl which encodes non-metastatic cells 1, protein
  • NMEl nucleoside diphosphate kinase
  • SLC11A1 encodes solute carrier family 11 (proton-coupled divalent metal ion transporters), member 1, and has LocusID: 6556.
  • SLC11A1 gene product is similar to murine Beg (Nrampl), and may control antimicrobial activity of macrophages.
  • NLI-IF encodes nuclear LIM interactor-interacting factor, and has LocusID: 58190.
  • NLI-IF gene product is similar to a region of S. cerevisiae plasma membrane phosphatase Psr2p.
  • CPS 65 is located 3' to the protein-coding sequence of SLC11 Al and 5' to the protein-coding sequence of NLI-IF.
  • CPS 66 corresponds to RNAHP which encodes RNA helicase-related protein and has LocusID: 11325. The gene has reported cytogenetic location at chromosome 17q22.
  • Blast search of the Entrez human genome sequence database shows that CPS
  • CPS 66 aligns with a chromosomal region at chromosome 16ql3 with at least 95% sequence identity. This region resides in genomic locus NT_010498 (NCBI Genome Annotation), and is located near MT1G. MT1G encodes metallothionein 1G, and has LocusID: 4495. [0133] In addition, CPS 66 has 83-92% sequence identity with various regions on chromosomes 1, 4, 9, 16 and 20. These regions include MTIH, MT2P1, LOC255917, LOC127544, LOC149450, a region near MGC10731, and a region near MMP24.
  • MTIH encodes metallothionein 1H, and has LocusID: 4496 with reported cytogenetic location 16ql3.
  • MT2P1 encodes metallothionein 2 pseudogene 1 (processed), and is located at chromosome 4pl l-q21 (LocusID: 4503).
  • MGC10731 has LocusID: 79363, and is located at chromosome lp36.13.
  • LOC255917 encodes a protein similar to Metallothionein-IE (MT- 1E), and is located on chromosome 9.
  • LOCI 27544 encodes a protein similar to dJ1174N9.1 (novel protein with IBR domain), and is located at chromosome lp34.3. LOC149450 has reported cytogenetic location lq42.3. MMP24 encodes matrix metalloproteinase 24 (membrane-inserted), and is located at chromosome 20ql l.2 with LocusID: 10893.
  • CPS 67 corresponds to ZNF198 which encodes zinc finger protein 198.
  • This gene has LocusID: 7750, and is located on chromosome 13 with reported cytogenetic location 13ql l-ql2. The gene resides in genomic locus NT_009799 (NCBI Genome Annotation). Zinc-finger protein 198 contains zinc fingers.
  • Nucleotides 185-221 of CPS 67 have 100% sequence identity with an intron sequence of LOC205936.
  • LOC205936 is located at chromosome 4pl6.2, and resides in genomic locus NT_006051 (NCBI Genome Annotation).
  • CPS 68 corresponds to PRCP which encodes prolylcarboxypeptidase
  • angiotensinase C This gene has LocusID: 5547, and is located on chromosome 11 with reported cytogenetic location 1 lql4. The gene resides in genomic locus NT_033927 (NCBI Genome Annotation). Prolylcarboxypeptidase (angiotensinase C) is a serine carboxypeptidase and can remove residues linked to proline.
  • CPS 70 corresponds to DKFZP586A0522 which encodes DKFZP586A0522 protein.
  • This gene has LocusID: 25840, and is located on chromosome 12 with reported cytogenetic location 12qll. The gene resides in genomic locus NT_009782 (NCBI Genome Annotation). The gene product include a region with low sequence similarity to a region of S. cerevisiae Coq5p. DKFZP586A0522 overlaps with LOC196529 which is encodes a protein similar to DKFZP586A0522 protein.
  • CPS 71 corresponds to SFRS11 which encodes splicing factor, arginine/serine-rich 11.
  • This gene has LocusID: 9295, and is located on chromosome 1 with reported cytogenetic location Ip21-p34.
  • the gene resides in genomic locus NT 304464 (NCBI Genome Annotation).
  • the gene product contains arginine/serine-rich domain and an RRM domain, and may have a role in pre-mRNA splicing.
  • Nucleotides 1 to 234 of CPS 71 have 89% sequence identity with an intron sequence of PEPP2.
  • PEPP2 encodes phosphoinositol 3 -phosphate-binding protein-2, and has LocusID: 54477 with reported cytogenetic location 12pl2.
  • CPS 72 corresponds to RAC2 which encodes ras-related C3 botulinum toxin substrate 2 (rho family, small GTP binding protein, Rac2).
  • This gene has LocusID: 5880, and is located on chromosome 22 with reported cytogenetic location 22ql3.1.
  • the gene resides in genomic locus NT_011520 (NCBI Genome Annotation).
  • RAC2 is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins.
  • the RAC2 gene product may be a target for ADP ribosylation by the C3 subunit of botulinum toxin.
  • STK17A encodes serine/threonine kinase 17a (apoptosis-inducing), and has
  • LocusID 9263 with reported cytogenetic location 7pl2-pl4.
  • CPS 73 corresponds to ZFP103 which encodes zinc finger protein 103 homolog (mouse).
  • This gene has LocusID: 7844, and is located on chromosome 2 with reported cytogenetic location 2pl 1.2. The gene resides in genomic locus NT 315805 NCBI
  • the gene product contains a zinc-finger domain, and may be associated with membranous protein sorting.
  • CPS 74 corresponds to LOC51580 which encodes H-2K binding factor-2.
  • This gene has LocusID: 51580, and is located on chromosome 9. The gene resides in genomic locus NT_006316 (NCBI Genome Annotation).
  • the gene product is a member of the recombination signal-sequence binding protein family. It is a transcription factor that binds to the NFkB site of MHC class I genes.
  • CPS 75 corresponds to NRDl which encodes nardilysin (N-arginine dibasic convertase). This gene has LocusID: 4898, and is located on chromosome 1 with reported cytogenetic location Ip32.2-p32.1. The gene resides in genomic locus NT_004424 (NCBI).
  • N-arginine dibasic convertase is a zinc-dependent endopeptidase. It is a member of the insulinase family. [0145] Blast search of the Entrez human genome sequence database shows that CPS
  • CPS 76 overlaps or includes the 3' untranslated region of FLJ20154.
  • FLJ20154 encodes hypothetical protein FLJ20154, and is located at chromosome 10q24.32 with LocusID: 54838.
  • FLJ20154 resides in genomic locus NT_030059 (NCBI Genome Annotation).
  • CPS 76 also has 85-88% sequence identity with an intron sequence of KIAA0103, a chromosome 13 region near LOCI 60822, and a chromosome 7 region near BAP29.
  • KIAA0103 has LocusID: 9694 and reported cytogenetic location 8q23.1.
  • LOCI 60822 encodes a protein similar to hypothetical protein FLJ12457, and is located at chromosome 13q31.1.
  • BAP29 encodes B-cell receptor-associated protein BAP29, and is located at chromosome 7q22.2 with LocusID: 55973.
  • fragments of CPS 76 align with various other regions on chromosome 1, 6, 15 and 19 with 89-90% sequence identity.
  • Affymetrix annotation suggests that CPS 77 corresponds to KIAA0906, also known as NUP210 which encodes nucleoporin 210.
  • the gene has LocusID: 23225, and is located at chromosome 3p25.1.
  • FLJ22389 has LocusID: 79985, and is located on chromosome 3 with reported cytogenetic location 3p25.1. The gene resides in genomic locus NT_005927 (NCBI Genome Annotation).
  • CPS 78 corresponds to MT1F which encodes metallothionein IF (functional) and has LocusID: 4494. The gene is located at
  • chromosomal region 78 has 100% sequence identity with a chromosomal region located 3' to the protein-coding sequence of MT1G.
  • This chromosomal region, as well as MT1G, are within genomic locus NT_010498 (NCBI Genome Annotation) on chromosome 16.
  • MT1G encodes metallothionein 1G, and has LocusID: 4495 with reported cytogenetic location 16ql3.
  • Nucleotides 1 to 67 of CPS 78 have 90-95% sequence identity to various regions in genomic locus NT_010498. These regions include MT2A, LOC221228, MT1G and MT1L.
  • MT2A encodes metallothionein 2 A, and has LocusID: 4502 with reported cytogenetic location 16ql3.
  • LOC221228 is a hypothetical gene supported by AF495759.
  • MT1G encodes metallothionein 1G.
  • MT1L encodes metallothionein IL, and has LocusID: 4500 with reported cytogenetic location 16ql3.
  • CPS 78, and certain fragments thereof have 89-93% sequence identity with MT2P1, LOC127544, LOC255917, a chromosomal region near MMP24, a chromosomal region near MGC10731, and a chromosomal region near LOC149450.
  • MT2P1 encodes metallothionein 2 pseudogene 1 (processed), and has LocusID: 4503 with reported cytogenetic location 4pl l-q21.
  • LOC127544 encodes a protein similar to dJ1174N9.1 (novel protein with IBR domain), and is located at chromosome lp34.3.
  • LOC255917 encodes a protein similar to metallothionein-IE (MT-1E), and is located on chromosome 9.
  • MMP24 encodes matrix metalloproteinase 24 (membrane-inserted), and is located at chromosome 20ql l.2 with LocusID: 10893.
  • MGC10731 encodes hypothetical protein MGC10731, and has LocusID: 79363 with reported cytogenetic location lp36.13.
  • LOC149450 is located at chromosome lq42.3.
  • CPS 79 corresponds to P85SPR (ARHGEF7) which encodes Rho guanine nucleotide exchange factor (GEF) 7.
  • GEF Rho guanine nucleotide exchange factor
  • This gene has LocusID: 8874, and is located on chromosome 13 with reported cytogenetic location 13q34. The gene resides in genomic locus NT_009952 (NCBI Genome Annotation).
  • Rho GTPases are involved in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors.
  • the protein encoded by P85SPR belongs to a family of cytoplasmic proteins that activate the Ras-like family of Rho proteins by exchanging bound GDP for GTP.
  • the encoded protein may form a complex with G proteins and stimulate Rho- dependent signals.
  • the protein can induce membrane ruffling.
  • the gene product is also involved in Pak recruitment to Cdc42- and Racl -driven focal complexes. Multiple alternatively splic
  • CPS 80 corresponds to UNK_N53547 (MGC5508) which encodes hypothetical protein MGC5508. This gene has LocusID: 79073, and is located on chromosome 11 with reported cytogenetic location l lql3.1. The gene resides in genomic locus NT_033903 (NCBI Genome Annotation).
  • CPS 81 corresponds to GPRK5 which encodes G protein-coupled receptor kinase 5.
  • This gene has LocusID: 2869, and is located on chromosome 10 with reported cytogenetic location 10q24-qter. The gene resides in genomic locus NT_008902 (NCBI Genome Annotation).
  • G protein-coupled receptor kinases (GRKs) play a role in phosphorylating and regulating the activity of a variety of G protein-coupled receptors.
  • G protein-coupled receptor kinase 5 can phosphorylate agonist-stimulated G protein-coupled receptors.
  • CPS 83 corresponds to SCYA7 which encodes small inducible cytokine A7
  • Monocyte chemotactic protein 3 (monocyte chemotactic protein 3). This gene has LocusID: 6354, and is located on chromosome 17 with reported cytogenetic location 17ql l.2-ql2. The gene resides in genomic locus NT_010799 (NCBI Genome Annotation). Monocyte chemotactic protein 3 is a secreted chemokine which attracts macrophages, during inflammation and metastasis. It is a member of the C-C subfamily of chemokines 'which are characterized by having two adjacent cysteine residues. The protein is an in vivo substrate of matrix metalloproteinase
  • SCYA7 gene is part of a cluster of C-C chemokine family members on chromosome 17q.
  • CPS 84 corresponds to TUBA3 which encodes tubulin, alpha 3. This gene has LocusID: 7846, and is located on chromosome 12 with reported cytogenetic location
  • the gene resides in genomic locus NT_009526 (NCBI Genome
  • TUBA3 encodes alpha tubulin and is similar to mouse and rat Tubal gene.
  • TUBA3 is one of three alpha-tubulin genes in a cluster on chromosome
  • CPS 84 also has 97% sequence identity with a chromosomal region located
  • LOC134262 encodes a protein similar to alphaTub84B gene product.
  • the gene is located at 5pl 1, and resides in genomic locus NT_023098.
  • CPS 85 corresponds to SCML2 which encodes sex comb on midleg-like 2
  • the gene product is similar to Drosophila Scm.
  • CPS 85 also aligns with SCML1 with 94% sequence identity.
  • SCML1 encodes sex comb on midleg-like 1 (Drosophila).
  • the gene has LocusID: 6322, and is located on chromosome X with reported cytogenetic location Xp22.2-p22.1.
  • the gene resides in genomic locus NT_011586.
  • CPS 87 corresponds to IL1R1 which encodes interleukin 1 receptor, type I.
  • This gene has LocusID: 3554, and is located on chromosome 2 with reported cytogenetic location 2ql2. The gene resides in genomic locus NT_022171 (NCBI Genome Annotation).
  • Type I interleukin- 1 receptor contains immniunoglobulin domains, and can bind to all three forms of interleukin- 1 (ILIA, IL1B, and IL1RN).
  • CPS 88 corresponds to UNK AL008729 (LOC221692) which encodes a protein similar to KIAA1733 protein. This gene is located on chromosome 6 with reported cytogenetic location 6p22.3. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). LOC221692 is located within LOC51256 which has LocusID: 51256. CPS 88 matches with the non-protein-coding strand of LOC51256. [0163] CPS 89 corresponds to KIAA0191 which encodes KIAA0191 protein. This gene has LocusID: 23318, and is located on chromosome 1 with reported cytogenetic location lp32.3. The gene resides in genomic locus NT_004424 (NCBI Genome Annotation).
  • CPS 90 corresponds to EGFL5 which encodes EGF-like-domain, multiple 5.
  • This gene has LocusID: 1955, and is located on chromosome 9 with reported cytogenetic location 9q32-q33.3. The gene resides in genomic locus NT_017568 (NCBI Genome Annotation).
  • CPS 91 corresponds to DUSP1 which encodes dual specificity phosphatase
  • This gene has LocusID: 1843, and is located on chromosome 5 with reported cytogenetic location 5q34. The gene resides in genomic locus NT_023132 (NCBI Genome Annotation).
  • the expression of DUSP1 gene can be induced in human skin fibroblasts by oxidative/heat stress and growth factors.
  • the bacterially expressed and purified DUSP1 protein has intrinsic phosphatase activity, and can inactivate mitogen-activated protein (MAP) kinase in vitro by the concomitant dephosphorylation of both its phosphothreonine and phosphotyrosine residues.
  • MAP mitogen-activated protein
  • DUSP1 protein can also suppress the activation of MAP kinase by oncogenic ras in extracts of Xenopus oocytes.
  • DUSP1 may play a role in the human cellular response to environmental stress as well as in the negative regulation of cellular proliferation.
  • CPS 92 corresponds to FBP1 which encodes fructose- 1 ,6-bisphosphatase 1.
  • Fructose- 1,6-bisphosphatase 1 is a gluconeogenesis regulatory enzyme. It can catalyze the hydrolysis of fructose 1,6-bisphosphate to fructose 6-phosphate and inorganic phosphate. Fructose- 1,6-diphosphatase deficiency is associated with hypoglycemia and metabolic acidosis. [0167] CPS 93 corresponds to HRB which encodes HIV-1 Rev binding protein.
  • HIV-1 Rev binding protein can interact with the activation domains of the HIV-1 Rev protein, and may be related to nucleoporins, a class of proteins that mediate nucleocytoplasmic transport. HIV-1 Rev binding protein binds to the Rev activation domain when Rev is assembled onto its RNA target and can significantly enhance Rev activity when over-expressed.
  • the HRB gene can expressed as a major 2.8-kb and a minor 4.6-kb mRNA.
  • CPS 94 corresponds to NAGA which encodes N-acetylgalactosaminidase, alpha-.
  • This gene has LocusID: 4668, and is located on cliromosome 22 with reported cytogenetic location 22ql3-qter.
  • the gene resides in genomic locus NT_011520 (NCBI Genome Annotation).
  • the lysosomal enzyme alpha-N-acetylgalactosaminidase can cleave alpha-N-acetylgalactosaminyl moieties from glycoconjugates. Mutations in NAGA have been implicated as the cause of Schindler disease types I and II (type II also known as Kanzaki disease).
  • CPS 95 corresponds to GLRX which encodes glutaredoxin (thioltransferase).
  • This gene has LocusID: 2745, and is located on chromosome 5 with reported cytogenetic location 5ql4. The gene resides in genomic locus NT_023148 (NCBI Genome Annotation). Glutaredoxin can function as a glutathione-dependent hydrogen donor for ribonucleotide reductase.
  • Fragments of CPS 95 also align with AF358259, a chromosomal region near
  • GABRA6 and a chromosomal region near GLRXP with 86-90% sequence identity.
  • AF358259 encodes glutaredoxin pseudogene 2, and has LocusID: 171418 with reported cytogenetic location 14q32.13-q32.2.
  • GABRA6 encodes gamma-aminobutyric acid (GABA) A receptor, alpha 6. It has LocusID: 2559, and is located at chromosome 5q34.
  • GLRXP encodes glutaredoxin (thioltransferase) pseudogene, and is located at chromosome 20ql l.2 with LocusID: 170522.
  • nucleotides 1 to 29 of CPS 95 has 100% sequence identity with LOC257079.
  • LOC257079 encodes a protein similar to glutaredoxin (Thioltransferase) (TTase), and is located on chromosome 5.
  • CPS 96 corresponds to UNK .
  • W28281 (GABARAPL1) which encodes
  • GABA(A) receptor-associated protein like 1 This gene has LocusID: 23710, and is located on chromosome 12 with reported cytogenetic location 12pl3.1. The gene resides in genomic locus NT_035207 (NCBI Genome Annotation).
  • SEQ ID NO: 360 which can be used as a probe sequence for detecting the expression level of UNK . W28281, also aligns with GABARAPL3 with about 95% sequence identity.
  • GABARAPL3 encodes GABA(A) receptors associated protein like 3. It has LocusID: 23766, and is located on chromosome 15 with reported cytogenetic location 15q25.1.
  • CPS 97 corresponds to UNK_AL096740 (UBE3B) which encodes ubiquitin protein ligase.
  • UBE3B UBE3B which encodes ubiquitin protein ligase.
  • This gene has LocusID: 89910, and is located on chromosome 12 with reported cytogenetic location 12q24.11. The gene resides in genomic locus NT_009770 (NCBI Genome Annotation).
  • CPS 98 corresponds to TBXASl wliich encodes thromboxane A synthase 1
  • cytochrome P450 (platelet, cytochrome P450, subfamily V). This gene has LocusID: 6916, and is located on chromosome 7 with reported cytogenetic location 7q34-q35. The gene resides in genomic locus NT_007914 (NCBI Genome Annotation).
  • the gene product is a member of the cytochrome P450 superfamily of enzymes.
  • the cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids.
  • TBXASl gene product is considered a member of the cytochrome P450 superfamily on the basis of sequence similarity rather than functional similarity.
  • TBXASl gene product may play a role in several pathophysiological processes including hemostasis, cardiovascular disease, and stroke.
  • the gene expresses at least two transcript variants.
  • CPS 99 corresponds to DPYD which encodes dihydropyrimidine dehydrogenase.
  • This gene has LocusID: 1806, and is located on chromosome 1 with reported cytogenetic location lp22. The gene resides in genomic locus NT_034389 (NCBI Genome Annotation).
  • Dihydropyrimidine dehydrogenase is a pyrimidine catabolic enzyme which is involved in the initial and rate-limiting step in the pathway of uracil and thymidine catabolism and also in the pathway leading to the formation of beta-alanine.
  • the DPYD gene is a large gene of approximately 150 kb consisting of at least 23 exons encoding a protein of approximately 111-kDa. Genetic deficiency of DPYD enzyme results in an error in pyrimidine metabolism associated with thymine-uraciluria and an increased risk of toxicity in cancer patients receiving 5-flourouracil chemotherapy.
  • CPS 100 corresponds to ECGF1 which encodes endothelial cell growth factor 1 (platelet-derived). Nucleotides 4 to 120 of M63193 (SEQ ID NO: 94) align with ECGF1. This gene has LocusID: 1890, and is located on chromosome 22 with reported cytogenetic location 22ql3.33. The gene resides in genomic locus NT_011526 (NCBI Genome Annotation). ,
  • CPS 100 aligns with SC02 with at least 98% sequence identity.
  • SC02 encodes SCO cytochrome oxidase deficient homolog 2 (yeast). It has LocusID: 9997, and is located on chromosome 22 with reported cytogenetic location 22ql3.33. The gene resides in genomic locus NT_011526.
  • Mammalian cytochrome c oxidase (COX) catalyzes the transfer of reducing equivalents from cytochrome c to molecular oxygen and pumps protons across the inner mitochondrial membrane.
  • yeast two related COX assembly genes, yeast SCOl and SC02 (synthesis of cytochrome c oxidase), enable subunits 1 and 2 to be incorporated into the holoprotein.
  • SC02 is the human homolog of the yeast SC02 gene.
  • CPS 101 corresponds to TIMP1 which encodes tissue inhibitor of metalloproteinase 1 (erythroid potentiating activity, collagenase inhibitor).
  • the gene has LocusID: 7076, and is located at chromosome Xpl 1.3-pl 1.23.
  • Blast search of the Entrez human genome sequence database shows that CPS
  • SYN1 encodes synapsin I, and has LocusID: 6853 with reported cytogenetic location Xpl 1.23. The gene resides in genomic locus NT_011568. CPS 101 matches with the non-protein- coding strand of SYN1.
  • CPS 102 corresponds to GSN which encodes gelsolin (amyloidosis, Finnish type). This gene has LocusID: 2934, and is located on chromosome 9 with reported cytogenetic location 9q33. The gene resides in genomic locus NT_017568 (NCBI Genome Annotation). Gelsolin is a calcium-dependent protein which may function to sever and cap actin filaments.
  • CPS 103 corresponds to SECTM1 which encodes secreted and transmembrane 1.
  • This gene has LocusID: 6398, and is located on chromosome 17 with reported cytogenetic location 17q25. The gene resides in genomic locus NT_025911 (NCBI Genome Annotation).
  • the gene product is a transmembrane and secreted protein with characteristics of a type la transmembrane protein. It is found in a perinuclear Golgi-like pattern and thought to be involved in hematopoietic or immune system processes. The gene product may have a role in hematopoiesis or immune processes.
  • CPS 104 corresponds to OLR1 which encodes oxidised low density lipoprotein (lectin-like) receptor 1.
  • This gene has LocusID: 4973, and is located on chromosome 12 with reported cytogenetic location 12pl3.2-pl2.3. The gene resides in genomic locus NT_035207 (NCBI Genome Annotation).
  • Lectin-like oxidized low density lipoprotein receptor is a member of the C-type lectin receptor family, and may be involved in degradation of oxidized LDL by vascular endothelial cells.
  • CPS 105 corresponds to D6S49E (LST1) which encodes leukocyte specific transcript 1.
  • LST1 D6S49E
  • This gene has LocusID: 7940, and is located on chromosome 6 with reported cytogenetic location 6p21.3.
  • the gene resides in genomic locus NT_007592 (NCBI Genome Annotation).
  • the gene product is expressed in leukocytes and induced by IFN- gamma. It possibly functions in the immune response of monocytes and T cells.
  • CPS 106 corresponds to JUNB which encodes jun B proto-oncogene. This gene has LocusID: 3726, and is located on chromosome 19 with reported cytogenetic location 19pl3.2.
  • the gene resides in genomic locus NT_011176 (NCBI Genome Annotation).
  • the gene product may participate in AP-1 transcriptional activation.
  • CPS 107 corresponds to PFC which encodes properdin P factor, complement.
  • This gene has LocusID: 5199, and is located on chromosome X with reported cytogenetic location Xpl 1.3-pl 1.23.
  • the gene resides in genomic locus NT_011568 (NCBI Genome Annotation).
  • the gene product contains a related type-I repeat sequence, and may play a role in complement-mediated clearance.
  • CPS 108 corresponds to POLR2E which encodes polymerase (RNA) II
  • RNA directed polypeptide E 25kD
  • This gene has LocusID: 5434, and is located on chromosome 19 with reported cytogenetic location 19pl3.3. The gene resides in genomic locus NT_011277 (NCBI Genome Annotation).
  • This gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. The encoded subunit is shared by the other two DNA-directed RNA polymerases and is present in two-fold molar excess over the other polymerase subunits. An interaction between this subunit and a hepatitis virus transactivating protein has been demonstrated, suggesting that interaction between transcriptional activators and the polymerase can occur through this subunit.
  • CPS 109 corresponds to THBS1 which encodes thrombospondin 1.
  • This gene has LocusID: 7057, and is located on chromosome 15 with reported cytogenetic location 15ql5.
  • the gene resides in genomic locus NT_030828 (NCBI Genome Annotation).
  • Thrombospondin- 1 is a member of a family of adhesive molecules. It has a role in blood clotting and in angiogenesis.
  • CPS 110 corresponds to HK2 which encodes hexokinase 2. This gene has
  • LocusID 3099, and is located on chromosome 2 with reported cytogenetic location 2pl3.
  • the gene resides in genomic locus NT_022184 (NCBI Genome Annotation).
  • Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the glycolytic pathway.
  • HK2 gene encodes hexokinase 2, the predominant form found in skeletal muscle.
  • the gene product localizes to the outer membrane of mitochondria. Expression of this gene is insulin-responsive, and studies in rat suggest that it is involved in the increased rate of glycolysis seen in rapidly growing cancer cells.
  • CPS 110 also aligns with a chromosomal region near LOG 139132 with about
  • I I SH3 domain-binding protein SNP70
  • Npw38-binding protein NpwBP NpwBP
  • CPS 111 corresponds to INSIG1 which encodes insulin induced gene 1.
  • the gene has LocusID: 3638 and is located at chromosome
  • LOCI 31742 has about 80-85% sequence identity with a chromosomal region near LOCI 31742. This chromosomal region and LOCI 31742 reside in genomic locus NT_022554. LOCI 31742 has reported cytogenetic location 3p 12.1 -pi 1.2.
  • CPS 112 corresponds to HCK which encodes hemopoietic cell kinase.
  • This gene has LocusID: 3055, and is located on chromosome 20 with reported cytogenetic location 20ql l-ql2.
  • the gene resides in genomic locus NT_028392 (NCBI Genome Annotation).
  • the gene product can function as a non-receptor protein tyrosine kinase.
  • Affymetrix annotation suggests that CPS 113 corresponds to HP10390, alos known as TMEM4 which encodes transmembrane protein 4.
  • the gene has LocusID: 10330, and is located at chromosome 12ql5.
  • Blast search of the Entrez human genome sequence database shows that CPS
  • TIMELESS aligns with two regions with 100% sequence identity.
  • the first region is located in an intron of TIMELESS.
  • the second region is located 5' to the polypeptide-coding sequence of TIMELESS.
  • TIMELESS encodes timeless homolog (Drosophila). It has LocusID: 8914, and is located on chromosome 12 with reported cytogenetic location 12ql2-ql3.
  • the gene resides in genomic locus NT_009458 (NCBI Genome Annotation).
  • the gene product may be involved in circadian oscillation autoregulation.
  • CPS 114 corresponds to UNK U51712 (LAGY) which encodes lung cancer- associated Y protein.
  • LAGY UNK U51712
  • This gene has LocusID: 84525, and is located on chromosome 4 with reported cytogenetic location 4ql l-ql2.
  • the gene resides in genomic locus NT_022853 (NCBI Genome Annotation). Multiple alternatively spliced transcript variants have been described for this gene.
  • CPS 115 corresponds to which encodes KIAA0217 wliich encodes
  • KIAA0217 protein This gene has LocusID: 23185, and is located on chromosome 10 with reported cytogenetic location 1 Op 15.3. The gene resides in genomic locus NT_024115 (NCBI Genome Annotation).
  • CPS 116 corresponds to NCF1 which encodes neutrophil cytosolic factor 1
  • NCF1 (47kD, chronic granulomatous disease, autosomal 1). This gene has LocusID: 4687, and is located on chromosome 7 with reported cytogenetic location 7qll.23. The gene resides in genomic locus NT_034886 (NCBI Genome Annotation). NCF1 gene product can produce a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
  • CPS 116 also aligns with LOC220830 and LOC256379 with 95-99% sequence identity. Both genes reside in genomic locus NT 307758 on chromosome 7. Both genes encode proteins similar to neutrophil cytosolic factor 1 (47kD, chronic granulomatous disease, autosomal 1).
  • CPS 117 corresponds to KIAA1113 (TRIM33) which encodes tripartite motif-containing 33.
  • TAM33 This gene has LocusID: 51592, and is located on chromosome 1 with reported cytogenetic location lpl3.1. The gene resides in genomic locus NT_019273 (NCBI Genome Annotation).
  • the protein encoded by this gene is thought to be a transcriptional corepressor.
  • the protein is a member of the tripartite motif family.
  • the tripartite motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. At least three alternatively spliced transcript variants for this gene have been described.
  • CPS 118 corresponds to DKFZP566H073 which encodes DKFZP566H073 protein.
  • This gene has LocusID: 26001, and is located on chromosome 17 with reported cytogenetic location 17pl3.3. The gene resides in genomic . locus NT_033299 (NCBI).
  • the gene product contains a C3HC4 type (RING) zinc finger, and can mediate protein-protein interactions. ,
  • CPS 119 corresponds to ATP2B1 which encodes ATPase, Ca++ transporting, plasma membrane 1.
  • This gene has LocusID: 490, and is located on chromosome 12 with reported cytogenetic location 12q21-q23. The gene resides in genomic locus NT_009729 (NCBI Genome Annotation).
  • FLJ14075 with 100% sequence identity.
  • FLJ14075 encodes hypothetical protein FLJ14075, and has LocusID: 79954 with reported cytogenetic location 2p25.1.
  • CPS 120 corresponds to IFITM2 which encodes interferon induced transmembrane protein 2 (1-8D).
  • This gene has LocusID: 10581, and is located on chromosome 11 with reported cytogenetic location l lpl5.5. The gene resides in genomic locus NT_009407 (NCBI Genome Annotation). The expression of this gene can be induced by interferon.
  • CPS 121 corresponds to MNDA which encodes myeloid cell nuclear differentiation antigen. This gene has LocusID: 4332 , and is located on chromosome 1 with reported cytogenetic location lq22. The gene resides in genomic locus NT_004982
  • the myeloid cell nuclear differentiation antigen can be detected in nuclei of cells of the granulocyte-monocyte lineage.
  • a 200-amino acid region of the protein is similar to a region in the proteins encoded by a family of interferon-inducible mouse genes, designated Ifi-201, Ifi-202, and Ifi-203.
  • the MNDA mRNA which contains an interferon-stimulated response element in the 5-prime untranslated region, can be upregulated in human monocytes exposed to interferon alpha. MNDA is located within
  • MNDA resembles IFI16, suggesting that these genes participate in blood cell-specific responses to interferons.
  • CPS 122 corresponds to BTN3A2 which encodes butyrophilin, subfamily 3, member A2. This gene has LocusID: 11118, and is located on chromosome 6 with reported cytogenetic location 6p22.1. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation).
  • BTN3A3 encodes butyrophilin, subfamily 3, member A3, and has LocusID: 10384.
  • BTN3A1 encodes butyrophilin, subfamily 3, member Al, and has LocusID: 11119. Both genes are located at chromosome 6p22.1.
  • CPS 123 corresponds to KIAA0776 which encodes KIAA0776 protein. This gene has LocusID: 23376, and is located on chromosome 6 with reported cytogenetic location 6ql6.3. The gene resides in genomic locus NT_019424 (NCBI Genome Annotation).
  • CPS 124 corresponds to D6S2245E (HSD17B8) which encodes hydroxysteroid (17-beta) dehydrogenase 8.
  • This gene has LocusID: 7923, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). The protein encoded by this gene is similar to mouse Ke6 and is a member of the short-chain dehydrogenase superfamily. An alternatively spliced transcript of this gene has been detected.
  • CPS 125 corresponds to KIAA0628. This gene has LocusID: 9831, and is located on chromosome 8 with reported cytogenetic location 8q24.3. The gene resides in genomic locus NT_023684 (NCBI Genome Annotation). CPS 125 is in the 3' UTR of the gene.
  • CPS 126 corresponds to SELL which encodes selectin L (lymphocyte adhesion molecule 1).
  • This gene has LocusID: 6402, and is located on chromosome 1 with reported cytogenetic location Iq23-q25. The gene resides in genomic locus NT_O34405 (NCBI Genome Annotation).
  • Selectin L is a cell surface component that is a member of a family of adhesion homing receptors which are involved in leukocyte-endothelial cell interactions.
  • Selectin L is composed of multiple domains including one domain homologous to lectins, one to epidermal growth factor, and two to the consensus repeat units found in C3/C4 binding proteins. The protein can attach lymphocytes to lymph node high endothelial venules.
  • CPS 127 corresponds to MPN17 which encodes MpN17 transgene, murine homolog, glomerulosclerosis. This gene has LocusID: 4358, and is located on chromosome 2 with reported cytogenetic location 2p23-p21. The gene resides in genomic locus NT_005204 (NCBI Genome Annotation). The gene product is similar to murine Mpvl7. It is a predicted membrane protein and may be associated with nephrotic syndrome.
  • CPS 128 corresponds to FABP5 which encodes fatty acid binding protein 5
  • Fatty acid binding proteins are a family of small, conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism.
  • Nucleotides 1 to 260 of CPS 128 have 100% sequence identity with an intron sequence of STX3A.
  • STX3A encodes syntaxin 3A, and has LocusID: 6809 with reported cytogenetic location l lql2.3.
  • STX3A resides in genomic locus NT_033903. The alignment between CPS 128 and STX3A is in the non-protein-coding of the gene.
  • Nucleotides 1 to 260 of CPS 128, or fragments thereof also align to various regions on chromosomes 7, 13 and 15 with 95-97% sequence identity.
  • nucleotides 1-260 of CPS 128, or fragments thereof align to various region on chromosomes 2, 4, 13, 15 and 22 with sequence identity 88-93%.
  • CPS 129 corresponds to CASP1 which encodes caspase 1, apoptosis-related cysteine protease (interleukin 1, beta, convertase).
  • This gene has LocusID: 834, and is located on chromosome 11 with reported cytogenetic location llq23. The gene resides in genomic locus NT_009151 (NCBI Genome Annotation).
  • the gene product is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis.
  • Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme.
  • the CASP1 gene product can proteolytically cleave and activate the inactive precursor of interlukin-1, a cytokine involved in the processes such as inflammation, septic shock, and wound healing.
  • CASP1 gene has been shown to induce cell apoptosis and may function in various developmental stages. It may have a role in the pathogenesis of Huntington disease. Alternative splicing of this gene results in at least five transcript variants encoding distinct isoforms.
  • CPS 130 corresponds to PTPN6 which encodes protein tyrosine phosphatase, non-receptor type 6. This gene has LocusID: 5777,, and is located on chromosome 12 with reported cytogenetic location 12p 13.
  • the gene resides in genomic locus NT_035206 (NCBI Genome Annotation).
  • the protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family.
  • PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation.
  • N-terminal part of PTPN6 gene product contains two tandem Src homolog (SH2) domains, which act as protein phospho-tyrosine binding domains and mediate the interaction of the gene product with its substrates.
  • PTPN6 gene product can be expressed in hematopoietic cells, and may function as a regulator of multiple signaling pathways in hematopoietic cells.
  • the gene product has been shown to interact with, and dephosphorylate a wide spectrum of phospho-proteins involved in hematopoietic cell signaling. At least three alternatively spliced variants of this gene, which encode distinct isoforms, have been reported.
  • CPS 130 also aligns with a chromosomal region near REA with at least 98% sequence identity.
  • REA encodes repressor of estrogen receptor activity. It has LocusID: 11331 and reported cytogenetic location 12pl3.
  • CPS 131 corresponds to SKAP-HOM (SCAP2) which encodes src family associated phosphoprotein 2.
  • SCAP2 SKAP-HOM
  • This gene has LocusID: 8935, and is located on chromosome 7 with reported cytogenetic location 7p21-pl5. The gene resides in genomic locus NT_007819 (NCBI Genome Annotation).
  • the protein encoded by this gene belongs to the src family kinases.
  • the encoded protein is similar to the src family associated phosphoprotein 1 and may function as an adaptor protein.
  • the encoded protein has coiled- coil and SH3 domains, and is thought to play a role in the src signaling pathway in various cells.
  • the SCAP2 gene is implicated in the processes of myeloid differentiation and growth arrest.
  • CPS 133 corresponds to GR02 which encodes GR02 oncogene.
  • This gene has LocusID: 2920, and is located on chromosome 4 with reported cytogenetic location 4q21. The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). GR02 is similar to GROl.
  • the gene product is a chemotactic agent for polymorphonuclear leukocytes.
  • CPS 133 also aligns with GROl with 85% sequence identity.
  • GROl encodes GROl oncogene (melanoma growth stimulating activity, alpha), and has LocusID:
  • CPS 134 corresponds to RRAS which encodes related RAS viral (r-ras) oncogene homolog.
  • This gene has LocusID: 6237, and is located on chromosome 19 with reported cytogenetic location 19ql3.3-qter. The gene resides in genomic locus NT_011109
  • the gene product is a member of the ras family of GTP binding proteins.
  • CPS 135 corresponds to KIAA0022 which encodes KIAA0022 gene product.
  • This gene has LocusID: 9936, and is located on chromosome 2 with reported cytogenetic location 2q24.2.
  • the gene resides in genomic locus NT_005151 (NCBI Genome
  • CPS 136 corresponds to FLU which encodes flightless I homolog
  • This gene has LocusID: 2314, and is located on chromosome 17 with reported cytogenetic location 17pl l.2. The gene resides in genomic locus NT_030843
  • the gene product is a homolog of Drosophila flightless I. It has a gelsolin-like actin-binding domain and a leucine-rich interaction domain.
  • CPS 137 corresponds to PRFl which encodes perform 1 (pore forming protein). This gene has LocusID: 5551, and is located on chromosome 10 with reported cytogenetic location 10q22. The gene resides in genomic locus NT_024037 (NCBI).
  • Genome Annotation is a cytolytic, channel-forming protein which plays a role in clearing virally infected host cells and tumor cells.
  • CPS 138 corresponds to KIAA0102 which encodes KIAA0102 gene product.
  • This gene has LocusID: 9789, and is located on chromosome 11 with reported cytogenetic location l lql3.3. The gene resides in genomic locus NT_033927 (NCBI Genome
  • CPS 138 also aligns with a chromosomal region near PTAFR with 99% sequence identity.
  • PTAFR encodes platelet-activating factor receptor, and has LocusID: 5724 and reported cytogenetic location Ip35-p34.3. It resides in genomic locus NT_028053. Platelet-activating factor receptor is a G protein-coupled receptor.
  • CPS 138 has 86-94% sequence identity with various regions in chromosomes 5, 10, 11 and 15. For instance, CPS 138 aligns with a chromosome 15 region with 94% sequence identity.
  • the chromosome 15 region includes LOC255320 which encodes a protein similar to microsomal signal peptidase 25 kDa subunit (SPase 25 kDa subunit) (SPC25). ,
  • CPS 139 corresponds to UNK AA176780 (HSA249128) which encodes
  • DIPB protein This gene has LocusID: 54765, and is located on chromosome 11 with reported cytogenetic location lip 11.2. The gene resides in genomic locus NT_009237 (NCBI Genome Annotation). The gene product is member of the B-box zinc finger family, and contains a region of low similarity to a region of murine Mid2. CPS 139 aligns with the region 3' to the protein-coding strand of UNK_AA176780.
  • CPS 140 corresponds to PYGL which encodes phosphorylase, glycogen; liver (Hers disease, glycogen storage disease type VI). This gene has LocusID: 5836, and is located on chromosome 14 with reported cytogenetic location 14q21-q22. The gene resides in genomic locus NT_025892 (NCBI Genome Annotation).
  • CPS 141 corresponds to DBP which encodes D site of albumin promoter
  • albumin D-box (albumin D-box) binding protein.
  • This gene has LocusID: 1628, and is located on chromosome 19 with reported cytogenetic location 19ql3.3. The gene resides in genomic locus NT_011109 (NCBI Genome Annotation).
  • Albumin D-site-binding protein is a transcription factor which may play a role in the diurnal regulation of liver-specific genes. It is a member of the PAR (proline and acidic amino acid-rich) b/ZIP family.
  • CPS 143 corresponds to RANBP2L1 which encodes RAN binding protein 2- like 1.
  • This gene has LocusID: 84220, and is located on chromosome 2 with reported cytogenetic location 2ql2.3.
  • RAN is a small GTP-binding protein of the RAS superfamily that is associated with the nuclear membrane and is thought to control a variety of cellular functions through its interactions with other proteins.
  • RANBP2L1 gene shares sequence similarity with RANBP2, a large RAN-binding protein localized at the cytoplasmic side of the nuclear pore complex. It is believed that this RANBP2 gene family member arose from a duplication event 3 Mb distal to RANBP2. Alternative splicing has been observed for this locus and two variants are described. Additional splicing is suggested. [0232] CPS 143 also aligns with LOC220692 with 100% sequence identity.
  • LOC220692 encodes protein similar to RAN-binding protein 2-like 1, isoform 1 (sperm membrane protein BS-63). The gene is located at chromosome 2ql2.3 and resides in genomic locus NT_022135 (NCBI Genome Annotation).
  • CPS 143 has 94-96% sequence identity with KIAA0336, a chromosomal region near LOC256197, and an intron of LOC150821.
  • KIAA0336 encodes
  • KIAA0336 gene product has LocusID: 9648 and reported cytogenetic location 2ql2.2.
  • LOC256197 encodes a protein similar to ribosomal protein L22, and is located on chromosome 2.
  • LOCI 50821 encodes a protein similar to KIAA0336 gene product, and is located at chromosome 2pl 1.1.
  • CPS 144 corresponds to TNFSF10 which encodes tumor necrosis factor
  • LocusID 8743, and is located on chromosome 3 with reported cytogenetic location 3q26. The gene resides in genomic locus
  • TNFSF10 Tumor necrosis factor family cytokines function as mediators. of immune regulation and the inflammatory response.
  • TNFSF10 gene product is a Type II glycoprotein of the tumor necrosis factor ligand superfamily, and can mediate cell death.
  • CPS 145 corresponds to KIAA0854 which encodes KIAA0854 protein.
  • This gene has LocusID: 22882, and is located on chromosome 8 with reported cytogenetic location 8q24.13. The gene resides in genomic locus NT_023663 (NCBI Genome
  • CPS 146 corresponds to SDS which encodes serine dehydratase. This gene has LocusID: 10993, and is located on chromosome 12 with reported cytogenetic location
  • the gene resides in genomic locus NT_009601 (NCBI Genome Annotation).
  • Serine dehydratase catalyzes the PLP-dependent alpha,beta-elimination of L-serine to pyruvate and ammonia. It is one of three enzymes that are regarded as metabolic exits of the serine-glycine pool. Serine dehydratase can be found in the liver.
  • CPS 147 corresponds to KIAA0576 (COASTER) which encodes coactivator for steroid receptors.
  • This gene has LocusID: 26036, and is located on chromosome 6 with reported cytogenetic location ⁇ pl l.l. The gene resides in genomic locus NT_007592
  • CPS 148 corresponds to FCGR3B which encodes Fc fragment of IgG, low affinity Illb, receptor for (GDI 6).
  • the gene has LocusID: 2215, and is located at chromosome lq23.
  • FCGR3A encodes Fc fragment of IgG, low affinity Ilia, receptor for (CD 16).
  • the gene has LocusID: 2214, and is located on chromosome 1 with reported cytogenetic location lq23.
  • the gene resides in genomic locus NT_004668 (NCBI Genome Annotation).
  • the gene product is a Type III Fc gamma receptor and a member of the immunoglobulin superfamily. It can associate with zeta chain of the T-cell receptor complex (CD3Z).
  • CPS 149 corresponds to FCER1A which encodes Fc fragment of IgE, high affinity I, receptor for; alpha polypeptide.
  • the gene product is the alpha subunit of the high affinity IgE receptor, and may be involved in triggering allergic responses.
  • This gene has LocusID: 2205, and is located on chromosome 1 with reported cytogenetic location lq23. The gene resides in genomic locus NT_004982 (NCBI Genome Annotation).
  • the IgE receptor contains 3 subunits: alpha, beta (MIM 147138 ), and gamma (MIM 147139 ).
  • the alpha subunit can be glycosylated.
  • CPS 150 corresponds to CD44 which encodes CD44 antigen (homing function and Indian blood group system). This gene has LocusID: 960, and is located on chromosome 11 with reported cytogenetic location l ip 13. The gene resides in genomic locus NT_009237 (NCBI Genome Annotation).
  • CPS 151 corresponds to ID1 which encodes inhibitor of DNA binding 1, dominant negative helix-loop-helix protein.
  • This gene has LocusID: 3397, and is located on chromosome 20 with reported cytogenetic location 20ql l.
  • the gene resides in genomic locus NT_028392 (NCBI Genome Annotation).
  • the gene product is a member of the Id helix-loop-helix family of proteins, and may negatively regulate cell differentiation.
  • Blast search of the Entrez human genome sequence database shows that CPS
  • KIAA0963 overlaps KIAA0963.
  • CPS 152 aligns with the non-protein-coding strand of KIAA0963.
  • KIAA0963 encodes KIAA0963 protein, and has LocusID: 22904 with reported cytogenetic location 19pl3.3.
  • KIAA0963 resides in genomic locus NT_011277 (NCBI Genome Annotation).
  • CPS 153 corresponds to ADTB2 (AP2B1) which encodes adaptor-related protein complex 2, beta 1 subunit.
  • ADTB2 ADTB2
  • This gene has LocusID: 163, and is located on chromosome 17 with reported cytogenetic location 17ql l.2-ql2.
  • the gene resides in genomic locus NT_010799 (NCBI Genome Annotation).
  • the beta adaptin subunit is part of the clathrin coat assembly complex which links clathrin to receptors in coated pits and vesicles. These vesicles are involved in endocytosis and Golgi processing.
  • the beta 1 subunit is one of the assembly proteins which binds to clathrin and initiates coat formation.
  • CPS 154 corresponds to MADH3 which encodes MAD, mothers against decapentaplegic homolog 3 (Drosophila).
  • This gene has LocusID: 4088, and is located on chromosome 15 with reported cytogenetic location 15q21-q22.
  • the gene resides in genomic locus NT_010265 (NCBI Genome Annotation).
  • the gene product is similar to murine Madh3. It is a member of the Smad family of proteins, and may affect transcription in response to TGF-beta signaling pathways.
  • CPS 155 corresponds to ARPC1B which encodes actin related protein 2/3 complex, subunit IB (41 kD). This gene has LocusID: 10095, and is located on chromosome 7 with reported cytogenetic location 7ql 1.21. The gene resides in genomic locus NT_007933 (NCBI Genome Annotation). The gene product is involved in assembly of the actin cytoskeleton, and may have a role in protrusion of lamellipodia. [0247] CPS 155 also aligns with two chromosome 12 regions with 99-100% sequence identity. These two regions reside in genomic locus NT_035283 (NCBI Genome Annotation).
  • the first region is located near LOG 196489 which encodes a protein similar to FLJ00209 protein.
  • the second region is located near LOCI 44584 which encodes a protein similar to vacuolar protein sorting 35 (yeast); maternal-embryonic 3. Both LOC196489 and LOC144584 have reported cytogenetic location 12ql3.12. [0248] Affymetrix annotation suggests that CPS 156 corresponds to TALDOl which encodes transaldolase 1 and has LocusID: 6888. The gene is located at chromosome Ilpl5.5-pl5.4.
  • CPS 157 corresponds to UNK_AL035079 (CAT) which encodes catalase.
  • CPS 158 corresponds to STXBP1 which encodes syntaxin binding protein 1.
  • CPS 159 corresponds to UNG2 which encodes uracil-DNA glycosylase 2.
  • CPS 160 corresponds to DUSP5 which encodes dual specificity phosphatase
  • This gene has LocusID: 1847, and is located on chromosome 10 with reported cytogenetic location 10q25. The gene resides in genomic locus NT_030059 (NCBI Genome Annotation).
  • the protein encoded by DUSP5 gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases can inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They can negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation.
  • MAP mitogen-activated protein
  • DUSP5 gene product can inactivate ERK1, and is expressed in a variety of tissues including pancreas and brain.
  • CPS 161 corresponds to PTPRO which encodes protein tyrosine phosphatase, receptor type, O.
  • This gene has LocusID: 5800, and is located on chromosome 12 with reported cytogenetic location 12pl3.3-pl3.2. The gene resides in genomic locus NT_009714 (NCBI Genome Annotation).
  • the gene product is an integral membrane protein containing a transmembrane domain and an intracellular catalytic domain with a characteristic signature motif.
  • Several alternatively spliced transcript variants, some of which encode different isoforms of the protein, have been described.
  • the gene product also contains fibronectin type Ill-like repeats and putative glycosylation sites in the extracellular domain.
  • CPS 162 corresponds to SLA which encodes Src-like-adaptor and has LocusID: 6503.
  • the gene has reported cytogenetic location at chromosome 8q24.
  • TG encodes thyroglobulin, and has LocusID: 7038 with reported cytogenetic location 8q24.2-q24.3.
  • the gene resides in genomic locus NT_008150 (NCBI Genome Annotation).
  • CPS 162 matches with the non-protein-coding strand of TG.
  • Thyroglobulin is a precursor of thyroid hormones.
  • CPS 163 corresponds to RFP2 which encodes ret finger protein 2. This gene has LocusID: 10206, and is located on chromosome 13 with reported cytogenetic location
  • the gene resides in genomic locus NT_033922 (NCBI Genome Annotation).
  • the protein encoded by RFP2 gene is a member of the tripartite motif (TRIM) family.
  • TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type
  • the RFP2 gene product localizes to cytoplasmic bodies near the nucleus.
  • the gene is located on chromosome 13 within the minimal deletion region for B- cell chronic lymphocytic leukemia. Alternative splicing of this gene has been described.
  • Nucleotides 3 to 255 of CPS 163 have 83% sequence identity with a chromosomal region near LOCI 62162.
  • LOCI 62162 encodes a protein similar to spinocerebellar ataxia type 1, and has reported cytogenetic location 16q23.1.
  • CPS 164 corresponds to ADFP which encodes adipose differentiation-related protein (adipophilin).
  • ADFP adipose differentiation-related protein
  • This gene has LocusID: 123, and is located on chromosome 9 with reported cytogenetic location 9p21.2. The gene resides in genomic locus NT_023974
  • Adipocyte differentiation-related protein is associated with the globule surface membrane material.
  • the protein is a major constituent of the globule surface. Increase in mRNA levels is one of the earliest indications of adipocyte differentiation.
  • the product is also a component of milk lipid globules.
  • CPS 164 aligns with a chromosome 1 region near LOC254424 with 94% sequence identity.
  • LOC254424 encodes a protein similar to adipophilin (adipose differentiation-related protein) (ADRP).
  • ADRP adipophilin (adipose differentiation-related protein)
  • CPS 165 corresponds to ADTD (AP3D1) which encodes adaptor-related protein complex 3, delta 1 subunit.
  • ADTD ADTD
  • This gene has LocusID: 8943, and is located on chromosome 19 with reported cytogenetic location 19pl3.3. The gene resides in genomic locus NT_011268 (NCBI Genome Annotation). Delta-adaptin is a component of the AP-3 complex which is involved in intracellular transport.
  • CPS 166 corresponds to ADTG which is also known as AP1G1.
  • the gene encodes adaptor-related protein complex 1, gamma 1 subunit.
  • the gene has LocusID: 164, and is located at chromosome 16q23.
  • LOC255980 aligns to LOC255980 with at least 99% sequence identity.
  • LOC255980 encodes a protein similar to hypothetical protein FLJ20151, and is located on chromosome 15.
  • LOC255980 resides in genomic locus NT_010265. It overlaps FLJ20151 which has LocusID: 54837 and reported cytogenetic location 15q21.3.
  • CPS 168 corresponds to UNK_X87344 (PSMB8) which encodes proteasome
  • CPS 170 corresponds to CAPG which encodes capping protein (actin filament), gelsolin-like.
  • This gene has LocusID: 822, and is located on chromosome 2 with reported cytogenetic location 2cen-q24.
  • the gene resides in genomic locus NT_015805 (NCBI Genome Annotation).
  • the gene product is a macrophage capping protein. It can reversibly block the barbed ends.
  • the gene product is a member of the gelsolin/villin protein family.
  • CPS 171 corresponds to ARL7 which encodes ADP-ribosylation factor-like
  • ADP-ribosylation factor-like 7 is a member of the ADP- ribosylation factor family of GTP-binding proteins.
  • ARL7 is similar to ARL4 and ARL6, and each has a nuclear localization signal and a high guanine nucleotide exchange rate.
  • CPS 172 corresponds to RCNl which encodes recoverin. This gene has
  • CPS 173 corresponds to GR03 which encodes GR03 oncogene. This gene has LocusID: 2921, and is located on chromosome 4 with reported cytogenetic location 4q21. The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). GR03 is similar to human GROl, and may encode a mitogenic factor.
  • CPS 175 corresponds to UNK_AF070606 (ATP2B1) which encodes
  • This gene has LocusID: 490, and is located on chromosome 12 with reported cytogenetic location 12q21-q23. The gene resides in genomic locus NT_009729 (NCBI Genome Annotation). CPS 175 aligns with the 3' UTR of the protein-coding strand of the gene.
  • CPS 176 corresponds to AOAH which encodes acyloxyacyl hydrolase
  • CPS 177 corresponds to UNK_AJ224442 (WBSCR22) which encodes
  • Williams Beuren syndrome chromosome region 22 This gene has LocusID: 114049, and is located on chromosome 7. The gene resides in genomic locus NT_007758 (NCBI Genome Annotation).
  • CPS 178 corresponds to MSR1 which encodes macrophage scavenger receptor 1.
  • This gene has LocusID: 4481, and is located on chromosome 8 with reported cytogenetic location 8p22. The gene resides in genomic locus NT_015280 (NCBI Genome Annotation).
  • This gene encodes the class A macrophage scavenger receptors, which include at least three different types (1, 2, 3) generated by alternative splicing of this gene. These receptors or isoforms are macrophage-specific trimeric integral membrane glycoproteins and have been implicated in many macrophage-associated physiological and pathological processes including atherosclerosis, Alzheimer's disease, and host defense.
  • the isoforms type 1 and type 2 are functional receptors and are able to mediate the endocytosis of modified low density lipoproteins (LDLs).
  • the isoform type 3 may have an altered intracellular processing and can be trapped within the endoplasmic reticulum.
  • the isoform type 3 can inhibit the function of isoforms type 1 and type 2 when co-expressed, indicating a dominant negative effect and suggesting a mechanism for regulation of scavenger receptor activity in macrophages.
  • CPS 179 corresponds to XAP4 (C20orfl8) which encodes chromosome 20 open reading frame 18.
  • This gene has LocusID: 10616, and is located on chromosome 20 with reported cytogenetic location 20pl3.
  • the gene resides in genomic locus NT_011387 (NCBI Genome Annotation).
  • the gene contains at least 12 exons.
  • Several alternatively spliced transcript variants have been described.
  • the gene product is similar to mouse UIP28/UbcM4 interacting protein at the amino acid level, and contains a C3HC4 type (RING) zinc finger. It may interact with PKC and mediate protein-protein interactions.
  • CPS 180 corresponds to C1QR1 which encodes complement component 1, q subcomponent, receptor 1.
  • This gene has LocusID: 22918, and is located on chromosome 20 with reported cytogenetic location 20p 11.21. The gene resides in genomic locus NT_011387 (NCBI Genome Annotation).
  • the gene product is a type I membrane protein and can act as a receptor for complement protein Clq, mannose-binding lectin, and pulmonary surfactant protein A.
  • the gene product is a functional receptor involved in ligand-mediated enhancement of phagocytosis.
  • CPS 181 corresponds to STAT4 which encodes signal transducer and activator of transcription 4.
  • This gene has LocusID: 6775, and is located on chromosome 2 with reported cytogenetic location 2q32.2-q32.3. The gene resides in genomic locus NT_022197 (NCBI Genome Annotation).
  • the gene product is a member of the STAT family of transcription factors. In response to cytokines and growth factors, STAT family members may be phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators.
  • STAT4 gene product may be involved in mediating responses to IL12 in lymphocytes and regulating the differentiation of T helper cells.
  • CPS 182 corresponds to UNK_AL049309 (SFRS12) which encodes splicing factor, arginine/serine-rich 12.
  • SFRS12 splicing factor
  • This gene has LocusID: 140890, and is located on chromosome 5 with reported cytogenetic location 5ql2.3. The gene resides in genomic locus NT_006431 (NCBI Genome Annotation).
  • CPS 183 corresponds to RNASE2 which encodes ribonuclease, RNase A family, 2 (liver, eosinophil-derived neurotoxin). This gene has LocusID: 6036, and is located on chromosome 14 with reported cytogenetic location 14q24-q31. The gene resides in genomic locus NT_025892 (NCBI Genome Annotation). The gene product is a member of ribonuclease superfamily, and has neurotoxic and ribonuclease activities. [0277] CPS 183 also aligns with LOCI 22661 and RNASE3 with about 86-93% sequence identity.
  • LOG 122661 encodes a protein similar to nonsecretory ribonuclease precursor (Ribonuclease US) (Eosinophil-derived neurotoxin) (RNase UpI-2) (Ribonuclease 2) (RNase 2). It is located at chromosome 14ql l.l, and resides in genomic locus NT_025892.
  • RNASE3 encodes ribonuclease, RNase A family, 3 (eosinophil cationic protein). RNASE3 has LocusID: 6037 and reported cytogenetic location 14q24-q31. It also resides in genomic locus NT_025892.
  • RNASE3 gene product is a member of the ribonuclease superfamily, and has both neurotoxic and ribonuclease activities.
  • CPS 184 corresponds to NCF4 which encodes neutrophil cytosolic factor 4
  • This gene has LocusID: 4689, and is located on chromosome 22 with reported cytogenetic location 22ql3.1.
  • the gene resides in genomic locus NT_011520 (NCBI Genome Annotation).
  • the protein encoded by this gene is a cytosolic element of nicotinamide adenine dinucleotide phosphate-oxidase. Upon neutrophil stimulation, the encoded protein and other cytosolic elements are sent to the cell membrane from the cytosol to form a complex which produces phagocytic oxygen radicals.
  • Two motifs found in this protein, an SH3 domain and a PC motif, are significant to protein-protein interactions. Through interaction with the SH3 domain, NCF4 gene product is responsible for the downregulation of NADPH-oxidase. Alternative splicing has been observed in this gene.
  • CPS 184 aligns to the 3' UTR of the gene.
  • CPS 185 corresponds to ANXA4 which encodes annexin A4. This gene has
  • Annexin A4 belongs to the annexin family of calcium-dependent phospholipid binding proteins. Several members of the annexin family have been implicated in membrane-related events along exocytotic and endocytotic pathways. Annexin A4 may interact with ATP, have in vitro anticoagulant activity, and inhibit phospholipase A2 activity. Annexin A4 can be detected in epithelial cells.
  • CPS 186 corresponds to ME2 which encodes malic enzyme 2, NAD(+)- dependent, mitochondrial.
  • This gene has LocusID: 4200, and is located on chromosome 18 with reported cytogenetic location 18q21. The gene resides in genomic locus NT_033905 (NCBI Genome Annotation).
  • the gene product is a homotetrameric protein which can catalyze the oxidative decarboxylation of malate to pyruvate.
  • CPS 183 also has 91% sequence identity with a chromosome 9 region near
  • LOC 169570 has reported cytogenetic location 9p 13.1.
  • CPS 187 corresponds to IFI35 which encodes interferon-induced protein 35.
  • This gene has LocusID: 3430, and is located on chromosome 17 with reported cytogenetic location 17q21.
  • the gene resides in genomic locus NT_035490 (NCBI Genome Annotation).
  • Interferon-induced protein 35 associates with B-ATF transcription factor on interferon treatment. It contains a leucine-zipper motif.
  • CPS 188 corresponds to NAMP5 which encodes vesicle-associated membrane protein 5 (myobrevin).
  • This gene has LocusID: 10791, and is located on chromosome 2 with reported cytogenetic location 2pll.2.
  • the gene resides in genomic locus ⁇ T_015805 (NCBI Genome Annotation).
  • the gene product is a member of the synaptobrevin/VAMP family of proteins.
  • CPS 188 also has 87% sequence identity with an intron sequence of NSF.
  • NSF encodes N-ethylmaleimide-sensitive factor. It has LocusID: 4905 and reported cytogenetic location 17q21.
  • CPS 189 corresponds to IMPA2 which encodes inositol(myo)-l(or 4)- monophosphatase 2.
  • This gene has LocusID: 3613, and is located on chromosome 18 with reported cytogenetic location 18pl l.2. The gene resides in genomic locus NT_010859 (NCBI Genome Annotation).
  • CPS 190 corresponds to GZMK which encodes granzyme K (serine protease, granzyme 3; tryptase II).
  • This gene has LocusID: 3003, and is located on chromosome 5 with reported cytogenetic location 5ql l-ql2. The gene resides in genomic locus NT_006431 (NCBI Genome Annotation).
  • This gene product is a member of a group of related serine proteases from the cytoplasmic granules of cytotoxic lymphocytes. Cytolytic T lymphocytes (CTL) and natural killer (NK) cells can recognize, bind, and lyse specific target cells. They are thought to protect their host by lysing cells bearing on their surface "nonself ' antigens, usually peptides or proteins resulting from infection by intracellular pathogens.,
  • CPS 191 corresponds to BGN which encodes biglycan. This gene has
  • LocusID 633, and is located on chromosome X with reported cytogenetic location Xq28.
  • the gene resides in genomic locus NT_025965 (NCBI Genome Annotation).
  • the protein encoded by this gene is a small cellular or pericellular matrix proteoglycan that is related in structure to two other small proteoglycans, decorin and fibromodulin.
  • the encoded protein and decorin are thought to be the result of a gene duplication.
  • Decorin contains one attached glycosaminoglycan chain, while biglycan probably contains two chains. Biglycan is thought to function in connective tissue metabolism by binding to collagen fibrils and transfering growth factor-beta. It may promote neuronal survival.
  • This gene is a candidate gene for the Happle syndrome.
  • CPS 192 corresponds to AIF1 which encodes allograft inflammatory factor
  • This gene has LocusID: 199, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome
  • This gene is induced by cytokines and interferon. Its protein product is thought to be involved in negative regulation of growth of vascular smooth muscle cells, which contributes to the anti-inflammatory response to vessel wall trauma.
  • the gene expresses at least three transcripts.
  • CPS 193 corresponds to CBP2 (SERPINH2) which encodes serine (or cysteine) proteinase inhibitor, clade H (heat shock protein 47), member 2. This gene has
  • LocusID 872, and is located on chromosome 11 with reported cytogenetic location
  • the gene resides in genomic locus NT_033927 (NCBI Genome Annotation).
  • the gene product is also known as Colligin-2, which is a collagen-binding protein that acts as a heat shock protein.
  • CPS 193 also has 92% sequence identity with a chromosome 9 region near pshsp47.
  • Pshsp47 encodes serine (or cysteine) proteinase inhibitor, clade H (heat shock protein 47), member 2 pseudogene. It has LocusID: 158172 and reported cytogenetic location 9p 11.2.
  • CPS 195 corresponds to IQGAP1 which encodes IQ motif containing GTPase activating protein 1.
  • the gene has LocusID: 8826 with reported cytogenetic location 15q26.1.
  • IQGAP1 encodes IQ motif containing GTPase activating protein 1.
  • the gene has LocusID:
  • the gene 8826 is located on chromosome 15 with reported cytogenetic location 15q26.1.
  • the gene resides in genomic locus NT_033276.
  • the gene product contains a GTPase activating domain and multiple calmodulin binding domains, and can bind to actin cytoskeleton and inhibit GTPase activity of ras family of GTP binding proteins Cdc42Hs and rac.
  • CPS 196 corresponds to KIAA0823 (PPP1R16B) which encodes protein phosphatase 1, regulatory (inhibitor) subunit 16B.
  • This gene has LocusID: 26051, and is located on chromosome 20 with reported cytogenetic location 20ql 1.23. The gene resides in genomic locus NT_011362 (NCBI Genome Annotation).
  • CPS 197 corresponds to UNK AI018098 (MGC 15523) which encodes hypothetical protein MGC 15523. This gene has LocusID: 124565, and is located on chromosome 17 with reported cytogenetic location 17q25.3. The gene resides in genomic locus NT_010661 (NCBI Genome Annotation). [0295] Affymetrix annotation suggests that CPS 198 corresponds to CD 163.
  • CD163 encodes CD163 antigen, and has LocusID: 9332. The gene is located at chromosome 12pl3.3.
  • CPS 199 corresponds to HK3 which encodes hexokinase 3 (white cell).
  • This gene has LocusID: 3101, and is located on chromosome 5 with reported cytogenetic location 5q35.2. The gene resides in genomic locus NT_023132 (NCBI Genome Annotation).
  • Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the glycolytic pathway.
  • HK3 gene encodes hexokinase 3 which is similar to hexokinases 1 and 2. Hexokinase 3 can be inhibited by its product glucose-6- phosphate.
  • CPS 200 corresponds to FOS which encodes v-fos FBJ murine osteosarcoma viral oncogene homolog.
  • This gene has LocusID: 2353, and is located on chromosome 14 with reported cytogenetic location 14q24.3. The gene resides in genomic locus NT_026437 (NCBI Genome Annotation).
  • the Fos gene family has 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation. In some cases, expression of the FOS gene has also been associated with apoptotic cell death. FOS gene product may also be involved in alteration of DNA methylation.
  • CPS 201 corresponds to DIFF48 (C6orf32) which encodes chromosome 6 open reading frame 32.
  • This gene has LocusID: 9750, and is located on chromosome 6 with reported cytogenetic location 6p22.3-p21.32.
  • the gene resides in genomic locus NT_007592 (NCBI Genome Annotation).
  • the protein encoded by DIFF48 can stimulate the formation of a non-mitotic multinucleate syncytium from proliferative cytotrophoblasts during trophoblast differentiation. An alternatively spliced transcript variant of this gene has been described.
  • CPS 202 corresponds to UNK_L23134 (GZMM) which encodes granzyme
  • M lymphocyte met-ase 1).
  • This gene has LocusID: 3004, and is located on chromosome 19 with reported cytogenetic location 19pl3.3. The gene resides in genomic locus NT_011227 (NCBI Genome Annotation).
  • Human natural killer (NK) cells and activated lymphocytes express and store a distinct subset of neutral serine proteases together with proteoglycans and other immune effector molecules in large cytoplasmic granules.
  • These serine proteases are collectively termed granzymes and include at least four distinct gene products: granzyme A, granzyme B, granzyme H, and Met-ase which is also known as granzyme M.
  • CPS 203 corresponds to LM02 which encodes LIM domain only 2
  • LM02 encodes a cysteine-rich, two LIM-domain protein that is involved in yolk sac erythropoiesis.
  • the LM02 protein has a role in hematopoietic development.
  • the LM02 transcription start site is located approximately 25 kb downstream from the l ip 13 T-cell translocation cluster (lip 13 ttc), where a number T-cell acute lymphoblastic leukemia-specific translocations occur.
  • the LM02 protein is a member of the rhombotin family.
  • CPS 204 corresponds to PDI2 (PADI2) which encodes peptidyl arginine deiminase, type II.
  • PADI2 PADI2
  • This gene has LocusID: 11240, and is located on chromosome 1 with reported cytogenetic location Ip35.2-p35.1.
  • the gene resides in genomic locus NT_034376 (NCBI Genome Annotation).
  • the gene product is similar to rat skeletal muscle peptidyl ! arginine deiminase, type II. It may convert arginine residues within proteins to citrulline residues.
  • CPS 205 corresponds to UNK AL031282 (FLJ13052) which encodes NAD kinase.
  • This gene has LocusID: 65220, and is located on chromosome 1 with reported cytogenetic location Ip36.33-p36.21. The gene resides in genomic locus NT_004350 (NCBI Genome Annotation).
  • CPS 205 also aligns with a chromosomal region near MMP23B with at least
  • MMP23B encodes matrix metalloproteinase 23B, and has LocusID: 8510 and reported cytogenetic location lp36.3. The gene resides in genomic locus NT_004350.
  • CPS 205 aligns with the non-protein-coding strand of MMP23B.
  • CPS 206 corresponds to GABRG2 wliich encodes gamma-aminobutyric acid
  • GABA GABA A receptor, gamma 2.
  • This gene has LocusID: 2566, and is located on chromosome 5 with reported cytogenetic location 5q31.1-q33.1.
  • the gene resides in genomic locus NT_030698 (NCBI Genome Annotation).
  • the gamma-aminobutyric acid (GABA) A receptor, gamma 2 is found as an inhibitory neurotransmitter receptor in the brain.
  • CPS 207 corresponds to KIAA0404 which encodes KIAA0404 protein.
  • This gene has LocusID: 23130, and is located on chromosome 11 with reported cytogenetic location l lql3.1.
  • the gene resides in genomic locus NT_033241 (NCBI Genome Annotation).
  • CPS 208 corresponds to UNK U12471 (THBS1) which encodes thrombospondin 1.
  • This gene has LocusID: 7057, and is located on chromosome 15 with reported cytogenetic location 15ql5. The gene resides in genomic locus NT_030828 (NCBI Genome Annotation).
  • Thrombospondin- 1 has a role in blood clotting and in angiogenesis. It is a member of a family of adhesive molecules.
  • CPS 209 corresponds to IGHA1 which encodes immunoglobulin heavy constant alpha 1.
  • the gene has LocusID: 3493 and reported cytogenetic location 14q32.33.
  • CPS 210 corresponds to ABCB1 which encodes ATP-binding cassette, subfamily B (MDR/TAP), member 1.
  • MDR/TAP subfamily B
  • This gene has LocusID: 5243, and is located on chromosome 7 with reported cytogenetic location 7q21.1. The gene resides in genomic locus NT_007933 (NCBI Genome Annotation).
  • the membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).
  • ABCB1 gene This protein encoded by ABCB1 gene is a member of the MDR/TAP subfamily. Members of the MDR/TAP subfamily are involved in multidrug resistance.
  • the ABCB gene product is an ATP-dependent drug efflux pump for xenobiotic compounds with broad substrate specificity. It is responsible for decreased drug accumulation in multidrug- resistant cells and often mediates the development of resistance to anticancer drugs. It can also function as a transporter in the blood-brain barrier.
  • the ABCB gene product is also known as P glycoprotein 1.
  • CPS 211 corresponds to NRIPl which encodes nuclear receptor interacting protein 1. This gene has LocusID: 8204, and is located on chromosome 21 with reported cytogenetic location 21ql l.2. The gene resides in genomic locus NT_011512 (NCBI Genome Annotation).
  • Nuclear receptor interacting protein 1 is a nuclear protein that can interact with the hormone-dependent activation domain AF2 of nuclear receptors. Also known as RIP 140, this protein modulates transcriptional activity of the estrogen receptor.
  • CPS 213 corresponds to GLB1 which encodes galactosidase, beta 1. This gene has LocusID: 2720, and is located on chromosome 3 with reported cytogenetic location 3p21.33.
  • CPS 214 corresponds to MSC which encodes musculin (activated B-cell factor-1). This gene has LocusID: 9242, and is located on chromosome 8 with reported cytogenetic location 8q21.
  • the gene resides in genomic locus NT_034895 (NCBI Genome Annotation).
  • the gene product contains a bHLH motif, a putative nuclear localization signal, a glycine-rich region, and a stretch of acidic residues.
  • the gene product is capable of binding an E-box element either as a homodimer or as a heterodimer with E2A in vitro, and forms heterodimers with E2A proteins in vivo. It also contains a transcriptional repression domain and is capable of inhibiting the transactivation capability of E47, an E2A protein, in mammalian cells. MSC is thought to be a downstream target of the B-cell receptor signal transduction pathway.
  • CPS 216 corresponds to UNK J95006 (MGC14480) which encodes hypothetical protein MGC 14480. This gene has LocusID: 201254, and is located on chromosome 17 with reported cytogenetic location 17q25.3. The gene resides in genomic locus NT_035479 (NCBI Genome Annotation).
  • Nucleotides 400 to 561 of CPS 216 also have 84% sequence identity with an intron sequence of EG1.
  • EG1 encodes endothelial-derived gene 1, and has LocusID: 80306 and reported cytogenetic location 4pl6.
  • CPS 217 corresponds to NK4 which encodes natural killer cell transcript 4.
  • This gene has LocusID: 9235, and is located on chromosome 16 with reported cytogenetic location 16pl3.3. The gene resides in genomic locus NT_010552 (NCBI Genome Annotation). The gene product may play a role in cell adhesion. It contains an RGD motif. [0315] CPS 217 also has 96% sequence identity with LOC 124213 which encodes a protein similar to natural killer cell transcript 4. LOC 124213 is located at chromosome 16p 13.11, and also resides in genomic locus NT_010552.
  • CPS 218 corresponds to UNK_Z24724 (FLJ20986) which encodes hypothetical protein FLJ20986.
  • This gene has LocusID: 79572, and is located on chromosome 3 with reported cytogenetic location 3q29. The gene resides in genomic locus NT_005535 (NCBI Genome Annotation).
  • the alignment between CPS 218 and UNK_Z24724 is within an intron of GP5 which encodes glycoprotein V (platelet) and has
  • CPS 219 corresponds to GRN which encodes granulin. This gene has
  • LocusID 2896, and is located on chromosome 17 with reported cytogenetic location
  • the gene resides in genomic locus NT_010755 (NCBI Genome Annotation).
  • Granulin is a putative growth factor. It is cysteine rich and contains multiple granulin repeats. ,
  • CPS 220 corresponds to EREG which encodes epiregulin. This gene has
  • LocusID 2069, and is located on chromosome 4 with reported cytogenetic location 4ql3.3.
  • Epiregulin is a member of the epidermal growth factor family. Epiregulin can function as a ligand of
  • EGFR epidermal growth factor receptor
  • ERBB v-erb-b2 oncogene homolog
  • Epiregulin may promote cell proliferation.
  • CPS 221 corresponds to KIAA0382 (ARHGEF12) which encodes Rho guanine nucleotide exchange factor (GEF) 12.
  • This gene has LocusID: 23365, and is located on chromosome 11 with reported cytogenetic location 1 lq23.3. The gene resides in genomic locus NTJ333899 (NCBI Genome Annotation).
  • Rho GTPases play a role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors.
  • Rho guanine nucleotide exchange factor (GEF) 12 may form a complex with G proteins and stimulate Rho-dependent signals. This protein is observed to form myeloid lymphoid fusion partner in acute myeloid leukemia.
  • CPS 222 corresponds to RNASE6 which encodes ribonuclease, RNase A family, k6.
  • This gene has LocusID: 6039, and is located on chromosome 14 with reported cytogenetic location 14ql 1.1. The gene resides in genomic locus NT_025892 (NCBI).
  • Ribonuclease k6 may function in host defense. It is a member of eosinophil-derived neurotoxin ribonuclease A superfamily.
  • CPS 223 corresponds to KIAA0442 which encodes autism-related protein 1.
  • This gene has LocusID: 26053, and is located on chromosome 7 with reported cytogenetic location 7pl3. The gene resides in genomic locus NT_007758 (NCBI Genome
  • Blast search of the Entrez human genome sequence database shows that nucleotides 35 to 1907of CPS 225 align with various chromosomal regions with 89-90% sequence identity. These regions include LOC255718, LOC140070, and a chromosome 2 region between GPR39 and LOC255521. LOC255718 is located on chromosome 2 and has genomic locus NT_034486 (NCBI Genome Annotation). LOC 140070 encodes a protein similar to Protein CDC27Hs (Cell division cycle protein 27 homolog) (H-NUC). It is located on chromosome Yql l.l and has genomic locus NT_011878.
  • GPR39 and LOC255521 reside in genomic locus NT_034487.
  • GPR39 encodes G protein-coupled receptor 39, and has LocusID: 2863 and reported cytogenetic location 2q21-q22.
  • Nucleotides 1317 to 1481 of CPS 225 have 98% sequence identity with a chromosome 12 region near LOC256746.
  • LOC256746 encodes phosphodiesterase 3 A, cGMP-inhibited.
  • Nucleotides 1381 to 1418 and 1461 to 1502 have 100% sequence identity with an intron sequence of LOC126500.
  • LOC126500 encodes a protein similar to hypothetical zinc finger protein KIAA1473. It is located on chromosome 19pl3.11, and resides in genomic locus NT_033317.
  • Other fragments of CPS 225 have 84-93% sequence identity with regions on chromosomes 7, 11, 14, 18 and 21.
  • ATP8A2 aligns to a region located in an intron of ATP8A.
  • CPS 226 aligns with the non-protein- coding strand of ATP8A2.
  • ATP8A2 encodes ATPase, aminophospholipid transporter-like, Class I, type 8 A, member 2.
  • ATP8A2 gene has LocusID: 51761, and is located on chromosome 13 with reported cytogenetic location 13ql2-13.
  • ATP8A2 resides in genomic locus NT_009799 (NCBI Genome Annotation).
  • ATP8A2 gene product may be a tumor suppressor. Loss of its function can convert cells to tumorigenic phenotype.
  • CPS 227 corresponds to ARHC which encodes ras homolog gene family, member C. This gene has LocusID: 389, and is located on chromosome 1 with reported cytogenetic location Ip21-pl3. The gene resides in genomic locus NT_019273 (NCBI Genome Annotation). The gene product can regulate reorganization of the actin cytoskeleton. CPS 227 aligns with the 3' UTR region of ARHC. The alignment is also 3' to MOV10 which encodes MovlO, Moloney leukemia virus 10, homolog (mouse). MOV10 has LocusID: 4343 and reported cytogenetic location lpl2. It resides in genomic locus NT_019273.
  • CPS 228 corresponds to CREM which encodes cAMP responsive element modulator.
  • This gene has LocusID: 1390, and is located on chromosome 10 with reported cytogenetic location 10pl2.1-pll.l. The gene resides in genomic locus NT_033896-(NCBI Genome Annotation). Cyclic AMP-responsive element modulator is a regulator of the transcription of cAMP-inducible genes.
  • CPS 229 corresponds to ISG15 which encodes interferon-stimulated protein
  • This gene has LocusID: 9636, and is located on chromosome 1 with reported cytogenetic location lp36.33. The gene resides in genomic locus NT_004350 (NCBI).
  • ISG15 can be induced by interferon.
  • CPS 230 corresponds to CCR7 which encodes chemokine (C-C motif) receptor 7.
  • This gene has LocusID: 1236, and is located on chromosome 17 with reported cytogenetic location 17ql2-q21.2 .
  • the gene resides in genomic locus NT_024901 (NCBI).
  • the gene product is a G protein-coupled receptor, and can bind to
  • CPS 231 corresponds to Fl l which encodes coagulation factor XI (plasma thromboplastin antecedent). This gene has LocusID: 2160, and is located on chromosome 4 with reported cytogenetic location 4q35. The gene resides in genomic locus NT_022792
  • transcripts encode the circulating plasma factor XI and an alternate transcript lacking exon 5 encodes the platelet factor XI.
  • the plasma factor XI is present in plasma as a zymogen.
  • the platelet factor XI is localized to platelets and megakaryocytes, and may play a role both in the maintenance of normal hemostasis and as a substitute for plasma factor XI.
  • LOC 166760 aligns with a chromosome 4 region near LOC 166760.
  • This chromosome 14 region resides in genomic locus NT_006344 (NCBI Genome Annotation).
  • LOC 166760 has reported cytogenetic location 4pl5.32.
  • CPS 233 corresponds to UNK AC002073 (MGC17330) which encodes hypothetical protein MGC17330.
  • MGC17330 This gene has LocusID: 113791, and is located on chromosome 22 with reported cytogenetic location 22qll.2-q22. The gene resides in genomic locus NT_011520 (NCBI Genome Annotation).
  • CPS 234 corresponds to STK17A which encodes serine/threonine kinase 17a
  • This gene has LocusID: 9263, and is located on chromosome 7 with reported cytogenetic location 7pl2-pl4. The gene resides in genomic locus NT_007819
  • CPS 234 and STK17A resides in an intron of FLJ10803.
  • FLJ10803 encodes hypothetical protein FLJ10803, and has LocusID: 55744 and reported cytogenetic location 7pl5.1.
  • CPS 235 corresponds to BLVRA which encodes biliverdin reductase A.
  • This gene has LocusID: 644, and is located on chromosome 7 with reported cytogenetic location 7pl4-cen.
  • the gene resides in genomic locus NT_007819 (NCBI Genome
  • CPS 236 corresponds to SPP1 which encodes secreted phosphoprotein 1
  • the gene 6696, and is located on chromosome 4 with reported cytogenetic location 4q21-q25.
  • the gene resides in genomic locus NT_006204 (NCBI Genome Annotation).
  • the gene product is a bone and blood vessel extracellular matrix protein involved in calcification and atherosclerosis.
  • CPS 237 corresponds to STX1A which encodes syntaxin 1A (brain).
  • This gene has LocusID: 6804, and is located on chromosome 7 with reported cytogenetic location 7ql l.23. The gene resides in genomic locus NT_007758 (NCBI Genome
  • Syntaxin 1A (brain) may be involved in intracellular transport and neurotransmitter release.
  • CPS 238 corresponds to TKTL1 which encodes transketolase-like 1.
  • This gene has LocusID: 8277, and is located on chromosome X with reported cytogenetic location Xq28. The gene resides in genomic locus NT_025965 (NCBI Genome
  • Transketolase-like 1 is a thiamine pyrophosphate-dependent enzyme of pentose phosphate pathway.
  • CPS 239 corresponds to IER3 which encodes immediate early response 3.
  • This gene has LocusID: 8870, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome
  • This gene functions in the protection of cells from Fas or tumor necrosis factor type alpha-induced apoptosis. Alternative splicing of this gene results in at least two transcript variants.
  • CPS 241 corresponds to PLAU which encodes plasminogen activator, urokinase. This gene has LocusID: 5328, and is located on chromosome 10 with reported cytogenetic location 10q24. The gene resides in genomic locus NT_033890 (NCBI).
  • Urokinase plasminogen activator is a serine protease that cleaves plasminogen to form plasmin.
  • CPS 242 corresponds to ASS which encodes argininosuccinate synthetase.
  • This gene has LocusID: 445, and is located on chromosome 9 with reported cytogenetic location 9q34.1.
  • the gene resides in genomic locus NT_008338 (NCBI Genome Annotation).
  • the protein encoded by this gene catalyzes the penultimate step of the arginine biosynthetic pathway.
  • There are approximately 10 to 14 copies of this gene including the pseudogenes scattered across the human genome, among which the one located on chromosome 9 appears to be the only, functional gene for argininosuccinate synthetase. Mutations in the chromosome 9 copy of ASS cause citrullinemia. At least two alternatively spliced transcript variants of this gene have been reported.
  • CPS 242 also has 87-94% sequence identity with various regions on chromosome 2, 4, 5, 6, 7, 9, 11, 12 and X. These regions include LOC 167449, LOC222906, ASSP2, ASSP3, ASSP4, ASSP5, LOC158452, LOC253843, LOC120341, LOC167519, and an intron of intron of LOC90957.
  • LOC167449 encodes a protein similar to argininosuccinate synthetase and resides in genomic locus NT_006431 with reported cytogenetic location 5ql l.2.
  • LOC222906 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_007819 with reported cytogenetic location 7p21.3.
  • ASSP2 (LocusID: 447) encodes argininosuccinate synthetase pseudogene 2, and resides in genomic locus NT_007592 with reported cytogenetic location 6p22.1.
  • ASSP3 (LocusID: 448) encodes argininosuccinate synthetase pseudogene 3, and resides in genomic locus NT_033256 with reported cytogenetic location 9ql l-q22.
  • ASSP4 (LocusID: 449) encodes argininosuccinate synthetase pseudogene 4, and resides in genomic locus NT _025302 with reported cytogenetic location Xpter- ⁇ 22.
  • ASSP5 (LocusID: 450) encodes argininosuccinate synthetase pseudogene 5, and resides in genomic locus NT_028405 with reported cytogenetic location Xq22-q26.
  • LOC 158452 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_023935 with reported cytogenetic location 9q21.31.
  • LOC253843 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_006258 on chromosome 4.
  • LOC 120341 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_009151 with reported cytogenetic location l lq23.1.
  • LOC 167519 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_034778 with reported cytogenetic location 5q31.3.
  • LOC90957 (LocusID: 90957) encodes DEAH-box RNA/DNA helicase AAM73547, and resides in genomic locus NT__005367 with -reported cytogenetic location 2p22.3. [0341] Affymetrix annotation suggests that CPS 244 corresponds to DUX1 which encodes double homeobox, 1. The gene has LocusID: 26584.
  • Blast search of the Entrez human genome sequence database shows that nucleotides 1 to 689 of A JOO 1481 (SEQ ID NO: 227) align with two chromosome 4 regions with 85-88% sequence identity.
  • the first region is located 3' to LOC131308.
  • LOC131308 encodes a protein similar to FSHD Region Gene 2 protein. It has reported cytogenetic location 3pl4.1 and resides in genomic locus NT_022665.
  • the second region is located near TRAP95.
  • TRAP95 encodes thyroid hormone receptor-associated protein, 95-kD subunit.
  • TRAP95 has LocusID: 10025 and reported cytogenetic location 19pl3.3. It resides in genomic locus NT_011277.
  • TRAP95 gene product is a subunit of TRAP thyroid hormone receptor-associated protein complex, and functions as a coactivator for nuclear receptors.
  • CPS 245 corresponds to GPA33 which encodes glycoprotein A33
  • This gene has LocusID: 10223, and is located on chromosome 1 with reported cytogenetic location lq23.2. The gene resides in genomic locus NT 304668 (NCBI Genome Annotation).
  • the glycoprotein encoded by this gene is a cell surface antigen that is expressed in human colon cancers.
  • the sequence of the extracellular region of the encoded protein contains 2 domains characteristic of the CD2 subgroup of the immunoglobulin (Ig) superfamily.
  • the encoded protein may play a role in cell adhesion.
  • CPS 246 corresponds to PAH (SERPINEl) which encodes serine (or cysteine) proteinase inhibitor, clade E (nexin, plasminogen activator inhibitor type 1), member 1.
  • This gene has LocusID: 5054, and is located on chromosome 7 with reported cytogenetic location 7q21.3-q22. The gene resides in genomic locus NT_007933 (NCBI Genome Annotation). The gene product may regulate fibrinolysis. It is a member of the serpin family of serine protease inhibitors.
  • CPS 247 corresponds to CYP1B1 which encodes cytochrome P450, subfamily I (dioxin-inducible), polypeptide 1 (glaucoma 3, primary infantile).
  • This gene has LocusID: 1545, and is located on chromosome 2 with reported cytogenetic location 2p21. The gene resides in genomic locus NT_005367 (NCBI Genome Annotation). This gene product is a member of the cytochrome P450 superfamily of enzymes.
  • the cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids.
  • CYP1B1 The enzyme encoded by CYP1B1 gene localizes to the endoplasmic reticulum and metabolizes procarcmogens such as polycyclic aromatic hydrocarbons and 17beta-estradiol. Mutations in this gene have been associated with primary congenital glaucoma.
  • CPS 249 corresponds to HU-K5 (MGLL) which encodes monoglyceride lipase.
  • MGLL HU-K5
  • This gene has LocusID: 11343, and is located on chromosome 3 with reported cytogenetic location 3q21.3.
  • the gene resides in genomic locus NT_005588 (NCBI).
  • the gene product may function in regulating levels of lysophospholipids.
  • DORA is also known as IGSF6 which encodes immunoglobulin superfamily, member 6.
  • the gene has LocusID: 10261, and is located at chromosome 16pl2-pl3.
  • DREVl encodes CGI-81 protein.
  • DREVl has LocusID: 51108, and is located on chromosome 16 with reported cytogenetic location 16pl3-pl2. The gene resides in genomic locus
  • NT_010441 NCBI Genome Annotation
  • CPS 253 corresponds to MD-2 which encodes MD-2 protein. This gene has
  • LocusID 23643, and is located on chromosome 8 with reported cytogenetic location
  • the gene resides in genomic locus NT_008209 (NCBI Genome Annotation).
  • MD-2 protein appears to associate with toll-like receptor 4 on the cell surface and confer responsiveness to lipopolysaccyaride (LPS), thus providing a link between the receptor and
  • CPS 254 corresponds to UNK_U07563 (RRP4) which encodes homolog of yeast RRP4 (ribosomal RNA processing 4), 3'-5'-exoribonuclease. This gene has LocusID:
  • the gene product also known as ribosomal RNA processing 4, is similar to S. cerevisiae RRP4 which is a component of both the nuclear and cytoplasmic forms of the ribosomal RNA processing 3'-
  • UNKJU07563 or RRP4 gene product may function in ribosomal
  • CPS 255 corresponds to SPINK1 which encodes serine protease inhibitor, Kazal type 1.
  • the gene has LocusID: 6690 and reported cytogenetic location 5q32.
  • Blast search of the Entrez human genome sequence database shows that CPS
  • LOC256135 and KIAA0555 reside in genomic locus NT_006859
  • LOC256135 encodes a protein similar to N-formyl peptide receptor.
  • KIAA0555 encodes KIAA0555 gene product, and has LocusID: 9832 and reported cytogenetic location 5q32.
  • CPS 256 corresponds to UNK_S82470 (also known as BB1 or LENG4) which encodes leukocyte receptor cluster (LRC) member 4.
  • LRC leukocyte receptor cluster
  • This gene has LocusID: 79143, and is located on chromosome 19 with reported cytogenetic location 19ql3.4. The gene resides in genomic locus NT_011148 (NCBI Genome Annotation).
  • Affymetrix annotation suggests that CPS 257 corresponds to IGHG3.
  • the gene encodes immunoglobulin heavy constant gamma 3 (G3m marker), and has LocusID:
  • the gene is located at chromosome 14q32.33.
  • 257 has at least 96% sequence identity with a chromosome 14 region near LOC122595.
  • LOC122595 has reported cytogenetic location 14q32.33 and resides in genomic locus
  • NT_010168 NCBI Genome Annotation
  • CPS 260 corresponds to TNFRSF12 which encodes tumor necrosis factor receptor superfamily, member 12 (translocating chain-association membrane protein). This gene has LocusID: 8718, and is located on chromosome 1 with reported cytogenetic location lp36.2. The gene resides in genomic locus NT_028054 (NCBI Genome
  • the gene product contains a cytoplasmic death domain and transmembrane domains. It can induce apoptosis and activates NF-kappaB.
  • CPS 261 corresponds to UNK AL031685 (KIAA0939) which encodes
  • KIAA0939 protein This gene has LocusID: 23315, and is located on chromosome 20 with reported cytogenetic location 20ql3.13. The gene resides in genomic locus NT_011362
  • CPS 262 corresponds to UNK_AL049963 (LOC64116) which encodes a protein up-regulated by BCG-CWS.
  • This gene has LocusID: 64116, and is located on chromosome 4 with reported cytogenetic location 4q22-q24. The gene resides in genomic locus NT_006383 (NCBI Genome Annotation).
  • NT_006383 NCBI Genome Annotation.
  • Affymetrix annotation suggests that CPS 263 corresponds to APELIN which encodes apelin, a peptide ligand for APJ receptor.
  • the gene has LocusID: 8862, and is located at chromosome Xq25-26.3.
  • OCRL refers to oculocerebrorenal syndrome of Lowe.
  • This gene has LocusID: 4952, and is located on chromosome X with reported cytogenetic location Xq25-q26.1. The gene resides in genomic locus NT_011786 (NCBI Genome Annotation). Mutations in this gene are linked to the disease oculocerebrorenal syndrome of Lowe.
  • the encoded protein is a phospatidylinositol polyphosphate 5-phosphatase that can be found in golgi cisternae.
  • CPS 264 corresponds to VEGF which encodes vascular endothelial growth factor.
  • This gene has LocusID: 7422, and is located on chromosome 6 with reported cytogenetic location 6pl2. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). Vascular endothelial growth factor can induce endothelial cell proliferation and vascular permeability.
  • CPS 265 corresponds to ELAVL2 which encodes ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 (Hu antigen B).
  • This gene has LocusID: 1993, and is located on chromosome 9 with reported cytogenetic location 9p21.
  • the gene resides in genomic locus NT_023974 (NCBI Genome Annotation).
  • the gene product can bind to, 3 - untranslated regions of mRNA.
  • CPS 266 corresponds to PPP2R2A which encodes protein phosphatase 2
  • This gene has LocusID: 5520, and is located on chromosome 8 with reported cytogenetic location 8p21.1. The gene resides in genomic locus NT_023666 (NCBI Genome Annotation).
  • Nucleotides 163 to 505 of CPS 266 have 96% sequence identity with an intron sequence of TSC22.
  • TSC22 encodes transforming growth factor beta-stimulated protein TSC-22. It has LocusID: 8848, and resides in genomic locus NT_033922 with reported cytogenetic location 13ql4.
  • CPS 266 aligns with the non-protein-coding strand of the gene.
  • CPS 267 corresponds to UNK_AF001862 (FYB) which encodes FYN binding protein (FYB-120/130).
  • FYB UNK_AF001862
  • This gene has LocusID: 2533, and is located on chromosome 5 with reported cytogenetic location 5pl3.1. The gene resides in genomic locus NT_023195 (NCBI Genome Annotation). FYN-binding protein can modulate interleukin 2 production.
  • CPS 268 corresponds to OAS1 which encodes 2',5'-oligoadenylate synthetase 1 (40-46 kD).
  • This gene has LocusID: 4938, and is located on chromosome 12 with reported cytogenetic location 12q24.1. The gene resides in genomic locus NT_009770 (NCBI Genome Annotation).
  • This gene product is a member of the 2', 5' oligoadenylate synthase family. It can be induced by interferons and catalyze the 2', 5' oligomers of adenosine in order to bind and activate RNase L.
  • This gene family plays a significant role in the inhibition of cellular protein synthesis and viral infection resistance.
  • Alternative splicing of OAS1 gene produces at least two isoforms.
  • CPS 269 corresponds to CYBB which encodes cytochrome b-245, beta polypeptide (chronic granulomatous disease).
  • This gene has LocusID: 1536, and is located on chromosome X with reported cytogenetic location Xp21.1. The gene resides in genomic locus NT_011657 (NCBI Genome Annotation).
  • Cytochrome b (-245) is composed of cytochrome b alpha (CYBA) and beta (CYBB) chain. It has been proposed as a primary component of the microbicidal oxidase system of phagocytes.
  • CYBB deficiency is one of five described biochemical defects associated with chronic granulomatous disease (CGD).
  • phagocyte NADPH oxidase In this disorder, there is decreased activity of phagocyte NADPH oxidase. Neutrophils are able to phagocytize bacteria but cannot kill them in the phagocytic vacuoles. The cause of the killing defect may be an inability to increase the cell's respiration and consequent failure to deliver activated oxygen into the phagocytic vacuole.
  • CPS 270 corresponds to GFPT2 which encodes glutamine-fructose-6- phosphate transaminase 2.
  • This gene has LocusID: 9945, and is located on chromosome 5 with reported cytogenetic location 5q34-q35.
  • the gene resides in genomic locus NT_006519 (NCBI Genome Annotation).
  • the encoded protein is an enzyme of the hexosamine biosynthetic pathway.
  • CPS 271 corresponds to BNIP3 which encodes BCL2/adenovirus E1B 19kDa interacting protein 3.
  • the gene has LocusID: 664, and is located at chromosome 14ql 1.2-ql2.
  • LOC159348 has 100% sequence identity with an intron sequence of LOC159348.
  • LOC159348 is located at chromosome 10q26.3 and has genomic locus NT_024040 (NCBI Genome Annotation).
  • Nucleotides 1 to 451 of CPS 270 have 97% sequence identity with a region 3' to the protein-coding sequence of LOC254590.
  • LOC254590 resides in genomic locus NT_025892 on chromosome 14.
  • nucleotides 4 to 450 align to a chromosome 15 region near CAPN3 with 81% sequence identity.
  • CAPN3 encodes calpain 3, (p94), and has LocusID: 825 and reported cytogenetic location 15ql5.1-q21.1.
  • CAPN3 resides in genomic locus NT_030828.
  • CPS 272 corresponds to BST1 which encodes bone marrow stromal cell antigen 1.
  • This gene has LocusID: 683, and is located on chromosome 4 with reported cytogenetic location 4pl5. The gene resides in genomic locus NT_006344 (NCBI Genome Annotation).
  • Bone marrow stromal cell antigen 1 is a stromal cell line-derived glycosylphosphatidylinositol-anchored molecule that facilitates pre-B-cell growth. BST1 expression can be enhanced in bone marrow stromal cell lines derived from patients with rheumatoid arthritis.
  • CPS 273 corresponds to UNK_U50535 (CG005) wliich encodes hypothetical protein from BCRA2 region.
  • This gene has LocusID: 10443, and is located on chromosome 13 with reported cytogenetic location 13ql2-ql3.
  • the gene resides in genomic locus NTJ309984 (NCBI Genome Annotation).
  • the gene product has a region of low similarity to a region of rat 2',3 '-cyclic nucleotide 3 '-phosphodiesterase
  • CPS 274 corresponds to CX3CR1 which encodes chemokine (CX3C) receptor 1.
  • This gene has LocusID: 1524, and is located on chromosome 3 with reported cytogenetic location 3p21.3. The gene resides in genomic locus NT_005498 (NCBI Genome Annotation).
  • CX3C chemokine receptor is a G protein-coupled receptor. It can mediate leukocyte migration and adhesion, bind the CX3C chemokine fractalkine and signal through a pertussis toxin sensitive G-protein.
  • CPS 275 corresponds to CKTSF1B1 which encodes cysteine knot superfamily 1, BMP antagonist 1.
  • This gene has LocusID: 26585, and is located on chromosome 15 with reported cytogenetic location 15ql3-ql5.
  • the gene resides in genomic locus NT_024680 (NCBI Genome Annotation).
  • the gene product is a homolog of Xenopus laevis Gremlin wliich is a secreted protein that blocks signaling of bone morphogenetic protein (BMP) by preventing access to the BMP receptor.
  • CPS 276 corresponds to VNN2 wliich encodes vanin 2. This gene has
  • LocusID 8875, and is located on chromosome 6 with reported cytogenetic location 6q23- q24.
  • the gene resides in genomic locus NT_025741 (NCBI Genome Annotation).
  • This gene product is a member of the Vanin family of proteins which share sequence similarity with each other, and also with biotinidase.
  • the family includes secreted and membrane- associated proteins, a few of which have been reported to participate in hematopoietic cell trafficking.
  • Members of this family possess pantetheinase activity, and may play a role in oxidative-stress response.
  • Vanin 2 is a GPI-anchored cell surface molecule that plays a role in transendothelial migration of neutrophils.
  • VNN2 gene lies in close proximity to, and in same transcriptional orientation as two other vanin genes on chromosome 6q23-q24. Two transcript variants encoding different isoforms have 'been described for this gene. [0376] CPS 277 corresponds to KIAA0064 (SNX17) which encodes sorting nexin
  • This gene has LocusID: 9784, and is located on chromosome 2 with reported cytogenetic location 2p23-p22. The gene resides in genomic locus NT_005204 (NCBI Genome Annotation).
  • CPS 278 corresponds to GROl which encodes GROl oncogene (melanoma growth stimulating activity, alpha). This gene has LocusID: 2919, and is located on chromosome 4 with reported cytogenetic location 4q21. The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). The gene product has melanoma growth stimulating activity. It may be a mitogenic factor involved in inflammatory processes. [0378] CPS 278 has 88% sequence identity with GRO2 which encodes GRO2 oncogene. GRO2 has LocusID: 2920, and resides in genomic locus NT_00621-6 with reported cytogenetic location 4q21.
  • CPS 279 corresponds to KIAA0655 (HIP12) which encodes huntingtin interacting protein 12.
  • This gene has LocusID: 9026, and is located on chromosome 12 with reported cytogenetic location 12q24. The gene resides in genomic locus NT_009464 (NCBI Genome Annotation).
  • Nucleotides 13 to 332 of CPS 279 aligns to an intron of DNAH11 with 100% sequence identity.
  • DNAH11 encodes dynein, axonemal, heavy polypeptide 11. It has LocusID: 8701, and
  • CPS 280 corresponds to RBI which encodes retinoblastoma 1 (including osteosarcoma). This gene has LocusID: 5925, and is located on chromosome 13 with reported cytogenetic location 13ql4.2. The gene resides in genomic locus NT_033922 (NCBI Genome Annotation). Retinoblastoma protein 1 is a nuclear phosphoprotein with DNA binding activity. It can interact with histone deacetylase to repress transcription. [0383] CPS 281 corresponds to KIAA0073 which encodes KIAA0073 protein. This gene has LocusID: 23398, and is located on chromosome 5 with reported cytogenetic location 5ql2.3. The gene resides in genomic locus NT_006431 (NCBI Genome Annotation).
  • CPS 282 corresponds to CAPN4 (CAPNS1) which encodes calpain, small subunit 1.
  • This gene has LocusID: 826, and is located on chromosome 19 with reported cytogenetic location 19ql3.13. The gene resides in genomic locus NT_011296 (NCBI Genome Annotation). Calpains are a ubiquitous, well-conserved family of calcium- dependent, cysteine proteases. Calpain I and II are heterodimeric with distinct large subunits associated with common small subunits.
  • CAPN4 (CAPNS1) gene encodes a small subunit common to both calpain I and II and is associated with myotonic dystrophy.
  • CPS 284 corresponds to MGST 1 L 1 (PTGES) which encodes prostaglandin E synthase. This gene has LocusID: 9536, and is located on chromosome 9 with reported cytogenetic location 9q34.3. The gene resides in genomic locus NTJ329366 (NCBI Genome Annotation). Prostaglandin (PG) E synthase is involved in eicosanoid and glutathione metabolism. It is a member of superfamily of membrane associated proteins. [0386] CPS 285 corresponds to C1NH (SERPINGl) which encodes serine (or cysteine) proteinase inhibitor, clade G (Cl inhibitor), member 1, (angioedema, hereditary).
  • SERPINGl serine (or cysteine) proteinase inhibitor
  • clade G Cl inhibitor
  • member 1 angioedema, hereditary
  • This gene has LocusID: 710, and is located on chromosome 11 with reported cytogenetic location I lql2-ql3.1.
  • the gene resides in genomic locus NT_033903 (NCBI Genome Annotation).
  • the protein encoded by C1NH gene can inhibit activated Clr and Cis of the first complement component and thus regulate complement activation. Deficiency of the encoded protein may be associated with hereditary angioneurotic oedema (HANE).
  • the encoded protein is a member of the serine protease inhibitor (serpin) superfamily.
  • Affymetrix annotation suggests that CPS 286 corresponds to H1FX.
  • H1FX encodes HI histone family, member X. It has LocusID: 8971.
  • CPS 287 corresponds to IL1B which encodes interleukin 1, beta. This gene has LocusID: 3553, and is located on chromosome 2 with reported cytogenetic location 2ql4. The gene resides in genomic locus NT_022135 (NCBI Genome Annotation). Interleukin 1 beta may initiate and amplify the immune and inflammatory responses.
  • CPS 288 corresponds to F3 which encodes coagulation factor III
  • CPS 289 corresponds to DUSP4 which encodes dual specificity phosphatase
  • This gene has LocusID: 1846, and is located on chromosome 8 with reported cytogenetic location 8pl2-pl l. The gene resides in genomic locus ⁇ T_030743 (NCBI Genome Annotation).
  • the protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK JNK, p38), which are associated with cellular proliferation and differentiation.
  • MAP mitogen-activated protein
  • DUSP4 gene product can inactivate ERK1 , ERK2 and JNK, is expressed in a variety of tissues, and is localized in the nucleus. Two alternatively spliced transcript variants, encoding distinct isoforms, have been observed for this gene. In addition, multiple polyadenylation sites have been reported.
  • HUMRTNLH3 encodes endogenous retroviral protease.
  • the gene has LocusID: 51354.
  • CPS 291 corresponds to U ⁇ K_AL049250 (BANP) which encodes BTG3 associated nuclear protein.
  • BANP U ⁇ K_AL049250
  • This gene has LocusID: 54971, and resides in genomic locus NT_010859 on chromosome 18 (NCBI Genome Annotation).
  • the gene product can interact with CAF1, a component of the general transcription multisubunit complex. It is thought that BTG3 is involved in negative control of the cell cycle.
  • the protein encoded by BTG3 gene binds to BTG3. Studies with mouse homolog suggest that this protein may also interact with a specific nuclear matrix/scaffold-associated region (MAR).
  • MAR nuclear matrix/scaffold-associated region
  • LOC 124302 which encodes a protein similar to nuclear pore complex interacting protein.
  • LOC 124302 resides in genomic locus NT_010859 with reported cytogenetic location 18pl 1.1.
  • CPS 291 also aligns with other genes and chromosomal regions with at least
  • LOC92267 encodes a protein similar to hypothetical protein FLJ12363.
  • LOC92267 resides in NT_010441 with reported cytogenetic location 16pl2.1.
  • CPS 291 aligns with the non-protein-coding strand of LOC92267.
  • LOCI 46181 encodes a protein similar to Hypothetical protein KIAA0220, and resides in genomic locus NT_010441 with reported cytogenetic location 16pl2.2.
  • LOC146136 encodes a protein similar to nuclear pore complex interacting protein, and resides in genomic locus NT_010441 with reported cytogenetic location 16pl2.1.
  • KIAA0220 has LocusID: 23117, and resides in genomic locus NT_035362 with reported cytogenetic location 16pl2.1.
  • LOC253666 has similarity to rat kidney-specific (KS) gene. LOC253666 resides in genomic locus NT_010604 on chromosome 16.
  • LOC220565 encodes a protein similar to PI-3-kinase-related kinase SMG-1, isoform 1; lambda/iota protein kinase C-interacting protein; phosphatidylinositol 3-kinase-related protein kinase.
  • LOC220565 is located at chromosome 16ql3, and resides in genomic locus NT_033291.
  • LOC255019 encodes a protein similar to nuclear pore complex interacting protein, and resides in genomic locus NT_035360 on chromosome 16.
  • LOC83985 encodes spinster-like protein, and has LocusID: 83985.
  • LOC83985 resides in genomic locus NT_035372 with reported cytogenetic location 16ql3.
  • NPIP LocusID: 9284
  • EIF3S8 encodes eukaryotic translation initiation factor 3, subunit 8 (HOkD).
  • EIF3S8 has LocusID: 8663, and resides in genomic locus NT_010589 with reported cytogenetic location 16pll.2.
  • LOC146452 encodes a protein similar to KIAA0251 hypothetical protein. It resides in genomic locus NT_010478 with reported cytogenetic location 16q22.3.
  • LOCI 18735 encodes a protein similar to apoptosis response protein (prostate apoptosis response protein 4). LOCI 18735 resides in NT_030059 with reported cytogenetic location 10q24.2. [0395] CPS 292 corresponds to UNK . AB002308 (KIAA0310) wliich encodes
  • KIAA0310 gene product This gene has LocusID: 9919, and is located on chromosome 9 with reported cytogenetic location 9q34.3. The gene resides in genomic locus NT_033215
  • CPS 293 corresponds to EDN1 which encodes endothelin 1.
  • the gene has LocusID: 1906, and is located at chromosome 6p24.1.
  • CPS 293 aligns with IL1B with at least 99% sequence identity.
  • IL1B encodes interleukin 1, beta. '
  • CPS 295 corresponds to FRAT2 which encodes frequently rearranged in advanced T-cell lymphomas 2.
  • This gene has LocusID: 23401, and is located on chromosome 10 with reported cytogenetic location 10q23-q24.1.
  • the gene resides in genomic locus NT_03005 (NCBI Genome Annotation).
  • CPS 297 corresponds to IL8 which encodes interleukin 8. This gene has
  • LocusID 3576, and is located on chromosome 4 with reported cytogenetic location 4ql3- q21.
  • the gene resides in genomic locus NT_006216 (NCBI Genome Annotation).
  • Interleukin 8 is a cytokine that plays a role in chemoattraction and activation of neutrophils.
  • FSCN2 encodes fascin homolog 2, actin-bundling protein, retinal (Strongylocentrotus purpuratus).
  • the gene has LocusID: 25794, and is located at chromosome 17q25.
  • CPS 300 corresponds to CAMP which encodes cathelicidin antimicrobial peptide.
  • This gene has LocusID: 820, and is located on chromosome 3 with reported cytogenetic location 3p21.3. The gene resides in genomic locus NT_022567 (NCBI).
  • Cathelicidin antimicrobial peptide is a precursor of a peptide with antibacterial activity.
  • CPS 301 corresponds to FCGR1A which encodes Fc fragment of IgG, high affinity la, receptor for (CD64).
  • This gene has LocusID: 2209, and is located on chromosome 1 with reported cytogenetic location Iq21.2-q21.3.
  • the gene resides in genomic locus NT_032962 (NCBI Genome Annotation).
  • the gene product also known as
  • Fc gamma RI is a receptor for the Fc domain of IgG. It is a member of the immunoglobulin superfamily, and may have a role in immune response.
  • CPS 302 corresponds to MAT1A which encodes methionine adenosyltransferase I, alpha. This gene has LocusID: 4143, and is located on chromosome 10 with reported cytogenetic location 10q22. The gene resides in genomic locus
  • Methionine adenosyltransferase I (alpha isoform) catalyzes the formation of S-adenosylmethionine from methionine and ATP. Both the beta and alpha isoforms may be encoded by MATIA. Methionine adenosyltransferase deficiency is known to be caused by recessive as well as dominant mutations, the latter identified in autosomal dominant persistant hypermethioninemia.
  • CPS 303 corresponds to CYP4B1 which encodes cytochrome P450, subfamily IVB, polypeptide 1.
  • This gene has LocusID: 1580, and is located on chromosome 1 with reported cytogenetic location Ip34-pl2. The gene resides in genomic locus NT_004386 (NCBI Genome Annotation).
  • This gene product is a member of the cytochrome P450 heme-binding monooxygenase superfamily, and can metabolize steroids, fatty acids and xenobiotics.
  • MUC3 is also known as MUC3A, and encodes mucin 3A, intestinal.
  • the gene has LocusID: 4584 and reported cytogenetic location 7q22.
  • MUC3B aligns to regions 5' to the protein-coding strand of MUC3B with 80-81% sequence identity.
  • MUC3B encodes mucin 3B. It has LocusID: 57876 and reported cytogenetic location 7q22. The gene resides in genomic locus NT_007933.
  • CPS 305 corresponds to B7 which encodes B7 protein. This gene has
  • LocusID 10233, and is located on chromosome 12 with reported cytogenetic location
  • the gene resides in genomic locus NT_035206 (NCBI Genome Annotation).
  • the gene product has similarity to the regulatory subunit of protein phosphatases. It contains leucine rich repeats, and may mediate protein-protein interactions.
  • CPS 306 corresponds to SCYA20 which encodes small inducible cytokine subfamily A (Cys-Cys), member 20.
  • This gene has LocusID: 6364, and is located on chromosome 2 with reported cytogenetic location 2q33-q37.
  • the gene resides in genomic locus NT_005403 (NCBI Genome Annotation).
  • the protein encoded by this gene is a chemotactic factor for lymphocytes.
  • CPS 307 corresponds to ILIRN which encodes interleukin 1 receptor antagonist.
  • This gene has LocusID: 3557, and is located on chromosome 2 with reported cytogenetic location 2ql4.2. The gene resides in genomic locus NT_02213 (NCBI Genome Annotation).
  • Interleukin 1 receptor antagonist binds to and inhibits the IL-1 receptor. It is a member of the interleukin- 1 (IL-1) family.
  • CPS 308 corresponds to ZNF261 which encodes zinc finger protein 261.
  • This gene has LocusID: 9203, and is located on chromosome X with reported cytogenetic location Xql3.1.
  • the gene resides in genomic locus NT_019696 (NCBI Genome Annotation).
  • the gene product contains a putative zinc-binding motif (MYM).
  • CPS 309 corresponds to LGALS2 which encodes lectin, galactoside-binding, soluble, 2 (galectin 2).
  • This gene has LocusID: 3957, and is located on chromosome 22 with reported cytogenetic location 22ql3.1.
  • the gene resides in genomic locus NT_011520 (NCBI Genome Annotation).
  • Galectin 2 is an actose-binding lectin involved in cell growth regulation.
  • CPS 310 corresponds to TGN51 (TGOLN2) which encodes trans-golgi network protein 2.
  • This gene has LocusID: 10618, and is located on chromosome 2 with reported cytogenetic location 2pl l.2. The gene resides in genomic locus NT_015805 (NCBI Genome Annotation).
  • CCI-779 modulation of the expression levels of the CCI-779 activity genes have yet to be elucidated. Without being limited to any specific theory, the modulation may be attributed to the direct effect of CCI- 779 on PBMCs or other blood cells. It may also be caused by the effect of CCI-779 on renal cell carcinoma tumors, which in turn induce the change of the gene expression profile in the peripheral blood cells.
  • CCI-779 activity genes are RCC disease genes that are differentially expressed in PBMCs of RCC patients relative to tumor-free humans. These genes include, for example, TUBB, CTSL, SCYA2, PAI2, UNK_AI679353, SCYA7, IL1R1, THBS1, NCF1, ATP2B1, GRO2, PRF1, DBP, FCGR3B, ADFP, GRO3, C1QR, RNASE2, CBP2, HK3, FOS, PDI2, UNKJJ12471 (THBS1), EREG, SPP1, STX1A, TKTL1, DUX1, SPINK1, UNK AL049963 (LOC64116), BNIP3, GROl, IL1B, F3, UNK AL049250, EDN1, FCGR1A, MUC3, B7, SCYA20, IL1RN, and ZNF261. Over the course of CCI-779 therapy, at least one subset of these genes, which are elevated or suppressed in R
  • Table 6 shows examples of RCC disease genes whose expression profiles in
  • PBMCs can be modulated by CCI-779. Modulation of the expression profiles of these genes in ex vivo conditions is also indicated.
  • CCI-779 peripheral blood mononuclear cells isolated from tumor- free humans were treated with CCI-779 ex vivo. Comparison of the genes sensitive to CCI- 779 treatment ex vivo with the CCI-779 activity genes of the present invention revealed a common set of genes. These genes represent surrogate markers of CCI-779 drug activity in vivo as well as ex vivo.
  • the CCI-779 activity genes identified in the present invention can be used to monitor CCI-779 drug activities in a patient who is subject to a CCI-779 treatment.
  • Peripheral blood samples can be isolated at different stages of the CCI-779 treatment.
  • the expression profile of one or more CCI-779 activity genes in these peripheral blood samples can be determined and compared to a reference expression profile.
  • a change in the gene expression profile indicates in vivo activities of CCI-779.
  • the patent has RCC or another solid tumor.
  • the expression profiles of CCI-779 activity genes are determined by measuring the levels of RNA transcripts of these genes in peripheral blood samples. Suitable methods for this purpose include, but are not limited to, RT-PCT, Northern Blot, in situ hybridization, slot-blotting, nuclease protection assay, and nucleic acid arrays.
  • the peripheral blood samples can be, without limitation, whole blood samples or samples containing enriched PBMCs.
  • RNA isolated from peripheral blood samples can be first amplified to cDNA or cRNA.
  • the amplification can be specific or non-specific. Suitable amplification methods include, but are not limited to, reverse transcriptase PCR, isothermal amplification, ligase chain reaction, and Qbeta replicase.
  • the amplified nucleic acid products can be detected or quantitated, for instance, through hybridization to labeled probes.
  • Amplification primers and hybridization probes for a CCI-779 activity gene can be prepared from the gene sequence or its corresponding CPS using numerous methods. Gene sequences suitable for this purpose include, but are not limited to, exons, introns, or the 3' or 5' untranslated regions, or any combination thereof. In one embodiment, probes/primers are designed based on the sequence in or near the 3 ' protein-coding region of a CCI-779 activity gene. For instance, the nucleotide sequence encoding the last 100 to 300 amino acid residues in the C-terminus region of the CCI-779 activity gene product can be selected to design probes or primers.
  • a CCI-779 activity gene is a hypothetical or putative gene whose expression is supported only by EST or mRNA data, or where the genomic location(s) of a CCI-779 activity gene has not been determined or the gene may correspond to multiple genomic loci
  • the probes/primers for the gene can be designed based on the corresponding CPS, or the oligonucleotide probes of the corresponding qualifier. Table 5 lists example sequences that are useful for designing probes/primers for detecting the expression profiles of CCI-779 activity genes.
  • each probe can comprise at least 15, 20, 25, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 400 or more nucleotides.
  • each probe/primer has relatively high sequence complexity and does not have any ambiguous residue (undetermined "n" residues).
  • the probes/primers can hybridize to the target gene; such as its RNA transcripts or the complements thereof, under stringent or highly stringent conditions.
  • stringent conditions are at least as stringent as, for example, conditions G-L shown in Table 7.
  • “Highly stringent conditions” are at least as stringent as conditions A-F shown in Table 7. As used in Table 1, hybridization is carried out under the hybridization conditions (Hybridization Temperature and Buffer) for about four hours, followed by two 20-minute washes under the corresponding wash conditions (Wash Temp, and Buffer).
  • the hybrid length is that anticipated for the hybridized region(s) of the hybridizing polynucleotides.
  • the hybrid length is assumed to be that of the hybridizing polynucleotide.
  • the hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region or regions of optimal sequence complementarity.
  • SSPE 0.15M NaCl, lOmM NaH 2 PO 4 , and 1.25mM EDTA, pH 7.4
  • SSC 0.15M NaCl and 15mM sodium citrate
  • the probes/primers for a CCI-779 activity gene are selected from regions which significantly diverge from the sequences of other genes. Such regions can be determined by checking the probe/primer sequences against a human genome sequence database, such as the Entrez database at the NCBI.
  • a human genome sequence database such as the Entrez database at the NCBI.
  • One algorithm suitable for this purpose is the BLAST algorithm. This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold.
  • HSPs high scoring sequence pairs
  • the word hits are then extended in both directions along each sequence to increase the cumulative alignment score. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always ⁇ 0).
  • the BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. These parameters can be adjusted for different purposes, as appreciated by one of ordinary skill in the art.
  • quantitative RT-PCR (such as TaqMan, ABI) is used for detecting and comparing the peripheral blood expression profiles of CCI-779 activity genes.
  • Quantitative RT-PCR involves reverse transcription (RT) of RNA to cDNA followed by relative quantitative PCR (RT-PCR).
  • the concentration of the target DNA in the linear portion of the PCR is proportional to the starting concentration of the target before the PCR was begun.
  • concentration of the PCR products of the target DNA in PCR reactions that have completed the same number of cycles and are in their linear ranges, it is possible to determine the relative concentrations of the specific target sequence in the original DNA mixture. If the DNA mixtures are cDNAs synthesized from RNAs isolated from different tissues or cells, the relative abundances of the specific mRNA from which the target sequence was derived may be determined for the respective tissues or cells. This direct proportionality between the concentration of the PCR products and the relative mRNA abundances is true in the linear range portion of the PCR reaction.
  • the final concentration of the target DNA in the plateau portion of the curve is determined by the availability of reagents in the reaction mix and is independent of the original concentration of target DNA. Therefore, the sampling and quantifying of the amplified PCR products can be carried out when the PCR reactions are in the linear portion of their curves.
  • relative concentrations of the amplifiable cDNAs can be normalized to some independent standard, which may be based on either internally existing RNA species or externally introduced RNA species. The abundance of a particular mRNA species may also be determined relative to the average abundance of all mRNA species in the sample.
  • the PCR amplification utilizes internal PCR standards that are approximately as abundant as the target. This strategy is effective if the products of the PCR amplifications are sampled during their linear phases. If the products are sampled when the reactions are approaching the plateau phase, then the less abundant product may become relatively over-represented. Comparisons of relative abundances made for many different RNA samples, such as is the case when examining RNA samples for differential expression, may become distorted in such a way as to make differences in relative abundances of RNAs appear less than they actually are. This can be improved if the internal standard is much more abundant than the target. If the internal standard is more abundant than the target, then direct linear comparisons may be made between RNA samples.
  • RT-PCR is performed as a relative quantitative RT-PCR with an internal standard in which the internal standard is an amplifiable cDNA fragment that is larger than the target cDNA fragment and in which the abundance of the mRNA encoding the internal standard is roughly 5-100 times higher than the mRNA encoding the target.
  • This assay measures relative abundance, not absolute abundance of the respective mRNA species.
  • the relative quantitative RT-PCR uses an external standard protocol. Under this protocol, the PCR products are sampled in the linear portion of their amplification curves. The number of PCR cycles that are optimal for sampling can be empirically determined for each target cDNA fragment.
  • the reverse transcriptase products of each RNA population isolated from the various samples can be normalized for equal concentrations of amplifiable cDNAs. While empirical determination of the linear range of the amplification curve and normalization of cDNA preparations are tedious and time-consuming processes, the resulting RT-PCR assays may, in certain cases, be superior to those derived from a relative quantitative RT-PCR with an internal standard.
  • Nucleic acid arrays can also be used to detect and compare the expression patterns of CCI-779 activity genes in peripheral blood samples isolated at different CCI-779 treatment stages. Probes suitable for detecting the CCI-779 activity genes can be stably attached to known discrete regions on a support substrate. These probes maintain their positions relative to the respective discrete regions during hybridization and subsequent washes. Construction of nucleic acid arrays is well known in the art. Suitable substrates for making nucleic acid arrays include, but are not limited to, glasses, silica, ceramics, nylons, quartz wafers, gels, metals, papers, beads, tubes, fibers, films, membranes, column matrixes, or microtiter plate wells.
  • a nucleic acid array of the present invention can comprise at least 2, 5, 10,
  • probes for the same gene can be used on a single nucleic acid array. Examples of probes suitable for this invention are listed in the Qualifier Table. Probes for other drag activities genes can also be included in the nucleic acid arrays of this invention.
  • the probe density on the array can be in any range. For instance, the density may be 50, 100, 200, 300, 400, 500, or more probes/cm 2 .
  • a substantial portion of all polynucleotide probes on a nucleic acid array of the present invention are probes for CCI-779 or other drug activity genes. For instance, at least 10%, 20%, 30%, 40%, 50%, or more of all probes on the nucleic acid array can hybridize under stringent or nucleic acid array hybridization conditions to RNA transcripts, or the complements thereof, of drug activity genes.
  • nuclease protection assays are used to quantify
  • RNAs derived from the peripheral blood samples There are many different versions of nuclease protection assays. The common characteristic of these nuclease protection assays is that they involve hybridization of an antisense nucleic acid with the RNA to be quantified. The resulting hybrid double-stranded molecule is then treated with a nuclease which digests single-stranded nucleic acids more efficiently than double-stranded molecules. The amount of antisense nucleic acid that survives digestion is a measure of the amount of the target RNA species to be quantified.
  • An example of nuclease protection assays is the RNase protection assay manufactured by Ambion, Inc. (Austin, Texas).
  • the peripheral blood expression profiles of CCI-779 activity genes are determined by measuring the levels of polypeptides encoded by these genes.
  • Methods suitable for this purpose include, but are not limited to, immunoassays such as ELISA, RIA, FACS, dot blot, Western Blot, immunohistochemistry, and antibody-based radioimaging.
  • Other methods such as 2-dimensional SDS-polyacrylamide gel electrophoresis can also be used.
  • One exemplary method suitable for detecting the levels of target proteins in peripheral blood samples is ELISA.
  • ELISA antibodies capable of binding to the target proteins encoded by one or more CCI-779 activity genes are immobilized onto a selected surface exhibiting protein affinity, such as wells in a polystyrene or polyvinylchloride microtiter plate. Then, peripheral blood samples to be tested are added to the wells. After binding and washing to remove non-specif ⁇ cally bound immunocomplexes, the bound antigen(s) can be detected. Detection can be achieved by the addition of a second antibody which is specific for the target proteins and is linked to a detectable label.
  • Detection may also be achieved by the addition of a second antibody, followed by the addition of a third antibody that has binding affinity for the second antibody, with the third antibody being linked to a detectable label.
  • a second antibody followed by the addition of a third antibody that has binding affinity for the second antibody, with the third antibody being linked to a detectable label.
  • Befor-e being added to the microtiter plate, cells in the peripheral blood samples can be lysed using various methods known in the art. Proper extraction procedures can be used to separate the target proteins from potentially interfering substances.
  • the peripheral blood samples suspected of containing the target proteins are immobilized onto the well surface and then contacted with the antibodies of the invention. After binding and washing to remove non-specif ⁇ cally bound immunocomplexes, the bound antigen is detected. Where the initial antibodies are linked to a detectable label, the immunocomplexes can be detected directly. The immunocomplexes can also be detected using a second antibody that has binding affinity for the first antibody, with the second antibody being linked to a detectable label.
  • Another exemplary ELISA involves the use of antibody competition in the detection. In this ELISA, the target proteins are immobilized on the well surface.
  • the labeled antibodies are added to the well, allowed to bind to the target proteins, and detected by means of their labels.
  • the amount of the target proteins in an unknown sample is then determined by mixing the sample with the labeled antibodies before or during incubation with coated wells. The presence of the target proteins in the unknown sample acts to reduce the amount of antibody available for binding to the well and thus reduces the ultimate signal.
  • Different ELISA formats can have certain features in common, such as coating, incubating or binding, washing to remove non-specifically bound species, and detecting the bound immunocomplexes. For instance, in coating a plate with either antigen or antibody, the wells of the plate can be incubated with a solution of the antigen or antibody, either overnight or for a specified period of hours. The wells of the plate are then washed to remove incompletely adsorbed material. Any remaining available surfaces of the wells are then "coated” with a nonspecific protein that is antigenically neutral with regard to the test samples. Examples of these nonspecific proteins include bovine serum albumin (BSA), casein and solutions of milk powder.
  • BSA bovine serum albumin
  • the coating allows for blocking of nonspecific adsorption sites on the immobilizing surface and thus reduces the background caused by nonspecific binding of antisera onto the surface.
  • a secondary or tertiary detection means an also be used. After binding of a protein or antibody to the well, coating with a non-reactive material to reduce background, and washing to remove unbound material, the immobilizing surface is contacted with the control or clinical or biological sample to be tested under conditions effective to allow immunocomplex (antigen/antibody) formation. These conditions may include, for example, diluting the antigens and antibodies with solutions such as BSA, bovine garnma globulin (BGG) and phosphate buffered saline (PB$)/Tween and incubating the antibodies and antigens at room temperature for about 1 to 4 hours or at 4°C overnight. Detection of the immunocomplex then requires a labeled secondary binding ligand or antibody, or a secondary binding ligand or antibody in conjunction with a labeled tertiary antibody or third binding ligand.
  • BSA bovine garnma globulin
  • PB$ phosphate buffered saline
  • the contacted surface can be washed so as to remove non-complexed material.
  • the surface may be washed with a solution such as PBS/Tween, or borate buffer.
  • a solution such as PBS/Tween, or borate buffer.
  • the second or third antibody can have an associated label.
  • the label is an enzyme that generates color development upon incubating with an appropriate chromogenic substrate.
  • a urease glucose oxidase
  • alkaline phosphatase alkaline phosphatase
  • hydrogen peroxidase-conjugated antibody for a period of time and under conditions that favor the development of further immunocomplex formation (e.g., incubation for 2 hours at room temperature in a PBS-containing solution such as PBS-Tween).
  • the amount of label is quantified, e.g., by incubation with a chromogenic substrate such as urea and bromocresol purple or 2,2'-azido-di-(3-ethyl)- benzthiazoline-6-sulfonic acid (ABTS) and H 2 O 2 , in the case of peroxidase as the enzyme label.
  • Quantitation can be achieved by measuring the degree of color generation, e.g., using a spectrophotometer.
  • RIA radioimmunoassay
  • An exemplary RIA is based on the competition between radiolabeled-polypeptides and unlabeled polypeptides for binding to a limited quantity of antibodies.
  • Suitable radiolabels include, but are not limited to, I 125 .
  • a fixed concentration of I I25 -labeled polypeptide is incubated with a series of dilution of an antibody specific to the polypeptide.
  • the unlabeled polypeptide is added to the system, the amount of the I 125 -polypeptide that binds to the antibody is decreased.
  • a standard curve can therefore be constructed to represent the amount of antibody-bound I 125 -polypeptide as a function of the concentration of the unlabeled polypeptide. From this standard curve, the concentration of the polypeptide in unknown samples can be determined.
  • Suitable antibodies for this invention include, but are not limited to, polyclonal antibodies, monoclonal antibodies, chimeric antibodies, humanized antibodies, single chain antibodies, Fab fragments, or fragments produced by a Fab expression library. Methods for making these antibodies are well known in the art.
  • the antibodies of the present invention can bind to the corresponding CCI-779 activity gene products or other desired antigens with a binding affinity constant Ka of at least 10 4 M “1 , 10 5 M “1 , 10 6 M “1 , 10 7 M “1 , 10 8 M “1 , or more.
  • the antibodies of this invention can be labeled with one or more detectable moieties to allow for detection of antibody-antigen complexes.
  • the detectable moieties can include compositions detectable by spectroscopic, enzymatic, photochemical, biochemical, bioelectronic, immunochemical, electrical, optical or chemical means.
  • the detectable moieties include, but are not limited to, radioisotopes, chemiluminescent compounds, labeled binding proteins, heavy metal atoms, spectroscopic markers such as fluorescent markers and dyes, magnetic labels, linked enzymes, mass spectrometry tags, spin labels, electron transfer donors and acceptors, and the like.
  • the levels of polypeptides in peripheral blood samples can be determined by detecting the biological activities associated with the polypeptides. If a biological function/activity of a polypeptide is known, suitable in vitro bioassays can be designed to evaluate the biological function/activity, which in turn can be used to determine the amount of the polypeptide in the sample.
  • Comparison between the expression profile of a patient of interest and a reference expression profile can be conducted manually or electronically.
  • the reference expression profile can be a baseline expression profile representing gene expression in peripheral blood samples isolated prior to a drag treatment.
  • the reference profile can also be an expression profile in peripheral blood samples isolated after initiation of the drag treatment.
  • the reference expression profile can be determine using sample isolated from the patient of interest or other reference patient or patients. In many embodiments, the process or methodology that is used to determine the reference expression profile and the expression profile being compared is identical or comparable.
  • comparison is carried out by comparing each component in the expression profile of the patient of interest to the corresponding component in the reference expression profile(s).
  • the component can be the expression level of a drag activity gene, a ratio between the expression levels of two drag activity genes, or another measure capable of representing gene expression patterns.
  • the expression level of a gene can be an absolute level, or a normalized or relative level. The difference between two corresponding components can be assessed by fold changes, absolute differences, or other suitable means.
  • Comparison between expression profiles can also be conducted using pattern recognition or comparison programs.
  • serial analysis of gene expression SAGE
  • GEMTOOLS gene expression analysis program Incyte Pharmaceuticals
  • GeneCalling and Quantitative Expression Analysis technology Curagen
  • Other suitable methods, programs or systems can be used.
  • Multiple drag activity genes can be used in the comparison of expression profiles. For instance, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 30, 40, 50, or more drag activity genes can be used.
  • the drag activity gene(s) used in the comparison can be selected to have relatively small p- values or large differential expression ratios.
  • the drag activity genes used in the comparison have p-values (e.g., under an ANOVA analysis) of no greater than 0.05, 0.01, 0.001, 0.0005, 0.0001, or less.
  • the expression level after or during a drag treatment is increased or decreased by at least 2-fold, 3-fold, 4-fold, 5-fold, or more over the baseline expression level.
  • comparison of the expression profiles is performed electronically, such as by using a computer system.
  • the computer system includes a processor coupled to a memory or storage medium which stores data representing the expression profiles being compared.
  • the memory or storage medium is readable or rewritable.
  • the stored expression data can be changed, retrieved, or otherwise manipulated.
  • the memory can also store one or more programs capable of causing the processor to compare the expression profiles.
  • the processor is coupled to a nucleic acid array scanner which sends signals to the processor for analysis.
  • In vivo activities of other drags can be similarly monitored using the drug activities genes of the present invention, as appreciated by those skilled in the art.
  • At least a subset of the drag activity genes of the present invention may be used as disease regression indicators. This subset of genes may be responsive to the effect of CCI-779 or other drags on RCC or other diseases.
  • CCI-779 or another drug may cause regression of tumors, and the decrease in tumor size or activity may cause a decreased immune response which is reflected in gene expression changes in the peripheral blood cells. Consequently, the drag activity genes of the present invention can be used as molecular markers for monitoring the efficacy of CCI-779 or other drugs for treating RCC or other diseases. In many cases, diseases amendable to the present invention include those whose regression can produce changed immune responses.
  • PBMCs from the CCI-779 clinical trials were isolated from whole blood samples (8mL) and collected into CPT tubes according to the standard procedure. PBMCs were purified over Ficoll gradients, washed two times with PBS and counted. Total RNA was isolated from PBMC pellets using the RNeasy mini kit (Qiagen, Valencia, CA). Labeled target for oligonucleotide arrays was prepared using a modification of I the procedure described in Lockhart, et al, Nature Biotechnology 14: 1675-80 (1996). 2 ⁇ g total RNA was converted to cDNA by priming with an oligo-dT primer containing a T7 DNA polymerase promoter at the 5' end.
  • the cDNA was used as the template for in vitro transcription using a T7 DNA polymerase kit (Ambion, Woodlands, TX) and biotinylated CTP and UTP (Enzo). Labeled cRNA was fragmented in 40 mM Tris-acetate pH 8.0, 100 mM KOAc, 30 mM MgO Ac for 35 minutes at 94°C in a final volume of 40 ⁇ l.
  • Example 2 Hybridization to Affymetrix Microarrays and Detection of Fluorescence
  • Individual samples were hybridized to HgU95A genechip (Affymetrix). No samples were pooled. 110 RCC patients were involved in the study. Each patient received 25, 75, or 250 mg of CCI-779 once weekly through intravenous infusion. Blood samples were collected immediately before the first CCI-779 infusion, eight weeks after the first infusion, and sixteen weeks after the first infusion.
  • the sensitivity of detection of the arrays ranged between about 1:300,000 and 1:100,000 copies/million.
  • Labeled probes were denatured at 99°C for 5 minutes and then 45°C for 5 minutes, and hybridized to oligonucleotide arrays comprised of over 12,500 human gene probes (HgU95A, Affymetrix). Arrays were hybridized for 16 hours at 45°C.
  • the hybridization buffer was comprised of 100 mM MES, 1 M fNa + ], 20 mM EDTA, and 0.01% Tween 20. After hybridization, the cartridges were washed extensively with non-stringent wash buffer (6 x SSPET), such as three 10-minute washes at room temperature. These hybridization and wash conditions are herein collectively referred to as "nucleic acid array hybridization conditions.” The washed cartridges were then stained with phycoerythrin coupled to streptavidin.
  • 12 x MES stock contains 1.22 M MES and 0.89 M [Na + ].
  • the stock can be prepared by mixing 70.4 g MES free acid monohydrate, 193.3 g MES sodium salt and 800 ml of molecular biology grade water, and adjusting volume to 1000 ml. The pH should be between 6.5 and 6.7.
  • 2 x hybridization buffer can be prepared by mixing 8.3 mL of 12 x MES stock, 17.7 mL of 5 M NaCl, 4.0 mL of 0.5 M EDTA, 0.1 mL of 10% Tween 20 and 19.9 mL of water.
  • 6 x SSPET contains 0.9 M NaCl, 60 mM NaH 2 PO 4 , 6 mM EDTA, pH 7.4, and 0.005% Triton X-100.
  • the wash buffer can be replaced with a more stringent wash buffer.
  • 1000 ml stringent wash buffer can be prepared by mixing 83.3 mL of 12 x MES stock, 5.2 mL of 5 M NaCl, 1.0 mL of 10% Tween 2 ⁇ and 910.5 mL of water.
  • Example 3 Gene Expression Data Analysis
  • GENECHIP 3.2 software (Affymetrix). GeneChip 3.2 software uses an algorithm to calculate the likelihood as to whether a gene is "absent” or “present” as well as a specific hybridization intensity value or "average difference” for each transcript represented on the array. The algorithms used in these calculations are described in the Affymetrix GeneChip Analysis Suite User Guide (Affymetrix). The "average difference” for each transcript was normalized to "frequency” values according to the procedures of Hill et al, Science, 290: 809-812 (2000). This was accomplished by referring the average difference values on each chip to a calibration curve constructed from the average difference values for the 11 control transcripts with known abundance that were spiked into each hybridization solution. This process also served to normalize between arrays.
  • genes that were designated "absent" by the GENECHIP 3.2 software in all samples were excluded from the analysis.
  • Drug modulable transcripts in the surrogate tissue can provide early evidence of drag exposure in vivo.
  • PBMC ex vivo assays that mimic peripheral blood gene expression in disease states as well as the in vivo response to drag treatments can further validate the surrogate markers identified in in vivo studies.
  • ex vivo assays that overlay with disease-associated biomarkers may provide evidence for markers of drag-dependent disease amelioration.
  • PBMC seeding density a percentage of cells in culture media and other conditions.
  • vehicle control a measure of the amount of cells in culture media and other conditions.
  • dose response e.g., preincubation, coincubation, or postincubation.
  • dose schedule e.g., preincubation, coincubation, or postincubation.
  • phytohemagglutinin was used to stimulate gene expression in PBMCs.
  • the genes whose expression can be stimulated by PHA in PBMCs had a substantial overlap with disease-associated transcripts in RCC PBMCs that appear to result from T-cell activation.
  • PHA-stimulated PBMCs mimicked anti- CD3/anti-CD28 stimulated T-cell profiles in BioExpress.
  • CCI-779 inhibited certain PHA-inducible transcripts in a dose-dependent manner. The inhibition by CCI-779 appeared to be specific, rather than global.
  • CTSL, SCYA2, FABP5, SCYA7, ATP2B1, and IL1R1 genes were directly repressed by 300 nm CCI-779 in PHA- stimulated PBMCs in an ex vivo assay.
  • Databases of disease-associated profiles and ex vivo activation signatures can be created to identity most appropriate culture conditions for identification of drag modulated biomarkers in specific disease settings.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Physics & Mathematics (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Hospice & Palliative Care (AREA)
  • Biophysics (AREA)
  • Oncology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

Methods, systems and equipment useful for monitoring in vivo activities of CCI-779 or other drugs. Numerous drug activity genes can be identified by the present invention. The expression profiles of these genes in peripheral blood mononuclear cells can be modulated by CCI-779 or other drugs. Therefore, these genes can be used as surrogate markers for monitoring drug activities in vivo.

Description

METHODS FOR MONITORING DRUG ACTIVITIES IN VIVO
[0001] This application incorporates by reference the entire disclosures of U.S.
Provisional Application Serial No. 60/446,133, filed February 11, 2003 and entitled "Methods for Monitoring Drug Activities in Vivo," U.S. Provisional Application Serial No. 60/459,782, filed April 3, 2003 and entitled "Methods for Diagnosing RCC and/or Solid Tumors," and U.S. Provisional Application, filed January 23, 2004 and entitled "Methods for Prognosis and Treatment of Solid Tumors" (by Michael Burczynski, et al). In addition, this application incorporates by reference all materials recorded in compact discs labeled "Copy 1 - Sequence Listing Part," "Copy 2 - Sequence Listing Part" and "Copy 3 - Sequence Listing Part," each of which includes "Sequence Listing.ST25.txt" (3,647 KB, created February 9, 2004). Furthermore, this application incorporates by reference all materials recorded in compact discs labeled "Copy 1 - Tables Part," "Copy 2 - Tables Part" and "Copy 3 - Tables Part," each of which includes "Table 1 (the Qualifier Table).txt" (210 KB, created February 9, 2004).
TECHNICAL FIELD
[0002] This invention relates to methods, systems and equipment useful for monitoring in vivo activities of CCI-779 or other drugs.
BACKGROUND
[0003] CCI-779 is an ester analog of the immunosuppressant rapamycin and as such is a potent, selective inhibitor of the mammalian target of rapamycin. The mammalian target of rapamycin (mTOR) activates multiple signaling pathways, including phosphorylation of p70s6kinase, which results in increased translation of 5' TOP mRNAs encoding proteins involved in translation and entry into the GI phase of the cell cycle. By virtue of its inhibitory effects on mTOR and cell cycle control, CCI-779 may function as a cytostatic and immunosuppressive agent. CCI-779 has been used as an anticancer drug and is currently being evaluated for indications in clinical trials for treating various oncology and inflammatory diseases. These diseases include, but are not limited to, renal cell carcinoma (RCC), prostate cancer, gliomas, lung cancer, and non-Hodgkin's lymphoma.
SUMMARY OF THE INVENTION SUMMARY OF THE INVENTION
[0004] One of the main objectives of clinical pharmacogenomic studies is to identify suitable markers for monitoring in vivo activities of CCI-779 or other drugs. The present invention employs easily-obtained tissues, such as peripheral blood, as surrogate tissues for the detection of in vivo activities of CCI-779 or other drugs.
[0005] In one aspect, the present invention provides methods that are useful for detecting in vivo activities of CCI-779 or other drugs. The methods include comparing an expression profile of at least one drug activity gene in a peripheral blood sample of a patient of interest to a reference expression profile of the gene, where the gene is differentially expressed in peripheral blood mononuclear cells (PBMCs) of patients who have a non- blood disease and are subject to a drug therapy as compared to PBMCs isolated from the patients prior to the drug therapy. In many embodiments, the patient of interest has the non- blood disease and is being treated by the drug therapy.
[0006] In one embodiment, the drug therapy is an anti-cancer therapy, such as CCI-
779 therapy, and the non-blood disease is a solid tumor, such as RCC, prostate cancer, or head/neck cancer. In another embodiment, the drug activity genes of the present invention are selected from Table 5.
[0007] The peripheral blood samples used in the present invention can be, without limitation, whole blood samples or samples comprising enriched PBMCs. Other peripheral blood samples that include PBMCs can also be employed in the present invention. [0008] The expression profiles of the drug activity genes can be determined by various means, such as quantitative RT-PCR, Northern Blot, in situ hybridization, slot- blotting, nuclease protection assay, nucleic acid arrays, enzyme-linked immunosorbent assays (ELISAs), radioimmunoassay (RIAs), fluorescence-activated cell sorters (FACSs), or Western Blots. In addition, two-dimensional SDS-polyacrylamide gel electrophoresis and other high-through nucleic acid or protein detection techniques can also be used. [0009] In many embodiments, the reference expression profile and the expression profile being compared are prepared using the same or comparable methodology. In one example, the reference expression profile is an average baseline expression profile of drug activity genes in peripheral blood samples isolated from patient or patients prior to a drug treatment. In another example, the reference expression profile is an expression profile of drug activity genes in peripheral blood samples of the patient of interest. The reference expression profile and the expression profile being compared can be obtained from peripheral blood samples isolated at different time points in a drug treatment. [0010] In one embodiment, the drug activity genes used in the present invention are over-expressed (or under-expressed) in PBMCs of patient who have a non-blood disease as compared to PBMCs of humans who do not have the non-blood disease. The drug therapy being investigated can down-regulate (or up-regulate) the expression of the drug activity genes in PBMCs of patients who have the non-blood disease.
[0011] In another embodiment, the expression of the drug activity genes in PBMCs can be stimulated (or suppressed) by phytohemagglutinin (PHA). The drug therapy being investigated can down-regulate (or up-regulate) the expression of these genes in PHA- treated PBMCs.
[0012] In another aspect, the present invention provides other methods that are useful for detecting in vivo activities of CCI-779 or other drugs. The methods include comprises comparing an expression profile of at least one drug activity gene in a peripheral blood sample of a patient of interest to a reference expression profile of the gene, where the RNA transcript(s) of the drug activity gene can hybridize under stringent or nucleic acid array hybridization conditions to one or more qualifiers selected from the Qualifier Table. [0013] In yet another aspect, the present invention provides methods useful for identifying drug activity genes. The methods include detecting the gene expression profile in peripheral blood samples of patients who have a non-blood disease and are subject to a drug therapy, and comparing the gene expression profile to a baseline gene expression profile in peripheral blood samples isolated before the drug therapy. Drug activity genes whose expression levels in peripheral blood samples can be modulated by the drug therapy can therefore be identified.
[0014] In still another aspect, the present invention provides kits useful for detecting in vivo activities of CCI-779 or other drugs. In one embodiment, the kits include a plurality of polynucleotides, and each polynucleotide can hybridize under stringent or nucleic acid array hybridization conditions to an RNA transcript, or the complement thereof, of a different respective drug activity gene. In another embodiment, the kits include a plurality of antibodies, and each antibody can bind to a polypeptide encoded by a different respective drug activity gene. The drug activity genes can be selected, without limitation, from Table 5. [0015] In still yet another aspect, the present invention provides nucleic acid arrays useful for detecting in vivo activities of CCI-779 or other drugs. A substantial portion of all polypeptide probes on the nucleic acid array can hybridize under stringent or nucleic acid array hybridization conditions to RNA transcripts, or the complements thereof, of drug activity genes.
[0016] Other features, objects, and advantages of the present invention are apparent in the detailed description that follows. It should be understood, however, that the detailed description, while indicating preferred embodiments of the invention, are given by way of illustration only, not limitation. Various changes and modifications within the scope of the invention will become apparent to those skilled in the art from the detailed description.
DETAILED DESCRIPTION
[0017] The present invention provides methods useful for the detection of in vivo activities of CCI-779 or other drugs. Numerous drug activity genes can be identified by the present invention. The expression profiles of these genes in PBMCs can be modulated by CCI-779 or other drugs. Accordingly, these genes can be used as surrogate markers for monitoring drug activities in vivo. In one embodiment, the methods of the present invention include comparing the expression profile of at least one drug activity gene in a peripheral blood sample of a patient of interest to a reference expression profile of the same drag activity gene. The patient of interest has a non-blood disease, such as RCC, prostate cancer, or another solid tumor, and is being treated by a drug therapy. A change in the peripheral blood expression profile of the drug activity gene is indicative of the in vivo activity of the drug therapy. In many cases, the reference expression. profile can be determined by using baseline peripheral blood samples isolated from patients prior to the drug therapy. Peripheral blood samples amenable to the present invention include, but are not limited to, whole blood samples or samples comprising enriched PBMCs. Expression profiles of drug activity genes can be detected using a variety of methods, such as quantitative RT-PCT, Northern Blot, in situ hybridization, nucleic acid arrays, enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), FACS (fluorescence-activated cell sorter), or Western Blot. The drug activity genes of the present invention may also be used for assessing the efficiency of a drug therapy.
[0018] Various aspects of the invention are described in further detail in the following sections. The use of sections is not meant to limit the invention. Each section and subsection may apply to any aspect of the invention. In this application, the use of "or" means "and/or" unless stated otherwise. Also, the use of the singular includes the plural unless stated otherwise.
A. General Methods for Identifying Drug Activity Genes
[0019] The availability of the human genome sequence, together with new developments in technology, such as DNA microarrays, proteomics and computational biology, allows systemic gene expression studies for various diseases. The present invention employs the systematic gene expression analysis technique to identify genes whose expression in peripheral blood can be modulated by a therapeutic agent such as CCI- 779. These genes are herein referred to as "drug activity genes." The genes whose expression levels in peripheral blood can be modified by CCI-779 are refereed to as "CCI- 779 activity genes."
[0020] Drug activity genes can be identified by comparing peripheral blood gene expression profiles before and after a drug treatment. Numerous methods are available for detecting and comparing gene expression profiles.
[0021] For instance, gene expression profiles can be detected by measuring the levels of RNA transcripts in peripheral blood samples. In one embodiment, total RNAs or polyA RNAs are isolated from peripheral blood samples using conventional means. The isolated RNAs can be amplified to produce cDNAs or cRNAs. Peripheral blood gene expression profiles can be determined by measuring the amount of the amplified cDNAs or cRNAs.
[0022] Peripheral blood gene expression profiles can also be determined by measuring the levels of polypeptides in peripheral blood samples. The amounts of polypeptides in peripheral samples can be detected using various methods, such as ELISAs, RIAs, FACSs, Western Blots or other immunoassays. In addition, two-dimensional gel electrophoresis/mass spectrometry or other high-throughput protein sequencing and identification methods can be used.
[0023] In one embodiment, nucleic acid arrays are used for detecting or comparing gene expression profiles in peripheral blood samples isolated at different stages of a therapeutic treatment. Nucleic acid arrays allow for quantitative detection of the expression levels of a large number of genes at one time. Examples of nucleic acid arrays include, but are not limited to, Genechip® microarrays from Affymetrix (Santa Clara, CA), cDNA microarrays from Agilent Technologies (Palo Alto, CA), and bead arrays described in U.S. Patent Nos. 6,288,220 and 6,391,562.
[0024] The polynucleotides to be hybridized to nucleic acid arrays can be labeled with one or more labeling moieties to allow for detection of hybridized polynucleotide complexes. The labeling moieties can include compositions that are detectable by spectroscopic, photochemical, biochemical, bioelectronic, immunochemical, electrical, optical or chemical means. Exemplary labeling moieties include radioisotopes, chemiluminescent compounds, labeled binding proteins, heavy metal atoms, spectroscopic markers such as fluorescent markers and dyes, magnetic labels, linked enzymes, mass spectrometry tags, spin labels, electron transfer donors and acceptors, and the like. Unlabeled polynucleotides can also be employed. The polynucleotides can be DNA, RNA, or a modified form thereof.
[0025] Hybridization reactions can be performed in absolute or differential hybridization formats. In the absolute hybridization format, polynucleotides derived from one sample, such as a peripheral blood sample isolated from a cancer patient at a particular treatment stage, are hybridized to the probes in a nucleic acid array. Signals detected after the formation of hybridization complexes correlate to the polynucleotide levels in the sample. In the differential hybridization format, polynucleotides derived from two biological samples, such as one isolated from a cancer patient at a first stage of treatment and the other isolated from the same patient but at a second stage of treatment, are labeled with different labeling moieties. A mixture of these differently labeled polynucleotides is added to a nucleic acid array. The nucleic acid array is then examined under conditions in which the emissions from the two different labels are individually detectable. In one embodiment, the fluorophores Cy3 and Cy5 (Amersham Pharmacia Biotech, Piscataway N. J.) are used as the labeling moieties for the differential hybridization format. [0026] Signals gathered from nucleic acid arrays can be analyzed using commercially available software, such as those provide by Affymetrix or Agilent Technologies. Controls, such as for scan sensitivity, probe labeling and cDNA/cRNA quantitation, can be included in the hybridization experiments. In many embodiments, the nucleic acid array expression signals are scaled or normalized before being subject to further analysis. For instance, the expression signals for each gene can be normalized to take into account variations in hybridization intensities when more than one array is used under similar test conditions. Signals for individual polynucleotide complex hybridization can also be normalized using the intensities derived from internal normalization controls contained on each array. In addition, genes with relatively consistent expression levels across the samples can be used to normalize the expression levels of other genes. In one embodiment, the expression levels of the genes are normalized across the samples such that the mean is zero and the standard deviation is one. In another embodiment, the expression data detected by nucleic acid arrays are subject to a variation filter which excludes genes showing minimal or insignificant variation across all samples.
[0027] A variety of peripheral blood samples can be used in the present invention.
In one embodiment, the peripheral blood samples are whole blood samples. In another embodiment, the peripheral blood samples comprise enriched PBMCs. By "enriched," it means that the percentage of PBMCs in the sample is higher than that in whole blood. In many cases, the PBMC percentage in an enriched sample is at least 1, 2, 3, 4, 5 or more times higher than that in whole blood. In many other cases, the PBMC percentage in an enriched sample is at least 90%, 95%, 98%, 99%, 99.5%, or more. Blood samples containing enriched PBMCs can be prepared using any method known in the art, such as Ficoll gradients centrifugation or CPTs (cell purification tubes).
[0028] Peripheral blood samples used in the present can be isolated at any stage of a drug treatment (including baseline samples isolated before the drug treatment). For instance, the samples can be isolated from patients at 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 1 week, 2 weeks, 3 weeks, 4 weeks, 8 weeks, or 16 weeks after initiation of a drug treatment. Other time points can also be used for isolating blood samples for monitoring or assessing in vivo drug activities.
[0029] In many embodiments, the patients being treated by a drug therapy of interest have a non-blood disease, such as a solid tumor. Solid tumors amenable to the present invention include, but are not limited, RCC, prostate cancer, head/neck cancer, ovarian cancer, testicular cancer, brain tumor, breast cancer, lung cancer, colon cancer, pancreas cancer, stomach cancer, bladder cancer, skin cancer, cervical cancer, uterine cancer, and liver cancer. In one embodiment, the solid tumors have the following characteristics: (1) a mass of hyperproliferating cells of clonal origin, and (2) acquisition of an aggressively invasive phenotype, where cancer cells leave the tissue of origin and establish new tumor metastases at distant sites. In one example, the patients have RCC.
[0030] Any cancer or disease treatment can be evaluated by the present invention.
Exemplary cancer treatments include the use of cytokines, such as mterferon or interleukin 2. In addition, chemotherapy drugs can be used, either individually or in combination with other drugs, cytokines or therapies. Suitable chemotherapy drags include, but are not limited to, CCI-779, AN-238, vinblastine, floxuridine, 5-fluorouracil, and tamoxifen. AN238 is a cytotoxic agent which has 2-pyrrolinodoxorubicin linked to a somatostatin (SST) carrier octapeptide. AN238 can be targeted to SST receptors on the surface of RCC tumor cells. Moreover, monoclonal antibodies, antiangiogenesis drags, and anti-growth factor drugs can be employed to treat cancers. ,
[0031] The gene expression profile in peripheral blood samples isolated at one stage of a drag treatment can be compared to that at another stage of the drag treatment. Drug activity genes that are differentially expressed in PBMCs at one stage of the treatment relative to another stage of treatment can therefore be identified. In one embodiment, the PBMC expression level of a drag activity gene is substantially higher at one stage than at another stage. For instance, an average PBMC expression level of a drag activity gene at one stage can be at least 1.5, 2, 3, 4, 5, 10, 20, or more times of that at another stage. In another embodiment, the PBMC expression level of a drag activity gene is substantially lower at one stage than at another stage. For instance, an average PBMC expression level of a drag activity gene at one stage can be no greater than 0.67, 0,5, 0.33, 0.25, 0.1, 0.05, or less times of that at another stage.
[0032] In yet another embodiment, drug activity genes can be identified using clustering algorithms based on the nucleic acid array gene expression data. For instance, unsupervised cluster analyses can be used to analyze and categorize genes which have differential expression patterns between different drug treatment stages. Algorithms for unsupervised cluster analysis include, but are not limited to, self-organized maps (SOMs), principle component analysis, average linkage clustering, and hierarchical clustering. [0033] Supervised cluster analysis can also be used to organize and identify drag activity genes. Algorithms for supervised cluster analysis include, but are not limited to, nearest neighbors test, support vector machines, and SPLASH. Either two-class or multi- class correlation metrics can be used.
B. Identification of CCI-779 Activity Genes
[0034] In one embodiment, HG-U95Av2 gene chips (manufactured by Affymetrix) were used for detecting and comparing the levels of RNA transcripts in PBMC-enriched peripheral blood samples. Peripheral blood samples were isolated from RCC patients at different stages of CCI-779 treatment (See Example 1). The CCI-779 treatment included intravenous administration of 25, 75 or 250 mg of CCI-779 once weekly. Peripheral blood samples were isolated from RCC patients immediately before the initial admi stration of CCI-779, and then 8 weeks and 16 weeks thereafter.
[0035] cRNA were prepared from the isolated PBMC samples and hybridized to
HG-U95Av2 genechips. Hybridization signals were collected for each oligonucleotide probe on the genechips. Signals from the oligonucleotide probes of the same qualifier were averaged. Qualifiers that produce different hybridization signals in samples isolated at different treatment time points were identified. Examples of these identified qualifiers are illustrated in Table 1 ("the Qualifier Table").
[0036] In general, each qualifier in the Qualifier Table corresponds to at least one
CCI-779 activity gene, and the RNA transcripts of the gene can hybridize under stringent or nucleic acid array hybridization conditions to the qualifier. As used herein, "hybridize to a qualifier" means to hybridize to at least one oligonucleotide probe of the qualifier. In many embodiments, the RNA transcripts of a CCI-779 activity gene can hybridize under stringent or nucleic acid array hybridization conditions to at least 2, 4, 6, 8, 10, 12, 14 or 16 oligonucleotide probes of the corresponding qualifier.
[0037] Table 2 lists the expression profiles of some qualifiers that produced different hybridization signals for PBMC samples isolated at different treatment stages. Each expression profile in Table 2 ("Baseline," "8wks Average," or "16wks Average") was an average of PBMC samples of 110 RCC patients. Each expression profile under "Baseline," "8wks Average," and "16wks Average" represented the hybridization signals on the respective qualifier for PBMC samples isolated immediately before the initial CCI-779 administration, 8 weeks after the initial administration, and 16 weeks after the initial administration, respectively. The p-value of an ANOVA analysis was provided for each qualifier. The p-value suggests the statistical significance of the difference observed between the expression profiles obtained at different CCI-779 treatment stages. Lesser p- values indicate more statistical significance for the differences observed between different treatment stages.
Table 2. PBMC Gene Expression Profiles at Different CCI-779 Treatment Stages
[0038] Each qualifier has a corresponsding CCI-779 activity gene probe sequence
(CPS), and each CPS can be derived from a corresponding SEQ ID NO depicted in Table 3. In many cases, each CPS consists of an unambiguous fragment, or the complement thereof, of the corresponding SEQ ID NO. In many other cases, each CPS also comprises at least one oligonucleotide probe of the corresponding qualifier. Each SEQ ID NO in Table 3 is a cDNA or genomic sequence, or the complement thereof, of a CCI-779 activity gene represented by the qualifier that corresponds to the SEQ ID NO. Accordingly, each SEQ ID NO or its corresponding CPS can hybridize 'under stringent or nucleic acid array hybridization conditions to the RNA transcripts, or the complements thereof, of the represented CCI-779 activity gene.
[0039] Each SEQ ID NO may have an Entrez nucleotide sequence database accession number (see Table 4). The Entrez nucleotide sequence database is maintained by the National Center of Biotechnology Information (NCBI), National Library of Medicine, Washington, DC. The database collects sequences from several sources, including GenBank, RefSeq, and PDB.
[0040] Any ambiguous residue ("n") in a SEQ ID NO can be determined by numerous methods. In one embodiment, the ambiguous residues in a SEQ ID NO are determined by aligning the SEQ ID NO to a corresponding genomic sequence obtained from a human genome sequence database. In another embodiment, the ambiguous residues in a SEQ ID NO are determined based on the sequence of the corresponding Entrez accession number. In yet another embodiment, the ambiguous residues are determined by re-sequencing the SEQ ID NO. In general, each "n" position in a SEQ ID NO represents at least one nucleotide selected from a, t, g, and c, or contains no nucleotide residue.
Table 3. Qualifiers and Corresponding CPSs and SEQ ID NOs
Table 4. SEQ ID Nos and Corresponding Entrez Accession Nos
SEQ ID NO Entrez Accession No. Reported Source of Entrez Sequence
Homo sapiens c33.28 unnamed HERV-H
263 U92818 protein mRNA
264 AF010316 Homo sapiens Pigl2 (PIG12) mRNA
265 X54486 Human gene for Cl -inhibitor
266 D64142 Human mRNA for histone Hl
267 M15330 Human interleukin 1-beta (IL1B) mRNA
Human placental tissue factor (two forms)
268 J02931 mRNA
Human MAP kinase phosphatase (MKP-2)
269 U48807 mRNA
Homo sapiens human endogeneous
270 M27826 retrovirus RTVL-H neutral protease large subunit mRNA
Homo sapiens mRNA; cDNA
271 AL049250 DKFZp564D113 (from clone
DKFZp564D113)
Homo sapiens mRNA for KIAA0310
272 AB002308 protein
275 AI817548
276 M28130 Human interleukin 8 (IL8) gene
H.sapiens mRNA for FALL-39 peptide
278 Z38026 antibiotic
Human mRNA for S-adenosylmethionine
280 D49357 synthetase
Human lung cytochrome P450 (IV
281 J02871 subfamily) Bl protein
282 M55405 Homo sapiens mucin (MUC-3) mRNA
283 W27095
Homo sapiens chemokine exodus- 1
284 U64197 mRNA
H.sapiens mRNA for interleukin- 1
285 X52015 receptor antagonist
H.sapiens mRNA for protein encoded by a
286 X95808 candidate gene, DXS6673E, for mental retardation
Homo sapiens trans-golgi network
288 AF027516 glycoprotein 51 (TGN) mRNA
[0041] CCI-779 activity genes represented by each qualifier in Table 2 can be identified based on the HG-U95Av2 gene chip annotation provided by Affymetrix. Genes thus identified are illustrated in Table 5. CCI-779 activity genes can also be determined based on the corresponding Entrez accession numbers. In addition, CCI-779 activity genes can be determined by BLAST searching the corresponding CPSs, or the unambiguous segments of the corresponding SEQ ID NOs, against a human genome sequence database. Suitable human genome sequence databases for this pmpose include, but are not limited to, the NCBI human genome database. The NCBI also provides BLAST programs, such as "blastn," for searching its sequence databases.
Table 5. CCI-779 Activity Genes
[0042] In one embodiment, the BLAST search of the NCBI human genome database is conducted by using CPSs. Gene(s) that aligns to a given CPS with at least 95% sequence identity can be identified. In many cases, the identified gene(s) has at least 96%, 97%, 98%, 99%, or more sequence identity with the CPS. The results of the BLAST search are detailed below.
[0043] CPS 1 corresponds to DKFZP564E1962 (TXNDC) which encodes thioredoxin domain-containing. This gene has LocusID: 81542, and is located on chromosome 14 with reported cytogenetic location 14q21.3. The gene resides in genomic locus NT_025892 (NCBI Genome Annotation). The gene product is a member of the thioredoxin family.
[0044] CPS 1 also has 86-90% sequence identity with an intron sequence of GK003 which encodes GK003 protein. GK003 has LocusID: 57002, and is located on chromosome 7 with reported cytogenetic location 7pl5.2. In addition, fragments of CPS 1 align with a chromosomal region on chromosome 13 and an intron sequence of OATPRP4 with 87-95% sequence identity. The chromosomal region on chromosome 13 is located near INGl which encodes inhibitor of growth family, member 1, and has LocusID: 3621. OATPRP4 encodes organic anion transporter polypeptide-related protein 4, and has LocusID: 81796 with reported cytogenetic location 8ql3.1.
[0045] CPS 2 corresponds to ETR101 which encodes immediate early protein. This gene has LocusID: 9592, and is located on chromosome 19 with reported cytogenetic location 19pl3.13. The gene resides in genomic locus NT_031915 (NCBI Genome Annotation). Expression of immediate early protein can be induced by TPA stimulation in promyelocytic leukemia cell line HL-60 and in other leukemia cell lines. [0046] A fragment of CPS 2 (nucleotides! 309 to 492 of CPS 2) shows 98% sequence identity with an intron sequence of LOCI 69782. LOCI 69782 encodes a protein similar to transcription factor IIIA (Factor A) (TFIIIA), and has reported cytogenetic location 9p24.1.
[0047] CPS 3 corresponds to PFN1 which encodes profilin 1. This gene has
LocusID: 5216, and is located on chromosome 17 with reported cytogenetic location 17pl3.3. The gene resides in genomic locus NT_033299 (NCBI Genome Annotation). Profilin 1 is a ubiquitous actin monomer-binding protein belonging to the profilin family. It is thought to regulate actin polymerization in response to extracellular signals. Deletion of PFN1 gene is associated with Miller-Dieker syndrome.
[0048] Nucleotides 7 to 406 of CPS 3 align with various regions in the human genome with 86-93% sequence identity. These regions include LOC163511, COAS3, a region near LOG 149010, a region near LOC200030, an intron sequence of DKFZp434D177, FLJ20719, LOC199970, and LOC206456. LOC163511 encodes a protein similar to profilin I, and has reported cytogenetic location lq23.2. COAS3 encodes chromosome 1 amplified sequence 3, and has LocusID: 200025 with reported cytogenetic location lql2. LOC149010 encodes a protein similar to hypothetical protein DKFZp434D177, and has reported cytogenetic location lql2. LOC200030 encodes a protein similar to hypothetical protein DJ328E19.C1.1, and has reported cytogenetic location lql2. DKFZp434D177 encodes hypothetical protein DKFZp434D177, and has LocusID: 84224 with reported cytogenetic location lp36.12. FLJ20719 encodes hypothetical protein FLJ20719, and has LocusID: 55672 with reported cytogenetic location lp31. LOG 199970 has reported cytogenetic location lpl l.l. LOC206456 encodes a protein similar to chain P, structure of bovine beta-actin-profilin complex with actin bound Atp phosphates solvent accessible. LOC206456 is located on chromosome 6. [0049] CPS 4 corresponds to APOBEC1L (APOBEC3C) which encodes a protein similar to APOBEC1 protein. APOBEC1L gene has LocusID: 27350, and is located on chromosome 22 with reported cytogenetic location 22ql3.1-ql3.2. APOBEC1L gene product is similar to phorbolin (DJ742C19.2), and may catalyze hydrolytic deamination of cytidine nucleotides. The gene product contains a cytidine deaminase zinc-binding domain. [0050] CPS 4 also has 91% sequence identity with LOC200316 which encodes a protein similar to phorbolin 3 (APOBECl-like). LOC200316 has LocusID: 200316 with reported cytogenetic location 22ql3.1. In addition, CPS 4 aligns with two other regions on chromosome 22 with 89-92% sequence identity.
[0051] CPS 5 corresponds to CKAPl which encodes cytoskeleton-associated protein 1. This gene has LocusID: 1155, and is located on chromosome 19 with reported cytogenetic location 19ql3.11-ql3.12. The gene resides in genomic locus NT_011296 (NCBI Genome Annotation). Cytoskeleton-associated protein 1 associates with microtubules. It contains a glycine domain which plays a role in association with microtubules.
[0052] Affymetrix annotation suggests that CPS 6 corresponds to KIAA0220 which has LocusID: 23117 and is located at chromosome 16pl2.1. Blast search of the Entrez human genome sequence database shows that CPS 6 has 99% sequence identity to a region on chromosome 16. This region is near LOC255565, and resides in genomic locus NT_035368 (NCBI Genome Annotation).
[0053] In addition, fragments of CPS 6 align with various chromosomal regions with at least 90% sequence identity^ These regions include LOC220555, LOC124302, a chromosomal region near LOC220567, a chromosomal region near LOC254081, and a chromosomal region near LOC197363. LOC220555 encodes a protein similar to nuclear pore complex interacting protein, and has reported cytogenetic location 16pl 1.2. LOG 124302 encodes a protein similar to nuclear pore complex interacting protein, and has reported cytogenetic location lδpl l.l. LOC220567 encodes a protein similar to apolipoprotein B48 receptor, and is located at chromosome 16ql3. LOC254081 encodes a protein similar to group X secretory phospholipase A2 precursor (phosphatidylcholine 2- acylhydrolase GX) (GX sPLA2) (sPLA2-X), and is located on chromosome 16. LOC197363 encodes a protein similar to ataxin 2 related protein (isoform 1), and is located on chromosome 16. [0054] CPS 7 corresponds to CORO1A which encodes coronin, actin binding protein, 1A. This gene has LocusID: 11151, and is located on chromosome 16 with reported cytogenetic location 16ql3. The gene resides in genomic locus NT_033291 (NCBI Genome Annotation). The gene product (coronin 1 A) binds to actin, and may be involved in mitosis, cell motility, formation of phagocytic vacuoles and phagocytosis. Coronin 1 A has at least five WD domains.
[0055] CPS 8 corresponds to GNL1 which encodes guanine nucleotide binding protein-like 1. This gene has LocusID: 2794, and is 'located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). The GNL1 gene, identified in the human major histocompatibility complex class I region, shows a high degree of similarity with its mouse counterpart. The GNL1 gene is located less than 2 kb centromeric to HLA-E, in the same transcriptional orientation. GNL1 is telomeric to HLA-B and HLA-C.
[0056] CPS 9 corresponds to AKR1A1 which encodes aldo-keto reductase family 1, member Al (aldehyde reductase). This gene has LocusID: 10327, and is located on chromosome 1 with reported cytogenetic location Ip33-p32. The gene resides in genomic locus NT_032972 (NCBI Genome Annotation). Aldehyde reductase (aldo-keto reductase family 1, member Al) reduces carbonyl-containing substrates, and may metabolize xenobiotics. It is a NADPH-dependent member of the aldo-keto reductase superfamily. [0057] CPS 10 corresponds to ID2 which encodes inhibitor of DNA binding 2, dominant negative helix-loop-helix protein. This gene has LocusID: 3398, and is located on chromosome 2 with reported cytogenetic location 2p25. The gene resides in genomic locus NT_005334 (NCBI Genome Annotation). The gene product is a member of the Id helix- loop-helix family of proteins, and may negatively regulate cell differentiation. [0058] CPS 10 also has 95% sequence identity with an intron sequence of PTPRG.
PTPRG encodes protein tyrosine phosphatase, receptor type, G, and has LocusID: 5793 with reported cytogenetic location 3p21-pl4. The protein encoded by PTPRG gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. The PTP encoded by PTPRG gene possesses an extracellular region, a single transmembrane region, and two tandem intracytoplasmic catalytic domains, and thus represents a receptor-type PTP. The extracellular region of this PTP contains a carbonic anhydrase-like (CAH) domain, which is also found in the extracellular region of PTPRBETA/ZETA. PTPRG gene is located in a chromosomal region that is frequently deleted in renal cell carcinoma and lung carcinoma, and thus is thought to be a candidate tumor suppressor gene.
[0059] In addition, CPS 10 aligns with a chromosomal region near LOG 140282 with
88% sequence identity. LOCI 40282 encodes a protein similar to translationally controlled tumor protein (TCTP) (p23) (Histamine-releasing factor) (HRF), and has reported cytogenetic location 21q21.3.
[0060] Affymetrix annotation suggests that CPS 11 corresponds to PSG7 which encodes pregnancy specific beta- 1 -gly coprotein 7 and has LocusID: 5676. PSG7 is located at chromosome 19ql3.2.
[0061] Blast search of the Entrez human genome sequence database shows that CPS
11 aligns with a region 3' to PSG1 with at least 98% sequence identity. PSG1 encodes pregnancy specific beta-1-glycoprotein, and has LocusID: 5669 with reported cytogenetic location 19ql3.2. The gene product is a member of the pregnancy-specific glycoprotein
(PSG) and CEA families.
[0062] Moreover, CPS 11 has 92-95% sequence identity with various regions on chromosome 19. They include PSG2, PSG9, a region near PSG3, and a region near PSG7.
PSG2 encodes pregnancy specific beta- 1 -glycoprotein 2, and has LocusID: 5670 with reported cytogenetic location 19ql3.1-ql3.2. PSG9 encodes pregnancy specific beta-1- glycoprotein 9, and has LocusID: 5678 with reported cytogenetic location 19ql3.2. PSG3 encodes pregnancy specific beta- 1 -glycoprotein 3, and has LocusID: 5671 with reported cytogenetic location 19ql3.2. PSG7 encodes pregnancy specific beta- 1 -glycoprotein 7, and has LocusID: 5676 with reported cytogenetic location 19ql3.2.
[0063] CPS 13 corresponds to IDH2 which encodes isocitrate dehydrogenase 2
(NADP+), mitochondrial. This gene has LocusID: 3418, and is located on chromosome 15 with reported cytogenetic location 15q26.1. The gene resides in genomic locus NT__033276
(NCBI Genome Annotation). Mitochondrial NADP(+)-specific isocitrate dehydrogenase 2 can decarboxylate isocitrate into alpha-ketoglutarate.
[0064] CPS 14 corresponds to RER1 which encodes a protein similar to S. cerevisiae RERl. This gene has LocusID: 11079, and is located on chromosome 1 with reported cytogenetic location lpter-q24. The gene resides in genomic locus NT_004350
(NCBI Genome Annotation). [0065] CPS 15 corresponds to TUBB which encodes tubulin, beta polypeptide. This gene has LocusID: 7280, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_034880 (NCBI Genome
Annotation). Beta-tubulin polymerizes to form microtubules, and is a member of a family of structural proteins.
[0066] CPS 16 corresponds to HADH2 which encodes hydroxyacyl-Coenzyme A dehydrogenase, type II. This gene has LocusID: 3028, and is located on chromosome X with reported cytogenetic location Xpl 1.2. The gene resides in genomic locus NT_011799
(NCBI Genome Annotation). The gene product can bind to amyloid-beta peptide.
[0067] CPS 17 corresponds to NDUFB5 which encodes NADH dehydrogenase
(ubiquinone) 1 beta subcomplex, 5 (16kD, SGDH). This gene has LocusID: 4711, and is located on chromosome 3 with reported cytogenetic location 3q27.1. The gene resides in genomic locus NT_022396 (NCBI Genome Annotation). The gene product is a subunit of
NADH-ubiquinone oxidoreductase (complex I). It can transport electrons from NADH to ubiquinone.
[0068] CPS 17 also shows 77% sequence identity with a chromosomal region near
LOC205833. LOC205833 has reported cytogenetic location 4q34.3.
[0069] CPS 18 corresponds to APRT which encodes adenine phosphoribosyltransferase. This gene has LocusID: 353, and is located on chromosome 16 with reported cytogenetic location 16q24. The gene resides in genomic locus NT_010404
(NCBI Genome Annotation). Adenine phosphoribosyltransferase belongs to the purine/pyrimidine phosphoribosyltransferase family. This enzyme catalyzes the formation of AMP and inorganic pyrophosphate from adenine and 5-phosphoribosyl-l-pyrophosphate
(PRPP). It can also produce adenine as a by-product of the polyamine biosynthesis pathway. A homozygous deficiency in this enzyme may cause 2,8-dihydroxyadenine urolithiasis.
[0070] CPS 19 corresponds to PSME2 wliich encodes proteasome (prosome, macropain) activator subunit 2 (PA28 beta). This gene has LocusID: 5721, and is located on chromosome 14 with reported cytogenetic location 14qll.2. The gene resides in genomic locus NT_025892 (NCBI Genome Annotation).
[0071] CPS 19 also aligns with LOC220462 with 97% sequence identity.
LOC220462 encodes a protein similar to proteasome activator complex subunit 2
(Proteasome activator 28-beta subunit) (PA28beta) (PA28b) (Activator of multicatalytic protease subunit 2) (I IS regulator complex beta subunit) (REG-beta). LOC220462 has reported cytogenetic location 13ql4.11. In addition, CPS 19 has about 90-96% sequence identity with LOC257093, LOG 166868, and an intron sequence of LOC152940. LOC257093 encodes a protein similar to proteasome activator complex subunit 2 (Proteasome activator 28-beta subunit), and is located on chromosome 5. LOG 166868 encodes a protein similar to PA28beta, and has reported cytogenetic location 4pl4. LOCI 52940 has reported cytogenetic location 4q31.3-q32.1. Furthermore, fragments of CPS 17 show 86-97% sequence identity with LOC220220 and LOC206704. LOC220220 encodes a protein similar to PA28beta, and has reported cytogenetic location 1 Op 11.23. LOC206704 also encodes a protein similar to PA28beta, and is located on chromosome 8p21.1.
[0072] CPS 20 corresponds to MTCP1 which encodes mature T-cell proliferation 1.
This gene has LocusID: 4515, and is located on chromosome X with reported cytogenetic location Xq28. The gene resides in genomic locus NT_025965 (NCBI Genome Annotation). The gene product may be involved in the leukemogenic process of mature T cell proliferation.
[0073] CPS 21 corresponds to NDUFB3 which encodes NADH dehydrogenase
(ubiquinone) 1 beta subcomplex, 3 (12kD, B12). This gene has LocusID: 4709, and is located on chromosome 2 with reported cytogenetic location 2q31.3. The gene resides in genomic locus NT_005370 (NCBI Genome Annotation). The multisubunit NADH:ubiquinone oxidoreductase (complex I) is an enzyme complex in the electron transport chain of mitochondria. NDUFB3 gene product is a subunit of NADH-ubiquinone oxidoreductase (complex I), and can transport electrons from NADH to ubiquinone. [0074] CPS 21 also has about 96% sequence identity with NDUFB3P4.
NDUFB3P4 encodes NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 3 (12kD, B12) pseudogene 4. It has LocusID: 93995, and is located on chromosome 14. The gene is located in an intron of WARS which encodes tryptophanyl-tRNA synthetase and has LocusID: 7453 with reported cytogenetic location 14q32.31.
[0075] Moreover, CPS 21 has about 86-91% sequence identity with NDUFB3P3, and intron sequence of KIAA0893, and an intron sequence of PPP6C. NDUFB3P3 encodes NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 3 (12kD, B12) pseudogene 3. It has LocusID: 93996, and is located on chromosome 14. KIAA0893 encodes KIAA0893 protein, and has LocusID: 22911 with reported cytogenetic location lpl3.2. PPP6C encodes protein phosphatase 6, catalytic subunit, and has LocusID: 5537 with reported cytogenetic location 9q34.11.
[0076] CPS 23 corresponds to CTSL which encodes cathepsin L. This gene has
LocusID: 1514, and is located on chromosome 9 with reported cytogenetic location 9q21- q22. The gene resides in genomic locus NT_023935 (NCBI Genome Annotation). The protein encoded by CTSL gene is a lysosomal cysteine proteinase that plays a role in intracellular protein catabolism. Its substrates include collagen and elastin, as well as alpha-
1 protease inhibitor, a major controlling element of neutrophil elastase activity. The encoded protein has been implicated in several pathologic processes, including myofibril necrosis in myopathies and in myocardial ischemia, and in the renal tubular response to proteinuria. This protein is a member of the peptidase Cl family. At least two transcript variants encoding the same protein have been found for CTSL gene.
[0077] Fragments of CPS 23 have 84-88% sequence identity with LOCI 18945,
LOCI 19215, and LOC219343. LOCI 18945, LOCI 19215, and LOC219343 encode proteins similar to cathepsin L precursor (Major excreted protein) (MEP), and are located at chromosome 10q23.32, 10q21.1 and 10q23.2, respectively.
[0078] Affymetrix annotation suggests that CPS 24 corresponds to FGL2 which encodes fibrinogen-like 2 and has LocusID: 10875. FGL2 has the reported cytogenetic location at chromosome 7ql 1.23.
[0079] Blast search of the Entrez human genome database shows that CPS 24 aligns with the non-protein-coding strand of KIAA1505 with at least 98% sequence identity.
KIAA1505 encodes KIAA1505 protein, and has LocusID: 57639 with reported cytogenetic location 7ρl2.3. KIAA1505 resides in genomic locus NT_007933 (NCBI Genome
Annotation).
[0080] Affymetrix annotation suggests that CPS 25 corresponds to
DKFZP564C1940 which is also known as LRP10. LRP10 encodes low density lipoprotein receptor-related protein 10, and has LocusID: 26020. LRP10 is located at chromosome
14ql l.l.
[0081] Blast search of the Entrez human genome sequence database shows that fragments of CPS 26 have 84-98% sequence identity with various regions on chromosomes
1, 2, 4, 5, 7, 9, 10, 11, 12, 17, 18, X, or Y. For instance, nucleotides 164-283 and 359-589 have 95% sequence identity with an intron sequence of LOC121292. LOC121292 is located at chromosome 12q24.31, and resides in genomic locus NT_035241 (NCBI Genome Annotation). Nucleotides 253-323 and 359-589 of CPS 26 align with a region near LOC138924 with at least 95% sequence identity. LOC138924 encodes a protein similar to peptidyl-Pro cis trans isomerase, and is located at chromosome 9q21.31 and in genomic locus NT_008580 (NCBI Genome Annotation). Nucleotides 374-588 of CPS 26 have 93% sequence identity with a region between DIM1 and FLJ21172. DIMl encodes a protein similar to S. pombe diml+, and has LocusID: 10907 with reported cytogenetic location 18q23. FLJ21172 encodes hypothetical protein FLJ21172, and has LocusID: 79863 with reported cytogenetic location 18q23. Nucleotides 634-1116 of CPS 26 have 86% sequence identity with an intron sequence of MYOID. MYOID encodes myosin ID, and has LocusID: 4642 and reported cytogenetic location 17ql l-ql2.
[0082] CPS 27 corresponds to SEMA3F which encodes sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3F. This gene has LocusID: 6405, and is located on chromosome 3 with reported cytogenetic location 3p21.3. The gene resides in genomic locus NT_006014 (NCBI Genome Annotation). The semaphorins are a family of proteins that are involved in signaling. A typical family member has a secretion signal, a 500-amino acid sema domain, and 16 conserved cysteine residues. Sequence comparisons have grouped the secreted semaphorins into 3 general classes. Members of the semaphorin III family, including human semaphorin III, chicken collapsin, and mouse semaphorins A, D, and E, have a basic domain at the C terminus. SEMA3F gene product is a secreted member of the semaphorin III family. [0083] CPS 28 corresponds to BST2 which encodes bone marrow stromal cell antigen 2. This gene has LocusID: 684, and is located on chromosome 19 with reported cytogenetic location 19pl3.2. The gene resides in genomic locus NT_011295 (NCBI Genome Annotation). Bone marrow stromal cells are involved in the growth and development of B-cells. Bone marrow stromal cell antigen 2 may play a role in pre-B-cell growth and in rheumatoid arthritis.
[0084] Blast search of the Entrez human genome sequence database shows that CPS
29 overlaps or includes a chromosomal region between SLC7A1 and KIAA0774. SLC7A1 encodes solute carrier family 7 (cationic amino acid transporter, y+ system), member 1. SLC7A1 has LocusID: 6541, and is located on chromosome 13 with reported cytogenetic location 13ql2-ql4. KIAA0774 encodes KIAA0774 protein, and has LocusID: 23281. KIAA0774 is located on chromosome 13 with reported cytogenetic location 13ql2.2. Both SLC7A1 and KIAA0774 have genomic locus NT_009799 (NCBI Genome Annotation). CPS 29 is located 3' to the protein-coding regions of both genes. CPS 29 matches with the protein-coding strand of SLC7A1. SLC7A1 gene product can has strong similarity to murine Rec-1 (Atrcl), and can transport arginine, lysine and ornithine across the plasma membrane.
[0085] CPS 30 corresponds to UBE2L6 which encodes ubiquitin-conjugating enzyme E2L 6. This gene has LocusID: 9246, and is located on chromosome 11 with reported cytogenetic location 1 lql2. The gene resides in genomic locus NT_033903 (NCBI Genome Annotation). The gene product is a member of the ubiquitin-conjugating enzyme family, and can ubiquitinate cellular proteins and mark them for degradation. The gene product can also bind to Hect domains of E3 proteins.
[0086] CPS 31 corresponds to SLC11A1 which encodes solute carrier family 11
(proton-coupled divalent metal ion transporters), member 1. This gene has LocusID: 6556, and is located on chromosome 2 with reported cytogenetic location 2q35. The gene resides in genomic locus NT_005403 (NCBI Genome Annotation). The gene product is similar to murine Beg (Nrampl), and may control antimicrobial activity of macrophages. [0087] CPS 32 corresponds to CDKN1C which encodes cyclin-dependent kinase inhibitor IC (p57, Kip2). This gene has LocusID: 1028, and is located on chromosome 11 with reported cytogenetic location 1 lpl5.5. The gene resides in genomic locus NT_009368 (NCBI Genome Annotation). Cyclin-dependent kinase inhibitor IC is a tight-binding inhibitor of several GI cyclin/Cdk complexes and a negative regulator of cell proliferation. Mutations of CDKN1C are implicated in sporadic cancers and Beckwith-Wiedemann syndrome, suggesting that CDKN1C may be a tumor suppressor candidate. [0088] CPS 32 also has 98% sequence identity with a chromosomal region near
LOC256784. LOC256784 encodes a protein similar to cyclin-dependent kinase inhibitor IC (Cyclin-dependent kinase inhibitor P57) (P57KIP2), and is located on chromosome 11 with genomic locus NT_009368 (NCBI Genome Annotation).
[0089] CPS 33 corresponds to GAIP (RGS 19) which encodes regulator of G-protein signaling 19. This gene has LocusID: 10287, and is located on chromosome 20 with reported cytogenetic location 20ql3.3. The gene resides in genomic locus NT_011333 (NCBI Genome Annotation). G proteins mediate a number of cellular processes. The protein encoded by this gene belongs to the RGS (regulators of G-protein signaling) family and can interact with G protein, GAI3. G-protein signaling 19 is a guanosine triphosphatase-activating protein that may function to down-regulate Galpha i/Galpha q- linked signaling.
[0090] CPS 34 corresponds to TACC2 which encodes transforming, acidic coiled- coil containing protein 2. This gene has LocusID: 10579, and is located on chromosome 10 with reported cytogenetic location 10q26. The gene resides in genomic locus NT_030764 (NCBI Genome Annotation). Transforming acidic coiled-coil proteins are a conserved family of centrosome- and microtubule-interacting proteins that are implicated in cancer. The member encoded by TACC2 gene concentrates at centrosomes throughout the cell cycle, and it is a candidate breast tumor suppressor and biomarker for tumor progression. [0091] CPS 35 corresponds to CLNS1A which encodes chloride channel, nucleotide-sensitive, 1A. This gene has LocusID: 1207, and is located on chromosome 11 with reported cytogenetic location I lql3.5-ql4. The gene resides in genomic locus NT_033927 (NCBI Genome Annotation). The gene product is associated with a swelling- induced chloride channel, and may be involved in aqueous humor formation in the eye [0092] CPS 35 also has 88% sequence identity with an intron sequence of TDRKH.
TDRKH encodes tudor and KH domain-containing protein, and has LocusID: 11022. It is located at chromosome lq21. In addition, nucleotides 66-459 of CPS 35 have 89% sequence identity with a region near LOC221349. LOC221349 encodes a protein similar to chloride conductance regulatory protein ICln {I(Cln)) (Chloride channel, nucleotide sensitive 1A) (Chloride ion current inducer protein) (C1CI) (Reticulocyte PICIn), and has reported cytogenetic location 6pl l.2. Nucleotides 1-66 of CPS 35 aligns with LOC152922 with 98% sequence identity. LOCI 52922 encodes a protein similar to chloride conductance regulatory protein ICln (I(Cln)) (Chloride channel, nucleotide sensitive 1A) (Chloride ion current inducer protein) (C1CI) (Reticulocyte PICIn), and is located at chromosome 4q32.3. [0093] CPS 36 corresponds to DPYSL2 which encodes dihydropyrimidinase-like 2.
This gene has LocusID: 1808, and is located on chromosome 8 with reported cytogenetic location 8p22-p21. The gene resides in genomic locus NT_023666 (NCBI Genome Annotation). The gene product is a member of the dihydropyrimidinase family. [0094] CPS 37 corresponds to FCGR3A which encodes Fc fragment of IgG, low affinity Ilia, receptor for (CD 16). This gene has LocusID: 2214, and is located on chromosome 1 with reported cytogenetic location lq23. The gene resides in genomic locus NT_004668 (NCBI Genome Annotation). The gene product is type III Fc gamma receptor, and can associate with zeta chain of the T-cell receptor complex (CD3Z). The gene product is a member of the immunoglobulin superfamily
[0095] CPS 38 corresponds to UQCRC1 which encodes ubiquinol-cytochrome c reductase core protein I. This gene has LocusID: 7384, and is located on chromosome 3 with reported cytogenetic location 3p21.3. The gene resides in genomic locus NT_005990 (NCBI Genome Annotation). The gene product, core I protein, is a subunit of the ubiquinol-cytochrome c oxidoreductase in the mitochondrial respiratory chain. [0096] CPS 39 corresponds to GZMA which encodes granzyme A (granzyme 1, cytotoxic T-lymphocyte-associated serine esterase 3). This gene has LocusID: 3001, and is located on chromosome 5 with reported cytogenetic location 5ql l-ql2. The gene resides in genomic locus NT_006431 (NCBI Genome Annotation). Cytolytic T lymphocytes (CTL) and natural killer (NK) cells share the ability to recognize, bind, and lyse specific target cells. They are thought to protect their host by lysing cells bearing on their surface "nonself antigens, usually peptides or proteins resulting from infection by intracellular pathogens. GZMA gene product is a T cell- and natural killer cell-specific serine protease that may function as a common component necessary for lysis of target cells by cytotoxic T lymphocytes and natural killer cells.
[0097] CPS 41 corresponds to PSMB10 which encodes proteasome (prosome, macropain) subunit, beta type, 10. This gene has LocusID: 5699, and is located on chromosome 16 with reported cytogenetic location 16q22.1. The gene resides in genomic locus NT_010478 (NCBI Genome Annotation). The gene can replace beta subunit PSMB7 when cells are stimulated by interferon g.
[0098] CPS 42 corresponds to IDS which encodes iduronate 2-sulfatase (Hunter syndrome). This gene has LocusID: 3423, and is located on chromosome X with reported cytogenetic location Xq28. The gene resides in genomic locus NT_019686 (NCBI Genome Annotation). Iduronate-2-sulfatase is involved in the lysosomal degradation of heparan sulfate and dermatan sulfate. Mutations in this X-chromosome gene that result in enzymatic deficiency may lead to the sex-linked Mucopolysaccharidosis Type II, also known as Hunter Syndrome. Iduronate-2-sulfatase has a sequence homology with human arylsulfatases A, B, and C, and human glucosamine-6-sulfatase. A splice variant of this gene has been described. [0099] Affymetrix annotation suggests that CPS 43 corresponds to AHNAK which encodes AHNAK nucleoprotein (desmoyokin) and has LocusID: 195. The gene is located at chromosome 11 ql 2-ql 3.
[0100] Blast search of the Entrez human genome sequence database indicates that
CPS 43 aligns with a region 3' to LOC221087 with 100% sequence identity. LOC221087 is hypothetical gene supported by M80899. The gene is located on chromosome l lql3.1, and resides in genomic locus NT_033241 (NCBI Genome Annotation). [0101] CPS 44 corresponds to PTPN7 which encodes protein tyrosine phosphatase, non-receptor type 7. This gene has LocusID: 5778, and is located on chromosome 1 with reported cytogenetic location lq32.1. The gene resides in genomic locus NT_034408 (NCBI Genome Annotation). The gene product is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. PTPN7 gene is expressed in a variety of hematopoietic cells, and is an early response gene in lymphokine stimulated cells. The noncatalytic N-terminus of PTPN7 gene product can interact with MAP kinases and suppress the MAP kinase activities. The gene product is also shown to be involved in the regulation of T cell antigen receptor (TCR) signaling, which is thought to function through dephosphorylating the molecules related to MAP kinase pathway. At least three alternatively spliced transcript variants of this gene, which encode at least two distinct isoforms, have been reported. [0102] CPS 45 corresponds to DUSP2 wliich encodes dual specificity phosphatase
2. This gene has LocusID: 1844, and is located on chromosome 2 with reported cytogenetic location 2qll. The gene resides in genomic locus NT_026970 (NCBI Genome Annotation). The protein encoded by DUSP2 is a member of the dual specificity protein phosphatase subfamily. These phosphatases can inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. DUSP2 gene product can inactivate ERK1 and ERK2, is expressed in hematopoietic tissues, and is localized in the nucleus. [0103] Affymetrix annotation suggests that CPS 46 corresponds to PSMB9 which encodes proteasome (prosome, macropain) subunit, beta type, 9 (large multifunctional protease 2). PSMB9 has LocusID: 5698, and is located at chromosome 6p21.3. [0104] Blast search of the Entrez human genome sequence database indicates that
CPS 46 shows 100% sequence identity with three regions near PSMB8. These regions are within genomic locus NT_007592 (NCBI Genome Annotation). PSMB8 encodes proteasome (prosome, macropain) subunit, beta type, 8 (large multifunctional protease 7). It has LocusID: 5696 and reported cytogenetic location 6p21.3.
[0105] Blast search of the Entrez human genome sequence database shows that CPS
47 overlaps or includes an intron sequence of HANI 1. HANI 1 encodes WD-repeat protein, and has LocusID: 10238 with reported cytogenetic location 17q21.33. HAN11 resides in genomic locus NT_035428 (NCBI Genome Annotation). HAN11 gene product has WD- repeats, and is similar to plant anl 1.
[0106] Blast search of the Entrez human genome sequence database also shows that
CPS 48 overlaps or includes LILRBl. LILRBl encodes leukocyte immunoglobulin-like receptor, subfamily B (with TM and ITIM domains), member 1. LILRBl has LocusID: 10859, and is located on chromosome 19 with reported cytogenetic location 19ql3.4. LILRBl resides in genomic locus NT_011225 (NCBI Genome Annotation). The gene product (leukocyte immunoglobulin-like receptor Bl) contains immunoreceptor tyrosine- based inhibitory motifs, and can bind to cellular and viral MHC class I antigens. [0107] CPS 48 also aligns with LILRB2 with 91% sequence identity. LILRB2 encodes leukocyte immunoglobulin-like receptor, subfamily B (with TM and ITIM domains), member 2. The gene has LocusID: 10288 and reported cytogenetic location 19ql3.4.
[0108] CPS 49 corresponds to SDR1 which encodes short-chain dehydrogenase/reductase 1. This gene has LocusID: 9249, and is located on chromosome 1 with reported cytogenetic location lp36.1. The gene resides in genomic locus NT_022041 (NCBI Genome Annotation). Short-chain dehydrogenase/reductase 1 can reduce all-trans- retinal during bleached visual pigment regeneration.
[0109] Affymetrix annotation suggests that CPS 50 corresponds to POLR2L which encodes polymerase (RNA) II (DNA directed) polypeptide L, 7.6kDa. POLR2L has LocusID: 5441, and is located at chromosome 1 lpl5. [0110] CPS 51 corresponds to SCYA2 which encodes small inducible cytokine A2
(monocyte chemotactic protein 1). This gene has LocusID: 6347, and is located on chromosome 17 with reported cytogenetic location 17ql l.2-q21.1. The gene resides in genomic locus NT_010799 (NCBI Genome Annotation). Cytokine A 2 is a chemotactic factor for monocytes.
[0111] CPS 52 corresponds to PAI2 (SERPINB2) which encodes serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 2. This gene has LocusID: 5055, and is located on chromosome 18 with reported cytogenetic location 18q21.3. The gene resides in genomic locus NT_033907 (NCBI Genome Annotation). The gene product is also known as plasminogen activator inhibitor II, and may function as a serine protease inhibitor. It is considered a member of the serpin family of serine protease inhibitors. [0112] Blast search of the Entrez human genome sequence database shows that CPS
53 overlaps or includes LOCI 58972. LOCI 58972 has reported cytogenetic location Xq28, and is located in genomic locus NT_026504 (NCBI Genome Annotation).
[0113] CPS 53 also aligns with a chromosomal region near LOC93052 with 93% sequence identity. LOC93052 has reported cytogenetic location Xq28.
[0114] Blast search of the Entrez human genome sequence database shows that CPS
54 overlaps or includes KIAA1564. KIAA1564 encodes KIAA1564 protein, and has LocusID: 57680 with reported cytogenetic location 14ql 1.1. KIAA1564 resides in genomic locus NT_025892 (NCBI Genome Annotation). CPS 54 matches with the non-protein- coding strand of KIAA1564.
[0115] Affymetrix annotation indicates that CPS 55 corresponds to JMJ.
Nucleotides 1997 to 198161 of AL021938 (SEQ ID NO:52) have 99% sequence identity with JMJ. JMJ encodes jumonji homolog (mouse), and has LocusID: 3720. The gene is located on chromosome 6 with reported cytogenetic location 6p24-p23 and genomic locus NT_007592 (NCBI Genome Annotation). JMJ is an ortholog of the mouse jumonji gene, which encodes a nuclear protein involved in mouse embryogenesis, including neural tube formation. Overexpression of mouse jumonji negatively regulates cell proliferation. The jumonji proteins contain a DNA-binding domain, called an AT-rich interaction domain (ARID), and share regions of similarity with human retinoblastoma-binding protein-2 and the human SMCX protein. Nucleotides 1997 to 198161 of AL021938 match with the non- protein-coding strand of JMJ. [0116] Nucleotides 1 to 66 of CPS 55 have 100% sequence identity with a chromosomal region in genomic locus NT_010463 (NCBI Genome Annotation). This chromosomal region is located 3 ' to LOC222499 which has reported cytogenetic location
16q21.
[0117] CPS 56 corresponds to UGP2 which encodes UDP-glucose pyrophosphorylase 2. This gene has LocusID: 7360, and is located on chromosome 2 with reported cytogenetic location 2pl4-pl3. The gene resides in genomic locus NT_005375
(NCBI Genome Annotation). The enzyme encoded by UGP2 is an intermediary in mammalian carbohydrate interconversions. It can transfer a glucose moiety from glucose-
1 -phosphate to MgUTP and form UDP-glucose and MgPPi.
[0118] Nucleotides 155 to 433 of CPS .56 have 96% sequence identity with
LOC253415. LOC253415 encodes a protein similar to UDP-glucose pyrophosphorylase 2
(UTP-glucose-1 -phosphate uridyltransferase) (UDP-glucose diphosphorylase) (UGPase 2).
LOC253415 is located on chromosome 2.
[0119] CPS 57 corresponds to KIAA0410. This gene has LocusID: 9818, and is located on chromosome 13 with reported cytogenetic location 13ql2.12. The gene resides in genomic locus NT_009799 (NCBI Genome Annotation).
[0120] CPS 58 corresponds to NDUFS7 which encodes NADH dehydrogenase
(ubiquinone) Fe-S protein 7 (20kD) (NADH-coenzyme Q reductase). This gene has
LocusID: 4727, and is located on chromosome 19 with reported cytogenetic location 19pl3.
The gene resides in genomic locus NT_011268 (NCBI Genome Annotation).
[0121] CPS 59 corresponds to KIAA0645. This gene has LocusID: 9681, and is located on chromosome 22 with reported cytogenetic location 22ql2.3. The gene resides in genomic locus NT_011520 (NCBI Genome Annotation).
[0122] CPS 60 corresponds to GSTP1 which encodes glutathione S-transferase pi.
This gene has LocusID: 2950, and is located on chromosome 11 with reported cytogenetic location l lql3. The gene resides in genomic locus NT_033241 (NCBI Genome
Annotation). Glutathione S-transferases (GSTs) are a family of enzymes that play a role in detoxification by catalyzing the conjugation of many hydrophobic and electrophilic compounds with reduced glutathione. The soluble GSTs are categorized into 4 main classes: alpha, mu, pi, and theta. The glutathione S-transferase pi gene (GSTP1) is a polymorphic gene encoding active, functionally different GSTP1 variant proteins that are thought to function in xenobiotic metabolism and play a role in susceptibility to cancer, and other diseases.
[0123] Nucleotides 180 to 558 have 86% sequence identity with a chromosomal region near DGKA. DGK encodes diacylglycerol kinase, alpha (80kD), and is located at chromosome 12ql3.3 with LocusID: 1606.
[0124] CPS 61 corresponds to DECR1 which encodes 2,4-dienoyl CoA reductase 1, mitochondrial. This gene has LocusID: 1666, and is located on chromosome 8 with reported cytogenetic location 8q21.3. The gene resides in genomic locus NT_034898 (NCBI Genome Annotation). The gene product is an accessory enzyme which participates in the beta-oxidation and metabolism of unsaturated fatty enoyl-CoA esters. [0125] CPS 62 corresponds to PLXNCl which encodes plexin Cl. This gene has
LocusID: 10154, and is located on chromosome 12 with reported cytogenetic location 12q23.3. The gene resides in genomic locus NT_009575 (NCBI Genome Annotation). Plexin Cl can function as a receptor for virally-encoded semphorin. It is a member of the plexin family.
[0126] CPS 63 corresponds to TUBA2 which encodes tubulin, alpha 2. This gene has LocusID: 7278, and is located on chromosome 13 with reported cytogenetic location 13ql l. The gene resides in genomic locus NT_009799 (NCBI Genome Annotation). Microtubules of the eukaryotic cytoskeleton perform essential and diverse functions and are composed of a heterodimer of alpha and beta tubulin. The genes encoding these microtubule constituents are part of the tubulin superfamily, which is composed of six distinct families. Genes from the alpha, beta and gamma tubulin families are found in all eukaryotes. The alpha and beta tubulins represent the major components of microtubules, while gamma tubulin plays a critical role in the nucleation of microtubule assembly. There are multiple alpha and beta tubulin genes and they are conserved among and between species. TUBA2 gene is an alpha tubulin gene that encodes a protein similar to the mouse testis-specific Tuba3 and Tuba7 gene products. TUBA2 gene is located in the 13ql l region, which is associated with the genetic diseases Clouston hidrotic ectodermal dysplasia and Kabuki syndrome. Alternative splicing has been observed for this gene and at least two variants have been identified.
[0127] CPS 63 has 95-96% sequence identity with H2-ALPHA and LOCI 12714.
H2-ALPHA encodes alpha-tubulin isotype H2-alpha, and has LocusID: 113457 with reported cytogenetic location 2q22.1. LOC112714 encodes a protein similar to alpha tubulin, and has LocusID: 112714 with reported cytogenetic location 2ql4.2. [0128] In addition, CPS 63 shows 85-90% sequence identity with a chromosomal region near MGC 16703. MGC 16703 encodes alpha tubulin-like, and is located at chromosome 22ql l.21 with LocusID: 113691. Fragments of CPS have 83-91% sequence identity with regions on chromosomes 1 and 22.
[0129] CPS 64 corresponds to NMEl which encodes non-metastatic cells 1, protein
(NM23A) expressed in. This gene has LocusID: 4830, and is located on chromosome 17 with reported cytogenetic location 17q21.3. The gene resides in genomic locus NT_010783 (NCBI Genome Annotation). NMEl was reported to have reduced mRNA transcript levels in highly metastatic cells. NMEl encodes the "A" isoform of nucleoside diphosphate kinase (NDK). NDK exists as a hexamer composed of the "A" (encoded by NMEl) and "B" (encoded by NME2) isoforrns. Mutations in NMEl have been identified in aggressive neuroblastomas. NMEl gene product may have a role in the transcriptional regulation of c- myc expression. [0130] Blast search of the Entrez human genome sequence database shows that CPS
65 overlaps or includes a chromosomal region between SLC11A1 and NLI-IF. Both genes are located on chromosome 2 with reported cytogenetic location 2q35 and genomic locus NT_005403 (NCBI Genome Annotation). SLC11A1 encodes solute carrier family 11 (proton-coupled divalent metal ion transporters), member 1, and has LocusID: 6556. SLC11A1 gene product is similar to murine Beg (Nrampl), and may control antimicrobial activity of macrophages. NLI-IF encodes nuclear LIM interactor-interacting factor, and has LocusID: 58190. NLI-IF gene product is similar to a region of S. cerevisiae plasma membrane phosphatase Psr2p. CPS 65 is located 3' to the protein-coding sequence of SLC11 Al and 5' to the protein-coding sequence of NLI-IF.
[0131] Affymetrix annotation suggests that CPS 66 corresponds to RNAHP which encodes RNA helicase-related protein and has LocusID: 11325. The gene has reported cytogenetic location at chromosome 17q22. [0132] Blast search of the Entrez human genome sequence database shows that CPS
66 aligns with a chromosomal region at chromosome 16ql3 with at least 95% sequence identity. This region resides in genomic locus NT_010498 (NCBI Genome Annotation), and is located near MT1G. MT1G encodes metallothionein 1G, and has LocusID: 4495. [0133] In addition, CPS 66 has 83-92% sequence identity with various regions on chromosomes 1, 4, 9, 16 and 20. These regions include MTIH, MT2P1, LOC255917, LOC127544, LOC149450, a region near MGC10731, and a region near MMP24. MTIH encodes metallothionein 1H, and has LocusID: 4496 with reported cytogenetic location 16ql3. MT2P1 encodes metallothionein 2 pseudogene 1 (processed), and is located at chromosome 4pl l-q21 (LocusID: 4503). MGC10731 has LocusID: 79363, and is located at chromosome lp36.13. LOC255917 encodes a protein similar to Metallothionein-IE (MT- 1E), and is located on chromosome 9. LOCI 27544 encodes a protein similar to dJ1174N9.1 (novel protein with IBR domain), and is located at chromosome lp34.3. LOC149450 has reported cytogenetic location lq42.3. MMP24 encodes matrix metalloproteinase 24 (membrane-inserted), and is located at chromosome 20ql l.2 with LocusID: 10893.
[0134] CPS 67 corresponds to ZNF198 which encodes zinc finger protein 198. This gene has LocusID: 7750, and is located on chromosome 13 with reported cytogenetic location 13ql l-ql2. The gene resides in genomic locus NT_009799 (NCBI Genome Annotation). Zinc-finger protein 198 contains zinc fingers.
[0135] Nucleotides 185-221 of CPS 67 have 100% sequence identity with an intron sequence of LOC205936. LOC205936 is located at chromosome 4pl6.2, and resides in genomic locus NT_006051 (NCBI Genome Annotation).
[0136] CPS 68 corresponds to PRCP which encodes prolylcarboxypeptidase
(angiotensinase C). This gene has LocusID: 5547, and is located on chromosome 11 with reported cytogenetic location 1 lql4. The gene resides in genomic locus NT_033927 (NCBI Genome Annotation). Prolylcarboxypeptidase (angiotensinase C) is a serine carboxypeptidase and can remove residues linked to proline.
[0137] CPS 70 corresponds to DKFZP586A0522 which encodes DKFZP586A0522 protein. This gene has LocusID: 25840, and is located on chromosome 12 with reported cytogenetic location 12qll. The gene resides in genomic locus NT_009782 (NCBI Genome Annotation). The gene product include a region with low sequence similarity to a region of S. cerevisiae Coq5p. DKFZP586A0522 overlaps with LOC196529 which is encodes a protein similar to DKFZP586A0522 protein.
[0138] CPS 71 corresponds to SFRS11 which encodes splicing factor, arginine/serine-rich 11. This gene has LocusID: 9295, and is located on chromosome 1 with reported cytogenetic location Ip21-p34. The gene resides in genomic locus NT 304464 (NCBI Genome Annotation). The gene product contains arginine/serine-rich domain and an RRM domain, and may have a role in pre-mRNA splicing.
[0139] Nucleotides 1 to 234 of CPS 71 have 89% sequence identity with an intron sequence of PEPP2. PEPP2 encodes phosphoinositol 3 -phosphate-binding protein-2, and has LocusID: 54477 with reported cytogenetic location 12pl2.
[0140] CPS 72 corresponds to RAC2 which encodes ras-related C3 botulinum toxin substrate 2 (rho family, small GTP binding protein, Rac2). This gene has LocusID: 5880, and is located on chromosome 22 with reported cytogenetic location 22ql3.1. The gene resides in genomic locus NT_011520 (NCBI Genome Annotation). The protein encoded by
RAC2 is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins.
Members of this superfamily appear to regulate a diverse array of cellular events, including the control of cell growth, cytoskeletal reorganization, and the activation of protein kinases.
The RAC2 gene product may be a target for ADP ribosylation by the C3 subunit of botulinum toxin.
[0141] Nucleotides 400 to 487 of CPS 72 have 95% sequence identity with
STK17A. STK17A encodes serine/threonine kinase 17a (apoptosis-inducing), and has
LocusID: 9263 with reported cytogenetic location 7pl2-pl4.
[0142] CPS 73 corresponds to ZFP103 which encodes zinc finger protein 103 homolog (mouse). This gene has LocusID: 7844, and is located on chromosome 2 with reported cytogenetic location 2pl 1.2. The gene resides in genomic locus NT 315805 NCBI
Genome Annotation). The gene product contains a zinc-finger domain, and may be associated with membranous protein sorting.
[0143] CPS 74 corresponds to LOC51580 which encodes H-2K binding factor-2.
This gene has LocusID: 51580, and is located on chromosome 9. The gene resides in genomic locus NT_006316 (NCBI Genome Annotation). The gene product is a member of the recombination signal-sequence binding protein family. It is a transcription factor that binds to the NFkB site of MHC class I genes.
[0144] CPS 75 corresponds to NRDl which encodes nardilysin (N-arginine dibasic convertase). This gene has LocusID: 4898, and is located on chromosome 1 with reported cytogenetic location Ip32.2-p32.1. The gene resides in genomic locus NT_004424 (NCBI
Genome Annotation). N-arginine dibasic convertase (NRD convertase) is a zinc-dependent endopeptidase. It is a member of the insulinase family. [0145] Blast search of the Entrez human genome sequence database shows that CPS
76 overlaps or includes the 3' untranslated region of FLJ20154. FLJ20154 encodes hypothetical protein FLJ20154, and is located at chromosome 10q24.32 with LocusID: 54838. FLJ20154 resides in genomic locus NT_030059 (NCBI Genome Annotation). [0146] CPS 76 also has 85-88% sequence identity with an intron sequence of KIAA0103, a chromosome 13 region near LOCI 60822, and a chromosome 7 region near BAP29. KIAA0103 has LocusID: 9694 and reported cytogenetic location 8q23.1. LOCI 60822 encodes a protein similar to hypothetical protein FLJ12457, and is located at chromosome 13q31.1. BAP29 encodes B-cell receptor-associated protein BAP29, and is located at chromosome 7q22.2 with LocusID: 55973. Moreover, fragments of CPS 76 align with various other regions on chromosome 1, 6, 15 and 19 with 89-90% sequence identity. [0147] Affymetrix annotation suggests that CPS 77 corresponds to KIAA0906, also known as NUP210 which encodes nucleoporin 210. The gene has LocusID: 23225, and is located at chromosome 3p25.1.
[0148] Blast search of the Entrez human genome sequence database shows that CPS
77 aligns with FLJ22389 with at least 99% sequence identity. FLJ22389 has LocusID: 79985, and is located on chromosome 3 with reported cytogenetic location 3p25.1. The gene resides in genomic locus NT_005927 (NCBI Genome Annotation).
[0149] Affymetrix annotation suggests that CPS 78 corresponds to MT1F which encodes metallothionein IF (functional) and has LocusID: 4494. The gene is located at
16ql3.
[0150] Blast search of the Entrez human genome sequence database shows that CPS
78 has 100% sequence identity with a chromosomal region located 3' to the protein-coding sequence of MT1G. This chromosomal region, as well as MT1G, are within genomic locus NT_010498 (NCBI Genome Annotation) on chromosome 16. MT1G encodes metallothionein 1G, and has LocusID: 4495 with reported cytogenetic location 16ql3. [0151] Nucleotides 1 to 67 of CPS 78 have 90-95% sequence identity to various regions in genomic locus NT_010498. These regions include MT2A, LOC221228, MT1G and MT1L. MT2A encodes metallothionein 2 A, and has LocusID: 4502 with reported cytogenetic location 16ql3. LOC221228 is a hypothetical gene supported by AF495759. MT1G encodes metallothionein 1G. MT1L encodes metallothionein IL, and has LocusID: 4500 with reported cytogenetic location 16ql3. [0152] In addition, CPS 78, and certain fragments thereof, have 89-93% sequence identity with MT2P1, LOC127544, LOC255917, a chromosomal region near MMP24, a chromosomal region near MGC10731, and a chromosomal region near LOC149450. MT2P1 encodes metallothionein 2 pseudogene 1 (processed), and has LocusID: 4503 with reported cytogenetic location 4pl l-q21. LOC127544 encodes a protein similar to dJ1174N9.1 (novel protein with IBR domain), and is located at chromosome lp34.3. LOC255917 encodes a protein similar to metallothionein-IE (MT-1E), and is located on chromosome 9. MMP24 encodes matrix metalloproteinase 24 (membrane-inserted), and is located at chromosome 20ql l.2 with LocusID: 10893. MGC10731 encodes hypothetical protein MGC10731, and has LocusID: 79363 with reported cytogenetic location lp36.13. LOC149450 is located at chromosome lq42.3.
[0153] CPS 79 corresponds to P85SPR (ARHGEF7) which encodes Rho guanine nucleotide exchange factor (GEF) 7. This gene has LocusID: 8874, and is located on chromosome 13 with reported cytogenetic location 13q34. The gene resides in genomic locus NT_009952 (NCBI Genome Annotation). Rho GTPases are involved in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The protein encoded by P85SPR belongs to a family of cytoplasmic proteins that activate the Ras-like family of Rho proteins by exchanging bound GDP for GTP. The encoded protein may form a complex with G proteins and stimulate Rho- dependent signals. The protein can induce membrane ruffling. The gene product is also involved in Pak recruitment to Cdc42- and Racl -driven focal complexes. Multiple alternatively spliced transcript variants encoding different isoforms have been described for this gene.
[0154] CPS 80 corresponds to UNK_N53547 (MGC5508) which encodes hypothetical protein MGC5508. This gene has LocusID: 79073, and is located on chromosome 11 with reported cytogenetic location l lql3.1. The gene resides in genomic locus NT_033903 (NCBI Genome Annotation).
[0155] CPS 81 corresponds to GPRK5 which encodes G protein-coupled receptor kinase 5. This gene has LocusID: 2869, and is located on chromosome 10 with reported cytogenetic location 10q24-qter. The gene resides in genomic locus NT_008902 (NCBI Genome Annotation). G protein-coupled receptor kinases (GRKs) play a role in phosphorylating and regulating the activity of a variety of G protein-coupled receptors. G protein-coupled receptor kinase 5 can phosphorylate agonist-stimulated G protein-coupled receptors.
[0156] CPS 83 corresponds to SCYA7 which encodes small inducible cytokine A7
(monocyte chemotactic protein 3). This gene has LocusID: 6354, and is located on chromosome 17 with reported cytogenetic location 17ql l.2-ql2. The gene resides in genomic locus NT_010799 (NCBI Genome Annotation). Monocyte chemotactic protein 3 is a secreted chemokine which attracts macrophages, during inflammation and metastasis. It is a member of the C-C subfamily of chemokines 'which are characterized by having two adjacent cysteine residues. The protein is an in vivo substrate of matrix metalloproteinase
2, an enzyme which degrades components of the extracellular matrix. SCYA7 gene is part of a cluster of C-C chemokine family members on chromosome 17q.
[0157] CPS 84 corresponds to TUBA3 which encodes tubulin, alpha 3. This gene has LocusID: 7846, and is located on chromosome 12 with reported cytogenetic location
12ql2-12ql4.3. The gene resides in genomic locus NT_009526 (NCBI Genome
Annotation). There are multiple alpha and beta tubulin genes, which are conserved among species. TUBA3 encodes alpha tubulin and is similar to mouse and rat Tubal gene.
Northern blotting studies have shown the gene expression in morphologically differentiated neurologic cells. TUBA3 is one of three alpha-tubulin genes in a cluster on chromosome
12q.
[0158] CPS 84 also has 97% sequence identity with a chromosomal region located
3' to LOC134262. LOC134262 encodes a protein similar to alphaTub84B gene product.
The gene is located at 5pl 1, and resides in genomic locus NT_023098.
[0159] CPS 85 corresponds to SCML2 which encodes sex comb on midleg-like 2
(Drosophila). This gene has LocusID: 10389, and is located on chromosome X with reported cytogenetic location Xp22. The gene resides in genomic locus NT_011586 (NCBI
Genome Annotation). The gene product is similar to Drosophila Scm.
[0160] CPS 85 also aligns with SCML1 with 94% sequence identity. SCML1 encodes sex comb on midleg-like 1 (Drosophila). The gene has LocusID: 6322, and is located on chromosome X with reported cytogenetic location Xp22.2-p22.1. The gene resides in genomic locus NT_011586.
[0161] CPS 87 corresponds to IL1R1 which encodes interleukin 1 receptor, type I.
This gene has LocusID: 3554, and is located on chromosome 2 with reported cytogenetic location 2ql2. The gene resides in genomic locus NT_022171 (NCBI Genome Annotation). Type I interleukin- 1 receptor contains immniunoglobulin domains, and can bind to all three forms of interleukin- 1 (ILIA, IL1B, and IL1RN).
[0162] CPS 88 corresponds to UNK AL008729 (LOC221692) which encodes a protein similar to KIAA1733 protein. This gene is located on chromosome 6 with reported cytogenetic location 6p22.3. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). LOC221692 is located within LOC51256 which has LocusID: 51256. CPS 88 matches with the non-protein-coding strand of LOC51256. [0163] CPS 89 corresponds to KIAA0191 which encodes KIAA0191 protein. This gene has LocusID: 23318, and is located on chromosome 1 with reported cytogenetic location lp32.3. The gene resides in genomic locus NT_004424 (NCBI Genome Annotation).
[0164] CPS 90 corresponds to EGFL5 which encodes EGF-like-domain, multiple 5.
This gene has LocusID: 1955, and is located on chromosome 9 with reported cytogenetic location 9q32-q33.3. The gene resides in genomic locus NT_017568 (NCBI Genome Annotation).
[0165] CPS 91 corresponds to DUSP1 which encodes dual specificity phosphatase
1. This gene has LocusID: 1843, and is located on chromosome 5 with reported cytogenetic location 5q34. The gene resides in genomic locus NT_023132 (NCBI Genome Annotation). The expression of DUSP1 gene can be induced in human skin fibroblasts by oxidative/heat stress and growth factors. The bacterially expressed and purified DUSP1 protein has intrinsic phosphatase activity, and can inactivate mitogen-activated protein (MAP) kinase in vitro by the concomitant dephosphorylation of both its phosphothreonine and phosphotyrosine residues. DUSP1 protein can also suppress the activation of MAP kinase by oncogenic ras in extracts of Xenopus oocytes. Thus, DUSP1 may play a role in the human cellular response to environmental stress as well as in the negative regulation of cellular proliferation.
[0166] CPS 92 corresponds to FBP1 which encodes fructose- 1 ,6-bisphosphatase 1.
This gene has LocusID: 2203, and is located on chromosome 9 with reported cytogenetic location 9q22.3. The gene resides in genomic locus NT_008476 (NCBI Genome Annotation). Fructose- 1,6-bisphosphatase 1 is a gluconeogenesis regulatory enzyme. It can catalyze the hydrolysis of fructose 1,6-bisphosphate to fructose 6-phosphate and inorganic phosphate. Fructose- 1,6-diphosphatase deficiency is associated with hypoglycemia and metabolic acidosis. [0167] CPS 93 corresponds to HRB which encodes HIV-1 Rev binding protein.
This gene has LocusID: 3267, and is located on chromosome 2 with reported cytogenetic location 2q36. The gene resides in genomic locus NT_005403 (NCBI Genome Annotation). HIV-1 Rev binding protein can interact with the activation domains of the HIV-1 Rev protein, and may be related to nucleoporins, a class of proteins that mediate nucleocytoplasmic transport. HIV-1 Rev binding protein binds to the Rev activation domain when Rev is assembled onto its RNA target and can significantly enhance Rev activity when over-expressed. The HRB gene can expressed as a major 2.8-kb and a minor 4.6-kb mRNA.
[0168] CPS 94 corresponds to NAGA which encodes N-acetylgalactosaminidase, alpha-. This gene has LocusID: 4668, and is located on cliromosome 22 with reported cytogenetic location 22ql3-qter. The gene resides in genomic locus NT_011520 (NCBI Genome Annotation). The lysosomal enzyme alpha-N-acetylgalactosaminidase can cleave alpha-N-acetylgalactosaminyl moieties from glycoconjugates. Mutations in NAGA have been implicated as the cause of Schindler disease types I and II (type II also known as Kanzaki disease).
[0169] CPS 95 corresponds to GLRX which encodes glutaredoxin (thioltransferase).
This gene has LocusID: 2745, and is located on chromosome 5 with reported cytogenetic location 5ql4. The gene resides in genomic locus NT_023148 (NCBI Genome Annotation). Glutaredoxin can function as a glutathione-dependent hydrogen donor for ribonucleotide reductase.
[0170] Fragments of CPS 95 also align with AF358259, a chromosomal region near
GABRA6 and a chromosomal region near GLRXP with 86-90% sequence identity. AF358259 encodes glutaredoxin pseudogene 2, and has LocusID: 171418 with reported cytogenetic location 14q32.13-q32.2. GABRA6 encodes gamma-aminobutyric acid (GABA) A receptor, alpha 6. It has LocusID: 2559, and is located at chromosome 5q34. GLRXP encodes glutaredoxin (thioltransferase) pseudogene, and is located at chromosome 20ql l.2 with LocusID: 170522. In addition, nucleotides 1 to 29 of CPS 95 has 100% sequence identity with LOC257079. LOC257079 encodes a protein similar to glutaredoxin (Thioltransferase) (TTase), and is located on chromosome 5.
[0171] CPS 96 corresponds to UNK .W28281 (GABARAPL1) which encodes
GABA(A) receptor-associated protein like 1. This gene has LocusID: 23710, and is located on chromosome 12 with reported cytogenetic location 12pl3.1. The gene resides in genomic locus NT_035207 (NCBI Genome Annotation).
[0172] SEQ ID NO: 360, which can be used as a probe sequence for detecting the expression level of UNK .W28281, also aligns with GABARAPL3 with about 95% sequence identity. GABARAPL3 encodes GABA(A) receptors associated protein like 3. It has LocusID: 23766, and is located on chromosome 15 with reported cytogenetic location 15q25.1.
[0173] CPS 97 corresponds to UNK_AL096740 (UBE3B) which encodes ubiquitin protein ligase. This gene has LocusID: 89910, and is located on chromosome 12 with reported cytogenetic location 12q24.11. The gene resides in genomic locus NT_009770 (NCBI Genome Annotation).
[0174] CPS 98 corresponds to TBXASl wliich encodes thromboxane A synthase 1
(platelet, cytochrome P450, subfamily V). This gene has LocusID: 6916, and is located on chromosome 7 with reported cytogenetic location 7q34-q35. The gene resides in genomic locus NT_007914 (NCBI Genome Annotation). The gene product is a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. TBXASl gene product is considered a member of the cytochrome P450 superfamily on the basis of sequence similarity rather than functional similarity. It is an endoplasmic reticulum membrane protein, and can catalyze the conversion of prostglandin H2 to thromboxane A2, a potent vasoconstrictor and inducer of platelet aggregation. TBXASl gene product may play a role in several pathophysiological processes including hemostasis, cardiovascular disease, and stroke. The gene expresses at least two transcript variants.
[0175] CPS 99 corresponds to DPYD which encodes dihydropyrimidine dehydrogenase. This gene has LocusID: 1806, and is located on chromosome 1 with reported cytogenetic location lp22. The gene resides in genomic locus NT_034389 (NCBI Genome Annotation). Dihydropyrimidine dehydrogenase is a pyrimidine catabolic enzyme which is involved in the initial and rate-limiting step in the pathway of uracil and thymidine catabolism and also in the pathway leading to the formation of beta-alanine. The DPYD gene is a large gene of approximately 150 kb consisting of at least 23 exons encoding a protein of approximately 111-kDa. Genetic deficiency of DPYD enzyme results in an error in pyrimidine metabolism associated with thymine-uraciluria and an increased risk of toxicity in cancer patients receiving 5-flourouracil chemotherapy.
[0176] Affymetrix annotation suggests that CPS 100 corresponds to ECGF1 which encodes endothelial cell growth factor 1 (platelet-derived). Nucleotides 4 to 120 of M63193 (SEQ ID NO: 94) align with ECGF1. This gene has LocusID: 1890, and is located on chromosome 22 with reported cytogenetic location 22ql3.33. The gene resides in genomic locus NT_011526 (NCBI Genome Annotation). ,
[0177] CPS 100 aligns with SC02 with at least 98% sequence identity. SC02 encodes SCO cytochrome oxidase deficient homolog 2 (yeast). It has LocusID: 9997, and is located on chromosome 22 with reported cytogenetic location 22ql3.33. The gene resides in genomic locus NT_011526. Mammalian cytochrome c oxidase (COX) catalyzes the transfer of reducing equivalents from cytochrome c to molecular oxygen and pumps protons across the inner mitochondrial membrane. In yeast, two related COX assembly genes, yeast SCOl and SC02 (synthesis of cytochrome c oxidase), enable subunits 1 and 2 to be incorporated into the holoprotein. SC02 is the human homolog of the yeast SC02 gene.
[0178] Affymetrix annotation suggests that CPS 101 corresponds to TIMP1 which encodes tissue inhibitor of metalloproteinase 1 (erythroid potentiating activity, collagenase inhibitor). The gene has LocusID: 7076, and is located at chromosome Xpl 1.3-pl 1.23. [0179] Blast search of the Entrez human genome sequence database shows that CPS
101 aligns with an intron sequence of SYN1 with at least 97% sequence identity. SYN1 encodes synapsin I, and has LocusID: 6853 with reported cytogenetic location Xpl 1.23. The gene resides in genomic locus NT_011568. CPS 101 matches with the non-protein- coding strand of SYN1.
[0180] CPS 102 corresponds to GSN which encodes gelsolin (amyloidosis, Finnish type). This gene has LocusID: 2934, and is located on chromosome 9 with reported cytogenetic location 9q33. The gene resides in genomic locus NT_017568 (NCBI Genome Annotation). Gelsolin is a calcium-dependent protein which may function to sever and cap actin filaments.
[0181] CPS 103 corresponds to SECTM1 which encodes secreted and transmembrane 1. This gene has LocusID: 6398, and is located on chromosome 17 with reported cytogenetic location 17q25. The gene resides in genomic locus NT_025911 (NCBI Genome Annotation). The gene product is a transmembrane and secreted protein with characteristics of a type la transmembrane protein. It is found in a perinuclear Golgi-like pattern and thought to be involved in hematopoietic or immune system processes. The gene product may have a role in hematopoiesis or immune processes.
[0182] CPS 104 corresponds to OLR1 which encodes oxidised low density lipoprotein (lectin-like) receptor 1. This gene has LocusID: 4973, and is located on chromosome 12 with reported cytogenetic location 12pl3.2-pl2.3. The gene resides in genomic locus NT_035207 (NCBI Genome Annotation). Lectin-like oxidized low density lipoprotein receptor is a member of the C-type lectin receptor family, and may be involved in degradation of oxidized LDL by vascular endothelial cells.
[0183] CPS 105 corresponds to D6S49E (LST1) which encodes leukocyte specific transcript 1. This gene has LocusID: 7940, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). The gene product is expressed in leukocytes and induced by IFN- gamma. It possibly functions in the immune response of monocytes and T cells. [0184] CPS 106 corresponds to JUNB which encodes jun B proto-oncogene. This gene has LocusID: 3726, and is located on chromosome 19 with reported cytogenetic location 19pl3.2. The gene resides in genomic locus NT_011176 (NCBI Genome Annotation). The gene product may participate in AP-1 transcriptional activation. [0185] CPS 107 corresponds to PFC which encodes properdin P factor, complement. This gene has LocusID: 5199, and is located on chromosome X with reported cytogenetic location Xpl 1.3-pl 1.23. The gene resides in genomic locus NT_011568 (NCBI Genome Annotation). The gene product contains a related type-I repeat sequence, and may play a role in complement-mediated clearance.
[0186] CPS 108 corresponds to POLR2E which encodes polymerase (RNA) II
(DNA directed) polypeptide E (25kD). This gene has LocusID: 5434, and is located on chromosome 19 with reported cytogenetic location 19pl3.3. The gene resides in genomic locus NT_011277 (NCBI Genome Annotation). This gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. The encoded subunit is shared by the other two DNA-directed RNA polymerases and is present in two-fold molar excess over the other polymerase subunits. An interaction between this subunit and a hepatitis virus transactivating protein has been demonstrated, suggesting that interaction between transcriptional activators and the polymerase can occur through this subunit. A pseudogene is located on chromosome 11. [0187] CPS 109 corresponds to THBS1 which encodes thrombospondin 1. This gene has LocusID: 7057, and is located on chromosome 15 with reported cytogenetic location 15ql5. The gene resides in genomic locus NT_030828 (NCBI Genome Annotation). Thrombospondin- 1 is a member of a family of adhesive molecules. It has a role in blood clotting and in angiogenesis.
[0188] CPS 110 corresponds to HK2 which encodes hexokinase 2. This gene has
LocusID: 3099, and is located on chromosome 2 with reported cytogenetic location 2pl3. The gene resides in genomic locus NT_022184 (NCBI Genome Annotation). Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the glycolytic pathway. HK2 gene encodes hexokinase 2, the predominant form found in skeletal muscle. The gene product localizes to the outer membrane of mitochondria. Expression of this gene is insulin-responsive, and studies in rat suggest that it is involved in the increased rate of glycolysis seen in rapidly growing cancer cells.
[0189] CPS 110 also aligns with a chromosomal region near LOG 139132 with about
95% sequence identity. Both the chromosomal region and LOCI 39132 reside in genomic locus NT_011597. LOG 139132 encodes a protein similar to WW domain binding protein
I I (SH3 domain-binding protein SNP70) (Npw38-binding protein NpwBP), and is located at Xql3.2.
[0190] Affymetrix annotation suggests that CPS 111 corresponds to INSIG1 which encodes insulin induced gene 1. The gene has LocusID: 3638 and is located at chromosome
7q36.
[0191] Blast search of the Entrez human genome sequence database shows that CPS
II I has about 80-85% sequence identity with a chromosomal region near LOCI 31742. This chromosomal region and LOCI 31742 reside in genomic locus NT_022554. LOCI 31742 has reported cytogenetic location 3p 12.1 -pi 1.2.
[0192] CPS 112 corresponds to HCK which encodes hemopoietic cell kinase. This gene has LocusID: 3055, and is located on chromosome 20 with reported cytogenetic location 20ql l-ql2. The gene resides in genomic locus NT_028392 (NCBI Genome Annotation). The gene product can function as a non-receptor protein tyrosine kinase. [0193] Affymetrix annotation suggests that CPS 113 corresponds to HP10390, alos known as TMEM4 which encodes transmembrane protein 4. The gene has LocusID: 10330, and is located at chromosome 12ql5. [0194] Blast search of the Entrez human genome sequence database shows that CPS
113 aligns with two regions with 100% sequence identity. The first region is located in an intron of TIMELESS. The second region is located 5' to the polypeptide-coding sequence of TIMELESS. TIMELESS encodes timeless homolog (Drosophila). It has LocusID: 8914, and is located on chromosome 12 with reported cytogenetic location 12ql2-ql3. The gene resides in genomic locus NT_009458 (NCBI Genome Annotation). The gene product may be involved in circadian oscillation autoregulation.
[0195] CPS 114 corresponds to UNK U51712 (LAGY) which encodes lung cancer- associated Y protein. This gene has LocusID: 84525, and is located on chromosome 4 with reported cytogenetic location 4ql l-ql2. The gene resides in genomic locus NT_022853 (NCBI Genome Annotation). Multiple alternatively spliced transcript variants have been described for this gene.
[0196] CPS 115 corresponds to which encodes KIAA0217 wliich encodes
KIAA0217 protein. This gene has LocusID: 23185, and is located on chromosome 10 with reported cytogenetic location 1 Op 15.3. The gene resides in genomic locus NT_024115 (NCBI Genome Annotation).
[0197] CPS 116 corresponds to NCF1 which encodes neutrophil cytosolic factor 1
(47kD, chronic granulomatous disease, autosomal 1). This gene has LocusID: 4687, and is located on chromosome 7 with reported cytogenetic location 7qll.23. The gene resides in genomic locus NT_034886 (NCBI Genome Annotation). NCF1 gene product can produce a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
[0198] CPS 116 also aligns with LOC220830 and LOC256379 with 95-99% sequence identity. Both genes reside in genomic locus NT 307758 on chromosome 7. Both genes encode proteins similar to neutrophil cytosolic factor 1 (47kD, chronic granulomatous disease, autosomal 1).
[0199] CPS 117 corresponds to KIAA1113 (TRIM33) which encodes tripartite motif-containing 33. This gene has LocusID: 51592, and is located on chromosome 1 with reported cytogenetic location lpl3.1. The gene resides in genomic locus NT_019273 (NCBI Genome Annotation). The protein encoded by this gene is thought to be a transcriptional corepressor. The protein is a member of the tripartite motif family. The tripartite motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. At least three alternatively spliced transcript variants for this gene have been described.
[0200] CPS 118 corresponds to DKFZP566H073 which encodes DKFZP566H073 protein. This gene has LocusID: 26001, and is located on chromosome 17 with reported cytogenetic location 17pl3.3. The gene resides in genomic . locus NT_033299 (NCBI
Genome Annotation). The gene product contains a C3HC4 type (RING) zinc finger, and can mediate protein-protein interactions. ,
[0201] CPS 119 corresponds to ATP2B1 which encodes ATPase, Ca++ transporting, plasma membrane 1. This gene has LocusID: 490, and is located on chromosome 12 with reported cytogenetic location 12q21-q23. The gene resides in genomic locus NT_009729 (NCBI Genome Annotation).
[0202] Nucleotides 305 to 338 of CPS 119 align with an intron sequence of
FLJ14075 with 100% sequence identity. FLJ14075 encodes hypothetical protein FLJ14075, and has LocusID: 79954 with reported cytogenetic location 2p25.1.
[0203] CPS 120 corresponds to IFITM2 which encodes interferon induced transmembrane protein 2 (1-8D). This gene has LocusID: 10581, and is located on chromosome 11 with reported cytogenetic location l lpl5.5. The gene resides in genomic locus NT_009407 (NCBI Genome Annotation). The expression of this gene can be induced by interferon.
[0204] CPS 121 corresponds to MNDA which encodes myeloid cell nuclear differentiation antigen. This gene has LocusID: 4332 , and is located on chromosome 1 with reported cytogenetic location lq22. The gene resides in genomic locus NT_004982
(NCBI Genome Annotation). The myeloid cell nuclear differentiation antigen can be detected in nuclei of cells of the granulocyte-monocyte lineage. A 200-amino acid region of the protein is similar to a region in the proteins encoded by a family of interferon-inducible mouse genes, designated Ifi-201, Ifi-202, and Ifi-203. The MNDA mRNA, which contains an interferon-stimulated response element in the 5-prime untranslated region, can be upregulated in human monocytes exposed to interferon alpha. MNDA is located within
2,200 kb of FCER1A, APCS, CRP, and SPTA1. In its pattern of expression or regulation,
MNDA resembles IFI16, suggesting that these genes participate in blood cell-specific responses to interferons.
[0205] CPS 122 corresponds to BTN3A2 which encodes butyrophilin, subfamily 3, member A2. This gene has LocusID: 11118, and is located on chromosome 6 with reported cytogenetic location 6p22.1. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation).
[0206] Nucleotides 213 to 250 of CPS 122 align with BTN3A3 and BTN3A1 with
94% and 97% sequence identity, respectively. BTN3A3 encodes butyrophilin, subfamily 3, member A3, and has LocusID: 10384. BTN3A1 encodes butyrophilin, subfamily 3, member Al, and has LocusID: 11119. Both genes are located at chromosome 6p22.1. [0207] CPS 123 corresponds to KIAA0776 which encodes KIAA0776 protein. This gene has LocusID: 23376, and is located on chromosome 6 with reported cytogenetic location 6ql6.3. The gene resides in genomic locus NT_019424 (NCBI Genome Annotation).
[0208] CPS 124 corresponds to D6S2245E (HSD17B8) which encodes hydroxysteroid (17-beta) dehydrogenase 8. This gene has LocusID: 7923, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). The protein encoded by this gene is similar to mouse Ke6 and is a member of the short-chain dehydrogenase superfamily. An alternatively spliced transcript of this gene has been detected.
[0209] CPS 125 corresponds to KIAA0628. This gene has LocusID: 9831, and is located on chromosome 8 with reported cytogenetic location 8q24.3. The gene resides in genomic locus NT_023684 (NCBI Genome Annotation). CPS 125 is in the 3' UTR of the gene.
[0210] CPS 126 corresponds to SELL which encodes selectin L (lymphocyte adhesion molecule 1). This gene has LocusID: 6402, and is located on chromosome 1 with reported cytogenetic location Iq23-q25. The gene resides in genomic locus NT_O34405 (NCBI Genome Annotation). Selectin L is a cell surface component that is a member of a family of adhesion homing receptors which are involved in leukocyte-endothelial cell interactions. Selectin L is composed of multiple domains including one domain homologous to lectins, one to epidermal growth factor, and two to the consensus repeat units found in C3/C4 binding proteins. The protein can attach lymphocytes to lymph node high endothelial venules.
[0211] CPS 127 corresponds to MPN17 which encodes MpN17 transgene, murine homolog, glomerulosclerosis. This gene has LocusID: 4358, and is located on chromosome 2 with reported cytogenetic location 2p23-p21. The gene resides in genomic locus NT_005204 (NCBI Genome Annotation). The gene product is similar to murine Mpvl7. It is a predicted membrane protein and may be associated with nephrotic syndrome. [0212] CPS 128 corresponds to FABP5 which encodes fatty acid binding protein 5
(psoriasis-associated). This gene has LocusID: 2171, and is located on chromosome 8 with reported cytogenetic location 8q21.13. The gene resides in genomic locus NT_007972 (NCBI Genome Annotation). The gene product can be found in epidermal cells, and was identified as being upregulated in psoriasis tissue. Fatty acid binding proteins (FABPs) are a family of small, conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism.
[0213] Nucleotides 1 to 260 of CPS 128 have 100% sequence identity with an intron sequence of STX3A. STX3A encodes syntaxin 3A, and has LocusID: 6809 with reported cytogenetic location l lql2.3. STX3A resides in genomic locus NT_033903. The alignment between CPS 128 and STX3A is in the non-protein-coding of the gene. Nucleotides 1 to 260 of CPS 128, or fragments thereof, also align to various regions on chromosomes 7, 13 and 15 with 95-97% sequence identity. In addition, nucleotides 1-260 of CPS 128, or fragments thereof, align to various region on chromosomes 2, 4, 13, 15 and 22 with sequence identity 88-93%.
[0214] CPS 129 corresponds to CASP1 which encodes caspase 1, apoptosis-related cysteine protease (interleukin 1, beta, convertase). This gene has LocusID: 834, and is located on chromosome 11 with reported cytogenetic location llq23. The gene resides in genomic locus NT_009151 (NCBI Genome Annotation). The gene product is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme. The CASP1 gene product can proteolytically cleave and activate the inactive precursor of interlukin-1, a cytokine involved in the processes such as inflammation, septic shock, and wound healing. CASP1 gene has been shown to induce cell apoptosis and may function in various developmental stages. It may have a role in the pathogenesis of Huntington disease. Alternative splicing of this gene results in at least five transcript variants encoding distinct isoforms. [0215] Nucleotides 1 to 487 of CPS 129, or fragments thereof, aligns with various chromosome 11 regions with sequence identity 86-95%. These regions include LOC120332 and LOC160131, both of which have reported cytogenetic location llq22.3 and encode proteins similar to caspase 1, isoform beta precursor {interleukin 1-beta convertase) (interleukin 1-B converting enzyme) (ILlB-convertase). [0216] CPS 130 corresponds to PTPN6 which encodes protein tyrosine phosphatase, non-receptor type 6. This gene has LocusID: 5777,, and is located on chromosome 12 with reported cytogenetic location 12p 13. The gene resides in genomic locus NT_035206 (NCBI Genome Annotation). The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. N-terminal part of PTPN6 gene product contains two tandem Src homolog (SH2) domains, which act as protein phospho-tyrosine binding domains and mediate the interaction of the gene product with its substrates. PTPN6 gene product can be expressed in hematopoietic cells, and may function as a regulator of multiple signaling pathways in hematopoietic cells. The gene product has been shown to interact with, and dephosphorylate a wide spectrum of phospho-proteins involved in hematopoietic cell signaling. At least three alternatively spliced variants of this gene, which encode distinct isoforms, have been reported.
[0217] CPS 130 also aligns with a chromosomal region near REA with at least 98% sequence identity. REA encodes repressor of estrogen receptor activity. It has LocusID: 11331 and reported cytogenetic location 12pl3.
[0218] CPS 131 corresponds to SKAP-HOM (SCAP2) which encodes src family associated phosphoprotein 2. This gene has LocusID: 8935, and is located on chromosome 7 with reported cytogenetic location 7p21-pl5. The gene resides in genomic locus NT_007819 (NCBI Genome Annotation). The protein encoded by this gene belongs to the src family kinases. The encoded protein is similar to the src family associated phosphoprotein 1 and may function as an adaptor protein. The encoded protein has coiled- coil and SH3 domains, and is thought to play a role in the src signaling pathway in various cells. In a mouse study the SCAP2 gene is implicated in the processes of myeloid differentiation and growth arrest.
[0219] CPS 133 corresponds to GR02 which encodes GR02 oncogene. This gene has LocusID: 2920, and is located on chromosome 4 with reported cytogenetic location 4q21. The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). GR02 is similar to GROl. The gene product is a chemotactic agent for polymorphonuclear leukocytes.
[0220] CPS 133 also aligns with GROl with 85% sequence identity. GROl encodes GROl oncogene (melanoma growth stimulating activity, alpha), and has LocusID:
2919 with reported cytogenetic location 4q21. The gene resides in genomic locus
NT_006216.
[0221] CPS 134 corresponds to RRAS which encodes related RAS viral (r-ras) oncogene homolog. This gene has LocusID: 6237, and is located on chromosome 19 with reported cytogenetic location 19ql3.3-qter. The gene resides in genomic locus NT_011109
(NCBI Genome Annotation). The gene product is a member of the ras family of GTP binding proteins.
[0222] CPS 135 corresponds to KIAA0022 which encodes KIAA0022 gene product.
This gene has LocusID: 9936, and is located on chromosome 2 with reported cytogenetic location 2q24.2. The gene resides in genomic locus NT_005151 (NCBI Genome
Annotation).
[0223] CPS 136 corresponds to FLU which encodes flightless I homolog
(Drosophila). This gene has LocusID: 2314, and is located on chromosome 17 with reported cytogenetic location 17pl l.2. The gene resides in genomic locus NT_030843
(NCBI Genome Annotation). The gene product is a homolog of Drosophila flightless I. It has a gelsolin-like actin-binding domain and a leucine-rich interaction domain.
[0224] CPS 137 corresponds to PRFl which encodes perform 1 (pore forming protein). This gene has LocusID: 5551, and is located on chromosome 10 with reported cytogenetic location 10q22. The gene resides in genomic locus NT_024037 (NCBI
Genome Annotation). Perform is a cytolytic, channel-forming protein which plays a role in clearing virally infected host cells and tumor cells.
[0225] CPS 138 corresponds to KIAA0102 which encodes KIAA0102 gene product.
This gene has LocusID: 9789, and is located on chromosome 11 with reported cytogenetic location l lql3.3. The gene resides in genomic locus NT_033927 (NCBI Genome
Annotation).
[0226] CPS 138 also aligns with a chromosomal region near PTAFR with 99% sequence identity. PTAFR encodes platelet-activating factor receptor, and has LocusID: 5724 and reported cytogenetic location Ip35-p34.3. It resides in genomic locus NT_028053. Platelet-activating factor receptor is a G protein-coupled receptor. [0227] In addition, CPS 138 has 86-94% sequence identity with various regions in chromosomes 5, 10, 11 and 15. For instance, CPS 138 aligns with a chromosome 15 region with 94% sequence identity. The chromosome 15 region includes LOC255320 which encodes a protein similar to microsomal signal peptidase 25 kDa subunit (SPase 25 kDa subunit) (SPC25). ,
[0228] CPS 139 corresponds to UNK AA176780 (HSA249128) which encodes
DIPB protein. This gene has LocusID: 54765, and is located on chromosome 11 with reported cytogenetic location lip 11.2. The gene resides in genomic locus NT_009237 (NCBI Genome Annotation). The gene product is member of the B-box zinc finger family, and contains a region of low similarity to a region of murine Mid2. CPS 139 aligns with the region 3' to the protein-coding strand of UNK_AA176780.
[0229] CPS 140 corresponds to PYGL which encodes phosphorylase, glycogen; liver (Hers disease, glycogen storage disease type VI). This gene has LocusID: 5836, and is located on chromosome 14 with reported cytogenetic location 14q21-q22. The gene resides in genomic locus NT_025892 (NCBI Genome Annotation).
[0230] CPS 141 corresponds to DBP which encodes D site of albumin promoter
(albumin D-box) binding protein. This gene has LocusID: 1628, and is located on chromosome 19 with reported cytogenetic location 19ql3.3. The gene resides in genomic locus NT_011109 (NCBI Genome Annotation). Albumin D-site-binding protein is a transcription factor which may play a role in the diurnal regulation of liver-specific genes. It is a member of the PAR (proline and acidic amino acid-rich) b/ZIP family. [0231] CPS 143 corresponds to RANBP2L1 which encodes RAN binding protein 2- like 1. This gene has LocusID: 84220, and is located on chromosome 2 with reported cytogenetic location 2ql2.3. The gene resides in genomic locus NT_022135 (NCBI Genome Annotation). RAN is a small GTP-binding protein of the RAS superfamily that is associated with the nuclear membrane and is thought to control a variety of cellular functions through its interactions with other proteins. RANBP2L1 gene shares sequence similarity with RANBP2, a large RAN-binding protein localized at the cytoplasmic side of the nuclear pore complex. It is believed that this RANBP2 gene family member arose from a duplication event 3 Mb distal to RANBP2. Alternative splicing has been observed for this locus and two variants are described. Additional splicing is suggested. [0232] CPS 143 also aligns with LOC220692 with 100% sequence identity.
LOC220692 encodes protein similar to RAN-binding protein 2-like 1, isoform 1 (sperm membrane protein BS-63). The gene is located at chromosome 2ql2.3 and resides in genomic locus NT_022135 (NCBI Genome Annotation).
[0233] In addition, CPS 143 has 94-96% sequence identity with KIAA0336, a chromosomal region near LOC256197, and an intron of LOC150821. KIAA0336 encodes
KIAA0336 gene product, and has LocusID: 9648 and reported cytogenetic location 2ql2.2.
LOC256197 encodes a protein similar to ribosomal protein L22, and is located on chromosome 2. LOCI 50821 encodes a protein similar to KIAA0336 gene product, and is located at chromosome 2pl 1.1.
[0234] CPS 144 corresponds to TNFSF10 which encodes tumor necrosis factor
(ligand) superfamily, member 10. This gene has LocusID: 8743, and is located on chromosome 3 with reported cytogenetic location 3q26. The gene resides in genomic locus
NT_025667 (NCBI Genome Annotation). Tumor necrosis factor (TNF) family cytokines function as mediators. of immune regulation and the inflammatory response. TNFSF10 gene product is a Type II glycoprotein of the tumor necrosis factor ligand superfamily, and can mediate cell death.
[0235] CPS 145 corresponds to KIAA0854 which encodes KIAA0854 protein. This gene has LocusID: 22882, and is located on chromosome 8 with reported cytogenetic location 8q24.13. The gene resides in genomic locus NT_023663 (NCBI Genome
Annotation).
[0236] CPS 146 corresponds to SDS which encodes serine dehydratase. This gene has LocusID: 10993, and is located on chromosome 12 with reported cytogenetic location
12q24.12. The gene resides in genomic locus NT_009601 (NCBI Genome Annotation).
Serine dehydratase catalyzes the PLP-dependent alpha,beta-elimination of L-serine to pyruvate and ammonia. It is one of three enzymes that are regarded as metabolic exits of the serine-glycine pool. Serine dehydratase can be found in the liver.
[0237] CPS 147 corresponds to KIAA0576 (COASTER) which encodes coactivator for steroid receptors. This gene has LocusID: 26036, and is located on chromosome 6 with reported cytogenetic location όpl l.l. The gene resides in genomic locus NT_007592
(NCBI Genome Annotation). [0238] Affymetrix annotation suggests that CPS 148 corresponds to FCGR3B which encodes Fc fragment of IgG, low affinity Illb, receptor for (GDI 6). The gene has LocusID: 2215, and is located at chromosome lq23.
[0239] Blast search of the Entrez human genome sequence database shows that CPS
148 aligns with FCGR3A with at least 97% sequence identity. FCGR3A encodes Fc fragment of IgG, low affinity Ilia, receptor for (CD 16). The gene has LocusID: 2214, and is located on chromosome 1 with reported cytogenetic location lq23. The gene resides in genomic locus NT_004668 (NCBI Genome Annotation). The gene product is a Type III Fc gamma receptor and a member of the immunoglobulin superfamily. It can associate with zeta chain of the T-cell receptor complex (CD3Z).
[0240] CPS 149 corresponds to FCER1A which encodes Fc fragment of IgE, high affinity I, receptor for; alpha polypeptide. The gene product is the alpha subunit of the high affinity IgE receptor, and may be involved in triggering allergic responses. This gene has LocusID: 2205, and is located on chromosome 1 with reported cytogenetic location lq23. The gene resides in genomic locus NT_004982 (NCBI Genome Annotation). The IgE receptor contains 3 subunits: alpha, beta (MIM 147138 ), and gamma (MIM 147139 ). The alpha subunit can be glycosylated.
[0241] CPS 150 corresponds to CD44 which encodes CD44 antigen (homing function and Indian blood group system). This gene has LocusID: 960, and is located on chromosome 11 with reported cytogenetic location l ip 13. The gene resides in genomic locus NT_009237 (NCBI Genome Annotation).
[0242] CPS 151 corresponds to ID1 which encodes inhibitor of DNA binding 1, dominant negative helix-loop-helix protein. This gene has LocusID: 3397, and is located on chromosome 20 with reported cytogenetic location 20ql l. The gene resides in genomic locus NT_028392 (NCBI Genome Annotation). The gene product is a member of the Id helix-loop-helix family of proteins, and may negatively regulate cell differentiation. [0243] Blast search of the Entrez human genome sequence database shows that CPS
152 overlaps KIAA0963. CPS 152 aligns with the non-protein-coding strand of KIAA0963. KIAA0963 encodes KIAA0963 protein, and has LocusID: 22904 with reported cytogenetic location 19pl3.3. KIAA0963 resides in genomic locus NT_011277 (NCBI Genome Annotation).
[0244] CPS 153 corresponds to ADTB2 (AP2B1) which encodes adaptor-related protein complex 2, beta 1 subunit. This gene has LocusID: 163, and is located on chromosome 17 with reported cytogenetic location 17ql l.2-ql2. The gene resides in genomic locus NT_010799 (NCBI Genome Annotation). The beta adaptin subunit is part of the clathrin coat assembly complex which links clathrin to receptors in coated pits and vesicles. These vesicles are involved in endocytosis and Golgi processing. The beta 1 subunit is one of the assembly proteins which binds to clathrin and initiates coat formation. [0245] CPS 154 corresponds to MADH3 which encodes MAD, mothers against decapentaplegic homolog 3 (Drosophila). This gene has LocusID: 4088, and is located on chromosome 15 with reported cytogenetic location 15q21-q22. The gene resides in genomic locus NT_010265 (NCBI Genome Annotation). The gene product is similar to murine Madh3. It is a member of the Smad family of proteins, and may affect transcription in response to TGF-beta signaling pathways.
[0246] CPS 155 corresponds to ARPC1B which encodes actin related protein 2/3 complex, subunit IB (41 kD). This gene has LocusID: 10095, and is located on chromosome 7 with reported cytogenetic location 7ql 1.21. The gene resides in genomic locus NT_007933 (NCBI Genome Annotation). The gene product is involved in assembly of the actin cytoskeleton, and may have a role in protrusion of lamellipodia. [0247] CPS 155 also aligns with two chromosome 12 regions with 99-100% sequence identity. These two regions reside in genomic locus NT_035283 (NCBI Genome Annotation). The first region is located near LOG 196489 which encodes a protein similar to FLJ00209 protein. The second region is located near LOCI 44584 which encodes a protein similar to vacuolar protein sorting 35 (yeast); maternal-embryonic 3. Both LOC196489 and LOC144584 have reported cytogenetic location 12ql3.12. [0248] Affymetrix annotation suggests that CPS 156 corresponds to TALDOl which encodes transaldolase 1 and has LocusID: 6888. The gene is located at chromosome Ilpl5.5-pl5.4.
[0249] CPS 157 corresponds to UNK_AL035079 (CAT) which encodes catalase.
This gene has LocusID: 847, and is located on chromosome 11 with reported cytogenetic location l lpl3. The gene resides in genomic locus NT_009237 (NCBI Genome Annotation). Catalase is a tetrameric hemoprotein that can detoxify hydrogen peroxide. [0250] CPS 158 corresponds to STXBP1 which encodes syntaxin binding protein 1.
This gene has LocusID: 6812, and is located on chromosome 9 with reported cytogenetic location 9q34.1. The gene resides in genomic locus NTJ329366 (NCBI Genome Annotation). [0251] CPS 159 corresponds to UNG2 which encodes uracil-DNA glycosylase 2.
This gene has LocusID: 10309, and is located on chromosome 5 with reported cytogenetic location 5pl5.2-pl3.1. The gene resides in genomic locus NT_006431 (NCBI Genome Annotation). Uracil-DNA glycosylase 2 possesses uracil-DNA glycosylase activity. [0252] CPS 160 corresponds to DUSP5 which encodes dual specificity phosphatase
5. This gene has LocusID: 1847, and is located on chromosome 10 with reported cytogenetic location 10q25. The gene resides in genomic locus NT_030059 (NCBI Genome Annotation). The protein encoded by DUSP5 gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases can inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They can negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. DUSP5 gene product can inactivate ERK1, and is expressed in a variety of tissues including pancreas and brain.
[0253] CPS 161 corresponds to PTPRO which encodes protein tyrosine phosphatase, receptor type, O. This gene has LocusID: 5800, and is located on chromosome 12 with reported cytogenetic location 12pl3.3-pl3.2. The gene resides in genomic locus NT_009714 (NCBI Genome Annotation). The gene product is an integral membrane protein containing a transmembrane domain and an intracellular catalytic domain with a characteristic signature motif. Several alternatively spliced transcript variants, some of which encode different isoforms of the protein, have been described. The gene product also contains fibronectin type Ill-like repeats and putative glycosylation sites in the extracellular domain.
[0254] Affymetrix annotation suggests that CPS 162 corresponds to SLA which encodes Src-like-adaptor and has LocusID: 6503. The gene has reported cytogenetic location at chromosome 8q24.
[0255] Blast search of the Entrez human genome sequence database shows that CPS
162 aligns with an intron sequence of TG with at least 99% sequence identity. TG encodes thyroglobulin, and has LocusID: 7038 with reported cytogenetic location 8q24.2-q24.3. The gene resides in genomic locus NT_008150 (NCBI Genome Annotation). CPS 162 matches with the non-protein-coding strand of TG. Thyroglobulin is a precursor of thyroid hormones.
[0256] CPS 163 corresponds to RFP2 which encodes ret finger protein 2. This gene has LocusID: 10206, and is located on chromosome 13 with reported cytogenetic location
13ql4. The gene resides in genomic locus NT_033922 (NCBI Genome Annotation). The protein encoded by RFP2 gene is a member of the tripartite motif (TRIM) family. The
TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type
2, and a coiled-coil region. The RFP2 gene product localizes to cytoplasmic bodies near the nucleus. The gene is located on chromosome 13 within the minimal deletion region for B- cell chronic lymphocytic leukemia. Alternative splicing of this gene has been described.
[0257] Nucleotides 3 to 255 of CPS 163 have 83% sequence identity with a chromosomal region near LOCI 62162. LOCI 62162 encodes a protein similar to spinocerebellar ataxia type 1, and has reported cytogenetic location 16q23.1.
[0258] CPS 164 corresponds to ADFP which encodes adipose differentiation-related protein (adipophilin). This gene has LocusID: 123, and is located on chromosome 9 with reported cytogenetic location 9p21.2. The gene resides in genomic locus NT_023974
(NCBI Genome Annotation). Adipocyte differentiation-related protein is associated with the globule surface membrane material. The protein is a major constituent of the globule surface. Increase in mRNA levels is one of the earliest indications of adipocyte differentiation. The product is also a component of milk lipid globules.
[0259] CPS 164 aligns with a chromosome 1 region near LOC254424 with 94% sequence identity. LOC254424 encodes a protein similar to adipophilin (adipose differentiation-related protein) (ADRP).
[0260] CPS 165 corresponds to ADTD (AP3D1) which encodes adaptor-related protein complex 3, delta 1 subunit. This gene has LocusID: 8943, and is located on chromosome 19 with reported cytogenetic location 19pl3.3. The gene resides in genomic locus NT_011268 (NCBI Genome Annotation). Delta-adaptin is a component of the AP-3 complex which is involved in intracellular transport.
[0261] Affymetrix annotation suggests that CPS 166 corresponds to ADTG which is also known as AP1G1. The gene encodes adaptor-related protein complex 1, gamma 1 subunit. The gene has LocusID: 164, and is located at chromosome 16q23.
[0262] Blast search of the Entrez human genome sequence database shows that CPS
166 aligns to LOC255980 with at least 99% sequence identity. LOC255980 encodes a protein similar to hypothetical protein FLJ20151, and is located on chromosome 15. LOC255980 resides in genomic locus NT_010265. It overlaps FLJ20151 which has LocusID: 54837 and reported cytogenetic location 15q21.3.
[0263] CPS 168 corresponds to UNK_X87344 (PSMB8) which encodes proteasome
(prosome, macropain) subunit, beta type, 8 (large multifunctional protease 7). This gene has LocusID: 5696, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). Beta subunit 8 of the proteasome (prosome macropain) 'can replace beta subunit PSMB5 when cells are stimulated by interferon gamma, thereby altering proteasome specificity. [0264] CPS 170 corresponds to CAPG which encodes capping protein (actin filament), gelsolin-like. This gene has LocusID: 822, and is located on chromosome 2 with reported cytogenetic location 2cen-q24. The gene resides in genomic locus NT_015805 (NCBI Genome Annotation). The gene product is a macrophage capping protein. It can reversibly block the barbed ends. The gene product is a member of the gelsolin/villin protein family.
[0265] CPS 171 corresponds to ARL7 which encodes ADP-ribosylation factor-like
7. This gene has LocusID: 10123, and is located on chromosome 2 with reported cytogenetic location 2q37.2. The gene resides in genomic locus NT_005414 (NCBI Genome Annotation). ADP-ribosylation factor-like 7 (ARL7) is a member of the ADP- ribosylation factor family of GTP-binding proteins. ARL7 is similar to ARL4 and ARL6, and each has a nuclear localization signal and a high guanine nucleotide exchange rate. [0266] CPS 172 corresponds to RCNl which encodes recoverin. This gene has
LocusID: 5957, and is located on chromosome 17 with reported cytogenetic location 17p 13.1. The gene resides in genomic locus ΝT_010718 (NCBI Genome Annotation). Recoverin is a calcium-binding protein which can activate guanylate cyclase activity. [0267] CPS 173 corresponds to GR03 which encodes GR03 oncogene. This gene has LocusID: 2921, and is located on chromosome 4 with reported cytogenetic location 4q21. The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). GR03 is similar to human GROl, and may encode a mitogenic factor.
[0268] CPS 175 corresponds to UNK_AF070606 (ATP2B1) which encodes
ATPase, Ca++ transporting, plasma membrane 1. This gene has LocusID: 490, and is located on chromosome 12 with reported cytogenetic location 12q21-q23. The gene resides in genomic locus NT_009729 (NCBI Genome Annotation). CPS 175 aligns with the 3' UTR of the protein-coding strand of the gene.
[0269] CPS 176 corresponds to AOAH which encodes acyloxyacyl hydrolase
(neutrophil). This gene has LocusID: 313, and is located on chromosome 7 with reported cytogenetic location 7pl4-pl2. The gene resides in genomic locus NT_007819 (NCBI Genome Annotation). Acyloxyacyl hydrolase is a 2-subunit lipase which can hydrolyze the secondary (acyloxyacyl-linked) fatty acyl chains from the lipid A region of bacterial endotoxins. Acyloxyacyl hydrolase may modulate' host inflammatory responses to gram- negative bacterial invasion. The 2 subunits are encoded by a single mRNA. [0270] CPS 177 corresponds to UNK_AJ224442 (WBSCR22) which encodes
Williams Beuren syndrome chromosome region 22. This gene has LocusID: 114049, and is located on chromosome 7. The gene resides in genomic locus NT_007758 (NCBI Genome Annotation).
[0271] CPS 178 corresponds to MSR1 which encodes macrophage scavenger receptor 1. This gene has LocusID: 4481, and is located on chromosome 8 with reported cytogenetic location 8p22. The gene resides in genomic locus NT_015280 (NCBI Genome Annotation). This gene encodes the class A macrophage scavenger receptors, which include at least three different types (1, 2, 3) generated by alternative splicing of this gene. These receptors or isoforms are macrophage-specific trimeric integral membrane glycoproteins and have been implicated in many macrophage-associated physiological and pathological processes including atherosclerosis, Alzheimer's disease, and host defense. The isoforms type 1 and type 2 are functional receptors and are able to mediate the endocytosis of modified low density lipoproteins (LDLs). The isoform type 3 may have an altered intracellular processing and can be trapped within the endoplasmic reticulum. The isoform type 3 can inhibit the function of isoforms type 1 and type 2 when co-expressed, indicating a dominant negative effect and suggesting a mechanism for regulation of scavenger receptor activity in macrophages.
[0272] CPS 179 corresponds to XAP4 (C20orfl8) which encodes chromosome 20 open reading frame 18. This gene has LocusID: 10616, and is located on chromosome 20 with reported cytogenetic location 20pl3. The gene resides in genomic locus NT_011387 (NCBI Genome Annotation). The gene contains at least 12 exons. Several alternatively spliced transcript variants have been described. The gene product is similar to mouse UIP28/UbcM4 interacting protein at the amino acid level, and contains a C3HC4 type (RING) zinc finger. It may interact with PKC and mediate protein-protein interactions. [0273] CPS 180 corresponds to C1QR1 which encodes complement component 1, q subcomponent, receptor 1. This gene has LocusID: 22918, and is located on chromosome 20 with reported cytogenetic location 20p 11.21. The gene resides in genomic locus NT_011387 (NCBI Genome Annotation). The gene product is a type I membrane protein and can act as a receptor for complement protein Clq, mannose-binding lectin, and pulmonary surfactant protein A. The gene product is a functional receptor involved in ligand-mediated enhancement of phagocytosis.
[0274] CPS 181 corresponds to STAT4 which encodes signal transducer and activator of transcription 4. This gene has LocusID: 6775, and is located on chromosome 2 with reported cytogenetic location 2q32.2-q32.3. The gene resides in genomic locus NT_022197 (NCBI Genome Annotation). The gene product is a member of the STAT family of transcription factors. In response to cytokines and growth factors, STAT family members may be phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. STAT4 gene product may be involved in mediating responses to IL12 in lymphocytes and regulating the differentiation of T helper cells.
[0275] CPS 182 corresponds to UNK_AL049309 (SFRS12) which encodes splicing factor, arginine/serine-rich 12. This gene has LocusID: 140890, and is located on chromosome 5 with reported cytogenetic location 5ql2.3. The gene resides in genomic locus NT_006431 (NCBI Genome Annotation).
[0276] CPS 183 corresponds to RNASE2 which encodes ribonuclease, RNase A family, 2 (liver, eosinophil-derived neurotoxin). This gene has LocusID: 6036, and is located on chromosome 14 with reported cytogenetic location 14q24-q31. The gene resides in genomic locus NT_025892 (NCBI Genome Annotation). The gene product is a member of ribonuclease superfamily, and has neurotoxic and ribonuclease activities. [0277] CPS 183 also aligns with LOCI 22661 and RNASE3 with about 86-93% sequence identity. LOG 122661 encodes a protein similar to nonsecretory ribonuclease precursor (Ribonuclease US) (Eosinophil-derived neurotoxin) (RNase UpI-2) (Ribonuclease 2) (RNase 2). It is located at chromosome 14ql l.l, and resides in genomic locus NT_025892. RNASE3 encodes ribonuclease, RNase A family, 3 (eosinophil cationic protein). RNASE3 has LocusID: 6037 and reported cytogenetic location 14q24-q31. It also resides in genomic locus NT_025892. RNASE3 gene product is a member of the ribonuclease superfamily, and has both neurotoxic and ribonuclease activities. [0278] CPS 184 corresponds to NCF4 which encodes neutrophil cytosolic factor 4
(40kD). This gene has LocusID: 4689, and is located on chromosome 22 with reported cytogenetic location 22ql3.1. The gene resides in genomic locus NT_011520 (NCBI Genome Annotation). The protein encoded by this gene is a cytosolic element of nicotinamide adenine dinucleotide phosphate-oxidase. Upon neutrophil stimulation, the encoded protein and other cytosolic elements are sent to the cell membrane from the cytosol to form a complex which produces phagocytic oxygen radicals. Two motifs found in this protein, an SH3 domain and a PC motif, are significant to protein-protein interactions. Through interaction with the SH3 domain, NCF4 gene product is responsible for the downregulation of NADPH-oxidase. Alternative splicing has been observed in this gene. CPS 184 aligns to the 3' UTR of the gene.
[0279] CPS 185 corresponds to ANXA4 which encodes annexin A4. This gene has
LocusID: 307, and is located on chromosome 2 with reported cytogenetic location 2pl3. The gene resides in genomic locus NT_022184 (NCBI Genome Annotation). Annexin A4 belongs to the annexin family of calcium-dependent phospholipid binding proteins. Several members of the annexin family have been implicated in membrane-related events along exocytotic and endocytotic pathways. Annexin A4 may interact with ATP, have in vitro anticoagulant activity, and inhibit phospholipase A2 activity. Annexin A4 can be detected in epithelial cells.
[0280] CPS 186 corresponds to ME2 which encodes malic enzyme 2, NAD(+)- dependent, mitochondrial. This gene has LocusID: 4200, and is located on chromosome 18 with reported cytogenetic location 18q21. The gene resides in genomic locus NT_033905 (NCBI Genome Annotation). The gene product is a homotetrameric protein which can catalyze the oxidative decarboxylation of malate to pyruvate.
[0281] CPS 183 also has 91% sequence identity with a chromosome 9 region near
LOC 169570. LOC 169570 has reported cytogenetic location 9p 13.1. [0282] CPS 187 corresponds to IFI35 which encodes interferon-induced protein 35.
This gene has LocusID: 3430, and is located on chromosome 17 with reported cytogenetic location 17q21. The gene resides in genomic locus NT_035490 (NCBI Genome Annotation). Interferon-induced protein 35 associates with B-ATF transcription factor on interferon treatment. It contains a leucine-zipper motif. [0283] CPS 188 corresponds to NAMP5 which encodes vesicle-associated membrane protein 5 (myobrevin). This gene has LocusID: 10791, and is located on chromosome 2 with reported cytogenetic location 2pll.2. The gene resides in genomic locus ΝT_015805 (NCBI Genome Annotation). The gene product is a member of the synaptobrevin/VAMP family of proteins.
[0284] CPS 188 also has 87% sequence identity with an intron sequence of NSF.
NSF encodes N-ethylmaleimide-sensitive factor. It has LocusID: 4905 and reported cytogenetic location 17q21.
[0285] CPS 189 corresponds to IMPA2 which encodes inositol(myo)-l(or 4)- monophosphatase 2. This gene has LocusID: 3613, and is located on chromosome 18 with reported cytogenetic location 18pl l.2. The gene resides in genomic locus NT_010859 (NCBI Genome Annotation).
[0286] CPS 190 corresponds to GZMK which encodes granzyme K (serine protease, granzyme 3; tryptase II). This gene has LocusID: 3003, and is located on chromosome 5 with reported cytogenetic location 5ql l-ql2. The gene resides in genomic locus NT_006431 (NCBI Genome Annotation). This gene product is a member of a group of related serine proteases from the cytoplasmic granules of cytotoxic lymphocytes. Cytolytic T lymphocytes (CTL) and natural killer (NK) cells can recognize, bind, and lyse specific target cells. They are thought to protect their host by lysing cells bearing on their surface "nonself ' antigens, usually peptides or proteins resulting from infection by intracellular pathogens.,
[0287] CPS 191 corresponds to BGN which encodes biglycan. This gene has
LocusID: 633, and is located on chromosome X with reported cytogenetic location Xq28. The gene resides in genomic locus NT_025965 (NCBI Genome Annotation). The protein encoded by this gene is a small cellular or pericellular matrix proteoglycan that is related in structure to two other small proteoglycans, decorin and fibromodulin. The encoded protein and decorin are thought to be the result of a gene duplication. Decorin contains one attached glycosaminoglycan chain, while biglycan probably contains two chains. Biglycan is thought to function in connective tissue metabolism by binding to collagen fibrils and transfering growth factor-beta. It may promote neuronal survival. This gene is a candidate gene for the Happle syndrome.
[0288] CPS 192 corresponds to AIF1 which encodes allograft inflammatory factor
1. This gene has LocusID: 199, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome
Annotation). This gene is induced by cytokines and interferon. Its protein product is thought to be involved in negative regulation of growth of vascular smooth muscle cells, which contributes to the anti-inflammatory response to vessel wall trauma. The gene expresses at least three transcripts.
[0289] CPS 193 corresponds to CBP2 (SERPINH2) which encodes serine (or cysteine) proteinase inhibitor, clade H (heat shock protein 47), member 2. This gene has
LocusID: 872, and is located on chromosome 11 with reported cytogenetic location
1 lql3.5. The gene resides in genomic locus NT_033927 (NCBI Genome Annotation). The gene product is also known as Colligin-2, which is a collagen-binding protein that acts as a heat shock protein.
[0290] CPS 193 also has 92% sequence identity with a chromosome 9 region near pshsp47. Pshsp47 encodes serine (or cysteine) proteinase inhibitor, clade H (heat shock protein 47), member 2 pseudogene. It has LocusID: 158172 and reported cytogenetic location 9p 11.2.
[0291] Affymetrix annotation suggests that CPS 195 corresponds to IQGAP1 which encodes IQ motif containing GTPase activating protein 1. The gene has LocusID: 8826 with reported cytogenetic location 15q26.1.
[0292] Blast search of the Entrez human genome sequence database shows that nucleotides 395 to 5984 of L33075 align with IQGAP1 with 98% sequence identity.
IQGAP1 encodes IQ motif containing GTPase activating protein 1. The gene has LocusID:
8826, and is located on chromosome 15 with reported cytogenetic location 15q26.1. The gene resides in genomic locus NT_033276. The gene product contains a GTPase activating domain and multiple calmodulin binding domains, and can bind to actin cytoskeleton and inhibit GTPase activity of ras family of GTP binding proteins Cdc42Hs and rac.
[0293] CPS 196 corresponds to KIAA0823 (PPP1R16B) which encodes protein phosphatase 1, regulatory (inhibitor) subunit 16B. This gene has LocusID: 26051, and is located on chromosome 20 with reported cytogenetic location 20ql 1.23. The gene resides in genomic locus NT_011362 (NCBI Genome Annotation).
[0294] CPS 197 corresponds to UNK AI018098 (MGC 15523) which encodes hypothetical protein MGC 15523. This gene has LocusID: 124565, and is located on chromosome 17 with reported cytogenetic location 17q25.3. The gene resides in genomic locus NT_010661 (NCBI Genome Annotation). [0295] Affymetrix annotation suggests that CPS 198 corresponds to CD 163.
CD163 encodes CD163 antigen, and has LocusID: 9332. The gene is located at chromosome 12pl3.3.
[0296] CPS 199 corresponds to HK3 which encodes hexokinase 3 (white cell). This gene has LocusID: 3101, and is located on chromosome 5 with reported cytogenetic location 5q35.2. The gene resides in genomic locus NT_023132 (NCBI Genome Annotation). Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the glycolytic pathway. HK3 gene encodes hexokinase 3 which is similar to hexokinases 1 and 2. Hexokinase 3 can be inhibited by its product glucose-6- phosphate.
[0297] CPS 200 corresponds to FOS which encodes v-fos FBJ murine osteosarcoma viral oncogene homolog. This gene has LocusID: 2353, and is located on chromosome 14 with reported cytogenetic location 14q24.3. The gene resides in genomic locus NT_026437 (NCBI Genome Annotation). The Fos gene family has 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation. In some cases, expression of the FOS gene has also been associated with apoptotic cell death. FOS gene product may also be involved in alteration of DNA methylation.
[0298] CPS 201 corresponds to DIFF48 (C6orf32) which encodes chromosome 6 open reading frame 32. This gene has LocusID: 9750, and is located on chromosome 6 with reported cytogenetic location 6p22.3-p21.32. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). The protein encoded by DIFF48 can stimulate the formation of a non-mitotic multinucleate syncytium from proliferative cytotrophoblasts during trophoblast differentiation. An alternatively spliced transcript variant of this gene has been described.
[0299] CPS 202 corresponds to UNK_L23134 (GZMM) which encodes granzyme
M (lymphocyte met-ase 1). This gene has LocusID: 3004, and is located on chromosome 19 with reported cytogenetic location 19pl3.3. The gene resides in genomic locus NT_011227 (NCBI Genome Annotation). Human natural killer (NK) cells and activated lymphocytes express and store a distinct subset of neutral serine proteases together with proteoglycans and other immune effector molecules in large cytoplasmic granules. These serine proteases are collectively termed granzymes and include at least four distinct gene products: granzyme A, granzyme B, granzyme H, and Met-ase which is also known as granzyme M.
[0300] CPS 203 corresponds to LM02 which encodes LIM domain only 2
(rhombotin-like 1). This gene has LocusID: 4005, and is located on chromosome 11 with reported cytogenetic location 1 lpl3. The gene resides in genomic locus NT_009237 (NCBI Genome Annotation). LM02 encodes a cysteine-rich, two LIM-domain protein that is involved in yolk sac erythropoiesis. The LM02 protein has a role in hematopoietic development. The LM02 transcription start site is located approximately 25 kb downstream from the l ip 13 T-cell translocation cluster (lip 13 ttc), where a number T-cell acute lymphoblastic leukemia-specific translocations occur. The LM02 protein is a member of the rhombotin family.
[0301] CPS 204 corresponds to PDI2 (PADI2) which encodes peptidyl arginine deiminase, type II. This gene has LocusID: 11240, and is located on chromosome 1 with reported cytogenetic location Ip35.2-p35.1. The gene resides in genomic locus NT_034376 (NCBI Genome Annotation). The gene product is similar to rat skeletal muscle peptidyl ! arginine deiminase, type II. It may convert arginine residues within proteins to citrulline residues.
[0302] CPS 205 corresponds to UNK AL031282 (FLJ13052) which encodes NAD kinase. This gene has LocusID: 65220, and is located on chromosome 1 with reported cytogenetic location Ip36.33-p36.21. The gene resides in genomic locus NT_004350 (NCBI Genome Annotation).
[0303] CPS 205 also aligns with a chromosomal region near MMP23B with at least
98% sequence identity. MMP23B encodes matrix metalloproteinase 23B, and has LocusID: 8510 and reported cytogenetic location lp36.3. The gene resides in genomic locus NT_004350. CPS 205 aligns with the non-protein-coding strand of MMP23B. [0304] CPS 206 corresponds to GABRG2 wliich encodes gamma-aminobutyric acid
(GABA) A receptor, gamma 2. This gene has LocusID: 2566, and is located on chromosome 5 with reported cytogenetic location 5q31.1-q33.1. The gene resides in genomic locus NT_030698 (NCBI Genome Annotation). The gamma-aminobutyric acid (GABA) A receptor, gamma 2 is found as an inhibitory neurotransmitter receptor in the brain. [0305] CPS 207 corresponds to KIAA0404 which encodes KIAA0404 protein. This gene has LocusID: 23130, and is located on chromosome 11 with reported cytogenetic location l lql3.1. The gene resides in genomic locus NT_033241 (NCBI Genome Annotation).
[0306] CPS 208 corresponds to UNK U12471 (THBS1) which encodes thrombospondin 1. This gene has LocusID: 7057, and is located on chromosome 15 with reported cytogenetic location 15ql5. The gene resides in genomic locus NT_030828 (NCBI Genome Annotation). Thrombospondin- 1 has a role in blood clotting and in angiogenesis. It is a member of a family of adhesive molecules.
[0307] Affymetrix annotation suggests that CPS 209 corresponds to IGHA1 which encodes immunoglobulin heavy constant alpha 1. The gene has LocusID: 3493 and reported cytogenetic location 14q32.33.
[0308] CPS 210 corresponds to ABCB1 which encodes ATP-binding cassette, subfamily B (MDR/TAP), member 1. This gene has LocusID: 5243, and is located on chromosome 7 with reported cytogenetic location 7q21.1. The gene resides in genomic locus NT_007933 (NCBI Genome Annotation). The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein encoded by ABCB1 gene is a member of the MDR/TAP subfamily. Members of the MDR/TAP subfamily are involved in multidrug resistance. The ABCB gene product is an ATP-dependent drug efflux pump for xenobiotic compounds with broad substrate specificity. It is responsible for decreased drug accumulation in multidrug- resistant cells and often mediates the development of resistance to anticancer drugs. It can also function as a transporter in the blood-brain barrier. The ABCB gene product is also known as P glycoprotein 1.
[0309] CPS 211 corresponds to NRIPl which encodes nuclear receptor interacting protein 1. This gene has LocusID: 8204, and is located on chromosome 21 with reported cytogenetic location 21ql l.2. The gene resides in genomic locus NT_011512 (NCBI Genome Annotation). Nuclear receptor interacting protein 1 is a nuclear protein that can interact with the hormone-dependent activation domain AF2 of nuclear receptors. Also known as RIP 140, this protein modulates transcriptional activity of the estrogen receptor. [0310] CPS 213 corresponds to GLB1 which encodes galactosidase, beta 1. This gene has LocusID: 2720, and is located on chromosome 3 with reported cytogenetic location 3p21.33. The gene resides in genomic locus NT_005580 (NCBI Genome Annotation). The gene product can catalyze cleavage of the terminal galactose. [0311] CPS 214 corresponds to MSC which encodes musculin (activated B-cell factor-1). This gene has LocusID: 9242, and is located on chromosome 8 with reported cytogenetic location 8q21. The gene resides in genomic locus NT_034895 (NCBI Genome Annotation). The gene product contains a bHLH motif, a putative nuclear localization signal, a glycine-rich region, and a stretch of acidic residues. The gene product is capable of binding an E-box element either as a homodimer or as a heterodimer with E2A in vitro, and forms heterodimers with E2A proteins in vivo. It also contains a transcriptional repression domain and is capable of inhibiting the transactivation capability of E47, an E2A protein, in mammalian cells. MSC is thought to be a downstream target of the B-cell receptor signal transduction pathway.
[0312] CPS 216 corresponds to UNK J95006 (MGC14480) which encodes hypothetical protein MGC 14480. This gene has LocusID: 201254, and is located on chromosome 17 with reported cytogenetic location 17q25.3. The gene resides in genomic locus NT_035479 (NCBI Genome Annotation).
[0313] Nucleotides 400 to 561 of CPS 216 also have 84% sequence identity with an intron sequence of EG1. EG1 encodes endothelial-derived gene 1, and has LocusID: 80306 and reported cytogenetic location 4pl6.
[0314] CPS 217 corresponds to NK4 which encodes natural killer cell transcript 4.
This gene has LocusID: 9235, and is located on chromosome 16 with reported cytogenetic location 16pl3.3. The gene resides in genomic locus NT_010552 (NCBI Genome Annotation). The gene product may play a role in cell adhesion. It contains an RGD motif. [0315] CPS 217 also has 96% sequence identity with LOC 124213 which encodes a protein similar to natural killer cell transcript 4. LOC 124213 is located at chromosome 16p 13.11, and also resides in genomic locus NT_010552.
[0316] CPS 218 corresponds to UNK_Z24724 (FLJ20986) which encodes hypothetical protein FLJ20986. This gene has LocusID: 79572, and is located on chromosome 3 with reported cytogenetic location 3q29. The gene resides in genomic locus NT_005535 (NCBI Genome Annotation). The alignment between CPS 218 and UNK_Z24724 is within an intron of GP5 which encodes glycoprotein V (platelet) and has
LocusID: 2814.
[0317] CPS 219 corresponds to GRN which encodes granulin. This gene has
LocusID: 2896, and is located on chromosome 17 with reported cytogenetic location
17q21.32. The gene resides in genomic locus NT_010755 (NCBI Genome Annotation).
Granulin is a putative growth factor. It is cysteine rich and contains multiple granulin repeats. ,
[0318] CPS 220 corresponds to EREG which encodes epiregulin. This gene has
LocusID: 2069, and is located on chromosome 4 with reported cytogenetic location 4ql3.3.
The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). Epiregulin is a member of the epidermal growth factor family. Epiregulin can function as a ligand of
EGFR (epidermal growth factor receptor), as well as a ligand of most members of the
ERBB (v-erb-b2 oncogene homolog) family of tyrosine-kinase receptors. Epiregulin may promote cell proliferation.
[0319] CPS 221 corresponds to KIAA0382 (ARHGEF12) which encodes Rho guanine nucleotide exchange factor (GEF) 12. This gene has LocusID: 23365, and is located on chromosome 11 with reported cytogenetic location 1 lq23.3. The gene resides in genomic locus NTJ333899 (NCBI Genome Annotation). Rho GTPases play a role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. Rho guanine nucleotide exchange factor (GEF) 12 may form a complex with G proteins and stimulate Rho-dependent signals. This protein is observed to form myeloid lymphoid fusion partner in acute myeloid leukemia.
[0320] CPS 222 corresponds to RNASE6 which encodes ribonuclease, RNase A family, k6. This gene has LocusID: 6039, and is located on chromosome 14 with reported cytogenetic location 14ql 1.1. The gene resides in genomic locus NT_025892 (NCBI
Genome Annotation). Ribonuclease k6 may function in host defense. It is a member of eosinophil-derived neurotoxin ribonuclease A superfamily.
[0321] CPS 223 corresponds to KIAA0442 which encodes autism-related protein 1.
This gene has LocusID: 26053, and is located on chromosome 7 with reported cytogenetic location 7pl3. The gene resides in genomic locus NT_007758 (NCBI Genome
Annotation).
[0322] Blast search of the Entrez human genome sequence database shows that nucleotides 35 to 1907of CPS 225 align with various chromosomal regions with 89-90% sequence identity. These regions include LOC255718, LOC140070, and a chromosome 2 region between GPR39 and LOC255521. LOC255718 is located on chromosome 2 and has genomic locus NT_034486 (NCBI Genome Annotation). LOC 140070 encodes a protein similar to Protein CDC27Hs (Cell division cycle protein 27 homolog) (H-NUC). It is located on chromosome Yql l.l and has genomic locus NT_011878. GPR39 and LOC255521 reside in genomic locus NT_034487. GPR39 encodes G protein-coupled receptor 39, and has LocusID: 2863 and reported cytogenetic location 2q21-q22. [0323] Nucleotides 1317 to 1481 of CPS 225 have 98% sequence identity with a chromosome 12 region near LOC256746. LOC256746 encodes phosphodiesterase 3 A, cGMP-inhibited. Nucleotides 1381 to 1418 and 1461 to 1502 have 100% sequence identity with an intron sequence of LOC126500. LOC126500 encodes a protein similar to hypothetical zinc finger protein KIAA1473. It is located on chromosome 19pl3.11, and resides in genomic locus NT_033317. Other fragments of CPS 225 have 84-93% sequence identity with regions on chromosomes 7, 11, 14, 18 and 21.
[0324] Blast search of the Entrez human genome sequence database shows that CPS
226 aligns to a region located in an intron of ATP8A. CPS 226 aligns with the non-protein- coding strand of ATP8A2. ATP8A2 encodes ATPase, aminophospholipid transporter-like, Class I, type 8 A, member 2. ATP8A2 gene has LocusID: 51761, and is located on chromosome 13 with reported cytogenetic location 13ql2-13. ATP8A2 resides in genomic locus NT_009799 (NCBI Genome Annotation). ATP8A2 gene product may be a tumor suppressor. Loss of its function can convert cells to tumorigenic phenotype. [0325] CPS 227 corresponds to ARHC which encodes ras homolog gene family, member C. This gene has LocusID: 389, and is located on chromosome 1 with reported cytogenetic location Ip21-pl3. The gene resides in genomic locus NT_019273 (NCBI Genome Annotation). The gene product can regulate reorganization of the actin cytoskeleton. CPS 227 aligns with the 3' UTR region of ARHC. The alignment is also 3' to MOV10 which encodes MovlO, Moloney leukemia virus 10, homolog (mouse). MOV10 has LocusID: 4343 and reported cytogenetic location lpl2. It resides in genomic locus NT_019273.
[0326] CPS 228 corresponds to CREM which encodes cAMP responsive element modulator. This gene has LocusID: 1390, and is located on chromosome 10 with reported cytogenetic location 10pl2.1-pll.l. The gene resides in genomic locus NT_033896-(NCBI Genome Annotation). Cyclic AMP-responsive element modulator is a regulator of the transcription of cAMP-inducible genes.
[0327] CPS 229 corresponds to ISG15 which encodes interferon-stimulated protein,
15 kDa. This gene has LocusID: 9636, and is located on chromosome 1 with reported cytogenetic location lp36.33. The gene resides in genomic locus NT_004350 (NCBI
Genome Annotation). The expression of ISG15 can be induced by interferon.
[0328] CPS 230 corresponds to CCR7 which encodes chemokine (C-C motif) receptor 7. This gene has LocusID: 1236, and is located on chromosome 17 with reported cytogenetic location 17ql2-q21.2 . The gene resides in genomic locus NT_024901 (NCBI
Genome Annotation). The gene product is a G protein-coupled receptor, and can bind to
CC chemokine ELC and mediate intracellular calcium flux.
[0329] CPS 231 corresponds to Fl l which encodes coagulation factor XI (plasma thromboplastin antecedent). This gene has LocusID: 2160, and is located on chromosome 4 with reported cytogenetic location 4q35. The gene resides in genomic locus NT_022792
(NCBI Genome Annotation). Alternative splicing of the gene results in at least two transcripts. One transcript encodes the circulating plasma factor XI and an alternate transcript lacking exon 5 encodes the platelet factor XI. The plasma factor XI is present in plasma as a zymogen. The platelet factor XI is localized to platelets and megakaryocytes, and may play a role both in the maintenance of normal hemostasis and as a substitute for plasma factor XI.
[0330] Blast search of the Entrez human genome sequence database shows that CPS
232 aligns with a chromosome 4 region near LOC 166760. This chromosome 14 region resides in genomic locus NT_006344 (NCBI Genome Annotation). LOC 166760 has reported cytogenetic location 4pl5.32.
[0331] CPS 233 corresponds to UNK AC002073 (MGC17330) which encodes hypothetical protein MGC17330. This gene has LocusID: 113791, and is located on chromosome 22 with reported cytogenetic location 22qll.2-q22. The gene resides in genomic locus NT_011520 (NCBI Genome Annotation).
[0332] CPS 234 corresponds to STK17A which encodes serine/threonine kinase 17a
(apoptosis-inducing). This gene has LocusID: 9263, and is located on chromosome 7 with reported cytogenetic location 7pl2-pl4. The gene resides in genomic locus NT_007819
(NCBI Genome Annotation). The alignment of CPS 234 and STK17A resides in an intron of FLJ10803. FLJ10803 encodes hypothetical protein FLJ10803, and has LocusID: 55744 and reported cytogenetic location 7pl5.1.
[0333] CPS 235 corresponds to BLVRA which encodes biliverdin reductase A.
This gene has LocusID: 644, and is located on chromosome 7 with reported cytogenetic location 7pl4-cen. The gene resides in genomic locus NT_007819 (NCBI Genome
Annotation).
[0334] CPS 236 corresponds to SPP1 which encodes secreted phosphoprotein 1
(osteopontin, bone sialoprotein I, early T-lymphocyte activation 1). This gene has LocusID:
6696, and is located on chromosome 4 with reported cytogenetic location 4q21-q25. The gene resides in genomic locus NT_006204 (NCBI Genome Annotation). The gene product is a bone and blood vessel extracellular matrix protein involved in calcification and atherosclerosis.
[0335] CPS 237 corresponds to STX1A which encodes syntaxin 1A (brain). This gene has LocusID: 6804, and is located on chromosome 7 with reported cytogenetic location 7ql l.23. The gene resides in genomic locus NT_007758 (NCBI Genome
Annotation). Syntaxin 1A (brain) may be involved in intracellular transport and neurotransmitter release.
[0336] CPS 238 corresponds to TKTL1 which encodes transketolase-like 1. This gene has LocusID: 8277, and is located on chromosome X with reported cytogenetic location Xq28. The gene resides in genomic locus NT_025965 (NCBI Genome
Annotation). Transketolase-like 1 is a thiamine pyrophosphate-dependent enzyme of pentose phosphate pathway.
[0337] CPS 239 corresponds to IER3 which encodes immediate early response 3.
This gene has LocusID: 8870, and is located on chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome
Annotation). This gene functions in the protection of cells from Fas or tumor necrosis factor type alpha-induced apoptosis. Alternative splicing of this gene results in at least two transcript variants.
[0338] CPS 241 corresponds to PLAU which encodes plasminogen activator, urokinase. This gene has LocusID: 5328, and is located on chromosome 10 with reported cytogenetic location 10q24. The gene resides in genomic locus NT_033890 (NCBI
Genome Annotation). Urokinase plasminogen activator is a serine protease that cleaves plasminogen to form plasmin. [0339] CPS 242 corresponds to ASS which encodes argininosuccinate synthetase.
This gene has LocusID: 445, and is located on chromosome 9 with reported cytogenetic location 9q34.1. The gene resides in genomic locus NT_008338 (NCBI Genome Annotation). The protein encoded by this gene catalyzes the penultimate step of the arginine biosynthetic pathway. There are approximately 10 to 14 copies of this gene including the pseudogenes scattered across the human genome, among which the one located on chromosome 9 appears to be the only, functional gene for argininosuccinate synthetase. Mutations in the chromosome 9 copy of ASS cause citrullinemia. At least two alternatively spliced transcript variants of this gene have been reported. [0340] CPS 242 also has 87-94% sequence identity with various regions on chromosome 2, 4, 5, 6, 7, 9, 11, 12 and X. These regions include LOC 167449, LOC222906, ASSP2, ASSP3, ASSP4, ASSP5, LOC158452, LOC253843, LOC120341, LOC167519, and an intron of intron of LOC90957. LOC167449 encodes a protein similar to argininosuccinate synthetase and resides in genomic locus NT_006431 with reported cytogenetic location 5ql l.2. LOC222906 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_007819 with reported cytogenetic location 7p21.3. ASSP2 (LocusID: 447) encodes argininosuccinate synthetase pseudogene 2, and resides in genomic locus NT_007592 with reported cytogenetic location 6p22.1. ASSP3 (LocusID: 448) encodes argininosuccinate synthetase pseudogene 3, and resides in genomic locus NT_033256 with reported cytogenetic location 9ql l-q22. ASSP4 (LocusID: 449) encodes argininosuccinate synthetase pseudogene 4, and resides in genomic locus NT _025302 with reported cytogenetic location Xpter-ρ22. ASSP5 (LocusID: 450) encodes argininosuccinate synthetase pseudogene 5, and resides in genomic locus NT_028405 with reported cytogenetic location Xq22-q26. LOC 158452 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_023935 with reported cytogenetic location 9q21.31. LOC253843 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_006258 on chromosome 4. LOC 120341 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_009151 with reported cytogenetic location l lq23.1. LOC 167519 encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus NT_034778 with reported cytogenetic location 5q31.3. LOC90957 (LocusID: 90957) encodes DEAH-box RNA/DNA helicase AAM73547, and resides in genomic locus NT__005367 with -reported cytogenetic location 2p22.3. [0341] Affymetrix annotation suggests that CPS 244 corresponds to DUX1 which encodes double homeobox, 1. The gene has LocusID: 26584.
[0342] Blast search of the Entrez human genome sequence database shows that nucleotides 1 to 689 of A JOO 1481 (SEQ ID NO: 227) align with two chromosome 4 regions with 85-88% sequence identity. The first region is located 3' to LOC131308. LOC131308 encodes a protein similar to FSHD Region Gene 2 protein. It has reported cytogenetic location 3pl4.1 and resides in genomic locus NT_022665. The second region is located near TRAP95. TRAP95 encodes thyroid hormone receptor-associated protein, 95-kD subunit. TRAP95 has LocusID: 10025 and reported cytogenetic location 19pl3.3. It resides in genomic locus NT_011277. TRAP95 gene product is a subunit of TRAP thyroid hormone receptor-associated protein complex, and functions as a coactivator for nuclear receptors.
[0343] CPS 245 corresponds to GPA33 which encodes glycoprotein A33
(transmembrane). This gene has LocusID: 10223, and is located on chromosome 1 with reported cytogenetic location lq23.2. The gene resides in genomic locus NT 304668 (NCBI Genome Annotation). The glycoprotein encoded by this gene is a cell surface antigen that is expressed in human colon cancers. The sequence of the extracellular region of the encoded protein contains 2 domains characteristic of the CD2 subgroup of the immunoglobulin (Ig) superfamily. The encoded protein may play a role in cell adhesion. [0344] CPS 246 corresponds to PAH (SERPINEl) which encodes serine (or cysteine) proteinase inhibitor, clade E (nexin, plasminogen activator inhibitor type 1), member 1. This gene has LocusID: 5054, and is located on chromosome 7 with reported cytogenetic location 7q21.3-q22. The gene resides in genomic locus NT_007933 (NCBI Genome Annotation). The gene product may regulate fibrinolysis. It is a member of the serpin family of serine protease inhibitors.
[0345] CPS 247 corresponds to CYP1B1 which encodes cytochrome P450, subfamily I (dioxin-inducible), polypeptide 1 (glaucoma 3, primary infantile). This gene has LocusID: 1545, and is located on chromosome 2 with reported cytogenetic location 2p21. The gene resides in genomic locus NT_005367 (NCBI Genome Annotation). This gene product is a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. The enzyme encoded by CYP1B1 gene localizes to the endoplasmic reticulum and metabolizes procarcmogens such as polycyclic aromatic hydrocarbons and 17beta-estradiol. Mutations in this gene have been associated with primary congenital glaucoma.
[0346] CPS 249 corresponds to HU-K5 (MGLL) which encodes monoglyceride lipase. This gene has LocusID: 11343, and is located on chromosome 3 with reported cytogenetic location 3q21.3. The gene resides in genomic locus NT_005588 (NCBI
Genome Annotation). The gene product may function in regulating levels of lysophospholipids. ,
[0347] Affymetrix annotation suggests that CPS 250 corresponds to DORA. DORA is also known as IGSF6 which encodes immunoglobulin superfamily, member 6. The gene has LocusID: 10261, and is located at chromosome 16pl2-pl3.
[0348] Blast search of the Entrez human genome sequence database shows that CPS
250 has at least 97% sequence identity with an intron sequence of DREV1. DREVl encodes CGI-81 protein. DREVl has LocusID: 51108, and is located on chromosome 16 with reported cytogenetic location 16pl3-pl2. The gene resides in genomic locus
NT_010441 (NCBI Genome Annotation).
[0349] CPS 253 corresponds to MD-2 which encodes MD-2 protein. This gene has
LocusID: 23643, and is located on chromosome 8 with reported cytogenetic location
8ql3.2. The gene resides in genomic locus NT_008209 (NCBI Genome Annotation). The
MD-2 protein appears to associate with toll-like receptor 4 on the cell surface and confer responsiveness to lipopolysaccyaride (LPS), thus providing a link between the receptor and
LPS signaling.
[0350] CPS 254 corresponds to UNK_U07563 (RRP4) which encodes homolog of yeast RRP4 (ribosomal RNA processing 4), 3'-5'-exoribonuclease. This gene has LocusID:
23404, and is located on chromosome 9 with reported cytogenetic location 9q34. The gene resides in genomic locus NT_008338 (NCBI Genome Annotation). The gene product, also known as ribosomal RNA processing 4, is similar to S. cerevisiae RRP4 which is a component of both the nuclear and cytoplasmic forms of the ribosomal RNA processing 3'-
5' exosome complex. UNKJU07563 or RRP4 gene product may function in ribosomal
RNA processing.
[0351] Affymetrix annotation suggests that CPS 255 corresponds to SPINK1 which encodes serine protease inhibitor, Kazal type 1. The gene has LocusID: 6690 and reported cytogenetic location 5q32. [0352] Blast search of the Entrez human genome sequence database shows that CPS
255 has at least 97% sequence identity with a chromosome 5 region between LOC256135 and KIAA0555. Both LOC256135 and KIAA0555 reside in genomic locus NT_006859
(NCBI Genome Annotation). LOC256135 encodes a protein similar to N-formyl peptide receptor. KIAA0555 encodes KIAA0555 gene product, and has LocusID: 9832 and reported cytogenetic location 5q32.
[0353] CPS 256 corresponds to UNK_S82470 (also known as BB1 or LENG4) which encodes leukocyte receptor cluster (LRC) member 4. This gene has LocusID: 79143, and is located on chromosome 19 with reported cytogenetic location 19ql3.4. The gene resides in genomic locus NT_011148 (NCBI Genome Annotation).
[0354] Affymetrix annotation suggests that CPS 257 corresponds to IGHG3. The gene encodes immunoglobulin heavy constant gamma 3 (G3m marker), and has LocusID:
3502. The gene is located at chromosome 14q32.33.
[0355] Blast search of the Entrez human genome sequence database shows that CPS
257 has at least 96% sequence identity with a chromosome 14 region near LOC122595.
LOC122595 has reported cytogenetic location 14q32.33 and resides in genomic locus
NT_010168 (NCBI Genome Annotation).
[0356] CPS 260 corresponds to TNFRSF12 which encodes tumor necrosis factor receptor superfamily, member 12 (translocating chain-association membrane protein). This gene has LocusID: 8718, and is located on chromosome 1 with reported cytogenetic location lp36.2. The gene resides in genomic locus NT_028054 (NCBI Genome
Annotation). The gene product contains a cytoplasmic death domain and transmembrane domains. It can induce apoptosis and activates NF-kappaB.
[0357] CPS 261 corresponds to UNK AL031685 (KIAA0939) which encodes
KIAA0939 protein. This gene has LocusID: 23315, and is located on chromosome 20 with reported cytogenetic location 20ql3.13. The gene resides in genomic locus NT_011362
(NCBI Genome Annotation).
[0358] CPS 262 corresponds to UNK_AL049963 (LOC64116) which encodes a protein up-regulated by BCG-CWS. This gene has LocusID: 64116, and is located on chromosome 4 with reported cytogenetic location 4q22-q24. The gene resides in genomic locus NT_006383 (NCBI Genome Annotation). [0359] Affymetrix annotation suggests that CPS 263 corresponds to APELIN which encodes apelin, a peptide ligand for APJ receptor. The gene has LocusID: 8862, and is located at chromosome Xq25-26.3.
[0360] Blast search of the Entrez human genome sequence database shows that CPS
263 aligns with OCRL with at least 97% sequence identity. OCRL refers to oculocerebrorenal syndrome of Lowe. This gene has LocusID: 4952, and is located on chromosome X with reported cytogenetic location Xq25-q26.1. The gene resides in genomic locus NT_011786 (NCBI Genome Annotation). Mutations in this gene are linked to the disease oculocerebrorenal syndrome of Lowe. The encoded protein is a phospatidylinositol polyphosphate 5-phosphatase that can be found in golgi cisternae. [0361] CPS 264 corresponds to VEGF which encodes vascular endothelial growth factor. This gene has LocusID: 7422, and is located on chromosome 6 with reported cytogenetic location 6pl2. The gene resides in genomic locus NT_007592 (NCBI Genome Annotation). Vascular endothelial growth factor can induce endothelial cell proliferation and vascular permeability.
[0362] CPS 265 corresponds to ELAVL2 which encodes ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 (Hu antigen B). This gene has LocusID: 1993, and is located on chromosome 9 with reported cytogenetic location 9p21. The gene resides in genomic locus NT_023974 (NCBI Genome Annotation). The gene product can bind to, 3 - untranslated regions of mRNA.
[0363] CPS 266 corresponds to PPP2R2A which encodes protein phosphatase 2
(formerly 2A), regulatory subunit B (PR 52), alpha isoform. This gene has LocusID: 5520, and is located on chromosome 8 with reported cytogenetic location 8p21.1. The gene resides in genomic locus NT_023666 (NCBI Genome Annotation).
[0364] Nucleotides 163 to 505 of CPS 266 have 96% sequence identity with an intron sequence of TSC22. TSC22 encodes transforming growth factor beta-stimulated protein TSC-22. It has LocusID: 8848, and resides in genomic locus NT_033922 with reported cytogenetic location 13ql4. CPS 266 aligns with the non-protein-coding strand of the gene.
[0365] CPS 267 corresponds to UNK_AF001862 (FYB) which encodes FYN binding protein (FYB-120/130). This gene has LocusID: 2533, and is located on chromosome 5 with reported cytogenetic location 5pl3.1. The gene resides in genomic locus NT_023195 (NCBI Genome Annotation). FYN-binding protein can modulate interleukin 2 production.
[0366] CPS 268 corresponds to OAS1 which encodes 2',5'-oligoadenylate synthetase 1 (40-46 kD). This gene has LocusID: 4938, and is located on chromosome 12 with reported cytogenetic location 12q24.1. The gene resides in genomic locus NT_009770 (NCBI Genome Annotation). This gene product is a member of the 2', 5' oligoadenylate synthase family. It can be induced by interferons and catalyze the 2', 5' oligomers of adenosine in order to bind and activate RNase L. This gene family plays a significant role in the inhibition of cellular protein synthesis and viral infection resistance. Alternative splicing of OAS1 gene produces at least two isoforms.
[0367] CPS 269 corresponds to CYBB which encodes cytochrome b-245, beta polypeptide (chronic granulomatous disease). This gene has LocusID: 1536, and is located on chromosome X with reported cytogenetic location Xp21.1. The gene resides in genomic locus NT_011657 (NCBI Genome Annotation). Cytochrome b (-245) is composed of cytochrome b alpha (CYBA) and beta (CYBB) chain. It has been proposed as a primary component of the microbicidal oxidase system of phagocytes. CYBB deficiency is one of five described biochemical defects associated with chronic granulomatous disease (CGD). In this disorder, there is decreased activity of phagocyte NADPH oxidase. Neutrophils are able to phagocytize bacteria but cannot kill them in the phagocytic vacuoles. The cause of the killing defect may be an inability to increase the cell's respiration and consequent failure to deliver activated oxygen into the phagocytic vacuole.
[0368] CPS 270 corresponds to GFPT2 which encodes glutamine-fructose-6- phosphate transaminase 2. This gene has LocusID: 9945, and is located on chromosome 5 with reported cytogenetic location 5q34-q35. The gene resides in genomic locus NT_006519 (NCBI Genome Annotation). The encoded protein is an enzyme of the hexosamine biosynthetic pathway.
[0369] Affymetrix annotation suggests that CPS 271 corresponds to BNIP3 which encodes BCL2/adenovirus E1B 19kDa interacting protein 3. The gene has LocusID: 664, and is located at chromosome 14ql 1.2-ql2.
[0370] Blast search of the Entrez human genome sequence database shows that CPS
271 has 100% sequence identity with an intron sequence of LOC159348. LOC159348 is located at chromosome 10q26.3 and has genomic locus NT_024040 (NCBI Genome Annotation). Nucleotides 1 to 451 of CPS 270 have 97% sequence identity with a region 3' to the protein-coding sequence of LOC254590. LOC254590 resides in genomic locus NT_025892 on chromosome 14. In addition, nucleotides 4 to 450 align to a chromosome 15 region near CAPN3 with 81% sequence identity. CAPN3 encodes calpain 3, (p94), and has LocusID: 825 and reported cytogenetic location 15ql5.1-q21.1. CAPN3 resides in genomic locus NT_030828.
[0371] CPS 272 corresponds to BST1 which encodes bone marrow stromal cell antigen 1. This gene has LocusID: 683, and is located on chromosome 4 with reported cytogenetic location 4pl5. The gene resides in genomic locus NT_006344 (NCBI Genome Annotation). Bone marrow stromal cell antigen 1 is a stromal cell line-derived glycosylphosphatidylinositol-anchored molecule that facilitates pre-B-cell growth. BST1 expression can be enhanced in bone marrow stromal cell lines derived from patients with rheumatoid arthritis. The polyclonal B-cell abnormalities in rheumatoid arthritis may be, at least in part, attributed to BST1 over-expression in the stromal cell population. [0372] CPS 273 corresponds to UNK_U50535 (CG005) wliich encodes hypothetical protein from BCRA2 region. This gene has LocusID: 10443, and is located on chromosome 13 with reported cytogenetic location 13ql2-ql3. The gene resides in genomic locus NTJ309984 (NCBI Genome Annotation). The gene product has a region of low similarity to a region of rat 2',3 '-cyclic nucleotide 3 '-phosphodiesterase
[0373] CPS 274 corresponds to CX3CR1 which encodes chemokine (CX3C) receptor 1. This gene has LocusID: 1524, and is located on chromosome 3 with reported cytogenetic location 3p21.3. The gene resides in genomic locus NT_005498 (NCBI Genome Annotation). CX3C chemokine receptor is a G protein-coupled receptor. It can mediate leukocyte migration and adhesion, bind the CX3C chemokine fractalkine and signal through a pertussis toxin sensitive G-protein.
[0374] CPS 275 corresponds to CKTSF1B1 which encodes cysteine knot superfamily 1, BMP antagonist 1. This gene has LocusID: 26585, and is located on chromosome 15 with reported cytogenetic location 15ql3-ql5. The gene resides in genomic locus NT_024680 (NCBI Genome Annotation). The gene product is a homolog of Xenopus laevis Gremlin wliich is a secreted protein that blocks signaling of bone morphogenetic protein (BMP) by preventing access to the BMP receptor. [0375] CPS 276 corresponds to VNN2 wliich encodes vanin 2. This gene has
LocusID: 8875, and is located on chromosome 6 with reported cytogenetic location 6q23- q24. The gene resides in genomic locus NT_025741 (NCBI Genome Annotation). This gene product is a member of the Vanin family of proteins which share sequence similarity with each other, and also with biotinidase. The family includes secreted and membrane- associated proteins, a few of which have been reported to participate in hematopoietic cell trafficking. Members of this family possess pantetheinase activity, and may play a role in oxidative-stress response. Vanin 2 is a GPI-anchored cell surface molecule that plays a role in transendothelial migration of neutrophils. VNN2 gene lies in close proximity to, and in same transcriptional orientation as two other vanin genes on chromosome 6q23-q24. Two transcript variants encoding different isoforms have 'been described for this gene. [0376] CPS 277 corresponds to KIAA0064 (SNX17) which encodes sorting nexin
17. This gene has LocusID: 9784, and is located on chromosome 2 with reported cytogenetic location 2p23-p22. The gene resides in genomic locus NT_005204 (NCBI Genome Annotation).
[0377] CPS 278 corresponds to GROl which encodes GROl oncogene (melanoma growth stimulating activity, alpha). This gene has LocusID: 2919, and is located on chromosome 4 with reported cytogenetic location 4q21. The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). The gene product has melanoma growth stimulating activity. It may be a mitogenic factor involved in inflammatory processes. [0378] CPS 278 has 88% sequence identity with GRO2 which encodes GRO2 oncogene. GRO2 has LocusID: 2920, and resides in genomic locus NT_00621-6 with reported cytogenetic location 4q21.
[0379] CPS 279 corresponds to KIAA0655 (HIP12) which encodes huntingtin interacting protein 12. This gene has LocusID: 9026, and is located on chromosome 12 with reported cytogenetic location 12q24. The gene resides in genomic locus NT_009464 (NCBI Genome Annotation).
[0380] Nucleotides 13 to 332 of CPS 279 aligns to an intron of DNAH11 with 100% sequence identity. DNAH11 encodes dynein, axonemal, heavy polypeptide 11. It has LocusID: 8701, and
[0381] resides in genomic locus NT_007819 with reported cytogenetic location 7p21. CPS 279 aligns with the non-protein-coding strand of DNAH11.
[0382] CPS 280 corresponds to RBI which encodes retinoblastoma 1 (including osteosarcoma). This gene has LocusID: 5925, and is located on chromosome 13 with reported cytogenetic location 13ql4.2. The gene resides in genomic locus NT_033922 (NCBI Genome Annotation). Retinoblastoma protein 1 is a nuclear phosphoprotein with DNA binding activity. It can interact with histone deacetylase to repress transcription. [0383] CPS 281 corresponds to KIAA0073 which encodes KIAA0073 protein. This gene has LocusID: 23398, and is located on chromosome 5 with reported cytogenetic location 5ql2.3. The gene resides in genomic locus NT_006431 (NCBI Genome Annotation).
[0384] CPS 282 corresponds to CAPN4 (CAPNS1) which encodes calpain, small subunit 1. This gene has LocusID: 826, and is located on chromosome 19 with reported cytogenetic location 19ql3.13. The gene resides in genomic locus NT_011296 (NCBI Genome Annotation). Calpains are a ubiquitous, well-conserved family of calcium- dependent, cysteine proteases. Calpain I and II are heterodimeric with distinct large subunits associated with common small subunits. CAPN4 (CAPNS1) gene encodes a small subunit common to both calpain I and II and is associated with myotonic dystrophy. [0385] CPS 284 corresponds to MGST 1 L 1 (PTGES) which encodes prostaglandin E synthase. This gene has LocusID: 9536, and is located on chromosome 9 with reported cytogenetic location 9q34.3. The gene resides in genomic locus NTJ329366 (NCBI Genome Annotation). Prostaglandin (PG) E synthase is involved in eicosanoid and glutathione metabolism. It is a member of superfamily of membrane associated proteins. [0386] CPS 285 corresponds to C1NH (SERPINGl) which encodes serine (or cysteine) proteinase inhibitor, clade G (Cl inhibitor), member 1, (angioedema, hereditary). This gene has LocusID: 710, and is located on chromosome 11 with reported cytogenetic location I lql2-ql3.1. The gene resides in genomic locus NT_033903 (NCBI Genome Annotation). The protein encoded by C1NH gene can inhibit activated Clr and Cis of the first complement component and thus regulate complement activation. Deficiency of the encoded protein may be associated with hereditary angioneurotic oedema (HANE). The encoded protein is a member of the serine protease inhibitor (serpin) superfamily. [0387] Affymetrix annotation suggests that CPS 286 corresponds to H1FX. H1FX encodes HI histone family, member X. It has LocusID: 8971.
[0388] CPS 287 corresponds to IL1B which encodes interleukin 1, beta. This gene has LocusID: 3553, and is located on chromosome 2 with reported cytogenetic location 2ql4. The gene resides in genomic locus NT_022135 (NCBI Genome Annotation). Interleukin 1 beta may initiate and amplify the immune and inflammatory responses. [0389] CPS 288 corresponds to F3 which encodes coagulation factor III
(thromboplastin, tissue factor). This gene has LocusID: 2152, and is located on chromosome 1 with reported cytogenetic location Ip22-p21. The gene resides in genomic locus NT_021979 (NCBI Genome Annotation). Coagulation factor III which is a cell surface glycoprotein. This factor enables cells to initiate the blood coagulation cascades, and it functions as the high-affinity receptor for the coagulation factor NIL The resulting complex provides a catalytic event that is responsible for initiation of the coagulation protease cascades by specific limited proteolysis. There are at least three distinct domains of this factor: extracellular, transmembrane, and cytoplasmic. The factor functions in normal hemostasis, and is a component of the cellular immune response. [0390] CPS 289 corresponds to DUSP4 which encodes dual specificity phosphatase
4. This gene has LocusID: 1846, and is located on chromosome 8 with reported cytogenetic location 8pl2-pl l. The gene resides in genomic locus ΝT_030743 (NCBI Genome Annotation). The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK JNK, p38), which are associated with cellular proliferation and differentiation. DUSP4 gene product can inactivate ERK1 , ERK2 and JNK, is expressed in a variety of tissues, and is localized in the nucleus. Two alternatively spliced transcript variants, encoding distinct isoforms, have been observed for this gene. In addition, multiple polyadenylation sites have been reported.
[0391] Affymetrix annotation suggests that CPS 290 corresponds to HUMRTVLH3.
HUMRTNLH3 encodes endogenous retroviral protease. The gene has LocusID: 51354. [0392] CPS 291 corresponds to UΝK_AL049250 (BANP) which encodes BTG3 associated nuclear protein. This gene has LocusID: 54971, and resides in genomic locus NT_010859 on chromosome 18 (NCBI Genome Annotation). The gene product can interact with CAF1, a component of the general transcription multisubunit complex. It is thought that BTG3 is involved in negative control of the cell cycle. The protein encoded by BTG3 gene binds to BTG3. Studies with mouse homolog suggest that this protein may also interact with a specific nuclear matrix/scaffold-associated region (MAR). Transcript variants encoding different isoforms have been described for this gene. The alignment between CPS 291 and BANP overlaps LOC 124302 which encodes a protein similar to nuclear pore complex interacting protein. LOC 124302 resides in genomic locus NT_010859 with reported cytogenetic location 18pl 1.1.
[0393] CPS 291 also aligns with other genes and chromosomal regions with at least
97% sequence identity. These genes and chromosomal regions include LOC92267, LOC146181, LOC146136, KIAA0220, LOC253666, and an intron of LOC220565, a region near LOC255019, a region near LOC83985, a region 5' to the protein-coding sequence of NPIP, and a region 3' to the protein-coding sequence of EIF3S8. LOC92267 encodes a protein similar to hypothetical protein FLJ12363. LOC92267 resides in NT_010441 with reported cytogenetic location 16pl2.1. CPS 291 aligns with the non-protein-coding strand of LOC92267. LOCI 46181 encodes a protein similar to Hypothetical protein KIAA0220, and resides in genomic locus NT_010441 with reported cytogenetic location 16pl2.2. LOC146136 encodes a protein similar to nuclear pore complex interacting protein, and resides in genomic locus NT_010441 with reported cytogenetic location 16pl2.1. KIAA0220 has LocusID: 23117, and resides in genomic locus NT_035362 with reported cytogenetic location 16pl2.1. LOC253666 has similarity to rat kidney-specific (KS) gene. LOC253666 resides in genomic locus NT_010604 on chromosome 16. LOC220565 encodes a protein similar to PI-3-kinase-related kinase SMG-1, isoform 1; lambda/iota protein kinase C-interacting protein; phosphatidylinositol 3-kinase-related protein kinase. LOC220565 is located at chromosome 16ql3, and resides in genomic locus NT_033291. LOC255019 encodes a protein similar to nuclear pore complex interacting protein, and resides in genomic locus NT_035360 on chromosome 16. LOC83985 encodes spinster-like protein, and has LocusID: 83985. LOC83985 resides in genomic locus NT_035372 with reported cytogenetic location 16ql3. NPIP (LocusID: 9284) encodes a nuclear pore complex interacting protein, and resides in genomic locus NT_035359 with reported cytogenetic location 16pl3-pl l. EIF3S8 encodes eukaryotic translation initiation factor 3, subunit 8 (HOkD). EIF3S8 has LocusID: 8663, and resides in genomic locus NT_010589 with reported cytogenetic location 16pll.2. LOC146452 encodes a protein similar to KIAA0251 hypothetical protein. It resides in genomic locus NT_010478 with reported cytogenetic location 16q22.3.
[0394] Nucleotides 1 to 196 of CPS 291 have about 86% sequence identity with
LOCI 18735. LOCI 18735 encodes a protein similar to apoptosis response protein (prostate apoptosis response protein 4). LOCI 18735 resides in NT_030059 with reported cytogenetic location 10q24.2. [0395] CPS 292 corresponds to UNK .AB002308 (KIAA0310) wliich encodes
KIAA0310 gene product. This gene has LocusID: 9919, and is located on chromosome 9 with reported cytogenetic location 9q34.3. The gene resides in genomic locus NT_033215
(NCBI Genome Annotation).
[0396] Affymetrix annotation suggests that CPS 293 corresponds to EDN1 which encodes endothelin 1. The gene has LocusID: 1906, and is located at chromosome 6p24.1.
[0397] CPS 293 aligns with IL1B with at least 99% sequence identity. IL1B encodes interleukin 1, beta. '
[0398] CPS 295 corresponds to FRAT2 which encodes frequently rearranged in advanced T-cell lymphomas 2. This gene has LocusID: 23401, and is located on chromosome 10 with reported cytogenetic location 10q23-q24.1. The gene resides in genomic locus NT_03005 (NCBI Genome Annotation).
[0399] CPS 297 corresponds to IL8 which encodes interleukin 8. This gene has
LocusID: 3576, and is located on chromosome 4 with reported cytogenetic location 4ql3- q21. The gene resides in genomic locus NT_006216 (NCBI Genome Annotation).
Interleukin 8 is a cytokine that plays a role in chemoattraction and activation of neutrophils.
It has similarity to several platelet-derived factors.
[0400] Affymetrix annotation suggests that CPS 298 corresponds to FSCN2.
FSCN2 encodes fascin homolog 2, actin-bundling protein, retinal (Strongylocentrotus purpuratus). The gene has LocusID: 25794, and is located at chromosome 17q25.
[0401] CPS 300 corresponds to CAMP which encodes cathelicidin antimicrobial peptide. This gene has LocusID: 820, and is located on chromosome 3 with reported cytogenetic location 3p21.3. The gene resides in genomic locus NT_022567 (NCBI
Genome Annotation). Cathelicidin antimicrobial peptide is a precursor of a peptide with antibacterial activity.
[0402] CPS 301 corresponds to FCGR1A which encodes Fc fragment of IgG, high affinity la, receptor for (CD64). This gene has LocusID: 2209, and is located on chromosome 1 with reported cytogenetic location Iq21.2-q21.3. The gene resides in genomic locus NT_032962 (NCBI Genome Annotation). The gene product, also known as
Fc gamma RI, is a receptor for the Fc domain of IgG. It is a member of the immunoglobulin superfamily, and may have a role in immune response.
[0403] CPS 302 corresponds to MAT1A which encodes methionine adenosyltransferase I, alpha. This gene has LocusID: 4143, and is located on chromosome 10 with reported cytogenetic location 10q22. The gene resides in genomic locus
NT_033890 (NCBI Genome Annotation). Methionine adenosyltransferase I (alpha isoform) catalyzes the formation of S-adenosylmethionine from methionine and ATP. Both the beta and alpha isoforms may be encoded by MATIA. Methionine adenosyltransferase deficiency is known to be caused by recessive as well as dominant mutations, the latter identified in autosomal dominant persistant hypermethioninemia.
[0404] CPS 303 corresponds to CYP4B1 which encodes cytochrome P450, subfamily IVB, polypeptide 1. This gene has LocusID: 1580, and is located on chromosome 1 with reported cytogenetic location Ip34-pl2. The gene resides in genomic locus NT_004386 (NCBI Genome Annotation). This gene product is a member of the cytochrome P450 heme-binding monooxygenase superfamily, and can metabolize steroids, fatty acids and xenobiotics.
[0405] Affymetrix annotation suggests that CPS 304 corresponds to MUC3. MUC3 is also known as MUC3A, and encodes mucin 3A, intestinal. The gene has LocusID: 4584 and reported cytogenetic location 7q22.
[0406] Blast search of the Entrez human genome sequence database shows that CPS
304 aligns to regions 5' to the protein-coding strand of MUC3B with 80-81% sequence identity. MUC3B encodes mucin 3B. It has LocusID: 57876 and reported cytogenetic location 7q22. The gene resides in genomic locus NT_007933.
[0407] CPS 305 corresponds to B7 which encodes B7 protein. This gene has
LocusID: 10233, and is located on chromosome 12 with reported cytogenetic location
12pl3. The gene resides in genomic locus NT_035206 (NCBI Genome Annotation). The gene product has similarity to the regulatory subunit of protein phosphatases. It contains leucine rich repeats, and may mediate protein-protein interactions.
[0408] CPS 306 corresponds to SCYA20 which encodes small inducible cytokine subfamily A (Cys-Cys), member 20. This gene has LocusID: 6364, and is located on chromosome 2 with reported cytogenetic location 2q33-q37. The gene resides in genomic locus NT_005403 (NCBI Genome Annotation). The protein encoded by this gene is a chemotactic factor for lymphocytes.
[0409] CPS 307 corresponds to ILIRN which encodes interleukin 1 receptor antagonist. This gene has LocusID: 3557, and is located on chromosome 2 with reported cytogenetic location 2ql4.2. The gene resides in genomic locus NT_02213 (NCBI Genome Annotation). Interleukin 1 receptor antagonist binds to and inhibits the IL-1 receptor. It is a member of the interleukin- 1 (IL-1) family.
[0410] CPS 308 corresponds to ZNF261 which encodes zinc finger protein 261.
This gene has LocusID: 9203, and is located on chromosome X with reported cytogenetic location Xql3.1. The gene resides in genomic locus NT_019696 (NCBI Genome Annotation). The gene product contains a putative zinc-binding motif (MYM). [0411] CPS 309 corresponds to LGALS2 which encodes lectin, galactoside-binding, soluble, 2 (galectin 2). This gene has LocusID: 3957, and is located on chromosome 22 with reported cytogenetic location 22ql3.1. The gene resides in genomic locus NT_011520 (NCBI Genome Annotation). Galectin 2 is an actose-binding lectin involved in cell growth regulation.
[0412] CPS 310 corresponds to TGN51 (TGOLN2) which encodes trans-golgi network protein 2. This gene has LocusID: 10618, and is located on chromosome 2 with reported cytogenetic location 2pl l.2. The gene resides in genomic locus NT_015805 (NCBI Genome Annotation).
[0413] The biological mechanisms underlying the CCI-779 modulation of the expression levels of the CCI-779 activity genes have yet to be elucidated. Without being limited to any specific theory, the modulation may be attributed to the direct effect of CCI- 779 on PBMCs or other blood cells. It may also be caused by the effect of CCI-779 on renal cell carcinoma tumors, which in turn induce the change of the gene expression profile in the peripheral blood cells.
[0414] Some of the CCI-779 activity genes are RCC disease genes that are differentially expressed in PBMCs of RCC patients relative to tumor-free humans. These genes include, for example, TUBB, CTSL, SCYA2, PAI2, UNK_AI679353, SCYA7, IL1R1, THBS1, NCF1, ATP2B1, GRO2, PRF1, DBP, FCGR3B, ADFP, GRO3, C1QR, RNASE2, CBP2, HK3, FOS, PDI2, UNKJJ12471 (THBS1), EREG, SPP1, STX1A, TKTL1, DUX1, SPINK1, UNK AL049963 (LOC64116), BNIP3, GROl, IL1B, F3, UNK AL049250, EDN1, FCGR1A, MUC3, B7, SCYA20, IL1RN, and ZNF261. Over the course of CCI-779 therapy, at least one subset of these genes, which are elevated or suppressed in RCC patients, return to the normal baseline levels.
[0415] Table 6 shows examples of RCC disease genes whose expression profiles in
PBMCs can be modulated by CCI-779. Modulation of the expression profiles of these genes in ex vivo conditions is also indicated. [0416] In one experiment, peripheral blood mononuclear cells isolated from tumor- free humans were treated with CCI-779 ex vivo. Comparison of the genes sensitive to CCI- 779 treatment ex vivo with the CCI-779 activity genes of the present invention revealed a common set of genes. These genes represent surrogate markers of CCI-779 drug activity in vivo as well as ex vivo.
[0417] As appreciated by those skilled in the art, the above-described methodology can be employed to identify genes whose expression profiles in PBMCs can be modulated by other drugs.
Table 6. Modulation of RCC Disease Genes by CCI-779
11,5
o o o o
H U α.
vo o o o
O
C. Monitoring CCI-779 Drug Activities
[0418] The CCI-779 activity genes identified in the present invention can be used to monitor CCI-779 drug activities in a patient who is subject to a CCI-779 treatment. Peripheral blood samples can be isolated at different stages of the CCI-779 treatment. The expression profile of one or more CCI-779 activity genes in these peripheral blood samples can be determined and compared to a reference expression profile. A change in the gene expression profile indicates in vivo activities of CCI-779. In one embodiment, the patent has RCC or another solid tumor.
[0419] Numerous methods are available for detecting gene expression profiles. In one aspect, the expression profiles of CCI-779 activity genes are determined by measuring the levels of RNA transcripts of these genes in peripheral blood samples. Suitable methods for this purpose include, but are not limited to, RT-PCT, Northern Blot, in situ hybridization, slot-blotting, nuclease protection assay, and nucleic acid arrays. The peripheral blood samples can be, without limitation, whole blood samples or samples containing enriched PBMCs.
[0420] In one embodiment, RNA isolated from peripheral blood samples can be first amplified to cDNA or cRNA. The amplification can be specific or non-specific. Suitable amplification methods include, but are not limited to, reverse transcriptase PCR, isothermal amplification, ligase chain reaction, and Qbeta replicase. The amplified nucleic acid products can be detected or quantitated, for instance, through hybridization to labeled probes.
[0421] Amplification primers and hybridization probes for a CCI-779 activity gene can be prepared from the gene sequence or its corresponding CPS using numerous methods. Gene sequences suitable for this purpose include, but are not limited to, exons, introns, or the 3' or 5' untranslated regions, or any combination thereof. In one embodiment, probes/primers are designed based on the sequence in or near the 3 ' protein-coding region of a CCI-779 activity gene. For instance, the nucleotide sequence encoding the last 100 to 300 amino acid residues in the C-terminus region of the CCI-779 activity gene product can be selected to design probes or primers. Where a CCI-779 activity gene is a hypothetical or putative gene whose expression is supported only by EST or mRNA data, or where the genomic location(s) of a CCI-779 activity gene has not been determined or the gene may correspond to multiple genomic loci, the probes/primers for the gene can be designed based on the corresponding CPS, or the oligonucleotide probes of the corresponding qualifier. Table 5 lists example sequences that are useful for designing probes/primers for detecting the expression profiles of CCI-779 activity genes.
[0422] The length of the probes/primers can be selected to achieve the desired hybridization or amplification effect. For instance, each probe can comprise at least 15, 20, 25, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 400 or more nucleotides. In one embodiment, each probe/primer has relatively high sequence complexity and does not have any ambiguous residue (undetermined "n" residues). In another embodiment, the probes/primers can hybridize to the target gene; such as its RNA transcripts or the complements thereof, under stringent or highly stringent conditions. [0423] As used herein, "stringent conditions" are at least as stringent as, for example, conditions G-L shown in Table 7. "Highly stringent conditions" are at least as stringent as conditions A-F shown in Table 7. As used in Table 1, hybridization is carried out under the hybridization conditions (Hybridization Temperature and Buffer) for about four hours, followed by two 20-minute washes under the corresponding wash conditions (Wash Temp, and Buffer).
Table 7. Stringency Conditions
1: The hybrid length is that anticipated for the hybridized region(s) of the hybridizing polynucleotides. When hybridizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed to be that of the hybridizing polynucleotide. When polynucleotides of known sequence are hybridized, the hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region or regions of optimal sequence complementarity.
H: SSPE (lxSSPE is 0.15M NaCl, lOmM NaH2PO4, and 1.25mM EDTA, pH 7.4) can be substituted for SSC (lxSSC is 0.15M NaCl and 15mM sodium citrate) in the hybridization and wash buffers.
TB* - TR*: The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should be 5-10°C less than the melting temperature (Tm) of the hybrid, where Tm is determined according to the following equations. For hybrids less than 18 base pairs in length, Tm(°C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 18 and 49 base pairs in length, Tm(°C) = 81.5 + 16.6(log]0Na+) + 0.41(%G + C) - (600/N), where N is the number of bases in the hybrid, and Na+ is the molar concentration of sodium ions in the hybridization buffer (Na+ for lxSSC = 0.165M).
[0424] In one embodiment, the probes/primers for a CCI-779 activity gene are selected from regions which significantly diverge from the sequences of other genes. Such regions can be determined by checking the probe/primer sequences against a human genome sequence database, such as the Entrez database at the NCBI. One algorithm suitable for this purpose is the BLAST algorithm. This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold. These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence to increase the cumulative alignment score. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. These parameters can be adjusted for different purposes, as appreciated by one of ordinary skill in the art.
[0425] In one aspect, quantitative RT-PCR (such as TaqMan, ABI) is used for detecting and comparing the peripheral blood expression profiles of CCI-779 activity genes. Quantitative RT-PCR involves reverse transcription (RT) of RNA to cDNA followed by relative quantitative PCR (RT-PCR).
[0426] In PCR, the number of molecules of the amplified target DNA increases by a factor approaching two with every cycle of the reaction until some reagent becomes limiting. Thereafter, the rate of amplification becomes increasingly diminished until there is not an increase in the amplified target between cycles. If one plots a graph on which the cycle number is on the X axis and the log of the concentration of the amplified target DNA is on the Y axis, one observes that a curved line of characteristic shape is formed by connecting the plotted points. Beginning with the first cycle, the slope of the line is positive and constant. This is said to be the linear portion of the curve. After some reagent becomes limiting, the slope of the line begins to decrease and eventually becomes zero. At this point the concentration of the amplified target DNA becomes asymptotic to some fixed value. This is said to be the plateau portion of the curve. ,
[0427] The concentration of the target DNA in the linear portion of the PCR is proportional to the starting concentration of the target before the PCR was begun. By determining the concentration of the PCR products of the target DNA in PCR reactions that have completed the same number of cycles and are in their linear ranges, it is possible to determine the relative concentrations of the specific target sequence in the original DNA mixture. If the DNA mixtures are cDNAs synthesized from RNAs isolated from different tissues or cells, the relative abundances of the specific mRNA from which the target sequence was derived may be determined for the respective tissues or cells. This direct proportionality between the concentration of the PCR products and the relative mRNA abundances is true in the linear range portion of the PCR reaction.
[0428] The final concentration of the target DNA in the plateau portion of the curve is determined by the availability of reagents in the reaction mix and is independent of the original concentration of target DNA. Therefore, the sampling and quantifying of the amplified PCR products can be carried out when the PCR reactions are in the linear portion of their curves. In addition, relative concentrations of the amplifiable cDNAs can be normalized to some independent standard, which may be based on either internally existing RNA species or externally introduced RNA species. The abundance of a particular mRNA species may also be determined relative to the average abundance of all mRNA species in the sample.
[0429] In one embodiment, the PCR amplification utilizes internal PCR standards that are approximately as abundant as the target. This strategy is effective if the products of the PCR amplifications are sampled during their linear phases. If the products are sampled when the reactions are approaching the plateau phase, then the less abundant product may become relatively over-represented. Comparisons of relative abundances made for many different RNA samples, such as is the case when examining RNA samples for differential expression, may become distorted in such a way as to make differences in relative abundances of RNAs appear less than they actually are. This can be improved if the internal standard is much more abundant than the target. If the internal standard is more abundant than the target, then direct linear comparisons may be made between RNA samples.
[0430] A problem inherent in clinical samples is that they are of variable quantity or quality. This problem can be overcome if the RT-PCR is performed as a relative quantitative RT-PCR with an internal standard in which the internal standard is an amplifiable cDNA fragment that is larger than the target cDNA fragment and in which the abundance of the mRNA encoding the internal standard is roughly 5-100 times higher than the mRNA encoding the target. This assay measures relative abundance, not absolute abundance of the respective mRNA species.
[0431] In another embodiment, the relative quantitative RT-PCR uses an external standard protocol. Under this protocol, the PCR products are sampled in the linear portion of their amplification curves. The number of PCR cycles that are optimal for sampling can be empirically determined for each target cDNA fragment. In addition, the reverse transcriptase products of each RNA population isolated from the various samples can be normalized for equal concentrations of amplifiable cDNAs. While empirical determination of the linear range of the amplification curve and normalization of cDNA preparations are tedious and time-consuming processes, the resulting RT-PCR assays may, in certain cases, be superior to those derived from a relative quantitative RT-PCR with an internal standard. [0432] Nucleic acid arrays can also be used to detect and compare the expression patterns of CCI-779 activity genes in peripheral blood samples isolated at different CCI-779 treatment stages. Probes suitable for detecting the CCI-779 activity genes can be stably attached to known discrete regions on a support substrate. These probes maintain their positions relative to the respective discrete regions during hybridization and subsequent washes. Construction of nucleic acid arrays is well known in the art. Suitable substrates for making nucleic acid arrays include, but are not limited to, glasses, silica, ceramics, nylons, quartz wafers, gels, metals, papers, beads, tubes, fibers, films, membranes, column matrixes, or microtiter plate wells.
[0433] A nucleic acid array of the present invention can comprise at least 2, 5, 10,
15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, or more different polynucleotide probes, each different probe capable of hybridizing to a different respective CCI-779 activity gene. Multiple probes for the same gene can be used on a single nucleic acid array. Examples of probes suitable for this invention are listed in the Qualifier Table. Probes for other drag activities genes can also be included in the nucleic acid arrays of this invention. The probe density on the array can be in any range. For instance, the density may be 50, 100, 200, 300, 400, 500, or more probes/cm2.
[0434] In one embodiment, a substantial portion of all polynucleotide probes on a nucleic acid array of the present invention are probes for CCI-779 or other drug activity genes. For instance, at least 10%, 20%, 30%, 40%, 50%, or more of all probes on the nucleic acid array can hybridize under stringent or nucleic acid array hybridization conditions to RNA transcripts, or the complements thereof, of drug activity genes. [0435] In another embodiment, nuclease protection assays are used to quantify
RNAs derived from the peripheral blood samples. There are many different versions of nuclease protection assays. The common characteristic of these nuclease protection assays is that they involve hybridization of an antisense nucleic acid with the RNA to be quantified. The resulting hybrid double-stranded molecule is then treated with a nuclease which digests single-stranded nucleic acids more efficiently than double-stranded molecules. The amount of antisense nucleic acid that survives digestion is a measure of the amount of the target RNA species to be quantified. An example of nuclease protection assays is the RNase protection assay manufactured by Ambion, Inc. (Austin, Texas). [0436] In another aspect, the peripheral blood expression profiles of CCI-779 activity genes are determined by measuring the levels of polypeptides encoded by these genes. Methods suitable for this purpose include, but are not limited to, immunoassays such as ELISA, RIA, FACS, dot blot, Western Blot, immunohistochemistry, and antibody-based radioimaging. Other methods such as 2-dimensional SDS-polyacrylamide gel electrophoresis can also be used.
[0437] One exemplary method suitable for detecting the levels of target proteins in peripheral blood samples is ELISA. In an exemplifying ELISA, antibodies capable of binding to the target proteins encoded by one or more CCI-779 activity genes are immobilized onto a selected surface exhibiting protein affinity, such as wells in a polystyrene or polyvinylchloride microtiter plate. Then, peripheral blood samples to be tested are added to the wells. After binding and washing to remove non-specifϊcally bound immunocomplexes, the bound antigen(s) can be detected. Detection can be achieved by the addition of a second antibody which is specific for the target proteins and is linked to a detectable label. Detection may also be achieved by the addition of a second antibody, followed by the addition of a third antibody that has binding affinity for the second antibody, with the third antibody being linked to a detectable label. Befor-e being added to the microtiter plate, cells in the peripheral blood samples can be lysed using various methods known in the art. Proper extraction procedures can be used to separate the target proteins from potentially interfering substances.
[0438] In another exemplifying ELISA, the peripheral blood samples suspected of containing the target proteins are immobilized onto the well surface and then contacted with the antibodies of the invention. After binding and washing to remove non-specifϊcally bound immunocomplexes, the bound antigen is detected. Where the initial antibodies are linked to a detectable label, the immunocomplexes can be detected directly. The immunocomplexes can also be detected using a second antibody that has binding affinity for the first antibody, with the second antibody being linked to a detectable label. [0439] Another exemplary ELISA involves the use of antibody competition in the detection. In this ELISA, the target proteins are immobilized on the well surface. The labeled antibodies are added to the well, allowed to bind to the target proteins, and detected by means of their labels. The amount of the target proteins in an unknown sample is then determined by mixing the sample with the labeled antibodies before or during incubation with coated wells. The presence of the target proteins in the unknown sample acts to reduce the amount of antibody available for binding to the well and thus reduces the ultimate signal.
[0440] Different ELISA formats can have certain features in common, such as coating, incubating or binding, washing to remove non-specifically bound species, and detecting the bound immunocomplexes. For instance, in coating a plate with either antigen or antibody, the wells of the plate can be incubated with a solution of the antigen or antibody, either overnight or for a specified period of hours. The wells of the plate are then washed to remove incompletely adsorbed material. Any remaining available surfaces of the wells are then "coated" with a nonspecific protein that is antigenically neutral with regard to the test samples. Examples of these nonspecific proteins include bovine serum albumin (BSA), casein and solutions of milk powder. The coating allows for blocking of nonspecific adsorption sites on the immobilizing surface and thus reduces the background caused by nonspecific binding of antisera onto the surface.
[0441] In ELISAs, a secondary or tertiary detection means an also be used. After binding of a protein or antibody to the well, coating with a non-reactive material to reduce background, and washing to remove unbound material, the immobilizing surface is contacted with the control or clinical or biological sample to be tested under conditions effective to allow immunocomplex (antigen/antibody) formation. These conditions may include, for example, diluting the antigens and antibodies with solutions such as BSA, bovine garnma globulin (BGG) and phosphate buffered saline (PB$)/Tween and incubating the antibodies and antigens at room temperature for about 1 to 4 hours or at 4°C overnight. Detection of the immunocomplex then requires a labeled secondary binding ligand or antibody, or a secondary binding ligand or antibody in conjunction with a labeled tertiary antibody or third binding ligand.
[0442] Following all incubation steps in an ELISA, the contacted surface can be washed so as to remove non-complexed material. For instance, the surface may be washed with a solution such as PBS/Tween, or borate buffer. Following the formation of specific immunocomplexes between the test sample and the originally bound material, and subsequent washing, the occurrence of the amount of immunocomplexes can be determined. [0443] To provide a detection means, the second or third antibody can have an associated label. In one embodiment, the label is an enzyme that generates color development upon incubating with an appropriate chromogenic substrate. Thus, for example, one may contact and incubate the first or second immunocomplex with a urease, glucose oxidase, alkaline phosphatase or hydrogen peroxidase-conjugated antibody for a period of time and under conditions that favor the development of further immunocomplex formation (e.g., incubation for 2 hours at room temperature in a PBS-containing solution such as PBS-Tween).
[0444] After incubation with the labeled antibody, and subsequent to washing to remove unbound material, the amount of label is quantified, e.g., by incubation with a chromogenic substrate such as urea and bromocresol purple or 2,2'-azido-di-(3-ethyl)- benzthiazoline-6-sulfonic acid (ABTS) and H2O2, in the case of peroxidase as the enzyme label. Quantitation can be achieved by measuring the degree of color generation, e.g., using a spectrophotometer.
[0445] Another method suitable for this invention is RIA (radioimmunoassay). An exemplary RIA is based on the competition between radiolabeled-polypeptides and unlabeled polypeptides for binding to a limited quantity of antibodies. Suitable radiolabels include, but are not limited to, I125. In one embodiment, a fixed concentration of II25-labeled polypeptide is incubated with a series of dilution of an antibody specific to the polypeptide. When the unlabeled polypeptide is added to the system, the amount of the I125-polypeptide that binds to the antibody is decreased. A standard curve can therefore be constructed to represent the amount of antibody-bound I125-polypeptide as a function of the concentration of the unlabeled polypeptide. From this standard curve, the concentration of the polypeptide in unknown samples can be determined.
[0446] Suitable antibodies for this invention include, but are not limited to, polyclonal antibodies, monoclonal antibodies, chimeric antibodies, humanized antibodies, single chain antibodies, Fab fragments, or fragments produced by a Fab expression library. Methods for making these antibodies are well known in the art.
[0447] In one embodiment, the antibodies of the present invention can bind to the corresponding CCI-779 activity gene products or other desired antigens with a binding affinity constant Ka of at least 104 M"1, 105 M"1, 106 M"1, 107 M"1, 108 M"1, or more. [0448] The antibodies of this invention can be labeled with one or more detectable moieties to allow for detection of antibody-antigen complexes. The detectable moieties can include compositions detectable by spectroscopic, enzymatic, photochemical, biochemical, bioelectronic, immunochemical, electrical, optical or chemical means. The detectable moieties include, but are not limited to, radioisotopes, chemiluminescent compounds, labeled binding proteins, heavy metal atoms, spectroscopic markers such as fluorescent markers and dyes, magnetic labels, linked enzymes, mass spectrometry tags, spin labels, electron transfer donors and acceptors, and the like.
[0449] Moreover, the levels of polypeptides in peripheral blood samples can be determined by detecting the biological activities associated with the polypeptides. If a biological function/activity of a polypeptide is known, suitable in vitro bioassays can be designed to evaluate the biological function/activity, which in turn can be used to determine the amount of the polypeptide in the sample.
[0450] Comparison between the expression profile of a patient of interest and a reference expression profile can be conducted manually or electronically. The reference expression profile can be a baseline expression profile representing gene expression in peripheral blood samples isolated prior to a drag treatment. The reference profile can also be an expression profile in peripheral blood samples isolated after initiation of the drag treatment. The reference expression profile can be determine using sample isolated from the patient of interest or other reference patient or patients. In many embodiments, the process or methodology that is used to determine the reference expression profile and the expression profile being compared is identical or comparable.
[0451] In one example, comparison is carried out by comparing each component in the expression profile of the patient of interest to the corresponding component in the reference expression profile(s). The component can be the expression level of a drag activity gene, a ratio between the expression levels of two drag activity genes, or another measure capable of representing gene expression patterns. The expression level of a gene can be an absolute level, or a normalized or relative level. The difference between two corresponding components can be assessed by fold changes, absolute differences, or other suitable means.
[0452] Comparison between expression profiles can also be conducted using pattern recognition or comparison programs. In addition, the serial analysis of gene expression (SAGE) technology, the GEMTOOLS gene expression analysis program (Incyte Pharmaceuticals), the GeneCalling and Quantitative Expression Analysis technology (Curagen), and other suitable methods, programs or systems can be used. [0453] Multiple drag activity genes can be used in the comparison of expression profiles. For instance, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 30, 40, 50, or more drag activity genes can be used.
[0454] In one embodiment, the drag activity gene(s) used in the comparison can be selected to have relatively small p- values or large differential expression ratios. In one example, the drag activity genes used in the comparison have p-values (e.g., under an ANOVA analysis) of no greater than 0.05, 0.01, 0.001, 0.0005, 0.0001, or less. In another example, the expression level after or during a drag treatment is increased or decreased by at least 2-fold, 3-fold, 4-fold, 5-fold, or more over the baseline expression level. [0455] In another embodiment, comparison of the expression profiles is performed electronically, such as by using a computer system. The computer system includes a processor coupled to a memory or storage medium which stores data representing the expression profiles being compared. In one example, the memory or storage medium is readable or rewritable. The stored expression data can be changed, retrieved, or otherwise manipulated. The memory can also store one or more programs capable of causing the processor to compare the expression profiles. In one embodiment, the processor is coupled to a nucleic acid array scanner which sends signals to the processor for analysis. [0456] In vivo activities of other drags can be similarly monitored using the drug activities genes of the present invention, as appreciated by those skilled in the art. [0457] At least a subset of the drag activity genes of the present invention may be used as disease regression indicators. This subset of genes may be responsive to the effect of CCI-779 or other drags on RCC or other diseases. In one example, CCI-779 or another drug may cause regression of tumors, and the decrease in tumor size or activity may cause a decreased immune response which is reflected in gene expression changes in the peripheral blood cells. Consequently, the drag activity genes of the present invention can be used as molecular markers for monitoring the efficacy of CCI-779 or other drugs for treating RCC or other diseases. In many cases, diseases amendable to the present invention include those whose regression can produce changed immune responses.
[0458] It should be understood that the above-described embodiments and the following examples are given by way of illustration, not limitation. Various changes and modifications within the scope of the present invention will become apparent to those skilled in the art from the present description.
E. Examples
Example 1. Isolation of RNA and Preparation of Labeled Microarray Targets
[0459] PBMCs from the CCI-779 clinical trials were isolated from whole blood samples (8mL) and collected into CPT tubes according to the standard procedure. PBMCs were purified over Ficoll gradients, washed two times with PBS and counted. Total RNA was isolated from PBMC pellets using the RNeasy mini kit (Qiagen, Valencia, CA). Labeled target for oligonucleotide arrays was prepared using a modification of I the procedure described in Lockhart, et al, Nature Biotechnology 14: 1675-80 (1996). 2 μg total RNA was converted to cDNA by priming with an oligo-dT primer containing a T7 DNA polymerase promoter at the 5' end. The cDNA was used as the template for in vitro transcription using a T7 DNA polymerase kit (Ambion, Woodlands, TX) and biotinylated CTP and UTP (Enzo). Labeled cRNA was fragmented in 40 mM Tris-acetate pH 8.0, 100 mM KOAc, 30 mM MgO Ac for 35 minutes at 94°C in a final volume of 40 μl.
Example 2. Hybridization to Affymetrix Microarrays and Detection of Fluorescence [0460] Individual samples were hybridized to HgU95A genechip (Affymetrix). No samples were pooled. 110 RCC patients were involved in the study. Each patient received 25, 75, or 250 mg of CCI-779 once weekly through intravenous infusion. Blood samples were collected immediately before the first CCI-779 infusion, eight weeks after the first infusion, and sixteen weeks after the first infusion.
[0461] 10 μg of labeled target was diluted in,l x MES buffer with 100 μg/ml herring sperm DNA and 50 μg/ml acetylated BSA. To normalize arrays to each other and to estimate the sensitivity of the oligonucleotide arrays, in vitro synthesized transcripts of 11 bacterial genes were included in each hybridization reaction as described in Hill et al, Science, 290: 809-812 (2000). The abundance of these transcripts ranged from 1:300,000 (3 ppm) to 1:1000 (1000 ppm) stated in terms of the number of control transcripts per total transcripts. As determined by the signal response from these control transcripts, the sensitivity of detection of the arrays ranged between about 1:300,000 and 1:100,000 copies/million. Labeled probes were denatured at 99°C for 5 minutes and then 45°C for 5 minutes, and hybridized to oligonucleotide arrays comprised of over 12,500 human gene probes (HgU95A, Affymetrix). Arrays were hybridized for 16 hours at 45°C. The hybridization buffer was comprised of 100 mM MES, 1 M fNa+], 20 mM EDTA, and 0.01% Tween 20. After hybridization, the cartridges were washed extensively with non-stringent wash buffer (6 x SSPET), such as three 10-minute washes at room temperature. These hybridization and wash conditions are herein collectively referred to as "nucleic acid array hybridization conditions." The washed cartridges were then stained with phycoerythrin coupled to streptavidin.
[0462] 12 x MES stock contains 1.22 M MES and 0.89 M [Na+]. For 1000 ml, the stock can be prepared by mixing 70.4 g MES free acid monohydrate, 193.3 g MES sodium salt and 800 ml of molecular biology grade water, and adjusting volume to 1000 ml. The pH should be between 6.5 and 6.7. 2 x hybridization buffer can be prepared by mixing 8.3 mL of 12 x MES stock, 17.7 mL of 5 M NaCl, 4.0 mL of 0.5 M EDTA, 0.1 mL of 10% Tween 20 and 19.9 mL of water. 6 x SSPET contains 0.9 M NaCl, 60 mM NaH2PO4, 6 mM EDTA, pH 7.4, and 0.005% Triton X-100. In some cases, the wash buffer can be replaced with a more stringent wash buffer. 1000 ml stringent wash buffer can be prepared by mixing 83.3 mL of 12 x MES stock, 5.2 mL of 5 M NaCl, 1.0 mL of 10% Tween 2ϋ and 910.5 mL of water. Example 3. Gene Expression Data Analysis
[0463] Data analysis was performed on raw fluorescent intensity values using
GENECHIP 3.2 software (Affymetrix). GeneChip 3.2 software uses an algorithm to calculate the likelihood as to whether a gene is "absent" or "present" as well as a specific hybridization intensity value or "average difference" for each transcript represented on the array. The algorithms used in these calculations are described in the Affymetrix GeneChip Analysis Suite User Guide (Affymetrix). The "average difference" for each transcript was normalized to "frequency" values according to the procedures of Hill et al, Science, 290: 809-812 (2000). This was accomplished by referring the average difference values on each chip to a calibration curve constructed from the average difference values for the 11 control transcripts with known abundance that were spiked into each hybridization solution. This process also served to normalize between arrays.
[0464] Specific transcripts were evaluated further if they met the following criteria.
First, genes that were designated "absent" by the GENECHIP 3.2 software in all samples were excluded from the analysis. Second, in comparisons of transcript levels between arrays, a gene was required to be present in at least one of the arrays. Third, for comparisons of transcript levels between groups, an ANOVA was applied to identify a subset of transcripts that had a significant (p < 0.05) differences in frequency values. [0465] A ANOVA was used to compare PBMC expression profiles measured immediately before the first infusion of CCI-779, to PBMC expression profiles measured at eight and sixteen weeks after the first infusion. In the comparisons, a p value < 0.05 was used to indicate statistical significance. Transcripts that were altered, on average, by 2-fold or more between any two time points can be selected.
Example 4. Ex Vivo Assays l
[0466] Drug modulable transcripts in the surrogate tissue, such as peripheral blood, can provide early evidence of drag exposure in vivo. PBMC ex vivo assays that mimic peripheral blood gene expression in disease states as well as the in vivo response to drag treatments can further validate the surrogate markers identified in in vivo studies. In addition, ex vivo assays that overlay with disease-associated biomarkers may provide evidence for markers of drag-dependent disease amelioration.
[0467] Many factors should be considered in designing suitable ex vivo assays.
These factors include, but are not limited to, PBMC seeding density, culture media and other conditions, vehicle control, dose response, time course, and dose schedule (e.g., preincubation, coincubation, or postincubation).
[0468] In one embodiment, phytohemagglutinin (PHA) was used to stimulate gene expression in PBMCs. The genes whose expression can be stimulated by PHA in PBMCs had a substantial overlap with disease-associated transcripts in RCC PBMCs that appear to result from T-cell activation. In some cases, PHA-stimulated PBMCs mimicked anti- CD3/anti-CD28 stimulated T-cell profiles in BioExpress. In addition, CCI-779 inhibited certain PHA-inducible transcripts in a dose-dependent manner. The inhibition by CCI-779 appeared to be specific, rather than global. In one example, CTSL, SCYA2, FABP5, SCYA7, ATP2B1, and IL1R1 genes were directly repressed by 300 nm CCI-779 in PHA- stimulated PBMCs in an ex vivo assay. Databases of disease-associated profiles and ex vivo activation signatures can be created to identity most appropriate culture conditions for identification of drag modulated biomarkers in specific disease settings. [0469] The foregoing description of the present invention provides illustration and description, but is not intended to be exhaustive or to limit the invention to the precise one disclosed. Modifications and variations are possible consistent with the above teachings or may be acquired from practice of the invention. Thus, it is noted that the scope of the invention is defined by the claims and their equivalents.

Claims

What is claimed is:
1. A method comprising comparing an expression profile of at least one gene in a peripheral blood sample of a patient to a reference expression profile of said at least one gene, wherein said at least one gene is differentially expressed in peripheral blood mononuclear cells (PBMCs) of patients who have a non-blood disease and are subject to a drag therapy as compared to PBMCs isolated from said patients before said drag therapy, and wherein the patient has the non-blood disease and is being treated by said drag therapy.
2. The method according to claim 1, wherein said drag therapy is a CCI-779 therapy.
3. The method according to claim 2, wherein the non-blood disease is a solid tumor.
4. The method according to claim 3, wherein the solid tumor is RCC.
5. The method according to claim 2, wherein said at least one gene includes one or more genes selected from Table 5.
6. The method according to claim 2, wherein said at least one gene includes at least two genes selected from Table 5.
7. The method according to claim 1, wherein the peripheral blood sample is a whole blood sample.
8. The method according to claim 1, wherein the peripheral blood sample comprises enriched PBMCs.
9. The method according to claim 1, wherein the expression profile is determined by RT-PCR or immunoassays.
10. The method according to claim 1, wherein the reference expression profile is an average expression profile of said at least one gene in peripheral blood samples isolated from said patients before said drag therapy.
11. The method according to claim 1 , wherein the reference expression profile is an expression profile of said at least one gene in a reference peripheral blood sample isolated from the patient before said drag therapy.
12. The method according to claim 1, wherein said at least one gene includes one or more genes which are over-expressed or under-expressed in PBMCs of patients who have the non-blood disease as compared to PBMCs of humans who do not have the non-blood disease, and wherein said drug therapy is capable of down-regulating or up-regulating expression of said one or more genes in PBMCs of patients who have the non-blood disease.
13. The method according to claim 1, wherein said at least one gene includes one or more genes whose expression in PBMCs is capable of being increased or reduced by a phytohemagglutinin treatment, and wherein said drag therapy is capable of down-regulating or up-regulating expression of said one or more genes in phytohemagglutinin-treated PBMCs.
14. The method according to claim 1, wherein RNA transcripts of said at least one gene are capable of hybridizing under stringent or nucleic acid array hybridization conditions to one or more qualifiers selected from the Qualifier Table.
15. The method according to claim 14, wherein said drug therapy is a CCI-779 therapy.
16. The method according to claim 15, wherein the non-blood disease is RCC.
17. A method for identifying drag activity genes, comprising: detecting an expression profile of genes in peripheral blood samples of patients who have a non-blood disease and are subject to a drag therapy; and comparing said expression profile to a baseline expression profile of said genes in peripheral blood samples isolated from said patients before said drag therapy so as to identify drug activity genes whose expression levels in peripheral blood samples can be modulated by said drag therapy.
18. A kit comprising a plurality of polynucleotides, wherein each of said polynucleotides is capable of hybridizing under stringent or nucleic acid array hybridization conditions to an RNA transcript, or the complement thereof, of a different respective gene selected from Table 5.
19. A kit comprising a plurality of antibodies, wherein each of said antibodies is capable of binding to a polypeptide encoded by a different respective gene selected from Table 5.
20. A nucleic acid array comprising polynucleotide probes, wherein a substantial portion of all polypeptide probes on the nucleic acid array can hybridize under stringent or nucleic acid array hybridization conditions to RNA transcripts, or the complements thereof, of genes selected from Table 5.
EP04710266A 2003-02-11 2004-02-11 Methods for monitoring drug activities in vivo Withdrawn EP1592811A2 (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US44613303P 2003-02-11 2003-02-11
US446133P 2003-02-11
US45978203P 2003-04-03 2003-04-03
US459782P 2003-04-03
US53824604P 2004-01-23 2004-01-23
US538246P 2004-01-23
PCT/US2004/004118 WO2004072265A2 (en) 2003-02-11 2004-02-11 Methods for monitoring drug activities in vivo

Publications (1)

Publication Number Publication Date
EP1592811A2 true EP1592811A2 (en) 2005-11-09

Family

ID=32872756

Family Applications (1)

Application Number Title Priority Date Filing Date
EP04710266A Withdrawn EP1592811A2 (en) 2003-02-11 2004-02-11 Methods for monitoring drug activities in vivo

Country Status (4)

Country Link
EP (1) EP1592811A2 (en)
AU (1) AU2004210986A1 (en)
CA (1) CA2515677A1 (en)
WO (1) WO2004072265A2 (en)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006006948A2 (en) * 2002-11-14 2006-01-19 Dharmacon, Inc. METHODS AND COMPOSITIONS FOR SELECTING siRNA OF IMPROVED FUNCTIONALITY
AU2003298689A1 (en) 2002-11-21 2004-06-18 Wyeth Methods for diagnosing rcc and other solid tumors
US20050287532A9 (en) * 2003-02-11 2005-12-29 Burczynski Michael E Methods for monitoring drug activities in vivo
US7643943B2 (en) 2003-02-11 2010-01-05 Wyeth Llc Methods for monitoring drug activities in vivo
US20060194211A1 (en) * 2003-04-29 2006-08-31 Burczynski Michael E Methods for prognosis and treatment of solid tumors
WO2006097388A2 (en) * 2005-02-08 2006-09-21 Innogenetics N.V. Nucleic acid variants in the c1qr1 gene associated with altered innate immunity.
ZA200800901B (en) 2005-07-14 2010-05-26 Takeda San Diego Inc Histone deacetylase inhibitors
AU2013206613B2 (en) * 2005-09-12 2017-03-02 Biontech Ag Identification of tumor-associated antigens for diagnosis and therapy
EP1762575A1 (en) * 2005-09-12 2007-03-14 Ganymed Pharmaceuticals AG Identification of tumor-associated antigens for diagnosis and therapy
CA2670019A1 (en) * 2006-11-16 2009-03-26 Gentron, Llc Methods, systems, and compositions for cancer diagnosis
KR101827015B1 (en) 2009-08-11 2018-02-07 큐알엔에이, 인크. Treatment of adiponectin (adipoq) related diseases by inhibition of natural antisense transcript to an adiponectin (adipoq)
US8785634B2 (en) 2010-04-26 2014-07-22 Merck Sharp & Dohme Corp Spiropiperidine prolylcarboxypeptidase inhibitors
WO2011137012A1 (en) 2010-04-27 2011-11-03 Merck Sharp & Dohme Corp. Prolylcarboxypeptidase inhibitors
WO2011143057A1 (en) 2010-05-11 2011-11-17 Merck Sharp & Dohme Corp. Novel prolylcarboxypeptidase inhibitors
US8669252B2 (en) 2010-05-17 2014-03-11 Merck Sharp & Dohme Corp. Prolylcarboxypeptidase inhibitors
WO2011156220A1 (en) 2010-06-08 2011-12-15 Merck Sharp & Dohme Corp. Novel prolylcarboxypeptidase inhibitors
EP2579873A4 (en) 2010-06-11 2013-11-27 Merck Sharp & Dohme Novel prolylcarboxypeptidase inhibitors
EP3710032A4 (en) * 2017-11-15 2021-12-15 The Translational Genomics Research Institute Biomarker proxy tests and methods for standard blood chemistry tests
CN112852964B (en) * 2021-03-08 2022-02-11 镇江市第一人民医院 Circular RNA hsa _ circ _0059707, specific amplification primer thereof and application

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002061144A2 (en) * 2001-01-31 2002-08-08 Whitehead Institute For Biomedical Research Brain tumor diagnosis and outcome prediction

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030104371A1 (en) * 1997-09-24 2003-06-05 Strom Terry B. Methods of evaluating transplant rejection
WO2000040749A2 (en) * 1999-01-06 2000-07-13 Genenews Inc. Method for the detection of gene transcripts in blood and uses thereof
TWI286074B (en) * 2000-11-15 2007-09-01 Wyeth Corp Pharmaceutical composition containing CCI-779 as an antineoplastic agent
AU2003298689A1 (en) * 2002-11-21 2004-06-18 Wyeth Methods for diagnosing rcc and other solid tumors

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002061144A2 (en) * 2001-01-31 2002-08-08 Whitehead Institute For Biomedical Research Brain tumor diagnosis and outcome prediction

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
IDEGUCHI H. ET AL.: "Structural and functional characterization of the USP11 deubiquitinating enzyme, which interacts with the RanGTP-associated protein RanBPM", THE BIOCHEMICAL JOURNAL, vol. 367, 2002, pages 87 - 95 *

Also Published As

Publication number Publication date
AU2004210986A1 (en) 2004-08-26
WO2004072265A2 (en) 2004-08-26
CA2515677A1 (en) 2004-08-26
WO2004072265A3 (en) 2005-03-03

Similar Documents

Publication Publication Date Title
WO2004072265A2 (en) Methods for monitoring drug activities in vivo
EP1824997B1 (en) Genetic alteration useful for the response prediction of malignant neoplasia to taxane-based medical treatment
US11591655B2 (en) Diagnostic transcriptomic biomarkers in inflammatory cardiomyopathies
JP4938672B2 (en) Methods, systems, and arrays for classifying cancer, predicting prognosis, and diagnosing based on association between p53 status and gene expression profile
US8017321B2 (en) Gefitinib sensitivity-related gene expression and products and methods related thereto
US20040018513A1 (en) Classification and prognosis prediction of acute lymphoblastic leukemia by gene expression profiling
US20120245235A1 (en) Classification of cancers
US20080226645A1 (en) Methods and compositions for assessment and treatment of asthma
CA2524173A1 (en) Methods for diagnosing aml and mds differential gene expression
AU2003298689A1 (en) Methods for diagnosing rcc and other solid tumors
US20110166028A1 (en) Methods for predicting treatment response based on the expression profiles of biomarker genes in notch mediated cancers
KR20130137054A (en) Urine markers for detection of bladder cancer
US20040029151A1 (en) Molecular genetic profiling of gleason grades 3 and 4/5 prostate cancer
US20060240441A1 (en) Gene expression profiles and methods of use
AU2004298604A1 (en) Molecular signature of the PTEN tumor suppressor
US20090004173A1 (en) Diagnosis and Treatment of Drug Resistant Leukemia
US20040191819A1 (en) Expression profiles for breast cancer and methods of use
US20080274906A1 (en) Novel Predictors of Transplant Rejection Determined by Peripheral Blood Gene-Expression Profiling
WO2012022634A1 (en) Classification, diagnosis and prognosis of multiple myeloma
US8568974B2 (en) Identification of novel subgroups of high-risk pediatric precursor B acute lymphoblastic leukemia, outcome correlations and diagnostic and therapeutic methods related to same
AU2005316291A1 (en) Methods for assessing patients with acute myeloid leukemia
US20050272052A1 (en) Molecular genetic profiling of gleason grades 3 and 4/5 prostate cancer
US20070134690A1 (en) Diagnosis of systemic onset juvenile idiopathic arthritis through blood leukocyte microarray analysis
AU2004256182A1 (en) Genes regulated in ovarian cancer as prognostic and therapeutic targets
US20110294686A1 (en) Egfr inhibitor therapy responsiveness

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20050804

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK

RIN1 Information on inventor provided before grant (corrected)

Inventor name: TREPICCHIO, WILLIAM, L.

Inventor name: DORNER, ANDREW, J.

Inventor name: TWINE, NATALIE

Inventor name: BURCZYNSKI, MICHAEL

DAX Request for extension of the european patent (deleted)
17Q First examination report despatched

Effective date: 20070824

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: WYETH LLC

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20110901