WO1999049084A1 - Methods and compositions for diagnosing and predicting the behavior of cancer - Google Patents

Methods and compositions for diagnosing and predicting the behavior of cancer Download PDF

Info

Publication number
WO1999049084A1
WO1999049084A1 PCT/US1999/006679 US9906679W WO9949084A1 WO 1999049084 A1 WO1999049084 A1 WO 1999049084A1 US 9906679 W US9906679 W US 9906679W WO 9949084 A1 WO9949084 A1 WO 9949084A1
Authority
WO
WIPO (PCT)
Prior art keywords
sixl
sample
polypeptide
mrna
tumor
Prior art date
Application number
PCT/US1999/006679
Other languages
French (fr)
Inventor
Arthur B. Pardee
Heide L. Ford
Original Assignee
Dana-Farber Cancer Institute, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dana-Farber Cancer Institute, Inc. filed Critical Dana-Farber Cancer Institute, Inc.
Priority to US09/647,115 priority Critical patent/US7153700B1/en
Publication of WO1999049084A1 publication Critical patent/WO1999049084A1/en

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/574Immunoassay; Biospecific binding assay; Materials therefor for cancer
    • G01N33/57484Immunoassay; Biospecific binding assay; Materials therefor for cancer involving compounds serving as markers for tumor, cancer, neoplasia, e.g. cellular determinants, receptors, heat shock/stress proteins, A-protein, oligosaccharides, metabolites
    • G01N33/57488Immunoassay; Biospecific binding assay; Materials therefor for cancer involving compounds serving as markers for tumor, cancer, neoplasia, e.g. cellular determinants, receptors, heat shock/stress proteins, A-protein, oligosaccharides, metabolites involving compounds identifable in body fluids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/574Immunoassay; Biospecific binding assay; Materials therefor for cancer
    • G01N33/57407Specifically defined cancers
    • G01N33/57415Specifically defined cancers of breast
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/112Disease subtyping, staging or classification

Definitions

  • cancer is known to be one of the leading causes of mortality and morbidity among men and women.
  • breast cancer is believed to be the leading cause of death among women (Harris, et al. (1992) New Engl. J. Med. 327: 319- 28; Harris, et al. (1992) New Engl. J. Med. 327: 390-8; Harris, et al. (1992) New Engl. J. Med. 327: 473-80; and McGuire and Clark (1992) New Engl J. Med. 326: 1756-61).
  • the development of cancer is accompanied by a number of genetic changes (For review see Porter- Jordan, (1994) Hematol Oncol. Clin. N. Am. 8:73).
  • Such changes include gross chromosomal alterations as well as loss of genetic markers (Devilee et al. (1994) Biochim. Biophys. Ada 1198:113 and Callahan et al. (1993) J. Cell Biochem. Si ⁇ pl 17:167).
  • the progression of breast neoplasia has also been shown to result in qualitative and quantitative changes in expression of previously identified genes that encode growth factors and their receptors (Zajchowski et al. (1988) Cancer Res. 48:7041), structural proteins (Trask et ⁇ /.(1990) Proc. Natl. Acad. Sci. 87:2319), second messenger proteins (Ohuchi et ⁇ /.(1986) Cancer Res.
  • the present invention relates to methods for diagnosing cancer, for example, breast, colon, lung, or cervical cancer, in a subject in which the presence of human SIXl (HSIXl) homeobox gene sequences bears a positive correlation to the - 2 -
  • the present invention is based, at least in part, on the demonstration of an aberrant expression of the HSIXl homeobox gene in primary breast cancers and metastatic lesions and in cells isolated from lung, colorectal, and cervical tumors, as well as from subjects having chronic myelogenous leukemia.
  • the present invention further relates to compositions of molecular probes which can be utilized in such diagnostic methods.
  • one aspect of the invention pertains to methods for detecting the presence of SIXl in a biological sample.
  • the method involves contacting a biological sample (e.g., a tissue or tumor sample or isolate of such a sample) with an agent capable of detecting SIXl protein or nucleic acid (e.g., mRNA or cDNA) molecule such that the presence of SIXl is detected in the biological sample.
  • a biological sample e.g., a tissue or tumor sample or isolate of such a sample
  • an agent capable of detecting SIXl protein or nucleic acid (e.g., mRNA or cDNA) molecule such that the presence of SIXl is detected in the biological sample.
  • the agent can be, for example, a labeled or labelable nucleic acid probe capable of hybridizing to a SIXl nucleic acid molecule or a labeled or labelable antibody capable of binding to SIXl protein.
  • Another aspect of the invention features a method of determining the metastatic potential of a tumor which involves contacting a sample of the tumor (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the tumor sample or isolate, thereby determining the metastatic potential of the tumor.
  • Yet another aspect of the invention features a prognostic method for determining whether a subject is at risk for developing cancer which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby determining whether the subject is at risk for developing cancer.
  • Another aspect of the invention features a method for diagnosis of a tumor which involves contacting a tumor sample (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing the tumor.
  • Yet another aspect of the invention features a method of diagnosing cancer in a subject which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing cancer in the subject.
  • Kits for detecting SIXl in a biological sample are also within the scope of the invention.
  • Figure 1 is the complete cDNA sequence and deduced amino acid sequence of human SIXl (SEQ ID NOs: 1 and 2, respectively).
  • Figure 2A-C Figure 2A depicts 3 H-thymidine incorporation following release from mimosine arrest showing progression of 21PT cells through S-phase.
  • Figure 2B is a photograph showing a section of a differential display gel demonstrating the differential expression of 6A (subsequently identified as HSIXl) in S-phase.
  • Figure 2C is a photograph of a Northern blot confirming the differential expression of HSIXl throughout S-phase of 21PT cells. RNA was isolated from cells following release from mimosine arrest and Northern blot analysis was performed with the HSIXl cDNA probe. Bottom panel shows EtBr staining as a loading control.
  • Figure 3 is a quantitative representation of a Northern blot analysis of 3 control tissues (normal adjacent breast, normal luminal cells, and normal myoepithelial cells- lanes 1,2, and 3 respectively) as well as on 25 primary breast tumor biopsies (lanes 4-28) and 10 metastatic lesions (lanes 29-38).
  • the blot was stripped and reprobed with 36B4 (Hatano et al. (1991) Science 253 J9-82) for normalization and relative HSIXl expression was plotted.
  • a 3 -fold increase over normal adjacent breast was considered positive for HSIXl and is marked by a dashed line.
  • Figure 4 depicts FACS analysis of HSIXl overexpressors which become polyploid over several months in culture.
  • His/lac7 control indicates cells transfected approximately 6 months prior to FACS analysis shown.
  • 21PT parent indicates cells transfected approximately 4.5 months prior to FACS analysis shown.
  • SIXFL4" and SIXFL6 indicates cells transfected approximately 4 and 6 months prior to FACS analysis shown, respectively.
  • FIG. 5 demonstrates that HSIXl overexpression abrogates the G2 cell cycle checkpoint.
  • the graph shows a summary of the percentage of either HSIXl - transfectanted cells or control cells (CAT transfectants) in G2 at various times after X- - 4 -
  • Figure 6 is a Western Blot demonstrating the immunoreactivity of an anti-HSIX antibody with SIX protein from transfected cells. Lanes marked “T” indicate lysates of MCF7 cells transfected with HSIXl . Lanes marked “M” indicate lysates from mock transfected MCF7 cells. Antibody dilutions are indicated below respective pairs of lanes.
  • HSIXl human SIXl gene
  • HSIXl a known homoebox gene
  • HSIXl refers to a gene obtained from human adult skeletal muscle (Boucher et al. (1996) Genomics 33:140-142) whose mouse counterpart has been implicated in the development of limb tendons (Oliver et al. (1995) Development 121 :693-705).
  • HSIXl is a member of a family of genes termed homeobox genes.
  • homeobox genes include genes which encode a family of proteins, termed homeodomain-containing proteins, which act as transcription factors that regulate the coordinated expression of genes involved in both development and differentiation.
  • Homeobox genes were identified initially in Drosophila, where they were found to be important in the control of sequence identity (Lewis (1978) Nature 276;565-570).
  • Homeobox genes contain a common 183-nt sequence encoding a 61-aa domain that is responsible for DNA binding (McGinnis and Krumlauff (1992) Cell 68:283-302). They are postulated to act as a network of transcriptional regulators effecting cell-cell communication during normal development, alterations of which may contribute to the neoplastic phenotype.
  • homeobox genes including members of the Hox and Pax families have been identified as oncogenic transcription factors (Lawrence et al. (1996) Stem Cell 14:281-291 and Stuart and Gruss (1995) Human Mol. Genet. 4:1717-1720).
  • Homeobox genes are often translocated to produce a chimeric protein with a new function, particularly in leukemias (Cillo (1994) Invasion Metastasis 14:38-49).
  • others retain their wild type function and are overexpressed (Lawrence et al, Cillo, and Stuart and Gruss, supra).
  • leukemias recent - 5 -
  • the present invention is further based, at least in part on the identification of HSIXl cDNA by its differential expression in cell cycle synchronized 21PT mammary adenocarcinoma cell line (a cell line derived from a patient who had an infiltrating and intraductal mammary adenocarcinoma) using the differential display method.
  • Direct sequencing of one differentially-expressed cDNA revealed its identity as HSIXl .
  • Further analysis revealed that HSIXl mRNA expression was very low in the first half of S phase and increased as 21PT cells are completing S phase.
  • HSIXl expression was also detected in other cell lines derived from the same patient including 2 INT, 21MT-1 and 21MT-2.
  • the 21PT and 2 INT cell lines were derived from a primary tumor, whereas the 21MT-1 and 21MT-2 cells lines were established from a metastatic pleural effusion.
  • HSIXl expression was not detected in a normal breast cell line, 70N (Band and Sager (1989) Proc. Natl. Acad. Sci. USA 86:1249-1253).
  • the invention is further based on the discovery that the HSIXl homeobox protein functions in a cell cycle-regulated manner and acts to abrogate G2 cell cycle arrest.
  • cells which overexpress HSIXl progress through X-ray irradiation- induced G2 arrest at a more rapid rate than control cells (e.g., normal cells).
  • continued passaging of 21PT cells which constitutivly overexpress HSIXl leads to ploidy changes over extended periods of time.
  • HSIXl The molecular weight of HSIXl appears to be unchanged in 21PT cells as compared to normal cells, suggesting that no gross genetic alterations exist. However, a translocation may occur upstream of the transcription start site resulting in aberrant expression, or point mutations or small deletions/insertions may exist in the gene.
  • overexpression of wild type HSIXl mRNA may contribute to the tumorigenic phenotype, consistent with a model proposed by Sager et al, which hypothesizes that tumorigenesis is not only the result of genetic mutations, but also of overexpression of wild type genes.
  • direct sequencing of HSIXl from 21PT cells resulted in wild type HSIX DNA sequence. The fact that SIXl overexpression correlates with both the tumorigenic phenotype as well as with abrogation of the G2 cell - 6 -
  • cycle check point indicates its utility as a growth-related or aberrant growth marker or marker of the tumorigenic phenotype.
  • the present invention features a method for detecting the presence of SIXl in a biological sample (e.g., a tumor sample) involving contacting a biological sample with an agent (e.g., a nucleic acid probe or antibody) capable of detecting SIXl protein or nucleic acid (e.g. , mRNA or cDNA) such that the presence of SIXl is detected in the biological sample.
  • an agent e.g., a nucleic acid probe or antibody
  • SIXl protein or nucleic acid e.g. , mRNA or cDNA
  • a "biological sample” refers to a sample of biological material obtained from a subject, preferably a human subject, or present within a subject, preferably a human subject, including a tissue, tissue sample, or cell sample (e.g., a tissue biopsy, for example, an aspiration biopsy, a brush biopsy, a surface biopsy, a needle biopsy, a punch biopsy, an excision biopsy, an open biobsy, an incision biopsy or an endoscopic biopsy), tumor, tumor sample, or biological fluid (e.g., blood, serum, lymph, spinal fluid).
  • tissue biopsy for example, an aspiration biopsy, a brush biopsy, a surface biopsy, a needle biopsy, a punch biopsy, an excision biopsy, an open biobsy, an incision biopsy or an endoscopic biopsy
  • tumor sample e.g., blood, serum, lymph, spinal fluid.
  • tissue sample refers to a portion, piece, part, segment, or fraction of a tissue which is obtained or removed from an intact tissue of a subject, preferably a human subject.
  • tissue samples can be obtained from the pancreas, stomach, liver, secretory gland, bladder, lung, skin, prostate gland, breast ovary, cervix, uterus, brain, eye, connective tissue, bone, muscles or vasculature.
  • the biological sample is a breast tissue sample.
  • the biological sample is a tissue sample, provided that it is not a breast tissue sample.
  • the biological sample is a tumor sample (e.g., a tumor biopsy).
  • a tumor sample refers to a portion, piece, part, segment, or fraction of a tumor, for example, a tumor which is obtained or removed from a subject (e.g., removed or extracted from a tissue of a subject), preferably a human subject.
  • a tumor sample can be obtained, for example, from a lung carcinoma, a colon carcinoma, a cervical carcinoma, an adenocarcinoma, a melanoma, a leukemia, a lymphoma, a glioma, a neuroblastoma, a retinoblastoma, and a sarcoma.
  • the tumor sample is obtained from a breast tumor (e.g., a breast tumor sample).
  • the tumor sample is obtained from a tumor, provided that the tumor is not - 7 -
  • the tumor sample is obtained from a primary tumor (e.g., is a primary tumor sample).
  • the biological sample is obtained metastatic lesion (e.g., is a metastatic lesion sample).
  • a "primary tumor” is a tumor appearing at a first site within the subject and can be distinguished from a “metastatic tumor” which appears in the body of the subject at a remote site from the primary tumor.
  • a “metastatic tumor” is a tumor resulting from the dissemination of cells from a primary tumor by the lymphatics or blood vessels or by direct extension through serum- contaning or serum-producing cavities or other spaces.
  • the present invention also encompasses the use of isolates of a biological sample in the methods of the invention.
  • an "isolate" of a biological sample refers to a material or composition (e.g., a biological material or composition) which has been separated, derived, extracted, purified or isolated from the sample and preferably is substantially free of undesireable compositions and/or impurities or contaminants associated with the biological sample.
  • Preferred isolates include, but are not limited to, DNA (e.g., cDNA or genomic DNA), RNA (e.g., mRNA), and protein (i.e., purified protein, protein extracts, polypeptides). Additional preferred isolates include cells as well as biological fluids (e.g., blood, serum, lymph, spinal fluid).
  • an "agent” refers to a substance which is cabable of identifying or detecting SIX in a biological sample (e.g., identifies or detects SIX mRNA, SIX DNA, SIX protein, SIX activity).
  • the agent is a labeled or labelable antibody which specifically binds to SIXl polypeptide.
  • labeled or labelable refers to the attaching or including of a label (e.g., a marker or indicator) or ability to attach or include include a label (e.g., a marker or indicator).
  • Markers or indicators include, but are not limited to, for example, radioactive molecules, colorimetric molecules, and enzymatic molecules which produce detectable changes in a substrate.
  • the agent is an antibody which specifically binds to all or a portion of a SIX protein (e.g., hSIXl).
  • SIX protein e.g., hSIXl
  • the agent is an antibody which specifically binds to all or a portion of HSIXl protein.
  • the agent is an antibody which specifically binds to all or a portion of a polypeptide selected from the group consisting of a polypeptide having the amino acid sequence of SEQ ID NO:2, a polypeptide comprising at least amino acids 183-284 of SEQ ID NO:2; and a polypeptide consisting of amino acids 183-284 of SEQ ID NO:2.
  • the antibody is a polyclonal antibody.
  • the agent is a labeled or labelable nucleic acid probe capable of hybridizing to SIXl mRNA.
  • the agent can be an oligonucleotide primer for the polymerase chain reaction which flank or lie within the nucleotide sequence encoding human SIXl .
  • the biological sample being tested is an isolate, for example, RNA.
  • the isolate e.g., the RNA
  • the isolate is subjected to an amplification process which results in amplification of SIXl nucleic acid.
  • an "amplification process" is designed to strengthen, increase, or augment a molecule within the isolate.
  • an amplification process such as RT-PCR can be utilized to amplify the mRNA, such that a signal is detectable or detection is enhanced.
  • amplification process is beneficial particularly when the biological, tissue, or tumor sample is of a small size or volume.
  • the present invention is also based in part on the discovery that HSIX is expressed in approximately one-half of primary breast cancers and nine-tenths of metastatic breast cancer lesions. While normal adjacent breast, normal breast luminal cells, and normal breast myoepithelial cells demonstrated almost no HSIXl expression, 44% of primary tumors and 90% of the metastatic lesions had elevated levels of HSIXl mRNA. HSIXl overexpression was likewise found in samples of lung tumors, when compared to adjacent normal lung tissue samples. Moreover, smaller scale analysis of several different tumor cell lines suggest that HSIXl may be expressed in a wide variety of tumors in addition to breast and lung. - 9 -
  • the invention further features diagnostic and prognostic methods useful in the detection and treatment of cancer, preferably breast cancer, described in detail herein.
  • the invention further involves kits useful in the detection and treatment of cancer, described in detail herein.
  • the invention features a method of determining the metastatic potential of a tumor which involves contacting a sample of the tumor (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the tumor sample or isolate, thereby determining the metastatic potential of the tumor.
  • Another aspect of the invention features a prognostic method for determining whether a subject is at risk for developing cancer which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby determining whether the subject is at risk for developing cancer.
  • a subject "at risk for developing cancer” includes a subject which has been determined to have a higher probability of developing cancer when compared to an average representative of the population.
  • a subject's "risk of developing cancer” can be based on an analysis of empirical criteria or on a persons pedigree.
  • Yet another aspect of the invention features a method for diagnosis of a tumor which involved contacting a tumor sample (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing the tumor.
  • Yet another aspect of the invention features a method of diagnosing cancer in a subject which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing cancer in the subject.
  • the diagnostic methods of the present invention further involve determining the level of SIXl polypeptide or mRNA in the sample or isolate.
  • determining the level includes measuring an amount (e.g., making a quantitative determination) or making a qualitative determination (e.g., a - 10 -
  • the diagnostic methods of the present invention involve comparing the level of SIXl polypeptide or mRNA in the sample or isolate with the level of SIXl polypeptide or mRNA in a control sample.
  • the phrase "comparing the level” includes evaluating, balancing or contrasting the amount or presence of, for example, SIX protein or nucleic acid in a first sample (e.g., a test sample) with the amount or presence of SIX protein or nucleic acid in a second sample (e.g.. a control sample).
  • the diagnostic or prognostic methods further includes the step of forming a prognosis or forming a diagnosis.
  • kits for detecting the presence of SIXl in a biological sample including a labeled or labelable agent capable of detecting SIXl polypeptide or mRNA in a biological sample.
  • the kit further includes a means for determining the amount of SIXl in the sample.
  • the agent of the kit is an antibody capable of specifically binding to SIXl polypeptide.
  • the agent of the kit is a nucleic acid probe capable of hybridizing to SIXl mRNA.
  • the kit further includes a means for comparing the amount of SIXl in the sample with a standard.
  • the kit further includes directions for use.
  • nucleic acid molecules that encode SIXl or biologically active portions thereof, as well as nucleic acid fragments sufficient for use as hybridization probes to identify SIXl -encoding nucleic acid (e.g., SIXl mRNA).
  • nucleic acid molecule is intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA).
  • the nucleic acid molecule may be single-stranded or double-stranded, but preferably is double-stranded DNA.
  • An "isolated" nucleic acid molecule is free of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the - 11 -
  • the isolated SIXl nucleic acid molecule may contain less than about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived (e.g., a human mammary adenocarcinoma cell).
  • an "isolated" nucleic acid molecule such as a cDNA molecule, may be free of other cellular material.
  • an isolated nucleic acid molecule of the invention comprises the nucleotide sequence shown in SEQ ID NO: 1.
  • the sequence of SEQ ID NO: 1 corresponds to the human SIXl cDNA.
  • This cDNA comprises sequences encoding the SIXl protein (i.e., "the coding region", from nucleotides 276 to 1130), as well as 5' untranslated sequences (nucleotides 1 to 275) and 3' untranslated sequences (nucleotides 1131 to 1378).
  • the nucleic acid molecule may comprise only the coding region of SEQ ID NO: 1 (e.g., nucleotides 276 to 1130).
  • the nucleic acid molecule of the invention can comprise only a portion of the coding region of SEQ ID NO: 1, for example a fragment encoding a biologically active portion of SIXl .
  • biologically active portion of SIXl is intended to include portions of SIXl that retain the ability to enhance cell cycle progression, abrogate the G2 cell cycle checkpoint (e.g., accelerate the progression of cells through G2), or promote aberrant growth (e.g., tumorigenesis).
  • portions of SIXl to inhibit cell cycle progression can be determined in standard cell cycle progression assays, for example using 3H-thymidine as an indicator of progression through S phase, propidium iodide staining and FACS analysis as an indicator of cells in G2/M phase of the cell cycle (described further below and in Examples 1 and 3).
  • Nucleic acid fragments encoding biologically active portions of SIXl can be prepared by isolating a portion of SEQ ID NO: 1, expressing the encoded portion of SIXl protein or peptide (e.g., by recombinant expression in vitro as detailed below in Example 3) and assessing the ability of the encoded fragment to effect cell cycle progression.
  • the invention further encompasses nucleic acid molecules that differ from SEQ ID NO:l (and portions thereof) due to degeneracy of the genetic code and thus encode the same SIXl protein as that encoded by SEQ ID NO: 1. Accordingly, in another - 12 -
  • an isolated nucleic acid molecule of the invention has a nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ ID NO: 2. Moreover, the invention encompasses nucleic acid molecules that encode biologically active portions of SEQ ID NO: 2.
  • a nucleic acid molecule having the nucleotide sequence of SEQ ID NO: 1 , or a portion thereof, can be isolated using standard molecular biology techniques and the sequence information provided herein.
  • a human SIXl cDNA can be isolated from a mammary adenocarcinoma cell line cDNA library using all or portion of SEQ ID NO: 1 as a hybridization probe and standard hybridization techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A
  • nucleic acid molecule encompassing all or a portion of SEQ ID NO: 1 can be isolated by the polymerase chain reaction using oligonucleotide primers designed based upon the sequence of SEQ ID NO: 1.
  • mRNA can be isolated from mammary adenocarcinoma cells (e.g. , by the guanidinium-thiocyanate extraction procedure of Chirgwin et al.
  • cDNA can be prepared using reverse transcriptase (e.g., Moloney MLV reverse transcriptase, available from Gibco/BRL, Bethesda, MD; or AMV reverse transcriptase, available from Seikagaku America, Inc., St. Russia, FL).
  • reverse transcriptase e.g., Moloney MLV reverse transcriptase, available from Gibco/BRL, Bethesda, MD; or AMV reverse transcriptase, available from Seikagaku America, Inc., St. Russia, FL.
  • Synthetic oligonucleotide primers for PCR amplification can be designed based upon the nucleotide sequence shown in SEQ ID NO: 1.
  • a nucleic acid of the invention can be amplified using cDNA or, alternatively, genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR amplification techniques.
  • the nucleic acid so amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis.
  • oligonucleotides corresponding to SIXl nucleotide sequence can be prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer.
  • DNA sequence polymorphisms that lead to changes in the amino acid sequences of SIXl may exist within a population (e.g. , the human population).
  • Such genetic polymorphism in the SIXl gene may exist among individuals within a population due to natural allelic variation.
  • nucleotide variations can typically result in 1-5 % variance in the nucleotide sequence of the a gene. Any and all such nucleotide variations and resulting amino acid polymorphisms in SIXl that are the result of natural allelic variation and that do not alter the functional activity of SIXl are intended to be within the scope of the invention. Moreover, nucleic acid molecules encoding SIXl proteins from other species, and thus which have a nucleotide sequence which differs from the human sequence of SEQ ID NO: 1, are intended to be within the scope of the invention.
  • Nucleic acid molecules corresponding to natural allelic variants and nonhuman homologues of the human SIXl cDNA of the invention can be isolated based on their homology to the human SIXl nucleic acid disclosed herein using the human cDNA, or a portion thereof, as a hybridization probe according to standard hybridization techniques under stringent hybridization conditions. Accordingly, in another embodiment, an isolated nucleic acid molecule of the invention is at least 15 nucleotides in length and hybridizes under stringent conditions to the nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 1. In other embodiment, the nucleic acid is at least 30, 50, 100, 250 or 500 nucleotides in length.
  • hybridizes under stringent conditions is intended to describe conditions for hybridization and washing under which nucleotide sequences at least 60 % homologous to each other typically remain hybridized to each other.
  • the conditions are such that at least sequences at least 65 %, more preferably at least 70 %, and even more preferably at least 75 % homologous to each other typically remain hybridized to each other.
  • stringent conditions are known to those skilled in the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6.
  • a preferred, non-limiting example of stringent hybridization conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X SSC, 0.1 % SDS at 50-65°C.
  • an isolated nucleic acid molecule of the invention that hybridizes under stringent conditions to the sequence of SEQ ID NO: 1 corresponds to a naturally-occurring nucleic acid molecule.
  • a "naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural protein).
  • the nucleic acid encodes a natural human SIXl . - 14 -
  • allelic variants of the SIXl sequence that may exist in the population, the skilled artisan will further appreciate that changes may be introduced by mutation into the nucleotide sequence of SEQ ID NO: 1, thereby leading to changes in the amino acid sequence of the encoded SIXl protein, without altering the functional ability of the SIXl protein.
  • nucleotide substitutions leading to amino acid substitutions at "non-essential" amino acid residues may be made in the sequence of SEQ ID NO: 1.
  • a "non-essential" amino acid residue is a residue that can be altered from the wild-type sequence of SIXl (e.g., the sequence of SEQ ID NO: 2) without altering the activity of SIXl, whereas an "essential" amino acid residue is required for SIXl activity.
  • Amino acid residues of SIXl that are strongly conserved among, for example, among members of the subfamily of homeobox genes that share a lysine within the DNA binding helix of the homeodomain (e.g., the Drosophila sine oculis (so) gene, the human myotonic dystrophy (DM)-associated homeodomain protein (DMHAP) and its murine homologue (Boucher et al. (1995) Hum. Mol. Genet. 4:1919- 1925), the human SIXl gene and its murine counterpart, and the murine SIX2 gene.
  • SIXl conserved among proteins whose amino acid sequences are aligned for comparison purposes
  • Other amino acid residues may not be essential for SIXl activity and thus are more likely to be amenable to alteration.
  • nucleic acid molecules encoding SIXl proteins that contain changes in amino acid residues that are not essential for SIXl activity , e.g., residues that are not conserved or only semi-conserved among members of the subfamily.
  • SIXl proteins differ in amino acid sequence from SEQ ID NO: 2 yet retain SIXl activity.
  • the isolated nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein comprises an amino acid sequence at least 60 % homologous to the amino acid sequence of SEQ ID NO: 2 and retains SIXl activity.
  • the protein encoded by the nucleic acid molecule is at least 70 % homologous to SEQ ID NO: 2, more preferably at least 80 % homologous to SEQ ID NO: 2, even more preferably at least 90 % - 15 -
  • the sequences are aligned for optimal comparison purposes (e.g., gaps may be introduced in the sequence of one protein for optimal alignment with the other protein).
  • the amino acid residues at corresponding amino acid positions are then compared.
  • a position in one sequence e.g., SEQ ID NO: 2
  • amino acid residues are then occupied by the same amino acid residue as the corresponding position in the other sequence (e.g., a mutant form of SIXl)
  • amino acid "homology” is equivalent to amino acid "identity”
  • the percent homology between the two sequences is a function of the number of identical positions shared by the sequences (/. e. ,
  • % homology # of identical positions/total # of positions x 100).
  • Such an alignment can be performed using any one of a number of computer algorithms designed for such a purpose.
  • a preferred, non-limiting example of a mathematical algorithim utilized for the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989).
  • Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the GCG sequence alignment software package.
  • ALIGN program version 2.0
  • An isolated nucleic acid molecule encoding a SIXl protein homologous to the protein of SEQ ID NO: 2 can be created by introducing one or more nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ ID NO: 1 such that one or more amino acid substitutions, additions or deletions are introduced into the encoded protein. Mutations can be introduced into SEQ ID NO: 1 by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at one or more predicted non-essential amino acid residues. A "conservative amino acid substitution" is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art, including basic side chains (e.g., lysine, arginine, histidine), acidic - 16 -
  • side chains e.g., aspartic acid, glutamic acid
  • uncharged polar side chains e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine
  • nonpolar side chains e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan
  • beta-branched side chains e.g., threonine, valine, isoleucine
  • aromatic side chains e.g. , tyrosine, phenylalanine, tryptophan, histidine.
  • a predicted nonessential amino acid residue in SIXl is preferably replaced with another amino acid residue from the same side chain family.
  • mutations can be introduced randomly along all or part of a SIXl coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for SIXl activity to identify mutants that retain SIXl activity.
  • the encoded protein can be expressed recombinantly (e.g., as described in Example 3) and the SIXl activity of the protein can be determined. Suitable assays for testing the activity of portions of SIXl proteins and mutated SIXl proteins are described in detail in Examples 1 and 3.
  • an antisense nucleic acid comprises a nucleotide sequence which is complementary to a "sense" nucleic acid encoding a protein, e.g.. complementary to the coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can hydrogen bond to a sense nucleic acid.
  • the antisense nucleic acid can be complementary to an entire SIXl coding strand, or to only a portion thereof.
  • an antisense nucleic acid molecule is antisense to a "coding region" of the coding strand of a nucleotide sequence encoding SIXl .
  • the term "coding region” refers to the region of the nucleotide sequence comprising codons which are translated into amino acid residues (e.g., the entire coding region of SEQ ID NO: 1 comprises nucleotides 276-1130).
  • the antisense nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence encoding SIXl.
  • noncoding region refers to 5' and 3' sequences which flank the coding region that are not translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions).
  • antisense nucleic acids of the invention can be designed according to the rules of Watson and Crick base pairing.
  • the antisense nucleic acid molecule may be complementary to the entire coding region of SIXl mRNA, but more preferably is an oligonucleotide which is antisense to only a portion of the coding or noncoding region of SIXl mRNA.
  • the antisense oligonucleotide may be complementary to the region surrounding the translation start site of SIXl mRNA.
  • An antisense oligonucleotide can be, for example, about 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length.
  • An antisense nucleic acid of the invention can be constructed using chemical synthesis and enzymatic ligation reactions using procedures known in the art.
  • an antisense nucleic acid e.g., an antisense oligonucleotide
  • an antisense nucleic acid e.g., an antisense oligonucleotide
  • the antisense nucleic acid can be produced biologically using an expression vector into which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described further in the following subsection).
  • an antisense nucleic acid of the invention is a ribozyme.
  • Ribozymes are catalytic RNA molecules with ribonuclease activity which are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a complementary region.
  • a ribozyme having specificity for a SIXl -encoding nucleic acid can be designed based upon the nucleotide sequence of a SIXl cDNA disclosed herein (i.e., SEQ ID NO: 1).
  • a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the base sequence of the active site is complementary to the base sequence to be cleaved in a SIXl -encoding mRNA.
  • SIXl mRNA can be used to select a catalytic RNA having a specific ribonuclease - 18 -
  • RNA molecules activity from a pool of RNA molecules. See for example Bartel, D. and Szostak, J.W. (1993) Science 261 : 1411-1418.
  • vectors preferably expression vectors, containing a nucleic acid encoding SIXl (or a portion thereof).
  • vector refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
  • plasmid refers to a circular double stranded DNA loop into which additional DNA segments may be ligated.
  • viral vector is another type of vector, wherein additional DNA segments may be ligated into the viral genome.
  • vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors).
  • Other vectors e.g., non-episomal mammalian vectors
  • certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are are referred to herein as "expression vectors”.
  • expression vectors of utility in recombinant DNA techniques are often in the form of plasmids.
  • plasmid and “vector” may be used interchangeably as the plasmid is the most commonly used form of vector.
  • the invention is intended to include such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions.
  • the recombinant expression vectors of the invention comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression, which is operatively linked to the nucleic acid sequence to be expressed.
  • "operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequence(s) in a manner which allows for expression - 19 -
  • regulatory sequence is intended to includes promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are described, for example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Regulatory sequences include those which direct constitutive expression of a nucleotide sequence in many types of host cell and those which direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences).
  • the expression vectors of the invention can be introduced into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., SIXl proteins, mutant forms of SIXl, fusion proteins, etc.).
  • the recombinant expression vectors of the invention can be designed for expression of SIXl in prokaryotic or eukaryotic cells.
  • SIXl can be expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian cells.
  • telomeres Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990).
  • the recombinant expression vector may be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
  • Fusion vectors add a number of amino acids to a protein encoded therein, usually to the amino terminus of the recombinant protein.
  • Such fusion vectors typically serve three purposes: 1) to increase expression of recombinant protein; 2) to increase the solubility of the recombinant protein; and 3) to aid in the purification of the recombinant protein by acting as a ligand in affinity purification.
  • a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant protein to enable separation of the recombinant protein from - 20 - the fusion moiety subsequent to purification of the fusion protein.
  • Such enzymes, and their cognate recognition sequences include Factor Xa, thrombin and enterokinase.
  • Typical fusion expression vectors include pGEX (Pharmacia Biotech Ine; Smith, D.B. and Johnson, K.S.
  • the coding sequence of the mature form of SIXl (i.e., encompassing amino acids 1-284) is cloned into a pGEX expression vector to create a vector encoding a fusion protein comprising, from the N- terminus to the C-terminus, GST-thrombin cleavage site-SIXl.
  • the fusion protein can be purified by affinity chromatography using glutathione-agarose resin. Recombinant SIXl unfused to GST can be recovered by cleavage of the fusion protein with thrombin.
  • Suitable inducible non-fusion E. coli expression vectors include pTrc (Amann et al, (1988) Gene 69:301-315) and pET l id (Studier et al, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 60-89).
  • Target gene expression from the pTrc vector relies on host RNA polymerase transcription from a hybrid trp-lac fusion promoter.
  • Target gene expression from the pET l id vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident ⁇ prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter.
  • the SIXl expression vector is a yeast expression vector.
  • yeast expression vectors for expression in yeast S. cerivisae include pYepSecl (Baldari. et al, (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933- 943), pJRY88 (Schultz et al, (1987) Gene 54:113-123), and pYES2 (Invitrogen Corporation, San Diego, CA).
  • SIXl can be expressed in insect cells using baculovirus expression vectors.
  • Baculovirus vectors available for expression of proteins in cultured insect cells include the pAc series (Smith et al, (1983) Mol. Cell Biol. 3:2156- 2165) and the pVL series (Lucklow, V.A., and Summers, M.D., (1989) Virology 170:31- 39).
  • a nucleic acid of the invention is expressed in mammalian cells using a mammalian expression vector.
  • mammalian expression vectors include pCDM8 (Seed, B., (1987) Nature 329:840) and pMT2PC (Kaufman et al. (1987), EMBO J. 6:187-195).
  • the expression vector's control functions are often provided by viral regulatory elements.
  • commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40.
  • Another aspect of the invention pertains to recombinant host cells into which a recombinant expression vector of the invention has been introduced.
  • host cell and “recombinant host cell” are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
  • a host cell may be any prokaryotic or eukaryotic cell.
  • SIXl protein may be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells).
  • bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells).
  • mammalian cells such as Chinese hamster ovary cells (CHO) or COS cells.
  • Other suitable host cells are known to those skilled in the art. - 22 -
  • Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques.
  • transformation and “transfection” are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press (1989)), and other laboratory manuals.
  • a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host cells along with the gene of interest.
  • selectable markers include those which confer resistance to drugs, such as G418, hygromycin and methotrexate.
  • Nucleic acid encoding a selectable marker may be introduced into a host cell on the same vector as that encoding SIXl or may be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die).
  • a host cell of the invention such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) SIXl protein.
  • the invention further provides methods for producing SIXl protein using the host cells of the invention.
  • the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding SIXl has been introduced) in a suitable medium until SIXl is produced.
  • the method further comprises isolating SIXl from the medium or the host cell.
  • Such an isolated SIXl protein can be used, for example, to raise antibodies to a SIXl protein for use in the diagnostic methods of the present invention (described further below).
  • SIXl proteins and biologically active portions thereof, as well as peptide fragments suitable as immunogens to raise anti-SIXl antibodies.
  • the invention provides an isolated preparation of SIXl, or a biologically active portion thereof.
  • An "isolated" protein is substantially free of cellular material or culture medium when produced by recombinant DNA techniques, or chemical precursors or other chemicals when chemically synthesized.
  • the SIXl protein has an amino acid sequence shown in SEQ ID NO: 2.
  • the SIXl protein is substantially homologous to SEQ ID NO: 2 and retains the functional activity of the protein of SEQ ID NO: 2 yet differs in amino acid sequence due to natural allelic variation or mutagenesis, as described in detail in subsection I above.
  • the SIXl protein is a protein which comprises an amino acid sequence at least 60% homologous to the amino acid sequence of SEQ ID NO: 2 and retains a SIXl activity.
  • the protein is at least 70% homologous to SEQ ID NO: 2, more preferably at least 80%) homologous to SEQ ID NO: 2, even more preferably at least 90%) homologous to SEQ ID NO: 2, and most preferably at least 95%> homologous to SEQ ID NO: 2.
  • An isolated SIXl protein may comprise the entire amino acid sequence of SEQ ID NO: 2 (i.e., amino acids 1-284), a biologically active portion thereof, or an immunogenic portion thereof.
  • an immunogenic portion of SIXl can comprise portion of SIXl in which hydrophobic, and thus predicted to comprise an surface portion of a SIXl protein.
  • An immunogenic portion can also comprise all or a portion of a SIXl protein which is unique to SIXl (e.g., does not share significant homology with other homeobox proteins, thereby reducing the risk of cross-reactivity with non-SIXl proteins).
  • an immunogenic portion of a SIXl protein includes all or a portion of human SIXl (SEQ ID NO:2) from about amino acids 183 to 284.
  • other biologically active and/or immunogenic portions in which other regions of the protein are deleted, can be prepared by recombinant techniques and evaluated for SIXl activity as described in detail above or alternatively, tested for immunogenicity.
  • SIX1 proteins are preferably produced by recombinant DNA techniques.
  • a nucleic acid molecule encoding all or a portion of the protein is cloned into an expression vector (as described above), the expression vector is introduced into a host cell (as described above) and the SIXl protein or portion thereof is expressed in the host cell.
  • the SIXl protein or portion thereof can then be isolated from the cells by an appropriate purification scheme using standard protein purification techniques.
  • a nucleic acid molecule comprising nucleotides 1 to 1130 of SEQ ID NO:l is cloned into an expresion vector.
  • a nucleic acid molecule comprising nucleotides 822 to 1130 is cloned into an expression vector.
  • a SIXl protein or polypeptide can be synthesized chemically using standard peptide synthesis techniques.
  • native SIXl protein can be isolated from cells (e.g., cultured human mammary adenocarcinoma cells), for example using an anti-SIXl antibody (discussed further below).
  • SIXl fusion protein comprises a SIXl polypeptide operatively linked to a non-SIXl polypeptide.
  • SIXl polypeptide refers to a polypeptide having an amino acid sequence corresponding to SIXl
  • non-SIXl polypeptide refers to a polypeptide having an amino acid sequence corresponding to another protein.
  • operatively linked is intended to indicate that the SIXl polypeptide and the non-SIXl polypeptide are fused in-frame to each other.
  • the non- SIXl polypeptide may be fused to the N-terminus or C-terminus of the SIXl polypeptide.
  • a non-SIXl polypeptide e.g., GST
  • the C-terminus of the SIXl polypeptide e.g., amino acids 183 to 284 of SEQ IDNO:2.
  • Such fusion proteins can facilitate the purification of recombinant SIXl (see, for example, the fusion proteins described in Example 5).
  • the fusion protein is a SIXl protein containing a heterologous signal sequence at its N- terminus. In certain host cells (e.g., mammalian host cells), expression and/or secretion of SIXl may be increased through use of a heterologous signal sequence.
  • a SIXl fusion protein of the invention is produced by standard recombinant DNA techniques.
  • DNA fragments coding for the different polypeptide sequences are ligated together in-frame in accordance with conventional - 25 -
  • a DNA fragment encoding a non-SIXl polypeptide (e.g., GST) is ligated in frame with a DNA fragment encoding a portion of SIXl (e.g., including all or a portion of SEQ ID NO:2, for example, amino acids 183 to 284 of SEQ IDNO:2).
  • the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers.
  • PCR amplification of gene fragments can be carried out using anchor primers which give rise to complementary overhangs between two consecutive gene fragments which can subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel et al. John Wiley & Sons: 1992).
  • anchor primers which give rise to complementary overhangs between two consecutive gene fragments which can subsequently be annealed and reamplified to generate a chimeric gene sequence
  • many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST polypeptide).
  • a SIXl -encoding nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the SIXl protein.
  • SIXl protein or fragment thereof, can be used as an immunogen to generate antibodies that bind SIXl using standard techniques for polyclonal and monoclonal antibody preparation.
  • the full-length SIXl protein can be used or, alternatively, the invention provides antigenic peptide fragments of SIXl for use as immunogens.
  • the antigenic peptide of SIXl comprises at least 8 amino acid residues of the amino acid sequence shown in SEQ ID NO: 2 and encompasses an epitope of SIXl such that an antibody raised against the peptide forms a specific immune complex with SIXl .
  • the antigenic peptide comprises at least 10 amino acid residues, more preferably at least 15 amino acid residues, even more preferably at least 20 amino acid residues, and most preferably at least 30 amino acid residues.
  • Antigenic polypeptides comprising at least 50, 100, 150, 200 or 250 amino acid residues are also within the scope of the present invention.
  • an antigenic polypeptide which includes 102 amino acids of SIXl e.g., amino acids 183 to 284 of SEQ ID NO:2
  • Preferred epitopes encompassed by the antigenic peptide are regions of SIXl that are located on the surface of the protein, e.g., hydrophilic - 26 -
  • SIXl immunogen typically is used to prepare antibodies by immunizing a suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen.
  • An appropriate immunogenic preparation can contain, for examples, recombinantly expressed SIXl protein or a chemically synthesized SIXl peptide.
  • the preparation can further include an adjuvant, such as Freund's complete or incomplete adjuvant, or similar immunostimulatory agent. Immunization of a suitable subject with an immunogenic SIXl preparation induces a polyclonal anti-SIXl antibody response.
  • the immunogen can further include a portion of non-SIXl polypeptide, for example, a polypeptide useful to facilitate purification.
  • antibody refers to immunoglobulin molecules and immunologically active portions of immunoglobulin molecules, i.e., molecules that contain an antigen binding site which specifically binds (immunoreacts with) an antigen, such as SIXl .
  • the invention provides polyclonal and monoclonal antibodies that bind SIXl.
  • monoclonal antibody or “monoclonal antibody composition”, as used herein, refers to a population of antibody molecules that contain only one species of an antigen binding site capable of immunoreacting with a particular epitope of SIXl. A monoclonal antibody composition thus typically displays a single binding affinity for a particular SIXl protein with which it immunoreacts.
  • Polyclonal anti-SIXl antibodies can be prepared as described above by immunizing a suitable subject with a SIXl immunogen.
  • the anti-SIXl antibody titer in the immunized subject can be monitored over time by standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) using immobilized SIXl.
  • ELISA enzyme linked immunosorbent assay
  • the antibody molecules directed against SIXl can be isolated from the mammal (e.g., from the blood) and further purified by well known techniques, such as protein A chromatography to obtain the IgG fraction.
  • antibody-producing cells can be obtained from the subject and used to prepare monoclonal antibodies by standard techniques, such as the hybridoma technique originally described by Kohler and - 27 -
  • an immortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal immunized with a SIXl immunogen as described above, and the culture supernatants of the resulting hybridoma cells are screened to identify a hybridoma producing a monoclonal antibody that binds SIXl.
  • lymphocytes typically splenocytes
  • Any of the many well known protocols used for fusing lymphocytes and immortalized cell lines can be applied for the purpose of generating an anti-SIXl monoclonal antibody (see, e.g., G. Galfre et al. (1977) Nature 266:55052; Gefter et al Somatic Cell Genet.
  • the immortal cell line (e.g., a myeloma cell line) is derived from the same mammalian species as the lymphocytes.
  • murine hybridomas can be made by fusing lymphocytes from a mouse immunized with an immunogenic preparation of the present invention with an immortalized mouse cell line.
  • Preferred immortal cell lines are mouse myeloma cell lines that are sensitive to culture medium containing hypoxanthine, aminopterin and thymidine ("HAT medium").
  • myeloma cell lines may be used as a fusion partner according to standard techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/O-Agl4 myeloma lines. These myeloma lines are available from the American Type Culture Collection (ATCC), Rockville, Md. Typically, HAT-sensitive mouse myeloma cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). Hybridoma cells resulting from the - 28 -
  • PEG polyethylene glycol
  • HAT medium kills unfused and unproductively fused myeloma cells (unfused splenocytes die after several days because they are not transformed).
  • Hybridoma cells producing a monoclonal antibody of the invention are detected by screening the hybridoma culture supernatants for antibodies that bind SIXl , e.g. , using a standard ELISA assay.
  • a monoclonal anti-SIXl antibody can be identified and isolated by screening a recombinant combinatorial immunoglobulin library (e.g., an antibody phage display library) with SIXl to thereby isolate immunoglobulin library members that bind SIXl .
  • Kits for generating and screening phage display libraries are commercially available (e.g. , the Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01 ; and the Stratagene SurfZAPTM Phage Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly amenable for use in generating and screening antibody display library can be found in, for example, Ladner et al. U.S.
  • recombinant anti-SIXl antibodies such as chimeric and humanized monoclonal antibodies, comprising both human and non-human portions, which can be made using standard recombinant DNA techniques, are within the scope of the invention.
  • chimeric and humanized monoclonal antibodies can be produced by - 29 -
  • An anti-SIXl antibody (e.g., monoclonal antibody) can be used to isolate SIXl by standard techniques, such as affinity chromatography or immunoprecipitation.
  • An anti-SIXl antibody can facilitate the purification of natural SIXl from cells and of recombinantly produced SIXl expressed in host cells.
  • an anti-SIXl antibody can be used to detect SIXl protein (e.g., in a cellular lysate or cell supernatant). Detection may be facilitated by coupling (i.e., physically linking) the antibody to a detectable substance. Examples of detectable substances include various enzymes, prosthetic groups, fluorescent materials, luminescent materials and radioactive materials.
  • suitable enzymes include horseradish peroxidase, alkaline phosphatase, ⁇ - galactosidase, or acetylcholinesterase;
  • suitable prosthetic group complexes include streptavidin biotin and avidin/biotin;
  • suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; and examples of suitable radioactive material include 125 !, 13 1 I, 35 S or 3 H. - 30 -
  • SIXl expression correlates with tumorogenesis and metastasis (e.g., breast tumorigenesis and metastasis) Accordingly, detection of SIXl protein or nucleic acid molecules provides is useful for diagnosing cancer and monitoring both tumor progression and metastasis. Furthermore, inhibition of SIXl expression may result in inhibition of cancer and tumor metastasis ,(e.g., breast, colon, and lung cancer).
  • the isolated nucleic acid molecules of the invention can be used to inhibit SIXl protein expression (e.g., antisense SIXl nucleic acid molecules), to detect SIXl mRNA (e.g., SIXl nucleic acid probes based on the nucleotide sequence of SEQ ID NO:l) and to modulate SIXl activity, as discussed further below.
  • SIXl protein expression e.g., antisense SIXl nucleic acid molecules
  • SIXl mRNA e.g., SIXl nucleic acid probes based on the nucleotide sequence of SEQ ID NO:l
  • anti-SIXl antibodies of the invention can be used to detect and isolate SIXl protein and modulate SIXl activity, also discussed further below.
  • the invention provides a method for detecting the presence of SIXl in a biological sample.
  • the method involves contacting the biological sample with an agent capable of detecting SIXl protein or nucleic acid molecules (e.g., SIXl mRNA) such that the presence of SIXl is detected in the biological sample.
  • SIXl mRNA a preferred agent for detecting SIXl mRNA is a labeled or labelable nucleic acid probe capable of hybridizing to SIXl mRNA.
  • the nucleic acid probe can be, for example, the full-length SIXl cDNA of SEQ ID NO: 1, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to SIXl mRNA.
  • a preferred agent for detecting SIXl protein is a labeled or labelable antibody capable of binding to SIXl protein.
  • Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used.
  • the term "labeled or labelable", with regard to the probe or antibody is intended to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a detectable substance to the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with another reagent that is directly labeled. Examples of indirect labeling include detection of a primary antibody using a fluorescently labeled secondary antibody and end-labeling of a DNA probe with biotin such that it can be detected with fluorescently labeled streptavidin. - 31 -
  • a biological sample comprises a sample which has been isolated from a subject and is subjected to a method of the present invention without further processing or manipulation subsequent to its isolation.
  • the biological sample can be processed or manipulated subsequent to being isolated and prior to being subjected to a method of the invention.
  • a sample can be refrigerated (e.g., stored at 4°C), frozen (e.g., stored at -20°C, stored at -135°C, frozen in liquid nitrogen, or cryopreserved using any one of many standard cryopreservation techniques known in the art).
  • a sample can be purified subsequent to isolation from a subject and prior to subjecting it to a method of the present invention.
  • the term "purified" when used in the context of a biological sample is intended to indicate that at least one component of the isolated biological sample has been removed from the biological sample such that fewer components, and consequently, purer components, remain following purification.
  • a serum sample can be separated into one or more components using centrifugation techniques known in the art to obtain partially-purified sample preparation.
  • a tissue or tumor sample can be purified such that substantially only the protein or mRNA component of the biological sample remains.
  • the mRNA component of a biological sample can be amplified (e.g., by RT-PCR) such that detection of SIXl mRNA is facilitated.
  • RT-PCR an abbreviation for reverse transcriptase-polymerase chain reaction
  • cDNA which is complementary to the base sequences of the mRNA.
  • Large amounts of selected cDNA can then be produced by means of the polymerase chain reaction which relies on the action of heat-stable DNA polymerase for its amplification action.
  • Alternative amplification methods include: self sustained sequence replication (Guatelli, J.C. et al, 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh, D.Y. et al, 1989, - 32 -
  • the detection methods of the present invention can be used to detect SIXl protein or nucleic acid molecules in a biological sample in vitro as well as in vivo.
  • in vitro techniques for detection of SIXl mRNA include Northern hybridizations and in situ hybridizations.
  • in vitro techniques for detection of SIXl DNA include Southern hybridizations.
  • in vitro techniques for detection of SIXl protein include enzyme linked immunosorbent assays (ELI S As), Western blots, immunoprecipitations and immunofluorescence.
  • SIXl protein can be detected in vivo in a subject by introducing into the subject a labeled anti-SIXl antibody.
  • the antibody can be labeled with a radioactive marker whose presence and location in a subject can be detected by standard imaging techniques.
  • the biological sample is a tissue sample or tumor sample.
  • the tissue sample or tumor sample may comprise tissue or a suspension of cells.
  • a tissue section for example, a freeze-dried, parafin- embedded, or fresh frozen section of tissue removed from a patient, or a section of a tumor biopsy can be used as the biological sample.
  • the sample may include a biological fluid obtained from a subject (e.g., blood, ascites, pleural fluid or spinal fluid). Following collection, tissue or tumor samples can be stored at temperatures below -20°C to prevent degradation until the detection method is to be performed.
  • a biological sample in which SIXl mRNA or protein is to be detected is a mammary tumor sample.
  • a biological sample in which SIXl mRNA is to be detected is, for example, a lung, colon, or cervical tumor.
  • the detection methods of the invention described above can be used as the basis for a method of diagnosis of a subject with a tumor (e.g., a breast tumor), can be used as the basis for a method of monitoring the progression of cancer in a subject, or can be used as the basis for a method of prognosing a person at risk for developing a cancer.
  • a tumor e.g., a breast tumor
  • the expression pattern of SIXl mRNA can - 33 -
  • SIXl mRNA levels are detectable in tissues (e.g., skeletal muscle, pituitary gland, salivary gland, lung and trachea) but was not detectable in normal mammary tissue. SIXl mRNA levels were elevated in 44% of primary breast tumors analysed and further elevated in 90%) of metastatic lesion examined.
  • the invention features a method of determining the metastatic potential of a tumor which involves contacting a sample of the tumor (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the tumor sample or isolate, thereby determining the metastatic potential of the tumor.
  • Another aspect of the invention features a prognostic method for determining whether a subject is at risk for developing cancer which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby determining whether the subject is at risk for developing cancer.
  • Yet another aspect of the invention features a method of diagnosing cancer in a subject which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing cancer in the subject.
  • the diagnostic methods of the present invention further involve determining the level of SIXl polypeptide or mRNA in the sample or isolate.
  • the diagnostic methods of the present invention involve comparing the level of SIXl polypeptide or mRNA in the sample or isolate with the level of SIXl polypeptide or mRNA in a control sample.
  • the diagnostic or prognostic methods further include the step of forming a prognosis or forming a diagnosis.
  • control is from normal cells and the tumor sample is a suspected primary tumor sample.
  • Primary malignancy of the tumor cell sample can be diagnosed based on an increase in the level of expression of SIXl mRNA or protein in the tumor sample as compared to the control.
  • control is from normal cells or a primary tumor and the tumor sample is a suspected metastatic - 34 -
  • the prognostic methods of the present invention are of particular utility in the early detection and treatment of breast cancer. It will be appreciated by those skilled in the art that breast cancer may not be as amenable to early detection as, for instance, cervical cancer, due to the lack of cytomorphologic screening methods available (e.g., pap smears for the detection of cellular abnormalities of the cervix). Accordingly, the prognostic methods of the present invention feature, for example, careful histological examination of breast biopsies (e.g. , biopsies of pre-malignant or pre-invasive lesions, atypical hyperplasias and/or carcinoma in situ). Upon the morphological detection of such a lesion, hyperplasia or carcinoma, it may be desireable to utilize an amplification step of the present invention to detect, for example, SIX nucleic acid.
  • breast biopsies e.g. , biopsies of pre-malignant or pre-invasive lesions, atypical hyperplasias and/or carcinoma
  • kits for detecting the presence of SIXl in a biological sample e.g., a tumor sample
  • the kit can comprise a labeled or labelable agent capable of detecting SIXl protein or mRNA in a biological sample and a means for determining the amount of SIXl in the sample.
  • the agent can be packaged in a suitable container.
  • the kit can further comprise a means for comparing the amount of SIXl in the sample with a standard and/or can further comprise instructions for using the kit to detect SIXl mRNA or protein.
  • SIXl activity associated with a cell e.g., for therapeutic purposes.
  • SIXl activity "associated with a cell” is intended to include SIXl activity within the cell and/or within the nucleus of the cell.
  • HSIXl is a homeobox gene that is diferentially expressed in the cell cycle and whose overexpression leads to an abbrogation of the DNA damage-induced G2 cell cycle checkpoint.
  • the invention pertains to methods of modulating SIXl activity in a subject afflicted with a disease associated with G2 checkpoint control.
  • diseases include, but are not limited to Ataxia telangiectasia (Scott et al. (1994) Int.
  • the modulatory method of the invention involves contacting the cell with an agent that modulates SIXl activity associated with the cell.
  • the agent stimulates SIXl activity.
  • stimulatory agents include active SIXl protein and a nucleic acid molecule encoding SIXl that has been introduced into the cell.
  • the agent inhibits the SIXl activity.
  • inhibitory agents include antisense SIXl nucleic acid molecules and anti-SIXl antibodies.
  • the invention provides a method for inhibiting development or progression of a metastatic phenotype in a tumor cell comprising contacting the tumor cell with an agent which inhibits the amount of SIXl in the tumor cell.
  • the term "in the tumor cell” is intended to include SIXl within the cell and/or SIXl within the nucleus of the cell.
  • SIXl is predicted to be a transcription factor, it is likely that it exerts tumor suppressive effects nuclearly.
  • the agent that inhibits SIXl in the tumor cell can be an antisense SIXl nucleic acid or a SIXl antibody.
  • a SIXl inhibitory agent preferably in a pharmaceutically acceptable carrier, can be administered to a tumor-bearing subject by an appropriate route to inhibit the development or progression of the metastatic phenotype of the tumor. Suitable routes of administration include intravenous, intramuscular or subcutaneous injection, injection directly into the tumor site or implantation of a device containing a slow-release formulation.
  • the SIXl inhibitory agent preparation can also be incorporated into liposomes or other carrier vehicles to facilitate delivery to the tumor - 36 -
  • a non-limiting dosage range is 0.001 to 100 mg/kg/day, with the most beneficial range to be determined by routine pharmacological methods.
  • the development or progression of the metastatic phenotype can be inhibited in tumor cells by modifying them to express a SIXl inhibitory agent (e.g., a SIXl antisense nucleic acid molecule) by introducing into the tumor cells a SIXl antisense nucleic acid expresssion vector.
  • a SIXl inhibitory agent e.g., a SIXl antisense nucleic acid molecule
  • SIXl antisense nucleic acids can be delivered to the tumor cells.
  • adenoviral vectors carrying appropriate regulatory elements can be used to deliver the SIXl antisense nucleic acids to the tumor cells.
  • modulating SIXl activity may be desirable.
  • SIXl overexpression in cells results in abbrogation of the G2 cell cycle checkpoint. Accordingly, SIXl inhibition may be desireable to reconstitute growth arrest in a population of cells, such that DNA repair can take place.
  • EXAMPLE 1 Identification of HSIXl as a Cell-Cycle Regulated Gene by Differential Display Methodology
  • differential display was performed with a two- step polymerase chain reaction (PCR) and the LHA series of primers as described (Martin et al (1996) in Methods in Molecular Biology-Differential Display Methods and Protocols, eds. Pardee & Liang (Humana, Totowa, NJ), Vol. 85, pp.77-85).
  • PCR polymerase chain reaction
  • LHA LHA series of primers as described
  • HSIXl cDNA for use as a probe, primers were designed to the 5' end (5'-ATG TCG ATG CTG CCG TCG TTT-3') (SEQ ID NO:5) and 3' end (5'-CAC TTA GGA CCC CAA GTC CAC-3') (SEQ ID NO:6) of the HSIXl cDNA.
  • Reverse transcription (RT) reactions were performed with 0.2 ⁇ g RNA template, 25 ⁇ M dNTPs, 1 mM DTT, 5 ⁇ M oligo dT 12 .
  • lg , and lx reverse transcriptase buffer 50 mM Tris-HCl, pH 8.3, 75 mM KC1, 3 mM >MgC12.
  • the reaction conditions were as follows: 65°C, 5 min.; 37°C, 60 min. (5 min. into this cycle 200 units SuperscriptTM II was added to each reaction); 95 °C, 5min.
  • PCR conditions were as follows: (94°C, 45 sec; 69°C, 45 sec.;72°C, 45 sec. ⁇ x 25, followed by an extension at 72°C, 5 min.
  • the PCR products were subcloned utilizing the TA cloning system (InVitrogen).
  • HSIXl Drosophila sine oculis
  • the HSIXl protein displays approximately 98%> sequence homology to mouse SIXl (Boucher et al. (1996) Genomics 33:140-142) which was first cloned by virtue of its homology to the Drosophila gene sine oculis (so) (Oliver et al. (1995) Development 121 :693-705).
  • Mouse SIXl is 62% similar to the Drosophila gene, and 87% similar if sequences C- terminal to the homeodomain are excluded (supra). So plays a role in the development of the fly visual system.
  • Example 2 Expression of HSIXl in Primary Tumors, Metastatic Tumors, and Other Tumor-Derived Cell Lines
  • a Human RNA Master Blot from ClontechTM was probed to determine HSIXl expression in normal human adult mammary tissue as well as its expression pattern in other normal adult andf fetal tissues (as expression of HSIXl and its mouse homolog had previously only been demonstrated in developing mouse limb tendons and in human adult skeletal muscle).
  • the Human RNA Master Blot includes poly A+ RNA from the following tissues: whole brain, amygdala, caudate nucleus, cerebellum, cerebral cortex, frontal lobe, hippocampus, medulla oblongata, occipital lobe, putamen, substantia nigra, temporal lobe, thalamus, subthalamic nucleus, spinal cord, heart, aorta, skeletal muscle, colon, bladder, uterus, prostate, stomach, testis, ovary, pancreas, pituitary gland, adrenal gland, thyroid gland, salivary gland, mammary gland, kidney, liver, small intestine, spleen, thymus, peripheral leukocyte, lymph node, bone marrow, appendix, lung, trachea, placenta, fetal brain, fetal heart, fetal kidney, fetal liver, fetal spleen, fetal thymus, and fetal
  • the 21PT and 2 INT cell lines were derived from the primary tumor, whereas the 21MT-1 and 21MT-2 cell lines were established from a metastatic pleural effusion.
  • HSIXl expression was not detected in a normal breast cell line, 70N (Band and Sager, supra), but was detected in all cell lines derived from the above-mentioned patient.
  • Levels of expression in 21PT and 2 INT cells were approximately 3- and 2-fold less, respectively, than levels in 21MT1 cells, and 10- and 7-fold less, respectively, than levels in 21MT2 cells.
  • Relative HSIXl expression for each sample was as follows: 70N -0, 21PT ⁇ 5, 21NT -7, 21MT1 -14, and 21MT2 -46).
  • HSIXl expression was increased in a significant proportion of primary and metastatic breast cancer cases.
  • 35 human breast biopsy samples were obtained and expression was examined by Northern blot analysis.
  • Northern blot analysis was performed as described in Example 1, except that RNA was isolated from breast tumor specimens by the guanidinium thiocyanate/CsCl method as described in Maniatis et al, supra. Normalization to 36B4 was performed on these samples, as it has been shown to be a good control for breast cancer samples.
  • Figure 3 shows the results with 35 tumor samples examined for HSIXl expression. The results were quantitated and plotted as relative HSIXl expression. While normal adjacent breast, normal breast luminal cells, and normal breast myoepithelial cells demonstrated almost no HSIXl expression (lanes 1-3 respectively), 44%> of the primary tumors (lanes 4-27) and 90%> of - 40 -
  • the metastatic lesions (lanes 28-37) expressed greater than a three-fold increase in HSIXl mRNA expression over levels in normal adjacent breast.
  • the 10 metastatic lesions utilized in the analysis came from either the lymph nodes (6 samples), bone/soft tissue (2 samples), the lung (1 sample), or the pleural wall (1 sample).
  • the Human RNA Master blot allowed examination of HSIXl expression in normal lymph nodes and lung. Five of the six lymph node metastases expressed HSIXl, however HSIXl expression was not observed in normal lymph nodes, indicating that the high expression levels in lymph node lesions came from the metastatic tumor itself.
  • HSIX 1 expression was detected in mRNA isolated from cells of a colon adenocarcinoma of a patient (termed “SW480 cells”) and was significantly enhanced in mRNA from cells isolated from a metastatic lesion of the same patient (termed "SW620 cells”).
  • SW480 cells colon adenocarcinoma of a patient
  • SW620 cells metastatic lesion of the same patient
  • HSIXl mRNA was demonstrated to be overexpressed in multiple lung cancer cells.
  • the first sample being isolated from a lung tumor
  • the second sample being isolated from normal adjacent tissue of the same patient
  • HSIXl expression was found to increase from 1.5 to 10-fold, among the various tumor-derived samples tested, as compared to their normal counterparts.
  • the above-described data indicate that HSIXl is overexpressed in several types of cancer in addition to breast.
  • MCF7 mammary carcinoma cell line was transfected with SIXFL, a construct that allows for constitutive expression of the full length wild type HSIXl cDNA, or with the parent vector expressing the chloramphenicol acetyl transferase gene (CAT) as a control.
  • SIXFL chloramphenicol acetyl transferase gene
  • MCF7 cells were seeded in 60 mM dishes at 5 x 10 3 cells/dish and transfected with SIXFL or with pcDNA3.1(CAT) utilizing Superfect (Qiagen). Transfections were performed according to the manufacturers protocol. 24 h after transfections the cells were passaged 1 : 15 in appropriate media containing 600 mg/ml G418. Approximately two weeks later stable transfectants were selected utilizing cloning cylinders and examined for HSIXl expression via Northern blot analysis.
  • HSIXA1, A8, and A13 stable clones expressing HSIXl
  • CATB3 control transfectants
  • pcDNA3.1(CAT) pcDNA3.1(CAT)
  • Figure 5 depicts a summary of the percentage of cells in G2 at various time points before and after irradiation in the transfectants and controls.
  • the data graphed are from one experiment performed at 8 Gy and are representative of several experiments performed at 5 and 8 Gy. Note that cells expressing HSIXl progress through the G2 arrest at a more rapid rate than transfected controls.
  • a cell line transfected with SIXFL (HSIXA2) that did not express HSIXl (possibly due to silencing of the gene upon insertion into the chromosomal DNA) was tested in the X-ray irradiation assay.
  • This cell line behaved as the CAT controls, confirming that HSIXl expression was necessary for abrogation of the G2 cell cycle checkpoint, and that the expression of CAT did not affect the checkpoint in any way.
  • 21PT cells were transfected with HSIXl or an HSIXl fusion protein that contains an 8 amino acid epitope tag (XPRESS) for following protein expression.
  • Immunohistochemistry of the latter transfectants with the anti-XPRESS antibody revealed a punctate nuclear localization of the HSIXl protein, as is commonly observed with proteins involved in replication and/or transcription. The result was expected, as HSIXl is a putative transcription factor.
  • Figure 4 shows a representative FACS analysis on - 44 -
  • HOX11 another homeobox gene, HOX11, has recently been found to disrupt the G2 cell cycle checkpoint by interacting with PP2A protein phosphatase (Kawabe et al. (1997) Nature 385:454-458). HOX11 has been implicated in cancer (supra), as it was isolated from a chromosomal breakpoint in human T-cell leukemia (Hatano et al. (1991) Science 253:79-82; Kennedy et al. (1991) Proc. Natl Acad. Sci. USA 88:8900-8904; and Dube et al. (1991) Blood 78:2996-3002).
  • transgenic mice expressing HOX11 in the thymus demonstrated cell cycle alterations and progression to malignancy (Hatano et al (1992) Curr. Opin. Oncol. 4:24-26). Since HSIXl was originally cloned from a mammary carcinoma cell line (21PT), and since overexpression of this gene leads to altered cell cycle control similar to that seen with HOX11, it can be reasoned that HSIXl may be differentially expressed in cancer.
  • 21PT mammary carcinoma cell line
  • EXAMPLE 4 Generation of HSIXl-Specific Antibodies
  • the C-terminus of HSIXl (from nucleotide 822 until the stop codon) was amplified and subcloned into the pGEX2T bacterial expression vector to create a GST- HSIX1 fusion protein. Expression of the protein was induced with 0.1 mM IPTG, and the protein was then purified from bacterial extracts utilizing glutathione-sepharose beads. Following purification, the fusion protein was run on a SDS-PAGE gel, very lightly coomassie stained, and extracted from the gel.
  • the extracted gel piece containing the GST-HSIX1 C-terminus was then injected into rabbits (Spring Valley Laboratories (Woodbine, MD)). Following injection and two boosts, the rabbit was bled and the sera tested for HSIXl antibodies. Following demonstration of HSIXl immunoreactivity, the sera was passed over a GST affinity column (to remove any antibody recognizing the GST portion of the fusion), and was subsequently purified on a GST-HSIX C-terminus column. Affinity purified anti- 45
  • HSIXl antibody was then tested on cells transfected with HSIXl versus untransfected cells ( Figure 6).
  • EXAMPLE 6 HSIXl Expressing Cells Lead to Larger Tumors When Injected Into Nude Mice
  • mice Six nude mice each were injected in the thigh with either 1 x 10 7 A13 cells (HSIX-transfected) or B3 cells (control transfectants). Tumor size was measured after 4.5 weeks. Tumors from B3 cell-injected mice ranged in size from approximately 35- 140 mm 3 whereas tumors from A13 cell-injected mice ranged in size from approximately 110-370 mm 3 (Table I).

Abstract

The present invention relates to methods for detecting the presence of SIX1 protein or nucleic acid in a biological sample in which a biological sample is contacted with an agent capable of detecting SIX1 protein or mRNA such that the presence of SIX1 is detected in the biological sample. Diagnostic and prognostic methods utilizing HSIX1 as an indicator of cancer and cancer progression are also provided. Compositions and kits for detecting the presence of SIX1 in a biological sample are also described.

Description

- 1 -
METHODS AND COMPOSITIONS FOR DIAGNOSING AND PREDICTING THE BEHAVIOR OF CANCER
Background of the Invention Today, cancer is known to be one of the leading causes of mortality and morbidity among men and women. In particular, breast cancer is believed to be the leading cause of death among women (Harris, et al. (1992) New Engl. J. Med. 327: 319- 28; Harris, et al. (1992) New Engl. J. Med. 327: 390-8; Harris, et al. (1992) New Engl. J. Med. 327: 473-80; and McGuire and Clark (1992) New Engl J. Med. 326: 1756-61). The development of cancer is accompanied by a number of genetic changes (For review see Porter- Jordan, (1994) Hematol Oncol. Clin. N. Am. 8:73). Such changes include gross chromosomal alterations as well as loss of genetic markers (Devilee et al. (1994) Biochim. Biophys. Ada 1198:113 and Callahan et al. (1993) J. Cell Biochem. Siψpl 17:167). For example, the progression of breast neoplasia has also been shown to result in qualitative and quantitative changes in expression of previously identified genes that encode growth factors and their receptors (Zajchowski et al. (1988) Cancer Res. 48:7041), structural proteins (Trask et α/.(1990) Proc. Natl. Acad. Sci. 87:2319), second messenger proteins (Ohuchi et α/.(1986) Cancer Res. 26:2511), and transcription factors (Harris (1992) Adv. Cancer Res. 59:69). Furthermore, novel genes have been identified whose increased expression can be correlated with the occurrence of breast tumors (e.g., mammaglobin, Watson and Fleming, U.S. Pat. No. 5,668,267).
Although progress has been made in the identification of various potential breast cancer marker genes, as well as other biomolecular markers of cancer (e.g., Prostate- Specific Antigen in the case of Prostate cancer) there remains a continuing need for new marker genes along with their expressed proteins that can be used to specifically and selectively identify the appearance and pathogenic development of cancer in a patient.
Summary of the Invention
Briefly, the present invention relates to methods for diagnosing cancer, for example, breast, colon, lung, or cervical cancer, in a subject in which the presence of human SIXl (HSIXl) homeobox gene sequences bears a positive correlation to the - 2 -
existence of malignant disease. The present invention is based, at least in part, on the demonstration of an aberrant expression of the HSIXl homeobox gene in primary breast cancers and metastatic lesions and in cells isolated from lung, colorectal, and cervical tumors, as well as from subjects having chronic myelogenous leukemia. The present invention further relates to compositions of molecular probes which can be utilized in such diagnostic methods.
Accordingly, one aspect of the invention pertains to methods for detecting the presence of SIXl in a biological sample. In a preferred embodiment, the method involves contacting a biological sample (e.g., a tissue or tumor sample or isolate of such a sample) with an agent capable of detecting SIXl protein or nucleic acid (e.g., mRNA or cDNA) molecule such that the presence of SIXl is detected in the biological sample. The agent can be, for example, a labeled or labelable nucleic acid probe capable of hybridizing to a SIXl nucleic acid molecule or a labeled or labelable antibody capable of binding to SIXl protein. Another aspect of the invention features a method of determining the metastatic potential of a tumor which involves contacting a sample of the tumor (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the tumor sample or isolate, thereby determining the metastatic potential of the tumor. Yet another aspect of the invention features a prognostic method for determining whether a subject is at risk for developing cancer which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby determining whether the subject is at risk for developing cancer. Another aspect of the invention features a method for diagnosis of a tumor which involves contacting a tumor sample (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing the tumor. Yet another aspect of the invention features a method of diagnosing cancer in a subject which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing cancer in the subject. Kits for detecting SIXl in a biological sample are also within the scope of the invention.
Brief Description of the Drawings
Figure 1 is the complete cDNA sequence and deduced amino acid sequence of human SIXl (SEQ ID NOs: 1 and 2, respectively).
Figure 2A-C. Figure 2A depicts 3H-thymidine incorporation following release from mimosine arrest showing progression of 21PT cells through S-phase. Figure 2B is a photograph showing a section of a differential display gel demonstrating the differential expression of 6A (subsequently identified as HSIXl) in S-phase. Figure 2C is a photograph of a Northern blot confirming the differential expression of HSIXl throughout S-phase of 21PT cells. RNA was isolated from cells following release from mimosine arrest and Northern blot analysis was performed with the HSIXl cDNA probe. Bottom panel shows EtBr staining as a loading control.
Figure 3 is a quantitative representation of a Northern blot analysis of 3 control tissues (normal adjacent breast, normal luminal cells, and normal myoepithelial cells- lanes 1,2, and 3 respectively) as well as on 25 primary breast tumor biopsies (lanes 4-28) and 10 metastatic lesions (lanes 29-38). The blot was stripped and reprobed with 36B4 (Hatano et al. (1991) Science 253 J9-82) for normalization and relative HSIXl expression was plotted. A 3 -fold increase over normal adjacent breast was considered positive for HSIXl and is marked by a dashed line.
Figure 4 depicts FACS analysis of HSIXl overexpressors which become polyploid over several months in culture. "His/lac7 control" indicates cells transfected approximately 6 months prior to FACS analysis shown. "21PT parent" indicates cells transfected approximately 4.5 months prior to FACS analysis shown. "SIXFL4" and "SIXFL6" indicates cells transfected approximately 4 and 6 months prior to FACS analysis shown, respectively.
Figure 5 demonstrates that HSIXl overexpression abrogates the G2 cell cycle checkpoint. The graph shows a summary of the percentage of either HSIXl - transfectanted cells or control cells (CAT transfectants) in G2 at various times after X- - 4 -
ray irradiation. Stippled bars indicate the HSIX overexpressors (Al, A8, and A13). The shaded bars indicate CAT-transfected controls (Bl and B3).
Figure 6 is a Western Blot demonstrating the immunoreactivity of an anti-HSIX antibody with SIX protein from transfected cells. Lanes marked "T" indicate lysates of MCF7 cells transfected with HSIXl . Lanes marked "M" indicate lysates from mock transfected MCF7 cells. Antibody dilutions are indicated below respective pairs of lanes.
Detailed Description of the Invention The invention is based, at least, in part on the discovery that the human SIXl gene ("HSIXl"), a known homoebox gene, is aberrantly expressed (e.g., overexpressed) in tumorigenic cells. As used herein, HSIXl refers to a gene obtained from human adult skeletal muscle (Boucher et al. (1996) Genomics 33:140-142) whose mouse counterpart has been implicated in the development of limb tendons (Oliver et al. (1995) Development 121 :693-705). HSIXl is a member of a family of genes termed homeobox genes. As used herein "homeobox genes" include genes which encode a family of proteins, termed homeodomain-containing proteins, which act as transcription factors that regulate the coordinated expression of genes involved in both development and differentiation. Homeobox genes were identified initially in Drosophila, where they were found to be important in the control of sequence identity (Lewis (1978) Nature 276;565-570). Homeobox genes contain a common 183-nt sequence encoding a 61-aa domain that is responsible for DNA binding (McGinnis and Krumlauff (1992) Cell 68:283-302). They are postulated to act as a network of transcriptional regulators effecting cell-cell communication during normal development, alterations of which may contribute to the neoplastic phenotype. Recently, homeobox genes, including members of the Hox and Pax families have been identified as oncogenic transcription factors (Lawrence et al. (1996) Stem Cell 14:281-291 and Stuart and Gruss (1995) Human Mol. Genet. 4:1717-1720). Homeobox genes are often translocated to produce a chimeric protein with a new function, particularly in leukemias (Cillo (1994) Invasion Metastasis 14:38-49). However, others retain their wild type function and are overexpressed (Lawrence et al, Cillo, and Stuart and Gruss, supra). In addition to leukemias, recent - 5 -
studies have demonstrated homeobox gene involvement in solid tumors such as breast, kidney, lung and colon (Cillo, supra).
The present invention is further based, at least in part on the identification of HSIXl cDNA by its differential expression in cell cycle synchronized 21PT mammary adenocarcinoma cell line (a cell line derived from a patient who had an infiltrating and intraductal mammary adenocarcinoma) using the differential display method. Direct sequencing of one differentially-expressed cDNA revealed its identity as HSIXl . Further analysis revealed that HSIXl mRNA expression was very low in the first half of S phase and increased as 21PT cells are completing S phase. HSIXl expression was also detected in other cell lines derived from the same patient including 2 INT, 21MT-1 and 21MT-2. The 21PT and 2 INT cell lines were derived from a primary tumor, whereas the 21MT-1 and 21MT-2 cells lines were established from a metastatic pleural effusion. By contrast, HSIXl expression was not detected in a normal breast cell line, 70N (Band and Sager (1989) Proc. Natl. Acad. Sci. USA 86:1249-1253). The invention is further based on the discovery that the HSIXl homeobox protein functions in a cell cycle-regulated manner and acts to abrogate G2 cell cycle arrest. In particular, cells which overexpress HSIXl progress through X-ray irradiation- induced G2 arrest at a more rapid rate than control cells (e.g., normal cells). Furthermore, continued passaging of 21PT cells which constitutivly overexpress HSIXl leads to ploidy changes over extended periods of time.
The molecular weight of HSIXl appears to be unchanged in 21PT cells as compared to normal cells, suggesting that no gross genetic alterations exist. However, a translocation may occur upstream of the transcription start site resulting in aberrant expression, or point mutations or small deletions/insertions may exist in the gene. Alternatively, overexpression of wild type HSIXl mRNA may contribute to the tumorigenic phenotype, consistent with a model proposed by Sager et al, which hypothesizes that tumorigenesis is not only the result of genetic mutations, but also of overexpression of wild type genes. In fact, direct sequencing of HSIXl from 21PT cells resulted in wild type HSIX DNA sequence. The fact that SIXl overexpression correlates with both the tumorigenic phenotype as well as with abrogation of the G2 cell - 6 -
cycle check point, indicates its utility as a growth-related or aberrant growth marker or marker of the tumorigenic phenotype.
Accordingly, the present invention features a method for detecting the presence of SIXl in a biological sample (e.g., a tumor sample) involving contacting a biological sample with an agent (e.g., a nucleic acid probe or antibody) capable of detecting SIXl protein or nucleic acid (e.g. , mRNA or cDNA) such that the presence of SIXl is detected in the biological sample.
As used herein, a "biological sample" refers to a sample of biological material obtained from a subject, preferably a human subject, or present within a subject, preferably a human subject, including a tissue, tissue sample, or cell sample (e.g., a tissue biopsy, for example, an aspiration biopsy, a brush biopsy, a surface biopsy, a needle biopsy, a punch biopsy, an excision biopsy, an open biobsy, an incision biopsy or an endoscopic biopsy), tumor, tumor sample, or biological fluid (e.g., blood, serum, lymph, spinal fluid). As used herein, a "tissue sample" refers to a portion, piece, part, segment, or fraction of a tissue which is obtained or removed from an intact tissue of a subject, preferably a human subject. For example, tissue samples can be obtained from the pancreas, stomach, liver, secretory gland, bladder, lung, skin, prostate gland, breast ovary, cervix, uterus, brain, eye, connective tissue, bone, muscles or vasculature. In a preferred embodiment, the biological sample is a breast tissue sample. In another embodiment, the biological sample is a tissue sample, provided that it is not a breast tissue sample. In yet another embodiment, the biological sample is a tumor sample (e.g., a tumor biopsy).
As used herein, a "tumor sample" refers to a portion, piece, part, segment, or fraction of a tumor, for example, a tumor which is obtained or removed from a subject (e.g., removed or extracted from a tissue of a subject), preferably a human subject. A tumor sample can be obtained, for example, from a lung carcinoma, a colon carcinoma, a cervical carcinoma, an adenocarcinoma, a melanoma, a leukemia, a lymphoma, a glioma, a neuroblastoma, a retinoblastoma, and a sarcoma. In one embodiment, the tumor sample is obtained from a breast tumor (e.g., a breast tumor sample). In another embodiment, the tumor sample is obtained from a tumor, provided that the tumor is not - 7 -
a breast tumor. In yet another embodiment, the tumor sample is obtained from a primary tumor (e.g., is a primary tumor sample). In another embodiment, the biological sample is obtained metastatic lesion (e.g., is a metastatic lesion sample).
As defined herein, a "primary tumor" is a tumor appearing at a first site within the subject and can be distinguished from a "metastatic tumor" which appears in the body of the subject at a remote site from the primary tumor. As used herein, a "metastatic tumor" is a tumor resulting from the dissemination of cells from a primary tumor by the lymphatics or blood vessels or by direct extension through serum- contaning or serum-producing cavities or other spaces. The present invention also encompasses the use of isolates of a biological sample in the methods of the invention. As used herein, an "isolate" of a biological sample (e.g., an isolate of a tissue or tumor sample) refers to a material or composition (e.g., a biological material or composition) which has been separated, derived, extracted, purified or isolated from the sample and preferably is substantially free of undesireable compositions and/or impurities or contaminants associated with the biological sample. Preferred isolates include, but are not limited to, DNA (e.g., cDNA or genomic DNA), RNA (e.g., mRNA), and protein (i.e., purified protein, protein extracts, polypeptides). Additional preferred isolates include cells as well as biological fluids (e.g., blood, serum, lymph, spinal fluid). The present invention features agents which are capable of detecting SIXl polypeptide or mRNA such that the presence of SIX is detected. As defined herein, an "agent" refers to a substance which is cabable of identifying or detecting SIX in a biological sample (e.g., identifies or detects SIX mRNA, SIX DNA, SIX protein, SIX activity). In one embodiment, the agent is a labeled or labelable antibody which specifically binds to SIXl polypeptide. As used herein, the phrase "labeled or labelable" refers to the attaching or including of a label (e.g., a marker or indicator) or ability to attach or include include a label (e.g., a marker or indicator). Markers or indicators include, but are not limited to, for example, radioactive molecules, colorimetric molecules, and enzymatic molecules which produce detectable changes in a substrate. In one embodiment the agent is an antibody which specifically binds to all or a portion of a SIX protein (e.g., hSIXl). As used herein, the phrase "specifically binds" refers to - 8 -
binding of, for example, an antibody to an epitope or antigen or antigenic determinant in such a manner that binding can be displaced or competed with a second preparation of identical or similar epitope, antigen or antigenic determinant. In an exemplary embodiment, the agent is an antibody which specifically binds to all or a portion of HSIXl protein. In another embodiment, the agent is an antibody which specifically binds to all or a portion of a polypeptide selected from the group consisting of a polypeptide having the amino acid sequence of SEQ ID NO:2, a polypeptide comprising at least amino acids 183-284 of SEQ ID NO:2; and a polypeptide consisting of amino acids 183-284 of SEQ ID NO:2. In another embodiment, the antibody is a polyclonal antibody.
In yet another embodiment the agent is a labeled or labelable nucleic acid probe capable of hybridizing to SIXl mRNA. For example, the agent can be an oligonucleotide primer for the polymerase chain reaction which flank or lie within the nucleotide sequence encoding human SIXl . In a preferred embodiment, the biological sample being tested is an isolate, for example, RNA. In yet another embodiment, the isolate (e.g., the RNA) is subjected to an amplification process which results in amplification of SIXl nucleic acid. As defined herein, an "amplification process" is designed to strengthen, increase, or augment a molecule within the isolate. For example, where the isolate is mRNA, an amplification process such as RT-PCR can be utilized to amplify the mRNA, such that a signal is detectable or detection is enhanced. Such an amplification process is beneficial particularly when the biological, tissue, or tumor sample is of a small size or volume.
The present invention is also based in part on the discovery that HSIX is expressed in approximately one-half of primary breast cancers and nine-tenths of metastatic breast cancer lesions. While normal adjacent breast, normal breast luminal cells, and normal breast myoepithelial cells demonstrated almost no HSIXl expression, 44% of primary tumors and 90% of the metastatic lesions had elevated levels of HSIXl mRNA. HSIXl overexpression was likewise found in samples of lung tumors, when compared to adjacent normal lung tissue samples. Moreover, smaller scale analysis of several different tumor cell lines suggest that HSIXl may be expressed in a wide variety of tumors in addition to breast and lung. - 9 -
Accordingly, the invention further features diagnostic and prognostic methods useful in the detection and treatment of cancer, preferably breast cancer, described in detail herein. The invention further involves kits useful in the detection and treatment of cancer, described in detail herein. In one embodiment, the invention features a method of determining the metastatic potential of a tumor which involves contacting a sample of the tumor (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the tumor sample or isolate, thereby determining the metastatic potential of the tumor. Another aspect of the invention features a prognostic method for determining whether a subject is at risk for developing cancer which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby determining whether the subject is at risk for developing cancer. As used herein, a subject "at risk for developing cancer" includes a subject which has been determined to have a higher probability of developing cancer when compared to an average representative of the population. A subject's "risk of developing cancer" can be based on an analysis of empirical criteria or on a persons pedigree.
Yet another aspect of the invention features a method for diagnosis of a tumor which involved contacting a tumor sample (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing the tumor. Yet another aspect of the invention features a method of diagnosing cancer in a subject which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing cancer in the subject.
In another embodiment, the diagnostic methods of the present invention further involve determining the level of SIXl polypeptide or mRNA in the sample or isolate. As used herein, the phrase "determining the level" includes measuring an amount (e.g., making a quantitative determination) or making a qualitative determination (e.g., a - 10 -
determination of the presence versus the absence of SIX protein or nucleic acid). In yet another embodiment, the diagnostic methods of the present invention involve comparing the level of SIXl polypeptide or mRNA in the sample or isolate with the level of SIXl polypeptide or mRNA in a control sample. As used herein, the phrase "comparing the level" includes evaluating, balancing or contrasting the amount or presence of, for example, SIX protein or nucleic acid in a first sample (e.g., a test sample) with the amount or presence of SIX protein or nucleic acid in a second sample (e.g.. a control sample). In yet another embodiment, the diagnostic or prognostic methods further includes the step of forming a prognosis or forming a diagnosis. Another feature of the present invention includes a kit for detecting the presence of SIXl in a biological sample (or isolate of the sample) including a labeled or labelable agent capable of detecting SIXl polypeptide or mRNA in a biological sample. In one embodiment, the kit further includes a means for determining the amount of SIXl in the sample. In another embodiment, the agent of the kit is an antibody capable of specifically binding to SIXl polypeptide. In yet another embodiment, the agent of the kit is a nucleic acid probe capable of hybridizing to SIXl mRNA. In yet another embodiment, the kit further includes a means for comparing the amount of SIXl in the sample with a standard. In yet another embodiment, the kit further includes directions for use. Various aspects of the invention are described in further detail in the following subsections:
I. Isolated Nucleic Acid Molecules
One aspect of the invention involves isolated nucleic acid molecules that encode SIXl or biologically active portions thereof, as well as nucleic acid fragments sufficient for use as hybridization probes to identify SIXl -encoding nucleic acid (e.g., SIXl mRNA). As used herein, the term "nucleic acid molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA). The nucleic acid molecule may be single-stranded or double-stranded, but preferably is double-stranded DNA. An "isolated" nucleic acid molecule is free of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the - 11 -
nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. For example, in various embodiments, the isolated SIXl nucleic acid molecule may contain less than about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived (e.g., a human mammary adenocarcinoma cell). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, may be free of other cellular material.
In a preferred embodiment, an isolated nucleic acid molecule of the invention comprises the nucleotide sequence shown in SEQ ID NO: 1. The sequence of SEQ ID NO: 1 corresponds to the human SIXl cDNA. This cDNA comprises sequences encoding the SIXl protein (i.e., "the coding region", from nucleotides 276 to 1130), as well as 5' untranslated sequences (nucleotides 1 to 275) and 3' untranslated sequences (nucleotides 1131 to 1378). Alternatively, the nucleic acid molecule may comprise only the coding region of SEQ ID NO: 1 (e.g., nucleotides 276 to 1130). Moreover, the nucleic acid molecule of the invention can comprise only a portion of the coding region of SEQ ID NO: 1, for example a fragment encoding a biologically active portion of SIXl . The term "biologically active portion of SIXl " is intended to include portions of SIXl that retain the ability to enhance cell cycle progression, abrogate the G2 cell cycle checkpoint (e.g., accelerate the progression of cells through G2), or promote aberrant growth (e.g., tumorigenesis). The ability of portions of SIXl to inhibit cell cycle progression can be determined in standard cell cycle progression assays, for example using 3H-thymidine as an indicator of progression through S phase, propidium iodide staining and FACS analysis as an indicator of cells in G2/M phase of the cell cycle (described further below and in Examples 1 and 3). Nucleic acid fragments encoding biologically active portions of SIXl can be prepared by isolating a portion of SEQ ID NO: 1, expressing the encoded portion of SIXl protein or peptide (e.g., by recombinant expression in vitro as detailed below in Example 3) and assessing the ability of the encoded fragment to effect cell cycle progression.
The invention further encompasses nucleic acid molecules that differ from SEQ ID NO:l (and portions thereof) due to degeneracy of the genetic code and thus encode the same SIXl protein as that encoded by SEQ ID NO: 1. Accordingly, in another - 12 -
embodiment, an isolated nucleic acid molecule of the invention has a nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ ID NO: 2. Moreover, the invention encompasses nucleic acid molecules that encode biologically active portions of SEQ ID NO: 2. A nucleic acid molecule having the nucleotide sequence of SEQ ID NO: 1 , or a portion thereof, can be isolated using standard molecular biology techniques and the sequence information provided herein. For example, a human SIXl cDNA can be isolated from a mammary adenocarcinoma cell line cDNA library using all or portion of SEQ ID NO: 1 as a hybridization probe and standard hybridization techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A
Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989). Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID NO: 1 can be isolated by the polymerase chain reaction using oligonucleotide primers designed based upon the sequence of SEQ ID NO: 1. For example, mRNA can be isolated from mammary adenocarcinoma cells (e.g. , by the guanidinium-thiocyanate extraction procedure of Chirgwin et al. (1979) Biochemistry 18: 5294-5299) and cDNA can be prepared using reverse transcriptase (e.g., Moloney MLV reverse transcriptase, available from Gibco/BRL, Bethesda, MD; or AMV reverse transcriptase, available from Seikagaku America, Inc., St. Petersburg, FL). Synthetic oligonucleotide primers for PCR amplification can be designed based upon the nucleotide sequence shown in SEQ ID NO: 1. A nucleic acid of the invention can be amplified using cDNA or, alternatively, genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to SIXl nucleotide sequence can be prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. In addition to the human SIXl nucleotide sequence shown in SEQ ID NO: 1, it will be appreciated by those skilled in the art that DNA sequence polymorphisms that lead to changes in the amino acid sequences of SIXl may exist within a population (e.g. , the human population). Such genetic polymorphism in the SIXl gene may exist among individuals within a population due to natural allelic variation. Such natural allelic - 13 -
variations can typically result in 1-5 % variance in the nucleotide sequence of the a gene. Any and all such nucleotide variations and resulting amino acid polymorphisms in SIXl that are the result of natural allelic variation and that do not alter the functional activity of SIXl are intended to be within the scope of the invention. Moreover, nucleic acid molecules encoding SIXl proteins from other species, and thus which have a nucleotide sequence which differs from the human sequence of SEQ ID NO: 1, are intended to be within the scope of the invention. Nucleic acid molecules corresponding to natural allelic variants and nonhuman homologues of the human SIXl cDNA of the invention can be isolated based on their homology to the human SIXl nucleic acid disclosed herein using the human cDNA, or a portion thereof, as a hybridization probe according to standard hybridization techniques under stringent hybridization conditions. Accordingly, in another embodiment, an isolated nucleic acid molecule of the invention is at least 15 nucleotides in length and hybridizes under stringent conditions to the nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 1. In other embodiment, the nucleic acid is at least 30, 50, 100, 250 or 500 nucleotides in length. As used herein, the term "hybridizes under stringent conditions" is intended to describe conditions for hybridization and washing under which nucleotide sequences at least 60 % homologous to each other typically remain hybridized to each other. Preferably, the conditions are such that at least sequences at least 65 %, more preferably at least 70 %, and even more preferably at least 75 % homologous to each other typically remain hybridized to each other. Such stringent conditions are known to those skilled in the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X SSC, 0.1 % SDS at 50-65°C. Preferably, an isolated nucleic acid molecule of the invention that hybridizes under stringent conditions to the sequence of SEQ ID NO: 1 corresponds to a naturally-occurring nucleic acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural protein). In one embodiment, the nucleic acid encodes a natural human SIXl . - 14 -
In addition to naturally-occurring allelic variants of the SIXl sequence that may exist in the population, the skilled artisan will further appreciate that changes may be introduced by mutation into the nucleotide sequence of SEQ ID NO: 1, thereby leading to changes in the amino acid sequence of the encoded SIXl protein, without altering the functional ability of the SIXl protein. For example, nucleotide substitutions leading to amino acid substitutions at "non-essential" amino acid residues may be made in the sequence of SEQ ID NO: 1. A "non-essential" amino acid residue is a residue that can be altered from the wild-type sequence of SIXl (e.g., the sequence of SEQ ID NO: 2) without altering the activity of SIXl, whereas an "essential" amino acid residue is required for SIXl activity. Amino acid residues of SIXl that are strongly conserved among, for example, among members of the subfamily of homeobox genes that share a lysine within the DNA binding helix of the homeodomain (e.g., the Drosophila sine oculis (so) gene, the human myotonic dystrophy (DM)-associated homeodomain protein (DMHAP) and its murine homologue (Boucher et al. (1995) Hum. Mol. Genet. 4:1919- 1925), the human SIXl gene and its murine counterpart, and the murine SIX2 gene.
(e.g., conserved among proteins whose amino acid sequences are aligned for comparison purposes) are predicted to be essential in SIXl and thus are not likely to be amenable to alteration. Other amino acid residues, however, (e.g., those that are not conserved or only semi-conserved among members of the subfamily) may not be essential for SIXl activity and thus are more likely to be amenable to alteration.
Accordingly, another aspect of the invention pertains to nucleic acid molecules encoding SIXl proteins that contain changes in amino acid residues that are not essential for SIXl activity , e.g., residues that are not conserved or only semi-conserved among members of the subfamily. Such SIXl proteins differ in amino acid sequence from SEQ ID NO: 2 yet retain SIXl activity. In one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein comprises an amino acid sequence at least 60 % homologous to the amino acid sequence of SEQ ID NO: 2 and retains SIXl activity. Preferably, the protein encoded by the nucleic acid molecule is at least 70 % homologous to SEQ ID NO: 2, more preferably at least 80 % homologous to SEQ ID NO: 2, even more preferably at least 90 % - 15 -
homologous to SEQ ID NO: 2, and most preferably at least 95 % homologous to SEQ ID NO: 2.
To determine the percent homology of two amino acid sequences (e.g., SEQ ID NO: 2 and a mutant form thereof), the sequences are aligned for optimal comparison purposes (e.g., gaps may be introduced in the sequence of one protein for optimal alignment with the other protein). The amino acid residues at corresponding amino acid positions are then compared. When a position in one sequence (e.g., SEQ ID NO: 2) is occupied by the same amino acid residue as the corresponding position in the other sequence (e.g., a mutant form of SIXl), then the molecules are homologous at that position (i. e. , as used herein amino acid "homology" is equivalent to amino acid "identity"). The percent homology between the two sequences is a function of the number of identical positions shared by the sequences (/. e. ,
% homology = # of identical positions/total # of positions x 100). Such an alignment can be performed using any one of a number of computer algorithms designed for such a purpose. A preferred, non-limiting example of a mathematical algorithim utilized for the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the GCG sequence alignment software package. When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used.
An isolated nucleic acid molecule encoding a SIXl protein homologous to the protein of SEQ ID NO: 2 can be created by introducing one or more nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ ID NO: 1 such that one or more amino acid substitutions, additions or deletions are introduced into the encoded protein. Mutations can be introduced into SEQ ID NO: 1 by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at one or more predicted non-essential amino acid residues. A "conservative amino acid substitution" is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art, including basic side chains (e.g., lysine, arginine, histidine), acidic - 16 -
side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g. , tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue in SIXl is preferably replaced with another amino acid residue from the same side chain family. Alternatively, in another embodiment, mutations can be introduced randomly along all or part of a SIXl coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for SIXl activity to identify mutants that retain SIXl activity. Following mutagenesis of SEQ ID NO: 1. the encoded protein can be expressed recombinantly (e.g., as described in Example 3) and the SIXl activity of the protein can be determined. Suitable assays for testing the activity of portions of SIXl proteins and mutated SIXl proteins are described in detail in Examples 1 and 3. In addition to the nucleic acid molecules encoding SIXl proteins described above, another aspect of the invention pertains to isolated nucleic acid molecules which are antisense thereto. An "antisense" nucleic acid comprises a nucleotide sequence which is complementary to a "sense" nucleic acid encoding a protein, e.g.. complementary to the coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can hydrogen bond to a sense nucleic acid.
The antisense nucleic acid can be complementary to an entire SIXl coding strand, or to only a portion thereof. In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the coding strand of a nucleotide sequence encoding SIXl . The term "coding region" refers to the region of the nucleotide sequence comprising codons which are translated into amino acid residues (e.g., the entire coding region of SEQ ID NO: 1 comprises nucleotides 276-1130). In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence encoding SIXl. The term "noncoding region" refers to 5' and 3' sequences which flank the coding region that are not translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). - 17 -
Given the coding strand sequences encoding SIXl disclosed herein (e.g., nucleotides 276-1130 of SEQ ID NO: 1), antisense nucleic acids of the invention can be designed according to the rules of Watson and Crick base pairing. The antisense nucleic acid molecule may be complementary to the entire coding region of SIXl mRNA, but more preferably is an oligonucleotide which is antisense to only a portion of the coding or noncoding region of SIXl mRNA. For example, the antisense oligonucleotide may be complementary to the region surrounding the translation start site of SIXl mRNA. An antisense oligonucleotide can be, for example, about 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be constructed using chemical synthesis and enzymatic ligation reactions using procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e.g. , phosphorothioate derivatives and acridine substituted nucleotides can be used. Alternatively, the antisense nucleic acid can be produced biologically using an expression vector into which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described further in the following subsection).
In another embodiment, an antisense nucleic acid of the invention is a ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. A ribozyme having specificity for a SIXl -encoding nucleic acid can be designed based upon the nucleotide sequence of a SIXl cDNA disclosed herein (i.e., SEQ ID NO: 1). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the base sequence of the active site is complementary to the base sequence to be cleaved in a SIXl -encoding mRNA. See for example Cech et al U.S. Patent No. 4,987,071; and Cech et α/. U.S. Patent No. 5,116,742. Alternatively, SIXl mRNA can be used to select a catalytic RNA having a specific ribonuclease - 18 -
activity from a pool of RNA molecules. See for example Bartel, D. and Szostak, J.W. (1993) Science 261 : 1411-1418.
II. Recombinant Expression Vectors and Host Cells Another aspect of the invention pertains to vectors, preferably expression vectors, containing a nucleic acid encoding SIXl (or a portion thereof). As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA loop into which additional DNA segments may be ligated. Another type of vector is a viral vector, wherein additional DNA segments may be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are are referred to herein as "expression vectors". In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the present specification, "plasmid" and "vector" may be used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions. The recombinant expression vectors of the invention comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression, which is operatively linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequence(s) in a manner which allows for expression - 19 -
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). The term "regulatory sequence" is intended to includes promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are described, for example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Regulatory sequences include those which direct constitutive expression of a nucleotide sequence in many types of host cell and those which direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector may depend on such factors as the choice of the host cell to be transformed, the level of expression of protein desired, etc. The expression vectors of the invention can be introduced into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., SIXl proteins, mutant forms of SIXl, fusion proteins, etc.). The recombinant expression vectors of the invention can be designed for expression of SIXl in prokaryotic or eukaryotic cells. For example, SIXl can be expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the recombinant expression vector may be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
Expression of proteins in prokaryotes is most often carried out in E. coli with vectors containing constitutive or inducible promotors directing the expression of either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, usually to the amino terminus of the recombinant protein. Such fusion vectors typically serve three purposes: 1) to increase expression of recombinant protein; 2) to increase the solubility of the recombinant protein; and 3) to aid in the purification of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant protein to enable separation of the recombinant protein from - 20 - the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia Biotech Ine; Smith, D.B. and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the target recombinant protein. In a preferred embodiment, exemplified in Example 3, the coding sequence of the mature form of SIXl (i.e., encompassing amino acids 1-284) is cloned into a pGEX expression vector to create a vector encoding a fusion protein comprising, from the N- terminus to the C-terminus, GST-thrombin cleavage site-SIXl. The fusion protein can be purified by affinity chromatography using glutathione-agarose resin. Recombinant SIXl unfused to GST can be recovered by cleavage of the fusion protein with thrombin.
Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amann et al, (1988) Gene 69:301-315) and pET l id (Studier et al, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 60-89). Target gene expression from the pTrc vector relies on host RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene expression from the pET l id vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident λ prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter.
One strategy to maximize recombinant protein expression in E. coli is to express the protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant protein (Gottesman, S., Gene Expression Technology: Methods in
Enzymology 185, Academic Press, San Diego, California (1990) 119-128). Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the individual codons for each amino acid are those preferentially utilized in E. coli (Wada et al, (1992) Nuc. Acids Res. 20:2111-2118). Such alteration of nucleic acid sequences of the invention can be carried out by standard DNA synthesis techniques. - 21 -
In another embodiment, the SIXl expression vector is a yeast expression vector. Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari. et al, (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933- 943), pJRY88 (Schultz et al, (1987) Gene 54:113-123), and pYES2 (Invitrogen Corporation, San Diego, CA).
Alternatively, SIXl can be expressed in insect cells using baculovirus expression vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al, (1983) Mol. Cell Biol. 3:2156- 2165) and the pVL series (Lucklow, V.A., and Summers, M.D., (1989) Virology 170:31- 39).
In yet another embodiment, a nucleic acid of the invention is expressed in mammalian cells using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 (Seed, B., (1987) Nature 329:840) and pMT2PC (Kaufman et al. (1987), EMBO J. 6:187-195). When used in mammalian cells, the expression vector's control functions are often provided by viral regulatory elements. For example, commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40.
Another aspect of the invention pertains to recombinant host cells into which a recombinant expression vector of the invention has been introduced. The terms "host cell" and "recombinant host cell" are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
A host cell may be any prokaryotic or eukaryotic cell. For example, SIXl protein may be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those skilled in the art. - 22 -
Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press (1989)), and other laboratory manuals. For stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host cells along with the gene of interest. Preferred selectable markers include those which confer resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a selectable marker may be introduced into a host cell on the same vector as that encoding SIXl or may be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die).
A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) SIXl protein. Accordingly, the invention further provides methods for producing SIXl protein using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding SIXl has been introduced) in a suitable medium until SIXl is produced. In another embodiment, the method further comprises isolating SIXl from the medium or the host cell. Such an isolated SIXl protein can be used, for example, to raise antibodies to a SIXl protein for use in the diagnostic methods of the present invention (described further below). - 23 -
III. Isolated SIXl Proteins and Anti-SIXl Antibodies
Another aspect of the invention pertains to isolated SIXl proteins, and biologically active portions thereof, as well as peptide fragments suitable as immunogens to raise anti-SIXl antibodies. The invention provides an isolated preparation of SIXl, or a biologically active portion thereof. An "isolated" protein is substantially free of cellular material or culture medium when produced by recombinant DNA techniques, or chemical precursors or other chemicals when chemically synthesized. In a preferred embodiment, the SIXl protein has an amino acid sequence shown in SEQ ID NO: 2. In other embodiments, the SIXl protein is substantially homologous to SEQ ID NO: 2 and retains the functional activity of the protein of SEQ ID NO: 2 yet differs in amino acid sequence due to natural allelic variation or mutagenesis, as described in detail in subsection I above. Accordingly, in another embodiment, the SIXl protein is a protein which comprises an amino acid sequence at least 60% homologous to the amino acid sequence of SEQ ID NO: 2 and retains a SIXl activity. Preferably, the protein is at least 70% homologous to SEQ ID NO: 2, more preferably at least 80%) homologous to SEQ ID NO: 2, even more preferably at least 90%) homologous to SEQ ID NO: 2, and most preferably at least 95%> homologous to SEQ ID NO: 2.
An isolated SIXl protein may comprise the entire amino acid sequence of SEQ ID NO: 2 (i.e., amino acids 1-284), a biologically active portion thereof, or an immunogenic portion thereof. For example, an immunogenic portion of SIXl can comprise portion of SIXl in which hydrophobic, and thus predicted to comprise an surface portion of a SIXl protein. An immunogenic portion can also comprise all or a portion of a SIXl protein which is unique to SIXl (e.g., does not share significant homology with other homeobox proteins, thereby reducing the risk of cross-reactivity with non-SIXl proteins). Accordingly, in one embodiment, an immunogenic portion of a SIXl protein includes all or a portion of human SIXl (SEQ ID NO:2) from about amino acids 183 to 284. Moreover, other biologically active and/or immunogenic portions, in which other regions of the protein are deleted, can be prepared by recombinant techniques and evaluated for SIXl activity as described in detail above or alternatively, tested for immunogenicity. - 24 -
SIX1 proteins are preferably produced by recombinant DNA techniques. For example, a nucleic acid molecule encoding all or a portion of the protein is cloned into an expression vector (as described above), the expression vector is introduced into a host cell (as described above) and the SIXl protein or portion thereof is expressed in the host cell. The SIXl protein or portion thereof can then be isolated from the cells by an appropriate purification scheme using standard protein purification techniques. In an exemplary embodiment, a nucleic acid molecule comprising nucleotides 1 to 1130 of SEQ ID NO:l is cloned into an expresion vector. In another embodiment, a nucleic acid molecule comprising nucleotides 822 to 1130 is cloned into an expression vector. Alternative to recombinant expression, a SIXl protein or polypeptide can be synthesized chemically using standard peptide synthesis techniques. Moreover, native SIXl protein can be isolated from cells (e.g., cultured human mammary adenocarcinoma cells), for example using an anti-SIXl antibody (discussed further below).
The invention also provides SIXl fusion proteins. As used herein, a SIXl "fusion protein" comprises a SIXl polypeptide operatively linked to a non-SIXl polypeptide. A "SIXl polypeptide" refers to a polypeptide having an amino acid sequence corresponding to SIXl, whereas a "non-SIXl polypeptide" refers to a polypeptide having an amino acid sequence corresponding to another protein. Within the fusion protein, the term "operatively linked" is intended to indicate that the SIXl polypeptide and the non-SIXl polypeptide are fused in-frame to each other. The non- SIXl polypeptide may be fused to the N-terminus or C-terminus of the SIXl polypeptide. In one embodiment, a non-SIXl polypeptide (e.g., GST) can be fused to the C-terminus of the SIXl polypeptide (e.g., amino acids 183 to 284 of SEQ IDNO:2). Such fusion proteins can facilitate the purification of recombinant SIXl (see, for example, the fusion proteins described in Example 5). In another embodiment, the fusion protein is a SIXl protein containing a heterologous signal sequence at its N- terminus. In certain host cells (e.g., mammalian host cells), expression and/or secretion of SIXl may be increased through use of a heterologous signal sequence.
Preferably, a SIXl fusion protein of the invention is produced by standard recombinant DNA techniques. For example, DNA fragments coding for the different polypeptide sequences are ligated together in-frame in accordance with conventional - 25 -
techniques, for example employing blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In one embodiment, a DNA fragment encoding a non-SIXl polypeptide (e.g., GST) is ligated in frame with a DNA fragment encoding a portion of SIXl (e.g., including all or a portion of SEQ ID NO:2, for example, amino acids 183 to 284 of SEQ IDNO:2). In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using anchor primers which give rise to complementary overhangs between two consecutive gene fragments which can subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel et al. John Wiley & Sons: 1992). Moreover, many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST polypeptide). A SIXl -encoding nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the SIXl protein.
An isolated SIXl protein, or fragment thereof, can be used as an immunogen to generate antibodies that bind SIXl using standard techniques for polyclonal and monoclonal antibody preparation. The full-length SIXl protein can be used or, alternatively, the invention provides antigenic peptide fragments of SIXl for use as immunogens. The antigenic peptide of SIXl comprises at least 8 amino acid residues of the amino acid sequence shown in SEQ ID NO: 2 and encompasses an epitope of SIXl such that an antibody raised against the peptide forms a specific immune complex with SIXl . Preferably, the antigenic peptide comprises at least 10 amino acid residues, more preferably at least 15 amino acid residues, even more preferably at least 20 amino acid residues, and most preferably at least 30 amino acid residues. Antigenic polypeptides comprising at least 50, 100, 150, 200 or 250 amino acid residues are also within the scope of the present invention. For example, an antigenic polypeptide which includes 102 amino acids of SIXl (e.g., amino acids 183 to 284 of SEQ ID NO:2) is within the scope of the present invention. Preferred epitopes encompassed by the antigenic peptide are regions of SIXl that are located on the surface of the protein, e.g., hydrophilic - 26 -
regions. Other preferred antigenic polypeptides include portions of SIXl which do not share significant homology with other homeobox proteins (e.g., non-SIXl proteins). A SIXl immunogen typically is used to prepare antibodies by immunizing a suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen. An appropriate immunogenic preparation can contain, for examples, recombinantly expressed SIXl protein or a chemically synthesized SIXl peptide. The preparation can further include an adjuvant, such as Freund's complete or incomplete adjuvant, or similar immunostimulatory agent. Immunization of a suitable subject with an immunogenic SIXl preparation induces a polyclonal anti-SIXl antibody response. The immunogen can further include a portion of non-SIXl polypeptide, for example, a polypeptide useful to facilitate purification.
Accordingly, another aspect of the invention pertains to anti-SIXl antibodies. The term "antibody" as used herein refers to immunoglobulin molecules and immunologically active portions of immunoglobulin molecules, i.e., molecules that contain an antigen binding site which specifically binds (immunoreacts with) an antigen, such as SIXl . The invention provides polyclonal and monoclonal antibodies that bind SIXl. The term "monoclonal antibody" or "monoclonal antibody composition", as used herein, refers to a population of antibody molecules that contain only one species of an antigen binding site capable of immunoreacting with a particular epitope of SIXl. A monoclonal antibody composition thus typically displays a single binding affinity for a particular SIXl protein with which it immunoreacts.
Polyclonal anti-SIXl antibodies can be prepared as described above by immunizing a suitable subject with a SIXl immunogen. The anti-SIXl antibody titer in the immunized subject can be monitored over time by standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) using immobilized SIXl. If desired, the antibody molecules directed against SIXl can be isolated from the mammal (e.g., from the blood) and further purified by well known techniques, such as protein A chromatography to obtain the IgG fraction. At an appropriate time after immunization, e.g., when the anti-SIXl antibody titers are highest, antibody-producing cells can be obtained from the subject and used to prepare monoclonal antibodies by standard techniques, such as the hybridoma technique originally described by Kohler and - 27 -
Milstein (1975, Nature 256:495-497) (see also, Brown et al. (1981) J. Immunol 127:539- 46; Brown et al. (1980) J Biol Chem 255:4980-83; Yeh et al. (1976) PNAS 76:2927-31 ; and Yeh et al. (1982) Int. J. Cancer 29:269-75), the more recent human B cell hybridoma technique (Kozbor et al. (1983) Immunol Today 4:72), the EBV-hybridoma technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing monoclonal antibody hybridomas is well known (see generally R. H. Kenneth, in Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing Corp., New York, New York (1980); E. A. Lerner (1981) Yale J. Biol. Med., 54:387-402; M. L. Gefter et al. (1977) Somatic Cell Genet., 3:231-36). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal immunized with a SIXl immunogen as described above, and the culture supernatants of the resulting hybridoma cells are screened to identify a hybridoma producing a monoclonal antibody that binds SIXl. Any of the many well known protocols used for fusing lymphocytes and immortalized cell lines can be applied for the purpose of generating an anti-SIXl monoclonal antibody (see, e.g., G. Galfre et al. (1977) Nature 266:55052; Gefter et al Somatic Cell Genet. , cited supra; Lerner, Yale J. Biol. Med. , cited supra; Kenneth, Monoclonal Antibodies, cited supra). Moreover, the ordinary skilled worker will appreciate that there are many variations of such methods which also would be useful. Typically, the immortal cell line (e.g., a myeloma cell line) is derived from the same mammalian species as the lymphocytes. For example, murine hybridomas can be made by fusing lymphocytes from a mouse immunized with an immunogenic preparation of the present invention with an immortalized mouse cell line. Preferred immortal cell lines are mouse myeloma cell lines that are sensitive to culture medium containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a number of myeloma cell lines may be used as a fusion partner according to standard techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/O-Agl4 myeloma lines. These myeloma lines are available from the American Type Culture Collection (ATCC), Rockville, Md. Typically, HAT-sensitive mouse myeloma cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). Hybridoma cells resulting from the - 28 -
f sion are then selected using HAT medium, which kills unfused and unproductively fused myeloma cells (unfused splenocytes die after several days because they are not transformed).
Hybridoma cells producing a monoclonal antibody of the invention are detected by screening the hybridoma culture supernatants for antibodies that bind SIXl , e.g. , using a standard ELISA assay.
Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal anti-SIXl antibody can be identified and isolated by screening a recombinant combinatorial immunoglobulin library (e.g., an antibody phage display library) with SIXl to thereby isolate immunoglobulin library members that bind SIXl . Kits for generating and screening phage display libraries are commercially available (e.g. , the Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01 ; and the Stratagene SurfZAP™ Phage Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly amenable for use in generating and screening antibody display library can be found in, for example, Ladner et al. U.S.
Patent No. 5,223,409; Kang et al. International Publication No. WO 92/18619; Dower et al. International Publication No. WO 91/17271; Winter et al. International Publication WO 92/20791 ; Markland et al. International Publication No. WO 92/15679; Breitling et al. International Publication WO 93/01288; McCafferty et al. International Publication No. WO 92/01047; Garrard et al. International Publication No. WO 92/09690; Ladner et al International Publication No. WO 90/02809; Fuchs et α/. (1991) Bio/Technology 9:1370-1372; Hay et al. (1992) Hum Antibod Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al. (1993) EMBO J 12:725-734; Hawkins et al. (1992) J Mol Biol 226:889-896; Clarkson et al. (1991) Nature 352:624-628; Gram et al. (1992) PNAS 89:3576-3580; Gaπad et al (1991) Bio/Technology 9:1373-1377; Hoogenboom et al (1991) Nuc Acid Res 19:4133-4137; Barbas et al (1991) PNAS 88:7978-7982; and McCafferty et al. Nature (1990) 348:552-554.
Additionally, recombinant anti-SIXl antibodies, such as chimeric and humanized monoclonal antibodies, comprising both human and non-human portions, which can be made using standard recombinant DNA techniques, are within the scope of the invention. Such chimeric and humanized monoclonal antibodies can be produced by - 29 -
recombinant DNA techniques known in the art, for example using methods described in Robinson et al. International Patent Publication PCT/US86/02269; Akira, et al. European Patent Application 184,187; Taniguchi, European Patent Application 171,496; Morrison et al. European Patent Application 173,494; Neuberger et al. PCT Application WO 86/01533; Cabilly et al. U.S. Patent No. 4,816,567; Cabilly et al. European Patent Application 125,023; Better et al. (1988) Science 240:1041-1043; Liu et al. (1987) PNAS 84:3439-3443; Liu et al. (1987) J. Immunol. 139:3521-3526; Sun et al. (1987) PNAS 84:214-218; Nishimura et al. (1987) Cane. Res. 47:999-1005; Wood et al. (1985) Nature 314:446-449; and Shaw et al. (1988) J. Natl Cancer Inst. 80: 1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et al. (1986) BioTechniques 4:214; Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 321 :552-525; Verhoeyan et al. (1988) Science 239:1534; and Beidler et α/. (1988) J. Immunol. 141 :4053-4060.
An anti-SIXl antibody (e.g., monoclonal antibody) can be used to isolate SIXl by standard techniques, such as affinity chromatography or immunoprecipitation. An anti-SIXl antibody can facilitate the purification of natural SIXl from cells and of recombinantly produced SIXl expressed in host cells. Moreover, an anti-SIXl antibody can be used to detect SIXl protein (e.g., in a cellular lysate or cell supernatant). Detection may be facilitated by coupling (i.e., physically linking) the antibody to a detectable substance. Examples of detectable substances include various enzymes, prosthetic groups, fluorescent materials, luminescent materials and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, β- galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes include streptavidin biotin and avidin/biotin; examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; and examples of suitable radioactive material include 125!, 13 1I, 35S or 3H. - 30 -
IV. Uses and Methods of the Invention
As described in more detail in Example 2, SIXl expression correlates with tumorogenesis and metastasis (e.g., breast tumorigenesis and metastasis) Accordingly, detection of SIXl protein or nucleic acid molecules provides is useful for diagnosing cancer and monitoring both tumor progression and metastasis. Furthermore, inhibition of SIXl expression may result in inhibition of cancer and tumor metastasis ,(e.g., breast, colon, and lung cancer). The isolated nucleic acid molecules of the invention can be used to inhibit SIXl protein expression (e.g., antisense SIXl nucleic acid molecules), to detect SIXl mRNA (e.g., SIXl nucleic acid probes based on the nucleotide sequence of SEQ ID NO:l) and to modulate SIXl activity, as discussed further below. Moreover, the anti-SIXl antibodies of the invention can be used to detect and isolate SIXl protein and modulate SIXl activity, also discussed further below.
The invention provides a method for detecting the presence of SIXl in a biological sample. The method involves contacting the biological sample with an agent capable of detecting SIXl protein or nucleic acid molecules (e.g., SIXl mRNA) such that the presence of SIXl is detected in the biological sample. A preferred agent for detecting SIXl mRNA is a labeled or labelable nucleic acid probe capable of hybridizing to SIXl mRNA. The nucleic acid probe can be, for example, the full-length SIXl cDNA of SEQ ID NO: 1, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to SIXl mRNA.
A preferred agent for detecting SIXl protein is a labeled or labelable antibody capable of binding to SIXl protein. Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term "labeled or labelable", with regard to the probe or antibody, is intended to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a detectable substance to the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with another reagent that is directly labeled. Examples of indirect labeling include detection of a primary antibody using a fluorescently labeled secondary antibody and end-labeling of a DNA probe with biotin such that it can be detected with fluorescently labeled streptavidin. - 31 -
As used herein, the term "isolated", when used in the context of a biological sample, is intended to indicate that the biological sample has been removed from the subject. In one embodiment, a biological sample comprises a sample which has been isolated from a subject and is subjected to a method of the present invention without further processing or manipulation subsequent to its isolation. In another embodiment, the biological sample can be processed or manipulated subsequent to being isolated and prior to being subjected to a method of the invention. For example, a sample can be refrigerated (e.g., stored at 4°C), frozen (e.g., stored at -20°C, stored at -135°C, frozen in liquid nitrogen, or cryopreserved using any one of many standard cryopreservation techniques known in the art). Furthermore, a sample can be purified subsequent to isolation from a subject and prior to subjecting it to a method of the present invention. As used herein, the term "purified" when used in the context of a biological sample, is intended to indicate that at least one component of the isolated biological sample has been removed from the biological sample such that fewer components, and consequently, purer components, remain following purification. For example, a serum sample can be separated into one or more components using centrifugation techniques known in the art to obtain partially-purified sample preparation. Furthermore, it is possible to purify a biological sample such that substantially only one component remains. For example, a tissue or tumor sample can be purified such that substantially only the protein or mRNA component of the biological sample remains.
Furthermore, it may be desirable to amplify a component of a biological sample such that detection of the component is facilitated. For example, the mRNA component of a biological sample can be amplified (e.g., by RT-PCR) such that detection of SIXl mRNA is facilitated. As used herein, the term "RT-PCR" (an abbreviation for reverse transcriptase-polymerase chain reaction) involves subjecting mRNA to the reverse transcriptase enzyme results in the production of cDNA which is complementary to the base sequences of the mRNA. Large amounts of selected cDNA can then be produced by means of the polymerase chain reaction which relies on the action of heat-stable DNA polymerase for its amplification action. Alternative amplification methods include: self sustained sequence replication (Guatelli, J.C. et al, 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh, D.Y. et al, 1989, - 32 -
Proc. Natl. Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi, P.M. et all, 1988, Bio/Technology 6:1197), or any other nucleic acid amplification method, followed by the detection of the amplified molecules using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules are present in very low numbers.
The detection methods of the present invention can be used to detect SIXl protein or nucleic acid molecules in a biological sample in vitro as well as in vivo. For example, in vitro techniques for detection of SIXl mRNA include Northern hybridizations and in situ hybridizations. In vitro techniques for detection of SIXl DNA include Southern hybridizations. In vitro techniques for detection of SIXl protein include enzyme linked immunosorbent assays (ELI S As), Western blots, immunoprecipitations and immunofluorescence. Alternatively, SIXl protein can be detected in vivo in a subject by introducing into the subject a labeled anti-SIXl antibody. For example, the antibody can be labeled with a radioactive marker whose presence and location in a subject can be detected by standard imaging techniques.
In a preferred embodiment of the detection method, the biological sample is a tissue sample or tumor sample. The tissue sample or tumor sample may comprise tissue or a suspension of cells. A tissue section, for example, a freeze-dried, parafin- embedded, or fresh frozen section of tissue removed from a patient, or a section of a tumor biopsy can be used as the biological sample. Moreover, the sample may include a biological fluid obtained from a subject (e.g., blood, ascites, pleural fluid or spinal fluid). Following collection, tissue or tumor samples can be stored at temperatures below -20°C to prevent degradation until the detection method is to be performed. In one embodiment, a biological sample in which SIXl mRNA or protein is to be detected is a mammary tumor sample. In another embodiment, a biological sample in which SIXl mRNA is to be detected is, for example, a lung, colon, or cervical tumor.
The detection methods of the invention described above can be used as the basis for a method of diagnosis of a subject with a tumor (e.g., a breast tumor), can be used as the basis for a method of monitoring the progression of cancer in a subject, or can be used as the basis for a method of prognosing a person at risk for developing a cancer. As described in further detail in Example 2, the expression pattern of SIXl mRNA can - 33 -
differ between normal cells and malignant cells and between primary tumor cells and metastatic tumor cells. For example, SIXl mRNA levels are detectable in tissues (e.g., skeletal muscle, pituitary gland, salivary gland, lung and trachea) but was not detectable in normal mammary tissue. SIXl mRNA levels were elevated in 44% of primary breast tumors analysed and further elevated in 90%) of metastatic lesion examined.
In one embodiment, the invention features a method of determining the metastatic potential of a tumor which involves contacting a sample of the tumor (or isolate) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the tumor sample or isolate, thereby determining the metastatic potential of the tumor. Another aspect of the invention features a prognostic method for determining whether a subject is at risk for developing cancer which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby determining whether the subject is at risk for developing cancer. Yet another aspect of the invention features a method of diagnosing cancer in a subject which involves contacting a biological sample obtained from the subject (or isolate of the sample) with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing cancer in the subject. In another embodiment, the diagnostic methods of the present invention further involve determining the level of SIXl polypeptide or mRNA in the sample or isolate. In yet another embodiment, the diagnostic methods of the present invention involve comparing the level of SIXl polypeptide or mRNA in the sample or isolate with the level of SIXl polypeptide or mRNA in a control sample. In yet another embodiment, the diagnostic or prognostic methods further include the step of forming a prognosis or forming a diagnosis.
In one embodiment, the control is from normal cells and the tumor sample is a suspected primary tumor sample. Primary malignancy of the tumor cell sample can be diagnosed based on an increase in the level of expression of SIXl mRNA or protein in the tumor sample as compared to the control. In another embodiment, the control is from normal cells or a primary tumor and the tumor sample is a suspected metastatic - 34 -
tumor sample. Acquisition of the metastatic phenotype by the suspected metastatic tumor sample can be diagnosed based on an increase in the level of SIXl mRNA or protein in the tumor sample compared to the control.
The prognostic methods of the present invention are of particular utility in the early detection and treatment of breast cancer. It will be appreciated by those skilled in the art that breast cancer may not be as amenable to early detection as, for instance, cervical cancer, due to the lack of cytomorphologic screening methods available (e.g., pap smears for the detection of cellular abnormalities of the cervix). Accordingly, the prognostic methods of the present invention feature, for example, careful histological examination of breast biopsies (e.g. , biopsies of pre-malignant or pre-invasive lesions, atypical hyperplasias and/or carcinoma in situ). Upon the morphological detection of such a lesion, hyperplasia or carcinoma, it may be desireable to utilize an amplification step of the present invention to detect, for example, SIX nucleic acid.
The invention also encompasses kits for detecting the presence of SIXl in a biological sample (e.g., a tumor sample). For example, the kit can comprise a labeled or labelable agent capable of detecting SIXl protein or mRNA in a biological sample and a means for determining the amount of SIXl in the sample. The agent can be packaged in a suitable container. The kit can further comprise a means for comparing the amount of SIXl in the sample with a standard and/or can further comprise instructions for using the kit to detect SIXl mRNA or protein.
Another aspect of the invention pertains to methods of modulating SIXl activity associated with a cell, e.g., for therapeutic purposes. SIXl activity "associated with a cell" is intended to include SIXl activity within the cell and/or within the nucleus of the cell. In particular, HSIXl is a homeobox gene that is diferentially expressed in the cell cycle and whose overexpression leads to an abbrogation of the DNA damage-induced G2 cell cycle checkpoint. Accordingly, in a preferred embodiment, the invention pertains to methods of modulating SIXl activity in a subject afflicted with a disease associated with G2 checkpoint control. Such diseases include, but are not limited to Ataxia telangiectasia (Scott et al. (1994) Int. J. Radiat. Biol. 66, Suppl., 157s-163s and Paules et al. (1995) Cancer Res. 55:1763-1773), Li Fraumeni (Paules et al supra), Bloom's Syndrome (Davey et α/.(1998) Mol. Cell. Biol. 18:2721-2728), and Fanconi - 35 -
Anemia (D' Andrea and Kupfer (1996) Blood 88:1019-1025), as well as other diseases which demonstrate cancer susceptability. The modulatory method of the invention involves contacting the cell with an agent that modulates SIXl activity associated with the cell. In one embodiment, the agent stimulates SIXl activity. Examples of such stimulatory agents include active SIXl protein and a nucleic acid molecule encoding SIXl that has been introduced into the cell. In another embodiment, the agent inhibits the SIXl activity. Examples of such inhibitory agents include antisense SIXl nucleic acid molecules and anti-SIXl antibodies. These modulatory methods can be performed in vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g, by administering the agent to a subject).
Inhibition of SIXl activity is desirable in situations in which SIXl is abnormally upregulated and/or in which decreased SIXl activity is likely to have a beneficial effect. One example of such a situation is in tumor cells, and in particular in inhibiting or preventing tumor cell metastatis. As demonstrated in Example 2, acquisition of a metastatic phenotype by tumor cells is associated with upregulation of SIXl expression. Thus, decreasing the expression and/or activity of SIXl in or around the tumor cells is expected to inhibit the development or progression of the metastatic phenotype. Accordingly, in a specific embodiment, the invention provides a method for inhibiting development or progression of a metastatic phenotype in a tumor cell comprising contacting the tumor cell with an agent which inhibits the amount of SIXl in the tumor cell. The term "in the tumor cell" is intended to include SIXl within the cell and/or SIXl within the nucleus of the cell. For example, since SIXl is predicted to be a transcription factor, it is likely that it exerts tumor suppressive effects nuclearly. The agent that inhibits SIXl in the tumor cell can be an antisense SIXl nucleic acid or a SIXl antibody. Thus, a SIXl inhibitory agent, preferably in a pharmaceutically acceptable carrier, can be administered to a tumor-bearing subject by an appropriate route to inhibit the development or progression of the metastatic phenotype of the tumor. Suitable routes of administration include intravenous, intramuscular or subcutaneous injection, injection directly into the tumor site or implantation of a device containing a slow-release formulation. The SIXl inhibitory agent preparation can also be incorporated into liposomes or other carrier vehicles to facilitate delivery to the tumor - 36 -
site. A non-limiting dosage range is 0.001 to 100 mg/kg/day, with the most beneficial range to be determined by routine pharmacological methods.
Alternative to administration of a SIXl inhibitory agent, the development or progression of the metastatic phenotype can be inhibited in tumor cells by modifying them to express a SIXl inhibitory agent (e.g., a SIXl antisense nucleic acid molecule) by introducing into the tumor cells a SIXl antisense nucleic acid expresssion vector.
Expression vectors suitable for gene therapy, including retroviral and adenoviral vectors carrying appropriate regulatory elements, can be used to deliver the SIXl antisense nucleic acids to the tumor cells. In addition to tumor therapy, there are other situations in which modulating SIXl activity may be desirable. As demonstrated in Example 3, SIXl overexpression in cells results in abbrogation of the G2 cell cycle checkpoint. Accordingly, SIXl inhibition may be desireable to reconstitute growth arrest in a population of cells, such that DNA repair can take place. This invention is further illustrated by the following examples which should not be construed as limiting. The contents of all references, patents and published patent applications cited throughout this application are hereby incorporated by reference.
EXAMPLE 1: Identification of HSIXl as a Cell-Cycle Regulated Gene by Differential Display Methodology
To identify genes differentially expressed in S-phase of the 21PT mammary adenocarcinoma cell line, cells were first synchronized with mimosine. Briefly, cell synchrony was performed as described (Alpan and Pardee (1996) Cell Growth Differ. 7:893-901); however, 150μM mimosine was used rather than 400μM. S phase syncrhonized cells were released from late Gl/S phase arrest, S-phase progression was monitored by ^H-thymidine incorporation at hourly intervals (Keyomarsi et αl (1991) Cancer Res. 51:3602-3609) (Figure 2A), and RNA was isolated from duplicate samples to perform differential display. Briefly, differential display was performed with a two- step polymerase chain reaction (PCR) and the LHA series of primers as described (Martin et al (1996) in Methods in Molecular Biology-Differential Display Methods and Protocols, eds. Pardee & Liang (Humana, Totowa, NJ), Vol. 85, pp.77-85). The - 37 -
anchored and the arbitrary primers which led to detection of HSIXl were LHTπC (TGC CGA AGC T„C) (SEQ ID NO:3) and LHA6 (TGC CGA AGC TTG CAG CGA) (SEQ ID NO:4). Band isolation and direct sequencing of the DD band were performed as described (Martin et al, supra). Figure 2B demonstrates increased expression of a cDNA band labeled as 6A.
Direct sequencing of 6A revealed its identify as HSIXl, a homeobox gene whose mouse counterpart has been implicated in the development of limb tendons (Oliver et al. (1995) Development 121 :693-705) and that was recently cloned from human adult skeletal muscle (Boucher et al. (1996) Genomics 33:140-142). A Northern blot probed with HSIX 1 cDN A (cloned from 21 PT cells by RT-PCR) confirmed its differential expression in S-phase (Figure 2C). (Briefly, to clone HSIXl cDNA for use as a probe, primers were designed to the 5' end (5'-ATG TCG ATG CTG CCG TCG TTT-3') (SEQ ID NO:5) and 3' end (5'-CAC TTA GGA CCC CAA GTC CAC-3') (SEQ ID NO:6) of the HSIXl cDNA. Reverse transcription (RT) reactions were performed with 0.2 μg RNA template, 25 μM dNTPs, 1 mM DTT, 5 μM oligo dT12.lg, and lx reverse transcriptase buffer (50 mM Tris-HCl, pH 8.3, 75 mM KC1, 3 mM >MgC12). The reaction conditions were as follows: 65°C, 5 min.; 37°C, 60 min. (5 min. into this cycle 200 units SuperscriptTM II was added to each reaction); 95 °C, 5min. PCR conditions were as follows: (94°C, 45 sec; 69°C, 45 sec.;72°C, 45 sec.} x 25, followed by an extension at 72°C, 5 min. The PCR products were subcloned utilizing the TA cloning system (InVitrogen). (Sequencing was performed on multiple clones as it is known that PCR may introduce point mutations.) For Northern blot analysis, RNA was isolated with TRIzol reagent and analysis was performed according to Maniatis et al. (1989) Molecular Cloning (Cold Spring Harbor Lab. Press, Cold Sprng Harbor, NY), 2nd Ed. Levels of HSIXl were very low in the first half of S-phase, and increased as cells completed S phase. Similar experiments in a related cell line (21MT1), also revealed cell cycle-specific expression of the gene indicating that HSIXl lays a role at or near the end of the cell cycle. - 38 -
Additional evidence supporting a function of HSIXl in cell cycle control can be obtained by comparison of SIX to the Drosophila sine oculis (so) gene. The HSIXl protein displays approximately 98%> sequence homology to mouse SIXl (Boucher et al. (1996) Genomics 33:140-142) which was first cloned by virtue of its homology to the Drosophila gene sine oculis (so) (Oliver et al. (1995) Development 121 :693-705). Mouse SIXl is 62% similar to the Drosophila gene, and 87% similar if sequences C- terminal to the homeodomain are excluded (supra). So plays a role in the development of the fly visual system. Interestingly, Drosophila eye development involves coordinate regulation of cell cycle progression and so has been suggested to play a role in the synchronization of the cell cycle because its expression precedes a burst of cell divisions (Cheyette et al (1994) Neuron 12:977-996). In addition, complete loss of function alleles of so are embryonic lethals (supra), suggesting that the gene's expression is important for more than just eye development. These results in conjunction with the cell cycle regulated expression of HSIXl in 21PT cells, suggest that HSIXl plays a role in regulating the onset of mitosis.
Example 2: Expression of HSIXl in Primary Tumors, Metastatic Tumors, and Other Tumor-Derived Cell Lines
For comparison with 21PT mammary carcinoma cells, a Human RNA Master Blot from Clontech™ was probed to determine HSIXl expression in normal human adult mammary tissue as well as its expression pattern in other normal adult andf fetal tissues (as expression of HSIXl and its mouse homolog had previously only been demonstrated in developing mouse limb tendons and in human adult skeletal muscle). The Human RNA Master Blot includes poly A+ RNA from the following tissues: whole brain, amygdala, caudate nucleus, cerebellum, cerebral cortex, frontal lobe, hippocampus, medulla oblongata, occipital lobe, putamen, substantia nigra, temporal lobe, thalamus, subthalamic nucleus, spinal cord, heart, aorta, skeletal muscle, colon, bladder, uterus, prostate, stomach, testis, ovary, pancreas, pituitary gland, adrenal gland, thyroid gland, salivary gland, mammary gland, kidney, liver, small intestine, spleen, thymus, peripheral leukocyte, lymph node, bone marrow, appendix, lung, trachea, placenta, fetal brain, fetal heart, fetal kidney, fetal liver, fetal spleen, fetal thymus, and fetal lung. Yeast total RNA - 39 -
(lOOng), yeast tRNA (lOOng), E. coli rRNA (lOOng), E. coli DNA (lOOng), Poly r(A) (lOOng), human C0t 1 DNA (lOOng), human DNA (lOOng), and human DNA (500ng) were included as controls.
It was determined that normal mammary tissue (pooled from 20 women ages 24- 40 who died of trauma) does not express HSIXl, whereas, expression was confirmed in normal adult skeletal muscle and was also observed in pituitary gland, salivary gland, lung and trachea, with very low levels of expression in the kidney. These data further indicate that expression of HSIXl expression in mammary carcinoma cells is aberrant. 21PT cells, the source of HSIXl, were derived from a patient who had an infiltrating and intraductal mammary adenocarcinoma (Band et al. (1990) Cancer Res. 50:7351-7357). Several other cell lines derived from the same patient include 21NT, 21MT-1, and 21 MT-2. The 21PT and 2 INT cell lines were derived from the primary tumor, whereas the 21MT-1 and 21MT-2 cell lines were established from a metastatic pleural effusion. As shown in Figure 3, HSIXl expression was not detected in a normal breast cell line, 70N (Band and Sager, supra), but was detected in all cell lines derived from the above-mentioned patient. Levels of expression in 21PT and 2 INT cells were approximately 3- and 2-fold less, respectively, than levels in 21MT1 cells, and 10- and 7-fold less, respectively, than levels in 21MT2 cells. (Relative HSIXl expression for each sample was as follows: 70N -0, 21PT ~5, 21NT -7, 21MT1 -14, and 21MT2 -46).
To determine whether HSIXl expression is increased in a significant proportion of primary and metastatic breast cancer cases, 35 human breast biopsy samples were obtained and expression was examined by Northern blot analysis. Northern blot analysis was performed as described in Example 1, except that RNA was isolated from breast tumor specimens by the guanidinium thiocyanate/CsCl method as described in Maniatis et al, supra. Normalization to 36B4 was performed on these samples, as it has been shown to be a good control for breast cancer samples. Figure 3 shows the results with 35 tumor samples examined for HSIXl expression. The results were quantitated and plotted as relative HSIXl expression. While normal adjacent breast, normal breast luminal cells, and normal breast myoepithelial cells demonstrated almost no HSIXl expression (lanes 1-3 respectively), 44%> of the primary tumors (lanes 4-27) and 90%> of - 40 -
the metastatic lesions (lanes 28-37) expressed greater than a three-fold increase in HSIXl mRNA expression over levels in normal adjacent breast.
As the metastatic lesions came from a secondary site, it was necessary to consider the expression levels of the tissue at this site to confirm that the expression observed is from the lesion and not from contaminating adjacent tissue. The 10 metastatic lesions utilized in the analysis came from either the lymph nodes (6 samples), bone/soft tissue (2 samples), the lung (1 sample), or the pleural wall (1 sample). The Human RNA Master blot allowed examination of HSIXl expression in normal lymph nodes and lung. Five of the six lymph node metastases expressed HSIXl, however HSIXl expression was not observed in normal lymph nodes, indicating that the high expression levels in lymph node lesions came from the metastatic tumor itself. Normal lung does express the gene at low levels, but densitometric scanning and subsequent normalization demonstrated that expression in the metastatic lesion from the lung was equal to that in normal adult skeletal muscle, which expresses four times more HSIXl than normal lung. This suggests that HSIXl expression in the lung metastases cannot be explained by normal tissue contaminating the sampled metastasis.
Having demonstrated the high expression in primary breast cancer tumors as well in metastatic tumors, multiple lung cancer cell lines as well as cell lines isolated from a range of other tumors, were analyzed for HSIXl expression by Norhtern blot (2μg poly(A)+ RNA isolated from each human cancer cell line, Clonetech Blot). HSIX 1 expression was detected in mRNA isolated from cells of a colon adenocarcinoma of a patient (termed "SW480 cells") and was significantly enhanced in mRNA from cells isolated from a metastatic lesion of the same patient (termed "SW620 cells"). HSIX mRNA was also detected in cell lines isolated from the following rumor sources: 41
CELL LINE Tumor Source relative HSIXl expression
HL-60 promyelocytic leukemia
HeLa HeLa Cell S3 - +++ cervical carcinoma
K562 chronic myelogenous +++ leukemia
MOLT4* lymphoblastic leukemia
Raji Burkitt's lymphoma +
SW480 colorectal adenocarcinoma +++
A549 lung carcinoma ++
G361 melanoma +
Figure imgf000043_0001
"Inadequate RNA loading may be responsible for absence of HSIXl expression in MOLT4.
Furthermore, HSIXl mRNA was demonstrated to be overexpressed in multiple lung cancer cells. When six pairs of cell line mRNAs were analyzed for HSIXl expression, the first sample being isolated from a lung tumor, and the second sample being isolated from normal adjacent tissue of the same patient, HSIXl expression was found to increase from 1.5 to 10-fold, among the various tumor-derived samples tested, as compared to their normal counterparts. CoUetively, the above-described data indicate that HSIXl is overexpressed in several types of cancer in addition to breast.
EXAMPLE 3: Overexpression of HSIXl in Cells Abrogates the G2 Cell Cycle Checkpoint
To determine if HSIXl plays a role in regulating the cell cycle, the MCF7 mammary carcinoma cell line was transfected with SIXFL, a construct that allows for constitutive expression of the full length wild type HSIXl cDNA, or with the parent vector expressing the chloramphenicol acetyl transferase gene (CAT) as a control. MCF7 cells are mammary carcinoma cells with a lower endogenous HSIXl level than - 42 -
that in 21PT cells. Briefly, MCF7 cells were seeded in 60 mM dishes at 5 x 103 cells/dish and transfected with SIXFL or with pcDNA3.1(CAT) utilizing Superfect (Qiagen). Transfections were performed according to the manufacturers protocol. 24 h after transfections the cells were passaged 1 : 15 in appropriate media containing 600 mg/ml G418. Approximately two weeks later stable transfectants were selected utilizing cloning cylinders and examined for HSIXl expression via Northern blot analysis. For all subsequent analysis, three stable clones expressing HSIXl (HSIXA1, A8, and A13) and two control transfectants (CATB1 and CATB3, expressing pcDNA3.1(CAT)) were examined in the X-Ray irradiation assay. X-Ray irradiation and subsequent FACS analysis of transfected cells was performed as follows: MCF7 transfectants were seeded at 8 x 103 cells/60 mM dish. Approximately 48h later the cells were treated with X-rays (5 or 8 Gy) at a dose rate of 1.25 Gy/min using a Phillips 250 kVp X-ray machine. Sham treated controls as well as irradiated cells were labeled with propidium iodide according to Vindelov et al. (1983) Cytometry 3:323-327, at various time points following irradiation. Experiments were performed singly or in duplicate, and repeated several times. FACS analysis was performed on the Becton Dickinson FACScan utilizing CellQuest (Becton Dickinson) and ModFit (Verity Software) to obtain cell cycle profiles.
In a representative experiment, exponentially growing cells overexpressing HSIXl (the HSIXA13 cell line) showed cell cycle profiles similar to the transfected control (the CATB3 cell line) (HSIXA13: Gl=55.1%, S=26.9%, G2/M=18.1% versus CATB3: Gl=53.2%, S=31.3%, G2/M=15.5%). In contrast, when the cells were irradiated at a dose of 8 Gy to examine the DNA damage-induced G2 cell cycle checkpoint, a marked difference was observed in the G2/M population in HSIXl transfectants versus the CAT controls. In the representative experiment, both HSIXl expressors and CAT controls were arrested in G2 17 h after irradiation, as was expected (HSIXA13: Gl=49.8%, S=6.8%, G2/M=43.5% versus CATB3: Gl=57.1%, S=6.9%, G2/M=36%). However, by 24 h post-irradiation, all cell lines expressing HSIXl had progressed beyond the G2 arrest, whereas the non-expressors remained arrested in G2 (HSIXA13: Gl=75.5%, S=4.9%, G2/M=19.6% versus CATB3: Gl=60.2%, S=5.6%, G2/M=34.2%>). The CAT control transfectants were blocked in G2 as long as 30 h post- - 43 -
irradiation, whereas the HSIXl transfectants had exited the G2 arrest significantly earlier (HSIXA13: Gl=74.6%, S=5.4%, G2/M=20.0% versus CATB3: Gl=58.7%, S=5.3%, G2/M=36%>). Although absolute percentages varied from experiment to experiment, the passage of HSIXl expressors through G2 following X-ray irradiation was always more rapid than that of the controls. Note that MCF7 cells have an intact Gl/S arrest in response to irradiation, and that cells passing through G2 will subsequently arrest at the Gl/S boundary. A summary of data collected from several experiments is presented as Figure 5. In particular, Figure 5 depicts a summary of the percentage of cells in G2 at various time points before and after irradiation in the transfectants and controls. The data graphed are from one experiment performed at 8 Gy and are representative of several experiments performed at 5 and 8 Gy. Note that cells expressing HSIXl progress through the G2 arrest at a more rapid rate than transfected controls.
In addition to the CAT controls, a cell line transfected with SIXFL (HSIXA2) that did not express HSIXl (possibly due to silencing of the gene upon insertion into the chromosomal DNA) was tested in the X-ray irradiation assay. This cell line behaved as the CAT controls, confirming that HSIXl expression was necessary for abrogation of the G2 cell cycle checkpoint, and that the expression of CAT did not affect the checkpoint in any way. Furthermore, it was generally observed that the growth rates of the HSIXl transfectants and controls in the absence of irradiation were not appreciably different, indicating that the rapid transit of HSIXl transfectants through the G2 arrest following DNA damage was not merely a consequence of faster growth. These data demonstrate that overexpression of HSIXl leads to an abrogation of the DNA damage- induced G2 cell cycle checkpoint.
In yet another series of experiments, 21PT cells were transfected with HSIXl or an HSIXl fusion protein that contains an 8 amino acid epitope tag (XPRESS) for following protein expression. Immunohistochemistry of the latter transfectants with the anti-XPRESS antibody revealed a punctate nuclear localization of the HSIXl protein, as is commonly observed with proteins involved in replication and/or transcription. The result was expected, as HSIXl is a putative transcription factor. Moreover, after passaging the cells over several months, a change in the DNA content of the transfectants was observed. Figure 4 shows a representative FACS analysis on - 44 -
propidium iodide stained SIXFL4 and 6 cells and the His/lac7 and parent 21PT control cells. The DNA content in the HSIXl expressing transfectants doubled in a large proportion of the cells, whereas the doubling was not observed in the control cells passaged over an even longer time period. This analysis demonstrates that HSIX overexpressors become polyploidy over several months in culture, further demonstrating an effect at the level of cell cycle control.
Interestingly, another homeobox gene, HOX11, has recently been found to disrupt the G2 cell cycle checkpoint by interacting with PP2A protein phosphatase (Kawabe et al. (1997) Nature 385:454-458). HOX11 has been implicated in cancer (supra), as it was isolated from a chromosomal breakpoint in human T-cell leukemia (Hatano et al. (1991) Science 253:79-82; Kennedy et al. (1991) Proc. Natl Acad. Sci. USA 88:8900-8904; and Dube et al. (1991) Blood 78:2996-3002). In addition, transgenic mice expressing HOX11 in the thymus demonstrated cell cycle alterations and progression to malignancy (Hatano et al (1992) Curr. Opin. Oncol. 4:24-26). Since HSIXl was originally cloned from a mammary carcinoma cell line (21PT), and since overexpression of this gene leads to altered cell cycle control similar to that seen with HOX11, it can be reasoned that HSIXl may be differentially expressed in cancer.
EXAMPLE 4: Generation of HSIXl-Specific Antibodies The C-terminus of HSIXl (from nucleotide 822 until the stop codon) was amplified and subcloned into the pGEX2T bacterial expression vector to create a GST- HSIX1 fusion protein. Expression of the protein was induced with 0.1 mM IPTG, and the protein was then purified from bacterial extracts utilizing glutathione-sepharose beads. Following purification, the fusion protein was run on a SDS-PAGE gel, very lightly coomassie stained, and extracted from the gel.
The extracted gel piece containing the GST-HSIX1 C-terminus was then injected into rabbits (Spring Valley Laboratories (Woodbine, MD)). Following injection and two boosts, the rabbit was bled and the sera tested for HSIXl antibodies. Following demonstration of HSIXl immunoreactivity, the sera was passed over a GST affinity column (to remove any antibody recognizing the GST portion of the fusion), and was subsequently purified on a GST-HSIX C-terminus column. Affinity purified anti- 45
HSIXl antibody was then tested on cells transfected with HSIXl versus untransfected cells (Figure 6).
EXAMPLE 6: HSIXl Expressing Cells Lead to Larger Tumors When Injected Into Nude Mice
Six nude mice each were injected in the thigh with either 1 x 107 A13 cells (HSIX-transfected) or B3 cells (control transfectants). Tumor size was measured after 4.5 weeks. Tumors from B3 cell-injected mice ranged in size from approximately 35- 140 mm3 whereas tumors from A13 cell-injected mice ranged in size from approximately 110-370 mm3 (Table I).
TABLE I:
Control HSIXl - Transfectants Transfected
53.2 365.9
138.5 112.8
76.2 208J
35.5 194.0
95.4 282.9
91.1 110.6
Figure imgf000047_0001
These data demonstrate the significant tumorigenic activity of HSIXl.
EQUIVALENTS
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be encompassed by the following claims.

Claims

- 46 -
What is claimed:
1. A method for detecting the presence of SIXl in a biological sample comprising contacting the biological sample with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl is detected in the biological sample.
2. The method of claim 1, wherein the biological sample is a tissue sample, or isolate thereof.
3. The method of claim 2, wherein the tissue sample is derived from the pancreas, stomach, liver, secretory gland, bladder, lung, skin, prostate gland, ovary, cervix, uterus, brain, eye, connective tissue, bone, muscles or vasculature.
4. The method of claim 1 , wherein the biological sample is a breast tissue sample, or isolate thereof.
5. The method of claim 1, wherein the biological sample is a tumor sample, or isolate thereof.
6. The method of claim 5, wherein the tumor sample is selected from the group consisting of a lung carcinoma, a colon carcinoma, a cervical carcinoma, an adenocarcinoma, a melanoma, a leukemia, a lymphoma, a glioma, a neuroblastoma, a retinoblastoma, and a sarcoma.
7. The method of claim 5, wherein the tumor sample is a breast tumor sample.
8. The method of claim 1 , wherein the biological sample is a primary tumor sample, or isolate thereof. - 47 -
9. The method of claim 1, wherein the biological sample is a metastatic lesion sample, or isolate thereof.
10. The method of claim 1 , wherein the agent is a labeled or labelable antibody which specifically binds to SIXl polypeptide.
11. The method of claim 10, wherein the antibody specifically binds to a polypeptide selected from the group consisting of:
(a) a polypeptide comprising all or a portion of the polypeptide having the amino acid sequence of SEQ ID NO:2;
(b) a polypeptide comprising at least amino acids 183-284 of SEQ ID NO:2; and
(c) a polypeptide consisting of amino acids 183-284 of SEQ ID NO:2.
12. The method of claim 10, wherein the antibody is a polyclonal antibody.
13. The method of claim 1 , wherein the agent is a labeled or labelable nucleic acid probe capable of hybridizing to SIXl mRNA.
14. The method of claim any of claims 2, 6, 8 or 9, wherein the isolate is RNA.
15. The method of claim 14, wherein the RNA is subjected to an amplification process which results in amplification of SIXl nucleic acid. - 48 -
16. A method of determining the metastatic potential of a tumor comprising contacting a sample of the tumor, or isolate thereof, with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the tumor sample or isolate, thereby determining the metastatic potential of the tumor.
17. A prognostic method for determining whether a subject is at risk for developing cancer comprising contacting a biological sample obtained from the subject, or isolate of the sample, with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby determining whether the subject is at risk for developing cancer.
18. A method for diagnosis of a tumor comprising contacting a tumor sample, or isolate thereof, with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing the tumor.
19. A method of diagnosing cancer in a subject comprising contacting a biological sample obtained from the subject, or isolate of the sample with an agent capable of detecting SIXl polypeptide or mRNA such that the presence of SIXl polypeptide or mRNA is detected in the biological sample or isolate, thereby diagnosing cancer in the subject.
20. The method of any of claims 16 to 19, further comprising determining the level of SIXl polypeptide or mRNA in the sample or isolate.
21. The method of claim 20, further comprising comparing the level of SIX 1 polypeptide or mRNA in the sample or isolate with the level of SIXl polypeptide or mRNA in a control sample. - 49 -
22. A kit for detecting the presence of SIXl in a biological sample, or isolate thereof, comprising a labeled or labelable agent capable of detecting SIXl polypeptide or mRNA in a biological sample
23. The kit of claim 22, further comprising a means for determining the amount of SIXl in the sample.
24. The kit of claim 22, wherein the agent is an antibody capable of specifically binding to SIXl polypeptide.
25. The kit of claim 22, wherein the agent is a nucleic acid probe capable of hybridizing to SIXl mRNA.
26. The kit of claim 24, further comprising a means for comparing the amount of SIXl in the sample with a standard.
28. The kit of claim 24, further comprising directions for use.
PCT/US1999/006679 1998-03-26 1999-03-26 Methods and compositions for diagnosing and predicting the behavior of cancer WO1999049084A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/647,115 US7153700B1 (en) 1999-03-26 1999-03-26 Methods and compositions for diagnosing and predicting the behavior of cancer

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US7951198P 1998-03-26 1998-03-26
US60/079,511 1998-03-26

Publications (1)

Publication Number Publication Date
WO1999049084A1 true WO1999049084A1 (en) 1999-09-30

Family

ID=22151026

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/006679 WO1999049084A1 (en) 1998-03-26 1999-03-26 Methods and compositions for diagnosing and predicting the behavior of cancer

Country Status (1)

Country Link
WO (1) WO1999049084A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2101784A2 (en) * 2006-12-11 2009-09-23 The Regents of the University of Colorado Methods for determining prognoses and therapeutic interventions for ovarian carcinomas
US20110223171A1 (en) * 2004-04-26 2011-09-15 Ford Heide L Methods and Compositions for the Diagnosis and Treatment of Cyclin A-1 Associated Conditions
CN116171942A (en) * 2023-03-10 2023-05-30 四川大学 Method for constructing vortex worm optic nerve system deletion model by SIX1 gene interference

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BOUCHER C A, ET AL.: "CLONING OF THE HUMAN SIX1 GENE AND ITS ASSIGNMENT TO CHROMOSOME 14", GENOMICS, ACADEMIC PRESS, SAN DIEGO., US, vol. 33, 1 January 1996 (1996-01-01), US, pages 140 - 142, XP002919711, ISSN: 0888-7543, DOI: 10.1006/geno.1996.0172 *
OLIVER G, ET AL.: "HOMEOBOX GENES AND CONNECTIVE TISSUE PATTERNING", DEVELOPMENT, THE COMPANY OF BIOLOGISTS LTD., GB, vol. 121, 1 January 1995 (1995-01-01), GB, pages 693 - 705, XP002919712, ISSN: 0950-1991 *
SPITZ F, ET AL.: "EXPRESSION OF MYOGENIN DURING EMBRYOGENESIS IS CONTROLLED BY SIX/SINE OCULIS HOMEOPROTEINS THROUGH A CONSERVED MEF3 BINDING SITE", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, NATIONAL ACADEMY OF SCIENCES, US, vol. 95, 1 November 1998 (1998-11-01), US, pages 14220 - 14225, XP002919713, ISSN: 0027-8424, DOI: 10.1073/pnas.95.24.14220 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110223171A1 (en) * 2004-04-26 2011-09-15 Ford Heide L Methods and Compositions for the Diagnosis and Treatment of Cyclin A-1 Associated Conditions
EP2101784A2 (en) * 2006-12-11 2009-09-23 The Regents of the University of Colorado Methods for determining prognoses and therapeutic interventions for ovarian carcinomas
EP2101784A4 (en) * 2006-12-11 2010-04-14 Univ Colorado Methods for determining prognoses and therapeutic interventions for ovarian carcinomas
US8283119B2 (en) 2006-12-11 2012-10-09 The Regents Of The University Of Colorado, A Body Corporate Methods for determining prognoses and therapeutic interventions for ovarian carcinomas
CN116171942A (en) * 2023-03-10 2023-05-30 四川大学 Method for constructing vortex worm optic nerve system deletion model by SIX1 gene interference

Similar Documents

Publication Publication Date Title
US7153700B1 (en) Methods and compositions for diagnosing and predicting the behavior of cancer
CA2281952C (en) Compounds for immunotherapy of prostate cancer and methods for their use
US7141417B1 (en) Compositions, kits, and methods relating to the human FEZ1 gene, a novel tumor suppressor gene
US6512102B1 (en) Compositions and methods of diagnosis and treatment using casein kinase I
JP2001513886A (en) Compounds for immunodiagnosis of prostate cancer and methods of using them
US20040053262A1 (en) Supressor gene
AU2008259792B2 (en) Cancer related isoforms of components of transcription factor complexes as biomarkers and drug targets
AU2001263952B2 (en) Tumour suppressor and uses thereof
EP1163252A2 (en) Compositions, kits, and methods relating to the human fez1 gene, a novel tumor suppressor gene
US7176294B2 (en) Transcription factor, BP1
WO1998012327A2 (en) Compositions and methods comprising bard1 and other brca1 binding proteins
CA2391805A1 (en) Differentially expressed genes associated with her-2/neu overexpression
WO1999049084A1 (en) Methods and compositions for diagnosing and predicting the behavior of cancer
Den Bakker et al. Evidence for a cytoskeleton attachment domain at the N‐terminus of the NF2 protein
JP2004527240A (en) Polynucleotides useful for regulating the growth of cancer cells
US20020142003A1 (en) Tumor-associated antigen (B345)
US7883896B2 (en) Marker molecules associated with lung tumors
WO1994021791A1 (en) Agents for the prevention and treatment of breast cancer
EP1490398B1 (en) Tumour associated antigens
US20050158737A1 (en) Tumour associated antigens
EP1365030A1 (en) G-protein coupled receptor marker molecules associated with colorectal lesions
WO2002088309A2 (en) Fusion genes associated with acute megakaryoblastic leukemias
CA2365278A1 (en) Tumour-associated antigen
WO2000000503A1 (en) Human az-1 gene, variants thereof and expressed gene products
US20120183551A1 (en) Novel human p53 splice variant displaying differential transcriptional activity

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): CA JP US

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 09647115

Country of ref document: US

122 Ep: pct application non-entry in european phase