DESCRIPTION
HEPATOCELLULAR CARCINOMA-RELATED GENES AND POLYPEPTIDES, AND METHOD FOR DETECTING HEPATOCELLULAR CARCINOMAS
Technical Field
The present invention relates to genes up-regulated in hepatocellular carcinomas, polypeptides encoded by the genes, and a method for detecting hepatocellular carcinomas.
Background Art cDNA microarray technologies have enabled one to obtain comprehensive profiles of gene expression in normal versus malignant cells (Perou, C. M. et al . , Nature. 406: 747-752, 2000; Clark, E. A. et al . , Nature. 406: 532-535, 2000; Okabe, H. et al., Cancer Res. 61 : 2129-2137, 2001). This approach discloses the complex nature of cancer cells , andhelps to improve understanding of carcinogenesis . Identification of genes that are deregulated in tumors can lead to more precise and accurate diagnosis of individual cancers, and to development of novel therapeutic targets (Golub, T. R. etal., Science 286: 531-537, 1999) .
Hepatocellular carcinoma (HCC) is a leading cause of cancer deaths worldwide. In spite of recent progress in therapeutic strategies, prognosis of patients with advanced HCC remains verypoor . Althoughmolecular studies have revealed that alterations of TP53 , CTNNB1 and/or AXIN1 genes can be involved in hepatocarcinogenesis (Perou, C. M. et al. , Nature. 406: 747-752, 2000; Satoh, S. et al., Nat Genet. 24 : 245-250, 2000) , these changes appear to be implicated in only a fraction of HCCs. Accordingly, a ultimate gene that can be a novel diagnostic marker and/or drug target for treatment of cancers has been desired.
The present inventors previously reported that a novel gene, VANGLl , was identified by genome-wide analysis of HCCs (Yagyu, R. et al., International Journal of Oncology 20 :
1173-1178, 2002) .
Disclosure of the Invention
An ob ective of the present invention is to provide genes up-regulated in hepatocellular carcinomas, polypeptides encoded by the genes , and a method for detecting hepatocellular carcinomas .
The present inventors have analyzed expression profiles of HCCs by means of a cDNAmicroarray representing 23,040 genes. These efforts have pinpointed 165 genes, including 69 ESTs, which appear to be up-regulated frequently in cancer tissues compared with corresponding non-cancerous liver cells. The inventors isolated three genes from among the transcripts whose expression was frequently elevated in HCCs . These genes encode products that shared structural features with centaurin-family proteins .
One of the three genes corresponds to an EST, Hs.44579 of a UniGene cluster, and was found to be a novel gene over-expressed at chromosomal band lp36.13. Since an open reading frame of this gene encoded a protein approximately 60% identical to that of development and differentiation enhancing factor 2 (DDEF2) , the inventors termed this gene development and differentiation enhancing factor-like 1 (DDEFL1 ) .
Another gene up-regulated in HCCs corresponds to an EST (Hs. 122730) of a UniGene cluster. The predicted amino acid sequence shared 40% and 63% identity with strabismus (Van Gogh) , which is involved in cell polarity and cell fate decisions in Drosophila , and Van Gogh Like 2 (VANGL2) . Hence, this gene was termed Van Gogh Like 1 (VANGL1 ) . Another gene up-regulated in HCCs was found to be LGN
(GenBank accession number U54999) . LGN protein interacts with alpha subunit of inhibitory heterotrimeric G proteinis (Gαι2) .
Gene transfer of DDEFL1 or LGN promoted proliferation of cells that lacked endogenous expression of either of these genes. Furthermore, reduction of DDEFL1 , VANGL1 or LGN expression by transfection of their specific anti-sense
S-oligonucleotides inhibited the growth of hepatocellular carcinoma cells.
The above findings would contribute to clarify the mechanisms of HCC and to develop new strategies for diagnosis and treatment of HCC.
The present invention specifically provides
(1) an isolated nucleic acid selected from the group consisting of:
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 1 or NO: 3;
(b) a nucleic acid encoding a polypeptide comprising the amino acid sequence of SEQ ID NO: 2 or NO: 4;
(c) a nucleic acid comprising a strand that hybridizes under high stringent conditions to a nucleotide sequence consisting of SEQ ID NO: 1 or NO: 3 or the complement thereof,
(2) an isolated polypeptide selected from the group consisting of:
(a) a polypeptide encoded by the nucleotide sequence of SEQ ID NO: 1 or NO: 3; (b) a polypeptide comprising the amino acid sequence of SEQ ID NO: 2 or NO: 4;
(c) a polypeptide having at least 65% identity to SEQ ID NO: 2 or NO: 4,
(3) a vector carrying the nucleic acid of (1) , (4) a transformant carrying the nucleic acid of (1) or the vector of (3) ,
(5) a method of producing a polypeptide, the method comprising culturing the transformant of (4) in a culture, expressing the polypeptide in the transformant, and recoverying the polypeptide from the culture,
(6) an antibody that specifically binds to the polypeptide of (2) ,
(7) a method for detecting hepatoceullar carcinoma, the method comprising the steps of: (a) preparing a biological sample from a subject;
(b) measuring the expression level of at least one of
polypeptides selected from the group consisting of the polypeptide of SEQ ID NO: 1, a polypeptide of SEQ ID NO: 3, and the polypeptide of SEQ ID NO: 5;
(c) comparing the expression level with that measured in a non-cancerous sample; and
(d) determining the presence or absence of the cancer in the subject,
( 8 ) a reagent for detecting hepatocellular carcinomas , comprising a nucleic acid comprising a strand that hybridizes under high stringent conditions to a nucleotide sequence consisting of SEQ ID NO: 1, NO: 3, or NO: 5 or the complement thereof,
(9) a reagent for detecting hepatocellular carcinomas , comprising the antibody of (6) , and (10) a method for inhibiting growth of hepatocellular carcinomas, the method comprising introducing at least one of antisense oligonucleotides that hybridizes with the nucleotide sequence of SEQ ID NO: 1, NO: 3, or NO: 5 into hepatocelluar carcinomas . The present invention will be illustrated below in more detail.
Nucleic Acids
The present invention provides genes up-regulated in hepatocellular carcinomas.
The nucleotide sequence and the amino acid sequence of
DDEFLl are shown as SEQ ID NO: 1 and NO: 2, respectively. The complete cDNA of DDEFLl consisted of 4050 nucleotides, with an open reading frame of 2712 uncleotides encoding a
903-amino-acid protein (GenBank accession number AB051853) .
The amino acid sequence of DDFEL1 showed 60% identity to DDEFL2 and 46% identity to DDEF/ASAP1 , and contained an Arf GTPase-activating protein (ArfGAP) domain and two ankyrin repeats .
DDEFLl showed 60% identity to a member of the centaurin family, DDEF2 , a protein that regulates re-organization of the actin cytoskeleton. This suggests that DDEFLl may also play a role in organization of cellular structure (Randazzo, P. A. et al. , The Arf GTPase-activating protein ASAP1 regulates the actin cytoskeleton, Proc. Natl. Acad. Sci. U S A 97 : 4011-4016, 2000) . Because DDEFLl also conserves a PH domain and an ArfGAP motif it appears to be a new member of the centaurin family, regulating Arf small GTPase by means of GAP activity. The PH domain, observed in the majority of molecules belonging to the Dbl family of GEFs , is thought to play a crucial role in relocation of proteins by interacting with specific targetmolecules and/or by directly regulating catalytic domains (Jackson, T. R. et al. , Trends Biochem Sci. 25 : 489-495, 2000; Cerione, R. A. and Zheng, Y. , Curr. Opin. Cell. Biol. 8 : 216-222, 1996; Chardin, P. et al., Nature 384 : 481-484, 1996). Although DDEF2 is localized in peripheral focal adhesions, the inventors found myc-tagged DDEFLl protein to be diffuse in cytoplasm.
Arf proteins have been implicated in important cellular processes such as vesicular membrane transport, maintenance of the integrity of ER and Golgi compartments, and regulation of the peripheral cytoskeleton (Cukierman, E. et al. , Science 270 : 1999-2002, 1995) . Six members of Arf family (Arfl-Arf6) and their functions have been identified so far (Moss, J. and Vaughan, M. , J. Biol. Chem. 270 : 12327-12330, 1995). For example, Arf6 proteins have been implicated as regulators of the cytoskeleton to alter the morphology of focal adhesions and to block spreading of cells , and DDEF2 displays GAP activity toward Arf1. Over-expression of DDEFLl promoted growth promotion and survival of cells under low-serum conditions. This suggests that DDEFLl may provide a growth advantage to cancer cells in poor nutritional and hypoxic conditions. The frequent up-regulation of DDEFLl in HCCs underscores the importance of this gene in hepatocarcinogenesis.
The nucleotide sequence and the amino acid sequence of VANGL1 are shown as SEQ ID NO: 3 and NO: 4, respectively. The
determined cDNA sequence consisted of 1879 nucleotides containing an open reading frame of 1572 nucleotides encoding a 524-amino-acid protein (GenBank accession number AB057596) . Strabismus (stbm) was identified as a gene responsible for a mutant fruit fly with rough eye phenotype (Wolff T. and Rubin G.M. , Development 125:1149-1159, 1998). The gene is required to maintain polarity in the eye, legs and bristles and to decide cell fate of R3 and R4 photoreceptors in the Drosophila . A mouse gene homologous to stbm , Ltap , was altered in the neural tube mutant mouse Loop-tail, which is a human model of neural tube defects (NTDs) (Kibar Z et al. , Nat Genet. 28 : 251-255, 2001) . Hence, VANGL1 may also play important roles in cellular polarity, cell fate decision, and/or organization of tissues. Since VANGL1 is frequently up-regulated in HCCs and suppression of its expression significantly reduced growth or survival of cancer cells , VANGL1 may confer prolonged survival and/or depolarized growth to cancer cells.
The nucleotide sequence and the amino acid sequence of LGN are shown as SEQ ID NO: 5 and NO: 6, respectively. LGN cDNA consists of 2336 nucleotides and encodes a 677 amino acid peptide.
LGN protein was previously reported as a protein interacting with alpha subunit of inhibitory heterotrimeric G proteins (Gαi2) (Mochizuki , N. et al ., Gene 181 : 39-43 , 1996) . The activating mutations of Gαi2 have ever been reported in pituitary tumor and other endocrine tumors (Hermouet, S. et al., Proc. Natl. Acad. Sci. USA 88 : 10455-10459, 1991; Pace, A. M. etal., Proc. Natl. Acad. Sci. USA. 88 : 7031-7035, 1991; Lyons, J. et al, Science 249 : 655-659, 1990). However, involvement of LGN in tumorigenesis or carcinogenesis has not yet been reported. Colony formation assay suggested that LGN might have oncogenic activity. Enhanced expression of LGN may activate Gαi2 and mediate oncogenic signals in hepatocarcinogenesis . The nucleic acid of the present invention includes cDNA, genomic DNA, chemically synthesized DNA, and RNA. It may be
single-stranded or double-stranded.
The "isolated nucleic acid" used herein means a nucleic acid the structure of which is not identical to that of any naturally occurring nucleic acid or to that of any fragment of a naturally occurring genomic nucleic acid spanning more than three separate genes. The term therefore includes, for example, (a) a DNA which has the sequence of part of a naturally occurring genomic DNA molecule in the genome of the organism in which it naturally occurs; (b) a nucleic acid incorporated into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner such that the resulting molecule is not identical to any naturally occurring vector or genomic DNA; (c) a separate molecule such as a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction (PCR) , or a restriction fragment; and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion protein. Specifically excluded from this definition are nucleic acids of DNA molecules present in mixtures of different (i) DNA molecules, (ii) transfected cells , or (iii) cell clones; e.g., as these occur in a DNA library such as a cDNA or genomic DNA library.
In one embodiment, the nucleic acid of the present invention includes a nucleic acid comprising the nucleotide sequence of DDEFLl or VANGL1 , specifically SEQ ID NO: 1 or NO: 3.
In another embodiment, the nucleic acid of the present invention includes a nucleic acid encoding a polypeptide comprising the amino acid sequence of DDEFLl or VANGL1 , specifically, SEQ ID NO: 2 or NO: 4. Thus, the nucleic acid comprising arbitrary sequences based on the degeneracy of the genetic code are included.
In still another embodiment, the nucleic acid of the present invention includes a variant nucleic acid of SEQ ID NO: 1 or NO: 3. The variant includes a nucleic acid comprising a strand that hybridizes under high stringent conditions to a nucleotide sequence consisting of SEQ ID NO: 1 or NO: 3 or
the complement thereof.
The term "complement" used herein means one strand of a double-stranded nucleic acid, in which all the bases are able to form base pairs with a sequence of bases in another strand. Also, "complementary" is defined as not only those completely matching within a continuous region of at least 15 contiguous nucleotides, but also those having identity of at least 65%, preferably 70% , more preferably 80% , still more preferably 90% , and most preferably 95% or higher within that region. As used herein, "percent identity" of two nucleic acids is determined using the algorithm of Karlin and Altschul (Proc. Natl. Acad. Sci. USA 87: 2264-2268, 1990) modified as in Karlin and Altschul (Proc. Natl. Acad. Sci. USA 90 : 5873-5877 , 1993). Such an algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et al. (J. Mol. Biol. 215:403-410, 1990).
BLAST nucleotide searches are performed with the NBLAST program, score = 100, wordlength = 12. Homology search of protein can readily be performed, for example, in DNA Databank of JAPAN
(DDBJ) , by using the FASTA program, BLAST program, etc. BLAST protein searches are performed with the XBLAST program, score = 50, wordlength = 3. Where gaps exist between two sequences, Gapped BLAST is utilized as described in Altsuchl et al . (Nucleic Acids Res. 25: 3389-3402, 1997). When utilizing BLAST and Gapped BLAST programs , the default parameters of the respective programs (e.g, XBLAST and NBLAST) are used.
Preferably, the variant includes a nucleotide sequence that is at least 65% identical to the nucleotide sequence shown in SEQ ID NO: 1 or NO: 3. More preferably, the variant is at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, identical to the nucleotide sequence shown in SEQ ID NO: 1 or NO: 3. In the case of a variant which is longer than or equivalent in length to the reference sequence, e.g. , SEQ ID NO: 1 or NO: 3, the comparison is made with the full length of the reference sequence. Where the variant is shorter than the reference sequence, e.g. , shorter than SEQ ID NO: 1 or NO: 3, the comparison is made to segment of the reference sequence
of the same length (excluding any loop required by the homology calculation) .
The stringency of hybridization is defined as equilibrium hybridization under the following conditions: 42 !, 2 x SSC, 0.1% SDS (low stringency); 50°C, 2 x SSC, 0.1% SDS (medium stringency); and 65 , 2 x SSC, 0,1% SDS (high stringency). If washings are necessary to achieve equilibrium, the washings are performedwith the hybridization solution for the particular stringency desired. In general, the higher the temperature, the higher is the homology between two strands hybridizing at equilibrium.
There is no restriction on length of the nucleic acid of the present invention, but it preferably comprises at least 15, 20, 30, 40, 50, 100, 150, 200, 300, 400, 500, 1000, 1500, 2000, 2500, or 3000 nucleotides.
The nucleic acid of the present invention includes polynucleotides used as probes or primers specifically hybridizing with the nucleotide sequence of SEQ ID NO: 1 or NO: 3 or its complement. The term "specifically hybridizing" means that hybridizing under a normal hybridization condition, preferably a stringent condition with the nucleotide sequence of SEQ ID NO: 1 or NO: 3, but not crosshybridizing with DNAs encoding other polypeptides.
The primers and probes comprise at least 15 continuous nucleotides within the nucleotide sequence of SEQ ID NO: 1 or 3 or complementary to the sequence. In general, the primers comprises 15 to 100 nucleotides, and preferably 15 to 35 nucleotides, and the probes comprise at least 15 nucleotides, preferably at least 30 nucleotides , containing at least a portion or the whole sequence of SEQ ID NO: 1 or NO: 3. The primers can be used for amplification of the nucleic acid encoding the polypeptide of the present invention and the probes can be used for the isolation or detection of the nucleic acid encoding the polypeptide of the present invention . The primers and probes of the present invention can be prepared, for example, by a commercially available oligonucleotide synthesizing machine .
The probes can be also prepared as double-stranded DNA fragments which are obtained by restriction enzyme treatments and the like.
The nucleic acid of the present invention includes an antisense oligonucleotide that hybridizes with any site within the nucleotide sequence of SEQ ID NO: 1 or 3. The term "antisense oligonucleotides" as used herein means , not only those in which the entire nucleotides corresponding to those constituting a specified region of a DNA or mRNA are complementary, but also those having a mismatch of one or more nucleotides, as long as DNA or mRNA and an oligonucleotide can specifically hybridize with the nucleotide sequence of SEQ ID NO: 1 or NO: 3.
The antisense oligonucleotide is preferably that against at least 15 continuous nucleotides in the nucleotide sequence of SEQ ID NO: 1 or NO: 3. The above-mentioned antisense oligonucleotide, which contains an initiation codon in the above-mentioned at least 15 continuous nucleotides, is even more preferred.
The antisense oligonucleotides of the present invention includes analogs containing lower alkyl phosphonate (e.g., methyl-phosphonate or ethyl-phosphonate) , phosphothioate, and phosphoamidate .
The antisense oligonucleotide of the present invention, acts upon cells producing the polypeptide of the invention by binding to the DNA or mRNA encoding the polypeptide and inhibits its transcription or translation, promotes the degradation of the mRNA, inhibiting the expression of the polypeptide of the invention.
The nucleic acid of the present invention can be prepared as follows. cDNA encoding the polypeptide of the present invention can be prepared, for example, by preparing a primer based on nucleotide information (for example, SEQ ID NO: 1 or NO: 3) of DNA encoding the polypeptide of the present invention and performing plaque PCR (Affara NA et al . (1994) Genomics 22, 205-210). Genomic DNA can be prepared, for example, by the method using commercially available "Qiagen genomic DNA
kits" (Qiagen, Hilden, Germany) . The nucleotide sequence of the DNA acquired can be decided by ordinary methods in the art by using, for example, the commercially available "dye terminator sequencing kit" (Applied Biosystems) . The nucleic acid of the present invention, as stated later, can be utilized for the production of a recombinant protein and detection of hepatocellular carcinoma.
Vectors, Transformants, and Production of Recombinant Polypeptide
The present invention also features a vector into which the nucleic acid of the present invention has been inserted.
The vector of the present invention includes a vector for preparing the recombinant polypeptide of the present invention. Any vector can be used as long as it enables expression of the polypeptide of the present invention.
Examples of the expression vector include bacterial (e.g. Escherichia coli) expressionvectors , yeastexpressionvectors , insect expression vectors, and mammalian expression vectors. In the present invention, mammalian expression vectors such as pcDNA3.1-myc/His or pcDNA 3.1 vector (Invitrogen) can be used. Insertion of the nucleic acid of the present invention into a vector can be done using ordinary methods in the art. The vector of the present invention also includes a vector for expressing the polypeptide of the present invention in vivo
(especially for gene therapy) . Various viral vectors and non-viral vectors can be used as long as they enable expression of the polypeptide of the present invention in vivo . Examples of viral vectors are adenovirus vectors, retrovirus vectors, etc. Cationic liposomes can be given as examples of non-viral vectors.
The present invention also provides a transformant carrying, in an expressible manner, the nucleic acid of the present invention. The transformant of the present invention includes , those carrying the above-mentioned expression vector
intowhich nucleic acidof thepresent inventionhas been inserted, and those having host genomes into which the nucleic acid of the present invention has been integrated. The nucleic acid of the invention is retained in the transformant in any form as long as the transformant can express the nucleic acid.
There is no particular restriction as to the cells into which the vector is inserted as long as the vector can function in the cells to express the nucleic acid of the present invention. For example, E. coli , yeast, mammalian cells and insect cells can be used as hosts. Preferably, mammalian cells such as COS7 cells and NIH3T3 cells. Introduction of a vector into a cell can be done using known methods such as electroporation and calcium phosphate method.
Common methods applied in the art may be used to isolate and purify said recombinant polypeptide from the transformant. For example, after collecting the transformant and obtaining the extracts, the objective polypeptide can be purified and prepared by, ion exchange chromatography, reverse phase chromatography, gel filtration, or affinity chromatography where an antibody against the polypeptide of the present invention has been immobilized in the column, or by combining several of these columns.
Also when the polypeptide of the present invention is expressed within host cells (for example, animal cells , E. coli ) as a fusion protein with glutathione-S-transferase protein or as a recombinant polypeptide supplemented with multiple histidines, the expressed recombinant polypeptide can be purified using a glutathione column or nickel column. After purifying the fusion protein, it is also possible to exclude regions other than the objective polypeptide by cutting with thrombin or factor-Xa as required.
Polypeptides
The present invention provides isolated polypeptides encoded by DDEFLl or VANGL1 (e.g. SEQ ID NO: 1 or NO: 3) . In
specific embodiments , the polypeptides of the present invention includes a polypeptide encoded by the nucleotide sequence of SEQ ID NO: 1 or NO: 3 and a polypeptide comprising the amino acid sequence of SEQ ID NO: 2 or NO: 4. The "isolatedpolypeptide" usedhereinmeans apolypeptide that is substantially pure and free from other biological macromolecules . The substantially pure polypeptide is at least 75% (e.g., at least 80, 85, 95, or 99%) pure by dry weight. Purity can be measured by any appropriate standard method, for example by column chromatography, polyacryiamide gel electrophoresis, or HPLC analysis.
Thepolypeptide of thepresent invention includes variants of SEQ ID NO: 2 or NO: 4 as long as the variants are at least 65% identical to SEQ ID NO: 2 or NO: 4. The variants may be a polypeptide comprising the amino acid sequence of SEQ ID NO: 2 or NO: 4 in which one or more amino acids have been substituted, deleted, added, and/or inserted. The variants may also be a polypeptide encoded by a nucleic acid comprising a strand that hybridizes under high stringent conditions to a nucleotide sequence consisting of SEQ ID NO: 1 or NO: 3.
Polypeptides having amino acid sequences modified by deleting, adding and/or replacing one ormore amino acid residues of a certain amino acid sequence, have been known to retain the original biological activity (Mark, D. F. et al. , Proc. Natl. Acad. Sci. USA (1984) 81, 5662-5666, Zoller, M. J. & Smith, M. , Nucleic Acids Research (1982) 10, 6487-6500, Wang, A. et al., Science 224, 1431-1433, Dalbadie-McFarland, G. et al., Proc. Natl. Acad. Sci. USA (1982) 79, 6409-6413).
The number of amino acids that are mutatedby substitution , deletion, addition, and/or insertion is not particularly restricted. Normally, it is 10% or less, preferably 5% or less, and more preferably 1% or less of the total amino acid residues.
As for the amino acid residue tobemutated, it is preferable to be mutated into a different amino acid in which the properties of the amino acid side-chain are conserved. Examples of properties of amino acid side chains are, hydrophobic amino
acids (A, I, L, M, F, P, W, Y, V), hydrophilic amino acids
(R, D, N, C, E, Q, G, H, K, S, T) , and amino acids comprising the following side chains: an aliphatic side-chain (G, A, V,
L, I, p) ; a hydroxyl group containing side-chain (s , T, Y) ; a sulfur atom containing side-chain (c, M) ; a carboxylic acid and amide containing side-chain (D,N,E, Q) ; a base containing side-chain (R, K, H) ; and an aromatic containing side-chain
(H, F, Y, W) (The parenthetic letters indicate the one-letter codes of amino acids) . A "conservative amino acid substitution" is a replacement of one amino acid belonging to one of the above groups with another amino acid in the same group.
A deletion variant includes a fragment of the amino acid sequence of SEQ ID NO: 1 or NO: 3. The fragment is a polypeptide having an amino acid sequence which is partly, but not entirely, identical to the above polypeptides of this invention. The polypeptide fragments of this invention usually consist of 8 amino acidresidues ormore, andpreferably 12 amino acidresidues or more (for example, 15 amino acid residues or more) . Examples of preferred fragments include truncation polypeptides, having amino acid sequences lacking a series of amino acid residues including either the amino terminus or carboxyl terminus, or two series of amino acid residues, one including the amino terminus and the other including the carboxyl terminus. Furthermore, fragments featured by structural or functional characteristics are also preferable , which include those having α-helix andα-helix forming regions , β-sheet andβ -sheet forming regions, turn and turn forming regions, coil and coil forming regions, hydrophilic regions, hydrophobic regions, α -amphipathic regions , β -amphipathic regions , variable regions , surface forming regions, substrate-binding regions, and high antigenicity index region. Biologically active fragments are also preferred. Biologically active fragments mediate the activities of the polypeptides of this invention, which fragments include those having similar or improved activities, or reduced undesirable activities. For example, fragments having the activity to transduce signals into cells via binding
of a ligand, and furthermore, fragments having antigenicity or immunogenicity in animals, especially humans are included. These polypeptide fragments preferably retain the antigenicity of the polypeptides of this invention. Further, an addition variant includes a fusion protein of the polypeptide of the present invention and another peptide or polypeptide. Fusion proteins can be made by techniques well known to a person skilled in the art, such as by linking the DNA encoding the polypeptide of the invention with DNA encoding other peptides or polypeptides , so as the frames match , inserting this into an expression vector and expressing it in a host. There is no restriction as to the peptides or polypeptides fused to the polypeptide of the present invention.
Known peptides, for example, FLAG (Hopp, T.P. et al . , Biotechnology (1988) 6, 1204-1210), 6xHis containing six His (histidine) residues, lOxHis, Influenza agglutinin (HA) , human c-myc fragment, VSP-GP fragment, plδHIV fragment, T7-tag,
HSV-tag, E-tag, SV40T antigen fragment, Ick tag, α-tubulin fragment, B-tag, Protein C fragment, and such, can be used as peptides that are fused to the polypeptide of the present invention. Examples of polypeptides that are fused to polypeptide of the invention are, GST
(glutathione-S-transferase) , Influenza agglutinin (HA) , immunoglobulin constant region, β-galactosidase, MBP (maltose-binding protein) , and such.
Fusion proteins can be prepared by fusing commercially available DNA encoding these peptides or polypeptides with the DNA encoding the polypeptide of the present invention and expressing the fused DNA prepared. The variant polypeptide is preferably at least 65% identical to the amino acid sequence shown in SEQ ID NO: 2 or NO: 4. More specifically, the modified polypeptide is at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or more, identical to the amino acid sequence shown in SEQ ID NO: 2 or NO: 4. In the case of a modified polypeptide which is longer than or equivalent in length to the reference sequence, e.g. ,
SEQ ID NO: 2 or NO: 4, the comparison is made with the full length of the reference sequence. Where the modified polypeptide is shorter than the reference sequence, e.g., shorter than SEQ ID NO: 2 or NO: 4, the comparison is made to segment of the reference sequence of the same length.
As used herein, "percent identity" of two amino acid sequences is determined in the same manner as described above for the nucleic acids.
The polypeptide of the present invention can be prepared by methods known to one skilled in the art, as a natural polypeptide or a recombinant polypeptide made using genetic engineering techniques as described above. For example, a natural polypeptide canbe obtainedbypreparing a column coupled with an antibody obtained by immunizing a small animal with the recombinant polypeptide, and performing affinity chromatography for extracts of liver tissues or cells expressing high levels of the polypeptide of the present invention. A recombinant polypeptide can be prepared by inserting DNA encoding the polypeptide of the present invention (for example, DNA comprising the nucleotide sequence of SEQ ID NO: 1 or 3) into a suitable expression vector, introducing the vector into a host cell, allowing the resulting transformant to express the polypeptide, and recovering the expressed polypeptide.
The variant polypeptide can be prepared, for example, by inserting a mutation into the amino acid sequence of SEQ ID NO: 1 or NO: 3 by a known method such as the PCR-mediated, site-directed-mutation-induction system (GIBCO-BRL, Gaithersburg, Maryland) , oligonucleotide-mediated, sight-directed-mutagenesis (Kramer, W. and Fritz, HJ (1987) Methods in Enzymol. 154:350-367).
Antibodies
The present invention also features an antibody that specifically binds to the polypeptide of the present invention.
There is no particular restriction as to the form of the antibody
of the present invention and include polyclonal antibodies and monoclonal antibodies. The antiserum obtained by immunizing animals such as rabbits with the polypeptide of the present invention, polyclonal andmonoclonal antibodies of all classes , humanized antibodies made by genetic engineering, human antibodies, are also included.
Polyclonal antibodies can be made by, obtaining the serum of small animals such as rabbits immunized with the polypeptide of the present invention, attaining a fraction recognizing only the polypeptide of the invention by an affinity column coupled with the polypeptide of the present invention, and purifying immunoglobulin G or M from this fraction by a protein G or protein A column.
Monoclonal antibodies can be made by immunizing small animals such as micewiththepolypeptide of thepresent invention , excising the spleen from the animal, homogenizing the organ into cells, fusing the cells with mouse myeloma cells using a reagent such as polyethylene glycol, selecting clones that produce antibodies against the polypeptide of the invention from the fused cells (hybridomas) , transplanting the obtained hybridomas into the abdominal cavity of a mouse, and extracting ascites. The obtained monoclonal antibodies can be purified by, for example, ammonium sulfate precipitation, protein A or protein G column, DEAE ion exchange chromatography, or an affinity column to which thepolypeptide of thepresent invention is coupled. The antibody of the invention can be used for purifying and detecting the polypeptide of the invention. In particular, it can be used for detecting hepatocellular carcinoma. The human antibodies or humanized antibodies can be prepared by methods commonly known to one skilled in the art. For example, human antibodies can be made by, immunizing a mouse whose immune system has been changed to that of humans , with the polypeptide of the present invention. Also, humanized antibodies can be prepared by , for example, cloning the antibody gene from monoclonal antibody producing cells and using the
CDR graft method which transplants the antigen-recognition site of the gene into a known human antibody.
Detection Methods
The present invention further provides a method of detecting hepatocellular carcinoma using the DDEFLl , VANGLl , or LGN polypeptide as a marker.
The detection can be performed by measuring an expression level of at least one of DDEFLl , VANGLl , and LGN polypeptides in a biological sample from a subject, comparing the expression level with that in a non-cancerous sample, and determining the presence or absence of the cancer in a subject.
A biological sample used herein include any liver tissues or cells obtained from a subject who is in need of detection of hepatocellular carcinoma. In particular, liver biopsy specimen can be used. The biological sample also includes an mRNA, cRNA or cDNA sample prepared from liver tissues or cells. mRNA and cDNA samples can be prepared by a conventional method. cRNA refers to RNA transcribed from a template cDNA with RNA polymerase. cRNA can be synthesized from T7 promoter-attached cDNA as a template by using T7 RNA polymerase. A commercially available cRNA transcription kit for DNA chip-based expression profiling can be used. In specific embodiments, the expression level of DDEFLl , VANGLl or LGN polypeptide can be measured in the RNA, cDNA, or polypeptide level.
The mRNA expression level can be measured by , for example , a Northern blotting method using a probe that hybridizes with the nucleotide sequence of DDEFLl , VANGLl , or LGN, an RT-PCR method using a primer that hybridizes with the nucleotide sequence of DDEFLl , VANGLl , or LGN, and such.
The probes or primers used in the detection method of the present invention include a nucleic acid specifically hybridizing with the nucleotide sequence of SEQ ID NO: 1, NO: 3, or NO: 5, or its complement. The term "specifically
hybridizing" means that hybridizing under a normal hybridization condition, preferably a stringent condition with the nucleotide sequence of SEQ ID NO: 1, NO: 3, or NO: 5, but not crosshybridizing with DNAs encoding other polypeptides. The primers and probes comprise at least 15 continuous nucleotides within the nucleotide sequence of SEQ ID NO: 1, NO: 3, or NO: 5 or complementary to the sequence. In general, the primers comprises 15 to 100 nucleotides, and preferably 15 to 35 nucleotides, and the probes comprise at least 15 nucleotides, preferably at least 30 nucleotides, containing at least a portion or the whole sequence of SEQ ID NO: 1, NO: 3, orNO: 5. The primers andprobes canbe prepared, for example, by a commercially available oligonucleotide synthesizing machine. The probes can be also prepared as double-stranded DNA fragments which are obtained by restriction enzyme treatments and the like.
The cDNA expression level can be measured by, for example , a method utilizing a DNA array (Masami Muramatsu and Masashi Yamamoto, New Genetic Engineering Handbook pp.280-284, YODOSHA Co. , LTD.) . Specifically, first, a cDNA sample prepared from a subject and a solid support, on which polynucleotide probes hybridizing with the nucleotide sequence of DDEFLl, VANGLl, or LGN are fixed, areprovided. As the probes , those as described above can be used. Plural kinds of probes can be fixed on the solid support in order to detect plural kinds of target polynucleotides. The cDNA sample is labeled for detection according to needs. The label is not specifically limited so longas it canbe detected, and includes , for example, fluorescent labels, radioactive labels, and so on. The labeling can be carried out by conventional methods (L. Luo et al., "Gene expression profiles of laser-captured adjacent neuronal subtypes", Nat. Med. (1999) pp. 117-122).
The cDNA sample is then contacted with the probes on the solid support to allow the cDNA sample to hybridize with the probes . Although the reaction solution and the reaction condition for hybridization varies depending on various factors ,
such as the length of the probe , they can be determined according to usual methods well known to those skilled in the art.
The intensity of hybridization between the cDNA sample and the probes on the solid support is measured depending on the kind of the label of the cDNA sample. For example, a fluorescent label can be detected by reading out the fluorescent signal with a scanner.
The hybridization intensity of the test cDNA sample and the control cDNA sample (e.g. cDNA from non-cancerous tissues or cells) can be measured simultaneously in one measurement by labeling themwith different fluorescent labels . For example , one of the above-mentioned cDNA samples can be labeled with Cy5 , and the other with Cy3. The intensity of Cy5 and Cy3 fluorescent signals show the expression level of the respective cDNA samples (Duggan et al., Nat. Genet. 21:10-14, 1999).
In this method, cRNA can be measured in place of cDNA. Furthermore, the polypeptide expression level can be measured using an antibody against DDEFLl , VANGLl , or LGN polypeptide by , for example, SDS polyacryiamide electrophoresis , Western blotting, dot-blotting, immunoassay such as immunoprecipitation, fluoroimmunoassay , radioimmunoassay, enzyme immunoassay (e.g. enzyme-linked immunosorbent assay (ELISA)), and immunohistochemical staining, etc.
In specific embodiments , a biological sample is contacted with an antibody against DDEFLl , VANGLl , or LGN polypeptide immobilized on a solid support, the antibody-antigen complex on the solid support is contacted with a second antibody labeled with a detectable label, and the label is detected by an appropriate method. The antibody used in the detection method of the present invention includes any antibody thatbinds to the DDEFLl , VANGLl , or LGN polypeptide , specifically the polypeptide with the amino acid sequence of SEQ ID NO: 2 , NO: 4 , or NO: 6 , including antiserum obtained by immunizing animals such as rabbits with the DDEFLl , VANGLl , or LGN polypeptide, polyclonal and monoclonal antibodies of all classes , humanized antibodies made by genetic
engineering, and human antibodies. These antibodies can be prepared as described above.
The expression level measured as described above is compared with that measured in a non-cancerous sample to determine the presence or absence of hematocellular carcinoma in the subj ect . When the expression level measured in the sample from the subj ect is higher than thatmeasured in the non-cancerous sample, the subject is judged to have the cancer or the risk of the cancer. On the other hand, the expression level in the subject sample is not higher compared with that in the non-cancerous sample, then, the subject is judged to be free from the cancer. Specifically, whether the expression level in the subject sample is higher than that in the non-cancerous sample , can be determined based on the relative expression ratio (subject sample/non-cancerous sample) ; the expression level is judged as being higher when the relative expression ratio is more than 2.0.
Detection Reagents
The present invention provides detection reagents for hepatocellular carcinomas.
In one embodiment, the detection reagent of the present invention comprises a polynucleotide having at least 15 nucleotides which hybridizes with DDEFLl , VANGLl or LGN, specifically SEQ ID NO: 1, NO: 3, or NO: 5. The polynucleotide canbeused in the above-mentioneddetectionmethod of thepresent invention as a probe or a primer. When used as a probe, the polynucleotides contained in the detection reagent of the present invention can be labeled. The method of labeling includes, for example, a labelingmethodusingT4 polynucleotide kinase to phosphorylate the 5 '-terminus of the polynucleotide with 32P; and a method of introducing substrate bases, which are labeled with isotopes such as 32P , fluorescent dyes , biotin , and so on using random hexamer oligonucleotides and such as primers and DNA polymerase such as Klenow enzyme (the random
prime method, etc.).
In another embodiment, the detection reagent of the present invention comprises an antibody thatbinds to the DDEFLl , VANGLl , or LGNpolypeptide, specifically the polypeptide having the amino acid sequence of SEQ ID NO: 2, NO: 4, or NO: 6. The antibodies are used to detect the polypeptides of the present invention in the above-mentioned detectionmethod of the present invention. The antibodies may be labeled according to the dictionmethod. Furthermore, the antibodies maybe immobilized on a solid support.
The detection reagent of the present invention may further comprise a medium or additive, including sterilized water, physiological saline, vegetable oils, surfactants, lipids, solubilizers, buffers, protein stabilizers (such as bovine serum albumin and gelatin) , preservatives, and such, as long as it does not affect the reactions used in the detection method of the present invention.
Methods for Inhibiting Growth of Hematocellular Carcinomas
The present invention further provides a method for inhibiting growth of hepatocellular carcinomas. In specific embodiments, this method can be performed by introducing an antisense oligonucleotide of DDEFLl , VANGLl , or LGN into the target cells.
The antisense oligonucleotide used in this method hybridizes with any site within the nucleotide sequence of SEQ IDNO: l,NO: 3, or NO: 5. The antisense oligonucleotides include not only those in which the entire nucleotides corresponding to those constituting a specified region of a DNA or mRNA are complementary, but also those having a mismatch of one or more nucleotides, as long as DNA or mRNA and an oligonucleotide can specifically hybridize with the nucleotide sequence of SEQ ID NO: 1, NO: 3, or NO: 5. The antisense oligonucleotide is preferably that against at least 15 continuous nucleotides in the nucleotide sequence
of SEQ ID NO: 1, NO: 3, or NO: 5. The above-mentioned antisense oligonucleotide, which contains an initiation codon in the above-mentioned at least 15 continuous nucleotides, is even more preferred. The antisense oligonucleotides includes analogs containing lower alkyl phosphonate (e.g., methyl-phosphonate or ethyl-phosphonate) , phosphothioate , and phosphoamidate . Herein, the target cells may be mammalian cells, preferably human cells . The introduction method may be in vi tro , in vivo , or ex vivo transfer method. In one embodiment, the antisense oligonucleotides can be introduced into the target cells by a conventional transfection method. Alternatively, the introduction can be made by conventional gene transfer technique using a vector carrying the antisense oligonucleotide, such as adenovirus vectors, retrovirus vectors, or cationic liposomes .
Any patents , patent applications , and publications cited herein are incorporated by reference.
Brief Description of Drawings
Figure la-lb show expression of a gene termed B9362 in HCCs. Fig. la shows relative expression ratio (cancer/non-cancer) of B9362 in primary 20 HCCs examined by cDNA microarray. Fig. lb presents photographs showing expression of B9362 analyzed by semi-quantitative RT-PCR using additional 11 HCC cases. Expression of GAPDH served as an internal control .
Figure 2a-2d show the results of identification of DDEFLl . Fig. 2a is a photograph showing the results of Northern blot analysis of DDEFLl in various human tissues. Fig. 2b shows the structure of DDEFLl . Fig. 2c shows similarity between the expected DDEFLl protein and members of ArfGAP family. Fig. 2d shows identity between the amino acid sequence of the ArfGAP
motif in DDEFLl and that in DDEF2. The arrows indicate a CXXCXiεCXXC motif, representing a zinc finger structure essential to GAP activity.
Figure 3a-3b show subcellular localization of DDEFLl . Fig. 3a is a photograph showing the results of Western blot analysis, indicating that cMyc-tagged DDEFLl protein was expressed in COS7 cells transfected with pcDNA-DDFF l-myc plasmid. Fig. 3b presents photographs showing immunocytochemistry of the cells, suggesting that cMyc-tagged DDEFLl protein localized in the cytoplasm.
Figure 4a-4d show growth-promoting effect of DDEFLl . Fig. 4a presents photographs showing the results of colony formation assays, indicating that DDEFLl promotes cell growth in NIH3T3, SNU423, and Alexander cells. Fig. 4b presents photographs showing stable expression of exogeneous DDEFLl by NIH3T3-DDFFL1 cells. Fig.4c is a graph showing growth of NIH3T3-DDEFLl cells stably expressing exogeneous DDEFLl in culture media containing 10% FBS. Fig. 4d is a graph showing growth of NIH3T3-DDFFL1 cells in culture media containing 0.1% FBS (P<0.01). Figure 5a-5b show growth suppression by antisense S-oligonucleotides designated to suppress DDEFLl in SNU475 cells. Fig. 5a shows designation of antisense S-oligonucleotides and photographs showing reduced expression of DDEFLl by the transfection of AS1 or AS5 antisense S-oligonucleotides. Fig.5b presents photographs showing that AS1 and AS5 suppressed growth of SNU475 cells.
Figure 6a-6b show expression of VANGLl in HCCs. Fig. 6a shows relative expression ratios (cancer/non-cancer) of VANGLl inprimary 20 HCCs examinedby cDNAmicroarray. Fig.6b presents photographs showing expression of D3244 analyzed by semi-quantitative RT-PCR using additional 10 HCC cases. T, tumor tissue; N, normal tissue. Expression of GAPDH served as an internal control.
Figure 7a and 7B show the results of identification of VANGLl . Fig. 7a is a photograph showing the results of multiple-tissue Northern blot analysis of VANGLl in various
human tissues. Fig. 7b shows predicted protein structure of VANGLl .
Figure 8a and 8b show subcellular localization of VANGLl . Fig. 8a presents photographs of SNU475 cells transfected with pcDNA3.1-myc/His- VANGLl stainedwithmouse anti-mycmonoclonal antibody and visualized by Rhodamine conjugated secondary anti-mouse IgG antibody . Nuclei were counter-stained with DAPI . Fig. 8b presents photographs of mock cells similarly stained and visualized. Figure 9a-9d show growth suppressive effect of antisense S-oligonucleotide designated to suppress VANGLl . Fig. 9a presents photographs showing expression of VANGLl in SNU475 cells treated with either control or antisense oligonucleotide for 12 hours. Fig. 9b is a photograph showing that S-oligonucleotide suppressed growth of SNU423 cells. Fig. 9c is a graph showing the results of analysis of cell viability by MTT assay. Fig. 9d shows the results of fluorescence activated cell sorting (FACS) analysis of cells treated with sense or antisense oligonucleotide. Figure 10a and 10b show LGNgene expressionof HCCs compared with their corresponding non-cancerous liver tissues. Fig. 10a shows relative expression ratios (cancer/non-cancer) of LGN in primary 20 HCCs studied by cDNA microarray. Fig. 10b presents photographs showing expression of LGN analyzed by semi-quantitative RT-PCR using additional ten HCCs. Expression of GAPDH served as an internal control. T, tumor tissue; N, normal tissue.
Figure 11 shows genomic structure of LGN.
Figure 12a-12c show subcellular localization of LGN. Fig. 12a is a photograph of COS7 cells transfected with pcDNA3.1-myc/His-LGN, in which nuclei was counter-stained with DAPI. Fig. 12b is a photograph of COS7 cells transfected with pcDNA3.1-myc/His-LGN, which were stained with mouse anti c-myc antibody and visualized by Rhodamine conjugated secondary anti-mouse IgG antibody. Fig. 12c is a merge of a and b.
Figure 13a and 13b show growth-promoting effect of LGN.
Fig. 13a presents photographs showing the results of colony formation assays, indicating that LGN promotes cell growth in NIH3T3, SNU423, Alexander, and SNU475 cells. Fig. 13b is a graph showing growth of NIH3T3-LGN cells stably expressing exogeneous LGN was higher than that of mock (NIH3T3-LacZ) cells in culture media containing 10% FBS.
Figure 14a and 14b show growth suppression by antisense S-oligonucleotide designated to suppress LGN expression in human hepatoma SNU423 cells. Fig. 14a presents photographs showing reduced expression of LGN by the transfection of antisense S-oligonucleotide, antisense 3. Fig. 4b is a photograph showing that antisense 3 suppressed growth of SNU423 cells .
Best Mode for Carrying out the Invention
The present invention will be illustrated with reference to the following examples , but is not construed as being limited thereto.
Example 1
1-1. Identification of DDEFLl commonly up-regulated in human hepatocellular carcinomas
Bymeans of a genome-wide cDNAmicroarray containing 23040 genes, expression profiles of 20 hepatocellular carcinomas (HCC) were compared with their corresponding non-cancerous liver tissues . All HCC tissues and correspondingnon-cancerous tissues were obtained with informed consent from surgical specimens of patients who underwent hepatectomy. A gene with an in-house accession number of B9362 corresponding to an EST, Hs.44579 of a UniGene cluster, was found to be over-expressed in a range between 1.57 and 5.83 (Fig. la) . Its up-regulated expression (Cy3:Cy5 intensity ratio, >2.0) was observed in 11 of the 12 HCCs that passed through the cutoff filter (both Cy3 and Cy5 signals greater than 25,000) . Since an open reading frame of this gene encoded a protein approximately 60% identical to that of development and differentiation enhancing factor
2 (DDEF2) , this gene was termeddevelopment anddifferentiation enhancing factor-like 1 (DDEFLl ) . To clarify the results of the cDNA microarray, expression of this transcript was examined in an additional 11 HCCs by semi-quantitative RT-PCR. Expression of GAPDH served as an internal control. RT-PCR was performed as follows . Total RNA was extracted with a Qiagen RNeasy kit (Qiagen) or Trizol reagent (Life Technologies , Inc.) according to the manufacturers' protocols. Ten-microgram aliquots of total RNA were reversely transcribed for single-stranded cDNAs using poly dTι2-i8 primer (Amersham Pharmacia Biotech) with Superscript II reverse transcriptase (Life Technologies) . Single-stranded cDNA preparation was diluted for subsequent PCR amplification by standard RT-PCR experiments carried out in 20-μl volumes of PCRbuffer (TAKARA) . Amplification proceeded for 4 min at 94°C for denaturing, followed by 20 (for GAPDH) or 33 (for DDEFLl ) cycles of 94°C for 30 s, 56°C for 30 s, and 72°C for 45 s, in the GeneAmp PCR system 9700 (Perkin-Elmer , Foster City, CA) . Primer sequences were; for GAPDH: forward, 5 ' -ACAACAGCCTCAAGATCATCAG (SEQ ID NO: 7) and reverse, 5 '-GGTCCACCACTGACACGTTG (SEQ ID NO: 8); for DDEFLl : forward, 5 '-AGCTGAGACATTTGTTCTCTTG (SEQ ID NO: 9) and reverse: 5 ' -TATAAACCAGCTGAGTCCAGAG (SEQ ID NO: 10). The results confirmed increased expression of DDEFLl in nine of these tumors (Fig. lb) .
1-2. Isolation and structure of a novel gene DDEFLl
Expression of DDEFLl was analyzed by multiple-tissue northern-blot analysis using a PCR product of DDEFLl as a probe . Human multiple-tissue blots (Clontech, Palo Alto, CA) were hybridized with a 3P-labeled DDEFLl cDNA. Pre-hybridization, hybridization and washing were performed according to the supplier's recommendations. The blots were autoradiographed withintensifyingscreensat-80°Cfor 72 h. The results revealed a 4-kb transcript that was expressed in lung, liver, small intestine, placenta and peripheral blood leukocyte (Fig. 2a).
Since B9362 was smaller than that detected on the Northern blot, 5 'RACE experiments was carried out to determine the entire
coding sequence of the gene. 5' RACE experiments were carried out using a Marathon cDNA amplification kit (Clontech, Palo Alto, CA) according to the manufacturer's instructions. For the amplification of the 5' part of DDEFLl cDNAs , gene-specific reverse primers (5 '-CTCACTTGGCACGTCAGCAGGG (SEQ ID NO: 11)) and the AP-1 primer supplied in the kit were used. The cDNA template was synthesized from human liver mRNA. The PCR products were cloned using a TA cloning kit (Invitrogen) and their sequences were determined with an ABI PRISM 3700 DNA sequencer (Applied Biosysterns) .
The complete cDNA consisted of 4050 nucleotides, with an open reading frame of 2712 nucleotides encoding a 903-amino-acid protein (GenBank accession number AB051853) . The first ATG was flanked by a sequence (CCCGCCATGC (SEQ ID NO: 12) ) that agreed with the consensus sequence for initiation of translation in eukaryotes , with an in-frame stop codon upstream. The BLAST program to search for homologies in the NCBI (the National Center for Biotechnology Information) database, identified a genomic sequence with GenBank accession number AL357134, which had been assigned to chromosomal band lp36.12. Comparison of the cDNAandgenomic sequences disclosed that DDEFLl consisted of 25 exons (Fig. 2b) .
A search for protein motifs with the Simple Modular Architecture Research Tool (SMART) revealed that the predicted protein contained two coiled-coil regions (codons 141-172 and 241-278) , a PH (Pleckstrin homology) motif (codons 303-396) , a motif of ArfGAP (GTPase-activating protein for Arf) (codons 426-551) and two ankyrin repeats (codons 585-617 and 621-653) . This structure was similar to centaurin beta 1 and centaurin beta 2 (Fig. 2c) . In particular, DDEFLl shared features of centaurin-family proteins such as a PH domain, a target of phosphatidylinositol 3 ,4 , 5-trisphosphate, andamotif ofArfGAP. The amino acid sequence of the ArfGAP motif of DDEFLl was 67.8% identical to that of DDEF2 (Fig.2d) . Notably, the CXXCXι6CXXC motif, representing a zinc finger structure essential to GAP activity, was completely preserved.
1-3. Subcellular localization of DDEFLl
The coding sequence of DDEFLl was cloned into the pcDNA3.1-myc/His vector (Invitrogen) . The resulting plasmid expressingmyc-taggedDDEFLl protein (pDNA-myc/His-DDEFLI ) was transiently transfected into COS7 cells (American Type Culture Collection (ATCC) ) . The expected myc-tagged protein was detected by immunoblotting (Western blotting) as follows. Cells transfected with pcDNA3.1-myc/His-DDEFLl were washed twice with PBS and harvested in lysis buffer (150 mM NaCI, 1% Triton X-100, 50 mM Tris-HCl pH 7.4, ImM DTT, and IX complete Protease Inhibitor Cocktail (Boehringer) ) . After the cells were homogenized and centrifuged at 10,000xg for 30 min, the supernatant was standardized for protein concentration by the Bradford assay (Bio-Rad) . Proteins were separated by 10% SDS-PAGE and immunoblotted with mouse anti-myc antibody. HRP-conjugated goat anti-mouse IgG (Amersham) served as the secondary antibody for the ECL Detection System (Amersham) . As a result, the DDEFLl protein was detected on western blots with an anti-myc antibody (Fig. 3a) . Furthermore, immunocytochemical staining was performed as follows. First, the cells were fixed with PBS containing 4% paraformaldehyde for 15 min, then rendered permeable with PBS containing 0.1% Triton X-100 for 2.5 min at RT . Subsequently the cells were covered with 2% BSA in PBS for 24 h at 4°C to block non-specific hybridization. Mouse anti-myc monoclonal antibody (Sigma) at 1 : 1000 dilution or mouse anti-FLAG antibody (Sigma) at 1:2000 dilution was used for the first antibody, and the reaction was visualized after incubation with Rhodamine-conjugated anti-mouse second antibody (Leinco and ICN) . Nuclei were counter-stained with
4 ' ,6 '-diamidine-2 '-phenylindole dihydrochloride (DAPI). Fluorescent images were obtained under an ECLIPSE E800 microscope. The microscopic analysis indicated that the protein was present mainly in the cytoplasm (Fig. 3b) . DDEFLl was also localized in the cytoplasm of human embryonal kidney (HEK293) cells (ATCC) .
1-4. Effect of DDEFLl on cell growth
The coding sequence of DDEFLl was cloned into the pcDNA 3.1 vector (Invitrogen) . NIH3T3 cells (ATCC) plated onto 10-cm dishes (2X105 cells/dish) were transfected with the resulting plasmid expressing DDEFLl (pcDNA-DDEFLI ) and the control plasmid (pcDNA-LacZ) and cultured in Dulbecco's modified Eagle's medium supplemented with 10% fetal bovine serum and 1% antibiotic/antimycotic solution (Sigma) , and further with an appropriate concentration of geneticin for two weeks. The cells were then fixed with 100% methanol and stained by Giemsa solution. Cells transfected with pcDNA-DDEFLI produced markedly more colonies than control cells. An increase in colony formation similarly occurred with transfected human hepatoma SNU423 (Korea cell-line bank) and Alexander (ATCC) cells, in which endogenous expression of DDEFLl is very low (Fig. 4a) .
To investigate this growth-promoting effect further, NIH3T3 cells that stably expressed exogeneous DDEFLl were established. pDNA-myc/His-DDEFLI was transfected into NIH3T3 cells using FuGENE6 reagent (Boehringer) according to the supplier's recommendations. Twenty-four hours after transfection, geneticin was added to the cultures and single colonies were selected two weeks after transfection. Expression of DDEFLl was determined by semi-quantitative RT-PCR (Fig.4b). The growth rate of NIH3T3-DDEFLl cells was statistically higher than that of mock (NIH3T3-LacZ) cells in culture media containing 10% FBS (P<0.05) (Fig. 4c) . In media containing only 0.1% FBS, NIH3T3-DDEFLl cells survived for 6 days, while control NIH3T3 cells died within 6 days under the same conditions. In this case, growth of NIH3T3-DDEFL1 cells was statistically higher than that of mock cells in culture media containing 0.1% FBS (P<0.01) (Fig. 4d) .
1-5. Suppression of DDEFLl expression in human hepatoma SNU475 cells by antisense S-oligonucleotides
The following six pairs of control (sense) and antisense S-oligonucleotides corresponding to the DDEFLl gene were
synthesized.
Antisense:
DDEFL1-AS1 5'-TGCTCCGGCATGGCGG-3' (SEQ ID NO 13) DDEFL1-AS2 5'-GCTGAACTGCTCCGGC-3' (SEQ ID NO 14) DDEFL1-AS3 5 ' -TCCAAGATCTCCTCCC-3 (SEQ ID NO 15) DDEFL1-AS4 5 ' -TCTCCTTCCAAGATCT-3 (SEQ ID NO 16) DDEFL1-AS5 5 ' -GCGCTGAGCCGGCCTC-3 (SEQ ID NO: 17) ; and DDEFL1-AS6 5'-CCTCACCTCCTCCCGC-3' (SEQ ID NO: 18).
Control :
DDEFL1-S1 5 ' -CCGCCATGCCGGAGCA-3 (SEQ ID NO 19) DDEFL1-S2 5 ' -GCCGGAGCAGTTCAGC-3 (SEQ ID NO 20) DDEFL1-S3 5 '-GGGAGGAGATCTTGGA-3 (SEQ ID NO 21) DDEFL1-S4 5 '-AGATCTTGGAAGGAGA-3 (SEQ ID NO 22) DDEFL1-S5 5'-GAGGCCGGCTCAGCGC-3' ( (SSEEQQ I IDD N NOO:: 2 233)); and DDEFL1-S6 5 '-GCGGGAGGAGGTGAGG-3 (SEQ ID NO 24)
Using LIPOFECTIN Reagent (GIBCO BRL) , the synthetic S-oligonucleotides were transfected into SNU475 cells (Korea cell-line bank) , which had shown the highest level of DDEFLl expression among the six hepatoma cell lines we examined (data not shown) . Twelve and twenty-four hours after transfection, antisense S-oligonucleotides AS1 and AS5 significantly suppressed expression of DDEFLl compared to the respective control S-oligonucleotides Sl and S5 (Fig.5a) . Six days after transfection, surviving cells transfected with antisense
S-oligonucleotide AS1 or AS5 were markedly fewer than cells transfected with the respective control S-oligonucleotide Sl or S5 (Fig. 5b) . Consistent results were obtained in three independent experiments .
Example 2
2-1. Identification of VANGLl commonly up-regulated in human hepatocellular carcinomas
The genome-wide cDNA microarray analysis carried out in Example 1 also revealed that a gene with an in-house accession
number of D3244 corresponding to an EST (Hs.122730) of a UniGene cluster, was found to be significantly up-regulated in ten of twelve clinical HCCs compared with the corresponding non-cancerous liver tissues. The relative expression ratio compared to corresponding non-cancerous tissue of these 12 tumors ranged from 1.5 to 16.0 (Fig. 6a). Up-regulated expression (Cy3:Cy5 intensity ratio, >2.0) was observed in 10 of the 12 HCCs that passed through the cutoff filter (both Cy3 and Cy5 signals greater than 25,000) . The elevated expression of D3244 was also confirmed in ten additional HCC cases by semi-quantitative RT-PCR performed similarly to Example 1-1 using the primer set, forward: 5'- GAGTTGTATTATGAAGAGGCCGA
(SEQ ID NO: 25) ; reverse: 5'- ATGTCTCAGACTGTAAGCGAAGG (SEQ ID
NO: 26) (Fig. 6b) .
2-2. Expression of VANGLl in human adult tissues
Multi-tissue northern blot analysis using D3244 cDNA as a probe was performed in the same manner as in Example 1-2 and the results showed a 1.9-kb transcript abundantly expressed in testis and ovary in a tissue-specific manner (Fig.7a). NCBI database search for genomic sequences corresponding to D3244 found two sequences (GenBank accession number: AL450389 and AL592436) assigned to chromosomal band lp22. Using GENSCAN, and Gene Recognition and Assembly Internet Link program, candidate-exon sequences were predicted and exon-connection was performed. In addition, 5' RACE was carried out using a reverse primer (5 ' -TGTCAGCTCTCCGCTTGCGGAAAAAAAG (SEQ ID NO: 27) ) to determine the sequence of the 5 ' region of the transcript in the same manner as in Example 1-2. As a result, an assembled human cDNA sequence of 1879 nucleotides containing an open reading frame of 1572 nucleotides (GenBank accession number: AB057596) was obtained. The predicted amino acid sequence shared 40% and 63% identitywith strabismus (Van Gogh) and VANGL2. Hence, the gene corresponding D3244 was termed as Van Gogh Like 1 (VANGLl ) . Simple Modular Architecture Research Tool suggested that the predicted protein contained putative four transmembrane domains (codons 111-133, 148-170, 182-204,
219-241 ) (Fig . 7b) .
2-3. Subcellular localization of VANGLl
The pcDNA3.1-myc/His-VANGLl plasmid expressing c-myc-tagged VANGLl protein was transiently transfected into SNU475 cells (Korea cell-line bank) . Immunocytochemical staining was performed in the same manner as in Example 1-3. The results revealed that the tagged VANGLl protein was present in the cytoplasm (Fig. 8a and 8b) .
2-4. Growth suppression of hepatoma cells by antisense S-oligonucleotides designated to reduce expression of VANGLl To test whether suppression of VANGLl may result in cell cycle arrest and/or cell death of HCC cells, the following four pairs of antisense and control (sense) S-oligonucleotides were synthesized and transfected into SNU475 cells.
Antisense: antisense 1 5 '-GTATCCATAGCAATGG-3 ' (SEQ ID NO: 28); antisense 2 5 '-TGGATTGGGTATCCAT-3 ' (SEQ ID NO: 29); antisense 3 5 '-TAAGTGGATTGGGTAT-3 ' (SEQ ID NO: 30) ; and antisense 4 5 '-ACTCCTACCTGCCTGT-3 ' (SEQ ID NO: 31).
Control: sense 1 5 '-CCATTGCTATGGATAC-3 ' (SEQ ID NO 32) ; sense 2 5 '-ATGGATACCCAATCCA-3 ' (SEQ ID NO 33) ; sense 3 5 '-ATACCCAATCCACTTA-3 ' (SEQ ID NO 34) ; and sense 4 5 '-ACAGGCAGGTAGGAGT-3 ' (SEQ ID NO 35) .
Antisense S-oligonucleotide encompassing the initiation codon (antisense 3) significantly decreased endogenous expression of VANGLl in SNU475 cells (Fig. 9a) .
Cell viability was evaluated by 3- (4 ,5-dimethyl- thiazol-2-yl) -2 ,5-diphenyltetrazolium bromide (MTT) assay as follows. Cells were plated at a density of 5X105 cells/100 mm dish. At 24 hours after seeding, the cells were transfected in triplicate with sense or antisense S-oligonucleotide
designated to suppress VANGLl . At 72 hours after transfection , the medium was replaced with fresh medium containing 500 μg/ml of 3- (4 ,5-dimethylthiazol-2-yl) -2 ,5-diphenyl tetrazolium bromide (MTT) (Sigma) and the plates were incubated for four hours at 37 °C. Subsequently, the cells were lysed by the addition of 1 ml of 0.01 N HC1/10%SDS and absorbance of lysates was measured with an ELISA plate reader at a test wavelength of 570 nm (reference, 630 nm) . The cell viability was represented by the absorbance compared to that of control cells . Transfection of the antisense S-oligonucleotide, antisense 3, significantly reduced number of surviving cells compared with control sense S-oligonucleotide, sense 3 (Fig. 9b and 9c) . This result was confirmed by three independent experiments . Furthermore, flow cytometry analysis was performed as follows. Cells were plated at a density of 1X105 cells/100 mm dish and trypsinized at the given time course, followed by fixation in 70% cold ethanol. After RNase treatment, cells were stained with propidium iodide (50 μg/ml) in PBS. Flow cytometry was performed on a Becton Dickinson FACScan and analyzed by CellQuest and ModFit software (Verity Software House) . The percentages of nuclei in G0/G1, S and G2/M phases of the cell cycle, and any sub-Gl population were determined from at least 20,000 ungated cells. FACS analysis demonstrated that inhibition of VANGLl significantly increased number of cells at sub-Gl phase (Fig. 9d) . These results suggest that VANGLl may play an important role for cell growth and/or survival of hepatocellular carcinoma cells .
Example 3
3-1. LGN is commonly increased in human hepatocellular carcinomas
Among commonly up-regulated genes by the microarray analysis performed in Example 1-1, a gene, D3636 corresponding to LGN (GenBank accession number : U54999) was selected because it was significantly up-regulated in ten of twelve clinical
HCCs compared with the corresponding non-cancerous liver tissues. The relative expression ratio compared to corresponding non-cancerous tissue of these 12 tumors ranged from 0.7 to 16.0. Up-regulated expression of LGN (Cy3:Cy5 intensity ratio, >2.0) was observed in 10 of the 12 HCCs that passed through the cutoff filter (bothCy3 andCy5 signals greater than 25,000) (Fig. 10a) . The elevated expression of LGN was also confirmed in additional ten HCC cases by semi-quantitative RT-PCR performed using a primer set, forward: 5 ' -ATCTGAAGCACTTAGCAATTGC (SEQ ID NO: 36), reverse: 5 ' -CTGTAGCTCAGACCAAGAACC (SEQ ID NO: 37) , similarly to Example 1-1 (Fig. 10b) .
3-2. Genomic structure of LGN LGN cDNA consists of 2,336 nucleotides and encodes a 677 amino acid peptide. Comparison of the cDNA sequence with genomic sequences disclosed that the LGN gene consists of 14 exons (Fig. 11) .
3-3. Subcellular localization of LGN
The pcDNA3.1-myc/His-LGN plasmid expressing c-myc-tagged LGN protein was transiently transfected into COS7 cells. A 72 kDa-band corresponding to myc-tagged LGN protein was detected by immunoblot analysis in the same manner as in Example 1-3 (Fig.12) . Similarly, immunocytochemical staining was performed as in Example 1-3 and the results revealed that the tagged LGN protein was present in the cytoplasm and nucleus in the cells.
3-4. LGN gene transfer can promote cell growth
To analyze the effect of LGN on cell growth, a colony-formation assay was carried out as in Example 1-4 by transfecting NIH3T3, SNU423, Alexander and SNU475 cells with a plasmid expressing LGN (pcDNA3.1-myc/His-LGN) . Compared with a control plasmid (pcDNA3.1-myc/His-LacZ) , pcDNA3.1-myc/His-LGN produced markedly a larger number of colonies in these cells (Fig. 13a) . This result was confirmed
by three independent experiments .
To further investigate the effect of LGN on cell growth, NIH3T3 cells that stably expressed exogeneous LGN (NIH3T3-LGN cells) were established. NIH3T3-LGNcells showedhigher growth rate than control NIH3T3-LacZ cells (Fig. 13b) .
3-5. Antisense S-oligonucleotides of LGN suppressed growth of human hepatoma SNU475 cells
The following five pairs of control (sense) and antisense S-oligonucleotides corresponding to LGN were synthesized and then transfected into SNU423 cells.
Antisense: antisense 1 5 ' -CCATCGAGTCATATTA -3' (SEQ ID NO: 38) antisense 2 5 ' -TTCCTCCATCGAGTCA -3' (SEQ ID NO: 39) antisense 3 5 ' -AAATTTTCCTCCATCG -3' (SEQ ID NO: 40) antisense 4 5 ' -AGTCTTACCTGTAACG -3' (SEQ ID NO: 41) ; and antisense 5 5 ' -GCTTCCATTCTACAAA -3' (SEQ ID NO: 42).
Sense: sense 1 5 '-TAATATGACTCGATGG-3 ' (SEQ ID NO: 43) sense 2 5 '-TGACTCGATGGAGGAA-3 ' (SEQ ID NO: 44) sense 3 5 '-CGATGGAGGAAAATTT-3 ' (SEQ ID NO: 45) sense 4 5 '-CGTTACAGGTAAGACT-3 ' (SEQ ID NO: 46); and sense 5 5 '-TTTGTAGAATGGAAGC-3 ' (SEQ ID NO: 47)
The antisense S-oligonucleotide encompassing the initiation codon (antisense 3) significantly suppressed expression of LGN compared to control S-oligonucleotide (sense 3) 12 hours after transfection (Fig. 14a) . Six days after transfection, the number of surviving cells transfected with antisense 3 were markedly fewer than that with control sense 3 (Fig. 14b) . Consistent results were obtained in three independent experiments .
Industrial Applicability
The present invention provides cDNA nucleotide sequences and polypeptide amino acid sequence of DDEFLl , VANGLl or LGN, which have been found to be commonly up-regulated in hepatocellular carcinomas. Thus, these polypeptides can be used as markers to determine the presence or absence of liver cancers . The information of these nucleotide sequences enables one to design probes and primers to detect or amplify the DDEFLl , VANGLl or LGN genes. It also enables synthesis of antisense nucleotide sequence that inhibits expression of the DDEFLl , VANGLl orLGWpolypeptides . The amino acid sequence information enables one to prepare antibodies that bind to the DDEFLl , VANGLl or LGN polypeptides. The probes and primers as well as the antibodies are useful as a reagent for detecting hepatocellular carcinomas. Furthermore, the present inventors demonstrated that suppressing the expression of DDEFLl , VANGLl or LGN by antisense oligonucleotides markedly decreases growth of HCC cells. Thus, the antisense oligonucleotides can be used to inhibit growth of HCC cells. The present invention also contributes to further clarify the mechanisms of hepatocellular carcinogenesis and to discover molecular targets for development of effective drugs to treat liver cancers.