AU721946B2

AU721946B2 - Novel human chromosome 16 genes, compositions, methods of making and using same

Info

Publication number: AU721946B2
Application number: AU36838/99A
Authority: AU
Inventors: Timothy Burn; Timothy Connors; William Dackowski; Katherine Klinger; Gregory Landes; Terence Van Raay
Original assignee: Genzyme Corp
Current assignee: Genzyme Corp
Priority date: 1995-06-30
Filing date: 1999-06-28
Publication date: 2000-07-20
Anticipated expiration: 2016-06-17
Also published as: AU3683899A

Description

AUSTRALIA

Patents Act 1990 Genzyme Corporation

ORIGINAL

COMPLETE SPECIFICATION STANDARD PATENT S.

S

*5 *5

S

S. S *5

S

*5 555.

55** *5 *5 5* S *5 Invention Title: Novel human chivomosome 16 genes, compositions, methods of making and using same The following statement is a full description of this invention including the best method of performing it known to us:- NOVEL HMN CHROMOSOME GENES COMPOSITIONS METHODS OF MAKING AND USING SAME

ACKNOWLEDGEMENT

This invention was made in part with Government support under Grant No. DK44853, from the National Institutes of Health. The Government may have certain rights in this invention.

BACKGROUND OF THE INVENTION The assembly of contiguous cloned genomic :5 reagents is a necessary step in the process of disease-gene 15 identification using a positional cloning approach. The rapid development of high density genetic maps based on polymorphic simple sequence repeats has facilitated contig assembly using sequence tagged site (STS) content mapping.

Most contig construction efforts have relied on yeast artificial chromosomes (YACs), since their large insert size uses the current STS map density more advantageously than bacterial-hosted systems. This approach has been S validated for multiple human chromosomes with YAC coverage ranging from 65-95% for many chromosomes and contigs of 11 to 36 Mb being described (Chumakov et al., Nature 377 (Supp.):17 5 297 1995; Doggett et al., Nature 377 (Supp.):335- 365 1995b; Gemmill et al.,Nacure 377 (Supp.):29 9 -319, 1995; Krauter et al.,Nature 377 (Supp.):321- 333 1995; Shimizu et al., Cytogenet. Cell Genet. 70:147-182, 1995; van-Heyningen et al., Cytogenet.

Cell Genet. 69:127-158, 1995).

Despite numerous successes, the YAC cloning system is not a panacea for cloning the entire genome of complex organisms due to intrinsic limitations that result in substantial proportions of chimeric clones (Green et al., Genomics 11:658-669, 1991; Bellanne-Chantelot et al., Cell 70:1059-1068, 1992; Nagaraja et al., Nuc. Acids Res.

22:3406-3411, 1994), as well as clones that are rearranged, deleted or unstable (Neil et al.,Nuc. Acids Res. 18:1421- 1428, 1990; Wada et al., Am. J. Hum. Genet. 46:95-106, 1990; Zuo et al., Hum. Mol. Genet. 1:149-159, 1992; Szepetowski et al.,Cytogenet. Cell Genec. 69:101-107, 1995). At least some of these cloned artifacts are a product of the recombinational machinery of yeast acting on the various types of repetitive elements in mammalian DNA (Neil et al., supra. 1990; Green et al., supra. 1991; Schlessinger et al., Genomics 11:783-793, 1991; Ling et al., Nuc. Acids Res. 21:6045-6046, 1993; Kouprina et al., Genomics 21:7-17, 1994; Larionov et al., Nuc. Acids Res.

22:4154-4162, 1994).

Accordingly, alternative cloning systems must be used in concert with YAC-based approaches to complement localized YAC cloning deficiencies, to enhance the resolution of the physical map, and to provide a 20 sequence-ready resource for genome-wide DNA sequencing.

Several exon trapping methodologies and vectors have been described for the rapid and efficient isolation of coding regions from genomic DNA (Auch et al., Nuc. Acids Res.

18:6743-6744, 1990; Duyk et al., Proc. Natl. Acad. Sci., USA 87:8995-8999, 1990; Buckler et al., Proc. Nati. Acad.

Sci., USA 88:4005-4009, 1991; Church et al., Nature Genet.

6:98-105, 1994). The major advantage of exon trapping is that the expression of cloned genomic DNAs (cosmid, P1 or YAC) is driven by a heterologous promoter in tissue culture cells. This allows for coding sequences to be identified without prior knowledge of their tissue distribution or developmental stage of expression. A second advantage of exon trapping is that exon trapping allows for the identification of coding sequences from only the cloned template of interest, which eliminates the risk of characterizing highly conserved transcripts from duplicated loci. This is not the case for either cDNA selection or direct library screening.

Exon trapping has been used successfully to identify transcribed sequences in the Huntington's disease locus (Ambrose et al., Hum. Mol. Genet. 1:697-703, 1992; Taylor et al., Nature Genet.2:223-227, 1992; Duyao et al., Hum. Mol. Genet. 2:673-676, 1993) and BRCAl locus (Brody et al., Genomics 25:238-247, 1995; Brown et al.,Proc. Natl.

Acad. Sci., USA 92:4362-4366, 1995) In addition, a number of disease-causing genes have been identified using exon trapping, including the genes for Huntington's disease (The Huntington's Disease Collaborative Research Group, Cell 72:971-983, 1993), neurofibromatosis type 2 (Trofatter et al., Cell 72:791-800, 1993), Menkes disease (Vulpe et al., Nature Genet. 3:7-13, 1993), Batten Disease (The International Batten Disease Consortium, Cell 82:949-957, 1995), and the gene responsible for the majority of Long-QT syndrome cases (Wang et al., Nature Genet. 12:17-23, 1996).

A 700 kb CpG-rich region in band 16pl3.3 has been shown to contain the disease gene for 90% of the cases of autosomal dominant polycystic kidney disease (PKD1)(Germino et al., Genomics 13:144-151, 1992; Somlo et al., Genomics 13:152-158, 1992; The European Polycystic Kidney Disease Consortium, Cell 77:881-894, 1994) as well as the tuburin gene (TSC2), responsible for one form of tuberous sclerosis *o (The European Chromosome 16 Tuberous Sclerosis Consortium, Cell 75:1305-1315, 1993). An estimated 20 genes are present in this region of chromosome 16 (Germino et al., Kidney Int. Supp.39:S20-S25, 1993). Characterization of the region surrounding the PKD1 gene in 16p13.3, however, has been complicated by duplication of a portion of the genomic interval more proximally at 16pl3.1 (The European Polycystic Kidney Disease Consortium, supra. 1994).

This chromosomal segment serves as a challenging test for large-insert cloning systems in E. coli and yeast since it resides in a GC-rich isochore (Saccone et al., Proc. Natl. Acad. Sci., USA 89:4913-4917, 1992) with an abundance of CpG islands (Harris et al., Genomics 7:195- 206, 1990; Germino et al., supra. 1992), genes (Germino et al., supra. 1993) and Alu repetitive sequences (Korenberg et al., Cell 53:391-400, 1988). Chromosome 16 also contains more low-copy repeats than other chromosomes with almost 25% of its cosmid contigs hybridizing to more than one chromosomal location when analyzed by fluorescence in situ hybridization (FISH) (Okumura et al.,Cytogenet. Cell Genet. 67:61-67, 1994). These types of repeats and sequence duplications interfere with "chromosome walking" techniques that are widely used for identification of genomic DNA and pose a challenge to hybridization-based methods of contig construction. This is because these techniques rely on hybridization to identify clones containing overlapping fragments of genomic DNA; thus, there is a high likelihood of "walking" into clones derived from homologues instead of clones derived from the authentic gene. In a similar manner, the sequence duplications and chromosome 16-specific repeats also interfere with the unambiguous determination of a complete cDNA sequence that encodes the corresponding protein.

Furthermore, low copy repeats may lead to instability of this interval in bacteria, yeast and higher eukaryotes.

Thus, there is a need in the art for methods and 30 compositions which enable accurate identification of genomic and cDNA sequences corresponding to authentic genes present on highly repetitive portions of chromosome 16, as well as genes similarly situated on other chromosomes.

SUMMARY OF INVENTION In a first aspect, the present invention consists in an isolated nucleic acid encoding human ATPase binding cassette transporter (hABC3) or its complement.

In a preferred embodiment of the first aspect of the invention, the nucleic acid is mRNA. In a further embodiment the nucleic acid is DNA, more preferably the nucleic acid is DNA comprising the sequence set forth in Figure 8.

The invention further includes a nucleic acid that hybridises under stringent conditions to a nucleic acid encoding hABC3 or its complement.

More preferably, the nucleic acid may comprise any one of the following sequences: -GACGCTGGTGAAGGAGC-3'; 5' -TCGCTGACCGCCAGGAT-3'; 20 5' -CATTGCCCGTGCTGTCGTG-3'; 5' -CATCGCCGCCTCCTTCATG-3'; 5' -GCGGAGCCACCTTCATCA-3'; 5' -GACGCTGGTGAAGGAGC-3'; -ATCCTGGCGGTCAGCGA-3'; 25 5' -AGGGATTCGACATTGCC-3'; or 5' -CTTCAGAGACTCAGGGGCAT-3'.

In a second aspect, the present invention consists in antisense oligonucleotides that specifically bind to and modulate translation of the 30 isolated mRNA of hABC3.

In a third aspect, the present invention consists in an isolated human ATPase binding cassette transporter (hABC3) and biologically active fragments thereof. Preferably, the isolated hABC3 includes the amino acid sequence set forth in Figure 8.

In a fourth aspect, the present invention consists in vectors containing nucleic acids according to the first aspect of the invention together with host cells transformed with the vector.

In a fifth aspect, the present invention consists in methods for producing human ATPase binding cassette transporter (ILABC 3) comprising: culturing the host cell of the fourth aspect of the present invention in a medium and under conditions suitable for expression of said protein, and isolating said expressed protein.

In a sixth aspect, the present invention consists in an antibody that specifically binds to human ATPase binding cassette transporter. More preferably, the antibody is a monoclonal antibody.

In a seventh aspect, the present invention consists in a composition including an effective amount of oligonucleotide according to the second aspect of the present invention to modulate expression of hABC3 by passing through a cell membrane and binding specifically with rnRNA encoding ~*hABC3 in the cell so as to prevent its translation and an acceptable hydrophobic carrier capable of passing through a cell membrane.

In an eighth aspect, the present invention consists in a composition containing an effective amount of antibody according to the sixth aspect, to block binding of naturally occurring ligands to hABC3 and an acceptable 25 carrier.

In a ninth aspect, the present invention consists in a transgenic non- U. human mammnal expressing DNA encoding human ATPase binding cassette transporter.

In a final aspect, the present invention consists in a method for identifying compounds that bind to human ATPase binding cassette transporter (hABC3) said method comprising a competitive binding assay wherein cells according to the fourth aspect of the present invention are exposed to a plurality of compounds and identifying compounds which bind thereto. Such compounds are useful for modulating the activity of invention polypeptides.

BRIEF DESCRIPTION OF THE DRAWINGS Figure 1 shows a schematic diagram of the P1 contig and trapped exons. The horizontal line at the top shows the position of relevant DNA markers with the scale (in kilobases). The position of NotI sites is shown below the horizontal line. The position and orientation of the known genes is indicated by arrows with the number of exon traps obtained from each gene shown in parentheses. The position of the transcription units described in this report (A through M) are shown below the known genes. The Genbank Accession numbers of corresponding exon traps are shown below each transcriptional unit. P1 clones are indicated by the overlapping lines with the name of the clone shown above the line. The position of trapped exons which did not map to characterized transcripts are shown below the P1 contig. Vertical lines denote the interval within the P1 clone(s) detected by the exon traps in hybridization studies.

Figure 2 shows an alignment of selected exon traps with sequences in the databases. An alignment of sequences encoded by exon trap L48741 and N-acetylglucosamine-6-phosphate deacetylase from C.

elegans, E. coli and Haemophilus. The EGF repeat from netrin-l, netrin-2 and UNC-6 are shown aligned to one of the translated netrin-like exon trap (Genbank Accession No.

L75.917). An alignment of sequences from the second netrin-like exon trap (Genbank Accession No. L75916) and netrin-1 and netrin-2 is shown. An alignment of the translated Rab26-like RT-PCR product (Genbank Accession Nos. L48770-L48771) and rat Rab26. Sequences encoded by exon trap L48792 are shown aligned to sequences from the pilB transcriptional repressor from Neisseria gonorrhoeae sequences predicted by computer analysis to be encoded by cosmid F44E2.6 from C. elegans, the YCL33C gene product from yeast (Genbank Accession No. P25566), and a transcriptional repressor from Haemophilus. Periods denote positions where gaps were inserted in the protein sequence in order to maintain alignment.

Figure 3 shows 6803 bp of hNET genomic sequence from P1 clone 53.8B.

Figure 4 shows 1743 bp of hNET cDNA coding for the human homologue of the chicken netrin-2 gene.

Figure 5 shows an amino acid comparison between chicken netrin-l, chicken netrin-2 and the human homologue of netrin-2, hNET. Shaded boxes denote regions of identical homology. The laminin domains V and VI and the C-terminal domain are indicated by arrows with domain V divided into three sub-components (V-i to The asterisks identify a motif for adhesion/signaling receptors.

Figure 6 shows a graphical representation of the homology between domains of chicken netrin-1, chicken netrin-2 and the human homologue of netrin-2, hNET.

2 5 Figure 7 shows exon traps, RT-PCR products and cDNA from the hABC3 gene. Exon traps are shown above.

hABC3 cDNA is shown below the exon traps with the position of the Genetrapper selection and repair (R) oligonucleotides indicated. The position of the RT-PCR clones are shown below the cDNA. A 147 bp insertion is present in the RT-PCR product relative to the cDNA. This insertion does not disrupt the open reading frame.

Figure 8 shows 5.8 kb of cDNA encoding hABC3.

Figure 9 shows an amino acid alignment of murine ABC1 and ABC2 with hABC3. Hyphens denote gaps; asterisks denote identical residues, while periods denote conservative substitutions. The location of the ATP binding cassettes is shown by the boxed regions. Numbers at the right show the relative position of the proteins.

Figure 10 shows the region of the transcriptional map of the PKD1 locus from which Pl clones 49.10D, 109.8C and 47.2H were isolated. The open boxes represent trapped exons with their relative position indicated below the SEM L3 gene. c, r and h identify the location of the capture, repair and hybridization oligonucleotides, respectively.

Figure 11 shows the nucleotide and deduced amino acid sequence of the SEM L3 cDNA. The 5' upstream inframe stop codon is underlined and the arrows indicate the site of the polyA tract of the two shorter cDNA clones that were also isolated.

Figure 12 shows a comparison of the deduced amino acid sequences from human, bovine, murine and the SEM L3 genes. Dashes indicate sequence identity to the human L3 gene. The nuclear targeting sequence at the N-terminal end Sis shaded and the bipartite motif is boxed.

Figure 13 shows the nucleotide and deduced amino acid sequence of the hALR cDNA.

o* 9 Figure 14 shows a comparison of the deduced amino acid sequences from rat ALR and human ALR.

o9 DETAILED DESCRIPTION OF THE INVENTION All patent applications, patents, and literature references cited in this specification are hereby incorporated by reference in their entirety. In case of conflict or inconsistency, the present description, including definitions, will control.

Definitions: 1. "complementary DNA (cDNA)" is defined herein as a single-stranded or double-stranded intronless DNA molecule that is derived from the authentic gene and whose sequence, or complement thereof, encodes a protein.

2. As referred to herein, a "contig" is a continuous stretch of DNA or DNA sequence, which may be represented by multiple, overlapping, clones or sequences.

3. As referred to herein, a "cosmid" is a DNA plasmid that can replicate in bacterial cells and that accommodates large DNA inserts from about 30 to about 45 kb in length.

4. The term "P1 clones" refers to genomic DNAs cloned into vectors based on the P1 phage replication mechanisms. These vectors generally accommodate inserts of about 70 to about 105 kb (Pierce et al., Proc. Natl. Acad.

Sci., USA, 89:2056-2060, 1992).

As used herein, the term "exon trapping" refers to a method for isolating genomic DNA sequences that are flanked by donor and acceptor splice sites for RNA processing.

6. "Amplification" of DNA as used herein denotes a reaction that serves to increase the concentration of a particular DNA sequence within a mixture of DNA sequences. Amplification may be carried out using polymerase chain reaction (PCR) (Saiki et al., Science 239:487, 1988), ligase chain reaction (LCR), nucleic acidspecific based amplification (NSBA), or any method known in the art.

7. "RT-PCR" as used herein refers to coupled reverse transcription and polymerase chain reaction. This method of amplification uses an initial step in which a specific oligonucleotide, oligo dT, or a mixture of random primers is used to prime reverse transcription of RNA into single-stranded cDNA; this cDNA is then amplified using standard amplification techniques e.g. PCR.

A P1 contig containing approximately 700 kb of DNA surrounding the PKD1 and TSC2 gene was assembled from a set of 12 unique chromosome 16-derived P1 clones obtained by screening a 3 genome equivalent P1 library (Shepherd et al., Proc. Natl. Acad. Sci., USA 91:2629-2633, 1994) with 15 distinct probes. Exon trapping was used to identify transcribed sequences from this region in 16p13.3.

96 novel exon traps have been obtained containing sequences from a minimum of eighteen genes in this S: 'interval. The eighteen identified genes include five previously reported genes from the interval and a 30 previously characterized gene whose location was unknown (Table Additional exon traps have been mapped to genes based on their presence in cDNAs, RT-PCR products, or their hybridization to distinct mRNA species on Northern blots.

e C C -g Oe gee e ge C CC ge i eSeC

C

Ce.

egg Ce ge

C

.e Ce eeg'00 ego *0g 'o o 0 0~ 0 0 C0wo "CA1Vl DahCs Homoogie I I I I I Gn ca Independent Exon Trapsb Clonec Tra~nscript Size Dtabase Iluiniotufgyd1 Accession Number P valuef A 6 2 kb (c DNA) 8 kb Probable protein kinase IS. cerevifizej Z48149 6.3c-83 8 I~ 1.3 kb (cDNA) 2 5 No Signiicnthoolgy C I 0.55 kb (Exon Trap) 1.4 kb N-acelylglucosaminc-6-phosphate dcacetylase (C P34480 7.4c-73 0.6 kb RACE) eleganil D 2 Exon trap (159 bp) Ncrin-2 1G. gallitil B54665 3.7c-I I Exon trap (196 bp) -Nctrin-2 1G. &!Ii L_ B54665 6. 1 -33 E I Exon trap (100 bp) ABCI gene product IM, P41233 0.0047 F 3 1. 1 kb (RT-PCR) 7 kb ABC2 gene product [Al. inrrscidil P4 1234 3.0c-28 2.8 kb (cDNA) 7 kb ABICI gene product (Ml. mussculuil P4 1233 7.I1c-65 G 2 1.8 kb (cDNA) 2.5 kb RNA-Binding protein (Hoamo sapiens J L37368 2.6c- 176 1I 2 1.2 kb (RT-PCR) 2 5 kb _phi AP3 murcilusl S41688 2-.9c-l169 I I 0.45 lb (Exon Trap) 3.0 4.5 kb No signiicant homologies________ 2 0.24kb (RT-PCR) 2 kb Rab26 JR. non-egiciss U18771 3.6c-56 K I Exon trap (219 bp) 40S Ribosomnal protein S4 11onosapiens) P15880 7.3e- 18 I. 5 1.7 kb (cDNA) 1.6 kb 60s Ribosomil proein 1-31Lnrno sapiens) S34 195 6.7c-233 NI 1 0.7 lb (cDNA) 1.3 kb Hlypothectical 17.2 Kd pootcin IC eegains P34436 6.2cIO1 a. Gene as dctnoted in Fig. 1.

b. Number of lic trapped cxon presetnt in clotned cDNA or P1CR product.

r- Size of clone with type of clone indicated in parentheses.

d Significant homology in databases as determined by BLASIX.

e Accession Number of best htit.

f. Smallest surn probability for (the best database match.

Northern analysis was not performed due to the small size of the cxoii traps.

Up to 200 copies of LLREP3 are present in the genotne.

Exon trapping was performed using an improved trapping vector (Burn et al., Gene 161:183-187, 1995), with the resulting exon traps being characterized by DNA sequence analysis. In order to determine the relative efficiency of the exon trapping procedure, exon traps were compared to the cDNA sequences for those genes known to be in the interval around the PKD1 gene (Figure Single exon traps were obtained from the human homologue of the ERV1 (Lisowsky et al., Genomics 29:690-697, 1995) and the ATP6C proton pump genes (Gillespie et al., Proc. Natl.

Acad. Sci., USA 88:4289-4293, 1991) In contrast, eight individual exon traps were isolated from the TSC2 gene and ten from the CCNF gene (The European Chromosome 16 Tuberous Sclerosis Consortium, supra. 1993; Kraus et al., Genomics 24:27-33, 1994).

Trapped sequences from three of the exons present in the PKD1 gene were obtained (The American PKD1 Consortium, Hum.

Mol. Genet. 4:575-582, 1995; The International Polycystic Kidney Disease Consortium, Cell 81:289-298, 1995; Hughes et al., Nature Genes. 10:151-160, 1995). 16 additional exon traps from the 109.8C and 47.2H P1 clones were also obtained.

Sequences present in two exon traps (Genbank S. Accession Nos. L75926 and L75927), localizing to the region of overlap between the 96.4B and 64.12C P1 clones, were shown to contain sequences from the previously described human homologue to the murine RNPS1 gene (Genbank Accession No. L37368), encoding an S phase-prevalent DNA/RNA-binding protein (Schmidt et al.,Biochim. Biophys. Acta 1216:317- 320, 1993). A comparison of these exon traps to the dbEST database indicated that they were also contained in cDNA 52161 from the I.M.A.G.E. Consortium (Lennon et al., Genomics 33:151-152, 1996). Based on these data, the hRNPS1 gene can be mapped to 16p13.3 near DNA marker D16S291 (transcript G in Figure 1).

Two exon traps from the 1.8F P1 clone were found to have a high level of homology to the previously described murine OAP3 encoding a zinc finger-containing transcription factor (Fognani et al., EMBO J. 12:4985-4992, 1993). The mDAP3 protein, a zinc finger-containing transcription factor, is believed to function as a negative regulator for genes encoding proteins responsible for the inhibition of cell cycling (Fognani et al., supra.). The two exon traps were linked by PCR, with the resulting 1.2 kb PCR product being 85% identical at the nucleotide level to the murine <AP3 cDNA. Hybridization of the QAP3-like exon traps to the dot blotted P1 contig indicated that the gene lies in the non-overlapping region of the 1.8F P1, between the DNA markers KLH7 and GGG12 (transcript H in Figure 1).

Significant homology was also seen between two exon traps obtained from the 97.10G P1 and the rat Rab26 gene encoding a ras-related GTP-binding protein involved in the regulation of vesicular transport (Nuoffer et al, Ann.

Rev. Biochem. 63:949-990, 1994; Wagner et al., Biochem.

Biophys. Res. Comm. 207:950-956, 1995). The Rab26-like exon traps were linked by RT-PCR (transcript J in Figure 1) with the encoded sequences being 94% (83/88) identical at the protein level to Rab26 (Figure 2).

In order to correlate exon traps with individual transcripts, cDNA library screening and PCR based approaches were used to clone transcribed sequences containing selected exon traps. RT-PCR was used to link individual exon traps together in cases where the two exon traps had homology to similar sequences in the databases.

In cases where only single exon traps were available, 3' RACE or cDNA library screening was used to obtain additional sequences. Sequences from the exon traps and cloned products were used to map the position, and when possible the orientation, of the corresponding transcription units.

Six unique exon traps, containing sequences from at least eight exons, were shown to be from a transcriptional unit in the centromeric most P1 clone, 94.10H (transcript A in Figure A 2 kb cDNA linking the six exon traps was isolated and shown to hybridize to an 8 kb transcript. Additional hybridization studies indicated that the gene was oriented centromeric to telomeric, with at least 6 kb of the transcript originating from sequences centromeric of the P1 contig. Extensive homology was observed between the translated cDNA and a variety of protein kinases; however, the presence of the conserved HRDLKPEN motif encoded in exon trap L48734, as well as the partial cDNA, suggests that it encodes a serine/threonine kinase (van-der-Geer et al., Ann. Rev. Cell Bio. 10:251- 337, 1994).

cDNAs were isolated using sequences derived from a separate 94.10H exon trap (Genbank Accession No. L48738) 25 and the position and orientation of the corresponding transcription unit were determined. Two cDNA species were obtained using exon trap L48738 as a probe, with the only homology between the two species arising from the 109 bases contained in the exon trap. Using oligonucleotide probes, 30 the transcription unit was mapped to a position near the 26-6DIS DNA marker, in a telomeric to centromeric orientation; however, only one of the cDNA species mapped to the P1 contig (transcript B in Figure Based on these data, it is likely that the second cDNA species originated from a region outside of the P1 contig, possibly from the duplicated 26-6PROX marker located further centromeric in 16p13.3 (Gillespie et al., Nuc. Acids Res.

18:7071-7075, 1990).

The 110.1F P1 clone contains at least two genes in addition to the ATP6C gene. Using BLASTX to search the protein databases, significant homology was observed between sequences encoded by exon trap L48741 and the N-acetylglucosamine-6-phosphate deacetylase (nagA) proteins from C. elegans (Wilson et al., Nature 368:32-38, 1994),

E.

coli (Plumbridge, Mol. Microbiol. 3:505-515, 1989) and Haemophilus (Fleischmann et al.,Science 269:496-512, 1995).

An alignment of the nagA proteins to the translated exon trap revealed the presence of multiple conserved regions (Figure suggesting that the exon trap contains sequences from the human nagA gene. Additional sequences from the nagA-like transcript have been cloned using 3' RACE and the transcription unit mapped to a region between NotI sites 2 and 3 in Figure 1. The gene is oriented telomeric to centromeric with NotI site 2 being present in the 3' UTR of the RACE clone (transcript C in Figure 1).

o6 Two additional exon traps (Genbank Accession Nos.

L75916 and L75917), mapping to the region of overlap 0.0" between the 110.IF and 53.8B P1 clones (transcript D in Figure were shown to have homology with the chicken netrins (Kennedy et al., Cell 78:425-435, 1994; Serafini et al., Cell 78:409-424, 1994) and the C. elegans UNC-6 protein (Ishii et al., Neuron 9:873-881, 1992) (Figure 2).

The presence of the netrin-like exon traps in transcribed sequences based on RT-PCR using oligonucleotide primers (Table IV) was confirmed in spinal cord RNA. These results, indicate that a novel gene, hNET, encoding a human netrin gene has been isolated.

The netrins define a family of diffusible factors involved in axon outgrowth, while UNC-6 has been shown to play a role in the migration of mesodermal cells and axons (Ishii et al., supra. 1992). Given that netrins function as target-derived neural chemoattractants, it is likely that hNET has a restricted spatial and temporal pattern of expression.

Sequences encoded by exon trap, L75917, were shown to have significant homology with the C-terminal most epidermal growth factor (EGF) repeat found in the netrin and UNC-6 proteins (Figure The netrin-like trap, L75916, encodes sequences from the more divergent C-terminal domain of the netrins (Figure This region is the least conserved between UNC-6 and the netrins, with sequences being 63% conserved between netrin-1 and netrin-2 and 29% conserved between netrin-2 and UNC-6 (Serafini et 15 al., supra. 1994).

The high level of conservation exhibited between hNET, chicken netrins 1 and 2, C. elegans UNC-6 and the laminins suggests that hNET is implicated in signaling 20 developing neurons. For example, the chicken netrin gene product has been shown to guide and redirect migrating commissural axons to their targets in both in vitro and in vivo studies (Serafini et al., supra.; Kennedy et al., supra.). Such chemoattractants may have a significant role in axon regeneration.

Exon trapping results further show that there is a novel ATPase Binding Cassette (ABC) transporter in the PKD1 locus located between the LCN1 and D16S291 markers in a centromeric to telomeric orientation. Database searches with the exon trap sequences show homology to the murine ABC1 and ABC2 genes (Luciani et al., Genomics 21:150-159, 1994). The human homologs of murine ABC1 and ABC2 have been cloned and mapped to human chromosome 9 (Luciani et al. supra.).

Seven exon traps with homology to ABC transporters were isolated from P1 clones 30.1F, 64.12C and 96.4B. Additional sequences encoded by the ABC3 gene were obtained by RT-PCR (placenta and brain RNA as template) and library PCR (using commercially available lung cDNA library as template) using custom primers designed from the exon traps (Tables II and III). Three exon traps (L48758-L48760) were obtained from the region of overlap between the 30.1F, 64.12C and 96.4B P1 clones (transcript

F

Figure while a fourth exon (L48753) maps to the 7 9.2A P1 clone, exclusively (transcript E in Figure 1), 0 0 0* S. 0 55 0 TABLE 11: Oligonucleotides Used to Clone Additional Sequences Genea Metlodb Ollgonticleotide I' -Oligonticleotide 2 d Clone Slzee A Genetrapper IACGCCGTG~CCCATCCAGF I CAGjCGIfGGTG11I'ATG1'TCCF 2.0 kb B Genle trapper ITGGGCC7GTGCFGAACrAC', CGGCAAGCT'GGTrGAYI'AACA 1 .3 klb C 3TRACE CGGCAGAGGATGCTGrGT GCGGAGCCACCTCATCA 0.6 kb F R'r-PCR GACGCFGGTGAAGGAGC 1CGCFGACCGCCAG/AT 1 .1 kb 11 RTr-PCR CTFfCGGGAAGGI CTCACTG GrCACCGCCI1'GGAGGA*1*1 1.1 kb I RT-PCR GTGI'GGGGAAGACCrGTCTG AGGAGGCCITGYFGGT'GACA 0.24 kb L Genetrapper ACGGACACCrGGGCITC AAACGGGAGGAGGTGGA 1.7 kb M Geneirapper TGTFGGCrATGAGCT1CTFC> GCAGTCCCGATrFfGAATAI''. 0.7 kb a Gene as denoted in Fig. 1.

b. Method used to clone additional sequences. Lifetechnologies Genetrapper system, 3' RACE and RTr-PCR Nvere performed as described in the Materials and Methods.

c. Sequence of oligonuicleotides used to obtain additional sequences. For flhe Genecraipper system, this oligonucleotide was used in ihe direct selection step. In thie case of 3' RACE experiments, this oligo was the external primie. While in RT-PCR experiments, the designated Oligoiuleotide was used as a sense primer.

d. Sequence of oligonuclcoi ides. In the Genetrapper experiments, this oligoniucleotide wais used ill the repalir step. For 3' RACE experiments, this was the internal primer. For RT-PCR experiments, this wvas the antisense primer.

e Size of clone obtained using the primer pair.

'0 so S* '0 00 '0'S .A1CCW-lGCD A G-~YW7' CAI G* B3. 1) 5 8 kb RT-PCR CATI1rGCCGWfGIc GIOG GCGOAGCCACCflICAIICA ABC3 (A 12) 1.7 kt) RT-PCR GACCGOflxAcJGAGC A'1cCWIIjGjGGCACXXA ABC3 (3-12) 1.1 kb RT-PCR .AGGGATICGACATIMCC CIAGAGACTCAGGG3GCAT IA13C3 0.5 kb a Method used to clone additional sequences. Lifetechnologies Genetrapper system and RT-PCR were performed as described in the Materials and Methods.

b Sequence of oligonucleotides used to obtain additional sequences. For [the Genetrapper system, this oligonucleotide was used in the direct selection step. In the case of RT-1PCR experiments, the designated oligonucleotide was used as a sense primer.

c Sequence of oligonucleouides. In the Geneurapper experiments, this oligonucleotide was used in the repair step. For RT-PCR experiments, this was the antisense primer.

d Assigned name of isolated cDNA clones.

e. Size of clone obtained using the primer pair.

An additional exon trap from region of overlap between the 109.8C and 47.2H P1 clones was shown to contain human LLRep3 sequences (Slynn et al., Nuc. Acids Res.

18:681, 1990). Hybridization studies indicated that the LLRep3 sequences (transcript K in Figure 1) were located between the sazD and L3-like genes. The region of highest gene density appears to be at the telomeric end of this cloned interval, particularly the region between TSC2 and D16S84, with a minimum of five genes mapping to this region (transcription units K, L and M, sazD and hERV1).

Also mapped to this region, was an exon trap which is 86% identical (170/197) at the nucleotide level to the previously described rat augmenter of liver regeneration (Hagiya et al., Proc. Natl. Acad. Sci., USA 91:8142-8146, 1994). ALR is a growth factor which augments the growth of damaged liver tissue while having no effect on the resting liver. Studies have demonstrated that rat 20 ALR is capable of augmenting hepatocytic regeneration following hepacectomy 9 a a *9 9 This ALR-like exon trap was also shown to contain sequences from the recently described hERVI gene, which encodes a functional homologue to yeast ERV1 (Lisowsky et al., supra. 1995).

A 468 bp cDNA has been obtained from the human ALR gene (Figure 13). The ALR sequences encode a 119 amino acid protein which is 84.8% identical and 94.1% similar to the rat ALR protein (Figure 14).

The cloning of human ALR has significant implications in the treatment of degenerative liver diseases. For example, biologically active rat ALR has been produced from COS-7 cells expressing rat ALR cDNA (Hagiya et al. supra.). Accordingly, recombinant hALR could be used in the treatment of damaged liver. In addition, a construct expressing hALR could be used in gene therapy to treat chronic liver diseases.

Forty three of the trapped exons did not have significant homology to sequences in the protein or DNA databases, nor were ESTs (expressed sequence tags) containing sequences from the exon traps observed in dbEST.

The absence of ESTs containing sequences from these novel exon traps is not surprising since one of the criterion for selecting exon traps for further analysis was the presence of an EST in the database. These trapped exons are likely to represent bona fide products, since in many cases they were trapped multiple times from different P1 clones and in combination with flanking exons.

The present invention encompasses novel human genes an isolated nucleic acids comprising unique exon sequences from chromosome 16. The sequences described herein provide a valuable resource for transcriptional mapping and create a set of sequence-ready templates for a gene-rich interval responsible for at least two inheritable diseases.

o o Accordingly, the present invention provides isolated nucleic acids encoding human netrin (hNET), human ATPase Binding Cassette transporter (hABC3), human ribosomal L3 (SEM L3) and human augmenter of liver regeneration (hALR) polypeptides. The present invention further provides isolated nucleic acids comprising unique exon sequences from chromosome 16. The term "nucleic acids" (also referred to as polynucleotides) encompasses RNA as well as single and double-stranded DNA, cDNA and oligonucleotides. As used herein, the phrase "isolated" means a polynucleotide that is in a form that does not occur in nature.

One means of isolating polynucleotides encoding invention polypeptides is to probe a human tissue-specific library with a natural or artificially designed DNA probe using methods well known in the art. DNA probes derived from the human netrin gene, hNET, the human ABC transporter gene, hABC3, the human ribosomal protein L3 gene, SEM L3, or the human augmenter of liver regeneration gene, hALR, are particularly useful for this purpose. DNA and cDNA molecules that encode invention polypeptides can be used to obtain complementary genomic DNA, cDNA or RNA from human, mammalian, or other animal sources, or to isolate related cDNA or genomic clones by the screening of cDNA or genomic libraries, by methods described in more detail below.

15 The present invention encompasses isolated nucleic acid sequences, including sense and antisense oligonucleotide sequences, derived from the sequences shown in Figures 3, 4, 8 and 11. hNET-, hABC3-, SEM L3-, and hALR-derived sequences may also be associated with heterologous sequences, including promoters, enhancers, response elements, signal sequences, polyadenylation sequences, and the like. Furthermore, the nucleic acids can be modified to alter stability, solubility, binding affinity, and specificity. For example, invention-derived 25 sequences can further include nuclease-resistant phosphorothioate, phosphoroamidate, and methylphosphonate derivatives, as well as "protein nucleic acid" (PNA) formed by conjugating bases to an amino acid backbone as described in Nielsen et al., Science, 254: 1497, 1991. The nucleic acid may be derivatized by linkage of the a-anomer nucleotide, or by formation of a methyl or ethyl phosphotriester or an alkyl phosphoramidate linkage.

Furthermore, the nucleic acid sequences of the present invention may also be modified with a label capable of providing a detectable signal, either directly or indirectly. Exemplary labels include radioisotopes, fluorescent molecules, biotin, and the like.

In general, nucleic acid manipulations according to the present invention use methods that are well known in the art, as disclosed in, for example, Sambrook et al., Molecular Cloning, A Laboratory Manual 2d Ed. (Cold Spring Harbor, NY, 1989), or Ausubel et al., Current Protocols in Molecular Biology (Greene Assoc., Wiley Interscience,

NY

NY, 1992).

Examples of nucleic acids are RNA, cDNA, or genomic DNA encoding a human netrin, a human ABC transporter, a human ribosomal L3 subtype, or a human augmenter of liver regeneration polypeptide. Such nucleic acids may have coding sequences substantially the same as the coding sequence shown in Figures 3, 4, 8 and 11, respectively.

The present invention further provides isolated oligonucleotides corresponding to sequences within the hNET, hABC3, SEM L3, hALR genes, or within the respective cDNAs, which, alone or together, can be used to discriminate between the authentic expressed gene and homologues or other repeated sequences. Theseoligonucleotides may be from about 12 to about nucleotides in length, preferably about 18 nucleotides, may be single- or double-stranded, and may be labeled or modified as described below.

This invention also encompasses nucleic acids which differ from the nucleic acids shown in Figures 3, 4, 8 and 11, but which have the same phenotype, encode substantially the same amino acid sequence set forth in Figures 3, 4, 8 and 11, respectively. Phenotypically similar nucleic acids are also referred to as "functionally equivalent nucleic acids". As used herein, the phrase "functionally equivalent nucleic acids" encompasses nucleic acids characterized by slight and non-consequential sequence variations that will function in substantially the same manner to produce the same protein product(s) as the nucleic acids disclosed herein. In particular, functionally equivalent nucleic acids encode proteins that are the same as those disclosed herein or that have conservative amino acid variations. For example, conservative variations include substitution of a non-polar residue with another non-polar residue, or substitution of a charged residue with a similarly charged residue. These variations include those recognized by skilled artisans as those that do not substantially alter the tertiary structure of the protein.

Further provided are nucleic acids encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, and human augmenter of liver regeneration polypeptides that, by virtue of the degeneracy of the genetic code, do not necessarily hybridize to the invention nucleic acids under specified hybridization conditions. Preferred nucleic acids encoding the invention polypeptide are comprised of nucleotides that encode substantially the same amino acid sequence set forth in Figures 4, 8 and 11.

Alternatively, preferred nucleic acids encoding the invention polypeptide(s) hybridize under high stringency conditions to substantially the entire sequence, or substantial portions typically at least 12 to nucleotides) of the nucleic acid sequence set forth in Figures 3, 4, 8 and 11, respectively.

Stringency of hybridization, as used herein, refers to conditions under which polyvnucleotide hybrids are stable. As known to those of skill in the art, the stability of hybrids is a function of sodium ion concentration and temperature. (See, for example, Sambrook et al., supra.).

The present invention provides isolated polynucleotides operatively linked to a promoter of RNA transcription, as well as other regulatory sequences. As used herein, the phrase "operatively linked' refers to the functional relationship of the polynucleotide with regulatory and effector sequences of nucleotides, such as promoters, enhancers, transcriptional and translational stop sites, and other signal sequences. For example, operative linkage of a polynucleotide to a promoter refers to the physical and functional relationship between the polynucleotide and the promoter such that transcription of DNA is initiated from the promoter by an RNA polymerase that specifically recognizes and binds to the promoter, and .15 wherein the promoter directs the transcription of RNA from the polynucleotide.

S•-Promoter regions include specific sequences that are sufficient for RNA polymerase recognition, binding and transcription initiation. Additionally, promoter regions include sequences that modulate the recognition, binding and transcription initiation activity of RNA polvmerase.

Such sequences may be cis acting or may be responsive to trans acting factors. Depending upon the nature of the 5 regulation, promoters may be constitutive or regulated.

Examples of promoters are SP6, T4, T7, SV40 early promoter, cytomegalovirus (CMV) promoter, mouse mammary tumor virus (MMTV) steroid-inducible promoter, Moloney murine leukemia virus (€MLV) promoter, and the like.

Vectors that contain both a promoter and a cloning site into which a polynucleotide can be operatively linked are well known in the art. Such vectors are capable of transcribing RNA in vitro or in vivo, and are commercially available from sources such as Stratagene (La Jolla, CA) and Promega Biotech (Madison, WI). In order to optimize expression and/or in vitro transcription, it may be necessary to remove, add or alter 5' and/or 3' untranslated portions of the clones to eliminate extra, potential inappropriate alternative translation initiation codons or other sequences that may interfere with or reduce expression, either at the level of transcription or translation. Alternatively, consensus ribosome binding sites can be inserted immediately 5' of the start codon to enhance expression. Similarly, alternative codons, encoding the same amino acid, can be substituted for coding sequences of the human netrin, human ABC3 transporter, the human ribosomal L3 subtype, or the human augmenter of liver regeneration polypeptide in order to enhance transcription the codon preference of the host cell can be adopted, the presence of G-C rich domains can be reduced, 15 and the like).

Examples of vectors are viruses, such as baculoviruses and retroviruses, bacteriophages, cosmids, plasmids, fungal vectors and other recombination vehicles typically used in the art which have been described for expression in a variety of eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple protein expression.

25 Polynucleotides are inserted into vector genomes using methods well known in the art. For example, insert and vector DNA can be contacted, under suitable conditions, with a restriction enzyme to create complementary ends on each molecule that can pair with each other and be joined together with a ligase. Alternatively, synthetic nucleic acid linkers can be ligated to the termini of restricted polynucleotide. These synthetic linkers contain nucleic acid sequences that correspond to a particular restriction site in the vector DNA. Additionally, an oligonucleotide containing a termination codon and an appropriate restriction site can be ligated for insertion into a vector containing, for example, some or all of the following: a selectable marker gene, such as the neomycin gene for selection of stable or transient transfectants in mammalian cells; enhancer/promoter sequences from the immediate early gene of human CMV for high levels of transcription; transcription termination and RNA processing signals from for mRNA stability; SV40 polyoma origins of replication and ColE1 for proper episomal replication; versatile multiple cloning sites; and T7 and SP6 RNA promoters for in vitro transcription of sense and antisense RNA. Other means are well known and available in the art.

Also provided are vectors comprising a polynucleotide encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, and human augmenter of liver regeneration polypeptides, adapted for expression in a bacterial cell, a yeast cell, an amphibian cell, an insect cell, a mammalian cell and other animal :cells. The vectors additionally comprise the regulatory elements necessary for expression of the polynucleotide in 0 the bacterial, yeast, amphibian, mammalian or animal cells so located relative to the polynucleotide encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptides as to permit expression thereof. As used herein, "expression" refers to the process by which polynucleotides are transcribed into mRNA and translated into peptides, Polypeptides, or proteins. If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA, if an appropriate eukaryotic host is selected.

Regulatory elements required for expression include promoter sequences to bind RNA polymerase and transcription initiation sequences for ribosome binding. For example, a bacterial expression vector includes a promoter such as the lac promoter and for transcription initiation the Shine- Dalgarno sequence and the start codon AUG (Sambrook et al., supra). Similarly, a eukaryotic expression vector includes a heterologous or homologous promoter for RNA polymerase II, a downstream polyadenylation signal, the start codon AUG, and a termination codon for detachment of the ribosome. Such vectors can be obtained commercially or assembled by the sequences described in methods well known in the art, for example, the methods described above for constructing vectors in general. Expression vectors are useful to produce cells that express the invention receptor.

This invention provides a transformed host cell that recombinantly expresses the human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptides. Invention host cells have been transformed with a polynucleotide encoding a 15 human netrin, a human ABC3 transporter, a human ribosomal L3 subtype, or a human augmenter of liver regeneration polypeptide. An example is a mammalian cell comprising a plasmid adapted for expression in a mammalian cell. The plasmid contains a polynucleotide encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide and the regulatory elements necessary for expression of the invention protein.

25 Appropriate host cells include bacteria, archebacteria, fungi, especially yeast, plant cells, insect cells and animal cells, especially mammalian cells. Of S" particular interest are E. coli, B. Subtilis, Saccharomyces cerevisiae, SF9 cells, C129 cells, 293 cells, Neurospora and CHO cells, COS cells,.HeLa cells, and immortalized mammalian myeloid and lymphoid cell lines. Preferred replication systems include M13, ColE1, SV40, baculovirus, lambda, adenovirus, artificial chromosomes, and the like.

A large number of transcription initiation and termination regulatory regions have been isolated and shown to be effective in the transcription and translation of heterologous proteins in the various hosts. Examples of these regions, methods of isolation, manner of manipulation, and the like, are known in the art. Under appropriate expression conditions, host cells can be used as a source of recombinantly produced hNET, hABC3, SEM L3 and/or hALR.

Nucleic acids (polynucleotides) encoding invention polypeptides may also be incorporated into the genome of recipient cells by recombination events. For example, such a sequence can be microinjected into a cell, and thereby effect homologous recombination at the site of an endogenous gene encoding hNET, hABC3, SEM L3, and/or hALR an analog or pseudogene thereof, or a sequence with substantial identity to a hNET-, hABC3-, SEM L3-, or hALRencoding gene. Other recombination-based methods such as nonhomologous recombinations or deletion of endogenous gene S by homologous recombination, especially in pluripotent cells, may also be used.

*99 9 9 The present invention provides isolated peptides, polypeptides(s) and/or protein(s) encoded by the invention nucleic acids. The present invention also encompasses isolated polypeptides having a sequence encoded by hNET, hABC3, SEM L3, and hALR genes, as well as peptides of six 25 or more amino acids derived therefrom. The polypeptide(s) may be isolated from human tissues obtained by biopsy or autopsy, or may be produced in a hecerologous cell by recombinant DNA methods as described herein.

As used herein, the term "isolated" means a protein molecule free of cellular components and/or contaminants normally associated with a native in vivo environment. Invention polypeptides and/or proteins include any natural occurring allelic variant, as well as recombinant forms thereof. Invention polypeptides can be isolated using various methods well known to a person of skill in the art.

The methods available for the isolation and purification of invention proteins include, precipitation, gel filtration, and chromatographic methods including molecular sieve, ion-exchange, and affinity chromatography using e.g. hNET-, hABC3-, SEM L3-, and/or hALR-specific antibodies or ligands. Other well-known methods are described in Deutscher et al., Guide to Protein Purification: Methods in Enzymology Vol. 182, (Academic Press, 1990). When the invention polypeptide to be purified is produced in a recombinant system, the recombinant expression vector may comprise additional sequences that encode additional amino-terminal or carboxyterminal amino acids; these extra amino acids act as "tags" 15 for immunoaffinity purification using immobilized antibodies or for affinity purification using immobilized ligands.

Peptides comprising hNET-, hABC3-, SEM L3-or hALR-specific sequences may be derived from isolated larger hNET, hABC3, SEM L3, or hALR polypeptides described above, using proteolytic cleavages by e.g. proteases such as trypsin and chemical treatments such as cyanogen bromide that are well-known in the art. Alternatively, peptides up 25 to 60 residues in- length can be routinely synthesized in milligram quantities using commercially available peptide synthesizers.

An example of the means for preparing the invention polypeptide(s) is to express polynucleotides encoding hNET, hABC3, SEM L3, and/or hALR in a suitable host cell, such as a bacterial cell, a yeast cell, an amphibian cell oocyte), an insect cell drosophila) or a mammalian cell, using methods well known in the art, and recovering the expressed polypeptide, again using well-known methods. Invention polypeptides can be isolated directly from cells that have been transformed with expression vectors, described below in more detail.

The invention polypeptide, biologically active fragments and functional equivalents thereof can also be produced by chemical synthesis. As used herein, "biologically active fragment" refers to any portion of the polypeptide represented by the amino acid sequence in Figures 4, 8 and 11 that can assemble into an active protein. Synthetic polypeptides can be produced using Applied Biosystems, Inc.

Model 430A or 431A automatic peptide synthesizer (Foster City, CA) employing the chemistry provided by the manufacturer.

Modification of the invention nucleic acids, polynucleotides, polypeptides, peptides or proteins with 15 the following phrases: "recombinantly expressed/produced", "isolated", or "substantially pure", encompasses nucleic acids, polynucleotides, polypeptides, peptides or proteins that have been produced in such form by the hand of man, and are thus separated from their native in vivo cellular environment. As a result of this human intervention, the recombinant nucleic acids, polynucleotides, polypeptides, peptides and proteins of the invention are useful in ways that the corresponding naturally occurring molecules are not, such as identification of selective drugs or 25 compounds.

Sequences having "substantial sequence homology" are intended to refer to nucleotide sequences that share at least about 90% identity with invention nucleic acids; and amino acid sequences that typically share at least about amino acid identity with invention polypeptides. It is recognized, however, that polypeptides or nucleic acids containing less than the above-described levels of homology arising as splice variants' or that are modified by conservative amino acid substitutions, or by substitution of degenerate codons are also encompassed within the scope of the present invention.

The present invention provides a nucleic acid probe comprising a polynucleotide capable of specifically hybridizing with a sequence included within the nucleic acid sequence encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide, for example, a coding sequence included within the nucleotide sequence shown in Figures 3, 4, 8 and 11, respectively.

As used herein, a "nucleic acid probe" may be a sequence of nucleotides that includes from about 12 to about 60 contiguous bases set forth in Figures 3, 4, 8 and 11, preferably about 18 nucleotides, may be single- or 15 double-stranded, and may be labeled or modified as described herein. Preferred regions from which to construct probes include 5' and/or 3' coding sequences, sequences predicted to encode transmembrane domains, sequences predicted to encode cytoplasmic loops, signal sequences, ligand binding sites, and the like.

.Full-length or fragments of cDNA clones can also be used as probes for the detection and isolation of related genes. When fragments are used as probes, 25 preferably the cDNA sequences will be from the carboxyl end-encoding portion of the cDNA, and most preferably will include predicted transmembrane domain-encoding portions of :the cDNA sequence. Transmembrane domain reaions can be predicted based on hydropathy analysis of the deduced amino acid sequence using, for example, the method of Kyte and Doolittle Mol. Biol. 157:105, 1982).

As used herein, the phrase "specifically hybridizing" encompasses the ability of a polynucleotide to recognize a sequence of nucleic acids that are complementary thereto and to form double-helical segments via hydrogen bonding between complementary base pairs.

Nucleic acid probe technology is well known to those skilled in the art who will readily appreciate that such probes may vary greatly in length and may be labeled with a detectable agent, such as a radioisotope, a fluorescent dye, and the like, to facilitate detection of the probe.

Invention probes are useful to detect the presence of nucleic acids encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptides. For example, the probes can be used for in situ hybridizations in order to locate biological tissues in which the invention gene is expressed. Additionally, synthesized oligonucleotides complementary to the nucleic acids of a polynucleotide encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptides are useful as probes for detecting the invention genes, their associated mRNA, or for the isolation of related genes using homology screening of genomic or cDNA libraries, or by using amplification 20 techniques well known to one of skill in the art.

Also provided are antisense oligonucleotides a* having a sequence capable of binding specifically with any portion of an mRNA that encodes human netrin, human ABC3 25 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide so as to prevent translation of the mRNA. The antisense oligonucleotide may have a sequence capable of binding specifically with any portion of the sequence of the cDNA encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide. As used herein, the phrase "binding specifically" encompasses the ability of a nucleic acid sequence to recognize a complementary nucleic acid sequence and to form doublehelical segments therewith via the formation of hydrogen bonds between the complementary base pairs. An example of an antisense oligonucleotide is an antisense used herein, the term "acceptable carrier" encompasses any of the standard pharmaceutical carriers, such as phosphate buffered saline solution, water and emulsions such as an oil/water or water/oil emulsion, and various types of wetting agents.

Also provided are antibodies having specific reactivity with the human netrin, the human ABC3 transporter, the human ribosomal L3 subtype, or the human augmenter of liver regeneration polypeptides of the subject invention. Active fragments of antibodies are encompassed within the definition of "antibody". Invention antibodies can be produced by methods known in the art using the invention proteins or portions thereof as antigens. For 15 example, polyclonal and monoclonal antibodies can be produced by methods well known in the art, as described, for example, in Harlow and Lane, Antibodies: A Laboratory Manual (Cold Spring Harbor Laboratory 1988).

The polypeptides of the present invention can be used as the immunogen in generating such antibodies.

Alternatively, synthetic peptides can be prepared (using commercially available synthesizers) and used as immunogens. Where natural or synthetic hNET-, hABC3-,

SEM

25 L3-, and/or hALR-derived peptides are used to induce a hNET-, hABC3-, SEM L3-, and/or hALR-specific immune response, the peptides may be conveniently coupled to an a.

suitable carrier such as KLH and administered in a suitable adjuvant such as Freund's. Preferably, selected peptides are coupled to a lysine core carrier substantially according to the methods of Tam, Proc. Natl. Acad. Sci, USA 85:5409-5413, 1988. The resulting antibodies may be modified to a monovalent form, such as, for example, Fab, Fab 2 FAB', or FV. Anti-idiotypic antibodies may also be prepared using known methods.

In one embodiment, normal or mutated hNET, hABC3, SEM L3, or hALR polypeptides are used to immunize mice after which their spleens are removed, and splenocytes used to form cell hybrids with myeloma cells and obtain clones of antibody-secreted cells according to techniques that are standard in the art. The resulting monoclonal antibodies are screened for specific binding to hNET, hABC3, SEM L3, and/or hALR proteins or hNET-, hABC3-, SEM L3-, and/or hALR-related peptides.

In another embodiment, antibodies are screened for selective binding to normal or mutated hNET, hABC3,

SEM

L3, or hALR sequences. Antibodies that distinguish between normal and mutant forms of hNET, hABC3, SEM L3, or hALR may be used in diagnostic tests (see below) employing

ELISA,

EMIT, CEDIA, SLIFA, and the like. Anti- hNET, hABC3 SEM L3, or hALR antibodies may also be used to perform subcellular and histochemical localization studies.

Finally, antibodies may be used to block the function of Z0 the hNET, hABC3, SEM L3, and/or hALR polypeptide, whether normal or mutant, or to perform rational drug design studies to identify and test inhibitors of the function using an anti-idiotypic antibody approach).

25 Amino acid sequences can be analyzed by methods well known in the art to determine whether they encode hydrophobic or hydrophilic domains of the corresponding polypeptide. Altered antibodies such as chimeric, humanized, CDR-grafted or bifunctional antibodies can also be produced by methods well known in the art. Such antibodies can also be produced by hybridoma, chemical synthesis or recombinant methods described, for example, in Sambrook et al., supra., and Harlow and Lane, supra. Both anti-peptide and anti-fusion protein antibodies can be used. (see, for example, Bahouth et al., Trends Pharmacol.

Sci. 12:338, 1991; Ausubel et al., supra.).

Invention antibodies can be used to isolate invention polypeptides. Additionally, the antibodies are useful for detecting the presence of the invention polypeptides, as well as analysis of polypeptide localization, composition, and structure of functional domains. Methods for detecting the presence of a human netrin, a human ABC3 transporter, a human ribosomal L3 subtype, or a human augmenter of liver regeneration polypeptide comprise contacting the cell with an antibody that specifically binds to the polypeptide, under conditions permitting binding of the antibody to the polypeptide, detecting the presence of the antibody bound to the cell, and thereby detecting the presence of the invention polypeptide on the cell. With respect to the 15 detection of such polypeptides, the antibodies can be used S: for in vitro diagnostic or in vivo imaging methods.

Immunological procedures useful for in vitro detection of the target human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide in a sample include immunoassays that employ a detectable antibody. Such immunoassays include, for example, ELISA, Pandex microfluorimetric assay, agglutination assays, flow 9 25 cytometry, serum diagnostic assays and immunohistochemical staining procedures which are well known in the art. An antibody can be made detectable by various means well known S" in the art. For example, a detectable marker can be directly or indirectly attached to the antibody. Useful markers include, for example, radionuclides, enzymes, fluorogens, chromogens and chemiluminescent labels.

For in vivo imaging methods, a detectable antibody can be administered to a subject and the binding of the antibody to the invention polypeptide can be detected by imaging techniques well known in the art.

Suitable imaging agents are known and include, for example, gamma-emitting radionuclides such as 111 In, 99 mTc, 51 Cr and the like, as well as paramagnetic metal ions, which are described in U.S. Patent No. 4,647,447. The radionuclides permit the imaging of tissues by gamma scintillation photometry, positron emission tomography, single photon emission computed tomography and gamma camera whole body imaging, while paramagnetic metal ions permit visualization by magnetic resonance imaging.

The invention provides a transgenic non-human mammal that is capable of expressing nucleic acids encoding a human netrin, a human ABC3 transporter, a human ribosomal L3 subtype, or a human augmenter of liver regeneration polypeptide. Also provided is a transgenic non-human 15 mammal capable of expressing nucleic acids encoding a human netrin, a human ABC3 transporter, a human ribosomal L3 subtype, or a human augmenter of liver regeneration polypeptide so mutated as to be incapable of normal activity, does not express native protein.

The present invention also provides a transgenic S non-human mammal having a genome comprising antisense nucleic acids complementary to nucleic acids encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide so placed as to be transcribed into antisense mRNA complementary to mRNA encoding a human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide, which hybridizes thereto and, thereby, reduces the translation thereof. The polynucleotide may additionally comprise an inducible promoter and/or tissue specific regulatory elements, so that expression can be induced, or restricted to specific cell types. Examples of polynucleotides are DNA or cDNA having a coding sequence substantially the same as the coding sequence shown in Figures 3, 4, 8 and 11. Examples of non-human transgenic mammals are transgenic cows, sheep, goats, pigs, rabbits, rats and mice. Examples of tissue specificity-determining elements are the metallothionein promoter and the L7 promoter.

Animal model systems which elucidate the physiological and behavioral roles of invention polypeptides are produced by creating transgenic animals in which the expression of the polypeptide is altered using a variety of techniques. Examples of such techniques include the insertion of normal or mutant versions of nucleic acids encoding human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide by microinjection, retroviral infection or other means well known to those skilled in the .15 art, into appropriate fertilized embryos to produce a ransgenic animal. See, for example, Carver, et al., Bio/Technology 11:1263-1270, 1993; Carver et al., Cytotechnology 9:77-84, 1992; Clark et al, Bio/Technology 7:487-492, 1989; Simons et al., Bio/Technology 6:179-183, 1988; Swanson et al., Bio/Technology 10:557-559, 1992; Velander et al., Proc. Natl. Acad. Sci. USA 89:12003-12007, 1992; Hammer et al., Nature 315:680-683, 1985; Krimpenfort et al., Bio/Technology 9:844-847, 1991; Ebert et al., Bio/Technology 9:835-838, 1991; Simons et al., Nature 328:530-532, 1987; Pittius et al., Proc. Natl. Acad. Sci.

USA 85:5874-5878, 1988; Greenberg et al., Proc. Natl. Acad.

Sci. USA 88:8327-8331, 1991; Whitelaw et al., Transg. Res.

1:3-13, 1991; Gordon et al., Bio/Technology 5:1183-1187, 1987; Grosveld et al., Cell 51:975-985, 1987; Brinster et al., Proc. Natl. Acad. Sci. USA 88:478-482, 1991; Brinster et al., Proc. Natl. Acad. Sci. USA 85:836-840, 1988; Brinster et al., Proc. Natl. Acad. Sci. USA 82:4438-4442, 1985; Al-Shawi et al., Mol. Cell. Biol. 10(3):1192-1198, 1990; Van Der Putten et al., Proc. Natl. Acad. Sci. USA 82:6148-6152, 1985; Thompson et al., Cell 56:313-321, 1989; Gordon et al., Science 214:1244-1246, 1981; and Hogan et al., Manipulating the Mouse Embryo: A Laboratory Manual (Cold Spring Harbor Laboratory, 1986).

Another technique, homologous recombination of mutant or normal versions of these genes with the native gene locus in transgenic animals, may be used to alter the regulation of expression or the structure of the invention polypeptides (see, Capecchi et al., Science 244:1288 ,1989; Zimmer et al., Nature 338:150, 1989). Homologous recombination techniques are well known in the art.

Homologous recombination replaces the native (endogenous) gene with a recombinant or mutated gene to produce an animal that cannot express native (endogenous) protein but can express, for example, a mutated protein which results 15 in altered expression of the human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide.

00** In contrast to homologous recombination, microinjection adds genes to the host genome, without removing host genes. Microinjection can produce a transgenic animal that is capable of expressing both endogenous and exogenous human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptides. Inducible promoters can be linked to the coding region of the nucleic acids to provide a means to regulate expression of the transgene.

*0J Tissue-specific regulatory elements can be linked to the coding region to permit tissue-specific expression of the transgene. Transgenic animal model systems are useful for in vivo screening of compounds for identification of ligands, agonists and antagonists, which activate or inhibit polypeptide responses.

The nucleic acids, oligonucleotides (including antisense), vectors containing same, transformed host cells, polypeptides, as well as antibodies of the present invention, can be used to screen compounds in vitro to determine whether a compound functions as a potential agonist or antagonist to the invention protein. These in vitro screening assays provide information regarding the function and activity of the invention protein, which can lead to the identification and design of compounds that are capable of specific interaction with invention proteins.

In accordance with still another embodiment of the present invention, there is provided a method for identifying compounds which bind to human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptides. The invention proteins may be employed in a competitive binding 15 assay. Such an assay can accommodate the rapid screening of a large number of compounds to determine which compounds, if any, are capable of binding to invention polypeptides. Subsequently, more detailed assays can be 'carried out with those compounds found to bind, to further determine whether such compounds act as modulators, agonists or antagonists of invention polypeptides.

In accordance with another embodiment of the present invention, transformed host cells that 2 5 recombinantly express invention polypeptides can be contacted with a test compound, and the modulating effect(s) thereof can then be evaluated by comparing the human netrin, human ABC3 transporter, human ribosomal L3 subtype, or human augmenter of liver regeneration polypeptide-mediated response in the presence and absence of test compound, or by comparing the response of test cells or control cells cells that do not express invention polypeptides), to the presence of the compound.

As used herein, a compound or a signal that "modulates the activity" of an invention polypeptide refers to a compound or a signal that alters the activity of the human netrin, the human ABC3 transporter, the human ribosomal L3 subtype, or the human augmenter of liver regeneration polypeptide so that the activity of the invention polypeptide is different in the presence of the compound or signal than in the absence of the compound or signal. In particular, such compounds or signals include agonists and antagonists. An agonist encompasses a compound or a signal that activates polypeptide function.

Alternatively, an antagonist includes a compound or signal that interferes with polypeptide function. Typically, the effect of an antagonist is observed as a blocking of agonist-induced protein activation. Antagonists include competitive and non-competitive antagonists. A competitive antagonist (or competitive blocker) interacts with or near the site specific for agonist binding. A non-competitive antagonist or blocker inactivates the function of the polypeptide by interacting with a site other than the agonist interaction site.

The following examples are intended to illustrate the invention without limiting the scope thereof.

Example I: Contig Assembly 25 A. Cosmids Multiple cosmids were used as reagents to initiate walks in YAC and P1 libraries. Clones 16-166N (D16S277), 16-191N (D16S279), 16-198N (D16S280) and 16-140N (D16S275) were previously isolated from a cosmid library (Lerner et al., Mamm. Genome 3:92-100, 1992). Cosmids (D16S84), c291 (D16S291), cAJ42 (ATP6C) and cKG8 were recovered from total human cosmid libraries (made in-house or by Stratagene, La Jolla, CA) using either a cloned insert (CMM65) or sequence-specific oligonucleotides as probe. The c326 cosmid contig and clone 413C12 originated from a flow-sorted chromosome 16 library (Stallings et al., Genomics 13(4):1031-1039, 1992). The 42 c326 contig was comprised of clones 2H2, 77E8, 325A11 and 325B10.

B. YACs Screening of gridded interspersed-repetitive sequence (IRS pools from Mark I, Mark II and Mega-YAC libraries) with cosmid-specific IRS probes was as previously described (Liu et al., Genomics 26:178-191, 1995). IRS probes were made from cosmids 16-166N, 16-191N, cAJ42, 16-198N, 325A11, cCMM65, and 16-140N. Biotinylated YAC probes were generated by nick-translating complex mixtures of IRS products from each YAC. Mixtures of sufficient complexity were achieved by performing independent DNA amplifications of total yeast DNA using various Alu primers (Lichter et al., Proc. Natl. Acad.

Sci., USA 87:6634-6638, 1990) and then combining the appropriate reactions containing the most diverse products.

0 C. Pls Chromosome walking experiments were done using a single set of membranes which contained the gridded P1 library pools (Shepherd et al., supra. 1994). The gridded filters were kindly provided by Dr. Mark Leppert and the Technology Access Section of the Utah Center for Human 25 Genome Research at the University of Utah. P1 gridded membranes were screened using end probes derived from a set of chromosome 16 cosmids (see above) and P1 clones as they were identified. Both RNA transcripts and bubble-PCR products were utilized as end probes.

D. Probes Radiolabeled transcripts were generated using restriction enzyme digested cosmids or Pls (Alul, HaeIII, Rsal, TaqI) as template for phage RNA polymerases T3, T7 and SP6. The T3 and T7 promoter elements were present on the cosmid-derived templates while T7 and SP6 promoter sequences were contained on the Pl-based templates.

Transcription reactions were performed as recommended by the manufacturer (Stratagene, La Jolla, CA) in the presence of [aP 32 ]-ATP (Amersham, Arlington Heights,

IL).

Bubble-PCR products were synthesized from restriction enzyme digested Pls (AluI, HaeIII, Rsal, TaqI) Bubble adaptors with appropriate overhangs and phosphorylated 5' ends were ligated to digested P1 DNA basically as described for YACs (Riley et al., Nuc. Acids Res. 18:2887-2890, 1990). The sequence of the universal vectorette primer derived from the bubble adaptor sequence was 5'-GTTCGTACGAGAATCGCT-3', and differed from that of Riley and co-workers with 12 fewer 5' nucleotides. The Tm of the truncated vectorette primer more closely matched 15 that of the paired amplimer from the vector-derived promoter sequence (SP6, T7). The desired bubble-PCR product was gel purified prior to radiolabeling (Feinberg et al., Anal. Biochem. 132:6-13, 1983; Feinberg and S Vogelstein, Anal. Biochem. 137:266-267, 1984) The specificity of all end probes was determined prior to their use on the single set of gridded P1 filter arrays. Radiolabeled probes were pre-annealed to Cotl DNA as recommended (Life Technologies Inc., Gaithersburg,

MD)

25 and then hybridized to strips of nylon membrane to which were bound 10-20 ng each of the following DNAs: the cloned genomic template used to create the probe; one or more unrelated cloned genomic DNAs; cloned vector (no insert); and human genomic

DNA.

Hybridizations were performed in CAK solution SSPE, 1% SDS, 5x Denhardt's Solution, 100 mg/mL torula RNA) at 650C overnight. Individual end probes were present at a concentration of 5x10 5 cpm/mL. Hybridized membranes were washed to a final stringency of 0.x SSC/0.1% SDS at 650 C.

The hybridization results were visualized by autoradiography. Probes which hybridized robustly to their respective cloned template while not hybridizing to unrelated cloned DNAs, vector DNA or genomic DNA were identified and used to screen the gridded P1 filters.

Hybridization to the arrayed P1 pools was performed as described for the nylon membrane strips (above) except that multiple probes were used simultaneously. Positive clones were identified, plated at a density of 200-500 cfu per 100 mm plate (LB plus 25 mg/mL kanamycin), lifted onto 82 mm HATF membranes (Millipore, Bedford, MA), processed for hybridization (Sambrook et al., supra.) and then rescreened with the complex probe mixture.

15 A single positive clone from each pool was selected and replated onto a master plate. To identify the colony purified genomic P1 clone and its corresponding probe, multiple P1 DNA dot blots were prepared and each t.o hybridized to individual radiolabeled probes. All S 20 hybridizations contained a chromosome 16pl3.3 reference probe, e.g. cAJ42, as well as a uniquely labeled P1 DNA probe.

Example II: Exon Trapping Genomic P1 clones were prepared for exon trapping experiments by digestion with PstI, double digestion with BamHI/BglII, or by partial digestion with limiting amounts of Sau3AI. Digested P1 DNAs were ligated to BamHI-cut and dephosphorylated vector, pSPL3B, while PstI digested P1 DNA was subcloned into PstI-cut dephosphorylated vector, pSPL3B.

Ligations were performed in triplicate using ng of vector DNA and 1, 3 or 6 mass equivalents of digested P1 DNA. Transformations were performed following an overnight 16 0 C incubation, with 1/10 and 1/2 of the transformation being plated on LB (ampicillin) plates.

After overnight growth at 37 0 C, colonies were scraped off those plates having the highest transformation efficiency (based on a comparison to "no insert" ligation controls) and miniprepped using the alkaline lysis method. To examine the proportion of the pSPL3B containing insert, a small portion of the miniprep was digested with HindIII, which cuts pSPL3B on each side of the multiple cloning site.

Example III: RNA Preparation Approximately 10 gg of the remaining miniprep

DNA

was ethanol precipitated, resuspended in 100 p4 of sterile PBS and electroporated into approximately 2 x 106 COS-7 cells (in 0.7 ml of ice cold PBS) using a BioRad GenePulser electroporator (1.2 kV, 25 4F and 200 The electroporated cells were incubated for 10 min. on ice prior to their addition to a 100 mm tissue culture dish containing 10 ml of prewarmed complete DMEM.

Cytoplasmic RNA was isolated 48 hours S post-transfection. The transfected COS-7 cells were removed from tissue culture dishes using 0.25% trypsin/1 mM EDTA (Life Technologies Inc., Gaithersburg,

MD).

Trypsinized cells were washed in DMEM/10% FCS and resuspended in 400 p.1 of ice cold TKM (10 mM Tris-HC1 pH 7.5, 10 mM KC1, 1 mM MgC1 2 supplemented with 1 p1 of RNAsin (Promega, Madison, WI). After -adding 20 .l of Triton X-100, the cells were incubated for 5 min. on ice.

The nuclei were removed by centrifugation at 1200 rpm for min. at 40C. Thirty microliters of 5% SDS was added to the supernatant, with the cytoplasmic RNA being further purified by three rounds of extraction using phenol/chloroform/isoamyl alcohol (24:24:1). The cytoplasmic RNA was ethanol precipitated and resuspended in p. of H 2 0.

Reverse transcription and PCR were performed on the cytoplasmic RNA prepared above as described (Church et al., supra. 1994) using commercially available exon trapping oligonucleotides (Life Technologies Inc., Gaithersburg, MD). The resulting CUA-tailed products were shotgun subcloned into pAMP10 as recommended by the manufacturer (Life Technologies Inc.). Random clones from each ligation were analyzed by colony PCR using secondary PCR primers (Life Technologies Inc.).

Miniprep DNA containing the pAMP0l/exon traps was prepared from overnight cultures by alkaline lysis using the EasyPrep manifold or a QIAwell 8 system according to the manufacturers' instructions (Pharmacia, Pistcataway, NJ and Qiagen Inc., Chatsworth, CA, respectively). DNA products containing trapped exons, based on comDarison to the 177 bp "vector only" DNA product, were selected for sequencing.

oo* o* Example IV: Sequencing Trapped exon DNA was sequenced with fluoroscein-labeled M13 universal and reverse primers using the Pharmacia Autoread Sequencing kit and Pharmacia

ALF

automated DNA sequencers. Sequences were assembled and analyzed using the Sequencher sequence analysis software (GeneCodes, Ann Arbor, MI).

Sequences from the exon traps were used to search the nonredundanc nucleotide and dbEST databases using the BLASTN program (Altschul et al., J. Mol. Biol. 215:403-410, 1990), while protein databases were searched using BLASTX (Altschul et al., supra. 1990; Gish and States, Nature Genet. 3:266-272, 1993).

Example V: RT-PCR, RACE, and cDNA Isolation 20 Based upon the sequence determined (above) two oligonucleotide primers (Table II) were designed for each *a.

exon trap using Oligo 4.0 (National Biosciences Inc., Plymouth,

MN).

5 2 25 To determine which tissue-specific library to screen for transcript or cDNA, RT-PCR reactions and/or PCR reactions were performed using different tissue-derived RNAs and/or cDNA libraries, respectively, as template with the oligonucleotide primers designed for each exon trap (above).

The oligonucleotides designed from the exons (Table II), were then used in one or more of the following positive selection formats to screen the corresponding tissue-specific cDNA library.

For RT-PCR experiments, the first oligonucleotide was used as a sense primer and the second oligonucleotide was used as an antisense primer. RT-PCR was performed as described using polyA* RNA from adult brain and placenta (Kawasaki, In PCR Protocols: A Guide to Methods and Applications, Eds. Innis et al., Academic Press, San Diego, CA, pp. 21-27, 1990). All PCR products were cloned using the pGEM-T vector as described by the manufacturer (Promega, Madison, WI).

To clone sequences 3' to selected exon traps, rapid amplification of cDNA ends (RACE) was performed as described (Frohman, PCR Met. Appl. 4:S40-S58, 1994). In 3' RACE experiments, the first oligonucleotide was used as the external primer and the second oligonucleotide was used as the internal primer.

o For the Genetrapper cDNA Positive Selection System, the first oligonucleotide primer was biotinylated and used for direct selection, while the second oligonucleotide was used in the repair.

0* Example VI: Nucleotide Sequence Analysis 25 hNET: The method of exon trapping was used to recover two exon traps from the P1 clone 53.8B with significant homology to the chicken netrin-1 and netrin-2 (62.5%) genes (Serafini et al. supra.) (Figure Positive colonies were selected for sequencing by hybridization of a 6.0 kb BamHI fragment containing the two exon traps.

The resulting contig contains 6803 bp of contiguous genomic sequence (Figure This genomic sequence was then analyzed using the GRAIL2 program which identifies potential exons in addition to CpG islands and polyadenylation sites. GRAIL2 predicted five exons which had the most significant homology to the chicken netrin-2 gene. Based on this analysis and with the use of Pustell DNA and protein matrices, a gene model was constructed The gene model predicts an open reading frame of 1743 bp.

hNET cDNA has a 5' start methionine codon, a 3' stop codon (TGA) and is 581 amino acids in length (Figure Based on this model, the gene contains 5 exons which comprise 2404 bp of genomic sequence. RT-PCR primers were designed (Table IV) from this model and used to amplify spinal cord RNA since spinal cord has been previously shown to express low levels of chicken netrin (Serafini et al supra.) Nested PCR was required to detect RT-PCR products from human spinal cord RNA. Spinal cord RNA was reverse transcribed with random primers and primary PCR was performed using primers designed from the gene model (Table S* IV). The primary PCR product was then used as template for secondary PCR with a pair of nested primers (also designed 99* from the gene model). The inclusion of betaine in the PCR reactions dramatically increased the purity and yield of '0 the human netrin RT-PCR products (see, for example, International Publication No. WO 96/12041).

9 q 9. 9 oooo Sequence analysis of the RT-PCR products indicated that hNET contains at least six exons. The RT- PCR data indicate that the f-ourth predicted exon is actually split by an. intron in the human netrin gene and is present as two exons. Three of the RT-PCR exons were shown to be identical to the original exon traps. Aside from the S extra exon, the gene model is nearly identical to the RT- PCR products. The cDNA sequence and its predicted protein product are shown in Figure 3. The final 207 bp of the coding region are predicted from the gene model.

hABC3: A total of 5.8 kb of cDNA sequence (Figure 8) has been assembled for hABC3, using RT-PCR and cDNA library screening using oligonucleotide primers (Tables II and III). A 147 bp insertion is present in the RT-PCR product relative to the cDNA. This insertion does not disrupt the open reading frame and presumably results from alternative splicing of an exon. The assembled cDNA contains a 792 bp 3' untranslated region with a consensus polyadenylation cleavage site 20 bp upstream of the polyA tail.

SEM L3: The longest cDNA is 1548 nucleotides in length (Figure 11). All three cDNAs have an open reading frame (ORF) of 1224 nucleotide with the longest cDNA containing a 48 nucleotide 5' untranslated region. An inframe stop codon at position 7 is followed by the Kozak initiation sequence CCACCATGT (Kozak, J. Cell Biol. 115:887-903, 1991). The 3' UTR for each of the three cDNAs vary in length, and lacks a consensus polyadenylation cleavage site.

The longest cDNA was compared to the human, bovine and murine ribosomal L3 genes. At the nucleotide level there is only 74% identity between the SEM L3 cDNA and the consensus from these other ribosomal L3 cDNAs.

This is in sharp contrast to the 89% identity shared between human, bovine, and murine L3 nucleotide secuences.

There is no similarity between the 3' UTR of the cDNAs isolated here and the other L3 genes.

hALR: Sequences were cloned from the human ALR gene by 3' RACE using primers external TGGCCCAGTTCATACATTTA-3' and internal 5'-TTACCCCTGTGAGGAGTGTG-3') designed from the exon trap. A total of 468 bp have been obtained from the human ALR gene (Figure 13).

Example VII: Amino Acid Sequence Analysis hNET: hNET cDNA has at least 210 bp of 5' untranslated sequence and is 66% homologous to the chicken netrin-2 cDNA. The predicted protein product has 58.6% identity and 73.4% similarity to chicken netrin-2 (Figure The conserved N-terminal laminin domains found between chicken netrin-1 and netrin-2, C. elegans UNC-6 and the murine laminin protein B2 are also conserved in the human homologue (67.3% identity to chicken netrin-2 for domains

V

and VI; Figure Domain V contains three epidermal growth factor repeats and is 81.1% identical between the human homologue and chicken netrin-2. Domain C has the characteristic basic residue bias with an isoelectric point of 12.26 and has a partial RGD domain for potential adhesion/signaling receptors (Figure hABC3: hABC3 cDNA contains an open reading frame of 1684 amino acids (Figure Comparison of ABC1, ABC2 and hABC3 reveals significant conservation in the regions surrounding the two ATP binding cassettes. As is the case with ABC1 S and ABC2, the ATP binding cassettes of hABC3 flank a large "9 linker domain containing numerous polar residues. The presence of these features in the linker domain suggests :20 that this domain may play a regulatory role similar to the R-domain of CFTR (the ABC transporter responsible for oo. cystic fibrosis).

S* 4. •9 SEM L3: The SEM L3 cDNA open reading frame predicts a 407 amino acid polypeptide of 46.3 kD (Figure 11). In vitro transcription translation of SEM L3 cDNA resulted in a protein product with an apparent molecular weight of 46 kD which is in close agreement with the predicted weight of S 46.3 kD.

Two nuclear targeting sequences, which are 100% conserved between man, mouse and cow, diverged slightly in the SEM L3 amino acid sequence. The first targeting site is the 21 amino acid N-terminal oligopeptide. The serine and arginine present at positions 13 and 19 respectively, in human, bovine and murine L3 are replaced with histidines in SEM L3 (Figure 12). The second potential nuclear targeting site is the bipartite motif. Here the human, bovine and murine proteins have a KKR-(aa) 12 -KRR at position 341-358 while the SEM L3 gene has KKR-(aa) 1 0

-HHSRQ

at position 341-358. The second half of this bipartite motif, while remaining basic, does not match those found in other nuclear targeting motifs (Simonic et al., supra.

1994). Overall, there is 77.2% amino acid identity between the SEM L3 and the consensus from the other mammalian L3 ribosomal genes, with 56% of the nucleotide differences between SEM L3 and the human L3 being silent.

hALR: hALR cDNA sequences encode a 119 amino acid protein which is 84.8% identical and 94.1% similar to the rat ALR protein (see, Figures 13 and 14).

Although the invention has been described with reference to the disclosed embodiments, it should be *.understood that various modifications can be made without departing from the spirit of the invention. Accordingly, the invention is limited only by the following claims.

*00.

0*

Claims

1. Isolated nucleic acid encoding human ATPase binding cassette transporter (hABC3) or its complement.

2. Isolated nucleic acid according to claim 1, wherein said nucleic acid is mRNA.

3. Isolated nucleic acid according to claim 1, wherein said nucleic acid is DNA comprising the sequence set forth in Figure 8.

4. Isolated nucleic acid that hybridizes under stringent conditions to the nucleic acid of claim 1.

5. Isolated nucleic acid according to claim 4, comprising the sequence: -GACGCTGGTGAAGGAGC-3'.

6. Isolated nucleic acid according to claim 4, comprising the sequence: -TCGCTGACCGCCAGGAT-3'.

7. Isolated nucleic acid according to claim 4, comprising the sequence: 5' -CATTGCCCGTGCTGTCGTG-3'.

8. Isolated nucleic acid according to claim 4, comprising the sequence: 5' -CATCGCCGCCTCCTTCATG-3'.

9. Isolated nucleic acid according to claim 4, comprising the sequence: 5' -GCGGAGCCACCTTCATCA-3'. 20

10. Isolated nucleic acid according to claim 4, comprising the sequence: 5' -GACGCTGGTGAAGGAGC-3'.

11. Isolated nucleic acid according to claim 4, comprising the sequence: -ATCCTGGCGGTCAGCGA-3'.

12. Isolated nucleic acid according to claim 4, comprising the sequence: 25 5' -AGGGATTCGACATTGCC-3'.

13. Isolated nucleic acid according to claim 4, comprising the sequence: -CTTCAGAGACTCAGGGGCAT-3'.

14. An antisense oligonucleotide that specifically binds to and modulates translation of mRNA according to claim 2.

15. Isolated human ATPase binding cassette transporter (hABC3) and biologically active fragments thereof.

16. Isolated hABC3 according to claim 15 comprising the amino acid sequence set forth in Figure 8.

17. A vector comprising the isolated nucleic acid of claim 1.

18. A host cell comprising the vector of claim 17.

19. A method for producing human ATPase binding cassette transporter (hABC3), said method comprising: culturing the host cell of claim 18 in a medium and under conditions suitable for expression of said protein, and isolating said expressed protein.

An antibody that specifically binds to human ATPase binding cassette transporter (hABC3).

21. The antibody according to claim 20 being a monoclonal antibody.

22. A composition comprising an amount of the oligonucleotide according to claim 14, effective to modulate expression of hABC3 by passing through a cell membrane and binding specifically with nRNA encoding hABC3 in the cell so as to prevent its translation and an acceptable hydrophobic carrier capable of passing through a cell membrane.

23. A composition comprising an amount of the antibody according to claim 20, effective to block binding of naturally occurring ligands to hABC3 and an acceptable carrier.

24. A transgenic non-human mammal expressing DNA encoding human ATPase binding cassette tranpsorter (hABC3). A method for identifying compounds which bind to human ATPase binding cassette transporter (hABC3),. said method comprising a competitive binding assay wherein the cells according to claim 18 are exposed to a plurality of compounds and identifying compounds which bind thereto. Dated this second day of November 1999 "GENZYME CORPORATION Patent Attorneys for the Applicant: F B RICE CO