EP1075531A1 - Genomic sequences upstream of the coding region of the ifn-alpha2 gene for protein production and delivery - Google Patents

Genomic sequences upstream of the coding region of the ifn-alpha2 gene for protein production and delivery

Info

Publication number
EP1075531A1
EP1075531A1 EP99920375A EP99920375A EP1075531A1 EP 1075531 A1 EP1075531 A1 EP 1075531A1 EP 99920375 A EP99920375 A EP 99920375A EP 99920375 A EP99920375 A EP 99920375A EP 1075531 A1 EP1075531 A1 EP 1075531A1
Authority
EP
European Patent Office
Prior art keywords
cell
ifna2
seq
nucleic acid
isolated nucleic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP99920375A
Other languages
German (de)
French (fr)
Inventor
Douglas A. Treco
Michael W. Heartlein
Richard F. Selden
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shire Human Genetics Therapies Inc
Original Assignee
Transkaryotic Therapies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Transkaryotic Therapies Inc filed Critical Transkaryotic Therapies Inc
Publication of EP1075531A1 publication Critical patent/EP1075531A1/en
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/52Cytokines; Lymphokines; Interferons
    • C07K14/555Interferons [IFN]
    • C07K14/56IFN-alpha
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/12Antivirals
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/67General methods for enhancing the expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • C12N15/902Stable introduction of foreign DNA into chromosome using homologous recombination
    • C12N15/907Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2840/00Vectors comprising a special translation-regulating system
    • C12N2840/44Vectors comprising a special translation-regulating system being a specific part of the splice mechanism, e.g. donor, acceptor

Definitions

  • This invention relates to genomic DNA.
  • Gene therapy methods involve administering to a patient cells, plasmids," or viruses that contain a sequence encoding the therapeutic protein of interest.
  • Certain therapeutic proteins may also be produced by altering the expression of their endogenous genes in a desired manner with gene targeting techniques. See, e.g., U.S. Patent Nos. 5,641,670, 5,733,761, and 5,272,071, U.S. Patent Application Serial No. 08/406,030,
  • the present invention is based upon the identification and sequencing of genomic DNA 5' to the
  • IFNA2 human interferon- ⁇ 2
  • This DNA can be used, for example, in a DNA construct that alters (e.g., increases) expression of an endogenous IFNA2 gene in a mammalian cell upon integration into the genome of the cell via homologous
  • Endogenous IFNA2 gene refers to a genomic (i.e., chromosomal) gene that encodes IFNA2.
  • the construct contains a targeting sequence including or derived from the newly disclosed 5' noncoding sequence, and a transcriptional regulatory sequence.
  • the transcriptional regulatory sequence preferably differs in sequence from the transcriptional regulatory sequence of the endogenous IFNA2 gene.
  • the targeting sequence directs the integration of the regulatory sequence into a region upstream of the endogenous IFNA2 -coding sequence such that the regulatory sequence becomes operatively linked to the endogenous coding sequence.
  • operatively linked is meant that the regulatory sequence can direct expression of the endogenous IFNA2- coding sequence.
  • the construct may additionally contain a selectable marker gene to facilitate selection of cells that have stably integrated the construct, and/or another coding sequence linked to a promoter.
  • the DNA construct comprises: (a) a targeting sequence, (b) a regulatory sequence, (c) an exon, (d) a splice-donor site, (e) an intron, and (f) a splice-acceptor site, wherein the targeting sequence directs the integration of itself and elements (b) - (f) such that elements (b) - (f) are within or upstream of the endogenous gene .
  • the regulatory sequence then directs production of a transcript that includes not only elements (c) - (f) , but also the endogenous IFNA2 coding sequence.
  • the intron and the splice-acceptor site are situated in the construct downstream from the splice-donor site.
  • the targeting sequence is homologous to a preselected target site in the genome with which homologous recombination is to occur. It contains at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 12; and can contain, for instance, at least 20 (e.g., at least 30, 50, or 100) contiguous nucleotides of SEQ ID NO:7, at least 20 (e.g., at least 30 or 50) contiguous nucleotides of SEQ ID NO:8, or at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 13.
  • the targeting sequence can contain at least 20 (e.g., at least 30, 50, or 100) contiguous nucleotides of SEQ ID NO: 16, at least 20 contiguous nucleotides of SEQ ID NO:17, at least 20 (e.g., at least 30 or 50) contiguous nucleotides of SEQ ID NO: 18, or at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 19.
  • at least 20 e.g., at least 30, 50, or 100
  • at least 20 e.g., at least 30, 50, or 100
  • SEQ ID NO : 7 corresponds to nucleotides 1 to 278 of SEQ ID NO: 12;
  • SEQ ID NO : 8 corresponds to nucleotides 3492 to 3564 of SEQ ID NO: 12; and
  • SEQ ID NO: 13 corresponds to nucleotides 279 to 3491 of SEQ ID NO: 12.
  • homologous is meant that the targeting sequence is identical or sufficiently similar to its genomic target site so that the targeting sequence and target site can undergo homologous recombination within a human cell . A small percentage of basepair mismatches is acceptable, as long as homologous recombination can occur at a useful frequency.
  • the targeting sequence is preferably at least about 20 (e.g., at least 50, 100, 250, 400, or 1,000) base pairs ("bp") long.
  • the targeting sequence can also include genomic sequences from outside the region covered by SEQ ID NO: 12, so long as it includes as least 20 nucleotides from within this region.
  • additional targeting sequence could be derived from the sequence lying between SEQ ID NO : 8 and the endogenous transcription initiation sequence of the IFNA2 gene. Due to polymorphism that exists at the IFNA2 genetic locus, minor variations in the nucleotide composition of any given genomic target site may occur in any given mammalian species. Targeting sequences that correspond to such polymorphic variants of SEQ ID NOs : 7 , 8, 12, 13, 16, 17, 18, and 19 (particularly human polymorphic variants) are within the scope of this invention.
  • the regulatory sequence of the construct is integrated into a pre- selected region upstream of the coding sequence of an IFNA2 gene in a chromosome of a cell.
  • the resulting new transcription unit containing the construct-derived regulatory sequence alters the expression of the target IFNA2 gene.
  • the IFNA2 protein so produced may be identical in sequence to the IFNA2 protein encoded by the unaltered, endogenous gene, or may contain additional, substituted, or fewer amino acid residues as compared to the wild type IFNA2 protein, due to changes introduced as a result of homologous recombination.
  • Altering gene expression encompasses activating (or causing to be expressed) a gene which is normally silent (i.e, essentially unexpressed) in the cell as obtained, increasing or decreasing the expression level of a gene, and changing the regulation pattern of a gene such that the pattern is different from that in the cell as obtained.
  • Cell as obtained refers to the cell prior to homologous recombination.
  • Also within the scope of the invention is a method of using the present DNA construct to alter expression of an endogenous IFNA2 gene in a mammalian cell.
  • This method includes the steps of (i) introducing the DNA construct into the mammalian cell, (ii) maintaining the cell under conditions that permit homologous recombination to occur between the construct and a genomic target site homologous to the targeting sequence, to produce a homologously recombinant cell; and (iii) maintaining the homologously recombinant cell under conditions that permit expression of the IFNA2 coding sequence under the control of the construct-derived regulatory sequence. At least a part of the genomic target site is 5' to the coding sequence of an endogenous IFNA2 gene.
  • the genomic target site can contain coding sequence as well as 5' non-coding sequence.
  • the invention also features transfected or infected cells in which the construct has undergone homologous recombination with genomic DNA upstream of the endogenous ATG initiation codon in one or both alleles of the endogenous IFNA2 gene.
  • Such transfected or infected cells also called homologously recombinant cells, have an altered IFNA2 expression pattern. These cells are particularly useful for in vi tro IFNA2 production and for delivering IFNA2 via gene therapy. Methods of making and using such cells are also embraced by the invention.
  • the cells can be of vertebrate origin such as mammalian (e.g., human, non-human primate, cow, pig, horse, goat, sheep, cat, dog, rabbit, mouse, guinea pig, hamster, or rat) origin.
  • mammalian e.g., human, non-human primate, cow, pig, horse, goat, sheep, cat, dog, rabbit, mouse, guinea pig, hamster, or rat
  • the invention further relates to a method of producing a mammalian IFNA2 protein in vi tro or in vivo by introducing the above-described construct into the genome of a host cell via homologous recombination. The homologously recombinant cell is then maintained under conditions that allow transcription, translation, and optionally, secretion of the IFNA2 protein.
  • the invention also features isolated nucleic acids comprising a sequence of at least 20 (e.g., at least 30, 50, 100, 200, or 1000) contiguous nucleotides of SEQ ID NO: 12 or its complement, or of a sequence identical to SEQ ID NO: 12 except for polymorphic variations or other minor variations (e.g., less than 5% of the sequence) which do not prevent homologous recombination with the target sequence.
  • isolated nucleic acids comprising a sequence of at least 20 (e.g., at least 30, 50, 100, 200, or 1000) contiguous nucleotides of SEQ ID NO: 12 or its complement, or of a sequence identical to SEQ ID NO: 12 except for polymorphic variations or other minor variations (e.g., less than 5% of the sequence) which do not prevent homologous recombination with the target sequence.
  • the isolated DNA can contain at least 20 (e.g., at least 30, 50, or 100) contiguous nucleotides of SEQ ID NO: 7 or its complement, at least 20 (e.g., at least 30 or 50) contiguous nucleotides of SEQ ID NO: 8 or its complement, at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 13 or its complement, at least 20 (e.g., at least 30, 50, or 100) contiguous nucleotides of SEQ ID NO: 16 or its complement, at least 20 contiguous nucleotides of SEQ ID NO: 17 or its complement, at least 20 (e.g., at least 30 or 50) contiguous nucleotides of SEQ ID NO:18 or its complement, or at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 19 or its complement.
  • at least 20 e.g.
  • the isolated nucleic acid of the invention includes a contiguous 100 bp block of SEQ ID NO: 12.
  • the isolated DNA can contain nucleotides 1 to 100, 101 to 200, 201 to 300, 301 to 400, 401 to 500, 501 to 600, 601 to 700, 701 to 800, 801 to 900, 901 to 1000, 1001 to 1100, 1101 to 1200, 1201 to 1300, 1301 to 1400, 1401 to 1500, 1501 to 1600, 1601 to 1700, 1701 to 1800, 1801 to 1900, 1901 to 2000, 2001 to 2100, 2101 to 2200, 2201 to 2300, 2301 to 2400, 2401 to 2500, 2501 to 2600, 2601 to 2700, 2701 to 2800, 2801 to 2900, 2901 to 3000, 3001 to 3100, 3101 to 3200, 3201 to 3300, 3301 to 3400, 3401 to 3500, or 3465 to 3564 of SEQ ID NO: 12 or its complement.
  • the SEQ ID NO: 12 -derived sequence is not linked to a sequence encoding intact IFNA2, or at least is not linked in the same configuration (i.e., separated by the same noncoding sequence) as occurs in any wild-type genome.
  • isolated DNA thus does not denote a chromosome or a large piece of genomic DNA (as might be incorporated into a cosmid or yeast artificial chromosome) that includes not only part or all of SEQ ID NO: 12, but also an intact IFNA2 -coding sequence and all of the sequence which lies between the IFNA2 coding sequence and the sequence corresponding to SEQ ID NO: 12 as it exists in the genome of a cell.
  • a DNA which is incorporated into a plasmid or virus; or (ii) which exists as a separate molecule independent of other sequences, e.g., a fragment produced by polymerase chain reaction ("PCR") or restriction endonuclease treatment .
  • the isolated DNA preferably does not contain a sequence which encodes intact IFNA2 precursor (i.e., IFNA2 complete with its endogenous secretion signal peptide) .
  • the invention also includes isolated DNA comprising a strand which contains a sequence that is at least 100 (e.g., at least 200, 400, or 1000) nucleotides in length and that hybridizes ' under either moderately stringent or highly stringent conditions with SEQ ID NO: 7, 8, 12, 13, 16, 17, 18, and/or 19, or the complement of SEQ ID NO: 7, 8, 12, 13, 16, 17, 18, and/or 19.
  • the sequence is not linked to an IFNA2-coding sequence, or at least is not linked in the same configuration as occurs in any wild-type genome.
  • moderately stringent conditions is meant hybridization at 50°C in Church buffer (7% SDS, 0.5% NaHP0 4 , 1 M EDTA, 1% bovine serum albumin) and washing at 50°C in 2X SSC.
  • Highly stringent conditions are defined as: hybridization at 42°C in the presence of 50% formamide; a first wash at 65°C with 2X SSC containing 1% SDS; followed by a second wash at 65°C with 0.1X SSC.
  • isolated DNA comprising a strand which contains a sequence that (1) is at least 50 (e.g., at least 70 or 100) nucleotides in length and (2) shares at least 80% (e.g., at least 85%, 90%, 95%, or 98%) sequence identity with a fragment or all of SEQ ID NO: 12, or with the complement of the fragment.
  • This fragment can include, for instance, a part or all of SEQ ID NO : 7 , 8, 13, 16, 17, 18, or 19.
  • the sequence is not linked to an intact IFNA2 -coding sequence, or at least is not linked in the same configuration as occurs in any wild-type genome.
  • polypeptide or nucleic acid molecule is said to have a specific percent identity or conservation to a reference polypeptide or nucleic acid molecule, the percent identity or conservation is determined by the algorithm of Myers and Miller, CABIOS (1989) , which is embodied in the ALIGN program (version 2.0), or its equivalent, using a gap length penalty of 12 and a gap penalty of 4 where such parameters are required. All other parameters are set to their default positions. Access to ALIGN is readily available. See, e.g., http: //www2. igh.cnrs . fr ' /bin/align-guess . cgi on the Internet .
  • the invention also features a method of delivering IFNA2 to an animal (e.g., a mammal such as a human, non- human primate, cow, pig, horse, goat, sheep, cat, dog, rabbit, mouse, guinea pig, hamster, or rat) by providing a cell whose endogenous IFNA2 gene has been activated as described herein, and implanting the cell in the animal, where the cell secretes IFNA2. Also included in the invention is a method of producing IFNA2 by providing a cell whose endogenous IFNA2 gene has been activated as described herein, and culturing the cell in vi tro under conditions which permit the cell to express and secrete IFNA2.
  • an animal e.g., a mammal such as a human, non- human primate, cow, pig, horse, goat, sheep, cat, dog, rabbit, mouse, guinea pig, hamster, or rat
  • an animal e.g., a ma
  • the invention further includes isolated DNA that shares at least 80% (e.g., at least 85%, 90%, or 95%) sequence identity, or hybridizes under highly or moderately stringent conditions, with a portion (e.g., at least about 20, 50, 100, 400, or 1000 bp in length) of the Hindlll-BamHI insert of plasmid pA2HB (described below) .
  • the 3' end of this portion of the insert is at least 511 bp upstream of the ATG translation initiation codon of the IFNA2 -coding sequence included in the plasmid insert .
  • the isolated DNA of the invention can be used, for example, as a source of an upstream PCR primer for use (when combined with a suitable downstream primer) in obtaining the regulatory region and/or complete coding sequence of an endogenous IFNA2 gene, or as a hybridization probe for indicating the presence of chromosome 9 in a preparation of human chromosomes. It can also be used, as described below, in a method for altering the expression of an endogenous IFNA2 gene in a vertebrate cell.
  • Fig. 1 is a representation of the published sequence (SEQ ID NO:l) of a human IFNA2-coding sequence and some flanking 5' and 3' non-coding sequences (GenBank HUMIFNAA) . Sequences of PCR primers IFN1, IFN2, IFN6, and IFN7 are indicated by arrows.
  • Fig. 2 is a schematic diagram showing the human IFNA2 genomic region encompassed by the insert of plasmid pA2HB .
  • Fig. 3 is a representation of the nucleotide sequence (SEQ ID NO: 7) of a region upstream of the coding sequence of a human IFNA2 gene. This nucleotide sequence has not been reported previously.
  • Fig. 4 is a representation of a sequence (SEQ ID NO: 9) of a human IFNA2-coding sequence and some flanking 5' and 3' non-coding sequences. The underlined sequence (SEQ ID NO: 8) has not been previously reported.
  • the polypeptide sequence (SEQ ID NO: 2) encoded by this gene is also shown. The N-terminus of the mature polypeptide is indicated by "Mature.”
  • Fig. 5 is a schematic diagram showing a construct of the invention.
  • the construct contains a first targeting sequence (1) ; an amplifiable marker gene (AM) ; a selectable marker gene (SM) ; a regulatory sequence; a CAP site; an exon; a splice-donor site (SD) ; an intron; a splice-acceptor site (SA) ; and a second targeting sequence (2) .
  • the black boxes represent coding DNA and the stippled boxes represent transcribed but untranslated sequences.
  • Fig. 6 is a representation of a sequence (SEQ ID NO: 11) of a human genomic sequence 5' to the IFNA2 coding sequence, and including some coding sequence.
  • the underlined sequence is previously reported while the remainder (-4074 to -511; SEQ ID NO: 12) is new.
  • the framed sequence is SEQ ID NO: 13.
  • the sequence 5' to SEQ ID NO: 13 is SEQ ID NO : 7.
  • the sequence between the framed area and the underlined sequence is SEQ ID NO: 8.
  • Nucleotides -4074 to -3270 is SEQ ID NO: 16.
  • Nucleotides -3267 to -3239 is SEQ ID NO: 17.
  • Nucleotides -3241 to - 3137 is SEQ ID NO: 18.
  • Nucleotides -3139 to -511 is SEQ ID NO: 19.
  • Fig. 7 is a representation of a first targeting sequence (SEQ ID NO: 14) used in a construct of the invention.
  • Fig. 8 is a representation of a second targeting sequence (SEQ ID NO: 15) used in a construct of the invention.
  • the present invention is based on the discovery of the nucleotide composition of sequences upstream to the coding sequence of a human IFNA2 gene.
  • Interferon- ⁇ constitutes a complex gene family with 14 genes clustered on the short arm of chromosome 9. None of these genes, including the IFNA2 gene, have introns. Interferon- ⁇ is produced by macrophages, T and B cells, and a variety of many other cells. Interferon- ⁇ has considerable antiviral effects, and has been shown to be efficacious in treating infections by papilloma virus, hepatitis B and C viruses, vaccinia, herpes simplex virus, herpes zoster varicellosus virus, and rhinovirus .
  • the human IFNA2 gene encodes a 188 amino acid precursor protein (SEQ ID NO:2) containing a 23 amino acid signal peptide.
  • the genomic map of the human IFNA2 gene is shown in Fig. 1. The map is constructed based on 1,733 base pair ("bp") published sequences (HUMIFNAA, GenBank accession number J00207 and V00544; SEQ ID NO:l) which begin at position -510 relative to the translational start site (unless otherwise specified, all positions referred to herein are relative to the translational start site), and end at position +1,223.
  • the cap site is located at position -67.
  • a human leukocyte genomic library in lambda EMBL3 (Clontech catalog # HL1006d) was screened with a 332 bp probe generated by PCR.
  • This probe corresponds to the genomic region between positions -263 and +69, and was amplified from human genomic DNA using oligonucleotide primers designated IFN7 and IFN6, both of which were designed from the available IFNA2 genomic DNA sequence (Fig. 1) .
  • the 5' end of primer IFN7 corresponds to position -263, and the primer's sequence is 5' -AGTTTCTAAAAAGGCTCTGGGGTA-3' (SEQ ID NO : 3 ) .
  • the 5' end of primer IFN6 corresponds to position +69, and this primer sequence is 5 ' -GCCCACAGAGCAGCTTGAC-3 ' (SEQ ID NO:4) .
  • IFN1 and IFN2 are derived from the 3 ' untranslated region of the IFNA2 gene; their sequences can be found at the website "http://www.ncbi.nlm.nih.gov/dbSTS,” using the identification code "NCBI_ID : 42433.
  • primer IFN1 corresponds to position +639, and the primer's sequence is 5' -AAAGACTCATGTTTCTGCTATGACC-3 ' (SEQ ID NO: 5) .
  • primer IFN2 corresponds to position +853, and the primer's sequence is 5'-
  • the pA2HB plasmid was sequenced by the Sanger method.
  • a 278 bp sequence whose 5' terminus is at the 5' end Hindlll site is shown below (see also Fig. 3) :
  • Hindlll site is located at position -4,073.
  • a previously unpublished sequence between positions -583 and -511 was also determined, as shown below and as underlined in Fig. 4.
  • SEQ ID NO: 13 The sequence (SEQ ID NO: 13) between the regions corresponding to SEQ ID NOs : 7 and 8 was also determined.
  • SEQ ID NO: 12 is the sequence which is not underlined in Fig. 6.
  • SEQ ID NOs : 7 and 8 correspond to nucleotides 1-278 and nucleotides 3492-3564 of SEQ ID NO: 12, respectively.
  • nucleotides 279-3311 of SEQ ID NO: 12 was cloned into a plasmid to produce targeting construct pGA402.
  • Nucleotides 279-3311 of SEQ ID NO: 12 was designated SEQ ID NO: 14.
  • the fragment was inserted upstream of a CMV promoter and a neomycin resistance gene and is schematically represented in Fig. 5.
  • a DNA fragment containing nucleotides -68 to 69 of the IFNA2 gene sequence shown in Fig. 1 was cloned downstream of the CMV promoter and neomycin resistance gene.
  • Nucleotides -68 to 69 of the IFNA2 gene was designated SEQ ID NO: 15.
  • the pGA402 plasmid was introduced into human fibroblast cells exhibiting little or no INFA2 gene expression, to allow homologous recombination with the endogenous INFA2 gene.
  • Cells resistant to G418 after plasmid introduction were screened to identify cells with increased INFA2 gene expression, as would be expected if a homologous recombination event between pGA402 and the genomic DNA took place in the vicinity of the endogenous INFA2 gene.
  • IFNA2 upstream sequences one can alter the expression of an endogenous human IFNA2 gene by a method as generally described in
  • a targeting construct is designed to include a first targeting sequence homologous to a first target site upstream of the gene, an amplifiable marker gene, a selectable marker gene, a regulatory region, a CAP site, an exon, a splice-donor site, an intron, a splice- acceptor site, and a second targeting sequence homologous to a second target site downstream of the first target site and terminating either within or upstream of the
  • the 5' end of the second target site is preferably less than 107 bp upstream of the normal IFNA2 translational initiation site, in order to avoid undesired ATG start codons within the transcribed sequence.
  • a transcript produced from the homologously recombined locus will include the construct-derived exon, the construct-derived splice-donor site, the construct-derived intron, the construct-derived splice-acceptor site, any sequence between any of those elements, and the sequence from the construct derived splice acceptor site through the entire endogenous coding sequence to the transcription termination site of the IFNA2 gene.
  • Splicing of this transcript will generate a mRNA which can be translated to produce a precursor of human IFNA2, having either the normal IFNA2 secretion signal sequence or a genetically engineered secretion signal sequence, depending on the characteristics of the construct-derived exon.
  • the size of the exogenous intron and thus the position of the exogenous regulatory region relative to the coding region of the gene can be varied to optimize the function of the regulatory region.
  • the first and second target sites need not be immediately adjacent or even be near each other. When they are not immediately adjacent to each other, a portion of the IFNA2 gene's normal upstream region and/or a portion of the coding region would be deleted upon homologous recombination. Mutations that facilitate alteration of endogenous IFNA2 expression may be introduced into the chromosomal DNA via homologous recombination. For instance, it may be desirable to abolish a spurious and undesired ATG initiation codon upstream of the correct ATG initiation codon and between the exogenous regulatory region and the endogenous IFNA2 coding region in the homologously recombined locus.
  • a targeting construct having a targeting sequence homologous to a genomic site that spans the undesired ATG initiation codon.
  • This targeting sequence contains nucleotides that correspond to the desired mutation, e.g., contains ATT instead of ATG.
  • the targeting construct optionally includes one or more selectable markers to facilitate selection of homologously recombined cells.
  • An exogenous regulatory region can then be introduced to the homologously recombined cells upstream of the altered sites, using the expression alteration method of the invention.
  • the exogenous regulatory region and the desired sequence mutation (s) may be introduced into the genomic DNA in a single step.
  • the DNA construct used in this embodiment may contain both the exogenous regulatory region and a targeting sequence that contains nucleotides corresponding to the desired mutation (s).
  • One may also co-transfect or .co-infect two separate constructs into target cells, with one construct containing the regulatory region and the other containing nucleotides corresponding to the desired mutation.
  • a mammalian splice-acceptor site may be introduced into the genomic DNA, e.g., at a site between an undesired ATG initiation codon and the correct ATG initiation codon, in a similar manner.
  • the DNA construct used for this purpose contains a targeting sequence homologous to a genomic site upstream of the correct INFA2 initiation codon, and adjacent to or embedded within the homologous sequence, a sequence corresponding to the desired splice-acceptor site.
  • Cells containing the correctly recombined IFNA2 locus are then transfected or infected with a second construct containing an exogenous regulatory region and an exon with an unpaired splice-donor site at its 3' end, together with targeting sequence (s) which target the second construct to a genomic region upstream of the inserted splice-acceptor site.
  • a primary transcript produced under the control of the exogenous regulatory region will include the exogenous exon, the exogenous splice-donor site, the exogenous splice-acceptor site, any sequences between those elements, and the sequence between the exogenous splice-acceptor site and the transcriptional termination site of the endogenous IFNA2 gene.
  • the splice-donor site of the transcript Upon splicing, the splice-donor site of the transcript will be spliced to the splice-acceptor site, and the intervening intronic RNA, which may contain undesirable AUG initiation codons, will be removed. Any problems associated with having a transcript with undesired AUG translation initiation codons between the transcription start site and the IFNA2 -coding sequence are thereby avoided.
  • the regulatory region, exon, splice donor site, and splice-acceptor site can instead be introduced in a single step.
  • the DNA construct used in this embodiment contains a regulatory region, an exon, a splice-donor site, an intron, a splice-acceptor site, a targeting sequence homologous to a genomic site between the correct INFA2 initiation codon and the undesired ATG codon, and optionally, one or more selectable markers.
  • two separate targeting constructs may be useful, with one containing the regulatory region, the exon, and the splice-donor site, and the other containing the splice-acceptor site. The two constructs can be introduced into target cells in a single step.
  • the DNA construct of the invention includes at least a targeting sequence and a regulatory sequence. It can additionally include an exon; or an exon and splice donor site; or an exon, a splice-donor site, an intron, and a splice-acceptor site.
  • the exon if present, is 3 ' of the regulatory sequence
  • the splice-donor site if present, is at the 3' end of the exon.
  • the intron and splice acceptor site are 3' of the splice donor site.
  • exogenous DNA may possess sequences identical to or different from portions of the endogenous genomic DNA present in the cell prior to transfection or infection by viral vector.
  • transfection means introduction of a plasmid into a cell by nonviral (e.g., chemical or physical) means, such as calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, electroporation, microinjection, microprojectiles, or biolistic-mediated uptake.
  • Infection means introduction of a viral vector into a cell by viral infection.
  • the DNA construct can also include cis-acting or trans-acting viral sequences (e.g., packaging signals), thereby enabling delivery of the construct into the nucleus of a cell via infection by a viral vector.
  • the DNA construct can be disengaged from various steps of a virus life cycle, such as integrase- mediated integration in retroviruses or episome maintenance. Disengagement can be accomplished by appropriate deletions or mutations of viral sequences, such as a deletion of the integrase coding region in a retrovirus vector. Additional details regarding the construction and use of viral vectors are found in Robbins et al . , Pharmacol. Ther. 80:35-47, 1998; and Gunzburg et al . , Mol . Med. Today 1:410-417, 1995, herein incorporated by reference. Targeting Sequences
  • Targeting sequences permit homologous recombination of a desired sequence into a selected site in the host genome.
  • Targeting sequences are homologous to (i.e., able to homologously recombine with) their respective target sites in the host genome.
  • a circular DNA construct can employ a single targeting sequence, or two or more separate targeting sequences.
  • a linear DNA construct may contain two or more separate targeting sequences.
  • the target site to which a given targeting sequence is homologous can reside within the coding region of the IFNA2 gene, upstream of and immediately adjacent to the coding region, or upstream of and at a distance from the coding region.
  • the first of the two .targeting sequences in the construct (or the entire targeting sequence, if there is only one targeting sequence in the construct) is derived from the newly disclosed genomic regions upstream of the IFNA2 -coding sequence.
  • This targeting sequence contains a portion (e.g., 20 or more contiguous nucleotides) of SEQ ID NO: 12, e.g., a portion of SEQ ID NO: 7, 8, or 13.
  • the second of the two targeting sequences in the construct may target a genomic region upstream of the coding sequence or target part or all of the coding sequence itself.
  • the second targeting sequence may contain, at its 3' end, an "exogenous" coding region identical to the first few codons of the IFNA2 coding sequences.
  • the exogenous coding region Upon homologous recombination, the exogenous coding region recombines with the targeted part of the endogenous IFNA2-coding sequence.
  • the exogenous coding region may encode a heterologous amino acid sequence, so long as the exogenous coding region remains sufficiently homologous to the endogenous coding region it replaces to permit homologous recombination.
  • the targeting sequence may additionally include sequence derived from a previously disclosed region of the IFNA2 gene, including those described herein, as well as a region further upstream which is structurally uncharacterized but can be mapped by one skilled in the art.
  • Genomic fragments useful as targeting sequences can be identified by their ability to hybridize to a probe containing all or a portion of SEQ ID NO: 12. Such a probe can be generated by PCR using primers derived from SEQ ID NO: 12.
  • the regulatory sequence of the DNA construct can contain one or more promoters (e.g., a constitutive, tissue-specific, or inducible ' promoter) , enhancers, scaffold-attachment regions or matrix attachment sites, negative regulatory elements, transcription factor binding sites, or combinations of these elements.
  • promoters e.g., a constitutive, tissue-specific, or inducible ' promoter
  • enhancers e.g., enhancers, scaffold-attachment regions or matrix attachment sites, negative regulatory elements, transcription factor binding sites, or combinations of these elements.
  • the regulatory sequence can be derived from a eukaryotic (e.g., mammalian) or viral genome.
  • Useful regulatory sequences include, but are not limited to, those that regulate the expression of SV40 early or late genes, cytomegalovirus genes, and adenovirus major late genes.
  • They also include regulatory regions derived from genes encoding mouse metallothionein-I, elongation factor-Ice, collagen (e.g., collagen I ⁇ l, collagen I ⁇ .2, and collagen IV), actin (e.g., ⁇ -actin) , immunoglobulin, HMG-CoA reductase, glyceraldehyde phosphate dehydrogenase, 3-phosphoglyceratekinase, collagenase, stromelysin, fibronectin, vimentin, plasminogen activator inhibitor I, thymosin ⁇ 4 , tissue inhibitors of etalloproteinase, ribosomal proteins, major histocompatibility complex molecules, and human leukocyte antigens .
  • actin e.g., ⁇ -actin
  • the regulatory sequence preferably contains a transcription factor binding site, such as a TATA Box, CCAAT Box, API, Spl, or a NF- ⁇ B binding site.
  • Marker Genes If desired, the construct can include a sequence encoding a desired polypeptide, operatively linked to its own promoter. An example of this would be a selectable marker gene, which can be used to facilitate the identification of a targeting event.
  • An amplifiable marker gene can also be included and used to facilitate selection of cells having co-amplified flanking DNA sequences. Cells containing amplified copies of the amplifiable marker gene can be identified by growth in the presence of an agent that selects for the expression of the amplifiable gene.
  • the activated endogenous IFNA2 gene will be amplified in tandem with the amplified selectable marker gene. Cells containing multiple copies of the activated endogenous gene may produce very high levels of IFNA2 and are thus useful for in vi tro protein production and gene therapy.
  • the selectable and amplifiable marker genes do not have to lie immediately adjacent to each other.
  • the amplifiable marker gene and selectable marker gene can be the same gene .
  • One or both of the marker genes can be situated in the intron of the DNA construct. Suitable amplifiable marker genes and selectable marker genes are described in U.S. Patent No. 5,641,670.
  • the Splice-Donor and Splice-Acceptor Sites may further contain an exon, a splice-donor site at the 3' end of the exon, an intron, and a splice-acceptor site.
  • a splice-donor site is a sequence which directs the splicing of one exon of an RNA transcript to the splice-acceptor site of another exon of the transcript, resulting in removal of the intron between the two sites.
  • the first exon lies 5' of the second exon, and the splice-donor site located at the 3' end of the first exon is paired with a splice-acceptor site flanking the second exon on the 5' side of the second exon.
  • Splice- donor sites have a characteristic consensus sequence represented as (A/C) AGGURAGU (where R denotes a purine nucleotide) , with the GU in the fourth and fifth positions being required (Jackson, Nucleic Acids Research 19: 3715-3798, 1991).
  • the first three bases of the splice-donor consensus site are the last three bases of the exon: i.e., they are not spliced out.
  • Splice-donor sites are functionally defined by their ability to effect the appropriate reaction within the mRNA splicing pathway.
  • a splice acceptor site in a construct of the invention directs, in conjunction with a splice donor site, the splicing of one exon to another exon.
  • Splice- acceptor sites have a characteristic sequence represented as (Y) 10 NYAG (SEQ ID NO:10), where Y denotes any pyrimidine and N denotes any nucleotide (Jackson, Nucleic Acids Research 19:3715-3798, 1991).
  • the CAP Site The DNA construct can optionally contain a CAP site.
  • a CAP site is a specific transcription start site which is associated with and utilized by the regulatory region.
  • This CAP site is located at a position relative to the regulatory sequence in the construct such that following homologous recombination, the regulatory sequence directs synthesis of a transcript that begins at the CAP site.
  • the transcriptional apparatus will locate by default an appropriate site in the targeted gene to be utilized as a CAP site. Additional DNA elements
  • the construct may additionally contain sequences which affect the structure or stability of the RNA or protein produced by homologous recombination.
  • the DNA construct can include a bacterial origin of replication and bacterial antibiotic resistance markers or other selectable markers, which allow for large-scale plasmid propagation in bacteria or any other suitable cloning/host system. All of the above-described elements of the DNA construct are operatively linked or functionally placed with respect to each other. That is, upon homologous recombination between the construct and the targeted genomic DNA, the regulatory sequence can direct the production of a primary RNA transcript which initiates at a CAP site (optionally included in the construct) and includes the sequence lying between the CAP site and the endogenous IFNA2 gene's transcription stop site.
  • a portion of this sequence may include the IFNA2 gene's endogenous regulatory region as well as sequences neighboring that region that are normally not transcribed. If an exon, a splice-donor site and a splice-acceptor site are present in the construct, the primary transcript will also include the exon, the two splice sites, and the intron between the two sites.
  • the order of elements in the DNA construct can vary. Where the construct is a circular plasmid or viral vector, the relative order of elements in the resulting structure can be, for example: a targeting sequence, plasmid DNA (comprised of sequences used for the selection and/or replication of the targeting plasmid in a microbial or other suitable host) , selectable marker(s), a regulatory sequence, an exon, a splice-donor site, an intron, and a splice-acceptor site.
  • the order can be, for example: a first targeting sequence, a selectable marker gene, a regulatory sequence, an exon, a splice- donor site, an intron, a splice-acceptor site, and a second targeting sequence; or, in the alternative, a first targeting sequence, a regulatory sequence, an exon, a splice-donor site, an intron, a splice-acceptor site, a selectable marker gene, optionally an internal ribosomal entry site, and a second targeting sequence.
  • the order can be : a first targeting sequence, a first selectable marker gene, a regulatory sequence, an exon, a splice-donor site, an intron, a splice-acceptor site, a second targeting sequence, and a second selectable marker gene; or, a first targeting sequence, a regulatory sequence, an exon, a splice-donor site, an intron, a splice-acceptor site, a first selectable marker gene, a second targeting sequence, and a second selectable marker gene. Recombination between the targeting sequences flanking the first selectable marker with homologous sequences in the host genome results in the targeted integration of the first selectable marker, while the second selectable marker is not integrated.
  • Desired transfected or infected cells are those that are stably transfected or infected with the first, but not second, selectable marker. Such cells can be selected for by growth in a medium containing an agent which selects for expression of the first marker and another agent which selects against the second marker. Transfected or infected cells that have improperly integrated the targeting construct by a mechanism other than homologous recombination would be expected to express the second marker gene and will thereby be killed in the selection medium.
  • a positively selectable marker is sometimes included in the construct to allow for the selection of cells containing amplified copies of that marker.
  • the order of construct components can be, for example: a first targeting sequence, an amplifiable positively selectable marker, a second selectable marker (optional) , a regulatory sequence, an exon, a splice-donor site, an intron, a splice-acceptor site, and a second targeting DNA sequence.
  • the various elements of the construct can be obtained from natural sources (e.g., genomic DNA), or can be produced using genetic engineering techniques or synthetic processes.
  • the regulatory region, CAP site, splice-donor site, intron, and splice acceptor site of the construct can be isolated as a complete unit from, e.g., the human elongation factor-l ⁇ (Genbank sequence HUMEFIA) gene or the cytomegalovirus (Genbank sequence HEHCMVP1) immediate early region. These components can also be isolated from separate genes.
  • the DNA construct of the invention can be intro- quizged into the cell, such as a primary, secondary, or immortalized cell, as a single DNA construct, or as separate DNA sequences which become incorporated into the chromosomal or nuclear DNA of a transfected or infected cell.
  • transfected cell is meant a cell into which (or into an ancestor of which) a DNA or RNA molecule has been introduced by a means other than using a viral vector.
  • infected cell is meant a cell into which (or into an ancestor of which) a DNA or RNA molecule has been introduced using a viral vector.
  • Viruses known to be useful as vectors include adenovirus, adeno-associated virus, Herpes virus, mumps virus, poliovirus, lentivirus, retroviruses, Sindbis virus, and vaccinia viruses such as canary pox virus.
  • the DNA can be introduced as a linear, double-stranded (with or without single-stranded regions at one or both ends) , single-stranded, or circular molecule.
  • the construct When the construct is introduced into host cells in two separate DNA fragments, the two fragments share DNA sequence homology (overlap) at the 3' end of one fragment and the 5' end of the other, while one carries a first targeting sequence and the other carries a second targeting sequence.
  • the two fragments Upon introduction into a cell, the two fragments can undergo homologous recombination to form a single molecule with the first and second targeting sequences flanking the region of overlap between the two original fragments.
  • the product molecule is then in a form suitable for homologous recombination with the cellular target sites. More than two fragments can be used, with each of them designed such that they will undergo homologous recombination with each other to ultimately form ' a product suitable for homologous recombination with the cellular target sites as described above .
  • the DNA construct of the invention if not containing a selectable marker itself, can be co-transfected or co-infected with another construct that contains such a marker.
  • a targeting plasmid may be cleaved with a restriction enzyme at one or more sites to create a linear or gapped molecule prior to transfection or infection.
  • the resulting free DNA ends increase the frequency of the desired homologous recombination event.
  • the free DNA ends may be treated with an exonuclease to create overhanging 5' or 3 ' single- stranded DNA ends (e.g., at least 30 nucleotides in length, and preferably 100-1000 nucleotides in length) to increase the frequency of the desired homologous recombination event.
  • DNA constructs may be transfected into cells (preferably in vi tro) by a variety of physical or chemical methods, including electroporation, microin- jection, microprojectile bombardment, calcium phosphate precipitation, liposome delivery, or polybrene- or DEAE dextran-mediated transfection.
  • the transfected or infected cell is maintained under conditions which permit homologous recombination, as described in the art (see, e.g., Capecchi, Science 24:1288-1292, 1989).
  • the homologously recombinant cell is maintained under conditions sufficient to permit transcription of the DNA, the regulatory region introduced by the DNA construct will alter transcription of the IFNA2 gene.
  • Homologously recombinant cells i.e., cells that have undergone the desired homologous recombination
  • ELISA enzyme-linked immunosorbent assays
  • Homologously recombinant cells can also be identified by Southern and Northern analyses or by polymerase chain reaction (PCR) screening.
  • primary cells includes (i) cells present in a suspension of cells isolated from a vertebrate tissue source (prior to their being plated, i.e., attached to a tissue culture substrate such as a dish or flask) , (ii) cells present in an explant derived from tissue, (iii) cells plated for the first time, and (iv) cell suspensions derived from these plated cells.
  • Primary cells can also be cells as they naturally occur within a human or an animal .
  • Secondary cells are cells at all subsequent steps in culturing. That is, the first time that plated primary cells are removed from the culture substrate and replated (passaged) , they are referred to herein as secondary cells, as are all cells in subsequent passages. Secondary cell strains consist of secondary cells which have been passaged one or more times. Secondary cells typically exhibit a finite number of mean population doublings in culture and the property of contact -inhibited, anchorage-dependent growth (anchorage-dependence does not apply to cells that are propagated in suspension culture) . Primary and secondary cells are not immortalized
  • Immortalized cells are cell lines (as opposed to cell strains, with the designation "strain” reserved for primary and secondary cells) that exhibit an apparently unlimited lifespan in culture.
  • Cells selected for tr.ansfection or infection can fall into four types or categories: (i) cells which do not, as obtained, make or contain more than trace amounts of the IFNA2 protein, (ii) cells which make or contain the protein but in quantities other than those desired (such as, in quantities less than the level which is physiologically normal for the type of cells as obtained) , (iii) cells which make the protein at a level which is physiologically normal for the type of cells as obtained, but are to be augmented or enhanced in their content or production, and (iv) cells in which it is desirable to change the pattern of regulation or induction of a gene encoding the protein.
  • Primary, secondary and immortalized cells to be transfected or infected by the present method can be obtained from a variety of tissues and include all appropriate cell types which can be maintained in culture.
  • suitable primary and secondary cells include fibroblasts, keratinocytes, epithelial cells (e.g., mammary epithelial cells, intestinal epithelial cells) , endothelial cells, glial cells, neural cells, formed elements of the blood (e.g., lymphocytes, bone marrow cells) , muscle cells, and precursors of these somatic cell types .
  • suitable primary and secondary cells include fibroblasts, keratinocytes, epithelial cells (e.g., mammary epithelial cells, intestinal epithelial cells) , endothelial cells, glial cells, neural cells, formed elements of the blood (e.g., lymphocytes, bone marrow cells) , muscle cells, and precursors of these somatic cell types .
  • primary cells are preferably obtained from the
  • primary cells can be obtained from a donor (i.e., an individual other than the recipient) of the same species.
  • immortalized human cell lines useful for protein production or gene therapy include, but are not limited to, 2780AD ovarian carcinoma cells (Van der Singh et al .
  • NCI-H1688 (ATCC CCL 257), NCI-H2126 (ATCC CCL 256), Raj i cells (ATCC CCL 86) , RD (ATCC CCL 136) , RPMI 2650 (ATCC CCL 30), RPMI 8226 cells (ATCC CCL 155), SNU-C2A (ATCC CCL 250.1), SNU-C2B (ATCC CCL 250), SW-13 (ATCC CCL 105), SW48 (ATCC CCL 231) , SW403 (ATCC CCL 230) , SW480 (ATCC CCL 227) , SW620 (ATCC CCL 227) , SW837 (ATCC CCL 235) , SW948 (ATCC CCL 237), SW1116 (ATCC CCL 233), SW1417 (ATCC CCL 238), SW1463 (ATCC CCL 234), T84 (ATCC CCL 248), U- 937 cells (ATCC CRL 1593), WiDr (ATCC CCL 218), and
  • Secondary human fibroblast strains such as WI-38 (ATCC CCL 75) and MRC-5 (ATCC CCL 171) , may be used.
  • primary, secondary, or immortalized human cells as well as primary, secondary, or immortalized cells from other species, can be used for in vi tro protein production or gene therapy.
  • IFNA2 -Expressing Cells IFNA2 -Expressing Cells
  • Homologously recombinant cells of the invention express IFNA2 at desired levels and are useful for both in vi tro production of IFNA2 and gene therapy.
  • Homologously recombinant cells according to this invention can be used for in vi tro production of IFNA2.
  • the cells are maintained under conditions, as described in the art, which result in expression of proteins.
  • the IFNA2 protein may be purified from cell lysates or cell supernatants .
  • a pharmaceutical composition containing the IFNA2 protein can be delivered to a human or an animal by conventional pharmaceutical routes known in the art (e.g., oral, intravenous, intramuscular, intranasal, pulmonary, transmucosal , intradermal, rectal, intrathecal , transdermal, subcutaneous, intraperitoneal , or intralesional) .
  • Oral administration may require use of a strategy for protecting the protein from degradation in the gastrointestinal tract: e.g., by encapsulation in polymeric microcapsules .
  • Homologously recombinant cells of the present invention are useful as populations of homologously recombinant cell lines, as populations of homologously recombinant primary or secondary cells, as homologously recombinant clonal cell strains or lines, as homologously recombinant heterogenous cell strains or lines, and as cell mixtures in which at least one representative cell of one of the four preceding categories of homologously recombinant cells is present.
  • Such cells may be used in a delivery system for treating (i) infections caused by such viruses as papilloma virus, hepatitis B and C viruses, vaccinia, herpes simplex virus, herpes zoster varicellosus virus, and rhinovirus, and (ii) any other conditions treatable with IFNA2.
  • viruses as papilloma virus, hepatitis B and C viruses, vaccinia, herpes simplex virus, herpes zoster varicellosus virus, and rhinovirus, and (ii) any other conditions treatable with IFNA2.
  • Homologously recombinant primary cells, clonal cell strains or heterogenous cell strains are administered to an individual " in whom the abnormal or undesirable condition is to be treated or prevented, in sufficient quantity and by an appropriate route, to express or make available the protein or exogenous DNA at physiologically relevant levels.
  • a physiologically rele- vant level is one which either approximates the level at which the product is normally produced in the body or results in improvement of the abnormal or undesirable condition.
  • the cells are syngeneic with respect to a immunocompetent recipient, the cells can be administered or implanted intravenously, intraarterially, subcutaneously, intraperitoneally, intraomentally, subrenal capsularly, intrathecally, intracranially, or intramuscularly .
  • the homologously recombinant cells to be administered can be enclosed in one or more semipermeable barrier devices.
  • the permeability properties of the device are such that the cells are prevented from leaving the device upon implantation into a subject, but the therapeutic protein is freely permeable and can leave the barrier device and enter the local space surrounding the implant or enter the systemic circulation. See, e.g., U.S. Patent Nos. 5,641,670, 5,470,731, 5,620,883, 5,487,737, and co-owned U.S. Patent Application entitled "Delivery of Therapeutic Proteins" (inventors: Justin C. Lamsa and Douglas A.
  • the barrier device can be implanted at any appropriate site: e.g., intraperitoneally, intrathecally, subcutaneously, intramuscularly, within the kidney capsule, or within the omentum.
  • Barrier devices are particularly useful and allow homologously recombinant immortalized cells, homologously recombinant cells from another species (homologously recombinant xenogeneic cells). , or cells from a nonhisto- compatibility-matched donor (homologously recombinant allogeneic cells) to be implanted for treatment of a subject.
  • the devices retain cells in a fixed position in vivo, while protecting the cells from the host's immune system.
  • Barrier devices also allow convenient short-term
  • Transient therapy by allowing ready removal of the cells when the treatment regimen is to be halted for any reason.
  • Transfected or infected xenogeneic and allogeneic cells may also be used in the absence of barrier devices for short-term gene therapy. In that case, the IFNA2 produced by the cells will be delivered in vivo until the cells are rejected by the host's immune system.
  • a number of synthetic, semisynthetic, or natural filtration membranes can be used for this purpose, including, but not limited to, cellulose, cellulose acetate, nitrocellulose, polysulfone, polyvinylidene difluoride, polyvinyl chloride polymers and polymers of polyvinyl chloride derivatives.
  • Barrier devices can be utilized to allow primary, secondary, or immortalized cells from another species to be used for gene therapy in humans .
  • Another type of device useful in the gene therapy of the invention is an implantable collagen matrix in which the cells are embedded.
  • Such a device which can contain beads to which the cells attach, is described in WO 97/15195, herein incorporated by reference. It can be implanted as described above.
  • the number of cells needed for a given dose or implantation depends on several factors, including the expression level of the protein, the size and condition of the host animal, and the limitations associated with the implantation procedure. Usually the number of cells implanted in an adult human or other similarly-sized animal is in the range of 1 X 10 4 to 5 X 10 10 , and preferably 1 X 10 8 to 1 X 10 9 . If desired, they may be implanted at multiple sites in the patient, either at one time or over a period of months or years. The dosage may be repeated as needed.
  • Transkaryotic Therapies, Inc. represents that the ATCC is a depository affording permanence of the deposit and ready accessibility thereto by the public if a patent is granted. All restrictions on the availability to the public of the material so deposited will be irrevocably removed upon the granting of the patent. The material will be available during the pendency of the patent application to one determined by the Commissioner to be entitled thereto under 37 CFR 1.14 and 35 U.S.C. ⁇ 122.
  • the deposited material will be maintained with all the care necessary to keep it viable and uncontaminated for a period of at least five years after the most recent request for the furnishing of a sample of the deposited material, and in any case, for a period of at least thirty (30) years after the date of deposit or for the enforceable life of the patent, whichever period is longer.
  • Applicants' assignee acknowledges its duty to replace the deposit should the depository be unable to furnish a sample when requested due to the condition of the deposit.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Wood Science & Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Medicinal Chemistry (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Public Health (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Animal Behavior & Ethology (AREA)
  • Veterinary Medicine (AREA)
  • Mycology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Cell Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Toxicology (AREA)
  • Oncology (AREA)
  • Communicable Diseases (AREA)
  • Virology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Peptides Or Proteins (AREA)

Abstract

An isolated nucleic acid molecule that hybridizes under stringent conditions, or shares at least 80 % sequence identity, with a defined genomic region upstream of the coding region of the IFNA2 gene, and a DNA construct containing that DNA molecule as a targeting sequence for homologous recombination.

Description

GENOMIC SEQUENCES UPSTREAM OF THE CODING REGION OF THE IFN-ALPHA2 GENE FOR PROTEIN PRODUCTION AND DELIVERY
Field of the Invention This invention relates to genomic DNA.
5 Background of the Invention
Current approaches to treating disease with therapeutic proteins include both administration of proteins produced in vi tro and gene therapy. In vi tro production of a protein generally involves the
10 introduction of exogenous DNA coding for the protein of interest into appropriate host cells in culture. Gene therapy methods, on the other hand, involve administering to a patient cells, plasmids," or viruses that contain a sequence encoding the therapeutic protein of interest.
15 Certain therapeutic proteins may also be produced by altering the expression of their endogenous genes in a desired manner with gene targeting techniques. See, e.g., U.S. Patent Nos. 5,641,670, 5,733,761, and 5,272,071, U.S. Patent Application Serial No. 08/406,030,
20 WO 91/06666, WO 91/06667, and WO 90/11354, all of which are incorporated by reference in their entirety.
Summary of the Invention The present invention is based upon the identification and sequencing of genomic DNA 5' to the
25 coding sequence of the human interferon-α 2 ("IFNA2") gene. This DNA can be used, for example, in a DNA construct that alters (e.g., increases) expression of an endogenous IFNA2 gene in a mammalian cell upon integration into the genome of the cell via homologous
30 recombination. "Endogenous IFNA2 gene" refers to a genomic (i.e., chromosomal) gene that encodes IFNA2. The construct contains a targeting sequence including or derived from the newly disclosed 5' noncoding sequence, and a transcriptional regulatory sequence. The transcriptional regulatory sequence preferably differs in sequence from the transcriptional regulatory sequence of the endogenous IFNA2 gene. The targeting sequence directs the integration of the regulatory sequence into a region upstream of the endogenous IFNA2 -coding sequence such that the regulatory sequence becomes operatively linked to the endogenous coding sequence. By "operatively linked" is meant that the regulatory sequence can direct expression of the endogenous IFNA2- coding sequence. The construct may additionally contain a selectable marker gene to facilitate selection of cells that have stably integrated the construct, and/or another coding sequence linked to a promoter.
In one embodiment, the DNA construct comprises: (a) a targeting sequence, (b) a regulatory sequence, (c) an exon, (d) a splice-donor site, (e) an intron, and (f) a splice-acceptor site, wherein the targeting sequence directs the integration of itself and elements (b) - (f) such that elements (b) - (f) are within or upstream of the endogenous gene . The regulatory sequence then directs production of a transcript that includes not only elements (c) - (f) , but also the endogenous IFNA2 coding sequence. Preferably, the intron and the splice-acceptor site are situated in the construct downstream from the splice-donor site.
The targeting sequence is homologous to a preselected target site in the genome with which homologous recombination is to occur. It contains at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 12; and can contain, for instance, at least 20 (e.g., at least 30, 50, or 100) contiguous nucleotides of SEQ ID NO:7, at least 20 (e.g., at least 30 or 50) contiguous nucleotides of SEQ ID NO:8, or at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 13. In addition, the targeting sequence can contain at least 20 (e.g., at least 30, 50, or 100) contiguous nucleotides of SEQ ID NO: 16, at least 20 contiguous nucleotides of SEQ ID NO:17, at least 20 (e.g., at least 30 or 50) contiguous nucleotides of SEQ ID NO: 18, or at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 19. SEQ ID NO : 7 corresponds to nucleotides 1 to 278 of SEQ ID NO: 12; SEQ ID NO : 8 corresponds to nucleotides 3492 to 3564 of SEQ ID NO: 12; and SEQ ID NO: 13 corresponds to nucleotides 279 to 3491 of SEQ ID NO: 12. By "homologous" is meant that the targeting sequence is identical or sufficiently similar to its genomic target site so that the targeting sequence and target site can undergo homologous recombination within a human cell . A small percentage of basepair mismatches is acceptable, as long as homologous recombination can occur at a useful frequency. To facilitate homologous recombination, the targeting sequence is preferably at least about 20 (e.g., at least 50, 100, 250, 400, or 1,000) base pairs ("bp") long. The targeting sequence can also include genomic sequences from outside the region covered by SEQ ID NO: 12, so long as it includes as least 20 nucleotides from within this region. For example, additional targeting sequence could be derived from the sequence lying between SEQ ID NO : 8 and the endogenous transcription initiation sequence of the IFNA2 gene. Due to polymorphism that exists at the IFNA2 genetic locus, minor variations in the nucleotide composition of any given genomic target site may occur in any given mammalian species. Targeting sequences that correspond to such polymorphic variants of SEQ ID NOs : 7 , 8, 12, 13, 16, 17, 18, and 19 (particularly human polymorphic variants) are within the scope of this invention.
Upon homologous recombination, the regulatory sequence of the construct is integrated into a pre- selected region upstream of the coding sequence of an IFNA2 gene in a chromosome of a cell. The resulting new transcription unit containing the construct-derived regulatory sequence alters the expression of the target IFNA2 gene. The IFNA2 protein so produced may be identical in sequence to the IFNA2 protein encoded by the unaltered, endogenous gene, or may contain additional, substituted, or fewer amino acid residues as compared to the wild type IFNA2 protein, due to changes introduced as a result of homologous recombination. Altering gene expression encompasses activating (or causing to be expressed) a gene which is normally silent (i.e, essentially unexpressed) in the cell as obtained, increasing or decreasing the expression level of a gene, and changing the regulation pattern of a gene such that the pattern is different from that in the cell as obtained. "Cell as obtained" refers to the cell prior to homologous recombination.
Also within the scope of the invention is a method of using the present DNA construct to alter expression of an endogenous IFNA2 gene in a mammalian cell. This method includes the steps of (i) introducing the DNA construct into the mammalian cell, (ii) maintaining the cell under conditions that permit homologous recombination to occur between the construct and a genomic target site homologous to the targeting sequence, to produce a homologously recombinant cell; and (iii) maintaining the homologously recombinant cell under conditions that permit expression of the IFNA2 coding sequence under the control of the construct-derived regulatory sequence. At least a part of the genomic target site is 5' to the coding sequence of an endogenous IFNA2 gene. That is, the genomic target site can contain coding sequence as well as 5' non-coding sequence. The invention also features transfected or infected cells in which the construct has undergone homologous recombination with genomic DNA upstream of the endogenous ATG initiation codon in one or both alleles of the endogenous IFNA2 gene. Such transfected or infected cells, also called homologously recombinant cells, have an altered IFNA2 expression pattern. These cells are particularly useful for in vi tro IFNA2 production and for delivering IFNA2 via gene therapy. Methods of making and using such cells are also embraced by the invention. The cells can be of vertebrate origin such as mammalian (e.g., human, non-human primate, cow, pig, horse, goat, sheep, cat, dog, rabbit, mouse, guinea pig, hamster, or rat) origin.
The invention further relates to a method of producing a mammalian IFNA2 protein in vi tro or in vivo by introducing the above-described construct into the genome of a host cell via homologous recombination. The homologously recombinant cell is then maintained under conditions that allow transcription, translation, and optionally, secretion of the IFNA2 protein. The invention also features isolated nucleic acids comprising a sequence of at least 20 (e.g., at least 30, 50, 100, 200, or 1000) contiguous nucleotides of SEQ ID NO: 12 or its complement, or of a sequence identical to SEQ ID NO: 12 except for polymorphic variations or other minor variations (e.g., less than 5% of the sequence) which do not prevent homologous recombination with the target sequence. For instance, the isolated DNA can contain at least 20 (e.g., at least 30, 50, or 100) contiguous nucleotides of SEQ ID NO: 7 or its complement, at least 20 (e.g., at least 30 or 50) contiguous nucleotides of SEQ ID NO: 8 or its complement, at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 13 or its complement, at least 20 (e.g., at least 30, 50, or 100) contiguous nucleotides of SEQ ID NO: 16 or its complement, at least 20 contiguous nucleotides of SEQ ID NO: 17 or its complement, at least 20 (e.g., at least 30 or 50) contiguous nucleotides of SEQ ID NO:18 or its complement, or at least 20 (e.g., at least 30, 50, 100, or 1000) contiguous nucleotides of SEQ ID NO: 19 or its complement.
In one embodiment, the isolated nucleic acid of the invention includes a contiguous 100 bp block of SEQ ID NO: 12. For example, the isolated DNA can contain nucleotides 1 to 100, 101 to 200, 201 to 300, 301 to 400, 401 to 500, 501 to 600, 601 to 700, 701 to 800, 801 to 900, 901 to 1000, 1001 to 1100, 1101 to 1200, 1201 to 1300, 1301 to 1400, 1401 to 1500, 1501 to 1600, 1601 to 1700, 1701 to 1800, 1801 to 1900, 1901 to 2000, 2001 to 2100, 2101 to 2200, 2201 to 2300, 2301 to 2400, 2401 to 2500, 2501 to 2600, 2601 to 2700, 2701 to 2800, 2801 to 2900, 2901 to 3000, 3001 to 3100, 3101 to 3200, 3201 to 3300, 3301 to 3400, 3401 to 3500, or 3465 to 3564 of SEQ ID NO: 12 or its complement. These blocks of SEQ ID NO: 12 or its complement are also useful as targeting sequences in the constructs of the invention.
In the isolated DNA, the SEQ ID NO: 12 -derived sequence is not linked to a sequence encoding intact IFNA2, or at least is not linked in the same configuration (i.e., separated by the same noncoding sequence) as occurs in any wild-type genome. The term "isolated DNA", as used herein, thus does not denote a chromosome or a large piece of genomic DNA (as might be incorporated into a cosmid or yeast artificial chromosome) that includes not only part or all of SEQ ID NO: 12, but also an intact IFNA2 -coding sequence and all of the sequence which lies between the IFNA2 coding sequence and the sequence corresponding to SEQ ID NO: 12 as it exists in the genome of a cell. It does include, but is not limited to, a DNA (i) which is incorporated into a plasmid or virus; or (ii) which exists as a separate molecule independent of other sequences, e.g., a fragment produced by polymerase chain reaction ("PCR") or restriction endonuclease treatment . The isolated DNA preferably does not contain a sequence which encodes intact IFNA2 precursor (i.e., IFNA2 complete with its endogenous secretion signal peptide) .
The invention also includes isolated DNA comprising a strand which contains a sequence that is at least 100 (e.g., at least 200, 400, or 1000) nucleotides in length and that hybridizes' under either moderately stringent or highly stringent conditions with SEQ ID NO: 7, 8, 12, 13, 16, 17, 18, and/or 19, or the complement of SEQ ID NO: 7, 8, 12, 13, 16, 17, 18, and/or 19. The sequence is not linked to an IFNA2-coding sequence, or at least is not linked in the same configuration as occurs in any wild-type genome. By moderately stringent conditions is meant hybridization at 50°C in Church buffer (7% SDS, 0.5% NaHP04, 1 M EDTA, 1% bovine serum albumin) and washing at 50°C in 2X SSC. Highly stringent conditions are defined as: hybridization at 42°C in the presence of 50% formamide; a first wash at 65°C with 2X SSC containing 1% SDS; followed by a second wash at 65°C with 0.1X SSC.
Also embraced by the invention is isolated DNA comprising a strand which contains a sequence that (1) is at least 50 (e.g., at least 70 or 100) nucleotides in length and (2) shares at least 80% (e.g., at least 85%, 90%, 95%, or 98%) sequence identity with a fragment or all of SEQ ID NO: 12, or with the complement of the fragment. This fragment can include, for instance, a part or all of SEQ ID NO : 7 , 8, 13, 16, 17, 18, or 19. The sequence is not linked to an intact IFNA2 -coding sequence, or at least is not linked in the same configuration as occurs in any wild-type genome. Where a particular polypeptide or nucleic acid molecule is said to have a specific percent identity or conservation to a reference polypeptide or nucleic acid molecule, the percent identity or conservation is determined by the algorithm of Myers and Miller, CABIOS (1989) , which is embodied in the ALIGN program (version 2.0), or its equivalent, using a gap length penalty of 12 and a gap penalty of 4 where such parameters are required. All other parameters are set to their default positions. Access to ALIGN is readily available. See, e.g., http: //www2. igh.cnrs . fr'/bin/align-guess . cgi on the Internet .
The invention also features a method of delivering IFNA2 to an animal (e.g., a mammal such as a human, non- human primate, cow, pig, horse, goat, sheep, cat, dog, rabbit, mouse, guinea pig, hamster, or rat) by providing a cell whose endogenous IFNA2 gene has been activated as described herein, and implanting the cell in the animal, where the cell secretes IFNA2. Also included in the invention is a method of producing IFNA2 by providing a cell whose endogenous IFNA2 gene has been activated as described herein, and culturing the cell in vi tro under conditions which permit the cell to express and secrete IFNA2.
The invention further includes isolated DNA that shares at least 80% (e.g., at least 85%, 90%, or 95%) sequence identity, or hybridizes under highly or moderately stringent conditions, with a portion (e.g., at least about 20, 50, 100, 400, or 1000 bp in length) of the Hindlll-BamHI insert of plasmid pA2HB (described below) . The 3' end of this portion of the insert is at least 511 bp upstream of the ATG translation initiation codon of the IFNA2 -coding sequence included in the plasmid insert .
The isolated DNA of the invention can be used, for example, as a source of an upstream PCR primer for use (when combined with a suitable downstream primer) in obtaining the regulatory region and/or complete coding sequence of an endogenous IFNA2 gene, or as a hybridization probe for indicating the presence of chromosome 9 in a preparation of human chromosomes. It can also be used, as described below, in a method for altering the expression of an endogenous IFNA2 gene in a vertebrate cell.
Unless otherwise defined, all technical and scientific terms used herein .have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Exemplary methods and materials are described below, although methods and materials similar or equivalent to those described herein can also be used in the practice or testing of the present invention. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control. The materials, methods, and examples are illustrative only and not intended to be limiting.
Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.
Brief Description of the Drawings Fig. 1 is a representation of the published sequence (SEQ ID NO:l) of a human IFNA2-coding sequence and some flanking 5' and 3' non-coding sequences (GenBank HUMIFNAA) . Sequences of PCR primers IFN1, IFN2, IFN6, and IFN7 are indicated by arrows.
Fig. 2 is a schematic diagram showing the human IFNA2 genomic region encompassed by the insert of plasmid pA2HB .
Fig. 3 is a representation of the nucleotide sequence (SEQ ID NO: 7) of a region upstream of the coding sequence of a human IFNA2 gene. This nucleotide sequence has not been reported previously. Fig. 4 is a representation of a sequence (SEQ ID NO: 9) of a human IFNA2-coding sequence and some flanking 5' and 3' non-coding sequences. The underlined sequence (SEQ ID NO: 8) has not been previously reported. The polypeptide sequence (SEQ ID NO: 2) encoded by this gene is also shown. The N-terminus of the mature polypeptide is indicated by "Mature."
Fig. 5 is a schematic diagram showing a construct of the invention. The construct contains a first targeting sequence (1) ; an amplifiable marker gene (AM) ; a selectable marker gene (SM) ; a regulatory sequence; a CAP site; an exon; a splice-donor site (SD) ; an intron; a splice-acceptor site (SA) ; and a second targeting sequence (2) . The black boxes represent coding DNA and the stippled boxes represent transcribed but untranslated sequences.
Fig. 6 is a representation of a sequence (SEQ ID NO: 11) of a human genomic sequence 5' to the IFNA2 coding sequence, and including some coding sequence. The underlined sequence is previously reported while the remainder (-4074 to -511; SEQ ID NO: 12) is new. The framed sequence is SEQ ID NO: 13. The sequence 5' to SEQ ID NO: 13 is SEQ ID NO : 7. The sequence between the framed area and the underlined sequence is SEQ ID NO: 8. Nucleotides -4074 to -3270 is SEQ ID NO: 16. Nucleotides -3267 to -3239 is SEQ ID NO: 17. Nucleotides -3241 to - 3137 is SEQ ID NO: 18. Nucleotides -3139 to -511 is SEQ ID NO: 19.
Fig. 7 is a representation of a first targeting sequence (SEQ ID NO: 14) used in a construct of the invention.
Fig. 8 is a representation of a second targeting sequence (SEQ ID NO: 15) used in a construct of the invention.
Detailed Description The present invention is based on the discovery of the nucleotide composition of sequences upstream to the coding sequence of a human IFNA2 gene.
Interferon-α constitutes a complex gene family with 14 genes clustered on the short arm of chromosome 9. None of these genes, including the IFNA2 gene, have introns. Interferon-α is produced by macrophages, T and B cells, and a variety of many other cells. Interferon-α has considerable antiviral effects, and has been shown to be efficacious in treating infections by papilloma virus, hepatitis B and C viruses, vaccinia, herpes simplex virus, herpes zoster varicellosus virus, and rhinovirus .
The human IFNA2 gene encodes a 188 amino acid precursor protein (SEQ ID NO:2) containing a 23 amino acid signal peptide. The genomic map of the human IFNA2 gene is shown in Fig. 1. The map is constructed based on 1,733 base pair ("bp") published sequences (HUMIFNAA, GenBank accession number J00207 and V00544; SEQ ID NO:l) which begin at position -510 relative to the translational start site (unless otherwise specified, all positions referred to herein are relative to the translational start site), and end at position +1,223. The cap site is located at position -67. Specific Sequences 5' to an IFNA2 Gene and Their Use in Altering Endogenous IFNA2 Gene Expression
To obtain genomic DNA containing sequence upstream to an IFNA2 gene, a human leukocyte genomic library in lambda EMBL3 (Clontech catalog # HL1006d) was screened with a 332 bp probe generated by PCR. This probe corresponds to the genomic region between positions -263 and +69, and was amplified from human genomic DNA using oligonucleotide primers designated IFN7 and IFN6, both of which were designed from the available IFNA2 genomic DNA sequence (Fig. 1) . The 5' end of primer IFN7 corresponds to position -263, and the primer's sequence is 5' -AGTTTCTAAAAAGGCTCTGGGGTA-3' (SEQ ID NO : 3 ) . The 5' end of primer IFN6 corresponds to position +69, and this primer sequence is 5 ' -GCCCACAGAGCAGCTTGAC-3 ' (SEQ ID NO:4) .
Approximately one million recombinant phage were screened with the radiolabelled 332 bp probe. Sixty positive plaques were isolated from the primary screening plates. Lambda phage DNA was isolated from thirty of these plaques and subjected to PCR assay using oligonucleotide primers IFN1 and IFN2. Both IFN1 and IFN2 are derived from the 3 ' untranslated region of the IFNA2 gene; their sequences can be found at the website "http://www.ncbi.nlm.nih.gov/dbSTS," using the identification code "NCBI_ID : 42433. " The 5' end of primer IFN1 corresponds to position +639, and the primer's sequence is 5' -AAAGACTCATGTTTCTGCTATGACC-3 ' (SEQ ID NO: 5) . The 5' end of primer IFN2 corresponds to position +853, and the primer's sequence is 5'-
GGTGCACATGACATAATATGAACA-3' (SEQ ID NO: 6) . Of the thirty phage samples, two generated the expected 215 bp PCR product. One of the two phage plaques was further purified by two additional rounds of hybridization screening, yielding phage clone 4-4-1. A 8.3 kb Hindlll-BamHI fragment from phage 4-4-1 was subcloned into pBluescript II SK+ (Stratagene, La Jolla, CA) to produce pA2HB, which contains approximately 4.3 kb of untranscribed upstream sequences, the protein- coding region (1.1 kb) , and approximately 2.8 kb of downstream sequences of the IFNA2 gene. A restriction map of the 8.3 kb Hindlll-BamHI fragment is shown in Fig. 2.
The pA2HB plasmid was sequenced by the Sanger method. A 278 bp sequence whose 5' terminus is at the 5' end Hindlll site is shown below (see also Fig. 3) :
AAGCTTTTATAGGTGTAAATTTTCCACTTAGTACTGCTTTTG TAATGTTGTCTTTTTATTTTCATTTATCTCAAGATGTTTTCT AATTTCTCTTGACTTCCTTCTTAAATTCTTACCTCATGTAGA CATACATTTTTGGCCCTATGCATTGGGATGCAAAACCAGACT
AATTTACTTTGTACAAAAAGAAAAATGAGAAAGAAATATATT TGGTCTTGTGAGCACTATATGGAAATACTTTATATTCCATTT GTTTCATCATATTCATATATCCCTTT (SEQ ID NO: 7)
The Hindlll site is located at position -4,073. A previously unpublished sequence between positions -583 and -511 was also determined, as shown below and as underlined in Fig. 4.
CATTGGATACTCCATCACCTGCTGTGATATTATGAATGTCTG CCTATATAAATATTCACTATTCCATAACACA (SEQ ID NO: 8)
The sequence (SEQ ID NO: 13) between the regions corresponding to SEQ ID NOs : 7 and 8 was also determined.
The genomic sequence between positions -4,074 and
-511 (SEQ ID NO: 12) is the sequence which is not underlined in Fig. 6. SEQ ID NOs : 7 and 8 correspond to nucleotides 1-278 and nucleotides 3492-3564 of SEQ ID NO: 12, respectively.
To alter the expression of an endogenous INFA2 gene, a DNA fragment containing nucleotides 279-3311 of SEQ ID NO: 12 was cloned into a plasmid to produce targeting construct pGA402. Nucleotides 279-3311 of SEQ ID NO: 12 was designated SEQ ID NO: 14. The fragment was inserted upstream of a CMV promoter and a neomycin resistance gene and is schematically represented in Fig. 5. For the second targeting sequence of Fig. 5, a DNA fragment containing nucleotides -68 to 69 of the IFNA2 gene sequence shown in Fig. 1 was cloned downstream of the CMV promoter and neomycin resistance gene.
Nucleotides -68 to 69 of the IFNA2 gene was designated SEQ ID NO: 15. The pGA402 plasmid was introduced into human fibroblast cells exhibiting little or no INFA2 gene expression, to allow homologous recombination with the endogenous INFA2 gene. Cells resistant to G418 after plasmid introduction were screened to identify cells with increased INFA2 gene expression, as would be expected if a homologous recombination event between pGA402 and the genomic DNA took place in the vicinity of the endogenous INFA2 gene.
General Methodologies
Alteration of Endogenous IFNA2 Expression
Using the above-described IFNA2 upstream sequences, one can alter the expression of an endogenous human IFNA2 gene by a method as generally described in
U.S. Patent
No. 5,641,670. One strategy is shown in Fig. 5. In this strategy, a targeting construct is designed to include a first targeting sequence homologous to a first target site upstream of the gene, an amplifiable marker gene, a selectable marker gene, a regulatory region, a CAP site, an exon, a splice-donor site, an intron, a splice- acceptor site, and a second targeting sequence homologous to a second target site downstream of the first target site and terminating either within or upstream of the
IFNA2 -coding sequence. According to this strategy, the 5' end of the second target site is preferably less than 107 bp upstream of the normal IFNA2 translational initiation site, in order to avoid undesired ATG start codons within the transcribed sequence. A transcript produced from the homologously recombined locus will include the construct-derived exon, the construct-derived splice-donor site, the construct-derived intron, the construct-derived splice-acceptor site, any sequence between any of those elements, and the sequence from the construct derived splice acceptor site through the entire endogenous coding sequence to the transcription termination site of the IFNA2 gene. Splicing of this transcript will generate a mRNA which can be translated to produce a precursor of human IFNA2, having either the normal IFNA2 secretion signal sequence or a genetically engineered secretion signal sequence, depending on the characteristics of the construct-derived exon. The size of the exogenous intron and thus the position of the exogenous regulatory region relative to the coding region of the gene can be varied to optimize the function of the regulatory region.
In any activation strategy, the first and second target sites need not be immediately adjacent or even be near each other. When they are not immediately adjacent to each other, a portion of the IFNA2 gene's normal upstream region and/or a portion of the coding region would be deleted upon homologous recombination. Mutations that facilitate alteration of endogenous IFNA2 expression may be introduced into the chromosomal DNA via homologous recombination. For instance, it may be desirable to abolish a spurious and undesired ATG initiation codon upstream of the correct ATG initiation codon and between the exogenous regulatory region and the endogenous IFNA2 coding region in the homologously recombined locus. To do so, one can employ a targeting construct having a targeting sequence homologous to a genomic site that spans the undesired ATG initiation codon. This targeting sequence contains nucleotides that correspond to the desired mutation, e.g., contains ATT instead of ATG. The targeting construct optionally includes one or more selectable markers to facilitate selection of homologously recombined cells. An exogenous regulatory region can then be introduced to the homologously recombined cells upstream of the altered sites, using the expression alteration method of the invention.
Alternatively, the exogenous regulatory region and the desired sequence mutation (s) may be introduced into the genomic DNA in a single step. The DNA construct used in this embodiment may contain both the exogenous regulatory region and a targeting sequence that contains nucleotides corresponding to the desired mutation (s). One may also co-transfect or .co-infect two separate constructs into target cells, with one construct containing the regulatory region and the other containing nucleotides corresponding to the desired mutation.
If desired, a mammalian splice-acceptor site may be introduced into the genomic DNA, e.g., at a site between an undesired ATG initiation codon and the correct ATG initiation codon, in a similar manner. The DNA construct used for this purpose contains a targeting sequence homologous to a genomic site upstream of the correct INFA2 initiation codon, and adjacent to or embedded within the homologous sequence, a sequence corresponding to the desired splice-acceptor site. Cells containing the correctly recombined IFNA2 locus are then transfected or infected with a second construct containing an exogenous regulatory region and an exon with an unpaired splice-donor site at its 3' end, together with targeting sequence (s) which target the second construct to a genomic region upstream of the inserted splice-acceptor site. A primary transcript produced under the control of the exogenous regulatory region will include the exogenous exon, the exogenous splice-donor site, the exogenous splice-acceptor site, any sequences between those elements, and the sequence between the exogenous splice-acceptor site and the transcriptional termination site of the endogenous IFNA2 gene. Upon splicing, the splice-donor site of the transcript will be spliced to the splice-acceptor site, and the intervening intronic RNA, which may contain undesirable AUG initiation codons, will be removed. Any problems associated with having a transcript with undesired AUG translation initiation codons between the transcription start site and the IFNA2 -coding sequence are thereby avoided. Of course, the regulatory region, exon, splice donor site, and splice-acceptor site can instead be introduced in a single step. The DNA construct used in this embodiment contains a regulatory region, an exon, a splice-donor site, an intron, a splice-acceptor site, a targeting sequence homologous to a genomic site between the correct INFA2 initiation codon and the undesired ATG codon, and optionally, one or more selectable markers. Alternatively, two separate targeting constructs may be useful, with one containing the regulatory region, the exon, and the splice-donor site, and the other containing the splice-acceptor site. The two constructs can be introduced into target cells in a single step. The DNA Construct
The DNA construct of the invention includes at least a targeting sequence and a regulatory sequence. It can additionally include an exon; or an exon and splice donor site; or an exon, a splice-donor site, an intron, and a splice-acceptor site. In the construct, the exon, if present, is 3 ' of the regulatory sequence, and the splice-donor site, if present, is at the 3' end of the exon. The intron and splice acceptor site, if present, are 3' of the splice donor site. In addition, there can be multiple exons and introns (with appropriate splice donor and acceptor sites) in the construct. The DNA in the construct is referred to as exogenous, since the DNA is not an original part of the genome of a host cell . Exogenous DNA may possess sequences identical to or different from portions of the endogenous genomic DNA present in the cell prior to transfection or infection by viral vector. As used herein, "transfection" means introduction of a plasmid into a cell by nonviral (e.g., chemical or physical) means, such as calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, electroporation, microinjection, microprojectiles, or biolistic-mediated uptake. "Infection" means introduction of a viral vector into a cell by viral infection. Various elements included in the DNA construct of the invention are described in detail below.
The DNA construct can also include cis-acting or trans-acting viral sequences (e.g., packaging signals), thereby enabling delivery of the construct into the nucleus of a cell via infection by a viral vector. Where necessary, the DNA construct can be disengaged from various steps of a virus life cycle, such as integrase- mediated integration in retroviruses or episome maintenance. Disengagement can be accomplished by appropriate deletions or mutations of viral sequences, such as a deletion of the integrase coding region in a retrovirus vector. Additional details regarding the construction and use of viral vectors are found in Robbins et al . , Pharmacol. Ther. 80:35-47, 1998; and Gunzburg et al . , Mol . Med. Today 1:410-417, 1995, herein incorporated by reference. Targeting Sequences
Targeting sequences permit homologous recombination of a desired sequence into a selected site in the host genome. Targeting sequences are homologous to (i.e., able to homologously recombine with) their respective target sites in the host genome.
A circular DNA construct can employ a single targeting sequence, or two or more separate targeting sequences. A linear DNA construct may contain two or more separate targeting sequences. The target site to which a given targeting sequence is homologous can reside within the coding region of the IFNA2 gene, upstream of and immediately adjacent to the coding region, or upstream of and at a distance from the coding region. The first of the two .targeting sequences in the construct (or the entire targeting sequence, if there is only one targeting sequence in the construct) is derived from the newly disclosed genomic regions upstream of the IFNA2 -coding sequence. This targeting sequence contains a portion (e.g., 20 or more contiguous nucleotides) of SEQ ID NO: 12, e.g., a portion of SEQ ID NO: 7, 8, or 13.
The second of the two targeting sequences in the construct may target a genomic region upstream of the coding sequence or target part or all of the coding sequence itself. By way of example, the second targeting sequence may contain, at its 3' end, an "exogenous" coding region identical to the first few codons of the IFNA2 coding sequences. Upon homologous recombination, the exogenous coding region recombines with the targeted part of the endogenous IFNA2-coding sequence. If desired, the exogenous coding region may encode a heterologous amino acid sequence, so long as the exogenous coding region remains sufficiently homologous to the endogenous coding region it replaces to permit homologous recombination. The targeting sequence may additionally include sequence derived from a previously disclosed region of the IFNA2 gene, including those described herein, as well as a region further upstream which is structurally uncharacterized but can be mapped by one skilled in the art.
Genomic fragments useful as targeting sequences can be identified by their ability to hybridize to a probe containing all or a portion of SEQ ID NO: 12. Such a probe can be generated by PCR using primers derived from SEQ ID NO: 12.
The Regulatory Sequence
The regulatory sequence of the DNA construct can contain one or more promoters (e.g., a constitutive, tissue-specific, or inducible' promoter) , enhancers, scaffold-attachment regions or matrix attachment sites, negative regulatory elements, transcription factor binding sites, or combinations of these elements.
The regulatory sequence can be derived from a eukaryotic (e.g., mammalian) or viral genome. Useful regulatory sequences include, but are not limited to, those that regulate the expression of SV40 early or late genes, cytomegalovirus genes, and adenovirus major late genes. They also include regulatory regions derived from genes encoding mouse metallothionein-I, elongation factor-Ice, collagen (e.g., collagen Iαl, collagen Iα.2, and collagen IV), actin (e.g., γ-actin) , immunoglobulin, HMG-CoA reductase, glyceraldehyde phosphate dehydrogenase, 3-phosphoglyceratekinase, collagenase, stromelysin, fibronectin, vimentin, plasminogen activator inhibitor I, thymosin β4 , tissue inhibitors of etalloproteinase, ribosomal proteins, major histocompatibility complex molecules, and human leukocyte antigens . The regulatory sequence preferably contains a transcription factor binding site, such as a TATA Box, CCAAT Box, API, Spl, or a NF-κB binding site. Marker Genes If desired, the construct can include a sequence encoding a desired polypeptide, operatively linked to its own promoter. An example of this would be a selectable marker gene, which can be used to facilitate the identification of a targeting event. An amplifiable marker gene can also be included and used to facilitate selection of cells having co-amplified flanking DNA sequences. Cells containing amplified copies of the amplifiable marker gene can be identified by growth in the presence of an agent that selects for the expression of the amplifiable gene. The activated endogenous IFNA2 gene will be amplified in tandem with the amplified selectable marker gene. Cells containing multiple copies of the activated endogenous gene may produce very high levels of IFNA2 and are thus useful for in vi tro protein production and gene therapy.
The selectable and amplifiable marker genes do not have to lie immediately adjacent to each other. The amplifiable marker gene and selectable marker gene can be the same gene . One or both of the marker genes can be situated in the intron of the DNA construct. Suitable amplifiable marker genes and selectable marker genes are described in U.S. Patent No. 5,641,670.
The Splice-Donor and Splice-Acceptor Sites The DNA construct may further contain an exon, a splice-donor site at the 3' end of the exon, an intron, and a splice-acceptor site.
A splice-donor site is a sequence which directs the splicing of one exon of an RNA transcript to the splice-acceptor site of another exon of the transcript, resulting in removal of the intron between the two sites. Typically, the first exon lies 5' of the second exon, and the splice-donor site located at the 3' end of the first exon is paired with a splice-acceptor site flanking the second exon on the 5' side of the second exon. Splice- donor sites have a characteristic consensus sequence represented as (A/C) AGGURAGU (where R denotes a purine nucleotide) , with the GU in the fourth and fifth positions being required (Jackson, Nucleic Acids Research 19: 3715-3798, 1991). The first three bases of the splice-donor consensus site are the last three bases of the exon: i.e., they are not spliced out. Splice-donor sites are functionally defined by their ability to effect the appropriate reaction within the mRNA splicing pathway.
A splice acceptor site in a construct of the invention directs, in conjunction with a splice donor site, the splicing of one exon to another exon. Splice- acceptor sites have a characteristic sequence represented as (Y)10NYAG (SEQ ID NO:10), where Y denotes any pyrimidine and N denotes any nucleotide (Jackson, Nucleic Acids Research 19:3715-3798, 1991). The CAP Site The DNA construct can optionally contain a CAP site. A CAP site is a specific transcription start site which is associated with and utilized by the regulatory region. This CAP site is located at a position relative to the regulatory sequence in the construct such that following homologous recombination, the regulatory sequence directs synthesis of a transcript that begins at the CAP site. Alternatively, no CAP site is included in the construct, and the transcriptional apparatus will locate by default an appropriate site in the targeted gene to be utilized as a CAP site. Additional DNA elements
The construct may additionally contain sequences which affect the structure or stability of the RNA or protein produced by homologous recombination. Optionally, the DNA construct can include a bacterial origin of replication and bacterial antibiotic resistance markers or other selectable markers, which allow for large-scale plasmid propagation in bacteria or any other suitable cloning/host system. All of the above-described elements of the DNA construct are operatively linked or functionally placed with respect to each other. That is, upon homologous recombination between the construct and the targeted genomic DNA, the regulatory sequence can direct the production of a primary RNA transcript which initiates at a CAP site (optionally included in the construct) and includes the sequence lying between the CAP site and the endogenous IFNA2 gene's transcription stop site. Depending on the location of the CAP site, a portion of this sequence may include the IFNA2 gene's endogenous regulatory region as well as sequences neighboring that region that are normally not transcribed. If an exon, a splice-donor site and a splice-acceptor site are present in the construct, the primary transcript will also include the exon, the two splice sites, and the intron between the two sites.
The order of elements in the DNA construct can vary. Where the construct is a circular plasmid or viral vector, the relative order of elements in the resulting structure can be, for example: a targeting sequence, plasmid DNA (comprised of sequences used for the selection and/or replication of the targeting plasmid in a microbial or other suitable host) , selectable marker(s), a regulatory sequence, an exon, a splice-donor site, an intron, and a splice-acceptor site. Where the construct is linear, the order can be, for example: a first targeting sequence, a selectable marker gene, a regulatory sequence, an exon, a splice- donor site, an intron, a splice-acceptor site, and a second targeting sequence; or, in the alternative, a first targeting sequence, a regulatory sequence, an exon, a splice-donor site, an intron, a splice-acceptor site, a selectable marker gene, optionally an internal ribosomal entry site, and a second targeting sequence. Alternatively, the order can be : a first targeting sequence, a first selectable marker gene, a regulatory sequence, an exon, a splice-donor site, an intron, a splice-acceptor site, a second targeting sequence, and a second selectable marker gene; or, a first targeting sequence, a regulatory sequence, an exon, a splice-donor site, an intron, a splice-acceptor site, a first selectable marker gene, a second targeting sequence, and a second selectable marker gene. Recombination between the targeting sequences flanking the first selectable marker with homologous sequences in the host genome results in the targeted integration of the first selectable marker, while the second selectable marker is not integrated. Desired transfected or infected cells are those that are stably transfected or infected with the first, but not second, selectable marker. Such cells can be selected for by growth in a medium containing an agent which selects for expression of the first marker and another agent which selects against the second marker. Transfected or infected cells that have improperly integrated the targeting construct by a mechanism other than homologous recombination would be expected to express the second marker gene and will thereby be killed in the selection medium.
A positively selectable marker is sometimes included in the construct to allow for the selection of cells containing amplified copies of that marker. In this embodiment, the order of construct components can be, for example: a first targeting sequence, an amplifiable positively selectable marker, a second selectable marker (optional) , a regulatory sequence, an exon, a splice-donor site, an intron, a splice-acceptor site, and a second targeting DNA sequence.
The various elements of the construct can be obtained from natural sources (e.g., genomic DNA), or can be produced using genetic engineering techniques or synthetic processes. The regulatory region, CAP site, splice-donor site, intron, and splice acceptor site of the construct can be isolated as a complete unit from, e.g., the human elongation factor-lα (Genbank sequence HUMEFIA) gene or the cytomegalovirus (Genbank sequence HEHCMVP1) immediate early region. These components can also be isolated from separate genes.
Transfection or Infection and Homologous Recombination The DNA construct of the invention can be intro- duced into the cell, such as a primary, secondary, or immortalized cell, as a single DNA construct, or as separate DNA sequences which become incorporated into the chromosomal or nuclear DNA of a transfected or infected cell. By "transfected cell" is meant a cell into which (or into an ancestor of which) a DNA or RNA molecule has been introduced by a means other than using a viral vector. By "infected cell" is meant a cell into which (or into an ancestor of which) a DNA or RNA molecule has been introduced using a viral vector. Viruses known to be useful as vectors include adenovirus, adeno-associated virus, Herpes virus, mumps virus, poliovirus, lentivirus, retroviruses, Sindbis virus, and vaccinia viruses such as canary pox virus. The DNA can be introduced as a linear, double-stranded (with or without single-stranded regions at one or both ends) , single-stranded, or circular molecule. When the construct is introduced into host cells in two separate DNA fragments, the two fragments share DNA sequence homology (overlap) at the 3' end of one fragment and the 5' end of the other, while one carries a first targeting sequence and the other carries a second targeting sequence. Upon introduction into a cell, the two fragments can undergo homologous recombination to form a single molecule with the first and second targeting sequences flanking the region of overlap between the two original fragments. The product molecule is then in a form suitable for homologous recombination with the cellular target sites. More than two fragments can be used, with each of them designed such that they will undergo homologous recombination with each other to ultimately form' a product suitable for homologous recombination with the cellular target sites as described above .
The DNA construct of the invention, if not containing a selectable marker itself, can be co-transfected or co-infected with another construct that contains such a marker. A targeting plasmid may be cleaved with a restriction enzyme at one or more sites to create a linear or gapped molecule prior to transfection or infection. The resulting free DNA ends increase the frequency of the desired homologous recombination event. In addition, the free DNA ends may be treated with an exonuclease to create overhanging 5' or 3 ' single- stranded DNA ends (e.g., at least 30 nucleotides in length, and preferably 100-1000 nucleotides in length) to increase the frequency of the desired homologous recombination event. In this embodiment, homologous recombination between the targeting sequence and the genomic target will result in two copies of the targeting sequences, flanking the elements contained within the introduced plasmid. The DNA constructs may be transfected into cells (preferably in vi tro) by a variety of physical or chemical methods, including electroporation, microin- jection, microprojectile bombardment, calcium phosphate precipitation, liposome delivery, or polybrene- or DEAE dextran-mediated transfection.
The transfected or infected cell is maintained under conditions which permit homologous recombination, as described in the art (see, e.g., Capecchi, Science 24:1288-1292, 1989). When the homologously recombinant cell is maintained under conditions sufficient to permit transcription of the DNA, the regulatory region introduced by the DNA construct will alter transcription of the IFNA2 gene. Homologously recombinant cells (i.e., cells that have undergone the desired homologous recombination) can be identified by phenotypic screening or by analyzing the culture supernatant in enzyme-linked immunosorbent assays (ELISA) for IFNA2. Commercial ELISA kits for detecting IFNA2 are available from Biosource International
(Camarillo, CA) . Homologously recombinant cells can also be identified by Southern and Northern analyses or by polymerase chain reaction (PCR) screening.
As used herein, the term "primary cells" includes (i) cells present in a suspension of cells isolated from a vertebrate tissue source (prior to their being plated, i.e., attached to a tissue culture substrate such as a dish or flask) , (ii) cells present in an explant derived from tissue, (iii) cells plated for the first time, and (iv) cell suspensions derived from these plated cells. Primary cells can also be cells as they naturally occur within a human or an animal .
Secondary cells are cells at all subsequent steps in culturing. That is, the first time that plated primary cells are removed from the culture substrate and replated (passaged) , they are referred to herein as secondary cells, as are all cells in subsequent passages. Secondary cell strains consist of secondary cells which have been passaged one or more times. Secondary cells typically exhibit a finite number of mean population doublings in culture and the property of contact -inhibited, anchorage-dependent growth (anchorage-dependence does not apply to cells that are propagated in suspension culture) . Primary and secondary cells are not immortalized
Immortalized cells are cell lines (as opposed to cell strains, with the designation "strain" reserved for primary and secondary cells) that exhibit an apparently unlimited lifespan in culture. Cells selected for tr.ansfection or infection can fall into four types or categories: (i) cells which do not, as obtained, make or contain more than trace amounts of the IFNA2 protein, (ii) cells which make or contain the protein but in quantities other than those desired (such as, in quantities less than the level which is physiologically normal for the type of cells as obtained) , (iii) cells which make the protein at a level which is physiologically normal for the type of cells as obtained, but are to be augmented or enhanced in their content or production, and (iv) cells in which it is desirable to change the pattern of regulation or induction of a gene encoding the protein.
Primary, secondary and immortalized cells to be transfected or infected by the present method can be obtained from a variety of tissues and include all appropriate cell types which can be maintained in culture. For example, suitable primary and secondary cells include fibroblasts, keratinocytes, epithelial cells (e.g., mammary epithelial cells, intestinal epithelial cells) , endothelial cells, glial cells, neural cells, formed elements of the blood (e.g., lymphocytes, bone marrow cells) , muscle cells, and precursors of these somatic cell types . Where the homologously recombinant cells are to be used in gene therapy, primary cells are preferably obtained from the individual to whom the transfected or infected primary or secondary cells are to be administered. However, primary cells can be obtained from a donor (i.e., an individual other than the recipient) of the same species. Examples of immortalized human cell lines useful for protein production or gene therapy include, but are not limited to, 2780AD ovarian carcinoma cells (Van der Blick et al . , Cancer Res., 48:5927-5932, 1988), A549 (American Type Culture Collection ("ATCC") CCL 185), BeWo (ATCC CCL 98) , Bowes Melanoma cells (ATCC CRL 9607) , CCRF-CEM (ATCC CCL 119), CCRF-HSB-2 (ATCC CCL 120.1), COLO201 (ATCC CCL 224) , COLO205 (ATCC CCL 222) , COLO 320DM (ATCC CCL 220), COLO 320HSR (ATCC CCL 220.1), Daudi cells (ATCC CCL 213), Detroit 562 (ATCC CCL 138), HeLa cells and derivatives of HeLa cells (ATCC CCL 2, 2.1 and 2.2), HCT116 (ATCC CCL 247), HL-60 cells (ATCC CCL 240), HT1080 cells (ATCC CCL 121) , IMR-32 (ATCC CCL 127) , Jurkat cells (ATCC TIB 152) , K-562 leukemia cells (ATCC CCL 243), KB carcinoma cells (ATCC CCL 17), KG-1 (ATCC CCL 246), KG-la (ATCC CCL 246.1), LS123 (ATCC CCL 255), LS174T (ATCC CCL CL-188) , LS180 (ATCC CCL CL-187) , MCF-7 breast cancer cells (ATCC BTH 22) , MOLT-4 cells (ATCC CRL 1582), Namalwa cells (ATCC CRL 1432), NCI-H498 (ATCC CCL 254), NCI-H508 (ATCC CCL 253), NCI-H548 (ATCC CCL 249), NCI-H716 (ATCC CCL 251), NCI-H747 (ATCC CCL 252),
NCI-H1688 (ATCC CCL 257), NCI-H2126 (ATCC CCL 256), Raj i cells (ATCC CCL 86) , RD (ATCC CCL 136) , RPMI 2650 (ATCC CCL 30), RPMI 8226 cells (ATCC CCL 155), SNU-C2A (ATCC CCL 250.1), SNU-C2B (ATCC CCL 250), SW-13 (ATCC CCL 105), SW48 (ATCC CCL 231) , SW403 (ATCC CCL 230) , SW480 (ATCC CCL 227) , SW620 (ATCC CCL 227) , SW837 (ATCC CCL 235) , SW948 (ATCC CCL 237), SW1116 (ATCC CCL 233), SW1417 (ATCC CCL 238), SW1463 (ATCC CCL 234), T84 (ATCC CCL 248), U- 937 cells (ATCC CRL 1593), WiDr (ATCC CCL 218), and WI- 38VA13 subline 2R4 cells (ATCC CLL 75.1), as well as heterohybridoma cells produced by fusion of human cells and cells of another species. Secondary human fibroblast strains, such as WI-38 (ATCC CCL 75) and MRC-5 (ATCC CCL 171) , may be used. In addition, primary, secondary, or immortalized human cells, as well as primary, secondary, or immortalized cells from other species, can be used for in vi tro protein production or gene therapy. IFNA2 -Expressing Cells
Homologously recombinant cells of the invention express IFNA2 at desired levels and are useful for both in vi tro production of IFNA2 and gene therapy.
Protein Production
Homologously recombinant cells according to this invention can be used for in vi tro production of IFNA2. The cells are maintained under conditions, as described in the art, which result in expression of proteins. The IFNA2 protein may be purified from cell lysates or cell supernatants . A pharmaceutical composition containing the IFNA2 protein can be delivered to a human or an animal by conventional pharmaceutical routes known in the art (e.g., oral, intravenous, intramuscular, intranasal, pulmonary, transmucosal , intradermal, rectal, intrathecal , transdermal, subcutaneous, intraperitoneal , or intralesional) . Oral administration may require use of a strategy for protecting the protein from degradation in the gastrointestinal tract: e.g., by encapsulation in polymeric microcapsules . Gene Therapy
Homologously recombinant cells of the present invention are useful as populations of homologously recombinant cell lines, as populations of homologously recombinant primary or secondary cells, as homologously recombinant clonal cell strains or lines, as homologously recombinant heterogenous cell strains or lines, and as cell mixtures in which at least one representative cell of one of the four preceding categories of homologously recombinant cells is present. Such cells may be used in a delivery system for treating (i) infections caused by such viruses as papilloma virus, hepatitis B and C viruses, vaccinia, herpes simplex virus, herpes zoster varicellosus virus, and rhinovirus, and (ii) any other conditions treatable with IFNA2.
Homologously recombinant primary cells, clonal cell strains or heterogenous cell strains are administered to an individual" in whom the abnormal or undesirable condition is to be treated or prevented, in sufficient quantity and by an appropriate route, to express or make available the protein or exogenous DNA at physiologically relevant levels. A physiologically rele- vant level is one which either approximates the level at which the product is normally produced in the body or results in improvement of the abnormal or undesirable condition. If the cells are syngeneic with respect to a immunocompetent recipient, the cells can be administered or implanted intravenously, intraarterially, subcutaneously, intraperitoneally, intraomentally, subrenal capsularly, intrathecally, intracranially, or intramuscularly .
If the cells are not syngeneic and the recipient is immunocompetent, the homologously recombinant cells to be administered can be enclosed in one or more semipermeable barrier devices. The permeability properties of the device are such that the cells are prevented from leaving the device upon implantation into a subject, but the therapeutic protein is freely permeable and can leave the barrier device and enter the local space surrounding the implant or enter the systemic circulation. See, e.g., U.S. Patent Nos. 5,641,670, 5,470,731, 5,620,883, 5,487,737, and co-owned U.S. Patent Application entitled "Delivery of Therapeutic Proteins" (inventors: Justin C. Lamsa and Douglas A. Treco) , filed April 16, 1999, all herein incorporated by reference. The barrier device can be implanted at any appropriate site: e.g., intraperitoneally, intrathecally, subcutaneously, intramuscularly, within the kidney capsule, or within the omentum.
Barrier devices are particularly useful and allow homologously recombinant immortalized cells, homologously recombinant cells from another species (homologously recombinant xenogeneic cells). , or cells from a nonhisto- compatibility-matched donor (homologously recombinant allogeneic cells) to be implanted for treatment of a subject. The devices retain cells in a fixed position in vivo, while protecting the cells from the host's immune system. Barrier devices also allow convenient short-term
(i.e., transient) therapy by allowing ready removal of the cells when the treatment regimen is to be halted for any reason. Transfected or infected xenogeneic and allogeneic cells may also be used in the absence of barrier devices for short-term gene therapy. In that case, the IFNA2 produced by the cells will be delivered in vivo until the cells are rejected by the host's immune system.
A number of synthetic, semisynthetic, or natural filtration membranes can be used for this purpose, including, but not limited to, cellulose, cellulose acetate, nitrocellulose, polysulfone, polyvinylidene difluoride, polyvinyl chloride polymers and polymers of polyvinyl chloride derivatives. Barrier devices can be utilized to allow primary, secondary, or immortalized cells from another species to be used for gene therapy in humans .
Another type of device useful in the gene therapy of the invention is an implantable collagen matrix in which the cells are embedded. Such a device, which can contain beads to which the cells attach, is described in WO 97/15195, herein incorporated by reference. It can be implanted as described above.
The number of cells needed for a given dose or implantation depends on several factors, including the expression level of the protein, the size and condition of the host animal, and the limitations associated with the implantation procedure. Usually the number of cells implanted in an adult human or other similarly-sized animal is in the range of 1 X 104 to 5 X 1010, and preferably 1 X 108 to 1 X 109. If desired, they may be implanted at multiple sites in the patient, either at one time or over a period of months or years. The dosage may be repeated as needed. Deposit
Under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure, the deposit of plasmid pA2HB was made with the American Type Culture Collection (ATCC) of Rockville, MD, USA on May
12, 1998. The deposit was given Accession Number 209872.
Applicants' assignee, Transkaryotic Therapies, Inc., represents that the ATCC is a depository affording permanence of the deposit and ready accessibility thereto by the public if a patent is granted. All restrictions on the availability to the public of the material so deposited will be irrevocably removed upon the granting of the patent. The material will be available during the pendency of the patent application to one determined by the Commissioner to be entitled thereto under 37 CFR 1.14 and 35 U.S.C. §122. The deposited material will be maintained with all the care necessary to keep it viable and uncontaminated for a period of at least five years after the most recent request for the furnishing of a sample of the deposited material, and in any case, for a period of at least thirty (30) years after the date of deposit or for the enforceable life of the patent, whichever period is longer. Applicants' assignee acknowledges its duty to replace the deposit should the depository be unable to furnish a sample when requested due to the condition of the deposit.
Other Embodiments It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not to limit the scope of the invention, which is defined by the scope of the appended claims .
Other aspects, advantages, and modifications are within the scope of the following claims.
What is claimed is:

Claims

1. A DNA construct that alters expression of an endogenous IFNA2 gene in a mammalian cell upon integration into the genome of the cell via homologous recombination, the construct comprising (i) a targeting sequence containing at least 20 contiguous nucleotides of SEQ ID NO: 12, and (ii) a transcriptional regulatory sequence .
2. The DNA construct of claim 1, wherein the construct further comprises an exon, a splice-donor site, an intron and a splice-acceptor site.
3. The DNA construct of claim 1, wherein the construct further comprises a selectable marker gene.
4. The DNA construct of claim 1, wherein the targeting sequence contains at least 50 contiguous nucleotides of SEQ ID NO: 12.
5. A DNA construct that alters expression of an endogenous IFNA2 gene in a mammalian cell upon integration into the genome of the cell via homologous recombination, the construct comprising (i) a targeting sequence containing at least 20 contiguous nucleotides of SEQ ID NO: 7, and (ii) a transcriptional regulatory sequence .
6. The DNA construct of claim 5, wherein the construct further comprises an exon, a splice-donor site, an intron and a splice-acceptor site.
7. A DNA construct that alters expression of an endogenous IFNA2 gene in a mammalian cell upon integration into the genome of the cell via homologous recombination, the construct comprising (i) a targeting sequence containing at least 20 contiguous nucleotides of SEQ ID NO: 8, and (ii) a transcriptional regulatory sequence .
8. The DNA construct of claim 7, wherein the construct further comprises an exon, a splice-donor site, an intron and a splice-acceptor site.
9. An isolated nucleic acid comprising at least 20 contiguous nucleotides of SEQ ID NO: 12 or its complement, wherein the isolated nucleic acid does not encode full-length interferon-α 2.
10. The isolated nucleic acid of claim 9, wherein the isolated nucleic acid comprises at least 50 contiguous nucleotides of SEQ ID NO: 12 or its complement.
11. The isolated nucleic acid of claim 9, wherein the isolated nucleic acid comprises at least 100 contiguous nucleotides of SEQ ID NO: 12 or its complement.
12. The isolated nucleic acid of claim 9, wherein the isolated nucleic acid comprises at least 200 contiguous nucleotides of SEQ ID NO: 12 or its complement.
13. The isolated nucleic acid of claim 9, wherein the isolated nucleic acid comprises SEQ ID NO: 7 or its complement .
14. The isolated nucleic acid of claim 9, wherein the isolated nucleic acid comprises SEQ ID NO: 8 or its complement .
15. The isolated nucleic acid of claim 9, wherein the isolated nucleic acid comprises SEQ ID NO: 12 or its complement .
16. The isolated nucleic acid of claim 9, wherein the isolated nucleic acid comprises at least 20 contiguous nucleotides of SEQ ID NO: 7 or its complement.
17. The isolated nucleic acid of claim 9, wherein the isolated nucleic acid comprises at least 20 contiguous nucleotides of SEQ' ID NO : 8 or its complement.
18. An isolated nucleic acid comprising a strand that comprises a nucleotide sequence that (i) is at least 100 nucleotides in length and (ii) hybridizes under highly stringent conditions with SEQ ID NO: 12, or the complement of SEQ ID NO: 12.
19. The isolated nucleic acid of claim 18, wherein the nucleotide sequence is at least 200 nucleotides in length.
20. The isolated nucleic acid of claim 18, wherein the nucleotide sequence is at least 400 nucleotides in length.
21. The isolated nucleic acid of claim 18, wherein the nucleotide sequence is at least 1,000 nucleotides in length.
22. An isolated nucleic acid comprising a strand that comprises a nucleotide sequence that (i) is at least 100 nucleotides in length and (ii) hybridizes under highly stringent conditions with SEQ ID NO: 7, or the complement of SEQ ID NO: 7.
23. An isolated nucleic acid comprising a strand that comprises a nucleotide sequence that (i) is at least 50 nucleotides in length and (ii) hybridizes under highly stringent conditions with SEQ ID NO: 8, or the complement of SEQ ID NO: 8.
24. An isolated nucleic acid comprising a strand that comprises a nucleotide sequence that (i) is at least 50 nucleotides in length and. (ii) shares at least 80% sequence identity with a fragment of SEQ ID NO: 12 having the same length as the nucleotide sequence.
25. The isolated nucleic acid of claim 24, wherein the nucleotide sequence is at least 100 nucleotides in length.
26. The isolated nucleic acid of claim 24, wherein the fragment is a part or all of SEQ ID NO:7.
27. The isolated nucleic acid of claim 24, wherein the fragment is a part or all of SEQ ID NO: 8.
28. A homologously recombinant mammalian cell stably transfected with the DNA construct of claim 1, the DNA construct having undergone homologous recombination with genomic DNA upstream of the ATG initiation codon of an endogenous IFNA2 coding sequence .
29. A homologously recombinant mammalian cell stably transfected with the DNA construct of claim 2, the DNA construct having undergone homologous recombination with genomic DNA upstream of the ATG initiation codon of an endogenous IFNA2 coding sequence.
30. A homologously recombinant cell stably transfected with the DNA construct of claim 3, the DNA construct having undergone homologous recombination with genomic DNA upstream of the ATG initiation codon of an endogenous IFNA2 coding sequence.
31. A homologously recombinant cell stably transfected with the DNA construct of claim 4, the DNA construct having undergone homologous recombination with genomic DNA upstream of the ATG initiation codon of an endogenous IFNA2 coding sequence.
32. A method of altering expression of an endogenous IFNA2 gene in a mammalian cell, the method comprising introducing the DNA construct of claim 1 into the cell and maintaining the cell under conditions which permit homologous recombination to occur between the construct and a target site 5' to the coding sequence of the endogenous IFNA2 gene.
33. A method of delivering IFNA2 to an animal, comprising providing the cell of claim 28; and implanting the cell in the animal, wherein the cell secretes IFNA2.
34. A method of delivering IFNA2 to an animal, comprising providing the cell of claim 29; and implanting the cell in the animal, wherein the cell secretes IFNA2.
35. A method of delivering IFNA2 to an animal, comprising providing the cell of claim 30; and implanting the cell in the animal, wherein the cell secretes IFNA2.
36. A method of delivering IFNA2 to an animal, comprising providing the cell of. claim 31; and implanting the cell in the animal, wherein the cell secretes IFNA2.
37. A method of producing IFNA2, comprising providing the cell of claim 28, and culturing the cell in vi tro under conditions which permit the cell to express and secrete IFNA2.
38. A method of producing IFNA2, comprising providing the cell of claim 29, and culturing the cell in vi tro under conditions which permit the cell to express and secrete IFNA2.
39. A method of producing IFNA2, comprising providing the cell of claim 30, and culturing the cell in vi tro under conditions which permit the cell to express and secrete IFNA2.
40. A method of producing IFNA2, comprising providing the cell of claim 31, and culturing the cell in vi tro under conditions which permit the cell to express and secrete IFNA2.
41. A DNA construct that alters expression of an endogenous IFNA2 gene in a mammalian cell upon integration into the genome of the cell via homologous recombination, the construct comprising (i) a targeting sequence containing at least 20 contiguous nucleotides of one or more of SEQ ID NOs: 16, 17, 18, and 19, and (ii) a transcriptional regulatory sequence.
42. An isolated nucleic acid comprising at least 20 contiguous nucleotides of. one or more of
SEQ ID NOs: 16, 17, 18, and 19, or the complement of one or more of SEQ ID NOs: 16, 17, 18, and 19, wherein the isolated nucleic acid does not encode full-length interferon-α 2.
43. A homologously recombinant mammalian cell stably transfected with the DNA construct of claim 39, the DNA construct having undergone homologous recombination with genomic DNA upstream of the ATG initiation codon of an endogenous IFNA2 coding sequence.
4 . A method of altering expression of an endogenous IFNA2 gene in a mammalian cell, the method comprising introducing the DNA construct of claim 39 into the cell and maintaining the cell under conditions which permit homologous recombination to occur between the construct and a target site 5' to the coding sequence of the endogenous IFNA2 gene.
45. A method of delivering IFNA2 to an animal, comprising providing the cell of claim 41; and implanting the cell in the animal, wherein the cell secretes IFNA2.
46. A method of producing IFNA2, comprising providing the cell of claim 41, and culturing the cell in vitro under conditions which permit the cell to express and secrete IFNA2.
EP99920375A 1998-05-07 1999-05-05 Genomic sequences upstream of the coding region of the ifn-alpha2 gene for protein production and delivery Withdrawn EP1075531A1 (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US8464898P 1998-05-07 1998-05-07
US84648P 1998-05-07
US8655598P 1998-05-21 1998-05-21
US86555P 1998-05-21
PCT/US1999/009925 WO1999057292A1 (en) 1998-05-07 1999-05-05 Genomic sequences upstream of the coding region of the ifn-alpha2 gene for protein production and delivery

Publications (1)

Publication Number Publication Date
EP1075531A1 true EP1075531A1 (en) 2001-02-14

Family

ID=26771248

Family Applications (1)

Application Number Title Priority Date Filing Date
EP99920375A Withdrawn EP1075531A1 (en) 1998-05-07 1999-05-05 Genomic sequences upstream of the coding region of the ifn-alpha2 gene for protein production and delivery

Country Status (14)

Country Link
EP (1) EP1075531A1 (en)
JP (1) JP2002513580A (en)
KR (1) KR20010052278A (en)
CN (1) CN1309710A (en)
AR (1) AR016264A1 (en)
AU (1) AU3788899A (en)
CA (1) CA2328459A1 (en)
CZ (1) CZ299418B6 (en)
HU (1) HUP0102517A3 (en)
IL (1) IL139433A0 (en)
NO (1) NO20005585L (en)
PL (1) PL344517A1 (en)
TR (1) TR200003271T2 (en)
WO (1) WO1999057292A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001042442A2 (en) 1999-12-10 2001-06-14 Cytos Biotechnology Ag Activation of endogenous genes by genomic introduction of a replicon
FR2817559B1 (en) * 2000-12-06 2003-12-12 Genodyssee METHOD FOR DETERMINING ONE OR MORE FUNCTIONAL POLYMORPHISM (S) IN THE NUCLEIC SEQUENCE OF A PRESELECTED FUNCTIONAL "CANDIDATE" GENE AND ITS APPLICATIONS
DK2697375T3 (en) * 2011-04-13 2018-01-15 Nat Res Council Canada EXPRESSION SYSTEM WITH A SAR (SCAFFOLD ATTACHMENT REGION) ELEMENT OF INTERFERON 2

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5641670A (en) * 1991-11-05 1997-06-24 Transkaryotic Therapies, Inc. Protein production and protein delivery

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
DATABASE EMBL [online] 6 January 1998 (1998-01-06), NICOLAEV A.I. ET AL: "Extrachromosomal maintenance and recombinant transgene rearrangements in transgenic insects and mice" *

Also Published As

Publication number Publication date
CZ299418B6 (en) 2008-07-16
CN1309710A (en) 2001-08-22
AU3788899A (en) 1999-11-23
WO1999057292A9 (en) 1999-12-16
NO20005585L (en) 2001-01-05
HUP0102517A3 (en) 2004-06-28
IL139433A0 (en) 2001-11-25
WO1999057292A1 (en) 1999-11-11
JP2002513580A (en) 2002-05-14
PL344517A1 (en) 2001-11-05
HUP0102517A2 (en) 2001-11-28
TR200003271T2 (en) 2001-03-21
NO20005585D0 (en) 2000-11-06
AR016264A1 (en) 2001-06-20
CZ20003705A3 (en) 2001-04-11
CA2328459A1 (en) 1999-11-11
KR20010052278A (en) 2001-06-25

Similar Documents

Publication Publication Date Title
EP1228237A1 (en) Expression vectors, transfection systems, and method of use thereof
JP2014036671A (en) Expression vector, transfection system, and method of use thereof
WO1999057291A1 (en) Genomic sequences upstream of the coding region of the g-csf gene for protein production and delivery
WO1999057263A9 (en) Modifying the expression of the fsh beta gene by homologous recombination
Merrill et al. Genetic and physical analysis of the chicken tk gene
CN113544257A (en) Characterization and inactivation of endogenous retroviruses in chinese hamster ovary cells
US5681942A (en) Fanconi Anemia Type C gene
EP1075531A1 (en) Genomic sequences upstream of the coding region of the ifn-alpha2 gene for protein production and delivery
EP0369458A2 (en) Bovine metallothionein regulatory region
GENES BERNARD G. FORGET

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20001109

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

DAX Request for extension of the european patent (deleted)
17Q First examination report despatched

Effective date: 20040518

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20040929

REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1034745

Country of ref document: HK