WO2002072017A2 - Method of drug target validation - Google Patents
Method of drug target validation Download PDFInfo
- Publication number
- WO2002072017A2 WO2002072017A2 PCT/US2002/007294 US0207294W WO02072017A2 WO 2002072017 A2 WO2002072017 A2 WO 2002072017A2 US 0207294 W US0207294 W US 0207294W WO 02072017 A2 WO02072017 A2 WO 02072017A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- gene
- human animal
- transgenic non
- cells
- transgenic
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/8509—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells for producing genetically modified animals, e.g. transgenic
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New breeds of animals
- A01K67/027—New breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/70571—Receptors; Cell surface antigens; Cell surface determinants for neuromediators, e.g. serotonin receptor, dopamine receptor
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/20—Animal model comprising regulated expression system
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
- A01K2267/0393—Animal model comprising a reporter system for screening tests
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2799/00—Uses of viruses
- C12N2799/02—Uses of viruses as vector
- C12N2799/021—Uses of viruses as vector for the expression of a heterologous nucleic acid
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2799/00—Uses of viruses
- C12N2799/02—Uses of viruses as vector
- C12N2799/021—Uses of viruses as vector for the expression of a heterologous nucleic acid
- C12N2799/027—Uses of viruses as vector for the expression of a heterologous nucleic acid where the vector is derived from a retrovirus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/20—Pseudochromosomes, minichrosomosomes
- C12N2800/204—Pseudochromosomes, minichrosomosomes of bacterial origin, e.g. BAC
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/30—Vector systems comprising sequences for excision in presence of a recombinase, e.g. loxP or FRT
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/001—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination
- C12N2830/002—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination inducible enhancer/promoter combination, e.g. hypoxia, iron, transcription factor
- C12N2830/003—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination inducible enhancer/promoter combination, e.g. hypoxia, iron, transcription factor tet inducible
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/15—Vector systems having a special element relevant for transcription chimeric enhancer/promoter combination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2840/00—Vectors comprising a special translation-regulating system
- C12N2840/20—Vectors comprising a special translation-regulating system translation of more than one cistron
- C12N2840/203—Vectors comprising a special translation-regulating system translation of more than one cistron having an IRES
- C12N2840/206—Vectors comprising a special translation-regulating system translation of more than one cistron having an IRES having multiple IRES
Definitions
- the present invention relates to methods for validating potential drug targets.
- the invention provides methods of screening numerous potential drug targets in one or more transgenic animal lines, but does not require the time-consuming production of a transgenic line for each potential drug target to be validated.
- a drug target is a gene, or the protein product of a gene, that is related to a particular indication, disease, or disorder, and that serves as a target for drug development.
- a potential drug target is a product of an endogenous gene, the expression of which has been observed to increase or decrease in a particular disease state.
- transgenic animal models in particular mouse models, have been developed for mammalian diseases and disorders.
- the analysis of transgenic animal models carrying genetic polymorphisms and mutations has shed light on the molecular mechanisms underlying mammalian (particularly human) diseases and disorders, leading to the identification of potential drug targets.
- One significant limitation is the amount of time (usually several months) required to produce a potential founder transgenic animal, such as a transgenic mouse, that bears a particular mutation or transgene and that exhibits a particular disease state. More time is then required to establish a stable line of transgenic individuals derived from the founder. It is only after several months of work to establish a transgenic animal line that the line is ready to be used to screen for and validate a potential drug target and/or to be tested as a model for potential therapeutic treatments. Thus there is an urgent need for methods that permit more rapid and efficient screening for potential drug targets in transgenic animal lines. Such a technology should permit the screening of numerous potential drug targets, but not require the time-consuming production of a transgenic line for each potential drug target to be validated. We describe such a technology here.
- the invention relates to a method of validating potential drug targets, i.e., a gene or protein product of a gene that is potentially related to a particular indication (e.g., a particular disease or disorder) and that potentially serves as target for drug development, for example, where the inhibition, altered expression, or increase in activity of the gene or protein product thereof treats, prevents or ameliorates the indication or symptom thereof.
- the potential drug target is the product of an endogenous gene, the expression of which has been observed to increase or decrease in a particular disease state.
- the drug validation system of the present invention allows the screening of numbers of potential drug targets using one or a collection of transgenic animal lines.
- the method does not involve the production of transgenic lines for each potential drug target to be validated but, rather, involves introduction of a potential drug target (or an inhibitor thereof) into one or more existing animal lines transgenic for a transactivator or transinhibitor that is conditionally expressed and that activates or inhibits expression of the potential drug target such that the potential drug target is either expressed or inhibited only in a particular subset of cells (i.e., expression is spatially or temporally restricted).
- the drug validation system of the invention is more flexible, convenient and efficient than other existing drug validation systems because it uses one of a limited set of transgenic animal lines not necessarily specific for the particular target, instead of requiring the production of a transgenic animal line for each target to be validated.
- the drug validation method of the invention uses one or more transgenic animal lines, preferably transgenic mouse lines, transgenic for a DNA sequence that encodes a "key protein."
- the key protein is a protein that can activate or inhibit expression of a gene under the control of an expression element that is turned off or on by the key protein (for example, but not limited to, promoters and/or enhancers whereby transcription is turned on or off by a specific transactivator; recombinase target sites for which recombination is effected by a recombinase, and recombination positions the target gene for expression or inhibition of expression).
- the expression of the key protein is regulated by regulatory sequences from a gene (herein a "characterizing gene") that is endogenously expressed in a particular subset of cells.
- the gene encoding the key protein (the “key gene") can be introduced (either by insertion or replacement) into a non-coding sequence or coding sequence of the characterizing gene (but preferably not into a regulatory sequence), (for example, by introduction of such a modified characterizing gene, i.e., a transgene, including all or a portion of the regulatory sequences into the genome of the animal), such that the expression of the key gene substantially reproduces the endogenous expression pattern of the characterizing gene.
- each transgenic line expressing a particular key gene under the control of the regulatory sequences of a characterizing gene is created by the introduction, for example by pronuclear injection, or by non-homologous recombination in embryonic stem cells that are introduced into embryos, of a vector containing the transgene into a founder animal, such that the transgene is transmitted to offspring in the line.
- the transgene preferably randomly integrates into the genome of the founder, but in specific embodiments, may be introduced by directed homologous recombination.
- the transgene is present at a location on the chromosome other than the site of the endogenous characterizing gene.
- homologous recombination in bacteria is used for target-directed insertion of the key gene sequence into a genomic DNA fragment containing all or a portion of the characterizing gene, including sufficient characterizing gene regulatory sequences to promote expression of the characterizing gene in its endogenous expression pattern.
- the characterizing gene sequences are on a bacterial artificial chromosome (BAC).
- the key gene coding sequences are inserted as a 5' fusion with the characterizing gene coding sequence such that the key gene coding sequences are inserted in frame and directly 3' from the initiation codon for the characterizing gene coding sequences.
- the key gene coding sequences are inserted into the 3' untranslated region (UTR) of the characterizing gene and, preferably, have their own internal ribosome entry sequence (IRES).
- the vector preferably a BAC comprising the key gene coding sequences and characterizing gene sequences is then introduced into the genome of a potential founder animal to generate a line of transgenic animals.
- founder animals can be screened for the selective expression of the key gene sequence in the population of cells characterized by expression of the endogenous characterizing gene.
- Transgenic animals that exhibit appropriate expression e.g., detectable expression of the key gene product having the same expression pattern within the animal as the endogenous characterizing gene are selected as founders for a line of transgenic animals.
- a "modulating construct" containing a nucleotide sequence encoding the potential drug target, or a product that specifically modulates (e.g., inhibits) the expression of the potential drug target is introduced into an appropriate transgenic animal cell line.
- the expression of the potential drug target, or modulator thereof, is regulated (either activated or inhibited) by the presence of the key protein.
- the key protein is a transcriptional activator
- the potential drug target is operably linked to a promoter activated by the key protein transcriptional activator.
- the modulating construct can contain a nucleotide sequence that is homologous to a selected endogenous gene sequence in the transgenic animal line or that is orthologously related to the endogenous gene sequence.
- the modulating construct can encode an inhibitor, including, but not limited to, a catalytic nucleic acid such as a ribozyme, inhibitory RNA (RNAi), or an inhibitor protein of the endogenous gene sequence.
- a catalytic nucleic acid such as a ribozyme, inhibitory RNA (RNAi), or an inhibitor protein of the endogenous gene sequence.
- the modulating construct is a viral vector that is used to infect a general type or population of cells (for example, the cells of a mouse in a global fashion) expressing the key protein in a select subpopulation of the general type or population of cells.
- the viral vector comprising the modulating construct is directly injected into a particular tissue region, e.g., a brain region.
- the viral vector is replication proficient; in an alternative embodiment, the viral vector is replication deficient.
- the invention provides a method of determining whether the modulation of expression of a potential target gene in a particular cell type is causally linked to a desired effect, for example, expression of the potential target causes the expression of a certain cell or tissue phenotype associated with a particular disease or disorder or with the treatment, prevention or amelioration of that disease or disorder.
- the subject methods are advantageous because they enable the validation of drug targets to proceed rapidly and efficiently, limited only by the rate at which modulating constructs can be produced, and not by the rate at which a transgenic animal line can be produced.
- a collection of transgenic animal lines expressing key proteins, for example, where the lines express the key protein in different cell populations, can be used repeatedly to validate many potential drug targets introduced via modulating constructs.
- the invention also provides non-human transgenic animals that express one or more potential drug target proteins (or inhibitors thereof) in a specific subset of cells.
- a transgenic animal of the invention comprises, for each potential drug target, a vector (or in certain embodiments a transgene) comprising a first nucleotide sequence encoding the potential drug target protein (or inhibitor thereof). The expression of each potential drug target protein (or inhibitor thereof) is under the control of a conditional expression element.
- the transgenic animal further comprises a transgene containing a key gene that encodes an inducer or suppressor of the conditional expression element.
- the key gene is operably linked to regulatory sequences of a characterizing gene corresponding to an endogenous gene or homolog of an endogenous gene such that the key gene is expressed in the transgenic animal with an expression pattern that is substantially the same as the expression pattern of the endogenous gene in a non-transgenic animal of the same species.
- the potential drug target protein(s) (or inhibitor(s) thereof) is selectively expressed in the cells expressing the key gene.
- FIGS. 1A and B A. DNA fingerprint gel showing putative co-integrate clones. Three different BAC clones containing the 5HT6 gene were used. B. Southern hybridization showing that all three clones were indeed co-integrates. Hindlll fragments containing the homology box were labeled and were duplicated in co-integrates. See . Section 6.9 for details.
- FIG. 2 Restriction mapping using DNA pulse-field gel (CHEF mapping protocol, Section 6.4) showing that one of the 5HT6-containing BAC clones (clone2) had a sufficiently large DNA fragment upstream of the 5HT6 transcription start site.
- FIGS. 3A and B A. DNA fingerprint gel showing putative resolvant clones.
- B Southern hybridization showing that two out of four clones tested were indeed resolvants; Hindlll fragments containing Emerald (GFP) were labeled; two copies of Emerald were present in co-integrate (col) and only one copy was left in the resolvants. See Section 6.9 for details.
- FIGS. 4A and B Fluorescence (A.) and light (B.) photomicrographs of a section through the cortex of a transgenic mouse expressing the 5HT6 receptor BAC. The section was immunohistochemically stained with an anti-GFP primary antibody and a fluorescently- conjugated secondary antibody. See Section 6.9 for details.
- FIG. 5. Fluorescence photomicrograph of a section of the hippocampus of a transgenic mouse expressing the 5HT6 receptor BAC. The section was immunohistochemically stained with an anti-GFP primary antibody and a fluorescently- conjugated secondary antibody. See Section 6.9 for details.
- FIG. 11 Fluorescence photomicrograph of a section of brain tissue showing that the 5HT2A transgene was indeed expressed in subsets of neurons in transgenic animals (arrows point to two fluorescent cells). See Section 6.10 for details.
- FIG. 12 A pLD53 shuttle vector designed to insert IRES-Emerald at the position specified by the A box, which is cloned into the vector using the indicated Ascl and Smal sites.
- the PCR product of the A box is cloned by digesting it with Ascl and then ligating with Ascl/Smal digested pLD53.
- FIG. 13 A pLD53 shuttle vector designed to insert Emerald at the position specified by the A box (normally, at the 5' end of the gene, such that Emerald is produced from the transcribed mRNA instead of the gene into which the insertion occurs).
- the A box is shown cloned into the vector.
- the drug validation system of the present invention allows the screening of numbers of potential drug targets using one or more transgenic animal lines.
- the drug validation system of the invention is more flexible, convenient and efficient than other existing drug validation systems because it uses one of a limited set of transgenic animal lines that is not necessarily specific for the particular drug target, instead of requiring the production of a transgenic animal line for each drug target to be validated.
- Each transgenic line is created by the introduction of a transgene into a founder animal, such that the transgene is transmitted to offspring in the line.
- Methods for producing transgenic animal lines and collections of transgenic animal lines are described in Serafmi, U.S. Patent Application Serial No. 09/783,487 entitled “Collections of Transgenic Animal Lines (Living Library)” filed February 14, 2001, and Serafmi, U.S. Patent Application Serial No. (to be assigned) (Attorney Docket No. 10239-0036-999) entitled “Collections of Transgenic Animal Lines (Living Library)” filed February 14, 2002, both of which are incorporated herein by reference in their entireties.
- a line may include transgenic animals that are derived from more than one founder animal but that contain the same transgene, preferably in the same chromosomal position and/or exhibiting the same level and pattern of expression within the animal. For example, in certain circumstances, it may be preferable to use more than one founder animal to maintain or rederive a line.
- a subset of cells of the transgenic animal that is characterized by expression of a particular endogenous gene also expresses the key gene, either constitutively or conditionally.
- the transgenic animal lines, collections of transgenic animal lines, and collections of vectors of the invention may be used for pharmacological, behavioral, physiological, electrophysiological, or drug discovery assays, for target validation, for gene expression analysis, etc.
- Each transgenic animal line of the invention contains a transgene that comprises key gene coding sequences under the control of the regulatory sequences for a characterizing gene, such that the key gene has substantially the same expression pattern as the endogenous characterizing gene.
- the expression of the key gene permits activation or inhibition of a gene comprised in the modulating construct that encodes a potential drug target.
- a transgene is a nucleotide sequence that has been or is designed to be incorporated into a cell, particularly a mammalian cell, that in turn becomes or is incorporated into a living animal such that the nucleic acid containing the nucleotide sequence is expressed (i e. , the mammalian cell is transformed with the transgene).
- the characterizing gene sequence is preferably endogenous to the transgenic animal, or is an ortholog of an endogenous gene, e.g., the human ortholog of a gene endogenous to the animal to be made transgenic.
- a transgene may be present as an extrachromosomal element in some or all of the cells of a transgenic animal or, preferably, stably integrated into some or all of the cells, more preferably into the germline DNA of the animal (i.e., such that the transgene is transmitted to all or some of the animal's progeny), thereby directing expression of an encoded gene product (i.e., the key gene product) in one or more cell types or tissues of the transgenic animal.
- an encoded gene product i.e., the key gene product
- a transgenic animal comprises stable changes to the chromosomes of germline cells.
- the transgene is present in the genome at a site other than where the endogenous characterizing gene is located.
- the transgene is incorporated into the genome of the transgenic animal at the site of the endogenous characterizing gene, for example, by homologous recombination.
- a transgenic animal is created by introducing a transgenic construct of the invention into the animal's genome using methods routine in the art, for example, the methods described in Section 5.4 and 5.5, infra, and using the vectors described in Section 5.3, infra.
- a construct is a recombinant nucleic acid, generally recombinant DNA, generated for the purpose of the expression of a specific nucleotide sequence(s), or to be used in the construction of other recombinant nucleotide sequences.
- a transgenic construct of the invention includes at least the coding region for a key gene operably linked to all or a portion of the regulatory sequences, e.g. a promoter and/or enhancer, of the characterizing gene.
- the transgenic construct optionally includes enhancer sequences and coding and other non-coding sequences (including intron and 5' and 3' untranslated sequences) from the characterizing gene such that the key gene is expressed in the same subset of cells as the characterizing gene in the same transgenic animal or in a comparable (e.g., same species, strain, gender, age, genetic background, etc. (e.g., a sibling) non-transgenic animal, i.e., an animal that is essentially the same but for the presence of the transgene).
- a comparable e.g., same species, strain, gender, age, genetic background, etc.
- the key gene coding sequences and the characterizing gene regulatory sequences are operably linked, meaning that they are connected in such a way so as to permit expression of the key gene when the appropriate molecules (e.g., transcriptional activator proteins) are bound to the characterizing gene regulatory sequences.
- the linkage is covalent, most preferably by a nucleotide bond.
- the promoter region is of sufficient length to promote transcription, as described in Alberts et al. (1989) in Molecular Biology of the Cell, 2d Ed. (Garland Publishing, Inc.).
- the regulatory sequence is the promoter of a characterizing gene.
- Other promoters that direct tissue-specific expression of the coding sequences to which they are operably linked are also contemplated in the invention.
- a promoter from one gene and other regulatory sequences (such as enhancers) from other genes are combined to achieve a particular temporal and spatial expression pattern of the key gene.
- the key gene coding sequences code for a protein that activates, enhances or suppresses the expression of a gene encoding a potential drug that is comprised in the modulating construct.
- the transgene comprises the key gene coding sequences operably linked to characterizing gene regulatory sequences.
- the modulating construct comprises sequences encoding a potential drug target operably linked to an expression control element that is activatable or suppressible by the protein product of the key gene coding sequences.
- sequences encoding the potential drug target operably linked to sequences that activate or suppress expression of the marker in the presence of the key gene protein product are present on a second transgene introduced into the transgenic animal containing the transgene with the key gene operably linked to the characterizing gene regulatory sequences, for example, but not by way of limitation, by injection of a viral vector, by random integration directly into the genome of the transgenic animal, or by breeding with a transgenic animal of the invention.
- the key gene coding sequences may be incorporated into some or all of the characterizing gene sequences such that the key gene is expressed in substantially the same expression pattern as the endogenous characterizing gene in the transgenic animal, or at least, in the anatomical region or tissue of the animal (by way of example, in the brain, spinal chord, heart, skin, bones, head, limbs, blood, muscle, peripheral nervous system, etc.) containing the population of cells to be marked by expression of the key gene coding sequences, so that tissue can be dissected from the transgenic animal, which tissue contains only cells of interest expressing the key gene coding sequences.
- substantially the same expression pattern is meant that the key gene coding sequences are expressed in at least 80%, 85%, 90%, 95%, and preferably 100% of the cells shown to express the endogenous characterizing gene by in situ hybridization. Because detection of the key gene expression product (or a marker expressed therewith) may be more sensitive than in situ hybridization detection of the endogenous characterizing gene messenger RNA, more cells may be detected to express the key gene product in the transgenic mice of the invention than are detected to express the endogenous characterizing gene by in situ hybridization or any other method known in the art for in situ detection of gene expression.
- the nucleotide sequences encoding the key gene protein product may replace the characterizing gene coding sequences in a genomic clone of the characterizing gene, leaving the characterizing gene regulatory non-coding sequences.
- the key gene coding sequences (either genomic or cDNA sequences) replace all or a portion of the characterizing gene coding sequence and the transgene only contains the upstream and downstream characterizing gene regulatory sequences.
- the key gene coding sequences are inserted into or replace transcribed coding or non-coding sequences of the genomic characterizing gene sequences, for example, into or replacing a region of an exon or of the 3' UTR of the characterizing gene genomic sequence.
- the key gene coding sequences are not inserted into or replace regulatory sequences of the genomic characterizing gene sequences.
- the key gene coding sequences are also not inserted into or replace characterizing gene intron sequences.
- the key gene coding sequence is inserted into or replaces a portion of the 3' untranslated region (UTR) of the characterizing gene genomic sequence.
- the coding sequence of the characterizing gene is mutated or disrupted to abolish characterizing gene expression from the transgene without affecting the expression of the key gene.
- the key gene coding sequence has its own internal ribosome entry site (IRES).
- IRESes see, e.g., Jackson et al, 1990, Trends Biochem Sci. 15(12):477-83; Jang et al, 1988, J. Virol. 62(8):2636-43; Jang et al, 1990, Enzyme 44(l-4):292-309; and Martinez-Salas, 1999, Curr. Opin. Biotechnol. 10(5):458-64.
- the key gene is inserted at the 3' end of the characterizing gene coding sequence.
- the key coding sequences are introduced at the 3' end of the characterizing gene coding sequence such that the transgene encodes a fusion of the characterizing gene and the key gene sequences.
- the key gene coding sequences are inserted using 5' direct fusion wherein the key gene coding sequences are inserted in-frame adjacent to the initial ATG sequence (or adjacent to the nucleotide sequence encoding the first two, three, four, five, six, seven or eight amino acids of the characterizing gene protein product) of the characterizing gene, so that translation of the inserted sequence produces a fusion protein of the first methionine (or first few amino acids) derived from the characterizing gene sequence fused to the key gene protein.
- the characterizing gene coding sequence 3' of the key gene coding sequences are not expressed.
- a key gene is inserted into a separate cistron in the 5 1 region of the characterizing gene genomic sequence and has an independent IRES sequence.
- an IRES is operably linked to the key gene coding sequence to direct translation of the key gene.
- the IRES permits the creation of polycistronic mRNAs from which several proteins can be synthesized under the control of an endogenous transcriptional regulatory sequence.
- Such a construct is advantageous because it allows marker proteins to be produced in the same cells that express the endogenous gene (Heintz, 2000, Hum. Mol. Genet. 9(6): 937-43; Heintz et al, WO 98/59060; Heintz et al, WO 01/05962; which are all incorporated herein by reference in their entireties).
- Shuttle vectors containing an IRES such as the pLD55 shuttle vector (see Heintz et al, WO 01/05962), may be used to insert the key gene sequence into the characterizing gene.
- the IRES in the pLD55 shuttle vector is derived from EMCV (encephalomyocarditis virus) (Jackson et al, 1990, Trends Biochem Sci. 15(12):477-83; and Jang et al, 1988, J. Virol. 62(8):2636-43, both of which are incorporated herein by reference in their entireties).
- the common sequence between the first and second IRES sites in the shuttle vector is shown below. This common sequence also matches pIRES (Clontech) from 1158-1710.
- more than one IRES site is present in the transgene to direct translation of more than one coding sequence.
- each IRES sequence must be a different sequence.
- the key gene coding sequence is embedded in the genomic sequence of the characterizing gene and is inactive unless acted on by a transactivator or recombinase, whereby expression of the key gene can then be driven by the characterizing gene regulatory sequences.
- a marker gene is expressed conditionally, through the activity of a key gene that is an activator or suppressor of gene expression.
- the key gene encodes a transactivator, e.g., tetR, or a recombinase, e.g., FLP, whose expression is regulated by the characterizing gene regulatory sequences.
- the marker gene is linked to a conditional element, e.g., the tet promoter, or is flanked by recombinase sites, e.g., FRT sites, and may be located anywhere within the genome.
- expression of the key gene as regulated by the characterizing gene regulatory sequences, activates the expression of the marker gene.
- exogenous translational control signals including, for example, the ATG initiation codon, can be provided by the characterizing gene or some other heterologous gene. The initiation codon must be in phase with the reading frame of the desired coding sequence of the key gene to ensure translation of the entire insert.
- exogenous translational control signals and initiation codons can be of a variety of origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer elements, transcription terminators, etc. (see Bittner et al, 1987, Methods in Enzymol. 153: 516-544).
- the transgene construct comprising the key gene can also comprise one or more genes encoding selectable markers that enable identification and/or selection of recombinant vectors.
- the selectable marker may be the key gene product itself or an additional selectable marker not necessarily tied to the expression of the characterizing gene.
- the transgene comprises all or a significant portion of the genomic characterizing gene, preferably, at least all or a significant portion of the 5' regulatory sequences of the characterizing gene, most preferably, sufficient sequence 5' of the characterizing gene coding sequence to direct expression of the key gene coding sequences in the same expression pattern (temporal and/or spatial) as the endogenous counterpart of the characterizing gene.
- the transgene comprises one exon, two exons, all but one exon, or all but two exons, of the characterizing gene. Nucleic acids comprising the characterizing gene sequences and key gene coding sequences can be obtained from any available source.
- genomic clones can be identified by probing a genomic DNA library under appropriate hybridization conditions, e.g., high stringency conditions, low stringency conditions or moderate stringency conditions, depending on the relatedness of the probe to the genomic DNA being probed.
- high stringency hybridization conditions may be used; however, if the probe and the genomic DNA are from different species, then low stringency hybridization conditions may be used. High, low and moderate stringency conditions are all well known in the art.
- Procedures for low stringency hybridization are as follows (see also Shilo and Weinberg, 1981, Proc. Natl. Acad. Sci. USA 78:6789-6792): Filters containing DNA are pretreated for 6 hours at 40°C in a solution containing 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA, 0.1% PVP, 0.1% Ficoll, 1% BSA, and 500 ⁇ g/ml denatured salmon sperm DNA.
- Hybridizations are carried out in the same solution with the following modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 ⁇ g/ml salmon sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 cpm 32 P-labeled probe is used. Filters are incubated in hybridization mixture for 18-20 hours at 40°C, and then washed for 1.5 hours at 55°C in a solution containing 2X SSC, 25 mM Tris-HCl (pH 7.4), 5 mM
- Procedures for high stringency hybridizations are as follows: Prehybridization of filters containing DNA is carried out for 8 hours to overnight at 65 °C in buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, " and 500 ⁇ g/ml denatured salmon sperm DNA. Filters are hybridized for 48 hours at 65°C in prehybridization mixture containing 100 ⁇ g/ml denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe.
- Washing of filters is done at 37°C for 1 hour in a solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA. This is followed by a wash in 0.1 X SSC at 50°C for 45 minutes before autoradiography.
- Moderate stringency conditions for hybridization are as follows: Filters containing DNA are pretreated for 6 hours at 55°C in a solution containing 6X SSC, 5X Denhardt's solution, 0.5% SDS, and 100 ⁇ g/ml denatured salmon sperm DNA. Hybridizations are carried out in the same solution and 5-20 X 10 6 CPM 32 P-labeled probe is used. Filters are incubated in the hybridization mixture for 18-20 hours at 55°C, and then washed twice for 30 minutes at 60°C in a solution containing 1 X SSC and 0.1% SDS.
- the characterizing gene all or a portion of the genomic sequence is preferred, particularly, the sequences 5' of the coding sequence that contain the regulatory sequences.
- a preferred method for identifying BACs containing appropriate and sufficient characterizing gene sequences to direct the expression of the key gene coding sequences in substantially the same expression pattern as the endogenous characterizing gene is described in Section 6, infra.
- the characterizing gene genomic sequences are preferably in a vector that can accommodate significant lengths of sequence (for example, 10 kb's of sequence), such as cosmids, YACs, and, preferably, BACs, and encompass at least 50, 70, 80, 100, 120, 150, 200, 250 or 300 kb of sequence that comprises all or a portion of the characterizing gene sequence.
- sequence for example, 10 kb's of sequence
- cosmids for example, 10 kb's of sequence
- YACs preferably, BACs
- Vectors identified as containing characterizing gene sequences can then be screened for those that are most likely to contain sufficient regulatory sequences from the characterizing gene to direct expression of the key gene coding sequences in substantially the same pattern as the endogenous characterizing gene.
- the vector contains the characterizing gene sequence with the start, i.e., the most 5' end, of the coding sequence in the approximate middle of the vector insert containing the genomic sequences and/or has at least 20 kb, 30 kb, 40 kb, 50 kb, 60 kb, 80 kb or 100 kb of genomic sequence on either side of the start of the characterizing gene coding sequence.
- the clones used may be from a library that has been characterized (e.g., by sequencing and/or restriction mapping) and the clones identified can be analyzed, for example, by restriction enzyme digestion and compared to database information available for the library. In this way, the clone of interest can be identified and used to query publicly available databases for existing contigs correlated with the characterizing gene coding sequence start site. Such information can then be used to map the characterizing gene coding sequence start site within the clone.
- the key gene sequences can be targeted to the 5' end of the characterizing gene coding sequence by directed homologous recombination (for example as described in Sections 5.3 and 6) in such a way that a restriction site unique or at least rare in the characterizing gene clone sequence is introduced.
- the position of the integrated key gene coding sequences (and, thus, the 5' end of the characterizing gene coding sequence) can be mapped by restriction endonuclease digestion and mapping.
- the clone may also be mapped using internally generated fingerprint data and/or by an alternative mapping protocol based upon the presence of restriction sites and the T7 and SP6 promoters in the BAC vector, as described in Section 6, infra.
- the key gene coding sequences are to be inserted in a site in the characterizing gene sequences other than the 5' start site of the characterizing gene coding sequences, for example, in the 3'-most translated or untranslated regions.
- the clones containing the characterizing gene are preferably mapped to insure that the clone contains the site for insertion in as well as sufficient sequence 5' of the characterizing gene coding sequences library to contain the regulatory sequences necessary to direct expression of the key gene sequences in the same expression pattern as the endogenous characterizing gene.
- the key gene can be incorporated into the characterizing gene sequence by any method known in the art for manipulating DNA.
- homologous recombination in bacteria is used for target-directed insertion of the key gene sequence into the genomic DNA encoding the characterizing gene and sufficient regulatory sequences to promote expression of the characterizing gene in its endogenous expression pattern, which characterizing gene sequences have been inserted into a BAC (see Section 5.4, infra).
- the BAC comprising the key gene and characterizing gene sequences is then introduced into the genome of a potential founder animal for generating a line of transgenic animals, using methods well known in the art, e.g.
- transgenic animals are then screened for expression of the key gene coding sequences that mimics the expression of the endogenous characterizing gene.
- transgenic animals are then screened for expression of the key gene coding sequences that mimics the expression of the endogenous characterizing gene.
- Several different constructs containing transgenes of the invention may be introduced into several potential founder animals and the resulting transgenic animals then screened for the best expression (e.g. , highest level) and most accurate expression (i.e., best mimicking expression of the endogenous characterizing gene) of the key gene coding sequences.
- the transgenic construct can be used to transform a host or recipient cell or animal using well known methods, e.g., those described in Section 5.4, infra. Transformation can be either a permanent or transient genetic change, preferably a permanent genetic change, induced in a cell following incorporation of new DNA (i.e., DNA exogenous to the cell). Where the cell is a mammalian cell, a permanent genetic change is generally achieved by introduction of the DNA into the genome of the cell.
- a vector is used for stable integration of the transgenic construct into the genome of the cell. Vectors include plasmids, retroviruses and other animal viruses, BACs, YACs, and the like. Vectors are described in Section 5.3, infra.
- a characterizing gene is endogenous to a host cell or host organism (or is an ortholog of an endogenous gene) and is expressed or not expressed in a particular select population of cells of the organism.
- the population of cells comprises a discernable group of cells sharing a common characteristic. Because of its selective expression, the population of cells may be characterized or recognized based on its positive or negative expression of the characterizing gene. Accordingly, all or some of the regulatory sequences of the characterizing gene are incorporated into transgenes of the invention to regulate the expression of key gene coding sequences, as discussed above. Any gene which is not constitutively expressed, (/. e. , exhibits some spatial or temporal restriction in its expression pattern) can be a characterizing gene.
- the characterizing gene is a human or mouse gene associated with an adrenergic or noradrenergic neurotransmitter pathway, e.g., one of the genes listed in Table 1; a cholinergic neurotransmitter pathway, e.g., one of the genes listed in Table 2; a dopaminergic neurotransmitter pathway, e.g., one of the genes listed in Table 3; a GABAergic neurotransmitter pathway, e.g., one of the genes listed in Table 4; a glutaminergic neurotransmitter pathway, e.g., one of the genes listed in Table 5; a glycinergic neurotransmitter pathway, e.g., one of the genes listed in Table 6; a histaminergic neurotransmitter pathway, e.g., one of the genes listed in Table 7; a neuropeptidergic neurotransmitter pathway, e.g., one of the genes listed in Table 8; a serotonergic neurotransmitter pathway, e.
- an ion channel encoded by or associated with a characterizing gene is preferably involved in generating and modulating ion flux across the plasma membrane of neurons, including, but not limited to voltage-sensitive and/or cation- . sensitive channels, e.g., a calcium, sodium or potassium channel.
- GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Benson et al, 2000, Nucleic Acids Res. 28(1): 15-18).
- GenBank accession number is a unique identifier for a sequence record.
- An accession number applies to the complete record and is usually a combination of a letter(s) and numbers, such as a single letter followed by five digits (e.g., U12345), or two letters followed by six digits (e.g., AF123456). Accession numbers do not change, even if information in the record is changed at the author's request.
- An original accession number might become secondary to a newer accession number, if the authors make a new submission that combines previous sequences, or if, for some reason, a new submission supercedes an earlier record.
- UniGene (National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD; Schuler, 1997, J. Mol. Med. 75(10),694-698; Schuler et al, 1996, Science 274, 540-546; Boguski and Schuler, 1995, Nature Genetics 10, 369-371) is an experimental system for automatically partitioning GenBank sequences into a non- redundant set of gene-oriented clusters for cow, human, mouse, rat, and zebrafish.
- ESTs expressed sequence tags
- full-length mRNA sequences are organized into clusters that each represent a unique known or putative gene.
- Each UniGene cluster contains related information such as the tissue types in which the gene has been expressed and map location. Sequences are annotated with mapping and expression information and cross-referenced to other resources. Consequently, the collection may be used as a resource for gene discovery.
- the Mouse Genome Informatics (MGI) Database is sponsored by the Jackson Laboratory (Bar Harbor, Maine).
- the MGI Database contains information on mouse genetic markers, mRNA and genomic sequence information, phenotypes, comparative mapping data, experimental mapping data, and graphical displays for genetic, physical, and cytogenetic maps.
- the characterizing gene sequence is a promoter that directs tissue-specific expression of the key gene coding sequence to which it is operably linked.
- expression of the key gene coding sequences may be controlled by any tissue-specific promoter/enhancer element known in the art.
- Promoters that may be used to control expression include, but are not limited to, the following animal transcriptional control regions that exhibit tissue specificity and that have been utilized in transgenic animals: elastase I gene control region, which is active in pancreatic acinar cells . (Swift et al, 1984, Cell 38:639-646; Ornitz et al, 1986, Cold Spring Harbor Symp. Quant. Biol.
- enolase promoter which is active in brain regions, including the striatum, cerebellum, CA1 region of the hippocampus, or deep layers of cerebral neocortex (Chen et al, 1998, Molecular Pharmacology 54(3): 495-503); insulin gene control region, which is active in pancreatic beta cells (Hanahan, 1985, Nature 315:115-22); immunoglobulin gene control region, which is active in lymphoid cells (Grosschedl et al, 1984, Cell 38:647-58; Adames et al, 1985, Nature 318:533-38; Alexander et al, 1987, Mol. Cell.
- alpha 1-antitrypsin gene control region which is active in the liver (Kelsey et al, 1987, Genes and Devel. 1:161-71); ⁇ -globin gene control region, which is active in myeloid cells (Mogram et al, 1985, Nature 315:338-40; Kollias et al, 1986, Cell 46:89-
- myelin basic protein gene control region which is active in oligodendrocyte cells in the brain (Readhead et al, 1987, Cell 48:703-12); myosin light chain-2 gene control region, which is active in skeletal muscle (Sani, 1985, Nature 314:283-86); and gonadotropic releasing hormone gene control region which is active in the hypothalamus (Mason et al. ,
- the characterizing gene sequence is protein kinase C, gamma (GenBank Accession Number: Z15114 (human); MGI Database Accession Number: MGI.97597); fos (UniGene No. MM5043 (mouse)); TH-elastin; Pax7 (Mansouri, 1998, The role of Pax3 and Pax7 in development and cancer, Crit. Rev. Oncog. 9(2): 141-9); Eph receptor (Mellitzer et al. , 2000, Control of cell behaviour by signalling through Eph receptors and ephrins; Curr. Opin. Neurobiol.
- the transgenes of the invention include all or a portion of the characterizing gene genomic sequence, preferably at least all or a portion of the upstream regulatory sequences of the characterizing gene genomic sequences are present in the transgene, and at a minimum, the characterizing gene sequences that direct expression of the key gene coding sequences in substantially the same pattern as the endogenous characterizing gene in the transgenic mouse or anatomical region or tissue thereof are present on the transgene.
- genomic sequences and/or clones or other isolated nucleic acids containing the genomic sequences of the gene of interest are not available for the desired species, yet the genomic sequence of the counterpart from another species or all or a portion of the coding sequence (e.g., cDNA or EST sequences) for the same species or another species is available. It is routine in the art to obtain the genomic sequence for a gene when all or a portion of the coding sequence is known, for example, by hybridization of the cDNA or EST sequence or other probe derived therefrom to a genomic library to identify clones containing the corresponding genomic sequence.
- the identified clones may then be used to identify clones that map either 3' or 5' to the identified clones, for example, by hybridization to overlapping sequences present in the clones of a library and, by repeating the hybridization, "walking" to obtain clones containing the entire genomic sequence.
- libraries prepared with vectors that can accommodate and that contain large inserts of genomic DNA (for example, at least 25 kb, 50 kb, 100 kb, 150 kb, 200 kb, or 300 kb) such that it is likely that a clone can be identified that contains the entire genomic sequence of the characterizing gene or, at least, the upstream regulatory sequences of the characterizing gene (all or a portion of the regulatory sequences sufficient to direct expression in the same pattern as the endogenous characterizing gene).
- Cross- species hybridization may be carried out by methods routine in the art to identify a genomic sequence from all species when the genomic or cDNA sequence of the corresponding gene in another species is known.
- the characterizing gene sequences are on BAC clones from a BAC mouse genomic library, for example, but not limited to the CITB Bac Resources (ResGen, an Invitrogen Corporation, Huntsville AL) or RPCI-23 (BACPAC Resources, Children's Hospital Oakland Research Institute, Oakland, California) libraries, or any other BAC library.
- the subset of cells of the transgenic animal that express the key gene also expresses an additional "marker gene” that encodes a detectable or selectable marker, or expresses a protein product that specifically induces or suppresses the expression of the detectable or selectable marker.
- the transgene also contains a nucleotide sequence encoding a detectable or selectable marker also operatively linked to the characterizing gene sequences or activated by the key gene protein product such that the marker gene is expressed in the same cells as the key gene.
- the invention provides collections of transgenic animal lines for use in the drug validation methods of the invention.
- a collection of such transgenic animal lines comprises at least two individual lines, more preferably at least three individual lines, and most preferably, at least five individual lines.
- a collection of transgenic animal lines comprises at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 75, 100, 200, 500, 1000, or 2000 individual lines.
- a collection of transgenic animal lines comprises between 2 to 10, 10 to 20, 10 to 50, 10 to 100, 100 to 500, 100 to 1000, or 100 to 2000 individual lines.
- each line of transgenic animals has a different characterizing gene and may or may not have different key gene coding sequences.
- each transgenic animal line of a collection of the invention has the same key gene coding sequences and in other embodiments, each transgenic animal line has a different key gene coding sequence.
- the invention provides a collection of vectors for producing transgenic animal lines of the invention comprising at least two vectors, more preferably at least three vectors, and most preferably, at least five vectors.
- a collection of vectors comprises at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 75, 100, 200, 500, 1000, or 2000 vectors.
- a collection of vectors comprises between 2 to 10, 10 to 20, 10 to 50, 10 to 100, 100 to 500, 100 to 1000, or 100 to 2000 individual vectors.
- the characterizing gene for each vector is different and each vector may or may not have different key gene coding sequences.
- each vector has the same key gene coding sequences and in other embodiments, each vector has a different key gene coding sequence.
- each individual line or vector is selected for the collection of transgenic animals lines and/or vectors based on the identity of the subset of cells in which the key gene is expressed.
- the characterizing genes for the lines of transgenic animals in such a collection consist of (or comprise), for example but not by way of limitation, a group of functionally related genes (i.e., genes encoding proteins that serve analogous functions in the cells in which they are expressed such as proteins that function in the cell as biosynthetic and/or degradative enzymes for a cellular component, transporters, intracellular or extracellular receptors, and signal transduction molecules), a group of genes . in the same signal transduction pathway, or a group of genes implicated in a particular physiological or disease state, or expressed in the same or related tissue types.
- a group of functionally related genes i.e., genes encoding proteins that serve analogous functions in the cells in which they are expressed such as proteins that function in the cell as biosynthetic and/or degradative enzymes for a cellular component
- the collection may consist of lines of transgenic animals in which the characterizing genes represent a battery of genes having a variety of cell functions, are expressed in a variety of tissue or cell types (e.g., different neuronal cell types, different immune system cell types, different tumor cell types, etc.), or are implicated in a variety of physiological or disease states (in particular, related disease states such as a group of different neurodegenerative diseases, cancers, autoimmune diseases or disorders of immune system function, heart diseases, etc.).
- the collection may also consist of lines of transgenic animals in which the characterizing genes represent a battery of genes expressed in particular neuronal cell types and circuits that control particular behaviors and underlie specific neurological or psychiatric diseases.
- the characterizing genes of the collection are a group of functionally related genes that encode the cellular components associated with a particular neurotransmitter signaling and/or synthetic pathway or with a particular signal transduction pathway, or the proteins that serve analogous functions in the cells in which they are expressed such as proteins that function in the cell as biosynthetic and/or degradative enzymes for a cellular component, transporters, intracellular or extracellular receptors, signal transduction molecules, transcriptional or translational regulators, cell cycle regulators, etc.
- the group of functionally related genes that are characterizing genes can be associated with or implicated in known neuronal circuitry or in a particular physiological, behavioral or disease state. Such states or responses include pain, sleeping, feeding, fasting, sexual behavior, aggression, depression, cognition, emotion, etc.
- the characterizing genes can represent a battery of genes having a variety of cell functions, are expressed in a variety of tissue or cell types (e.g., different neuronal cell types, different immune system cell types, different tumor cell types, etc.), or are implicated in a variety of physiological or disease states.
- a group of characterizing genes is a group of functionally related genes that encode a neurotransmitter, its receptors, and associated biosynthetic and/or degradative enzymes for the neurotransmitter.
- the characterizing genes are groups of genes that are expressed in cells of the same or different neurotransmitter phenotypes, in cells known to be anatomically or physiologically connected, cells underlying a particular behavior, cells in a particular anatomical locus (e.g., the dorsal root ganglia, a motor pathway), cells active or quiescent in a particular physiological state, cells affected or spared in a particular disease state, etc.
- the characterizing genes are groups of genes that are : expressed in cells underlying a neuropsychiatric disorder such as a disorder of thought and/or mood, including thought disorders such as schizophrenia, schizotypal personality disorder; psychosis; mood disorders, such as schizoaffective disorders (e.g., schizoaffective disorder manic type (SAD-M); bipolar affective (mood) disorders, such as severe bipolar affective (mood) disorder (BP-I), bipolar affective (mood) disorder with hypomania and major depression (BP-II); unipolar affective disorders, such as unipolar major depressive disorder (MDD), dysthymic disorder; obsessive-compulsive disorders; phobias, e.g., agoraphobia; panic disorders; generalized anxiety disorders; somatization disorders and hypochondriasis; and attention deficit disorders.
- a neuropsychiatric disorder such as a disorder of thought and/or mood, including thought disorders such as schizophrenia,
- the characterizing genes are groups of genes that are expressed in cells underlying a malignancy, cancer or hyperproliferation disorder, including but not limited to the following: Leukemias such as but not limited to, acute leukemia, acute lymphocytic leukemia, acute niyelocytic leukemias such as myeloblastic, promyelocytic, myelomonocytic, monocytic, erythroleukemia leukemias and myelodysplastic syndrome, chronic leukemias such as but not limited to, chronic myelocytic (granulocytic) leukemia, chronic lymphocytic leukemia, hairy cell leukemia; polycythemia vera; lymphomas such as but not limited to Hodgkin's disease, non-Hodgkin's disease; multiple myelomas such as but not limited to smoldering multiple myeloma, nonsecretory myeloma, osteosclerotic myel
- adenocarcinoma mucoepidermoid carcinoma, and adenoidcystic carcinoma
- pharynx cancers such as but not limited to squamous cell cancer, and verrucous
- skin cancers such as but not limited to, basal cell carcinoma, squamous cell carcinoma and melanoma, superficial spreading melanoma, nodular melanoma, lentigo malignant melanoma, acral lentiginous melanoma
- kidney cancers such as but not limited to renal cell cancer, adenocarcinoma, hypernephroma, fibrosarcoma, transitional cell cancer (renal, pelvic and/ or ureter); Wilms' tumor
- bladder cancers such as but not limited to transitional cell carcinoma, squamous cell cancer, adenocarcinoma, carcinosarcoma; and cancers including myxosarcoma, osteogenic sarcoma, endothelios
- the characterizing genes are groups of genes that are expressed in cells underlying a malignancy, cancer or hyperproliferation disorder, including but not limited to the following: carcinoma, including that of the bladder, breast, colon, kidney, liver, lung, ovary, pancreas, stomach, cervix, thyroid and skin; including squamous cell carcinoma; hematopoietic tumors of lymphoid lineage, including leukemia, acute lymphocytic leukemia, acute lymphoblastic leukemia, B-cell lymphoma, T-cell lymphoma, Burkitt's lymphoma; hematopoietic tumors of myeloid lineage, including acute and chronic .
- carcinoma including that of the bladder, breast, colon, kidney, liver, lung, ovary, pancreas, stomach, cervix, thyroid and skin
- hematopoietic tumors of lymphoid lineage including leukemia, acute lymphocytic leukemia, acute lymphoblastic leukemia, B
- tumors of mesenchymal origin including fibrosarcoma and rhabdomyosarcoma; other tumors, including melanoma, seminoma, teratocarcinoma, neuroblastoma and glioma; tumors of the central and peripheral nervous system, including astrocytoma, neuroblastoma, glioma, and schwannomas; tumors of mesenchymal origin, including fibrosarcoma, rhabdomyosarcoma, and osteosarcoma; and other tumors, including melanoma, xenoderma pigmentosum, keratoactanthoma, seminoma, thyroid follicular cancer and teratocarcinoma; cancers caused by aberrations in apoptosis, including but not limited to follicular lymphomas, carcinomas with p53 mutations, hormone dependent tumors of the breast, prostate and
- the characterizing genes of the collection are all expressed in the same population of cells, e.g., motorneurons of the spinal cord, amacrine cells, astroglia, etc.
- the characterizing genes of the collection are expressed in different populations of cells.
- the characterizing genes of the collection are all expressed within a particular anatomical region, tissue, or organ of the body, e.g., nucleus within the brain or spinal cord, cerebral cortex, cerebellum, retina, spinal cord, bone marrow, skeletal muscles, smooth muscles, pancreas, thymus, etc.
- the characterizing genes of the collection are each expressed in a different anatomical region, tissue, or organ of the body. In another embodiment, the characterizing genes of the collection are all listed in only one of Tables 1-15, above.
- the characterizing genes of the collection are a group of genes where at least two, three, four, five, eight, ten or twelve genes are each from a different one of Tables 1-15, above. In another embodiment, at least one characterizing gene in the collection is listed in one of Tables 1-15, above.
- the characterizing genes of the collection comprise at least one gene from each of one, two, three, four or more of Tables 1-15, above.
- the characterizing genes of the collection are all expressed temporally in a particular expression pattern during an organism's development.
- the characterizing genes of the collection are all expressed during the display of a temporally rhythmic behavior, such as a circadian behavior, a monthly behavior, an annual behavior, a seasonal behavior, an estrous or other mating behavior, or other periodic or episodic behavior.
- the characterizing genes of the collection are all expressed in cells of the nervous system that underlie feeding behavior.
- the characterizing genes of the collection are all expressed in neuronal circuits that function as positive and negative regulators of feeding behavior and, preferably, that are located in the hypothalamus.
- the invention provides vectors and lines of transgenic animals in which the characterizing gene is one of the genes listed in any of Tables 1-15, above.
- the invention provides lines of transgenic animals, wherein each transgenic animal contains two, four, five, six, seven, eight, ten, twelve, fifteen, twenty or more transgenes of the invention (i.e., containing key gene coding sequences operably linked to characterizing gene regulatory sequences). Each of the transgenes has a different characterizing gene. In a specific embodiment, all of the transgenes in the line of transgenic animals contain the same key gene coding sequences. In another embodiment, the transgenes in the line of transgenic animals have different key gene coding sequences (i.e., cells expressing differing characterizing genes express different key genes).
- Such lines of transgenic animals may be generated by introducing a transgene into an animal that is already transgenic for a transgene of the invention or by breeding two animals transgenic for a transgene of the invention. Once a line of transgenic animals containing two transgenes of the invention is established, additional transgenes can be introduced into that line, for example, by pronuclear injection or by breeding, to generate a line of transgenic animals transgenic for three transgenes of the invention, and so on.
- a “key gene” encodes a key protein.
- a key protein is a protein that can activate or inhibit expression of a gene in another gene construct, which gene is under the control of an expression element that is turned off or on by the key protein (for example, but not limited to, promoters and/or enhancers whereby transcription is turned on or off by a specific transactivator; recombinase target sites for which recombination is effected by a recombinase and recombination positions the target gene for expression or inhibition of expression).
- the key protein specifically activates or represses expression of a gene in the modulating construct.
- the gene activated or repressed by the key gene protein product encodes a potential drug target.
- the key gene encodes an RNA product that is an inhibitor such as a catalytic nucleic acid (e.g., a ribozyme or deoxyribozyme), an antisense RNA or double-stranded RNA that causes RNA interference (RNAi).
- a catalytic nucleic acid e.g., a ribozyme or deoxyribozyme
- RNAi RNA interference
- the key gene product (and in certain embodiments, additionally, a marker gene turned on or repressed by the characterizing or key gene product) is not present in any cells of the animal (or ancestor thereof) prior to its being made transgenic.
- the key gene product (and, in certain embodiments, a marker turned on or repressed by the characterizing or key gene product) is not present in a tissue in the animal (or ancestor thereof) prior to its being made transgenic, which tissue contains the subpopulation of cells to be isolated by virtue of the expression of the key gene coding sequences in the subpopulation and which can be cleanly dissected from any other tissues that may express the key gene product (and/or marker) in the animal (or ancestor thereof) prior to its being made transgenic.
- the key gene product (and/or a marker turned on or repressed by the characterizing or key gene product) is expressed in the animal or in tissues neighboring and/or containing the subpopulation of cells to be isolated prior to the animal (or ancestor thereof) being made transgenic but is expressed at much lower levels, e.g., 2- fold, 5-fold, 10-fold, 50-fold, 100-fold, 200-fold, 500-fold, 1000-fold lower levels, than the key gene product (or marker transactivated thereby), i.e., than expression driven by the transgene.
- the key gene coding sequences encode a fusion protein comprising or consisting of all or a portion of the key gene product that confers transcriptional activation or suppression properties on the fusion protein.
- a key gene polypeptide, fragment, analog, or derivative may be expressed as a chimeric, or fusion, protein product (comprising a key gene encoded peptide joined at its amino- or carboxy-terminus via a peptide bond to an amino acid sequence of a different protein). Sequences encoding such a chimeric product can be made by ligating the appropriate nucleotide sequences encoding the desired amino acid sequences to each other by methods known in the art, in the proper coding frame, and expressing the chimeric product as part of the transgene as discussed herein.
- the chimeric gene comprises or consists of all or a portion of the characterizing gene coding sequence fused in frame to a key gene coding sequence.
- the key gene coding sequences can be present at a low gene dose, such as one copy of the key gene per cell. In other embodiments, at least two, three, four, five, seven, ten or more copies of the key gene coding sequences are present per cell, e.g., multiple copies of the key gene coding sequences are present in the same transgene or are present in one copy in the transgene and more than one transgene is present in the cell. In a specific embodiment in which BACs are used to generate and introduce the transgene into the animal, the gene dosage is one copy of the key gene per BAC and at least two, three, four, five, seven, ten or more copies of the BAC per cell.
- More then one copy of the key gene coding sequences may be preferable, in some instances, to achieve levels of the key gene protein product capable of activating or suppressing target expression from the modulating construct.
- coding sequences other than the key gene coding sequences for example, the characterizing gene coding sequence, if present, and/or any other protein coding sequences (for example, from other genes proximal to the characterizing gene in the genomic DNA) are inactivated to avoid over- or mis-expression of these other gene products.
- the key gene is expressed selectively in neural cells.
- the key gene encodes a transactivator, preferably a transcription factor that specifically activates or inhibits transcription, preferably by binding to a specific nucleotide sequence (which sequence maybe operably linked to the target gene sequence). Any transactivator (or transinhibitor) paired with its corresponding promoter or enhancer element may be used.
- the transactivator and transcriptional expression element are heterologous to the transgenic animal, such that the transactivator only activates expression (or a particularly level of expression, in certain embodiments) of the target gene, but are compatible with the transgenic animal.
- the transactivator is a viral, bacterial or yeast transcription factor, for example, but not limited, Lac operator, VP16, gal 4, etc.
- the key gene encodes a component of a conditional transcriptional regulation system.
- a gene encoding a potential drug target may be expressed conditionally by operably linking all or a portion of the regulatory sequences from the characterizing gene to at least the coding region for the key gene, wherein the key gene encodes a conditional regulatory element which in turn induces or represses the expression of the gene encoding the potential drug target.
- Transactivators in these inducible or repressible transcriptional regulation systems are designed to interact specifically with sequences engineered into a vector.
- Such systems include those regulated by tetracycline ("tet systems"), interferon, estrogen, ecdysone, Lac operator, progesterone antagonist RU486, and rapamycin (FK506) with tet systems being particularly preferred (see, e.g., Gingrich and Roder, 1998, Annu. Rev. Neurosci. 21 : 377- 405; incorporated herein by reference in its entirety).
- tet systems tetracycline
- interferon regulated by tetracycline
- estrogen ecdysone
- Lac operator ecdysone
- FK506 rapamycin
- These drugs or hormones act on modular transactivators composed of natural or mutant ligand-binding domains and intrinsic or extrinsic DNA
- the inducible or repressible genetic system can restrict the expression of the potential drug target either temporally, spatially, or both temporally and spatially.
- control elements of the tetracycline-resistance operon of E. coli is used as an inducible or repressible transactivator or transcriptional regulation system ("tet system") for conditional expression of the potential drug target.
- tet system transcriptional regulation system
- a tetracycline-controlled transactivator can require either the presence or absence of the antibiotic tetracycline, or one of its derivatives, e.g., doxycycline (dox), for binding to the tet operator of the tet system, and thus for the activation of the tet system promoter (Ptet).
- dox doxycycline
- Such an inducible or repressible tet system is preferably used in a mammalian cell.
- a tetracycline-repressed regulatable system (TrRS) is used (Agha-Mohammadi and Lotze, 2000, J. Clin. Invest. 105(9): 1177-83; incorporated herein by reference in its entirety).
- This system exploits the specificity of the tet repressor (tetR) for the tet operator sequence (tetO), the sensitivity of tetR to tetracycline, and the activity of the potent herpes simplex virus transactivator (VP16) in eukaryotic cells.
- tetR tet repressor
- tetO tet operator sequence
- VP16 potent herpes simplex virus transactivator
- the TrRS uses a conditionally active chimeric tetracycline-repressed transactivator (tTA) created by fusing the COOH-terminal 127 amino acids of vision protein 16 (VP16) to the COOH terminus of the tetR protein (which may be the key gene).
- tTA conditionally active chimeric tetracycline-repressed transactivator
- VP16 COOH-terminal 127 amino acids of vision protein 16
- tetR protein which may be the key gene.
- tetR moiety of tTA binds with high affinity and specificity to a tetracycline-regulated promoter (tRP), a regulatory region comprising seven repeats of tetO placed upstream of a minimal human cytomegalovirus (CMV) promoter or ⁇ -actin promoter ( ⁇ -actin is preferable for neural expression).
- CMV minimal human cytomegalovirus
- ⁇ -actin is preferable for neural expression
- the VP16 moiety of tTA transactivates the gene encoding the potential drug target by promoting assembly of a transcriptional initiation complex.
- binding of tetracycline to tetR leads to a conformational change in tetR accompanied with loss of tetR affinity for tetO, allowing expression of the potential drug target gene to be silenced by administering tetracycline.
- Activity can be regulated over a range of orders of magnitude in response to tetracycline.
- a tetracycline-induced regulatable system is used to regulate expression of a potential drug target, e.g., the tetracycline transactivator (tTA) element of Gossen and Bujard (1992, Proc. Natl. Acad. Sci. USA 89: 5547-51; incorporated herein by reference in its entirety).
- a potential drug target e.g., the tetracycline transactivator (tTA) element of Gossen and Bujard (1992, Proc. Natl. Acad. Sci. USA 89: 5547-51; incorporated herein by reference in its entirety).
- the improved tTA system of Shockett et al. (1995, Proc. Natl. Acad. Sci. USA 92: 6522-26, incorporated herein by reference in its entirety) is used to drive expression of a potential drug target.
- This improved tTA system places the tTA gene under control of the inducible promoter to which tTA binds, making expression of tTA itself inducible and autoregulatory.
- a reverse tetracycline-controlled transactivator e.g., rtTA2 S-M2
- rtTA2 S-M2 transactivator has reduced basal activity in the absence doxycycline, increased stability in eukaryotic cells, and increased doxycycline sensitivity (Urlinger et al, 2000, Proc. Natl. Acad. Sci. USA 97(14): 7963-68; incorporated herein by . reference in its entirety).
- the tet-repressible system described by Wells et al. (1999, Transgenic Res. 8(5): 371-81; incorporated herein by reference in its entirety) is used.
- a single plasmid Tet-repressible system is used.
- a "mammalianized" TetR gene, rather than a wild-type TetR gene (tetR) is used (Wells et al, 1999, Transgenic Res. 8(5): 371-81).
- the GAL4-UAS system (Ornitz et al, 1991, Proc. Natl. Acad. Sci. USA 88:698-702; Rowitch et al, 1999, J. Neuroscience 19(20):8954-8965; Wang et al, 1999, Proc. Natl. Acad. Sci. USA 96:8483-8488; Lewandoski, 2001, Nature Reviews (Genetics) 2:743-755) is used.
- the key gene encodes a GAL4-VP16 fusion protein (Wang et al, 1999, Proc. Natl. Acad. Sci. USA 96:8483-8488) , and the expression of a GAL4-VP16 fusion protein is driven by characterizing gene sequences.
- This fusion protein contains the DNA binding domain of GAL4 fused to the transcription activation domain of VP-16.
- Animals expressing the GAL4-VP16 fusion protein in a specific population of cells are crossed to a transgenic line of mice that contains a modulating construct containing a potential drug target, wherein the potential drug target is under the control of multiple tandem copies of GAL4 UAS.
- conditional expression of a gene encoding a potential drug target is regulated by using a recombinase system that is used to turn on or off the gene's expression by recombination in the appropriate region of the genome in which the potential drug target gene is inserted.
- the gene encoding a potential drug target is flanked by recombinase sites, e.g., FRT sites.
- a recombinase system in which the key gene encodes the recombinase
- can be used to turn on or off expression of a potential drug target for review of temporal genetic switches and "tissue scissors" using recombinases, see Hennighausen & Furth, 1999, Nature Biotechnol.
- Exclusive recombination in a selected cell type may be mediated by use of a site-specific recombinase such as Cre, FLP- wild type (wt), FLP-L or FLPe.
- a site-specific recombinase such as Cre, FLP- wild type (wt), FLP-L or FLPe.
- the target to be validated is under the regulatory control of an inactive promoter that is activated by site-
- the promoter may be tissue specific or a constitutively active promoter.
- the ⁇ -actin promoter is preferred for constitutive expression in neural tissue.
- Recombination may be effected by any art-known method, e.g., the method of Doetschman et al. (1987, Nature 330: 576-78; incorporated herein by reference in its entirety); the method of Thomas et al, (1986, Cell 44: 419-28; incorporated herein by
- Cre-loxP recombination system (Sternberg and Hamilton, 1981, J. Mol. Biol. 150: 467-86; Lakso et al, 1992, Proc. Natl. Acad. Sci. USA 89: 6232- 36; which are both incorporated herein by reference in their entireties); the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman etal, 1991, Science 251: 1351-55); the Cre-loxP-tetracycline control switch (Gossen and Bujard, 1992, Proc. Natl.
- the recombinase is highly active, e.g., the Cre-loxP or the FLPe system, and has enhanced thermostability (Rodriguez et al, 2000, Nature Genetics 25: 139-40; incorporated herein by reference in its entirety).
- conditional expression element is composed of target sites for recombination positioned such that in the presence of an appropriate recombinase, . the orientation of the target is reversed, thereby operably linking the first nucleotide sequence to a promoter such that the potential drug target sequence is expressed, wherein the key gene encodes the appropriate recombinase.
- a recombinase system can be linked to a second inducible or repressible transcriptional regulation system.
- a cell-specific Cre-loxP mediated recombination system (Gossen and Bujard, 1992, Proc. Natl. Acad. Sci. USA 89: 5547-51) can be linked to a cell-specific tetracycline-dependent time switch detailed above (Ewald et al, 1996, Science 273: 1384-1386; Furth et al Proc. Natl. Acad. Sci. U.S.A. 91:
- an altered cre gene with enhanced expression in mammalian cells is used (Gorski and Jones, 1999, Nucleic Acids Research 27(9): 2059-61; incorporated herein by reference in its entirety).
- the ligand-regulated recombinase system of Kellendonk et al (1999, J. Mol. Biol. 285: 175-82; incorporated herein by reference in its entirety) can be used.
- the ligand-binding domain (LBD) of a receptor e.g., the progesterone or estrogen receptor
- the expression key gene is also conditionally expressed.
- the key gene sequence encodes a catalytic nucleic acid.
- a 10 gene encoding the potential drug target may be expressed conditionally by operably linking all or a portion of the regulatory sequences from the characterizing gene to at least the coding region for the key gene, wherein the key gene encodes a catalytic nucleic acid, e.g., a ribozyme or deoxyribozyme, which in turn induces or represses the expression of the gene encoding a potential drug target.
- a catalytic nucleic acid e.g., a ribozyme or deoxyribozyme
- a transgene encoding a RNA-cleaving RNA enzyme or ribozyme operably linked to a characterizing gene regulatory sequence is introduced.
- the ribozyme is a "hammerhead” or a "hairpin” ribozyme, and is used to induce specific RNA cleavage from a small catalytic domain.
- the catalytic nucleic acid is a DNA enzyme or deoxyribozyme (Sun et al, 2000, Pharmacol. Rev. 52: 325-47; incorporated herein by reference in its entirety).
- the deoxyribozyme is used to induce specific RNA or DNA 5 cleavage and to induce or suppress the expression of the potential drug target gene.
- a catalytic nucleic acid can be designed to cleave a specific target RNA, e.g., an RNA encoding a potential drug target (for methods of catalytic nucleic acid design, including ribozyme design, see, Welch et al, 1998, Curr. Opin. Biotechnol. 9: 486-96; Sun et al. 2000, Pharmacol. Rev. 52: 325-47; both of which are incorporated herein by reference 0 in their entireties) .
- a catalytic nucleic acid is used to induce or suppress the expression of the potential drug target such that the potential drug target is expressed in cells other than those expressing the characterizing gene.
- Localization of the catalytic nucleic acid product of a key gene is controlled by the 5 regulatory sequences of the characterizing gene, e.g., a promoter (Welch et al, 1998, Curr. Opin. Biotechnol. 9: 486-96).
- the U6 promoter is used to confer nuclear localization.
- tRNA-driven ribozyme expression is directed towards the nucleus or the cytoplasm depending on whether the tRNA-ribozyme transcript is spliced.
- the adenovirus VA1 promoter targets ribozyme transcript specifically to the cytoplasm.
- cytoplasmic transcription and localization of ribozymes is achieved using, e.g., the Semliki Forest virus 26S RNA- dependent RNA promoter/viral replicase system or the bacteriophage T7 RNA polymerase/promoter system.
- catalytic nucleic acids For use of catalytic nucleic acids in vivo, the catalytic nucleic acids must be fully functional in the intracellular environment. Not all catalytic nucleic acids (e.g., ribozymes) . selected in vitro are expected to work in vivo, whereas catalytic nucleic acids selected in the intracellular environment should retain their function in vivo.
- the key gene encodes an antisense RNA that is antisense to a sequence that encodes a potential drug target.
- a potential drug target is expressed conditionally by operably linking all or a portion of the regulatory sequences from the characterizing gene to at least the coding region for the key gene, wherein the key gene encodes an antisense RNA that suppresses the expression of the gene encoding the potential drug target (see, e.g., Gudkov et al, 1994, Proc. Natl. Acad. Sci.
- the key gene encodes a sequence that produces RNA interference (RNAi).
- RNAi RNA interference
- a potential drug target may be expressed conditionally by operably linking all or a portion of the regulatory sequences from the characterizing gene to at least the coding region for the key gene, wherein the key gene encodes a sequence that produces RNAi, which in turn induces or represses the expression of the gene encoding the potential drug target.
- RNA interference is defined as the ability of double-stranded RNA (dsRNA) to suppress the expression of a gene corresponding to its own sequence. RNAi is also called post-transcriptional gene silencing or PTGS. Since the only RNA molecules normally found in the cytoplasm of a cell are molecules of single-stranded mRNA, the cell has enzymes that recognize and cut dsRNA into fragments containing 21-25 base pairs (approximately two turns of a double helix). The antisense strand of the fragment separates enough from the sense strand so that it hybridizes with the complementary sense sequence on a molecule of endogenous cellular mRNA.
- This hybridization triggers cutting of the mRNA in the double-stranded region, thus destroying its ability to be translated into a polypeptide.
- Introducing dsRNA corresponding to a particular gene thus knocks out the cell's own expression of that gene in particular tissues and/or at a chosen time.
- Double-stranded (ds) RNA can be used to interfere with gene expression in mammals (Wianny & Zernicka-Goetz, 2000, Nature Cell Biology 2: 70-75; incorporated herein by reference in its entirety).
- dsRNA is used as inhibitory RNA (RNAi) of the function of a potential drug target gene to produce a phenotype that is the same as that of a null mutant of the potential drug target gene (Wianny & Zernicka-Goetz, 2000, Nature Cell Biology 2: 70-75).
- RNAi inhibitory RNA
- the transgene construct comprising the characterizing gene and key gene sequences also comprises one or more sequences encoding selectable markers that, once the transgene is introduced into a vector, enables identification and/or selection of the recombinant vector.
- the selectable marker may be the key gene product itself or an additional selectable marker not necessarily tied to the expression of the characterizing gene.
- the additional detectable or selectable marker can encode such proteins as a signal- producing protein, epitope, fluorescent or enzymatic marker, or inhibitor of cellular function or, in specific embodiments, encodes a protein product that specifically activates or represses expression of a detectable or selectable marker.
- the marker sequences may code for any protein that allows cells expressing that protein to be detected or selected (or O 02/072017
- the marker gene product (and in certain embodiments, a marker turned on or repressed by the characterizing or key gene) is not present in any cells of the animal (or ancestor thereof) prior to its being made transgenic; in other embodiments, the marker gene product (and, in certain embodiments, a marker turned on or repressed by the characterizing or key gene product) is not present in a tissue in the animal (or ancestor thereof) prior to its being made transgenic, which tissue contains the subpopulation of cells to be isolated by virtue of the expression of the marker gene coding sequences in the subpopulation and which can be cleanly dissected from any other tissues that may express the marker gene product in the animal (or ancestor thereof) prior to its being made transgenic.
- the marker gene product is expressed in the animal or in tissues neighboring and/or containing the subpopulation of cells to be isolated prior to the animal (or ancestor thereof) being made transgenic but is expressed at much lower levels, e.g., 2-fold, 5-fold, 10-fold, 50-fold, 100-fold, 200-fold, 500-fold, 1000-fold lower levels, than the key gene product, i.e., than expression driven by the transgene.
- the marker coding sequences encode a fusion protein comprising or consisting of all or a portion of the key gene product that confer the detectable or selectable property on the fusion protein, for example, where the marker sequence encodes an epitope that is not detected elsewhere in the transgenic animal or that is not detected in or neighboring the tissue that contains the subpopulation of cells to be isolated.
- the detectable or selectable marker is expressed everywhere in the transgenic animal except where the key gene is expressed, for example, where the key gene codes for a repressor that represses the expression of the detectable or selectable marker which is otherwise constitutively expressed (e.g., is under the regulatory control of the ⁇ -actin promoter (preferred for neural tissue) or CMV promoter).
- expression of the marker gene coding sequences in a subpopulation of cells of the transgenic animal permits detection, isolation and/or selection of the subpopulation.
- the marker gene encodes a marker enzyme, such as lacZ or ⁇ -lactamase, or a reporter or signal-producing protein such as luciferase or GFP.
- the marker gene encodes a protein-containing epitope not normally detected in the tissue of interest by immunohistological techniques.
- the marker gene could encode CD4 (a protein normally expressed in the immune system) and be expressed and detected in non-immune cells.
- the marker gene encodes a tract-tracing protein such as a lectin (e.g., wheat germ agglutinin (WGA)).
- a tract-tracing protein such as a lectin (e.g., wheat germ agglutinin (WGA)).
- the marker gene encodes a toxin.
- a marker gene polypeptide, fragment, analog, or derivative may be expressed as a chimeric, or fusion, protein product (comprising a marker gene encoded peptide joined at its amino- or carboxy-terminus via a peptide bond to an amino acid sequence of a different protein). Sequences encoding such a chimeric product can be made by ligating the appropriate nucleotide sequences encoding the desired amino acid sequences to each other by methods known in the art, in the proper coding frame, and expressing the chimeric product as part of the transgene as discussed herein.
- the chimeric gene comprises or consists of all or a portion of the characterizing gene and/or the key gene coding sequence fused in frame to an epitope tag.
- the marker gene coding sequences can be present at a low gene dose, such as one copy of the marker gene per cell. In other embodiments, at least two, three, four, five, seven, ten or more copies of the marker gene coding sequences are present per cell, e.g. , multiple copies of the marker gene coding sequences are present in the same transgene or are present in one copy in the transgene and more than one transgene is present in the cell. In a specific embodiment in which BACs are used to generate and introduce the transgene into the animal, the gene dosage is one copy of the marker gene per BAC and at least two, three, four, five, seven, ten or more copies of the BAC per cell.
- More then one copy of the marker gene coding sequences may be preferable, in some instances, to achieve detectable or selectable levels of the marker gene.
- coding sequences other than the marker gene coding sequences for example, the characterizing gene coding sequence, if present, and/or any other protein coding sequences (for example, from other genes proximal to the characterizing gene in the genomic DNA) are inactivated to avoid over- or mis-expression of these other gene products.
- a gene that encodes a marker enzyme is preferably selected for use as a marker gene.
- the marker enzyme is selected so that it produces a detectable signal when a particular chemical reaction is conducted.
- Such enzymatic markers are advantageous, particularly when used in vivo, because detection of enzymatic expression is highly accurate and sensitive.
- a marker enzyme is selected that can be used in vivo, without the need to kill and/or fix cells in order to detect the marker or enzymatic activity of the marker.
- the marker gene encodes ⁇ -lactamase (e.g., GeneBLAzerTM Reporter System, Aurora Biosciences), E. coli ⁇ -galactosidase (lacZ, InvivoGen), human placental alkaline phosphatase (PLAP, InvivoGen) (Kam et al. , 1985, Proc. Natl. Acad. Sci. USA 82: 8715-19), E. coli ⁇ -glucuronidase (gus, Sigma) (Jefferson et al, 1986, Proc. Natl. Acad. Sci.
- ⁇ -lactamase e.g., GeneBLAzerTM Reporter System, Aurora Biosciences
- E. coli ⁇ -galactosidase lacZ, InvivoGen
- human placental alkaline phosphatase PLAP, InvivoGen
- E. coli ⁇ -glucuronidase gus, Sigma
- the marker gene encodes a chemiluminescent enzyme marker such as luciferase (Danilov et al. , ⁇ 1989, Bacterial luciferase as a biosensor of biologically active compounds. Biotechnology, 11 :39-78; Gould et al, 1988, Firefly luciferase as a tool in molecular and cell biology, Anal.
- luciferase Dilov et al. , ⁇ 1989, Bacterial luciferase as a biosensor of biologically active compounds. Biotechnology, 11 :39-78; Gould et al, 1988, Firefly luciferase as a tool in molecular and cell biology, Anal.
- Cells expressing PLAP an enzyme that resides on the outer surface of the cell membrane, can be labeled using the method of Gustincich et al. (1997, Neuron 18: 723-36; incorporated herein by reference in its entirety).
- Cells expressing ⁇ -glucuronidase can be assayed using the method of Lorincz et al, 1996, Cytometry 24(4): 321 -29, which is hereby incorporated by reference in its entirety.
- the marker gene can encode a marker that produces a detectable signal.
- the marker gene encodes a reporter or signal-producing protein.
- the marker gene encodes a signal-producing protein that is used to monitor a physiological state.
- the reporter is a fluorescent protein such as green fluorescent protein (GFP), including particular mutant or engineered forms of GFP such as BFP, CFP O 02/072017
- GFP green fluorescent protein
- the marker gene encodes a red, green, yellow, or cyan fluorescent protein (an "XFP"), such as one of those disclosed in Feng et al. (2000, Neuron, 28: 41-51; incorporated herein by reference in its entirety).
- XFP red, green, yellow, or cyan fluorescent protein
- the marker gene encodes E. coli ⁇ -glucuronidase (gus), and intracellular fluorescence is generated by activity of ⁇ -glucuronidase (Lorincz et al, 1996, Cytometry 24(4): 321-29; incorporated herein by reference in its entirety).
- a fluorescence-activated cell sorter FACS is used to detect the activity of the E. coli ⁇ -glucuronidase (gus) gene (Lorincz et al, 1996, Cytometry 24(4): 321-29).
- each reporter gene When loaded with the Gus substrate fluorescein-di-beta-D-glucuronide (FDGlcu), individual mammalian cells expressing and translating gus mRNA liberate sufficient levels of intracellular fluorescein for quantitative analysis by flow cytometry.
- This assay can be used to FACS-sort viable cells based on Gus enzymatic activity (see Section 5.7, infra), and the efficacy of the assay can be measured independently by using a fluorometric lysate assay.
- the intracellular fluorescence generated by the activity of both ⁇ -glucuronidase and E. coli ⁇ -galactosidase enzymes are detected by FACS independently. Because each enzyme has high specificity for its cognate substrate, each reporter gene can be measured by FACS independently.
- the marker gene encodes a fusion protein of one or more different detectable or selectable markers and any other protein or fragment thereof.
- the fusion protein consists of or comprises two different detectable or selectable markers or epitopes, for example a lacZ-GFP fusion protein or GFP fused to an epitope not normally expressed in the cell of interest.
- the markers or epitopes are not normally expressed in the transformed cell population or tissue of interest.
- the marker gene encodes a "measurement protein” such as a protein that signals cell state, e.g., a protein that signals intracellular membrane voltage, such as SBFI and PBFI (Molecular Probes, Eugene, OR).
- a "measurement protein” such as a protein that signals cell state, e.g., a protein that signals intracellular membrane voltage, such as SBFI and PBFI (Molecular Probes, Eugene, OR).
- the invention relates to a method of validating potential drug targets, i.e., a gene or protein product of a gene that is potentially related to a particular indication (e.g. , a particular disease or disorder) and that potentially serves as target for drug development, for example where the inhibition, altered expression, or increase in activity of the gene or protein product thereof treats, prevents or ameliorates the indication or symptom thereof.
- a particular indication e.g. , a particular disease or disorder
- a human gene is validated in a transgenic mouse.
- an ortholog of an endogenous gene in a transgenic animal is validated.
- the potential drug target is the product of an endogenous gene, the expression of which has been observed to increase or decrease in a particular disease state.
- the potential drug target is the product of an endogenous gene, the expression of which has been observed to increase or decrease during the activation of a particular neurotransmitter pathway, a cell signaling pathway, a disease state, known neuronal circuitry, or a physiological or behavioral state or response.
- states or responses include pain, sleeping, feeding, fasting, sexual behavior, aggression, depression, cognition, emotion, etc.
- a potential drug target-encoding gene encodes a receptor, transporter or uptake molecule, synthetic enzyme or degradative enzyme of a diffusible intercellular signaling molecule such as a neurotransmitter, e.g., 5HT, dopamine, acetylcholine, norepinephrine, GABA, glutamate/ AMP A/NMD A, glycine, or histamine; an intercellular signaling peptide, e.g.,opioid peptide, neurokinin, CCK, CRF, galanin, GRH, interferon, interleukin, motilin, neuroimmunophilin, neurotensin, NPY, angiotensin, bradykinin, Substance P, TRH, or vasopressin; an intercellular signaling fatty acid, e.g., prostaglandin, Cox-2, or anandamide; a small intercellular signaling molecule, e.g., adeno
- a potential drug target-encoding gene encodes a cell surface receptor or protein that interacts with the extracellular matrix or with another cell surface protein such as ICAM, myelin basic protein, or receptor tyrosine kinase.
- a potential drug target-encoding gene encodes an ion channel such as a sodium, potassium, or calcium channel.
- a potential drug target-encoding gene encodes an ion-binding protein such as a calcium-binding protein or an iron-binding protein.
- a potential drug target-encoding gene encodes a molecule that is a component of a second messenger or other signal transduction system such as a signaling system using a lipase, a cyclic nucleotide, e.g., cAMP, a phospholipase, a phosphatase, a kinase, PKC, a SH2/SH3 -containing protein, or NO.
- a potential drug target-encoding gene encodes a trophic factor, e.g., a cytokine or NT4/5.
- a potential drug target-encoding gene encodes an intracellular receptor such as a steroid receptor, e.g., an epalon, vomeropherin, or estrogen receptor.
- a steroid receptor e.g., an epalon, vomeropherin, or estrogen receptor.
- a potential drug target-encoding gene encodes an enzyme or a by-product such as a protease, an ATPase, aldose reductase or an enzyme that has a free radical substrate.
- a potential drug target-encoding gene encodes a component of an amyloid processing system such as amyloid or a presenilin.
- a potential drug target-encoding gene encodes a component of a system for blood clotting or for blood-clotting metabolism such as glycoprotein lib, thrombin, or a platelet aggregation mediator.
- a potential drug target-encoding gene encodes a component of a vesicle cycling system such as a tetanus target.
- a potential drug target-encoding gene encodes a cytoskeletal protein.
- the potential drug target is not a bacterial gene.
- a potential drug target is regulated (either activated or inhibited) by the presence of the key protein.
- the key protein is a transcriptional activator
- the potential drug target is operably linked to a promoter activated by the key protein transcriptional activator.
- a "modulating construct" containing a nucleotide sequence encoding the potential drug target, or a product that modulates (e.g., inhibits) the expression of the potential drug target, is introduced into the cells of an appropriate transgenic mouse line.
- the sequence encoding the potential drug target can be a nucleotide sequence that is homologous to a selected endogenous gene sequence in the transgenic animal line or that is orthologously related to the endogenous gene sequence.
- it can encode an inhibitor, including, but not limited to, inhibitory RNA (RNAi) or an inhibitor protein of an endogenous gene sequence encoding a potential drug target.
- RNAi inhibitory RNA
- the gene sequence encoding the potential drug target is expressed conditionally, using any type of inducible or repressible system available for conditional expression of genes known in the art, e.g., a system inducible or repressible by tetracycline ("tet system”); interferon; estrogen, ecdysone, or other steroid inducible system; Lac operator, progesterone antagonist RU486, or rapamycin (FK506).
- t system system inducible or repressible by tetracycline
- interferon estrogen, ecdysone, or other steroid inducible system
- Lac operator progesterone antagonist RU486, or rapamycin
- FK506 rapamycin
- the key gene product is the conditional enhancer or suppressor which, upon expression, enhances or suppresses expression of a gene encoding a potential drug target present either in a modulating construct or elsewhere in the genome of the transgenic animal.
- Two separate plasmids can be introduced sequentially that contain the genetic sequences that allow reversible induction of expression of the potential drug target on the modulating construct in response to tetracycline (tet) (Gossen and Bujard, 1992, Proc. Natl. Acad. Sci. USA 89, 5547-51).
- tetracycline tet
- a single autoregulatory cassette can be used that allows reversible induction of expression of the potential drug target in the modulating construct in response to tetracycline (tet) (Hofmann et al, 1996, Proc. Natl. Acad. Sci. USA 93, 5185-90, incorporated herein by reference in its entirety).
- the target under control of the inducible or repressible conditional regulatory elements is introduced using a retrovirus.
- the vector can be self-inactivating, eliminating transcription from the long terminal repeat after infection of target cells (Hofmann et al, 1996, Proc. Natl. Acad. Sci. USA 93, 5185-90).
- Tandem tet operator sequences and the CMV minimal promoter can be used to drive expression of a bicistronic mRNA, leading to transcription of the gene of interest (e.g., the drug target gene) and the internal ribosome entry site (IRES)-controlled transactivator (e.g., Tet repressor-VP16 fusion protein).
- the gene of interest e.g., the drug target gene
- the internal ribosome entry site (IRES)-controlled transactivator e.g., Tet repressor-VP16 fusion protein.
- IVS internal ribosome entry site
- an inducible lentiviral vector system can be used to conditionally express the potential gene target (Kafri et al, 2000, Molecular Therapy 1(6), 516-21, incorporated herein by reference in its entirety).
- the inducible lentiviral vector system contains the entire tet-regulated system developed by Gossen and Bujard (1992, Proc. Natl. Acad. Sci. USA 89, 5547-51).
- the lentiviral vector comprises a potential drug target gene and the tetracycline transactivator under the control of the tetracycline-inducible promoter and the human CMV promoter, respectively.
- the recombinant lentiviral vector is used to transform neurons, and doxycycline is used to regulate potential drug target gene expression in the neurons (Kafri et al, 2000, Molecular Therapy 1(6), 516-21; incorporated herein by reference in its entirety).
- doxycycline is used to regulate potential drug target gene expression in the neurons (Kafri et al, 2000, Molecular Therapy 1(6), 516-21; incorporated herein by reference in its entirety).
- terminally differentiated neurons can be made to express the drug target gene.
- a reverse tetracycline-controlled transactivator (rtTA) system can be combined with a promoter (Mansuy et al, 1998, Neuron 21, 257-65, incorporated herein by reference in its entirety). Expression can be reversed by removal of doxycycline.
- the Cre-loxP recombination system is combined with a tetracycline-dependent genetic switch and tissue-specific control elements (Utomo et al. , 1999, Nat. Biotechnol. 17, 1091-96; incorporated herein by reference in its entirety). Using the methods of Utomo et al, a gene in a specific tissue can be targeted.
- the characterizing gene sequence drives the expression of the reverse tetracycline-controlled transactivator (rtTA). Placed in cis configuration to the rtTA transcription unit, the rtTA-inducible ' promoter directs expression of Cre recombinase. In another specific embodiment, the Cre recombinase gene is under control of a tet gene switch.
- rtTA reverse tetracycline-controlled transactivator
- a mouse strain is generated in which regulatory sequences from a characterizing gene drive FLPe expression (Rodriguez et al , 2000, Nature Genetics 25, 139-40; incorporated herein by reference in its entirety).
- a FLP indicator strain is generated in which cells that have undergone a site- specific recombination event, or their daughter cells, are marked by a gain of ⁇ - galactosidase ( ⁇ -gal) activity.
- the indicator transgene (Hmgc ⁇ FRTZ) is composed of an FRT-disrupted lacZ reporter gene driven by mouse Hmgcr (encoding hydroxymethylglutaryl- coenzyme A reductase) promoter/enhancer sequences.
- Hmgcr encoding hydroxymethylglutaryl- coenzyme A reductase promoter/enhancer sequences.
- To profile FLP activity recombinase mice are crossed to this indicator strain. Offspring carrying both the recombinase and the indicator transgenes are analyzed for FLP -mediated lacZ activation by histochemical detection of ⁇ -gal in tissue sections.
- a nuclear localization signal may be appended to the amino terminus of ⁇ -gal to enable visualization of individual cells and to increase sensitivity by concentrating ⁇ -gal activity in the nucleus.
- Hmgcr.FRTZ indicator strain can be evaluated by generating a fully recombined derivative strain.
- Hmgcr:FRTZ mice may be crossed to produce F2 Hmgcr:FRTZ-A mice that are fully transgenic for the recombined indicator, making lacZ expression dependent only on the combined activity of the Hmgcr promoter and surrounding chromosomal DNA.
- the FLP recombinase can be expressed as the key gene and used to regulate expression of the target gene using site specific recombination.
- an altered cre gene with enhanced expression in mammalian cells is used as the key gene (Gorski and Jones, 1999, Nucleic Acids Research 27(9), 2059- 61; incorporated herein by reference in its entirety).
- a cre gene having a mutated splice acceptor site is preferably used to reduce the risk of undesired mRNA splicing event.
- a conditionally expressible transgene can be site-specifically inserted into an untranslated region (UTR) of genomic DNA of the gene encoding the potential drug target, e.g., the 3' UTR or the 5' region, so that expression of the transgene via the conditional expression system is induced or abolished by administration of the inducing or repressing substance, e.g., administration of tetracycline or doxycycline, ecdysone, estrogen, etc., without interfering with the normal profile of gene expression (see, e.g., Bond et al, 2000, Science 289: 1942-46; incorporated herein by reference in its entirety).
- UTR untranslated region
- the modulating constructs (constructs containing a potential drug target regulated by the key gene protein product) of the invention are preferably introduced into a transgenic animal of the invention (i.e., an animal expressing a key gene under the control of characterizing gene regulatory sequences) in a viral vector.
- the viral vector can be any viral vector known to be useful to introduce nucleic acid into the species of transgenic animal being used.
- the vector is a retroviral vector. They provide high efficiency infection, stable integration and stable expression (Friedmann, 1989, Science 244: 1275-81).
- Sequences of a gene of interest e.g., a gene encoding a potential drug target, or portions thereof, can be cloned into a retroviral vector. Delivery of the virus can be accomplished by direct injection or implantation of virus into the desired tissue of the adult animal, a fertilized egg, or an early stage or later stage embryo.
- the modulating construct is introduced using viral vectors and transduction methods described in Deglon et al. (2000, Human Gene Therapy 11 -.179-190; incorporated herein by reference in its entirety).
- Deglon et al describe methods for producing and introducing a self-inactivating (non-reproducing) lentiviral vector with enhanced transgene expression into a selected cell population, e.g., neurons in a particular brain region.
- the self-inactivating vector is used to transduce, and to localize delivery of a potential drug target to, a select population of neurons.
- the self-inactivating (SIN) lentiviral vector is modified using the methods of Deglon et al by insertion of the posttranscriptional regulatory element of the woodchuck hepatitis virus, and particles are produced with a multiply attenuated packaging system.
- the lentiviral vector comprising the modulating construct may also modified so that it has an improved ability to transduce the cells into which it is introduced.
- the methods of Zennou et al are used to incorporate a central DNA flap into the vector (2000, Cell 101, 173-85; incorporated herein by reference in its entirety).
- Lentiviruses have the unique property among retroviruses of replicating in nondividing cells. This property relies on the use of a nuclear import pathway enabling the viral DNA to cross the nuclear membrane of the host cell.
- HIV-1 reverse transcription, a central strand displacement event consecutive to central initiation and termination of plus strand synthesis, creates a plus strand overlap: the central DNA flap.
- a key determinant for nuclear import of lentiviral genomes is therefore the central DNA flap: the central DNA flap acts as a cis-determinant of HIV-1 DNA nuclear import.
- a self-inactivating or non-reproducing lentiviral vector comprising the modulating construct is designed using the methods of Zennou et al. The vector comprises a reinsertion of the DNA flap sequence, thereby restoring nuclear import of the vector to wild-type levels.
- a replication-defective lentiviral vector such as the one described by Naldini et al
- the lentiviral vector may be injected into a specific tissue, e.g., the brain.
- a lentivirus-based vector capable of infecting both mitotic and postmitotic cells is used to introduce the modulating construct.
- Postmitotic cells in particular postmitotic neurons, are generally refractory to stable infection by retroviral vectors, which require the breakdown of the nuclear membrane during cell division in order to insert the transgene into the host cell genome. Therefore, in a preferred embodiment, a lentivirus vector based on the human immunodeficiency virus (HIV) (Bl ⁇ mer et al, 1997, J. Virol., Vol.
- HIV human immunodeficiency virus
- Retroviral vectors are preferable because they permit stable integration of the transgene into a dividing host cell genome, and because the absence of any viral gene expression reduces the chance of an immune response in the transgenic animal.
- retroviruses can be easily pseudo-typed with a variety of envelope proteins to broaden or restrict host cell tropism, thus adding an additional level of cellular targeting for transgene delivery (Welch etal, 1998, Curr. Opin. Biotechnol. 9: 486-96).
- Adenoviral vectors can be used to provide efficient transduction, but they do not integrate into the host genome and, consequently, expression is only transient in actively dividing cells. In animals, a further complication arises in that the most commonly used recombinant adenoviral vectors still contain viral late genes that are expressed at low levels and can lead to a host immune response against the transduced cells (Welch et al. , 1998, Curr. Opin. Biotechnol. 9: 486-96). In one embodiment, a 'gutless' adenoviral vector can be used that lacks all viral coding sequences (Parks et al, 1996, Proc. Natl. Acad. Sci.
- AAV adeno-associated virus
- lentivirus lentivirus
- alpha virus vaccinia virus
- bovine papilloma virus members of the herpes virus group such as Epstein-Barr virus, baculovirus, yeast vectors, bacteriophage vectors (e.g., lambda), and plasmid and cosmid DNA vectors.
- viruses with tropism to central nervous system (CNS) tissue also can be used.
- Adeno-associated virus is attractive because it is a small, non-pathogenic virus that can stably integrate a transgene expression cassette without any viral gene expression (Welch et al, 1998, Curr. Opin. Biotechnol. 9: 486-96).
- An alpha virus system using recombinant Semliki Forest virus, provides high transduction efficiencies of mammalian cells along with high cytoplasmic transgene, e.g., ribozyme, expression (Welch et al, 1998, Curr. Opin. Biotechnol. 9: 486-96).
- lentiviruses such as HIV and feline immunodeficiency virus
- HIV and feline immunodeficiency virus are attractive as gene delivery vehicles due to their ability to integrate into non-dividing cells (Welch et al, 1998, Curr. Opin. Biotechnol. 9: 486-96).
- Site-specific integration of a transgene can be mediated by an adeno-associated virus (AAV) vector derived from a nonpathogenic and defective human parvovirus.
- AAV adeno-associated virus
- rAAV recombinant adeno-associated virus
- the nondividing cells are neurons.
- a recombinant (non-wildtype) AAV is used, such as one of those disclosed by Xiao et al. (1997, Exper. Neurol. 144: 113-24; incorporated herein by reference in its entirety).
- rAAV vector has biosafety features, a high titer, broad host range, lacks cytotoxicity, does not evoke a cellular immune response in the target tissue, and transduces quiescent or non-dividing cells. It is preferably used to transduce cells in the central nervous system (CNS).
- rAAV plasmid DNA is used in a nonviral gene delivery system as disclosed by Xiao et al. (1997, Exper. Neurol. 144: 113-24).
- Nondividing cells can be infected by human immunodeficiency virus type 1 (HIV- l)-based vectors, which results in transgene expression that is stable over several months.
- HIV- l human immunodeficiency virus type 1
- an HIV-1 vector with biosafety features e.g., a self-inactivating HIV-1 vector is used.
- a self-inactivating HIV-1 vector with a 400-nucleotide deletion in the 3' long terminal repeat (LTR) is used (Zufferey et al, 1998, J. Virol. 72(12): 9873-80 ' ; incorporated herein by reference in its entirety).
- LTR 3' long terminal repeat
- the deletion which includes the TATA box, abolishes the LTR promoter activity but does not affect vector titers or transgene expression in vitro.
- the self-inactivating vector may be used to transduce neurons in vivo.
- a retroviral vector that is rendered replication incompetent, stably integrates into the host cell genome, and does not express any viral proteins, such as a vector based on the Moloney murine leukemia virus (MMLV), is used for gene transfer into the host cell genome (Bl ⁇ mer et al, 1997, J. Virol., Vol. 71(9): 6641-49).
- MMLV Moloney murine leukemia virus
- Pseudorabies virus can also be used as a viral vector for introducing nucleic acid.
- a pseudorabies virus is used to introduce the modulating construct.
- a strain of pseudorabies virus PRV 152 is used to introduce a modulating construct, thereby permitting expression of a potential drug target from an inducible or repressible conditional transcription element (see, e.g., Smith et al., 2000, Proc. Natl. Acad. Sci. USA 97(16), 9264-9269).
- injection of a pseudorabies vector comprising a transgene into a specific group of neurons transsynaptically infects their postsynaptic targets.
- the modulating construct is packaged in a viral vector that is used to infect a general type or population of cells (for example, to infect the cells of a mouse in a global fashion) expressing the key protein in a select subpopulation of the general type or population of cells.
- the viral vector comprising the modulating construct is directly injected into a particular tissue region, e.g., a brain region.
- the transgene comprising the characterizing and key gene sequences are inserted into an appropriate vector.
- a vector is a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked, preferably, the other nucleic acid is incorporated into the vector via a covalent linkage, more preferably via a nucleotide bond such that the other nucleic acid can be replicated along with the vector sequences.
- One type of vector is a plasmid, which is a circular double stranded DNA loop into which additional DNA segments can be ligated.
- Another type of vector is a viral vector, wherein additional DNA segments can be ligated into a viral genome or derivative thereof.
- vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., episomal mammalian vectors).
- Other vectors e.g., non-episomal mammalian vectors
- the invention includes viral vectors, e.g., replication defective retroviruses, adeno viruses and adeno-associated viruses, which serve equivalent functions.
- vectors include, but are not limited to, plasmids or modified viruses, but the vector system must be compatible with the host cell used.
- vectors include, but are not limited to, bacteriophages such as lambda derivatives, or plasmids such as pBR322 or pUC plasmid derivatives or the Bluescript vector (Stratagene).
- vectors can replicate (i.e., have a bacterial origin of replication) and be manipulated in bacteria (or yeast) and can then be introduced into mammalian cells.
- the vector comprises a selectable or detectable marker such as Amp r , tef, LacZ, etc.
- the recombinant vectors of the invention comprise a transgene of the invention in a form suitable for expression of the nucleic acid in a transformed cell or transgenic animal.
- such vectors can accommodate ( . e.
- the vector can be used to introduce into cells and replicate) large pieces of DNA such as genomic sequences, for example, large pieces of DNA consisting of at least 25 kb, 50 kb, 75 kb, 100 kb, 150 kb, 200 kb or 250 kb, such as BACs, YACs, cosmids, etc.
- the vector is a BAC.
- the insertion of a DNA fragment into a vector can, for example, be accomplished by ligating the DNA fragment into a vector that has complementary cohesive termini. However, if the complementary restriction sites used to fragment the DNA are not present in the vector, the ends of the DNA molecules may be enzymatically modified.
- any site desired may be produced by ligating nucleotide sequences (linkers) onto the DNA termini; these ligated linkers may comprise specific chemically synthesized oligonucleotides encoding restriction endonuclease recognition sequences.
- the cleaved vector and the transgene may be modified by homopolymeric tailing.
- Vectors can be cloned using methods known in the art, e.g., by the methods disclosed in Sambrook et al, 2001, Molecular Cloning, A Laboratory Manual, Third Edition, Cold Spring Harbor Laboratory Press, N.Y.; Ausubel et al, 1989, Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y., both of which are incorporated herein by reference in their entireties.
- Vectors have replication origins and other selectable or detectable markers to allow selection of cells with vectors and vector maintenance.
- the vectors contain cloning sites, for example, restriction enzyme sites that are unique in the sequence of the vector and insertion of a sequence at that site would not disrupt an essential vector function, such as replication.
- a collection of vectors for making transgenic animals comprises two or more vectors wherein each vectors comprises a transgene containing a key gene operably linked to regulatory sequences of a characterizing gene corresponding to an endogenous gene or ortholog of an endogenous gene such that said key gene is expressed in said transgenic animal with an expression pattern that is substantially the same as the expression pattern of said endogenous gene in a non-transgenic animal or anatomical region or tissue thereof containing the population of cells of interest.
- vectors used in the methods of the invention preferably can accommodate, and in certain embodiments comprise, large pieces of heterologous DNA such as genomic sequences.
- Such vectors can contain an entire genomic locus, or at least sufficient sequence to confer endogenous regulatory expression pattern and to insulate the expression of coding sequences from the effect of regulatory sequences surrounding the site of integration of the transgene in the genome to mimic better wild type expression.
- entire genomic loci or significant portions thereof are used, few, if any, site-specific expression problems of a transgene are encountered, unlike insertions of transgenes into smaller sequences.
- the vector into which the transgene comprising the characterizing and key gene sequences is a BAC containing genomic sequences into which key gene coding sequences have been inserted by directed homologous recombination in bacteria, e.g., the methods of Heintz WO 98/59060; Heintz et al, WO 01/05962; Yang et al, 1997, Nature Biotechnol. 15: 859-865; Yang et al, 1999, Nature Genetics 22: 327-35; which are all incorporated herein by reference in their entireties.
- a BAC can be modified directly in a recombination-deficient E. coli host strain by homologous recombination.
- homologous recombination in bacteria is used for target- directed insertion of the key gene coding sequence into the genomic DNA encoding the characterizing gene and sufficient regulatory sequences to promote expression of the characterizing gene in its endogenous expression pattern, which sequences have been inserted into the BAC.
- the BAC comprising the key gene coding sequences under the regulation of the characterizing gene sequences is then recovered and introduced into the genome of a potential founder animal for a line of transgenic animals.
- the key gene is inserted into the 3' UTR of the characterizing gene and, preferably, has its own IRES.
- the key gene is inserted into the characterizing gene sequences using 5' direct fusion without the use of an IRES, i.e., such that the key gene coding sequences are fused directly in frame to the nucleotide sequence encoding at least the first codon of the characterizing gene coding sequence and even the first two, four, five, six, eight, ten or twelve codons.
- the key gene is inserted into the 5' UTR of the characterizing gene with an IRES controlling the expression of the key gene.
- the key gene sequence is introduced into the BAC containing the characterizing gene by the methods of Heintz et al WO 98/59060 and Heintz et al, WO 01/05962, both of which are incorporated herein by reference in their entireties.
- the key gene is introduced by performing selective homologous recombination on a particular nucleotide sequence contained in a recombination deficient host cell, . e., a cell that cannot independently support homologous recombination, e.g., Rec A " .
- the method preferably employs a recombination cassette that contains a nucleic acid containing the key gene coding sequence that selectively integrates into a specific site in the characterizing gene by virtue of sequences homologous to the characterizing gene flanking the key gene coding sequences on the shuttle vector when the recombination deficient host cell is induced to support homologous recombination (for example by providing a functional Rec A gene on the shuttle vector used to introduce the recombination cassette).
- the particular nucleotide sequence that has been selected to undergo homologous recombination is contained in an independent origin based cloning vector introduced into or contained within the host cell, and neither the independent origin based cloning vector alone, nor the independent origin based cloning vector in combination with the host cell, can independently support homologous recombination (e.g., is RecA " ).
- the independent origin based cloning vector is a BAC or a bacteriophage-derived artificial chromosome (BBPAC) and the host cell is a host bacterium, preferably E. coli.
- sufficient characterizing gene sequences flank the key gene coding sequences to accomplish homologous recombination and target the insertion of the key gene coding sequences to a particular location in the characterizing gene.
- the key gene coding sequence and the homologous characterizing gene sequences are preferably present on a shuttle vector containing appropriate selectable markers and the RecA gene, optionally with a temperature sensitive origin of replication (see Heintz et al. WO 98/59060 and Heintz et al, WO 01/05962 such that the shuttle vector only replicates at the permissive temperature and can be diluted out of the host cell population at the non-permissive temperature.
- the RecA gene When the shuttle vector is introduced into the host cell containing the BAC the RecA gene is expressed and recombination of the homologous shuttle vector and BAC sequences can occur thus targeting the key gene coding sequences (along with the shuttle vector sequences and flanking characterizing gene sequences) to the characterizing gene sequences in the BAC.
- the BACs can be selected and screened for integration of the key gene coding sequences into the selected site in the characterizing gene sequences using methods well known in the art (e.g., methods described in Section 6, infra, and in Heintz et al. WO 98/59060 and Heintz et al, WO01/05962).
- the shuttle vector sequences not containing the key gene coding sequences can be removed from the BAC by resolution as described in Section 6 and in Heintz et al. WO 98/59060 and Heintz et al, WO 01/05962. If the shuttle vector contains a negative selectable marker, cells ' can be selected for loss of the shuttle vector sequences.
- the functional RecA gene is provided on a second vector and removed after recombination, e.g., by dilution of the vector or by any method known in the art.
- the exact method used to introduce the key gene coding sequences and to remove (or not) the RecA (or other appropriate recombination enzyme) will depend upon the nature of the BAC library used (for example the selectable markers present on the BAC vectors) and such modifications are within the skill in the art.
- the BAC containing the characterizing gene regulatory sequences and key gene coding sequences in the desired configuration is identified, it can be isolated from the host E. coli cells using routine methods and used to make transgenic animals as described, infra).
- BACs to be used in the methods of the invention are selected and/or screened using the methods described in Section 5.3, supra, and Section 6, infra.
- the BAC can also be engineered or modified by " ⁇ -T cloning," as described by Muyrers et al. (1999, Nucleic Acids Res. 27(6): 1555-57, incorporated herein by reference in its entirety).
- ⁇ -T cloning a method for modifying specific DNA into a BAC independently of the presence of suitable restriction sites. This method is based on homologous recombination mediated by the rec ⁇ and recT proteins (" ⁇ T-cloning”) (Zhang et al, 1998, Nat. Genet. 20(2): 123-28; incorporated herein by reference in its entirety).
- Homologous recombination can be performed between a PCR fragment flanked by short homology arms and an endogenous intact recipient such as a BAC. Using this method, homologous recombination is not limited by the disposition of restriction endonuclease cleavage sites or the size of the target DNA.
- a BAC can be modified in its host strain using a plasmid, e.g., pBAD- ⁇ , in which recE and recT have been replaced by their respective functional counterparts of phage lambda (Muyrers et al, 1999, Nucleic Acids Res. 27(6): 1555-57).
- a BAC is modified by recombination with a PCR product containing homology arms ranging from 27-60 bp. In a specific embodiment, homology arms are 50 bp in length.
- a transgene is inserted into a yeast artificial chromosome (YAC) (Burke et al, 1987 Science 236: 806-12; and Peterson et al, 1997, Trends Genet. 13: 61).
- YAC yeast artificial chromosome
- the transgene is inserted into another vector developed for the cloning of large segments of mammalian DNA, such as a cosmid or bacteriophage PI (Sternberg et al, 1990, Proc. Natl. Acad. Sci. USA 87: 103-07).
- the approximate maximum insert size is 30-35 kb for cosmids and 100 kb for bacteriophage PI.
- the transgene is inserted into a P-l derived artificial chromosome (PAC) (Mejia et al, 1997, Genome Res 7:179-186).
- PAC P-l derived artificial chromosome
- Vectors containing the appropriate characterizing and key gene sequences may be identified by any method well known in the art, for example, by sequencing, restriction mapping, hybridization, PCR amplification, etc.
- a vector containing the transgene comprising the key and/or characterizing gene is introduced into the genome of a host cell, and the host cell is then used to create a transgenic animal.
- host cell and “recombinant host cell” are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
- a host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell (e.g., insect cells, yeast or mammalian cells), preferably a mammalian cell, and most preferably a mouse cell.
- Host cells intended to be part of the invention include ones that comprise a system and/or characterizing gene sequence that has been engineered to be present within the host cell (e.g., as part of a vector), and ones that comprise nucleic acid regulatory sequences that have been engineered to be present in the host cell such that a nucleic acid molecule of the invention is expressed within the host cell.
- the invention encompasses genetically engineered host cells that contain any of the foregoing system and/or characterizing gene sequences operatively associated with a regulatory element (preferably from a characterizing gene, as described above) that directs the expression of the coding sequences in the host cell. Both cDNA and genomic sequences can be cloned and expressed.
- the host cell is recombination deficient, i.e., Rec " , and used for BAC recombination.
- a vector containing a transgene can be introduced into the desired host cell by methods known in the art, e.g., transfection, transformation, transduction, electroporation, infection, microinjection, cell fusion, DEAE dextran, calcium phosphate precipitation, liposomes, LIPOFECTINTM (Bethesda Research Laboratories, Gaithersburg, MD), lysosome fusion, synthetic cationic lipids, use of a gene gun or a DNA vector transporter, such that the transgene is transmitted to offspring in the line.
- methods known in the art e.g., transfection, transformation, transduction, electroporation, infection, microinjection, cell fusion, DEAE dextran, calcium phosphate precipitation, liposomes, LIPOFECTINTM (Bethesda Research Laboratories, Gaithersburg, MD), lysosome fusion, synthetic cationic lipids, use of a gene gun or a DNA vector transporter, such that the transgene is transmitted
- Particularly preferred embodiments of the invention encompass methods of introduction of the vector containing the transgene using pronuclear injection of a transgenic construct into the mononucleus of a mouse embryo and infection with a viral vector comprising the construct.
- Methods of pronuclear injection into mouse embryos are well-known in the art and described in Hogan et al 1986, Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, New York, NY and Wagner et al, U.S. Patent No. 4,873,191, issued October 10, 1989, herein incorporate by reference in their entireties.
- a vector containing the transgene is introduced into any nucleic genetic material which ultimately forms a part of the nucleus of the zygote of the animal to be made transgenic, including the zygote nucleus.
- the transgene can be introduced in the nucleus of a primordial germ cell which is diploid, e.g., a spermatogonium or oogonium. The primordial germ cell is then allowed to mature to a gamete which is then united with another gamete or source of a haploid set of chromosomes to form a zygote.
- the vector containing the transgene is introduced in the nucleus of one of the gametes, e.g., a mature sperm, egg or polar body, which forms a part of the zygote.
- the vector containing the transgene is introduced in either the male or female pronucleus of the zygote. More preferably, it is introduced in either the male or the female pronucleus as soon as possible after the sperm enters the egg. In other words, right after the formation of the male pronucleus when the pronuclei are clearly defined and are well separated, each being located near the zygote membrane.
- the vector containing the transgene is added to the male DNA complement, or a DNA complement other than the DNA complement of the female pronucleus, of the zygote prior to its being processed by the ovum nucleus or the zygote female pronucleus.
- the vector containing the transgene could be added to the nucleus of the sperm after it has been induced to undergo decondensation.
- the vector containing the transgene may be mixed with sperm and then the mixture injected into the cytoplasm of an unfertilized egg.
- Perry et al. 1999, Science 284:1180-1183.
- the vector maybe injected into the vas deferens of a male mouse and the male mouse mated with normal estrus females. Huguet et al, 2000, Mol. Reprod. Dev. 56:243-247.
- the transgene is introduced using any technique so long as it is not destructive to the cell, nuclear membrane or other existing cellular or genetic structures.
- the transgene is preferentially inserted into the nucleic genetic material by microinjection. Microinjection of cells and cellular structures is known and is used in the art. Also known in the art are methods of transplanting the embryo or zygote into a pseudopregnant female where the embryo is developed to term and the transgene is integrated and expressed. See, e.g., Hogan et al. 1986, Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, New York, NY. Viral methods of inserting a transgene are known in the art and have been described, supra.
- a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is generally introduced into the host cells along with the gene sequence of interest, e.g., the key gene sequence.
- selectable markers include those which confer resistance to drugs, such as G418, hygromycin and methotrexate.
- Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die). Such methods are particularly useful in methods involving homologous recombination in mammalian cells (e.g., in murine ES cells) prior to introducing the recombinant cells into mouse embryos to generate chimeras.
- a number of selection systems may be used to select transformed host cells.
- the vector may contain certain detectable or selectable markers.
- Other methods of selection include but are not limited to selecting for another marker such as: the herpes simplex virus thymidine kinase (Wigler et al, 1977, Cell 11: 223), hypoxanthine-guanine phosphoribosyltransferase (Szybalska and Szybalski, 1962, Proc. Natl. Acad. Sci.
- adenine phosphoribosyltransferase genes can be employed in tk-, hgprt- or aprt- cells, respectively.
- antimetabolite resistance can be used as the basis of selection for the following genes: dhfr, which confers resistance to methotrexate (Wigler et al, 1980, Natl. Acad. Sci. USA 77: 3567; O'Hare et al, 1981, Proc. Natl. Acad. Sci. USA 78: 1527); gpt, which confers resistance to mycophenolic acid (Mulligan and Berg, 1981, Proc. Natl.
- the transgene may integrate into the genome of the founder animal (or an oocyte or embryo that gives rise to the founder animal), preferably by random integration. In other embodiments the transgene may integrate by a directed method, e.g., by directed homologous recombination ("knock-in"), Chappel, U.S. Patent No. 5,272,071; and PCT publication No.
- the construct will comprise at least a portion of the characterizing gene with a desired genetic modification, e.g., insertion of the key gene coding sequences and will include regions of homology to the target locus, i.e., the endogenous copy of the characterizing gene in the host's genome.
- DNA constructs for random integration need not include regions of homology to mediate recombination.
- Markers can be included for performing positive and negative selection for insertion of the transgene.
- a homologous recombination vector is prepared in which the key gene is flanked at its 5' and 3' ends by characterizing gene sequences to allow for homologous recombination to occur between the exogenous gene carried by the vector and the endogenous characterizing gene in an embryonic stem cell.
- the additional flanking nucleic acid sequences are of sufficient length for successful homologous recombination with the endogenous characterizing gene.
- flanking DNA both at the 5' and 3' ends
- the drug validation method of the invention does not involve the production of transgenic lines for each potential drug target to be validated but, rather, involves introduction of the potential drug target (or an inhibitor thereof) into existing transgenic animal lines such that the potential drug target is either expressed or inhibited only in a particular subset of cells (i.e., expression is spatially or temporally restricted).
- a coding region for a potential drug target is operably linked to an inducible or repressible conditional transcription element.
- the modulating construct is cloned into a viral vector that is used to infect a general type or population of cells (for example, the cells of a mouse in a global fashion) expressing the key protein in a select subpopulation of the general type or population of cells using any method know in the art.
- the viral vector comprising the modulating construct is directly injected into a particular tissue region, e.g., a brain region.
- the invention provides a method of expressing a potential drug target protein (or inhibitor thereof) in a specific subset of cells in a non-human animal.
- the method comprises introducing into cells of the transgenic non-human animal a vector comprising a first nucleotide sequence encoding the potential drug target protein (or inhibitor thereof), the expression of the potential drug target protein or inhibitor thereof being under the control of a conditional expression element.
- the transgenic non-human animal comprises a transgene containing a key gene that encodes an inducer or suppressor of the conditional expression element.
- the key gene is operably linked to regulatory sequences of a characterizing gene corresponding to an endogenous gene or homolog of an endogenous gene such that the key gene is expressed in the transgenic non-human animal with an expression pattern that is substantially the same as the expression pattern of the endogenous gene in a non-transgenic animal of the same species as the transgenic non- human animal.
- the transgene is located at a site in the mouse genome other than the site of the endogenous characterizing gene.
- the potential drug target protein (or inhibitor thereof) is thereby selectively expressed in the cells expressing the key gene.
- the invention provides a method of determining whether the modulation of expression of a potential target gene in a particular cell type is causally linked to a desired effect, for example, expression of the potential target causes the expression of a certain cell or tissue phenotype associated with a particular disease or disorder or with the treatment, prevention or amelioration of that disease or disorder.
- homogeneous populations of cells expressing a particular key gene or group of key genes are isolated and purified from a transgenic animal line of the collection.
- a modulating construct comprising a gene encoding a selected potential drug target is introduced into the genomes of the homogeneous cell populations.
- the expression of the potential target gene is then modulated to determine whether expression of the potential target causes the expression of a certain cell or tissue phenotype associated with a particular disease or disorder or with the treatment, prevention or amelioration of that disease or disorder.
- the modulating construct is introduced into the genomes of cells in vivo.
- the drug validation system of the invention is more flexible, convenient and efficient than other existing drug validation systems because it uses one of a limited set of transgenic mouse lines instead of requiring the production of a transgenic mouse line for each target to be validated.
- the subject methods are advantageous because they enable the validation of drug targets to proceed rapidly and efficiently, limited only by the rate at which modulating constructs and viral vectors containing those modulating constructs can be produced, and not by the rate at which a transgenic animal line can be produced.
- a collection of transgenic animal lines expressing key proteins can be used repeatedly to validate many potential drug targets introduced via modulating constructs.
- a transgenic animal is a non-human animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more of the cells of the animal includes a transgene, i.e., has a non-endogenous (i.e., heterologous) nucleic acid sequence present as an extrachromosomal element in a portion of its cell or stably integrated into its germline DNA (i.e., in the genomic sequence of most or all of its cells).
- Other examples of transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, amphibians, etc.
- transgenic animal comprises stable changes to the germline sequence.
- Heterologous nucleic acid is introduced into the germ line of such a transgenic animal by genetic manipulation of, for example, embryos or embryonic stem cells of the host animal.
- Methods for producing transgenic animal lines and collection of transgenic animal lines are described in Serafmi, U.S. Patent Application Serial No. (to be assigned) (Attorney Docket Number 10239-010-999) entitled “Collections of Transgenic Animal Lines (Living Library)" filed February 14, 2001, which is incorporated herein by reference in its entirety.
- the transgenic animals of the invention are preferably generated by random integration of a vector containing a transgene of the invention into the genome of the animal, for example, by pronuclear injection in the animal zygote, or injection of sperm mixed with vector DNA as described above.
- Other methods involve introducing the vector into cultured embryonic cells, for example ES cells, and then introducing the transformed cells into animal blastocysts, thereby generating a "chimeras" or "chimeric animals", in which only a subset of cells have the altered genome.
- Chimeras are primarily used for breeding purposes in order to generate the desired transgenic animal. Animals having a heterozygous alteration are generated by breeding of chimeras. Male and female heterozygotes are typically bred to generate homozygous animals.
- a homologous recombinant animal is a non-human animal, preferably a mammal, more preferably a mouse, in which an endogenous gene has been altered by homologous recombination between the endogenous gene and an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to development of the animal.
- a transgenic animal of the invention is created by introducing a transgene of the invention, encoding the characterizing gene regulatory sequences operably linked to the key gene sequence, into the male pronuclei of a fertilized oocyte, e.g., by microinjection or retroviral infection, and allowing the egg to develop in a pseudopregnant female foster animal.
- Methods for generating transgenic animals via embryo manipulation and microinjection, particularly animals such as mice have become conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. Patent No.
- transgenic founder animal can be identified based upon the presence of the transgene in its genome and/or expression of mRNA encoding the transgene in tissues or cells of the animals. A transgenic founder animal can then be used to breed additional animals carrying the transgene as described supra.
- transgenic animals carrying the transgene can further be bred to other transgenic animals carrying other transgenes, animals of the same species that are disease models, etc.
- the transgene encoding the characterizing gene regulatory sequences operably linked to the key gene sequence is inserted into the genome of an embryonic stem (ES) cell, followed by injection of the modified ES cell into a blastocyst- stage embryo that subsequently develops to maturity and serves as the founder animal for a line of transgenic animals.
- ES embryonic stem
- a vector bearing a transgene encoding the characterizing gene regulatory sequences operably linked to the key gene sequence is introduced into ES cells (e.g., by electroporation) and cells in which the introduced gene has homologously recombined with the endogenous gene are selected.
- ES cells e.g., Li et al, 1992, Cell 69:915.
- embryonic stem (ES) cells an ES cell line may be employed, or embryonic cells may be obtained freshly from a host, e.g. mouse, rat, guinea pig, etc.
- ES cells are grown on an appropriate feeder layer, e.g., a fibroblast-feeder layer, in an appropriate medium and in the presence of appropriate growth factors, such as leukemia inhibiting factory (LIF). Cells that contain the construct of interest may be detected by employing a selective medium. Transformed ES cells may then be used to produce transgenic animals via embryo manipulation and blastocyst injection. (See, e.g., U.S. Pat. Nos. 5,387,742, 4,736,866 and 5,565,186 for methods of making transgenic animals.)
- LIF leukemia inhibiting factory
- ES cells that stably express a key gene product may be engineered.
- ES host cells can be transformed with DNA, e.g., a plasmid, controlled by appropriate expression control elements (e.g., promoter, enhancer, sequences, transcription terminators, polyadenylation sites, etc.), and a selectable marker.
- appropriate expression control elements e.g., promoter, enhancer, sequences, transcription terminators, polyadenylation sites, etc.
- engineered ES cells may be allowed to grow for 1-2 days in an enriched media, and then are switched to a selective media.
- the selectable marker in the recombinant plasmid confers resistance to the selection and allows cells to stably integrate the plasmid into their chromosomes and expanded into cell lines. This method may advantageously be used to engineer ES cell lines that express the key gene product.
- the selected ES cells are then injected into a blastocyst of an animal (e.g., a mouse) to form aggregation chimeras.
- an animal e.g., a mouse
- Blastocysts are obtained from 4 to 6 week old superovulated females.
- the ES cells are trypsinized, and the modified cells are injected into the blastocoel of the blastocyst. After injection, the blastocysts are implanted into the uterine horns of suitable pseudopregnant female foster animal.
- the ES cells may be incorporated into a morula to form a morula aggregate which is then implanted into a suitable pseudopregnant female foster animal.
- Females are then allowed to go to term and the resulting litters screened for mutant cells having the construct encoding the characterizing gene regulatory sequences operably linked to the key gene sequence, .
- the chimeric animals are screened for the presence of the characterizing gene regulatory sequences operably linked to the key gene sequence.
- Males and female chimeras having the modification are mated to produce homozygous progeny. Only chimeras with transformed germline cells will generate homozygous progeny. If the gene alterations cause lethality at some point in development, tissues or organs can be maintained as allergenic or congenic grafts or transplants, or in in vitro culture.
- Progeny harboring homologously recombined or integrated DNA in their germline cells can be used to breed animals in which all cells of the animal contain the homologously recombined DNA or randomly integrated transgene by germline transmission of the transgene.
- mice of the non-human transgenic animals described herein can also be produced according to the methods described in Wilmut et al, 1997, Nature 385: 810-13 and PCT Publication NOS. WO 97/07668 and WO 97/07669.
- the transgenic mice may be bred and maintained using methods well known in the art.
- the mice may be housed in an environmentally controlled facility maintained on a 10 hour dark: 14 hour light cycle or other appropriate light cycle. Mice are mated when they are sexually mature (6 to 8 weeks old).
- the transgenic founders or chimeras are mated to an unmodified animal (i.e., an animal having no cells containing the transgene).
- the transgenic founder or chimera is mated to C57BL/6 mice (Jackson Laboratories).
- the transgene encoding the characterizing gene regulatory sequences operably linked to the key gene sequence is introduced into ES cells and a chimeric mouse is generated
- the chimera is mated to 129/Sv mice, which have the same genotype as the embryonic stem cells.
- Protocols for successful breeding are known in the art (see also Section 6).
- Commercial breeding services e.g., Tosk, Inc. (Santa Cruz, CA) are also well known in the art and may be used to breed transgenic animals.
- a founder male is mated with two females and a founder female is mated with one male.
- Preferably two females are rotated through a male's cage every 1-2 weeks.
- Pregnant females are generally housed 1 or 2 per cage.
- pups are ear tagged, genotyped, and weaned at approximately 21 days.
- Males and females are housed separately.
- log sheets are kept for any mated animal, by example and not limitation, information should include pedigree, birth date, sex, ear tag number, source of mother and father, genotype, dates mated and generation.
- founder animals heterozygous for the transgene encoding the characterizing gene regulatory sequences operably linked to the key gene sequence may be mated to generate a homozygous line as follows: A heterozygous founder animal, designated as the P j generation, is mated with an offspring designated as the F, generation from a mating of a non-transgenic mouse with a transgenic mouse heterozygous for the transgene (backcross). Based on classical genetics, one fourth of the results of this backcross are homozygous for the transgene.
- transgenic founders are individually backcrossed to an inbred or outbred strain of choice. Different founders should not be intercrossed, since different expression patterns may result from separate transgene integration events.
- transgenic mouse is homozygous or heterozygous for the transgene.
- an offspring of the above described breeding cross is mated to a normal control non-transgenic animal.
- the offspring of this second mating are analyzed for the presence of the transgene by the methods described below. If all offspring of this cross test positive for the transgene, the mouse in question is homozygous for the transgene. If, on the other hand, some of the offspring test positive for the transgene and others test negative, the mouse in question is heterozygous for the transgene.
- An alternative method for distinguishing between a transgenic animal which is heterozygous and one which is homozygous for the transgene is to measure the intensity with radioactive probes following Southern blot analysis of the DNA of the animal. Animals homozygous for the transgene would be expected to produce higher intensity signals from probes specific for the transgene than would heterozygote transgenic animals.
- the transgenic mice are so highly inbred to be genetically identical except for sexual differences. The homozygotes are tested using backcross and intercross analysis to ensure homozygosity. Homozygous lines for each integration site in founders with multiple integrations are also established. Brother/sister matings for 20 or more generations define an inbred strain.
- the transgenic lines are maintained as hemizygotes.
- individual genetically altered mouse strains are also cryopreserved rather than propagated. Methods for freezing embryos for maintenance of founder animals and transgenic lines are known in the art. Gestational day 2.5 embryos are isolated and cryopreserved in straws and stored in liquid nitrogen. The first and last straws are subsequently thawed and transferred to foster females to demonstrate viability of the line with the assumption that all embryos frozen between the first and last straws will behave similarly. If viable progeny are not observed a second embryo transfer will be performed. Methods for reconstituting frozen embryos and bringing the embryos to term are known in the art.
- Transgenic animals that exhibit appropriate expression are selected as transgenic animal lines.
- in situ hybridization using probes specific for the key gene coding sequences may also be used to detect expression of the key gene product.
- immunohistochemistry using an antibody specific for the key gene product or associated marker is used to detect expression of the key gene product.
- expression of the key gene may be detected by in situ hybridization to detect the key gene mRNA.
- marker gene expression is visualized in single living mammalian cells.
- the method of Zlokarnik et al, (1998, Science 279: 84-88; incorporated herein by reference in its entirety) is used to visualize marker gene expression.
- the marker gene encodes an enzyme, e.g., ⁇ -lactamase.
- an enzyme assay is performed in which ⁇ -lactamase hydrolyzes a substrate loaded intracellularly as a membrane-permeant ester. Each molecule of ⁇ -lactamase changes the fluorescence of many substrate molecules from green to blue by disrupting resonance energy transfer. This wavelength shift can be detected by eye or photographically (either on film or digitally) in individual cells containing less than 100 ⁇ -lactamase molecules.
- the non-invasive method of Contag et al. is used to detect and localize light originating from a mammal in vivo (Contag et al. , U.S. Patent No. 5,650,135, issued July 22, 1997; incorporated herein by reference in its entirety) .
- Light- emitting conjugates are used that contain a biocompatible entity and a light-generating moiety.
- Biocompatible entities include, but are not limited to, small molecules such as cyclic organic molecules; macromolecules such as proteins; microorganisms such as viruses, bacteria, yeast and fungi; eukaryotic cells; all types of pathogens and pathogenic substances; and particles such as beads and liposomes.
- biocompatible entities may be all or some of the cells that constitute the mammalian subject being imaged.
- Light-emitting capability is conferred on the entities by the conjugation of a light- generating moiety.
- moieties include fluorescent molecules, fluorescent proteins, enzymatic reactions giving off photons and luminescent substances, such as bioluminescent proteins.
- the conjugation may involve a chemical coupling step, genetic engineering of a fusion protein, or the transformation of a cell, microorganism or animal to express a bioluminescent protein.
- the light-generating moiety may be a bioluminescent or fluorescent protein "conjugated" to the cells through localized, promoter-controlled expression from a vector construct introduced into the cells by having made a transgenic or chimeric animal.
- Light-emitting conjugates are typically administered to a subject by any of a variety of methods, allowed to localize within the subject, and imaged. Since the imaging, or measuring photon emission from the subject, may last up to tens of minutes, the subject is usually, but not always, immobilized during the imaging process.
- Imaging of the light-emitting entities involves the use of a photodetector capable of detecting extremely low levels of light (typically single photon events) and integrating photon emission until an image can be constructed.
- sensitive photodetectors include devices that intensify the single photon events before the events are detected by a camera, and cameras (cooled, for example, with liquid nitrogen) that are capable of detecting single photons over the background noise inherent in a detection system.
- a photon emission image is generated, it is typically superimposed on a "normal" reflected light image of the subject to provide a frame of reference for the source of the emitted photons (i.e. localize the light-emitting conjugates with respect to the subject).
- a "composite” image is then analyzed to determine the location and/or amount of a target in the subject.
- Homogeneous populations of cells that express a particular key gene can be isolated and purified from transgenic animals of the invention.
- Methods for cell isolation include, but are not limited to, surgical excision or dissection, dissociation, fluorescence-activated cell sorting (FACS), panning, and laser capture microdissection (LCM).
- FACS fluorescence-activated cell sorting
- LCD laser capture microdissection
- cells expressing a particular key gene are isolated using surgical excision or dissection. Before dissection, the transgenic animal may be perfused. Perfusion is preferably accomplished using a perfusion solution that contains ⁇ -amanitin or other transcriptional blockers to prevent changes in gene expression from occurring during cell isolation. In other embodiments, cells expressing a particular key gene are isolated from adult rodent brain tissue which is dissected and dissociated. Methods for such dissection and dissociation are well-known in the art. See, e.g., Brewer, 1997, J. Neurosci. Methods 71(2):143-55; Nakajima et al, 1996, Neurosci. Res.
- cells expressing a particular key gene are dissected from tissue slices based on their morphology as seen by transmittance light direct visualization and cultured, using, e.g., the methods of Nakajima et al, 1996, Neurosci. Res. 26(2):195-203; Masuko et al, 1992, Neuroscience 49(2):347-64; which are incorporated herein by reference in their entireties.
- Tissue slices are made of a particular tissue region and a particular subregion, e.g., a brain nucleus, is isolated under direct visualization using a dissecting microscope.
- cells expressing a particular key gene can be dissociated using a protease such as papain (Brewer, 1997, J. Neurosci. Methods 71(2):143-55; Nakajima et al, 1996, Neurosci. Res. 26(2):195-203;) or trypsin (Baranes, 1996, Proc. Natl. Acad. Sci. USA 93(10):4706-11; Emerling et al, 1994, Development 120(10):2811-22; Gilbert, 1997, J. Neurosci. Methods 71(2):191-98; Ninomiya, 1994, Int. J. Dev. Neurosci. 12(2): 99-106; Huber, 2000, J.
- a protease such as papain (Brewer, 1997, J. Neurosci. Methods 71(2):143-55; Nakajima et al, 1996, Neurosci. Res. 26(2):195-203;) or trypsin (
- Cells can also be dissociated using collagenase (Delree, 1989, J. Neurosci. Res. 23(2):198-206; incorporated herein by reference in its entirety).
- the dissociated cells are then grown in cultures over a feeder layer.
- the dissociated cells are neurons that are grown over a glial feeder layer.
- tissue that is labeled with a fluorescent marker can be microdissected and dissociated using the methods of Martinou (1989, J. Neurosci. 9(10):3645-56; incorporated herein by reference in its entirety). Microdissection of the labeled cells is followed by density-gradient centrifugation. The cells are then purified by fluorescence-activated cell sorting (FACS) (see infra). In other embodiments, cells can be purified by a cell-sorting procedure that only uses light-scatter parameters and does not necessitate labeling (Martinou, 1989, J. Neurosci. 9(10):3645-56).
- FACS fluorescence-activated cell sorting
- a subset of cells within a heterogeneous cell population derived from a transgenic animal in the collection of transgenic animals lines is recognized by expression of a key gene and/or marker gene.
- the regulatory sequences of the characterizing gene are used to express a key gene and/or a marker gene protein in transgenic cells, and the targeted population of cells is isolated based on expression of the key gene and/or marker gene.
- Selection and/or separation of the target subpopulation of cells may be effected by any convenient method. For example, where the marker is an externally accessible, cell-surface associated protein or other epitope-containing molecule, immuno-adsorption panning techniques or fluorescent immuno-labeling coupled with fluorescence activated cell sorting (FACS) are conveniently applied.
- FACS fluorescence activated cell sorting
- Cells that express a marker gene product can be detected using flow cytometric methods such as the one described by Mouawad et al, 1997, J. Immunol. Methods, 204(1), 51-56; incorporated herein by reference in its entirety).
- the method is based on an indirect immunofluorescence staining procedure using a monoclonal antibody that binds specifically to the marker enzyme encoded by the marker gene sequence, e.g. , ⁇ - galactosidase or a ⁇ -galactosidase fusion protein.
- the method can be used for both quantification in vitro and in vivo of enzyme expression in mammalian cells.
- the method is preferably used with a construct containing a lacZ selectable marker.
- cells expressing a key gene and/or marker gene can be quantified and gene regulation, including transfection modality, promoter efficacy, enhancer activity, and other regulatory factors studied (Mouawad et al, 1997, J. Immunol. Methods 204(1): 51-56).
- a FACS-enzyme assay e.g., a FACS-Gal assay
- the FACS-Gal assay measures E. coli lacZ-encoded ⁇ -galactosidase activity in individual cells. Enzyme activity is measured by flow cytometry, using a fluorogenic substrate that is hydrolyzed and retained intracellularly.
- lacZ serves both as a reporter gene to quantitate gene expression and as a selectable marker for the fluorescence-activated cell sorting based on their lacZ expression level.
- phenylethyl-beta-D-thiogalactoside (PETG) is used as a competitive inhibitor in the reaction, to inhibit ⁇ -galactosidase activity and slow reaction with the substrate.
- interfering endogenous host e.g., mammalian
- ⁇ -galactosidases are inhibited by the weak base chloroquine.
- a fluorescence-activated cell sorter (FACS) is used to detect the activity of a marker gene encoding E. coli ⁇ -glucuronidase (gus) (Lorincz et al, 1996, Cytometry 24(4): 321-9).
- FACS fluorescence-activated cell sorter
- gus E. coli ⁇ -glucuronidase
- FDGlcu Gus substrate fluorescein-di-beta- D- glucuronide
- This assay can be used to FACS-sort viable cells based on Gus enzymatic activity, and the efficacy of the assay can be measured independently by using a fluorometric lysate assay.
- the intracellular fluorescence generated by the activity of both beta-glucuronidase and E. coli ⁇ -galactosidase enzymes are detected by FACS independently. Because each enzyme has high specificity for its cognate substrate, each reporter gene can be measured by FACS independently.
- the invention provides methods for isolating individual cells harboring a fluorescent protein reporter from tissues of transgenic mice by FACS. See Hadjaantonakis and Naki, 2000, Genesis, 27(3):95-8, which is incorporated herein by reference it its entirety.
- the reporter is a autofluorescent (AFP) reporter such as, but not limited to, wild type Green Fluorescent Protein (wtGFP) and its variants, including enhanced green fluorescent protein (EGFP) and enhanced yellow fluorescent protein (EYFP).
- wtGFP wild type Green Fluorescent Protein
- EGFP enhanced green fluorescent protein
- EYFP enhanced yellow fluorescent protein
- cells are isolated by FACS using fluorescent antibody staining of cell surface proteins.
- the cells are isolated using methods known in the art as described by Barrett et al. , 1998, Neuroscience, 85(4): 1321 -8, incorporated herein in its entirety.
- cells are isolated by FACS using fluorogenic substrates of an enzyme transgenically expressed in a particular cell-type.
- the cells are isolated using methods known in the art as described by Blass-Kampmann et al, 1994, J. Neurosci. Res., 37(3):359-73, which is incorporated herein by reference in its entirety.
- the invention also provides methods for isolating cells from primary culture cells.
- WACS whole animal sorting
- cells are isolated by FACS using fluorescent, vital dyes to retrograde label cells with fluorescent tracers.
- Cells are isolated using the methods described by St. John and Stephens, 1992, Dev. Biol. 151(l):154-65, Martinou et al, 1992, Neuron 8(4):737-44. Clendening and Hume, 1990, J Neurosci. 10(12):3992-4005 and Martinou et al, 1989, J Neurosci, 9(10):3645-56, which are all incorporated herein by reference in their entireties.
- cells are isolated by FACS using fluorescent-conjugated lectins in retrograde labeled cells.
- the cells are isolated using the methods described in Schaffner et al, 1987, J Neurosci, 7(10):3088-104 and Armson and Bennett, 1983, Neurosci. Eett., 38(2):181-6, which are all incorporated herein by reference in their entireties.
- cells are isolated by panning on antibodies against cell surface markers.
- the antibody is a monoclonal antibody.
- Cells are isolated and characterized using methods known in the art described by Camu and Henderson, 1992, J Neurosci. Methods 44(l):59-79, Kashiwagi et al, 2000, 41(l):2373-7, Brocco and Panzetta, 1997, 75(l):15-20, Tanaka et al, 1997, Dev. Neurosci. 19(1):106-11, and Barres et al, 1988, Neuron l(9):791-803, which are all incorporated herein by reference in their entireties.
- cells are isolated using laser capture microdissection (LCM).
- LCM laser capture microdissection
- a collection of transgenic mouse lines of the invention is used to isolate neurons expressing the key gene that are located in the arcuate nucleus of the hypothalamus and that regulate feeding behavior.
- transgenic animal lines of the invention and cells isolated from the transgenic animal lines of the invention may be used for target validation, drug discovery, pharmacological, behavioral, electrophysiological, and gene expression assays, etc. but, preferably target validation.
- cells expressing the key gene and/or marker gene coding sequences are detected in vivo in the transgenic animal, or in explanted tissue or tissue slices from the transgenic animal, to analyze the population of cells marked by the expression of the key gene and/or marker gene coding sequences.
- the population of cells can be examined in transgenic animals in which a modulating construct comprising a particular drug target has or has not been introduced.
- the cells are detected by methods known in the art depending upon the marker gene . used (see Sections 5.1.3 and 5.6, above).
- the marker gene coding sequences encode or promote the production of an agent that enhances the contrast of the cells expressing the key gene coding sequences and such cells are detected by MRI.
- the transgenic animals may be bred to existing disease model animals or treated pharmacologically or surgically, or by any other means, to create a disease state in the transgenic animal. The animals can then be compared to such animals in which a modulating construct comprising a particular drug target has been introduced, e.g. for phenotypic changes, particularly changes in symptoms, indicators of the particular disease or disorder.
- treatments for the disease may be evaluated by administering a treatment (e.g. a candidate compound) to the transgenic mice of the invention expressing the target protein and, preferably that have been bred to a disease state or a disease model otherwise induced in the transgenic mice.
- a treatment e.g. a candidate compound
- the mice are then evaluated for morphological, physiological or electrophysiological changes, changes in gene expression, protein-protein interactions, protein profile in response to the treatment is an indication of efficacy or toxicity, etc. of the treatment.
- cells expressing the key gene and a potential drug target are isolated from the transgenic animal using methods known in the art, preferably, for analysis or for culture of the cells and subsequent analysis.
- the transgenic animal expressing the key gene and a potential drug target in a select population of cells may be subjected to a treatment (for example a surgical treatment or administered a candidate compound of interest) prior to isolation of the cells.
- the transgenic animal may be bred to a disease model or a disease state induced in the transgenic animal, for example by surgical or pharmacological manipulation, prior to isolation of the cells.
- the populations of cells expressing a key gene and a potential drug target of interest can be analyzed by any method known in the art.
- the gene expression profile of the cells is analyzed using any number of methods known in the art, for example but not by way of limitation, by isolating the mRNA from the isolated cells and then hybridizing the mRNA to a microarray to identify the genes which are or are not expressed in the isolated cells.
- Gene expression in cells treated and not treated with a compound of interest or in cells from animals treated or untreated with a particular treatment, e.g., surgical treatment, may be compared.
- mRNA from the isolated cells may also be analyzed, for example by northern blot analysis, PCR, RNase protection, etc., for the presence of mRNAs encoding certain protein products and for changes in the presence or levels of these mRNAs depending on the treatment of the cells.
- mRNA from the isolated cells may be used to produce a cDNA library and, in fact, a collection of such cell type specific cDNA libraries may be generated from different populations of isolated cells. Such cDNA libraries are useful to analyze gene expression, isolate and identify cell type-specific genes, splice variants and non-coding RNAs.
- such cell type specific libraries prepared from cells isolated from treated and untreated transgenic animals of the invention or from transgenic animals of the invention having and not having a disease state can be used, for example in subtractive hybridization procedures, to identify genes expressed at higher or lower levels in response to a particular treatment or in a disease state as compared to untreated transgenic animals.
- Data from such analyses may be used to generate a database of gene expression analysis for different populations of cells in the animal or in particular tissues or anatomical regions, for example, in the brain. Using such a database together with bioinformatics tools, such as
- specific cells or cell populations that express a potential drug target are isolated from the collection and analyzed for specific protein-protein interactions or an entire protein profile using proteomics methods known in the art, for example, chromatography, mass spectroscopy, 2D gel analysis, etc.
- assays may be used to analyze the cell population expressing the potential drug target, either in vivo, in explanted or sectioned tissue or in the isolated cells, for example, to monitor the response of the cells to a certain treatment or candidate compound or to compare the response of the animals, tissue or cells to expression of the target or inhibitor thereof, with animals, tissue or cells from animals not expressing the target or inhibitor thereof.
- the cells may be monitored, for example, but not by way of limitation, for changes in electrophysiology, physiology (for example, changes in physiological parameters of cells, such as intracellular or extracellular calcium or other ion concentration, change in pH, change in the presence or amount of second messengers, cell morphology, cell viability, indicators of apoptosis, secretion of secreted factors, cell replication, contact inhibition, etc.), morphology, etc.
- changes in electrophysiology for example, changes in physiological parameters of cells, such as intracellular or extracellular calcium or other ion concentration, change in pH, change in the presence or amount of second messengers, cell morphology, cell viability, indicators of apoptosis, secretion of secreted factors, cell replication, contact inhibition, etc.
- a subpopulation of cells in the isolated cells is identified and/or gene expression analyzed using the methods of Serafmi et al. (PCT Publication WO 99/29877, entitled Methods for Defining Cell Types, published June 17, 1999) which is hereby incorporated by reference in its entirety.
- This example describes the methods used for creation of a transgenic animal line for use in the drug validation methods of the invention.
- BAC libraries for various species (in the form of high density BAC colony DNA membrane)
- the BAC library is screened and positive clones are obtained, and the BACs for specific genes of interest are confirmed and mapped, as described in detail below.
- Overlapping oligonucleotide (“overgo") probes are highly useful for large-scale physical mapping and whenever sequence is available from which to design a probe for hybridization purposes.
- the short length of the overgo probe is advantageous when there is limited available sequence known from which to design the probe.
- overgo probes obviate the need to clone and characterize cDNA fragments, which traditionally have been used as hybridization probes.
- Overgo probes can be used for identifying homologous sequences on DNA macroarrays printed on nylon membranes (i.e., BAC DNA macroarrays) or for Southern blot analysis. This technique can be extended to any hybridization-based gene screening approach.
- the following protocol describes a method for generating hybridization probes of high specific activity and specificity when sequence data is available. The method is used for identifying homologous DNA sequences in arrays of BAC library clones.
- Overgo probes are designed through a multistep process designed to ensure several important qualities:
- Probes are designed with similar GC contents. This allows probes to be labeled to ⁇ similar specific activities and to hybridize with similar efficiencies, thus enabling a probe pooling strategy that is essential for high throughput screening of BAC library macroarrays.
- the starting point for overgo design is to obtain sequence information for the gene of interest.
- the software packages required for overgo design require this sequence to be in FASTA format.
- a sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column. It is recommended that all lines of text be shorter than 80 characters in length. Sequences are expected to be represented in the standard IUB/IUPAC amino acid and nucleic acid codes, with these exceptions: lower-case letters are accepted and are mapped into upper-case; a single hyphen or dash can be used to represent a gap of indeterminate length; and in amino acid sequences, U and * are acceptable letters (see below).
- any numerical digits in the query sequence should either be removed or replaced by appropriate letter codes (e.g., N for unknown nucleic acid residue or X for unknown amino acid residue).
- appropriate letter codes e.g., N for unknown nucleic acid residue or X for unknown amino acid residue.
- the nucleic acid codes supported are: A --> adenosine M — > A C (amino)
- the sequence used for overgo design is preferably genomic, but cDNA sequences have been used successfully.
- programs known in the art such as OvergoMaker (John D. McPherson, Ph.D., Genome Sequencing Center/Department of Genetics, Washington University School of Medicine, Box 8501,4444 Forest Park Blvd., St. Louis, MO 63108) may be used.
- OvergoMaker John D. McPherson, Ph.D., Genome Sequencing Center/Department of Genetics, Washington University School of Medicine, Box 8501,4444 Forest Park Blvd., St. Louis, MO 63108
- ATG start codon
- the overgo design program scans sequences and identifies two overlapping 24mers that have a balanced GC content, and an overall GC content between 40-60%. Once gene specific overgos have been designed, they are checked for uniqueness by using the BLAST program (NCBI) to compare them to the nr nucleic acid database (NCBI). Overgos that have significant BLAST scores for genes other than the gene of interest, i.e., could hybridize to genes other than the gene of interest, are redesigned.
- NCBI BLAST program
- NCBI nr nucleic acid database
- an overgo probe a pair of 24mer oligonucleotides overlapping at the 3' ends by 8 base pairs are annealed to create double stranded DNA with 16 base pair overhangs. The resulting overhangs are filled in using Klenow fragment. Radionucleotides are incorporated during the fill-in process to label the resulting 40mer as it is synthesized.
- the overgo probe is then hybridized to immobilized BAC DNA. Following hybridization, the filter is washed to remove nonspecifically bound probe. Hybridization of specifically bound probe is visualized through autoradiography or phosphoimaging.
- Target BAC clone DNA immobilized on nylon filters for example, a macroarray of a BAC library, e.g., the CITB BAC library (Research Genetics) or the RPCI-23 library (BACPAC Resources, Children's Hospital Oakland Research Institute, Oakland, CA).
- a BAC library e.g., the CITB BAC library (Research Genetics) or the RPCI-23 library (BACPAC Resources, Children's Hospital Oakland Research Institute, Oakland, CA).
- 10 ⁇ Ci/ ⁇ l [ 32 P]dATP -3000 Ci/mmol, 1 OmCi/ml
- 10 ⁇ Ci/ ⁇ l [ 32 P]dCTP -3000 Ci/mmol, lOmCi/ml
- Wash Buffer B 1% SDS, 40 mM NaPO 4 , ImM EDTA, pH 8.0
- Wash Buffer 2 1.5x SSC, 0.1% SDS
- wash Buffer 3 0.5x SSC, 0.1% SDS
- Solution B 2 M HEPES-NaOH, pH 6.6 2.6 g HEPES to 5 ml ddH 2 O pH to 6.6 with approximately 2 drops 6M NaOH
- Annealing oligonucleotides to generate a overhang Step 1 combine 1.0 ⁇ l of partially complementary 10 ⁇ M oligos (1.0 ⁇ l forward primer + 1.0 ⁇ l reverse primer) with 3.5 ⁇ l ddH 2 O (10 pmol each oligo/reaction) to either a tube or microtiter plate well.
- Step 2 Cap each tube or microtiter well and heat the paired oligonucleotides for 5 min at 80 °C to denature the oligonucleotides.
- Step 3 Incubate the labeling reactions for 10 min at 37°C to form overhangs.
- Step 4 Store the annealed oligonucleotides on ice until they are labeled. If the labeling step is not done within 1 hour of annealing the oligonucleotides, repeat steps 2 and 3 before proceeding.
- thermocycler can be programmed to perform steps 2 through 4.
- Overgo probes can be labeled and hybridized using methods well-known in the art, for example, using the protocols described in Ross et al, 1999, Screening Large-Insert Libraries by Hybridization, In Current Protocols in Human Genetics, eds. N.C. Dracopoli, J.L. Haines, B.R. Korf, D.T. Moir, C.C. Morton, C.E. Seidman, J.G. Seidman, D.R. Smith. pp. 5.6.1-5.6.52 John Wiley and Sons, New York; incorporated herein by reference in its entirety.
- This protocol uses both [ 32 P]dATP and [ 32 P]dCTP for labeling. This is recommended; however, the composition of the dNTP mix in the overgo labeling buffer can be altered to allow different labeled deoxynucleotides to be used.
- the following method can be used as a quick measure of the success of the labeling reaction. Dilute the probes 1 : 100 (1 ⁇ l probe + 99 ⁇ l H 2 O), and use 1 ⁇ l of diluted probe for scintillation counting. For optimal hybridization, the probe specific activity should be approximately 5 x 10 5 cpm/ml. 6.1.1. BAC SCREENING
- BACs containing specific characterizing genes of interest are identified by using 32 P labeled overgo probes, as described above, to probe nylon membranes onto which BAC- containing bacterial colonies have been spotted.
- overgo probes is accomplished by hybridizing a single probe to BAC library filters, and identifying positive clones for that single characterizing gene of interest.
- the use of overgo probes makes it possible to adopt a probe pooling strategy that permits higher throughput while using fewer library filters.
- probes are arrayed into a two- dimensional matrix (i.e., 5x5 or 6x6). Then probes are combined into row and column pools (e.g., 10 pools total for a 5x5 array).
- Each probe pool is hybridized to a single copy of the BAC library filters (10 separate hybridizations) e.g., the CITB or RPCI-23 BAC library filters.
- clones hybridizing to each probe pool are manually identified. Assignment of positive clones to individual probes is done by pairwise comparisons between each row and each column. The intersection of each row pool and column pool defines a single probe within the probe array. Thus, all positive clones that are shared in common by a specific row pool and a specific column pool are known to hybridize to the probe defined by the unique intersection between the row and column. Deconvolution of hybridization data to assign positive clones to specific probes in the probe array is done manually, or by using an MICROSOFT EXCELTM-based Visual Basic program.
- the nylon filters are prehybridized by wetting with 60 °C Church's hybridization buffer and rolling the filters into a hybridization bottle filled halfway or approximately 150 ml of 60 °C Church's hybridization buffer. All of the filters are rolled in the same direction (DNA and writing side up), with a nylon mesh spacer in between each and on top, and the bottle is placed in the oven to keep them rolled. The rotation speed is set to 8-9 speed. The filter is incubated at 60 °C for at least 4 hours the first time (1-2 hours for subsequent prehybridizations of the same filters).
- labeled probes are denatured by heating to 100°C for 10 min and then placed on slushy ice for 2 min or longer. 5 The Church's hybridization buffer is replaced before adding probes if the filter is used for the first time. Filters are incubated with the probe at 60°C overnight. The rotation speed is set to 8-9 speed.
- the Church's hybridization buffer is drained from the bottle and 100 ml Washing Buffer B pre-heated to 60 °C is added.
- the hybridization bottle is returned to 10 the incubation oven for 30 min.
- the rotation speed is set to 8-9 speed.
- Church's hybridization buffer and Washing Buffer B are radioactive and must be disposed of in a liquid radioactive waste container.
- Washing Buffer B is drained from the bottle and 80 ml Washing Buffer 2 pre-heated to 60 °C is added.
- the hybridization bottle is returned to the incubation oven for 20 min. 15
- the rotation speed is set to 8-9 speed.
- Washing Buffer 2 is drained from the bottle and 80 ml Washing Buffer 2 pre-heated to 60 °C is added.
- the hybridization bottle is returned to the incubation oven for 20 min.
- the rotation speed is set to 8-9 speed.
- Filters are removed from the hybridization bottles and washed in a shaking bath for 20 5 min. at 60°C with 2.5 L Washing Buffer 3, shaking slowly, without overwashing.
- Filters are removed from the bath, spacers are set aside, and placed in individual Kapak, 10" x 12," Sealpak pouches. All air bubbles are removed by rolling with a glass pipette. The pouches are sealed and checked for leaks. A damp tissue removes any 25 remaining solution on the outside of the bag.
- Each filter is placed in an autoradiograph cassette at room temperature with an intensifying screen. An overnight exposure at room temperature is usually adequate. Alternatively, the data can be collected using a phosphorimager if available.
- Probes may be stripped from the filters (not routinely done) by washing in 1.5 L 30 70 °C Stripping Buffer for 30 min. Counts are checked with a survey meter to verify the efficacy of stripping procedure. This is repeated for an additional 10 minutes, if necessary. Filters should not be overstripped. Overstripping removes BAC DNA and reduces the life of the filters.
- Stripping may be incomplete, so it is preferable to autoradiograph the stripped filter 35 if residual probe may confuse subsequent hybridization results. Identification and confirmation of clones
- the CTIB and RPCI-23 BAC library filters come as sets of 5-10 filters that have 30- 50,000 clones spotted in duplicate on each filter. Following autoradiography, positive clones appear as small dark spots. Because clones are spotted in duplicate, true positives always appear as twin spots within a subdivision of the macroarray. Using templates and positioning aids provided by the filter manufacturer, unique clone identities are obtained for each positive clone. Once the identities of clones for each probe have been identified, they are ordered from BACPAC Resources (Children's Hospital Oakland - Bacpac Resources 747 52nd St., Oakland, CA 94609) or Research Genetics (ResGen, an Invitrogen Corporation, 2130 Memorial Parkway, Huntsville, AL 35801).
- each clones is rescreened by PCR using gene specific primers that amplify a portion of the 5' or the 3' end of the gene. In some cases, clones are tested for the presence of both 5' and 3' end amplicons.
- Other BAC libraries including those from noncommercial sources may be used. Clones may be identified using the hybridization method described above to filters with arrayed clones having an identifiable location on the filter so that the corresponding BAC of any positive spots can be obtained.
- Dispense 20 ⁇ l of reaction mix to PCR tubes Use a 20 ⁇ l thin tip to transfer a colony from plate to the PCR tube. Pipet up and down a couple of times to dispense the colony into the PCR mixture. Include positive control (genomic DNA) and negative control (no DNA template).
- TPF TIGR PROCIPITATETM FILTER METHOD BAC ISOLATION PROTOCOL
- the position of the gene within the BAC must be determined.
- the BAC contain the transcriptional control elements required for wild-type expression.
- the characterizing gene lies near the center of a 5 BAC that is 150-200 kb in length, then the BAC will likely contain the control elements required to reproduce the wild type expression pattern.
- Fingerprinting methods rely on genome mapping technology to assemble BACs containing the characterizing gene of interest into a contig, i.e., a continuous set of overlapping clones. Once a contig has been assembled, it is straightforward to identify 1 or 2 center clones in the contig. Since all clones in the contig hybridize to the 5' end of the gene (because the probe sequence is designed to hybridize at or near the start codon of the
- the center clones of the contig should have the gene in the central- most position.
- a mouse BAC library e.g., a RPCI-23 BAC library
- Soderlund et al. 2000, Genome Res. 10(11): 1772-87; incorporated herein by reference in its entirety.
- BACs are fingerprinted using Hindlll digestion digests. Digests are run out on 1% agarose gels, stained with sybr green (Molecular Probes) and then visualized on a Typhoon fluoroimager (Amersham Pharmacia).
- BAC fingerprint information has been generated by the University of British Columbia Genome Mapping Project (Genome Sequence Centre, BC Cancer Agency, 600 West 10th Avenue, Vancouver, British Columbia, V5Z 4E6) and can be used for assembling BAC contigs.
- contig information from publicly available databases is used to select clones for BAC modification as described above.
- the shuttle vector (containing the homology region and the key gene coding sequences) integrates into the BAC to form the cointegrate.
- This process introduces a unique Asc-1 restriction site into the BAC at the site of cointegration. It is possible to map the position of this site, by first cutting the cointegrate with Not-1, which releases the BAC insert (approx 150-200 kb) from the BAC vector. Subsequent digestion with Asc-1 (which cuts very rarely in mammalian genomes), should cleave the BAC insert once, yielding two fragments. The fragment sizes can be accurately resolved using the CHEF gel mapping system (Bio-Rad).
- the insert should be cleaved into 2 nearly equal fragments of large size (-75-100 kb each). If the Asc-1 site is located asymmetrically, then the homology region is not centered in the BAC, and thus is not a good candidate for transgenesis. Alternatively, if the size of the smaller fragment falls below a predetermined size (for example 50 kb), then that BAC should be ruled out as a candidate.
- a predetermined size for example 50 kb
- the fingerprinting method described above can also be used to generate additional fingerprint data.
- This data is used to generate contigs of currently uncontigged BACs from which center clones can be selected.
- this data can be combined with data from publicly available databases to generate novel contig information.
- the following alternative mapping method is used to roughly localize a gene within a BAC clone.
- This method takes advantage of the fact that one end of the BAC genomic insert is linked to the SP6 promoter while the other end is linked to the T7 promoter.
- the alternative mapping method involves the following steps: a) digestion with notl to release the BAC insert b) digestion with another enzyme that cuts no more than 4-7 times in the BAC (in practice, we usually use several different enzymes). Digests are run out on a 0.7% agarose gel. c) The gel is transferred to nylon, hybridized to alkaline phosphatase conjugated T7 oligo probe-develop and the blot is exposed according to the alternative mapping protocol described below.
- This step identifies that fragment containing the T7 end of the BAC insert.
- d) Hybridization to alkaline phosphatase conjugated SP6 oligo probe.
- the blot is developed and exposed according to the alternative mapping protocol described below. This identifies fragment containing the SP6 end of the BAC insert.
- e) Finally, the blot is hybridized to a gene specific probe. This identifies which fragment contains the gene.
- Loading dye is added (orange dye preferred for Typhoon fluoroimager) to the above entire reaction, and the reactions are loaded into a 0.7% agarose gel. The gel is run . at 80V (for a 7x11 inch large gel) overnight.
- the gel is stained with Vista green (1 :10,000 dilution in TAE buffer) for 10-20 min and imaged on a Typhoon fluoroimager (Amersham Pharmacia) using the
- Fluorescence mode 526 SP/Green (532nm) setting. The gain and sensitivity are varied until the bands look dark but not saturated. Alternatively, bands can usually be visualized using standard ethidium bromide stain and visualized on a UV lightbox. 4.
- the gel is transferred into a large container and depurinated with 0.125M HCI for 10 min, rinsed with ddH 2 0 once, then neutralized with 1.5M NaCI and 0.5M Tris-HCl (pH 7.5) for 30 min, and denatured with 0.5M NaOH and 1.5M NaCI for 30 min. 5.
- a capillary wet transfer in 0.5M NaOH and 1.5M NaCI is set up, following the instructions that come with the H+ nylon membrane, and the transfer runs overnight. 6.
- T7 and SP7 hybridizations and exposures are done sequentially and are not to be performed together.
- Wash buffer #1 and wash buffer #2 are prewarmed at 37 °C.
- the membrane is prewet with ddH 2 O.
- the membrane is prehybridized in hybridization buffer at 37 °C for 10 min.
- exact 50 ⁇ l of buffer is used per 1.0 cm 2 of membrane.
- the probe is diluted to a 2 nM final concentration 10 in hybridization buffer.
- the volume is calculated as done in step 8.
- the correct probe concentration is crucial.
- the tubes containing these solutions are incubated at 37 °C during the prehybridization step.
- the membrane should not dry out during the following wash, detection and film exposure.
- Buffer 1 is removed and prewarmed buffer 2 is added. Washes are done as in step 11 for another 10 min.
- the substrate buffer is prepared and 50 ⁇ l is used per 1.0 cm 2 of membrane.
- the membrane is rinsed 2 times for 5 min. each in assay buffer.
- the membrane is 30 incubated in substrate buffer inside heat-sealable bags at RT for 10 min. while manually agitating the bag to ensure that the membranes are covered with substrate buffer.
- the membrane is removed from the substrate buffer and placed into a seal bag and exposed to KODAK® film (Eastman Kodak Co.) immediately.
- Probes are labeled using purified PCR product as a template with the Ready-Prime kit. The prehybridization and hybridization steps are carried out as in standard Southern blot hybridization. The membranes are exposed at room temperature or at 37°C. Alternatively, one can probe with a gene-specific overgo probe using the
- the two blots are aligned with the original DNA gel. Positive bands are identified for T7/SP6 and the gene-specific probe.
- Wash buffer 1 2x SSC 1% (w/v) SDS 2. Wash buffer 2:
- the construct comprises at least a portion of the characterizing gene with a desired genetic modification, e.g., insertion of the key gene coding sequences and will include regions of homology to the target locus, i.e., the endogenous copy of the characterizing gene in the host's genome.
- a homologous recombination shuttle vector is prepared in which the key gene is positioned next to characterizing gene sequences to allow for homologous recombination to occur between the exogenous gene carried by the shuttle vector and the characterizing gene sequences on the BAC.
- the additional flanking nucleic acid sequences are of sufficient length for successful homologous recombination with the characterizing gene on the BAC.
- Homology boxes are these regions of DNA and are used to direct site specific recombination between a shuttle vector and a BAC of interest.
- the homologous regions comprise the 3' portion of the characterizing gene.
- the homologous regions comprise the 5' portion of the characterizing gene, more preferably to target integration of the key gene coding sequences in frame with the ATC of the characterizing gene sequences.
- PCR is used for cloning a homology box from genomic DNA or BAC DNA. The homology box is cloned into the shuttle vector that is used for BAC recombination, as described below.
- Primer3 program Massachusetts Institute of Technology Cambridge, MA; Steve Rozen, Helen J. Skaletsky, 1998, Primer3
- a Ascl site is added in the 5' forward primer and a Smal site is added in the 3' reverse primer.
- primers are designed so that they have T m s of 57-60°C and so that the amplicons are between 300 and 500 bp in length. If a 5' UTR sequence of the characterizing gene sequence is available, amplicons are designed against this sequence. If the 5' UTR sequence is not available, then homology boxes are designed to include the 3' UTR or the 3' stop codon, or any other desired region of the characterizing gene.
- PCR reactions are performed with the following reagents: 1.0 ⁇ l Mouse genomic DNA or BAC having characterizing gene insert (500ng/ ⁇ l)
- DNA template for PCR should be from the BAC to be modified, or genomic DNA from the same strain of mouse from which the BAC library was constructed.
- the homology boxes must be cloned from the same mouse strain as the BACs to be modified.
- Pfu DNA polymerase (Stratagene) is used. This reduces errors introduced into the amplified sequence via PCR with Taq polymerase. Total volume is 25 ⁇ l.
- PCR reactions are run on a thermal cycler using the following program:
- a TOPO-TA cloning kit (Invitrogen) may be used to clone the PCR product. Ligation reactions are carried out at room temperature for 3 min with the following reagents:
- a blue-white selection is used (spreading IPTG and X-gal solutions on the LB-Amp plates prior to plating the transformation mixture).
- Preparation of cointegrates of the BAC and a shuttle vector may be prepared as follows.
- a shuttle vector containing IRES, GFP and the homology box (FIGS. 12 and 13; see PCT publication WO 01/05962), containing the key gene of interest is transformed into competent cells containing the BAC of interest by electroporation using the following protocol.
- a 40- ⁇ l aliquot of the BAC-containing competent cells is thawed on ice, the aliquot is mixed with 2 ⁇ l of DNA(0.5 ⁇ g / ⁇ l), and the mixture is placed on ice for 1 minute. Each sample is transferred to a cold 0.1 cm cuvette.
- a Gene Pulser apparatus (Bio-Rad) is used to carry out the electroporation.
- the Gene Pulser apparatus is set to 25 ⁇ f, the voltage to 1.8KV and pulse controller to 200 ⁇ . lml SOC is added to each cuvette immediately after conducting the electroporation. The cells are resuspended. The cell suspension is transferred to a 17x100mm polypropylene tube and incubated at 37° C for one hour with shaking at 225 RPM.
- the 1 ml culture is spun off and plated onto one chloramphenicol (Chi) (12.5 ⁇ g/ml) and ampicillin (Amp) (50 ⁇ g/ml) plate and incubated at 37 °C for 16-20 hours.
- the colonies are picked and inoculated with 5ml LB supplemented with Chi (12.5 ⁇ g/ml) and Amp (50 ⁇ g/ml), and incubated at 37°C overnight.
- Miniprep DNA from 3 ml of culture by alkaline lysis method described supra.
- Cointegrates for each clone are identified by Southern blot. Using a homology box as a probe in Southern blot analysis, the cointegrate can be identified by the appearance of an additional homology box that is introduced via the recombination process.
- the resolved clones i.e., clones in which the shuttle vector sequences have been removed, leaving the key gene sequences
- the resolved clones i.e., clones in which the shuttle vector sequences have been removed, leaving the key gene sequences
- each colony of cointegrate from the Chi/ Amp plates is picked and used to innoculate 5ml of LB + Chi (12.5 ⁇ g/ml) and 6% sucrose, and incubated at 37 °C for 8 hours.
- the culture is diluted 1 :5000 and plated on the agar plate with Chi (12.5 ⁇ g/ml) and 6% sucrose and incubated at 37 °C overnight. Five colonies per plate are picked and inoculated with 5ml of LB + Chl(12.5 ⁇ g/ml) only and incubated at 37 °C overnight. DNA from those cultures are miniprepped by alkaline lysis method known in the art. The resolved BACs are screened by Southern blot.
- preparation of cointegrates of the BAC and a shuttle vector may be prepared as follows.
- PCR amplify using an enzyme that does not leave an overhang, such as Pfu DNA polymerase
- a 300-500 bp "A box" homology regions from C57bl/6J genomic DNA using primers to the gene of interest (see Section 6.2, cloning homology boxes).
- Use of the 5' primer results in incorporation of an Ascl site.
- the A box should not contain an internal Asc I site. If the A box contains an Ascl site, then incorporate an Mlul site using the 5' primer and use that enzyme for cloning.
- this shuttle vector contains a R6kr DNA replication origin, which can only replicate in bacteria expressing the pir replication protein, use of pir2 cells (Invitrogen) is preferable.
- PLD53PA Transform pLD 53 -modified shuttle vector (PLD53PA) containing the gene of interest into BAC competent cells by electroporation: Thaw 40 ⁇ l of the BAC containing competent cells on ice, mix it with 2 ⁇ l of DNA (0.5 ⁇ g/ ⁇ l), and place the mixture on ice for 1 minute. Transfer each sample to a cold 0.1cm cuvette. Use a Gene Pulser apparatus to carry out the electroporation. Set the Gene Pulser apparatus at 25 ⁇ F, the voltage to 1.8KV and pulse controller to 200 ⁇ .
- PCR or Southern blotting is performed to ensure that the first step of recombination has occurred properly.
- this step may be verified to determine that the key gene sequences have been juxtaposed adjacent to the characterizing gene sequences.
- the vector sequences are removed in a resolution step, as described in WO 01/05962, herein incorporated by reference in its entirety. After cointegrates are resolved, Southern blotting and PCR are used to confirm that resolution products are correct, i.e., the only modification to the BAC is that the reporter has been inserted at the homology box. 6.4. CHEF MAPPING
- the protocol describes the CHEF gel mapping system (Bio-Rad). The protocol is run according to the manufacturer's instructions in the Bio-Rad CHEF gel mapping system reference manual. Restriction mapping is described in general in Section 6.1.5.
- Unmodified BAC from 3ml prep total 50ul: 3ul in three digests (Notl, Ascl, Notl/ Ascl double)
- Col BAC (from 96 prep total 30ul): 5ul in three digests (Notl, Ascl, Notl/ Ascl double) NEB low range PFG marker: small piece of agar to put into the well
- Hybridization with AP-T7 or AP-SP6 probe Prehybridization: in small roller bottle, at 37°C for lhr, 50 ul of buffer/ 1 cm 2 of membrane.
- Hybridization buffer 1 X SSC, 1% SDS, 0.5% BSA, 0.5% PVP, 0.01% NaN3
- Hybridization add fresh, warmed hybridization buffer (50 ul of buffer/1 cm 2 of membrane), and add in the probe at 2 nM final concentration. Run the hybridization at 37 °C overnight. Wash in: 2XSSC/1% SDS, 37 °C, 30 min
- AP reaction prepare CSPD substrate (Roche) in substrate buffer (50 ul of buffer/ 1 cm 2 of membrane). Dilute it 1 :100 to use.
- BAC DNA is preferably purified using one of the two following alternative methods and is then used for pronuclear injection or other methods known in the art to create transgenic mice.
- the injection concentration is preferably 1 ng/ ⁇ l.
- MAXIPREP BY ALKALINE LYSIS FOR BACS (ALTERNATIVE 1 1. 250 ml cultures are centrifuged to pellet bacteria.
- the pellet is resuspended in PI buffer (RNase-free, Qiagen), 20 ml, by pipetting.
- PI buffer RNase-free, Qiagen
- Cells are lysed for 4-5 min in P2 buffer (Qiagen), 40 ml, by inversion or swirling.
- the pellet is spun down on a swing bucket rotor at maximum speed for 20 min. 6.
- the supernatant is filtered through four layers of cheesecloth into clean 250 ml tubes.
- DNA is precipitated with 5ml 5M LiCI (final cone. 2.5M), on ice for 10 min. 10. Precipitate is spun at 4000 rpm for 20 min by a Sorval tabletop centrifuge.
- the supernatant is transferred to fresh 50 ml Falcon tubes.
- the precipitate is spun at 4000 rpm for 20 min on Sorval tabletop centrifuge.
- the DNA is resuspended in 500 ⁇ TE.
- RNase A is added to a final concentration of 25 ⁇ g/ml. (Qiagen).
- the DNA is incubated for 1 hr at 37°C.
- the DNA is phenol extracted 10 min on ADAMSTM Nutator Mixer (BD Diagnostic Systems).
- the pellet is resuspended in 50 ⁇ l TE.
- the DNA is purified for injection by either treatment with plasmid safe endonuclease (Epicenter Technologies) or by gel filtration using Sephacryl S-500 column or CL4b Sepharose column (both from Amersham Pharmacia Biotech).
- Step 8 Decant the supernatant into a clean 250 ml centrifuge bottle. If supernatant is 15 cloudy or contains floating material, repeat centrifugation (Step 8) before proceeding.
- Ethidium bromide will form a complex with the remaining protein to form a deep red flocculent precipitate. Centrifuge 5 minutes at 2000 x g. This will cause to the complex to form a disc at the top of the solution. Carefully transfer the- solution
- Phenol/chloroform extract (no vortex, gentle agitation)
- the FVB female egg donors are checked for copulation plugs (8:00AM), sacrificed via cervical dislocation, the oviducts harvested and the embryos are isolated from the oviducts for subsequent microinjection. Microinjection generally takes place between 10:00 AM and 2:00PM.
- the injection concentration is preferably lng/ ⁇ l.
- Injected embryos are transferred into the oviducts of ICR outbred strain pseudopregnant female mice. 20-25 eggs are transferred unilaterally into an oviduct. 19 days later the pups are bom.
- DNA is extracted from the tail biopsy (see tail biopsy protocol disclosed hereinbelow in Section 6.7).
- Lysis buffer 100 mM Tris HCl pH 8.5 5 mM EDTA 0.2% SDS 200 mM NaCI
- Resuspend pellets in 300 ⁇ Lo TE Briefly vortex and place in a 65°C incubator with agitation to aid in resuspension. The length of time needed to completely resuspend pellets may vary but usually falls within the range of 20 min - 1.5 hrs. Periodically check the samples until the desired suspension is attained.
- Total reaction volume is 50ul in the above example. If the total volume of the DNA required for the reaction is not lul then adjust the amount of H 2 O accordingly.
- GFP primers egfpl32F CCTGAAGTTCATCTGCACCA (SEQ ID NO:2)
- Amount of source DNA 100 ng
- Amount of fragment used in one copy control 0.7 pg
- Step 1 3 min at 95° C (hot start) Denaturing Temperature: 95 °C Denaturing Time: 30 sec Annealing Temperature: 58 °C Annealing Time: 30 sec Extension Temperature: 74 °C Extension Time: 45 sec Number of Cycles: 30
- the presence of positive GFP PCR product indicates that the transgenic mouse test carries the gene of interest.
- transgenic mouse line expressing the 5HT6 receptor BAC, according to the methods of the invention disclosed hereinabove.
- a transgenic mouse line expressing the 5HT6 receptor BAC was constructed as follows.
- BAC clones were identified using the overgo probe in a screen of CITB filters (see Section 6.1). PCR (Section 6.8) was used to verify BACs as containing the 5HT6 gene.
- the A box was cloned into a shuttle vector such that recombination with the 5HT6 gene in a BAC would place an IRES-EGFP sequence downstream of the stop codon in the
- a DNA fingerprint (performed as disclosed in Section 6.1.5) is shown in FIG. 1A.
- FIG. IB A corresponding Southern blot, shown in FIG. IB, was used to verify duplication of A boxes in cointegrate clones.
- CHEF mapping (see Section 6.4) was used to determine that one of the BACs was constructed such that one of the BAC clones had a sufficiently large DNA fragment upstream of the 5HT6 start site (FIG. 2).
- Sections of brain tissue showed that the transgene was indeed expressed in subsets of neurons in the transgenic animals (FIGS. 4 and 5).
- a transgenic mouse line expressing the 5HT2A receptor BAC was constructed as follows. An overgo probe was made for the 5HT6 gene as described in Section 6.1 using the following oligos.
- 5HT2A-5'SmaRl GTCTCCCGGGAAAAGCCGGAAGTTGTAGCAGA (SEQ ID NO: 12)
- the A box was cloned into a shuttle vector such that recombination with the 5HT2A gene in a BAC would place an Emerald sequence at the 5' end of the 5HT2A gene such that expression of the gene would result in only Emerald production, and not 5HT2A production.
- FIG. 6 A DNA fingerprint (performed as disclosed in Section 6.1.5) is shown in FIG. 6.
- FIG. 7 A corresponding Southern blot, shown in FIG. 7, was used to verify duplication of A boxes in cointegrate clones.
- CHEF mapping (see Sections 6.1.5 and 6.4) was used to determine that one of the
- BACs was constructed such that one of the BAC clones had a sufficiently large DNA fragment upstream of the 5HT6 start site (FIG. 8).
- transgenic animals were constructed (Section 6.6), and genotyped for the presence of GFP sequences (Sections 6.7 and 6.8). Founders were bred in order to obtain progeny containing the transgene (and verify that a line had indeed been established). Again, PCR (Section 6.8) was used to genotype FI animals. Sections of brain tissue showed that the transgene was indeed expressed in subsets of neurons in the transgenic animals (FIG. 11, arrows point to two fluorescent cells).
- useable BACs comprising a gene of interest in approximately 96% of cases.
- useable BACs typically all can be can be converted to recombinant BACs and used to create transgenic founder animals according to the methods of the invention.
- a transgenic construct is designed that contains two expression modules: (1) a Cre recombinase (key protein)-encoding sequence under the regulation of the rtTA-responsive hybrid promoter consisting of a tetO heptad repeat and a characterizing gene regulatory element (e.g., a hCMV minimal promoter), and (2) a rtTA cassette containing rtTA encoding sequence and SV40 polyadenylation site.
- Cre recombinase key protein
- rtTA-responsive hybrid promoter consisting of a tetO heptad repeat and a characterizing gene regulatory element (e.g., a hCMV minimal promoter)
- a rtTA cassette containing rtTA encoding sequence and SV40 polyadenylation site.
- conditional Cre-loxP-mediated recombination is as follows. Without doxycycline, rtTA is inert and unable to activate transcription of the key protein, Cre recombinase. In the presence of doxycycline, rtTA binds to the tetO-characterizing gene promoter leading to Cre expression. Cre-mediated DNA recombination is assayed as follows. In the absence of Cre, expression of the potential drug target gene from the modulating construct is prevented by the intervening transcriptional STOP sequence flanked by loxP sites. Cre-mediated DNA recombination results in removal of the STOP sequence followed by potential drug target expression.
- the modulating construct is introduced using transduction methods described in Deglon et al. (2000, Human Gene Therapy 11 :179-190; inco ⁇ orated herein by reference in its entirety).
- Deglon et al. describe methods for producing and introducing a self-inactivating (non-reproducing) lentiviral vector with enhanced transgene expression into a selected cell population, e.g., neurons in a particular brain region.
- the self-inactivating vector is used to transduce, and localize delivery of a potential drug target to, a select population of neurons.
- the self-inactivating (SIN) lentiviral vector is modified using the methods of Deglon et al. by insertion of the posttranscriptional regulatory element
- the lentiviral vector comprising the modulating construct is also modified so that it has an improved ability to transduce the cells into which it is introduced.
- the methods of Zennou et al. are used to inco ⁇ orate a central DNA flap into the vector (2000, Cell 101,
- Lentiviruses have the unique property among retroviruses of replicating in nondividing cells. This property relies on the use of a nuclear import pathway enabling the viral DNA to cross the nuclear membrane of the host cell.
- HIV-1 reverse transcription, a central strand displacement event consecutive to central initiation and termination of plus strand synthesis, creates a plus
- a key determinant for nuclear import of lentiviral genomes, e.g., HIV-1 genome, is therefore the central DNA flap: the central DNA flap acts as a cis-determinant of HIV-1 DNA nuclear import.
- a self-inactivating or non-reproducing lentiviral vector comprising the modulating construct is designed using the methods of Zennou et al. The vector comprises a reinsertion of the DNA flap sequence, thereby
- 2 ml of the modified, potential drug target-expressing lentiviral vector is injected into the cell population or region of interest, e.g., a select population of neurons.
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2002306684A AU2002306684A1 (en) | 2001-03-12 | 2002-03-12 | Method of drug target validation |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US27507301P | 2001-03-12 | 2001-03-12 | |
US60/275,073 | 2001-03-12 |
Publications (3)
Publication Number | Publication Date |
---|---|
WO2002072017A2 true WO2002072017A2 (en) | 2002-09-19 |
WO2002072017A3 WO2002072017A3 (en) | 2003-02-27 |
WO2002072017A9 WO2002072017A9 (en) | 2004-05-06 |
Family
ID=23050776
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2002/007294 WO2002072017A2 (en) | 2001-03-12 | 2002-03-12 | Method of drug target validation |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU2002306684A1 (en) |
WO (1) | WO2002072017A2 (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5859311A (en) * | 1995-11-27 | 1999-01-12 | University Of Kentucky Research Foundation | Transgenic mice which overexpress neurotrophin-3 (NT-3) and methods of use |
US5912411A (en) * | 1993-06-14 | 1999-06-15 | University Of Heidelberg | Mice transgenic for a tetracycline-inducible transcriptional activator |
-
2002
- 2002-03-12 AU AU2002306684A patent/AU2002306684A1/en not_active Abandoned
- 2002-03-12 WO PCT/US2002/007294 patent/WO2002072017A2/en not_active Application Discontinuation
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5912411A (en) * | 1993-06-14 | 1999-06-15 | University Of Heidelberg | Mice transgenic for a tetracycline-inducible transcriptional activator |
US5859311A (en) * | 1995-11-27 | 1999-01-12 | University Of Kentucky Research Foundation | Transgenic mice which overexpress neurotrophin-3 (NT-3) and methods of use |
Also Published As
Publication number | Publication date |
---|---|
AU2002306684A1 (en) | 2002-09-24 |
WO2002072017A9 (en) | 2004-05-06 |
WO2002072017A3 (en) | 2003-02-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zinyk et al. | Fate mapping of the mouse midbrain–hindbrain constriction using a site-specific recombination system | |
US7098031B2 (en) | Random integration of a polynucleotide by in vivo linearization | |
US20030106074A1 (en) | Collections of transgenic animal lines (living library) | |
Schultze et al. | Efficient control of gene expression by single step integration of the tetracycline system in transgenic mice | |
Sprengel et al. | Tetracycline-controlled genetic switches | |
WO2017104404A1 (en) | Genetic modification non-human organism, egg cells, fertilized eggs, and method for modifying target genes | |
MX2007014139A (en) | Piggybac as a tool for genetic manipulation and analysis in vertebrates. | |
JP2004033209A (en) | Artificial chromosome, use of the chromosome, and method for producing the artificial chromosome | |
US20040234504A1 (en) | Methods of inhibiting gene expression by RNA interference | |
AU2002333832A1 (en) | Random integration of a polynucleotide after in vivo linearization | |
Chen et al. | Transgenic technology in marine organisms | |
US5589392A (en) | Nucleic acid construct encoding a nuclear transport peptide operatively linked to an inducible promoter | |
Boyd et al. | Molecular biology of transgenic animals | |
CN106521638A (en) | Resource library of rat with gene mutation and preparation method thereof | |
JP5093776B2 (en) | A model animal capable of observing the state of a disease state in real time, a genetic construct enabling it, and use thereof | |
WO2005054463A1 (en) | Development of mammalian genome modification technique using retrotransposon | |
Furuta et al. | Recent innovations in tissue‐specific gene modifications in the mouse | |
Brandt et al. | Defining the functional boundaries of the Gata2 locus by rescue with a linked bacterial artificial chromosome transgene | |
Twyman | Gene transfer to animal cells | |
WO2002072017A2 (en) | Method of drug target validation | |
Kim | Improvement and establishment of the tTA-dependent inducible system in the mouse brain | |
ES2338971B1 (en) | ANIMAL MODEL FOR THE STUDY OF LIVING ANGIOGENESIS AND LYMPHANGIOGENESIS. | |
CN113444722A (en) | Application of single base editing mediated splicing repair in preparation of drugs for treating spinal muscular atrophy | |
Gadwe | AN OVERVIEW OF METHODS USED FOR PRODUCTION OF TRANSGENIC ANIMALS AND TRANSGENE DIAGNOSTICS | |
WO2009103978A2 (en) | Biological materials and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase | ||
COP | Corrected version of pamphlet |
Free format text: PAGES 1/13-13/13, DRAWINGS, REPLACED BY NEW PAGES 1/13-13/13; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE |
|
NENP | Non-entry into the national phase in: |
Ref country code: JP |
|
WWW | Wipo information: withdrawn in national office |
Country of ref document: JP |