US20120058078A1 - Assay for the detection of factors that modulate the expression of ingap - Google Patents

Assay for the detection of factors that modulate the expression of ingap Download PDF

Info

Publication number
US20120058078A1
US20120058078A1 US13/195,596 US201113195596A US2012058078A1 US 20120058078 A1 US20120058078 A1 US 20120058078A1 US 201113195596 A US201113195596 A US 201113195596A US 2012058078 A1 US2012058078 A1 US 2012058078A1
Authority
US
United States
Prior art keywords
seq
factor
ingap
expression
binding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/195,596
Inventor
David A. Taylor-Fishwick
Aaron I. Vinik
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GMP Endotherapeutics Inc
Original Assignee
GMP Endotherapeutics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GMP Endotherapeutics Inc filed Critical GMP Endotherapeutics Inc
Priority to US13/195,596 priority Critical patent/US20120058078A1/en
Publication of US20120058078A1 publication Critical patent/US20120058078A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/111General methods applicable to biologically active non-coding nucleic acids
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/04Anorexiants; Antiobesity agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/08Drugs for disorders of the metabolism for glucose homeostasis
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/08Drugs for disorders of the metabolism for glucose homeostasis
    • A61P3/10Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P43/00Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P5/00Drugs for disorders of the endocrine system
    • A61P5/48Drugs for disorders of the endocrine system of the pancreatic hormones
    • A61P5/50Drugs for disorders of the endocrine system of the pancreatic hormones for increasing or potentiating the activity of insulin
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • C07K14/4701Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
    • C07K14/4733Acute pancreatitis-associated protein
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • C07K14/4701Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
    • C07K14/474Pancreatic thread protein; Reg protein
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6811Selection methods for production or design of target specific oligonucleotides or binding molecules
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6897Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids involving reporter genes operably linked to promoters
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2333/00Assays involving biological materials from specific organisms or of a specific nature
    • G01N2333/435Assays involving biological materials from specific organisms or of a specific nature from animals; from humans
    • G01N2333/46Assays involving biological materials from specific organisms or of a specific nature from animals; from humans from vertebrates
    • G01N2333/47Assays involving proteins of known structure or function as defined in the subgroups
    • G01N2333/4701Details
    • G01N2333/4733Acute pancreatitis-associated protein
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2333/00Assays involving biological materials from specific organisms or of a specific nature
    • G01N2333/435Assays involving biological materials from specific organisms or of a specific nature from animals; from humans
    • G01N2333/46Assays involving biological materials from specific organisms or of a specific nature from animals; from humans from vertebrates
    • G01N2333/47Assays involving proteins of known structure or function as defined in the subgroups
    • G01N2333/4701Details
    • G01N2333/474Pancreatic thread protein; Reg protein

Definitions

  • the invention relates to the field of assays for the detection of factors that modulate gene expression. Specifically, the invention relates to reporter constructs and methods for identifying agents that modulate the expression of the INGAP gene.
  • Islet neogenesis gene associated protein has been identified as a pancreatic acinar cell protein that can induce islet cell neogenesis from progenitor cells resident in the pancreas in a manner that recapitulates islet development during normal embryogenesis.
  • INGAP is unique in its ability to stimulate growth and differentiation of islets of Langerhans from precursor cells associated with pancreas. These islets evolve a mature insulin secretory profile capable of responding to perturbations in blood glucose in a physiologic manner. This potential anti-diabetic therapeutic has been shown to demonstrate homology across several species and to exert a biological response.
  • pancreatic islet cell mass is lost in type 1 diabetes mellitus, a disease in which a progressive autoimmune reaction results in the selective destruction of insulin-producing ⁇ -cells.
  • type 2 diabetes mellitus so-called adult-onset disease, but also increasingly a condition in young overweight people, the ⁇ -cell mass may be reduced by as much as 60% of normal.
  • the number of functioning ⁇ -cells in the pancreas is of critical significance for the development, course, and outcome of diabetes.
  • type I diabetes there is a reduction of ⁇ -cell mass to less than 2% of normal. Even in the face of severe insulin resistance as occurs in type II diabetes, the development of diabetes only occurs if there is inadequate compensatory increase in ⁇ -cell mass.
  • a reporter construct comprises a regulatory region nucleotide sequence and a nucleotide sequence encoding a detectable product.
  • the reporter construct is provided in a vector.
  • the regulatory region nucleotide sequence is linked to the nucleotide sequence encoding a detectable product.
  • the regulatory region nucleotide sequence may comprise one or more fragments of 5′ regulatory region of the INGAP genomic sequence, SEQ ID NO: 23, or it may comprise the entire length of the 5′ regulatory region.
  • a promoter element is interposed between the regulatory region nucleotide sequence and the nucleotide sequence encoding a detectable product.
  • the promoter element may be selected from the promoter elements present in the INGAP regulatory sequence. Alternatively, the promoter element present in the vector comprising the reporter construct may be used.
  • the detectable product encoded by the said nucleotide sequence encoding a detectable product could be either a nucleic acid or a protein. The detectable product need not be the INGAP gene nucleic acid or protein.
  • a method identifying agents that modulate INGAP expression comprises contacting a cell with a test agent, wherein the cell comprises a reporter construct of the present invention. Expression of the detectable nucleic acid or protein product in the cell is determined. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the detectable product in the cell.
  • an isolated nucleic acid comprising the genomic sequence of the hamster INGAP gene (SEQ ID NO: 2), or a fragment thereof is provided.
  • an in vitro method for identifying agents that modulate INGAP expression comprises contacting a test agent with a reporter construct of the present invention in a cell-free system that allows for transcription and translation of a nucleotide sequence. Expression of the detectable product is determined. The substance is identified as a modulator of INGAP expression if the test substance modulates expression of the detectable product.
  • an in vitro method for identifying an agent that modulate INGAP expression comprises contacting a test agent with a nucleic acid of the invention. Binding of the test agent to the nucleic acid is determined. The test agent is identified as a modulator of INGAP expression if the test agent binds to the nucleic acid.
  • a method for increasing INGAP expression is provided.
  • An effective amount of a factor that stimulates INGAP expression directly or indirectly, for example cytokines, chemokines, growth factors, or pharmacological agents, is administered to a mammal in need of increased INGAP expression.
  • kits for modulating INGAP expression comprises a modulator of INGAP expression and instructions for using the modulator of INGAP expression to modulate INGAP expression.
  • a method for modulating INGAP expression in a mammal to treat a disease state related to reduced islet cell function comprises the step of administering to the mammal an effective amount of a modulator of INGAP expression whereby the level of INGAP expression in the mammal is modified.
  • FIG. 1 shows the annotation of the hamster INGAP gene structure.
  • the boundaries of introns 1-5 are listed in Table 1.
  • FIG. 2 shows an overview of the 5′-regulatory region of the hamster INGAP gene (nucleotides 1-3137 of SEQ ID NO: 2) showing many well known and well-characterized transcription factor binding sites.
  • the minimal promoter element contains the regions noted with an underline (CAAT-box, TATA-box, and GC-box).
  • FIG. 3 shows a schematic of many well known and well-characterized transcription factor-binding sites for nucleotides 1-3123 of the 5′-regulatory region (SEQ ID NO: 1) of the hamster INGAP gene. Table 3 further describes these transcription factor-binding sites.
  • FIG. 4 shows the predicted transcription start sites within the 5′-regulatory region (SEQ ID NO: 1) of the hamster INGAP gene (SEQ ID NO: 2).
  • the predicted start site is indicated by a boldface nucleotide.
  • the start and end nucleotide numbers are indicated for the promoter sequence. The numbers refer to nucleotide numbers of the hamster INGAP gene (SEQ ID NO: 2)
  • FIG. 5 shows the adapter primer structure and sequence used in gene walking.
  • Adapter primer 1 (AP 1 ) and adapter primer 2 (AP 2 ) are shown.
  • FIGS. 6 and 7 show the strategy for reconstructing the hamster INGAP gene.
  • the hamster INGAP gene was reconstructed using the technique of gene walking. Shown are the fragments and the gene specific primers (GSP 1 and GSP 2 ) used in PCR amplification for gene walking. Fragments were joined together using unique restriction enzyme sites within each fragment. The nucleotide sequences of the individual primers are listed in Table 2.
  • FIG. 8 shows the fragments of INGAP 5′-regulatory region, which were cloned into p ⁇ Gal-basic upstream of a ⁇ -galactosidase reporter gene.
  • the labels on the left refer to the nucleotide fragments of SEQ ID NO: 23 which were cloned upstream of p ⁇ Gal-basic.
  • FIG. 9A shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains various fragments of the 5′-regulatory region (SEQ ID NO: 23) of hamster INGAP DNA cloned upstream of a ⁇ -galactosidase reporter gene (p ⁇ Gal-basic), or in a reporter construct which contains no INGAP DNA.
  • the cells are stimulated with phorbol myristate acetate. Promoter activity is assessed by determining the level of ⁇ -galactosidase present in the cell using a ⁇ -galactosidase luminescent assay.
  • FIG. 9B shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains nucleotides 2030 to 3137 of the 5′-regulatory region (SEQ ID NO: 23) of hamster INGAP cloned upstream of a ⁇ -galactosidase reporter gene, or in a reporter construct which contains no INGAP DNA.
  • the cells are stimulated with leukemia inhibitory factor.
  • Promoter activity is assessed by determining the level of ⁇ -galactosidase present in the cell using a ⁇ -galactosidase luminescent assay.
  • FIG. 10 shows the reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains different fragments (see FIG. 8 ) of the 5′-regulatory region of hamster INGAP cloned upstream of a ⁇ -galactosidase reporter gene.
  • the cells are stimulated with phorbol myristate acetate.
  • Concentrations of PMA used are 6 ng/ml, 17 ng/ml, 50 ng/ml, 100 ng/ml, or 300 ng/ml.
  • Promoter activity is assessed by determining the level of ⁇ -galactosidase present in the cell using a ⁇ -galactosidase luminescent assay.
  • FIG. 11 shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains different fragments (see FIG. 8 ) of the 5′-regulatory region of hamster INGAP cloned upstream of a ⁇ -galactosidase reporter gene.
  • the cells are stimulated with human leukemia inhibitory factor (hLIF). Concentrations of hLIF used are 1 ng/ml, 10 ng/ml, or 30 ng/ml. Promoter activity was assessed by determining the level of ⁇ -galactosidase present in the cell using a ⁇ -galactosidase luminescent assay.
  • hLIF human leukemia inhibitory factor
  • FIG. 12 shows RNA analysis for INGAP gene upregulation in rat amphicrine pancreatic cells, AR42J, treated with cytokine IL-6 or untreated. Total RNA is probed by Northern analysis for INGAP gene.
  • promoter is used to define the region of a gene at which initiation and rate of transcription are controlled. It contains the site at which RNA polymerase binds and also sites for the binding of regulatory proteins, e.g. transcription factors, repressors, etc. In order to differentiate between the transcription initiation site and other sites that modulate rate of transcription, promoter region is generally subdivided into “minimal promoter element” and “regulatory region”.
  • minimal promoter element or sometimes simply referred to as “promoter” therefore may include TATA box, GC-rich sequence and CAAT box; while “regulatory region” is usually a long stretch of nucleotide sequence where transcription factors and other factors bind. Most eukaryotic genes have long regulatory regions where many different transcription factors bind. The expression or the lack of expression of a given gene in a given cell type, tissue, organ, or an organism is governed by the interactions that take place on its regulatory region.
  • transcription factor is used to describe the proteins that bind short stretches of DNA in the regulatory regions of a gene. Transcription factors may interact with each other as well as RNA polymerase. Thus, transcription factors may bind hormones or second messengers, DNA, RNA, other transcription factors, or other proteins. They may activate or inhibit transcription of a given gene. Transcription factors are also sometimes referred to as “enhancers” or “repressors”. Transcription factor binding sites can be used to identify agents that bind to the 5′-regulatory region of the gene and modulate the gene's expression.
  • reporter is used to describe a coding sequence attached to a heterologous promoter or enhancer elements and whose product, either nucleic acid or protein, is easily detected and is quantifiable.
  • Some common reporter genes include ⁇ -galactosidase (lacZ), chloramphenicol acetyltransferase (cat), ⁇ -glucuronidase (GUS), and green fluorescent protein (GFP).
  • a “reporter construct” is a piece of nucleic acid that includes a promoter element and a reporter gene housed in a suitable vector plasmid DNA. Regulatory region nucleotide sequences may be cloned 5′ of the promoter element to determine if they contain transcription factor binding sites.
  • the reporter construct-containing vector is introduced into a cell that contains many transcription factors. Activation of the reporter gene by transcription factors may be monitored by detection and quantification of the product of the reporter gene.
  • agent is used here to essentially describe any means to modulate INGAP expression.
  • Agent may be a chemical compound, a biological agent, or a physical force, a mechanical contraption, or any combinations thereof.
  • INGAP gene is regulated by a 5′-regulatory region that is susceptible to modulation by many known transcription factors, including PMA and LIF.
  • the 5′-regulatory region nucleotide sequence of the INGAP gene may be used in screening assays to identify agents capable of modulating the INGAP gene expression.
  • modulating agents have potential as therapeutic agents for treating pathological conditions including, but not limited to, diabetes mellitus, both type 1 and type 2, endocrine and non-endocrine hypoplasia, hypertrophy, adenoma, neoplasia, and nesidioblastosis.
  • Mammalian INGAP like most genes, has a 5′-regulatory region followed by introns and exons.
  • the sequence of a mammalian (Hamster sp.) INGAP gene is provided as SEQ ID NO: 2.
  • FIG. 1 details the relative location of the 5′-regulatory region, the introns and the exons of the hamster INGAP gene.
  • the boundaries of introns 1-5 and the location of the TATA-box and the poly-A signal are listed in Table 1.
  • nucleotide sequence of the 5′-regulatory region including the promoter elements of mammalian INGAP is shown partially in SEQ ID NO: 1, and completely in SEQ ID NO: 2 and 23 (nucleotides 1-3137 of SEQ ID NO: 2).
  • Nucleotides 1-3120 of SEQ ID NO: 1 are identical to nucleotides 1-3120 of SEQ ID NO: 2 and SEQ ID NO: 23.
  • An overview of the 5′-regulatory region is shown in FIG. 2 .
  • Representative transcription enhancer/repressor binding sites are shown also in FIG. 2 .
  • Predicted transcription enhancer/repressor binding sites for nucleotides 1-3123 of the 5′-regulatory region are shown in FIG. 3 .
  • Table 3 at the end of the specification details these transcription factors and their binding sites, and their locations in the regulatory region.
  • Potential transcription factor binding analysis was done using MatInspector ProfessionalTM, which is a bioinformatics software that utilizes a library of matrix descriptions for transcription factor binding sites to locate matches in sequences of unlimited length (Quandt, K., Frech, K., Karas, H., Wingender, E., Werner, T. (1995) Nucleic Acids Res. 23, 4878-4884).
  • Table 3 lists predicted binding proteins (Further Information) based upon their classification into functionally similar matrix families (Family/matrix).
  • the DNA sequence predicted to bind the protein (Sequence), whether sense or antisense DNA (Str) and location of the sequence in SEQ ID NO: 2, (Position) are listed. Further the similarity to the consecutive highest conserved nucleotides of a matrix (Core sim.) and similarity to all nucleotides in that matrix (Matrix sim.) along with the optimized value (Opt) defined in a way that a minimum number of matches is found in non-regulatory test sequences are also listed. Details to the algorithms used in MatInspector ProfessionalTM is referenced:
  • This matrix similarity is the optimized value defined in a way that a minimum number of matches are found in non-regulatory test sequences (i.e. with this matrix similarity the number of false positive matches is minimized). This matrix similarity is used when the user checks “Optimized” as the matrix similarity threshold for MatInspector ProfessionalTM.
  • V$NFKB includes 5 similar matrices for NFkappaB (V$NFKAPPAB.01, V$NFKAPPAB 0.02, V$NFKAPPAB 0.03, V$NFKAPPAB50.01, V$NFKAPPAB65.01) as well as 1 matrix for the NFkappaB related factor c-Rel (V$CREL.01).
  • Matrix The MatInspector ProfessionalTM matrices have an identifier that indicates one of the following seven groups: vertebrates (V$), insects (I$), plants (P$), fungi (F$), nematodes (N$), bacteria (B$), and other functional elements (O$); followed by an acronym for the factor the matrix refers to, and a consecutive number discriminating between different matrices for the same factor.
  • V$OCT1.02 indicates the second matrix for vertebral Oct-1 factor.
  • Core Sim The “core sequence” of a matrix is defined as the (usually 4) consecutive highest conserved positions of the matrix.
  • the core similarity is calculated as described here.
  • the maximum core similarity of 1.0 is only reached when the highest conserved bases of a matrix match exactly in the sequence. More important than the core similarity is the matrix similarity which takes into account all bases over the whole matrix length.
  • Matrix Sim The matrix similarity is calculated as described here. A perfect match to the matrix gets a score of 1.00 (each sequence position corresponds to the highest conserved nucleotide at that position in the matrix), a “good” match to the matrix usually has a similarity of >0.80. Mismatches in highly conserved positions of the matrix decrease the matrix similarity more than mismatches in less conserved regions.
  • Reporter constructs contain a 5′ regulatory region nucleotide sequence fragment of SEQ ID NO: 23 (e.g., an enhancer and/or repressor binding site containing region), a promoter element (which may or may not be from INGAP regulatory region nucleotide sequence, SEQ ID NO: 23), and a reporter gene.
  • the 5′-regulatory region nucleotide sequence is positioned upstream of the reporter gene.
  • the region may be mapped using deletion analysis.
  • One or more fragments of the regulatory region nucleotide sequence may be initially analyzed for their responses to various transcription factor activators. Once, a region of interest is determined, further fine mapping may be carried out where DNA from different locations within the regulatory region could be combined to make a more robust, and responsive reporter construct.
  • DNA sequences such as INGAP 5′-regulatory region DNA or a fragment thereof, can be manipulated by methods well known in the art. Examples of such techniques include, but are not limited to, polymerase chain reaction (PCR), restriction enzyme endonuclease digestion, ligation, and gene walking. Cloning fragments of DNA, such as 5′-regulatory regions is well known in the art.
  • PCR-ELISA may be used to capture transcripts onto a solid phase using biotin or digoxigenin-labelled primers, oligonucleotide probes (oligoprobes) or directly after incorporation of the digoxigenin into the transcripts (Watzinger, F. and Lion, T. (2001) Nucleic Acids Res., 29, e52). Once captured, the transcripts can be detected using an enzyme-labeled avidin or anti-digoxigenin reporter molecule similar to a standard ELISA format.
  • Another approach is to employ real-time PCR to detect the transcript of the reporter gene (Mackay, I. M. and Nitsche, A., Nucleic Acids Res. 2002 Mar. 15; 30 (6), 1292-305). In real-time PCR fluorogenic nucleotides are used and progress of the transcript is monitored in real-time as the polymerase transcribes the reporter gene.
  • the promoter element in the reporter construct may or may not be from the same gene as the 5′-regulatory region.
  • the enhancer/repressor region from the INGAP 5′-regulatory region, or a fragment of the enhancer/repressor region from the INGAP 5′-regulatory region may be cloned upstream of a heterologous minimal promoter element, e.g., the minimal CMV promoter (Boshart et al., 1985) and the promoters for TK (Nordeen, 1988), IL-2, and MMTV.
  • FIG. 4 shows the predicted transcription start sites for mammalian INGAP gene (SEQ ID NO: 2).
  • SEQ ID NO: 2 was analyzed using “Neural Network Promoter Prediction” program designed by Martin Reese to identify eukaryotic promoter recognition elements such as TATA-box, GC-box, CAAT-box, and the transcription start site. These promoter elements are present in various combinations separated by various distances in sequence. The program is available on the Internet and is located at http://www.fruitfly.org/seq_tools/promoter.html.
  • the reporter construct can be used to identify agents that modulate, either alone or in combination, the expression of INGAP. Some such agents may modulate expression of INGAP by binding to the regulatory region directly while others may regulate expression of transcription factors that bind to the INGAP regulatory region.
  • the reporter construct can be transfected into a host cell in vitro, or in vivo through the pancreatic duct, either transiently or stably, and a test agent introduced to the assay system.
  • test agents include, but are not limited to organic and inorganic chemical agents, carbohydrates, proteins, oligonucleotides, cholecystokinin, mechanically induced pressure, and agents which cause a pancreatic duct obstruction.
  • Expression of the reporter gene product can be determined by an assay appropriate for the reporter gene employed.
  • test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the reporter gene product.
  • the level of increase or decrease is at least 50%, 100%, 200%, 500%, or 1000%, but any statistically significant change can be an indicator of modulatory activity.
  • a skilled artisan may also determine reporter gene product expression in untreated cells, and in treated and untreated cells transfected with a promoter-less reporter gene only. Such determinations can be used to determine background levels of expression.
  • Test agents can also be obtained by fractionating pancreatic secretion fluids.
  • a pancreatic duct obstruction can be used as an exemplary method of harvesting pancreatic secretion fluids.
  • the pancreatic secretion fluids can be fractionated by methods well known in the art. Examples include high-pressure liquid chromatography (HPLC), size exclusion chromatography, hydrophobic interacting columns, and density gradient centrifugation.
  • HPLC high-pressure liquid chromatography
  • Size exclusion chromatography size exclusion chromatography
  • hydrophobic interacting columns hydrophobic interacting columns
  • density gradient centrifugation Individual fractions can be tested for agents that modulate reporter gene expression using a method described herein.
  • the individual fractions can be further fractionated to identify agents that modulate reporter gene expression.
  • the identified test agents can be used to modulate the expression of INGAP.
  • a host cell can be any cell suitable for transfection and maintenance in a suitable assay system.
  • suitable cells include, but are not limited to, mammalian cells, human cells, mouse cells, rat cells, monkey cells, dog cells, bovine cells, and porcine cells.
  • the cells used will be human cells.
  • the cells could be either transformed cells line or primary cells.
  • Whole organ explants may also be used where the regulation may be monitored over many different cell types.
  • Many methods exist in the art for transfecting or infecting cells with reporter construct DNA include, but are not limited to, lipofection, electroporation, calcium phosphate precipitation, DEAE dextran, gene guns, and modified viral techniques (e.g., recombinant adenovirus or recombinant retrovirus). The skilled artisan can readily choose a method suitable for use with a given cell type and assay system.
  • the reporter construct can also be introduced in vivo directly into cells of the pancreas.
  • methods to introduce the reporter construct into pancreatic cells in vivo include pancreatic duct retrograde perfusion and in vivo electroporation (Mir, 2001).
  • the reporter construct encodes a reporter gene product that is readily measured in vivo.
  • a test agent can be administered systemically or locally, and expression of the reporter gene in vivo can be determined by an assay appropriate for the particular reporter employed. Examples of such include a fluorescence assay for green fluorescent protein.
  • Methods for identifying agents that modulate INGAP expression can also be accomplished in vitro.
  • the reporter construct can be contacted with a test agent in vitro under conditions sufficient for transcription and/or translation of the reporter gene.
  • Components such as rabbit reticulocyte lysates or wheat germ extracts can be utilized for such a method.
  • the expression level of the reporter gene can be determined as described above utilizing an appropriate assay for a given reporter gene.
  • a test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the reporter gene. Threshold levels of change can be set by the practitioner as discussed above.
  • test agent can alternatively be contacted with an isolated and purified INGAP 5′-regulatory DNA molecule and one can determine if the test agent binds to the DNA molecule.
  • Test agents can be a chemical agent, a protein, or a nucleic acid.
  • Appropriate INGAP 5′-regulatory DNA molecules would include nucleotides 1-6586 of SEQ ID NO: 2, the 5′-regulatory region DNA (SEQ ID NO: 1, or SEQ ID NO: 23), or any fragment of the 5′-regulatory region, preferably a fragment which contains one or more enhancer/repressor binding sites.
  • Methods to determine binding of the test agent to the fragment of DNA are well known in the art, e.g., electrophoretic mobility shift assay (EMSA).
  • ESA electrophoretic mobility shift assay
  • Fragments of the 5′-regulatory region can be obtained by methods well known in the art using the disclosed sequence (SEQ ID NO: 2). Examples of such methods include, PCR, restriction enzyme digestion, and chemical synthesis. Any fragment of DNA within the 5′-regulatory region (SEQ ID NO: 1, or 23) can be used. The exact location that an agent binds can be determined for example by utilizing smaller fragments to map precisely the binding site for the test agent. Test agents that bind in the assay can be further tested in other assays that require modulatory activity.
  • An agent that causes an increase or decrease in reporter gene expression can be used as a modulator of INGAP expression.
  • the modulator can be administered to a mammal in need of such modulation.
  • mammals that may need INGAP expression modulation are those with reduced pancreatic function, in particular reduced islet cell function.
  • Such mammals include those who have diabetes mellitus, impaired glucose tolerance, impaired fasting glucose, hyperglycemia, obesity, and pancreatic insufficiency.
  • An agent that is identified as a modulator of INGAP expression can be supplied in a kit to treat diseases associated with reduced islet cell function.
  • the kit would comprise in single or divided containers, in single or divided doses a modulator of INGAP expression.
  • Written instructions may be included for using the modulator of INGAP expression. The instructions may simply refer a reader to another location such as a website or other information source.
  • Agents that cause an increase in reporter gene expression can be used to increase INGAP expression to treat a disease state related to reduced islet cell function.
  • Agents that cause a decrease in reporter gene expression can be used to decrease INGAP expression to treat a disease state related to hyperactivity of islet cells or a disease where reduced INGAP expression is desirable.
  • agents include, but are not limited to, PMA, LIF, interleukin-6, Oncostatin M, and ciliary neurotropic factor.
  • Agents can be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, parenteral, topical, sublingual, rectal, or pancreatic duct retrograde perfusion.
  • Agents for oral administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the mammal.
  • Agents for intravenous, intramuscular, intra-arterial, transdermal, and subcutaneous injections can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for injection into the mammal.
  • Agents for intranasal, topical, and rectal administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for surface administration to the mammal.
  • Mammals in need of an increase in INGAP expression include for example, mammals with diabetes mellitus, impaired glucose tolerance, impaired fasting glucose, hyperglycemia, obesity, and pancreatic insufficiency.
  • Mammals in need of a decrease in INGAP expression include for example, mammals with hypoglycemia.
  • the hamster INGAP genomic sequence and structure was determined by gene walking (Clontech) and DNA sequencing.
  • Gene walking is a method for walking upstream toward a promoter or downstream in genomic DNA from a known sequence, such as cDNA. This method utilizes four uncloned, adapter-ligated genomic fragment libraries. The manufacturer's recommended protocol is followed with one notable exception; hamster genomic DNA was used to create the uncloned, adapter-ligated genomic fragment libraries.
  • genomic DNA was purified from hamster cells. Four separate aliquots were thoroughly digested with PvuII, StuI, DraI, or EcoRV. Following digestion, inactivation of the restriction enzymes, and dephosphorylation, each separate pool of DNA fragments was ligated to an adapter AP 1 SEQ ID NO: 489 or AP 2 SEQ ID NO: 490, see FIG. 5 .
  • the adapter was phosphorylated to provide the requisite phosphate group for a ligation reaction.
  • the 3-prime side of the short adapter contains an amine group to prevent the adapters from forming concatamers.
  • GSP 1 and GSP 2 Two gene specific primers (GSP 1 and GSP 2 ) were designed for each region of known sequence (i.e., the exons of the INGAP gene). See FIG. 6 for fragment location and GSP 1 and GSP 2 location.
  • the gene specific primers were designed as reverse PCR primers for all fragments except fragments 1 _ 2 and 14 _ 5 .
  • the gene specific primers for fragments 1 _ 2 and 14 _ 5 were designed as forward primers.
  • Adapter primer 1 (AP 1 ) and adapter primer 2 (AP 2 ) ( FIG. 5 ) were forward PCR primers for all fragments except fragments 1 _ 2 and 14 _ 5 , which were reverse PCR primers.
  • the outer gene specific primer (GSP 1 ) was used with adapter primer 1 in a PCR reaction.
  • a second, nested PCR was set up using the inner gene specific primer (GSP 2 ) and adapter primer 2 .
  • a small aliquot of the first reaction served as template for the second reaction.
  • Gene specific PCR primers utilized for gene walking are listed in Table 2 and the strategy used to build the INGAP genomic sequence is shown in FIGS. 6 and 7 .
  • the arrowheads in FIG. 6 represent the adapter primers (AP 1 and AP 2 ), while the circles represent the gene specific primers (GSP 1 and GSP 2 ).
  • the PCR fragments were sequenced to determine the nucleotide sequence of the INGAP 5′-regulatory region, the introns, the intron/exon junctions, and the 3-prime polyadenylation regions.
  • the nucleotide sequence of hamster INGAP genomic DNA is shown in SEQ ID NO: 2.
  • FIGS. 6 and 7 detail the strategy used to piece the INGAP 5′-regulatory region together.
  • Fragments 8 _ 3 and 2 _ 3 were joined at a unique SphI site; 14 _ 3 and 8 _ 3 were joined at a unique BbsI site; 16 _ 3 and 14 _ 3 were joined at a unique PstI site.
  • the nucleotide sequence of hamster INGAP 5′-regulatory region DNA is shown in SEQ ID NO: 1 and 23 in the sequence listing.
  • the hamster INGAP 5′-regulatory region or a fragment of the 5′-regulatory region was cloned into a reporter plasmid, p ⁇ Gal-Basic (Clontech).
  • the 5′-regulatory region or fragments were cloned utilizing the unique XmaI site from the gene walking adapter primer and a unique BgIII site located at the 3-prime side of the regulatory region.
  • FIG. 8 details the fragments cloned into p ⁇ Gal-Basic. The sizes of the fragments are indicated to the right of the fragments and are expressed as the number of nucleotides of the fragment.
  • Promoter analysis of INGAP identified a number of potential promoter-proximal regulatory sites including the consensus transcription factor binding sites; cAMP response element (CRE), AP-1 and STAT. Promoter-fragment reporter-gene constructs were transiently transfected into 293T cells and co-transfection of secretory alkaline phosphatase was used to normalize for transfection efficiency.
  • CRE cAMP response element
  • Reporter constructs containing INGAP 5′-regulatory region fragments 2 _ 3 s P (SEQ ID NO: 37), 2 _ 3 d P (SEQ ID NO: 38), 2 _ 3 p P (SEQ ID NO: 36), 14 _ 3 P (SEQ ID NO: 34), 16 _ 3 P (SEQ ID NO: 31), or 19 _ 3 P (SEQ ID NO: 23) were transfected into human cells.
  • the p ⁇ Gal-Basic plasmid without the hamster INGAP DNA was also transfected into human cells as a control to measure the level of endogenous reporter activity. Two days following transfection, the cells were treated with PMA for 24 hours or were untreated.
  • FIG. 9A shows that construct 14 _ 3 P activated the INGAP expression the most, followed by 2 _ 3 p P, and 16 _ 3 P.
  • Reporter construct containing INGAP 5′-regulatory region DNA nucleotides 2030 to 3120 was transfected into human cells.
  • the p ⁇ Gal-Basic plasmid without the hamster INGAP DNA was also transfected into human cells as a control to measure the level of endogenous reporter activity.
  • Two days following transfection, the cells were treated with LIF for 24 hours or were untreated.
  • the amount of ⁇ -galactosidase gene product was determined using a luminescent assay for ⁇ -galactosidase.
  • FIG. 9B shows the results.
  • LIF was determined to increase the activity of the 5′-regulatory region of mammalian INGAP.
  • Forskolin an activator of cAMP/CREB/CRE did not modulate gene expression (data not shown).
  • the hamster INGAP 5′-regulatory region is transactivated by the human transcription factors.
  • the 5′-regulatory region of hamster INGAP creates a sensitive assay system to screen for factors that modulate the expression of INGAP.
  • the fragments cloned into the reporter construct were 2 _ 3 s P (SEQ ID NO: 37), 2 _ 3 d P (SEQ ID NO: 38), 2 _ 3 p P (SEQ ID NO: 36), 14 _ 3 P (SEQ ID NO: 34), 16 _ 3 P (SEQ ID NO: 31), or 19 _ 3 P (SEQ ID NO: 23).
  • the reporter constructs were transfected into human cells.
  • the concentrations of PMA used were 6 ng/ml, 17 ng/ml, 50 ng/ml, 100 ng/ml, or 300 ng/ml.
  • the concentrations of LIF used were 1 ng/ml, 10 ng/ml, or 30 ng/ml.
  • the amount of ⁇ -galactosidase gene product was determined using a luminescent assay for ⁇ -galactosidase.
  • FIGS. 10 and 11 show the results for PMA and LIF treatment, respectively. Both PMA and LIF activated the cell reporter constructs. The exact location of the DNA contact sites can be narrowed further by cloning smaller fragments of the hamster INGAP 5′-regulatory region and by site directed mutations or deletions.
  • RNA (10 ⁇ g) Equal amounts of total RNA (10 ⁇ g) were loaded in 2.5% formaldehyde gel and electrophoresed for 4 hours at 70V with a constant circulation of the buffer using a circulating pump. The gel was photographed and washed with water twice at room temperature and soaked in 20 ⁇ SSC. The gel was transferred to a nylon membrane (Amersham) in 20 ⁇ SSC overnight following a standard procedure. The membrane was washed with 20 ⁇ SSC to remove any agar that might have attached to the membrane and baked for 4 hours at 80° C.
  • hamster INGAP cDNA was labeled using Random Prime Labeling kit (Roche-BMB) and alpha-P 32 dCTP (ICN). Approximately 20 million counts were used for hybridization in 20 ml hybridization buffer following the standard procedure at 42° C. for overnight. The blot was washed as follows: 2-times at room temperature with 2 ⁇ SSC for 10 minutes each; 2-times at 42° C. with 2 ⁇ SSC for 10 minutes each; 2-times at 55° C. with 1 ⁇ SSC for 10 minutes each. The membrane was exposed to the film (XOMAT-Kodak) and kept at ⁇ 80° C. overnight before developing.
  • Random Prime Labeling kit Roche-BMB
  • ICN alpha-P 32 dCTP
  • CAR constitutitutive androstane receptor
  • RXR Retinoid 0.81 1934-1952 1943 ID receptor-related NO: testis-associated 331 receptor (GCNF/RTR) SEQ V$HOXF/EN1.01 Homeobox 0.77 1936-1952 1944 ID protein engrailed NO: (en-1) 332 SEQ V$NKXH/NKX25.01 homeo domain 1.00 1939-1951 1945 ID factor Nkx- NO: 2.5/Csx, tinman 333 homolog, high affinity sites SEQ V$GATA/GATA3.02 GATA-binding 0.91 1953-1965 1959 ID factor 3 NO: 334 SEQ V$TBPF/TATA.01 cellular and viral 0.90 1968-1984 1976 ID TATA box NO: elements 335 SEQ V$SRFF/SRF.01 serum response 0.66 1969-1987 1978 ID factor NO: 336 SEQ V$CLOX/CDPCR3.01 cut-like 0.75 1972-1988 1980

Landscapes

  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • Wood Science & Technology (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biotechnology (AREA)
  • Medicinal Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Diabetes (AREA)
  • Physics & Mathematics (AREA)
  • Microbiology (AREA)
  • Toxicology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Veterinary Medicine (AREA)
  • Public Health (AREA)
  • General Chemical & Material Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Plant Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Endocrinology (AREA)
  • Hematology (AREA)
  • Obesity (AREA)
  • Immunology (AREA)
  • Emergency Medicine (AREA)
  • Child & Adolescent Psychology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

A reporter construct contains mammalian INGAP 5′-regulatory region or a fragment thereof, a minimal promoter element from mammalian INGAP or a heterologous promoter, and a reporter gene. The reporter construct can be used to screen for agents which alone or in combination up-regulate or down-regulate reporter gene expression. Alternatively, the reporter construct can be used to screen for agents that bind to the hamster INGAP 5′-regulatory region or a fragment thereof.

Description

  • This application is a continuation of U.S. patent application Ser. No. 12/062,740, filed on which is a continuation of U.S. patent application Ser. No. 10/339,767 filed on Jan. 9, 2003, now U.S. Pat. No. 7,355,024, which claims priority to provisional applications: 60/388,315 filed on Jun. 14, 2002, provisional application 60/361,073 filed on Mar. 1, 2002 and provisional application 60/346,898 filed on Jan. 11, 2002, the contents of which are incorporated by reference.
  • REFERENCE TO SEQ ID
  • The Sequence listing in “1247090060.tx” created on Aug. 1, 2011, being 108 KB in size, is incorporated by reference.
  • FIELD OF THE INVENTION
  • The invention relates to the field of assays for the detection of factors that modulate gene expression. Specifically, the invention relates to reporter constructs and methods for identifying agents that modulate the expression of the INGAP gene.
  • BACKGROUND OF THE INVENTION
  • Islet neogenesis gene associated protein (INGAP protein) has been identified as a pancreatic acinar cell protein that can induce islet cell neogenesis from progenitor cells resident in the pancreas in a manner that recapitulates islet development during normal embryogenesis. INGAP is unique in its ability to stimulate growth and differentiation of islets of Langerhans from precursor cells associated with pancreas. These islets evolve a mature insulin secretory profile capable of responding to perturbations in blood glucose in a physiologic manner. This potential anti-diabetic therapeutic has been shown to demonstrate homology across several species and to exert a biological response.
  • Pancreatic islet cell mass is lost in type 1 diabetes mellitus, a disease in which a progressive autoimmune reaction results in the selective destruction of insulin-producing β-cells. In type 2 diabetes mellitus, so-called adult-onset disease, but also increasingly a condition in young overweight people, the β-cell mass may be reduced by as much as 60% of normal. The number of functioning β-cells in the pancreas is of critical significance for the development, course, and outcome of diabetes. In type I diabetes, there is a reduction of β-cell mass to less than 2% of normal. Even in the face of severe insulin resistance as occurs in type II diabetes, the development of diabetes only occurs if there is inadequate compensatory increase in β-cell mass. Thus, the development of either of the major forms of diabetes can be regarded as a failure of adaptive β-cell growth and a subsequent deficiency in insulin secretion. Stimulating the growth of islets and β-cells from precursor cells, known as islet neogenesis, is an attractive approach to the amelioration of diabetes. There is need in the art for methods to identify agents that can modulate the expression of INGAP, whether in animals or in cultured cells.
  • BRIEF SUMMARY OF THE INVENTION
  • It is an object of the invention to provide a reporter construct containing the 5′-regulatory region from mammalian INGAP gene.
  • It is another object of the invention to provide methods for identifying agents which modulate INGAP expression.
  • It is another object of the invention to provide a nucleic acid or fragment of INGAP 5′-regulatory region.
  • It is another object of the invention to provide methods for increasing INGAP expression.
  • It is another object of the invention to provide a kit for modulating INGAP expression.
  • These and other objects of the invention are provided by one or more of the embodiments described below.
  • In one aspect of the invention a reporter construct is provided. The reporter construct comprises a regulatory region nucleotide sequence and a nucleotide sequence encoding a detectable product. In one aspect of the invention, the reporter construct is provided in a vector. The regulatory region nucleotide sequence is linked to the nucleotide sequence encoding a detectable product. The regulatory region nucleotide sequence may comprise one or more fragments of 5′ regulatory region of the INGAP genomic sequence, SEQ ID NO: 23, or it may comprise the entire length of the 5′ regulatory region. In one embodiment of the reporter construct, a promoter element is interposed between the regulatory region nucleotide sequence and the nucleotide sequence encoding a detectable product. The promoter element may be selected from the promoter elements present in the INGAP regulatory sequence. Alternatively, the promoter element present in the vector comprising the reporter construct may be used. The detectable product encoded by the said nucleotide sequence encoding a detectable product could be either a nucleic acid or a protein. The detectable product need not be the INGAP gene nucleic acid or protein.
  • In another embodiment of the invention, a method identifying agents that modulate INGAP expression is provided. The method comprises contacting a cell with a test agent, wherein the cell comprises a reporter construct of the present invention. Expression of the detectable nucleic acid or protein product in the cell is determined. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the detectable product in the cell.
  • In another embodiment of the invention, an isolated nucleic acid comprising the genomic sequence of the hamster INGAP gene (SEQ ID NO: 2), or a fragment thereof is provided.
  • According to another embodiment of the invention, an in vitro method for identifying agents that modulate INGAP expression is provided. The method comprises contacting a test agent with a reporter construct of the present invention in a cell-free system that allows for transcription and translation of a nucleotide sequence. Expression of the detectable product is determined. The substance is identified as a modulator of INGAP expression if the test substance modulates expression of the detectable product.
  • According to another embodiment of the invention, an in vitro method for identifying an agent that modulate INGAP expression is provided. The method comprises contacting a test agent with a nucleic acid of the invention. Binding of the test agent to the nucleic acid is determined. The test agent is identified as a modulator of INGAP expression if the test agent binds to the nucleic acid.
  • According to another embodiment of the invention a method for increasing INGAP expression is provided. An effective amount of a factor that stimulates INGAP expression directly or indirectly, for example cytokines, chemokines, growth factors, or pharmacological agents, is administered to a mammal in need of increased INGAP expression.
  • According to another embodiment of the invention a kit for modulating INGAP expression is provided. The kit comprises a modulator of INGAP expression and instructions for using the modulator of INGAP expression to modulate INGAP expression.
  • According to another embodiment of the invention a method for modulating INGAP expression in a mammal to treat a disease state related to reduced islet cell function is provided. The method comprises the step of administering to the mammal an effective amount of a modulator of INGAP expression whereby the level of INGAP expression in the mammal is modified.
  • All documents cited are, in relevant part, incorporated herein by reference; the citation of any document is not to be construed as an admission that it is prior art with respect to the present invention.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 shows the annotation of the hamster INGAP gene structure. The boundaries of introns 1-5 are listed in Table 1.
  • FIG. 2 shows an overview of the 5′-regulatory region of the hamster INGAP gene (nucleotides 1-3137 of SEQ ID NO: 2) showing many well known and well-characterized transcription factor binding sites. The minimal promoter element contains the regions noted with an underline (CAAT-box, TATA-box, and GC-box).
  • FIG. 3 shows a schematic of many well known and well-characterized transcription factor-binding sites for nucleotides 1-3123 of the 5′-regulatory region (SEQ ID NO: 1) of the hamster INGAP gene. Table 3 further describes these transcription factor-binding sites.
  • FIG. 4 shows the predicted transcription start sites within the 5′-regulatory region (SEQ ID NO: 1) of the hamster INGAP gene (SEQ ID NO: 2). The predicted start site is indicated by a boldface nucleotide. The start and end nucleotide numbers are indicated for the promoter sequence. The numbers refer to nucleotide numbers of the hamster INGAP gene (SEQ ID NO: 2)
  • FIG. 5 shows the adapter primer structure and sequence used in gene walking. Adapter primer 1 (AP1) and adapter primer 2 (AP2) are shown.
  • FIGS. 6 and 7 show the strategy for reconstructing the hamster INGAP gene. The hamster INGAP gene was reconstructed using the technique of gene walking. Shown are the fragments and the gene specific primers (GSP1 and GSP2) used in PCR amplification for gene walking. Fragments were joined together using unique restriction enzyme sites within each fragment. The nucleotide sequences of the individual primers are listed in Table 2.
  • FIG. 8 shows the fragments of INGAP 5′-regulatory region, which were cloned into pβGal-basic upstream of a β-galactosidase reporter gene. The labels on the left refer to the nucleotide fragments of SEQ ID NO: 23 which were cloned upstream of pβGal-basic.
  • FIG. 9A shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains various fragments of the 5′-regulatory region (SEQ ID NO: 23) of hamster INGAP DNA cloned upstream of a β-galactosidase reporter gene (pβGal-basic), or in a reporter construct which contains no INGAP DNA. The cells are stimulated with phorbol myristate acetate. Promoter activity is assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.
  • FIG. 9B shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains nucleotides 2030 to 3137 of the 5′-regulatory region (SEQ ID NO: 23) of hamster INGAP cloned upstream of a β-galactosidase reporter gene, or in a reporter construct which contains no INGAP DNA. The cells are stimulated with leukemia inhibitory factor. Promoter activity is assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.
  • FIG. 10 shows the reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains different fragments (see FIG. 8) of the 5′-regulatory region of hamster INGAP cloned upstream of a β-galactosidase reporter gene. The cells are stimulated with phorbol myristate acetate. Concentrations of PMA used are 6 ng/ml, 17 ng/ml, 50 ng/ml, 100 ng/ml, or 300 ng/ml. Promoter activity is assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.
  • FIG. 11 shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains different fragments (see FIG. 8) of the 5′-regulatory region of hamster INGAP cloned upstream of a β-galactosidase reporter gene. The cells are stimulated with human leukemia inhibitory factor (hLIF). Concentrations of hLIF used are 1 ng/ml, 10 ng/ml, or 30 ng/ml. Promoter activity was assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.
  • FIG. 12 shows RNA analysis for INGAP gene upregulation in rat amphicrine pancreatic cells, AR42J, treated with cytokine IL-6 or untreated. Total RNA is probed by Northern analysis for INGAP gene.
  • DETAILED DESCRIPTION OF THE INVENTION Definitions
  • It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural references unless the context clearly dictates otherwise.
  • The term “promoter” is used to define the region of a gene at which initiation and rate of transcription are controlled. It contains the site at which RNA polymerase binds and also sites for the binding of regulatory proteins, e.g. transcription factors, repressors, etc. In order to differentiate between the transcription initiation site and other sites that modulate rate of transcription, promoter region is generally subdivided into “minimal promoter element” and “regulatory region”. The term “minimal promoter element” or sometimes simply referred to as “promoter” therefore may include TATA box, GC-rich sequence and CAAT box; while “regulatory region” is usually a long stretch of nucleotide sequence where transcription factors and other factors bind. Most eukaryotic genes have long regulatory regions where many different transcription factors bind. The expression or the lack of expression of a given gene in a given cell type, tissue, organ, or an organism is governed by the interactions that take place on its regulatory region.
  • The term “transcription factor” is used to describe the proteins that bind short stretches of DNA in the regulatory regions of a gene. Transcription factors may interact with each other as well as RNA polymerase. Thus, transcription factors may bind hormones or second messengers, DNA, RNA, other transcription factors, or other proteins. They may activate or inhibit transcription of a given gene. Transcription factors are also sometimes referred to as “enhancers” or “repressors”. Transcription factor binding sites can be used to identify agents that bind to the 5′-regulatory region of the gene and modulate the gene's expression.
  • The term “reporter” is used to describe a coding sequence attached to a heterologous promoter or enhancer elements and whose product, either nucleic acid or protein, is easily detected and is quantifiable. Some common reporter genes include β-galactosidase (lacZ), chloramphenicol acetyltransferase (cat), β-glucuronidase (GUS), and green fluorescent protein (GFP).
  • A “reporter construct” is a piece of nucleic acid that includes a promoter element and a reporter gene housed in a suitable vector plasmid DNA. Regulatory region nucleotide sequences may be cloned 5′ of the promoter element to determine if they contain transcription factor binding sites. The reporter construct-containing vector is introduced into a cell that contains many transcription factors. Activation of the reporter gene by transcription factors may be monitored by detection and quantification of the product of the reporter gene.
  • The term “agent” is used here to essentially describe any means to modulate INGAP expression. Agent may be a chemical compound, a biological agent, or a physical force, a mechanical contraption, or any combinations thereof.
  • INGAP Promoter and Regulatory Region
  • It is a discovery of the present inventors that INGAP gene is regulated by a 5′-regulatory region that is susceptible to modulation by many known transcription factors, including PMA and LIF.
  • It is a further discovery of the present invention that the 5′-regulatory region nucleotide sequence of the INGAP gene may be used in screening assays to identify agents capable of modulating the INGAP gene expression. These modulating agents have potential as therapeutic agents for treating pathological conditions including, but not limited to, diabetes mellitus, both type 1 and type 2, endocrine and non-endocrine hypoplasia, hypertrophy, adenoma, neoplasia, and nesidioblastosis.
  • Mammalian INGAP, like most genes, has a 5′-regulatory region followed by introns and exons. The sequence of a mammalian (Hamster sp.) INGAP gene is provided as SEQ ID NO: 2. FIG. 1 details the relative location of the 5′-regulatory region, the introns and the exons of the hamster INGAP gene. The boundaries of introns 1-5 and the location of the TATA-box and the poly-A signal are listed in Table 1.
  • TABLE 1
    Position In INGAP
    Description Gene (SEQ ID NO: 2)
    TATA-Box 3094
    INTRON 1 3150-3426
    INTRON 2 3508-4442
    INTRON 3 4562-4735
    INTRON 4 4874-5459
    INTRON 5 5587-5843
    Poly-A Signal 6098-6103
  • The nucleotide sequence of the 5′-regulatory region including the promoter elements of mammalian INGAP, is shown partially in SEQ ID NO: 1, and completely in SEQ ID NO: 2 and 23 (nucleotides 1-3137 of SEQ ID NO: 2). Nucleotides 1-3120 of SEQ ID NO: 1 are identical to nucleotides 1-3120 of SEQ ID NO: 2 and SEQ ID NO: 23. An overview of the 5′-regulatory region is shown in FIG. 2. Representative transcription enhancer/repressor binding sites are shown also in FIG. 2. Predicted transcription enhancer/repressor binding sites for nucleotides 1-3123 of the 5′-regulatory region are shown in FIG. 3. Table 3 at the end of the specification details these transcription factors and their binding sites, and their locations in the regulatory region. Potential transcription factor binding analysis was done using MatInspector Professional™, which is a bioinformatics software that utilizes a library of matrix descriptions for transcription factor binding sites to locate matches in sequences of unlimited length (Quandt, K., Frech, K., Karas, H., Wingender, E., Werner, T. (1995) Nucleic Acids Res. 23, 4878-4884).
  • Table 3 lists predicted binding proteins (Further Information) based upon their classification into functionally similar matrix families (Family/matrix). The DNA sequence predicted to bind the protein (Sequence), whether sense or antisense DNA (Str) and location of the sequence in SEQ ID NO: 2, (Position) are listed. Further the similarity to the consecutive highest conserved nucleotides of a matrix (Core sim.) and similarity to all nucleotides in that matrix (Matrix sim.) along with the optimized value (Opt) defined in a way that a minimum number of matches is found in non-regulatory test sequences are also listed. Details to the algorithms used in MatInspector Professional™ is referenced:
  • OPT: This matrix similarity is the optimized value defined in a way that a minimum number of matches are found in non-regulatory test sequences (i.e. with this matrix similarity the number of false positive matches is minimized). This matrix similarity is used when the user checks “Optimized” as the matrix similarity threshold for MatInspector Professional™.
  • Family: Each matrix belongs to a so-called matrix family, where functionally similar matrices are grouped together, eliminating redundant matches by MatInspector Professional™ professional (if the family option was selected). E.g. the matrix family V$NFKB includes 5 similar matrices for NFkappaB (V$NFKAPPAB.01, V$NFKAPPAB 0.02, V$NFKAPPAB 0.03, V$NFKAPPAB50.01, V$NFKAPPAB65.01) as well as 1 matrix for the NFkappaB related factor c-Rel (V$CREL.01).
  • Matrix: The MatInspector Professional™ matrices have an identifier that indicates one of the following seven groups: vertebrates (V$), insects (I$), plants (P$), fungi (F$), nematodes (N$), bacteria (B$), and other functional elements (O$); followed by an acronym for the factor the matrix refers to, and a consecutive number discriminating between different matrices for the same factor. Thus, V$OCT1.02 indicates the second matrix for vertebral Oct-1 factor.
  • Core Sim: The “core sequence” of a matrix is defined as the (usually 4) consecutive highest conserved positions of the matrix. The core similarity is calculated as described here. The maximum core similarity of 1.0 is only reached when the highest conserved bases of a matrix match exactly in the sequence. More important than the core similarity is the matrix similarity which takes into account all bases over the whole matrix length.
  • Matrix Sim: The matrix similarity is calculated as described here. A perfect match to the matrix gets a score of 1.00 (each sequence position corresponds to the highest conserved nucleotide at that position in the matrix), a “good” match to the matrix usually has a similarity of >0.80. Mismatches in highly conserved positions of the matrix decrease the matrix similarity more than mismatches in less conserved regions.
  • Another aspect of the invention provides for a reporter construct. Reporter constructs contain a 5′ regulatory region nucleotide sequence fragment of SEQ ID NO: 23 (e.g., an enhancer and/or repressor binding site containing region), a promoter element (which may or may not be from INGAP regulatory region nucleotide sequence, SEQ ID NO: 23), and a reporter gene. The 5′-regulatory region nucleotide sequence is positioned upstream of the reporter gene. In order to determine the identity of various transcription factors that bind the 5′ regulatory region nucleotide sequence and to elucidate their binding locations within the 5′ regulatory nucleotide sequence of the INGAP gene, the region may be mapped using deletion analysis. One or more fragments of the regulatory region nucleotide sequence may be initially analyzed for their responses to various transcription factor activators. Once, a region of interest is determined, further fine mapping may be carried out where DNA from different locations within the regulatory region could be combined to make a more robust, and responsive reporter construct. DNA sequences, such as INGAP 5′-regulatory region DNA or a fragment thereof, can be manipulated by methods well known in the art. Examples of such techniques include, but are not limited to, polymerase chain reaction (PCR), restriction enzyme endonuclease digestion, ligation, and gene walking. Cloning fragments of DNA, such as 5′-regulatory regions is well known in the art.
  • Another approach to quantify the expression levels of a gene is to measure transcription of the gene. PCR-ELISA may be used to capture transcripts onto a solid phase using biotin or digoxigenin-labelled primers, oligonucleotide probes (oligoprobes) or directly after incorporation of the digoxigenin into the transcripts (Watzinger, F. and Lion, T. (2001) Nucleic Acids Res., 29, e52). Once captured, the transcripts can be detected using an enzyme-labeled avidin or anti-digoxigenin reporter molecule similar to a standard ELISA format. Another approach is to employ real-time PCR to detect the transcript of the reporter gene (Mackay, I. M. and Nitsche, A., Nucleic Acids Res. 2002 Mar. 15; 30 (6), 1292-305). In real-time PCR fluorogenic nucleotides are used and progress of the transcript is monitored in real-time as the polymerase transcribes the reporter gene.
  • The promoter element in the reporter construct may or may not be from the same gene as the 5′-regulatory region. As an example, the enhancer/repressor region from the INGAP 5′-regulatory region, or a fragment of the enhancer/repressor region from the INGAP 5′-regulatory region, may be cloned upstream of a heterologous minimal promoter element, e.g., the minimal CMV promoter (Boshart et al., 1985) and the promoters for TK (Nordeen, 1988), IL-2, and MMTV.
  • Transcription of a gene begins around the minimal promoter. FIG. 4 shows the predicted transcription start sites for mammalian INGAP gene (SEQ ID NO: 2). SEQ ID NO: 2 was analyzed using “Neural Network Promoter Prediction” program designed by Martin Reese to identify eukaryotic promoter recognition elements such as TATA-box, GC-box, CAAT-box, and the transcription start site. These promoter elements are present in various combinations separated by various distances in sequence. The program is available on the Internet and is located at http://www.fruitfly.org/seq_tools/promoter.html.
  • The reporter construct can be used to identify agents that modulate, either alone or in combination, the expression of INGAP. Some such agents may modulate expression of INGAP by binding to the regulatory region directly while others may regulate expression of transcription factors that bind to the INGAP regulatory region.
  • The reporter construct can be transfected into a host cell in vitro, or in vivo through the pancreatic duct, either transiently or stably, and a test agent introduced to the assay system. Examples of test agents include, but are not limited to organic and inorganic chemical agents, carbohydrates, proteins, oligonucleotides, cholecystokinin, mechanically induced pressure, and agents which cause a pancreatic duct obstruction. Expression of the reporter gene product can be determined by an assay appropriate for the reporter gene employed. Examples of such assays include, but are not limited to a luminescent assay for β-galactosidase or luciferase, an enzymatic assay for chloramphenicol acetyl transferase, and fluorescence detection for fluorescent proteins. Such assays are well known in the art, and a skilled artisan will be able to select an appropriate assay for the chosen reporter. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the reporter gene product. Preferably the level of increase or decrease is at least 50%, 100%, 200%, 500%, or 1000%, but any statistically significant change can be an indicator of modulatory activity. A skilled artisan may also determine reporter gene product expression in untreated cells, and in treated and untreated cells transfected with a promoter-less reporter gene only. Such determinations can be used to determine background levels of expression.
  • Test agents can also be obtained by fractionating pancreatic secretion fluids. A pancreatic duct obstruction can be used as an exemplary method of harvesting pancreatic secretion fluids. The pancreatic secretion fluids can be fractionated by methods well known in the art. Examples include high-pressure liquid chromatography (HPLC), size exclusion chromatography, hydrophobic interacting columns, and density gradient centrifugation. Individual fractions can be tested for agents that modulate reporter gene expression using a method described herein. The individual fractions can be further fractionated to identify agents that modulate reporter gene expression. The identified test agents can be used to modulate the expression of INGAP.
  • A host cell can be any cell suitable for transfection and maintenance in a suitable assay system. Examples of suitable cells include, but are not limited to, mammalian cells, human cells, mouse cells, rat cells, monkey cells, dog cells, bovine cells, and porcine cells. Preferably the cells used will be human cells. The cells could be either transformed cells line or primary cells. Whole organ explants may also be used where the regulation may be monitored over many different cell types. Many methods exist in the art for transfecting or infecting cells with reporter construct DNA. Such methods include, but are not limited to, lipofection, electroporation, calcium phosphate precipitation, DEAE dextran, gene guns, and modified viral techniques (e.g., recombinant adenovirus or recombinant retrovirus). The skilled artisan can readily choose a method suitable for use with a given cell type and assay system.
  • The reporter construct can also be introduced in vivo directly into cells of the pancreas. Examples of methods to introduce the reporter construct into pancreatic cells in vivo include pancreatic duct retrograde perfusion and in vivo electroporation (Mir, 2001). The reporter construct encodes a reporter gene product that is readily measured in vivo. A test agent can be administered systemically or locally, and expression of the reporter gene in vivo can be determined by an assay appropriate for the particular reporter employed. Examples of such include a fluorescence assay for green fluorescent protein.
  • Methods for identifying agents that modulate INGAP expression can also be accomplished in vitro. The reporter construct can be contacted with a test agent in vitro under conditions sufficient for transcription and/or translation of the reporter gene. Components such as rabbit reticulocyte lysates or wheat germ extracts can be utilized for such a method. Subsequently, the expression level of the reporter gene can be determined as described above utilizing an appropriate assay for a given reporter gene. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the reporter gene. Threshold levels of change can be set by the practitioner as discussed above.
  • A test agent can alternatively be contacted with an isolated and purified INGAP 5′-regulatory DNA molecule and one can determine if the test agent binds to the DNA molecule. Test agents can be a chemical agent, a protein, or a nucleic acid. Appropriate INGAP 5′-regulatory DNA molecules would include nucleotides 1-6586 of SEQ ID NO: 2, the 5′-regulatory region DNA (SEQ ID NO: 1, or SEQ ID NO: 23), or any fragment of the 5′-regulatory region, preferably a fragment which contains one or more enhancer/repressor binding sites. Methods to determine binding of the test agent to the fragment of DNA are well known in the art, e.g., electrophoretic mobility shift assay (EMSA). See for example Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2d ed., 1989, at pages 9.50-9.51. Fragments of the 5′-regulatory region can be obtained by methods well known in the art using the disclosed sequence (SEQ ID NO: 2). Examples of such methods include, PCR, restriction enzyme digestion, and chemical synthesis. Any fragment of DNA within the 5′-regulatory region (SEQ ID NO: 1, or 23) can be used. The exact location that an agent binds can be determined for example by utilizing smaller fragments to map precisely the binding site for the test agent. Test agents that bind in the assay can be further tested in other assays that require modulatory activity.
  • An agent that causes an increase or decrease in reporter gene expression can be used as a modulator of INGAP expression. The modulator can be administered to a mammal in need of such modulation. Examples of mammals that may need INGAP expression modulation are those with reduced pancreatic function, in particular reduced islet cell function. Such mammals include those who have diabetes mellitus, impaired glucose tolerance, impaired fasting glucose, hyperglycemia, obesity, and pancreatic insufficiency.
  • An agent that is identified as a modulator of INGAP expression can be supplied in a kit to treat diseases associated with reduced islet cell function. The kit would comprise in single or divided containers, in single or divided doses a modulator of INGAP expression. Written instructions may be included for using the modulator of INGAP expression. The instructions may simply refer a reader to another location such as a website or other information source.
  • Agents that cause an increase in reporter gene expression can be used to increase INGAP expression to treat a disease state related to reduced islet cell function. Agents that cause a decrease in reporter gene expression can be used to decrease INGAP expression to treat a disease state related to hyperactivity of islet cells or a disease where reduced INGAP expression is desirable. Examples of such agents include, but are not limited to, PMA, LIF, interleukin-6, Oncostatin M, and ciliary neurotropic factor. Agents can be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, parenteral, topical, sublingual, rectal, or pancreatic duct retrograde perfusion. Agents for oral administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the mammal.
  • Agents for intravenous, intramuscular, intra-arterial, transdermal, and subcutaneous injections can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for injection into the mammal. Agents for intranasal, topical, and rectal administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for surface administration to the mammal. Mammals in need of an increase in INGAP expression include for example, mammals with diabetes mellitus, impaired glucose tolerance, impaired fasting glucose, hyperglycemia, obesity, and pancreatic insufficiency. Mammals in need of a decrease in INGAP expression include for example, mammals with hypoglycemia.
  • The following examples are offered by way of illustration and do not limit the invention disclosed herein.
  • EXAMPLES Example 1 Hamster INGAP Genomic Sequence and Structure
  • The hamster INGAP genomic sequence and structure was determined by gene walking (Clontech) and DNA sequencing. Gene walking is a method for walking upstream toward a promoter or downstream in genomic DNA from a known sequence, such as cDNA. This method utilizes four uncloned, adapter-ligated genomic fragment libraries. The manufacturer's recommended protocol is followed with one notable exception; hamster genomic DNA was used to create the uncloned, adapter-ligated genomic fragment libraries.
  • To create uncloned, adapter ligated genomic fragment libraries, genomic DNA was purified from hamster cells. Four separate aliquots were thoroughly digested with PvuII, StuI, DraI, or EcoRV. Following digestion, inactivation of the restriction enzymes, and dephosphorylation, each separate pool of DNA fragments was ligated to an adapter AP1 SEQ ID NO: 489 or AP2 SEQ ID NO: 490, see FIG. 5. The adapter was phosphorylated to provide the requisite phosphate group for a ligation reaction.
  • Also note that the 3-prime side of the short adapter contains an amine group to prevent the adapters from forming concatamers.
  • Two gene specific primers (GSP1 and GSP2) were designed for each region of known sequence (i.e., the exons of the INGAP gene). See FIG. 6 for fragment location and GSP1 and GSP2 location. The gene specific primers were designed as reverse PCR primers for all fragments except fragments 1_2 and 14_5. The gene specific primers for fragments 1_2 and 14_5 were designed as forward primers. Adapter primer 1 (AP1) and adapter primer 2 (AP2) (FIG. 5) were forward PCR primers for all fragments except fragments 1_2 and 14_5, which were reverse PCR primers. The outer gene specific primer (GSP1) was used with adapter primer 1 in a PCR reaction. To increase specificity, a second, nested PCR was set up using the inner gene specific primer (GSP2) and adapter primer 2. A small aliquot of the first reaction served as template for the second reaction. Gene specific PCR primers utilized for gene walking are listed in Table 2 and the strategy used to build the INGAP genomic sequence is shown in FIGS. 6 and 7. The arrowheads in FIG. 6 represent the adapter primers (AP1 and AP2), while the circles represent the gene specific primers (GSP1 and GSP2).
  • TABLE 2
    NAME (LOCATION) SEQUENCE
    INGEN 21_3 (1464, 1482) 5′-ACAAGCAATCTAGAGATGG-3′ (SEQ ID NO: 3)
    INGEN 19_3 (1401, 1423) 5′-GTTCAGCTATGTTCATAGCAGGG-3′ (SEQ ID NO: 4)
    INGEN 16_3 (1855, 1876) 5′-GTCTGTATGACTGTGTGGGAAG-3′ (SEQ ID NO: 5)
    INGEN 15_3 (1929, 1948) 5′-GCACTTGAACTCAATGGCTC-3′ (SEQ ID NO: 6)
    INGEN 14_3 (2147, 2168) 5′-GAACCACCTGACATGGGTGATG-3′ (SEQ ID NO: 7)
    INGEN 13_3 (2177, 2200) 5′-GGGCATCGTATCATCTGGTTACAG-3′ (SEQ ID NO: 8)
    INGEN 8_3 (2544, 2565) 5′-GGTTCAAAAAAGCTGCTTCAAC-3′ (SEQ ID NO: 9)
    INGEN 7_3 (2666, 2689) 5′-GGAATAGCTGCAATTTATGCCCAT-3′ (SEQ ID NO: 10)
    INGEN 4_3 (2833, 2858) 5′-CTTAGGAACATTCAGGCAGCCTCCTG-3′ (SEQ ID NO: 11)
    INGEN 3_3 (2866, 2891) 5′-GTTGCCCTCTGCCACGTGTCAAGTTC-3′ (SEQ ID NO: 12)
    INGEN 2_3 (3444, 3470) 5′-CATCCAAGACATCCTACAGAGGGTCAT-3′ (SEQ ID NO: 13)
    INGEN 1_3 (3475, 3501) 5′-CCCAAGAAAGGAACATCAGGCAGGAAA-3′ (SEQ ID NO: 14)
    INGEN 2_2 (3330, 3350) 5′-CCAAATGAGTGCTTCCCTGAA-3′ (SEQ ID NO: 15)
    INGEN 1_2 (3241, 3266) 5′-GCAGCACTCTGAAACTCAGTAGAGTT-3′ (SEQ ID NO: 16)
    INGEN 14_5 (5544, 5563) 5′-GCTGCTGACCGTGGTTATTG-3′ (SEQ ID NO: 17)
    INGEN 13_5 (5463, 5485) 5′-ACACTACCCAACGGAAGTGGATG-3′ (SEQ ID NO: 18)
    INGAP1_1L (3475, 3492) 5′-TTTCCTGCCTGATGTTCC-3′ (SEQ ID NO: 19)
    INGAP1_1R (5957, 5976) 5′-TCATACTTGCTTCCTTGTCC-3′ (SEQ ID NO: 20)
    INGAP2_1L (4470, 4488) 5′-CTTCACGTATAACCTGTCC-3′ (SEQ ID NO: 21)
    INGAP2_1R (5905, 5923) 5′-ATTAGAACTGCCCTAGACC-3′ (SEQ ID NO: 22)
  • The PCR fragments were sequenced to determine the nucleotide sequence of the INGAP 5′-regulatory region, the introns, the intron/exon junctions, and the 3-prime polyadenylation regions. The nucleotide sequence of hamster INGAP genomic DNA is shown in SEQ ID NO: 2.
  • Example 2 Cloning Hamster INGAP 5′-Regulatory Region Fragment into a Reporter Construct
  • To construct the INGAP 5′-regulatory region, individual PCR fragments were joined together at unique restriction sites located within two adjoining fragments. FIGS. 6 and 7 detail the strategy used to piece the INGAP 5′-regulatory region together. Fragments 8_3 and 2_3 were joined at a unique SphI site; 14_3 and 8_3 were joined at a unique BbsI site; 16_3 and 14_3 were joined at a unique PstI site. The nucleotide sequence of hamster INGAP 5′-regulatory region DNA is shown in SEQ ID NO: 1 and 23 in the sequence listing.
  • The hamster INGAP 5′-regulatory region or a fragment of the 5′-regulatory region was cloned into a reporter plasmid, pβGal-Basic (Clontech). The 5′-regulatory region or fragments were cloned utilizing the unique XmaI site from the gene walking adapter primer and a unique BgIII site located at the 3-prime side of the regulatory region. FIG. 8 details the fragments cloned into pβGal-Basic. The sizes of the fragments are indicated to the right of the fragments and are expressed as the number of nucleotides of the fragment.
  • Example 3 Assay System to Screen for Factors that Modulate the Expression of INGAP
  • Promoter analysis of INGAP identified a number of potential promoter-proximal regulatory sites including the consensus transcription factor binding sites; cAMP response element (CRE), AP-1 and STAT. Promoter-fragment reporter-gene constructs were transiently transfected into 293T cells and co-transfection of secretory alkaline phosphatase was used to normalize for transfection efficiency.
  • Reporter constructs containing INGAP 5′-regulatory region fragments 2_3 sP (SEQ ID NO: 37), 2_3 dP (SEQ ID NO: 38), 2_3 pP (SEQ ID NO: 36), 14_3P (SEQ ID NO: 34), 16_3P (SEQ ID NO: 31), or 19_3P (SEQ ID NO: 23) were transfected into human cells. The pβGal-Basic plasmid without the hamster INGAP DNA was also transfected into human cells as a control to measure the level of endogenous reporter activity. Two days following transfection, the cells were treated with PMA for 24 hours or were untreated. To determine the level of promoter activity, the amount of β-galactosidase gene product was determined using a luminescent assay for β-galactosidase. FIG. 9A shows that construct 14_3P activated the INGAP expression the most, followed by 2_3 pP, and 16_3P.
  • Reporter construct containing INGAP 5′-regulatory region DNA nucleotides 2030 to 3120 was transfected into human cells. The pβGal-Basic plasmid without the hamster INGAP DNA was also transfected into human cells as a control to measure the level of endogenous reporter activity. Two days following transfection, the cells were treated with LIF for 24 hours or were untreated. To determine the level of promoter activity, the amount of β-galactosidase gene product was determined using a luminescent assay for β-galactosidase. FIG. 9B shows the results. LIF was determined to increase the activity of the 5′-regulatory region of mammalian INGAP. Forskolin (an activator of cAMP/CREB/CRE) did not modulate gene expression (data not shown).
  • It is important to note that when present in human cells, the hamster INGAP 5′-regulatory region is transactivated by the human transcription factors. Thus, linked to a reporter gene, the 5′-regulatory region of hamster INGAP creates a sensitive assay system to screen for factors that modulate the expression of INGAP.
  • Example 4 Determination of Approximate Location of PMA and LIF-Mediated Transcription Factor Binding in the 5′-Regulatory Region
  • To map the approximate location of PMA-initiated or LIF-initiated transcription factor binding different fragments of the hamster INGAP 5′-regulatory region were cloned into pβGal-Basic. See FIG. 8. The fragments cloned into the reporter construct were 2_3 sP (SEQ ID NO: 37), 2_3 dP (SEQ ID NO: 38), 2_3 pP (SEQ ID NO: 36), 14_3P (SEQ ID NO: 34), 16_3P (SEQ ID NO: 31), or 19_3P (SEQ ID NO: 23). The reporter constructs were transfected into human cells. Two days following transfection, the cells were treated with different concentrations of PMA or LIF for 24 hours. The concentrations of PMA used were 6 ng/ml, 17 ng/ml, 50 ng/ml, 100 ng/ml, or 300 ng/ml. The concentrations of LIF used were 1 ng/ml, 10 ng/ml, or 30 ng/ml. To determine the level of promoter activity, the amount of β-galactosidase gene product was determined using a luminescent assay for β-galactosidase. FIGS. 10 and 11 show the results for PMA and LIF treatment, respectively. Both PMA and LIF activated the cell reporter constructs. The exact location of the DNA contact sites can be narrowed further by cloning smaller fragments of the hamster INGAP 5′-regulatory region and by site directed mutations or deletions.
  • Example 5 RNA Analysis of INGAP Gene Upregulation
  • To determine if INGAP RNA levels increase after stimulation with a cytokine that signals through STAT, rat amphocrine pancreatic cells, AR42J were treated with IL-6 (1000 U/ml) for 24 hours. Total RNA was extracted from the treated and untreated cells using techniques well known in the art, e.g., using TRIZOL® reagent.
  • Equal amounts of total RNA (10 μg) were loaded in 2.5% formaldehyde gel and electrophoresed for 4 hours at 70V with a constant circulation of the buffer using a circulating pump. The gel was photographed and washed with water twice at room temperature and soaked in 20×SSC. The gel was transferred to a nylon membrane (Amersham) in 20×SSC overnight following a standard procedure. The membrane was washed with 20×SSC to remove any agar that might have attached to the membrane and baked for 4 hours at 80° C.
  • One hundred nanograms of hamster INGAP cDNA was labeled using Random Prime Labeling kit (Roche-BMB) and alpha-P32 dCTP (ICN). Approximately 20 million counts were used for hybridization in 20 ml hybridization buffer following the standard procedure at 42° C. for overnight. The blot was washed as follows: 2-times at room temperature with 2×SSC for 10 minutes each; 2-times at 42° C. with 2×SSC for 10 minutes each; 2-times at 55° C. with 1×SSC for 10 minutes each. The membrane was exposed to the film (XOMAT-Kodak) and kept at −80° C. overnight before developing.
  • Treatment with IL-6 caused an increase in INGAP gene expression (FIG. 12). These data demonstrate that extracellular factors that elevate AP-1-binding transcription factors and STAT-binding transcription factors are involved in the regulation of INGAP gene expression. These studies suggest that it is feasible to enhance INGAP expression as a means of inducing islet neogenesis.
  • While particular embodiments of the present invention have been illustrated and described, it would be obvious to those skilled in the art that various other changes and modifications can be made without departing from the spirit and scope of the invention. It is therefore intended to cover in the appended claims all such changes and modifications that are within the scope of this invention.
  • TABLE 3
    SEQ
    ID Further Position
    NO: Family/matrix Information Opt. from-to anchor
    SEQ V$LEFF/LEF1.01 TCF/LEF-1, 0.86 12-28 20
    ID involved in the
    NO: Wnt signal
    41 transduction
    pathway
    SEQ V$MITF/MIT.01 MIT 0.81 22-40 31
    ID (microphthalmia
    NO: transcription
    42 factor) and TFE3
    SEQ V$OCT1/OCT1.05 octamer-binding 0.90 27-41 34
    ID factor 1
    NO:
    43
    SEQ V$TCFF/TCF11.01 TCF11/KCR- 1.00 32-38 35
    ID F1/Nrf1
    NO: homodimers
    44
    SEQ V$MYOF/MYOGNF1.01 Myogenin/ 0.71 25-53 39
    ID nuclear factor 1
    NO: or related factors
    45
    SEQ V$ZBPF/ZBP89.01 Zinc finger 0.93 36-48 42
    ID transcription
    NO: factor ZBP-89
    46
    SEQ V$SP1F/GC.01 GC box elements 0.88 38-52 45
    ID
    NO:
    47
    SEQ V$PERO/PPARA.01 PPAR/RXR 0.70 44-64 54
    ID heterodimers
    NO:
    48
    SEQ V$PAX5/PAX9.01 zebrafish PAX9 0.78 43-71 57
    ID binding sites
    NO:
    49
    SEQ V$TBPF/ATATA.01 Avian C-type LTR 0.81 68-84 76
    ID TATA box
    NO:
    50
    SEQ V$HMTB/MTBF.01 muscle-specific 0.90 76-84 80
    ID Mt binding site
    NO:
    51
    SEQ V$OCT1/OCT1.06 octamer-binding 0.80 74-88 81
    ID factor 1
    NO:
    52
    NO: (en-1)
    65
    SEQ V$BARB/BARBIE.01 barbiturate- 0.88 230-244 237
    ID inducible element
    NO:
    66
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 230-246 238
    ID TATA box
    NO: elements
    67
    SEQ V$BARB/BARBIE.01 barbiturate- 0.88 252-266 259
    ID inducible element
    NO:
    68
    SEQ V$MYT1/MYT1.01 MyT1 zinc finger 0.75 272-284 278
    ID transcription
    NO: factor involved in
    69 primary
    neurogenesis
    SEQ V$SMAD/SMAD4.01 Smad4 0.94 304-312 308
    ID transcription
    NO: factor involved in
    70 TGF-beta
    signaling
    SEQ V$HOXF/CRX.01 Cone-rod 0.94 312-328 320
    ID homeobox-
    NO: containing
    71 transcription
    factor/otx-like
    homeobox gene
    SEQ V$ECAT/NFY.01 nuclear factor Y 0.90 337-351 344
    ID (Y-box binding
    NO: factor)
    72
    SEQ V$HOXF/PTX1.01 Pituitary 0.79 337-353 345
    ID Homeobox 1
    NO: (Ptx1)
    73
    SEQ V$FKHD/FREAC2.01 Fork head 0.84 362-378 370
    ID RElated
    NO: ACtivator-2
    74
    SEQ V$MINI/MUSCLE_INI.02 Muscle Initiator 0.86 401-419 410
    ID Sequence
    NO:
    75
    SEQ V$MOKF/MOK2.01 Ribonucleoprotein 0.74 409-429 419
    ID associated zinc
    NO: finger protein
    76 MOK-2 (mouse)
    SEQ V$ZFIA/ZID.01 zinc finger with 0.85 414-426 420
    ID interaction
    NO: domain
    77
    SEQ V$CART/XVENT2.01 Xenopus 0.82 418-434 426
    ID homeodomain
    NO: factor Xvent-2;
    78 early BMP
    signaling
    response
    SEQ V$OCT1/OCT1.04 octamer-binding 0.80 421-435 428
    ID factor 1
    NO:
    79
    SEQ V$HOMS/S8.01 Binding site for 0.97 426-434 430
    ID S8 type
    NO: homeodomains
    80
    SEQ V$NKXH/NKX25.02 homeo domain 0.88 424-436 430
    ID factor Nkx-
    NO: 2.5/Csx, tinman
    81 homolog low
    affinity sites
    SEQ V$CREB/CREBP1.01 cAMP-responsive 0.80 425-445 435
    ID element binding
    NO: protein 1
    82
    SEQ V$COMP/COMP1.01 COMP1, 0.76 434-454 444
    ID cooperates with
    NO: myogenic
    83 proteins in
    multicomponent
    complex
    SEQ V$HOXF/HOX1-3.01 Hox-1.3, 0.83 444-460 452
    ID vertebrate
    NO: homeobox
    84 protein
    SEQ V$ETSF/GABP.01 GABP: GA 0.85 454-470 462
    ID binding protein
    NO:
    85
    SEQ V$LEFF/LEF1.01 TCF/LEF-1, 0.86 463-479 471
    ID involved in the
    NO: Wnt signal
    86 transduction
    pathway
    SEQ V$STAT/STAT6.01 STAT6: signal 0.84 464-482 473
    ID transducer and
    NO: activator of
    87 transcription 6
    SEQ V$GATA/GATA1.03 GATA-binding 0.95 490-502 496
    ID factor 1
    NO:
    88
    SEQ V$SRFF/SRF.01 serum response 0.66 487-505 496
    ID factor
    NO:
    89
    SEQ V$EVI1/EVI1.04 Ecotropic viral 0.77 493-509 501
    ID integration site 1
    NO: encoded factor
    90
    SEQ V$AP4R/TH1E47.01 Thing1/E47 0.93 509-525 517
    ID heterodimer, TH1
    NO: bHLH member
    91 specific
    expression in a
    variety of
    embryonic
    tissues
    SEQ V$AP4R/TAL1BETAITF2.01 Tal-1beta/ITF-2 0.85 512-528 520
    ID heterodimer
    NO:
    92
    SEQ V$NEUR/NEUROD1.01 DNA binding site 0.83 514-526 520
    ID for NEUROD1
    NO: (BETA-2/E47
    93 dimer)
    SEQ V$MEF2/MEF2.05 MEF2 0.96 518-540 529
    ID
    NO:
    94
    SEQ V$EVI1/EVI1.04 Ecotropic viral 0.77 523-539 531
    ID integration site 1
    NO: encoded factor
    95
    SEQ V$MEF2/AMEF2.01 myocyte 0.80 521-543 532
    ID enhancer factor
    NO:
    96
    SEQ V$TBPF/MTATA.01 Muscle TATA box 0.84 524-540 532
    ID
    NO:
    97
    SEQ V$HOXF/HOX1-3.01 Hox-1.3, 0.83 543-559 551
    ID vertebrate
    NO: homeobox
    98 protein
    SEQ V$PDX1/ISL1.01 Pancreatic and 0.82 543-563 553
    ID intestinal lim-
    NO: homeodomain
    99 factor
    SEQ V$OCT1/OCT1.05 octamer-binding 0.90 556-570 563
    ID factor 1
    NO:
    100
    SEQ V$CIZF/NMP4.01 NMP4 (nuclear 0.97 562-572 567
    ID matrix protein 4)/
    NO: CIZ (Cas-
    101 interacting zinc
    finger protein)
    SEQ V$EVI1/EVI1.01 Ecotropic viral 0.72 569-585 577
    ID integration site 1
    NO: encoded factor
    102
    SEQ V$AP1F/AP1.01 AP1 binding site 0.95 582-602 592
    ID
    NO:
    103
    SEQ V$PIT1/PIT1.01 Pit1, GHF-1 0.86 589-599 594
    ID pituitary specific
    NO: pou domain
    104 transcription
    factor
    SEQ V$AP1F/AP1.01 AP1 binding site 0.95 586-606 596
    ID
    NO:
    105
    SEQ V$VMYB/VMYB.01 v-Myb 0.90 593-603 598
    ID
    NO:
    106
    SEQ V$CIZF/NMP4.01 NMP4 (nuclear 0.97 595-605 600
    ID matrix protein 4)/
    NO: CIZ (Cas-
    107 interacting zinc
    finger protein)
    SEQ V$GREF/PRE.01 Progesterone 0.84 604-622 613
    ID receptor binding
    NO: site
    108
    SEQ V$GKLF/GKLF.01 Gut-enriched 0.91 632-646 639
    ID Krueppel-like
    NO: factor
    109
    SEQ V$CIZF/NMP4.01 NMP4 (nuclear 0.97 637-647 642
    ID matrix protein 4)/
    NO: CIZ (Cas-
    110 interacting zinc
    finger protein)
    SEQ V$NFAT/NFAT.01 Nuclear factor of 0.97 640-650 645
    ID activated T-cells
    NO:
    111
    SEQ V$MAZF/MAZ.01 Myc associated 0.90 649-661 655
    ID zinc finger
    NO: protein (MAZ)
    112
    SEQ V$EGRF/WT1.01 Wilms Tumor 0.88 658-672 665
    ID Suppressor
    NO:
    113
    SEQ V$ZBPF/ZBP89.01 Zinc finger 0.93 663-675 669
    ID transcription
    NO: factor ZBP-89
    114
    SEQ V$IRFF/IRF2.01 interferon 0.80 702-716 709
    ID regulatory factor 2
    NO:
    115
    SEQ V$BRNF/BRN2.01 POU factor Brn-2 0.91 746-762 754
    ID (N-Oct 3)
    NO:
    116
    SEQ V$ETSF/PU1.01 Pu.1 (Pu120) Ets- 0.86 746-762 754
    ID like transcription
    NO: factor identified
    117 in lymphoid B-
    cells
    SEQ V$EVI1/EVI1.04 Ecotropic viral 0.77 750-766 758
    ID integration site 1
    NO: encoded factor
    118
    SEQ V$EVI1/EVI1.05 Ecotropic viral 0.80 755-771 763
    ID integration site 1
    NO: encoded factor
    119
    SEQ V$ZBPF/ZBP89.01 Zinc finger 0.93 764-776 770
    ID transcription
    NO: factor ZBP-89
    120
    SEQ V$FAST/FAST1.01 FAST-1 SMAD 0.81 769-783 776
    ID interacting
    NO: protein
    121
    SEQ V$TBPF/TATA.02 Mammalian C- 0.89 771-787 779
    ID type LTR TATA
    NO: box
    122
    SEQ V$PAX5/PAX9.01 zebrafish PAX9 0.78 781-809 795
    ID binding sites
    NO:
    123
    SEQ V$OCT1/OCT.01 Octamer binding 0.79 793-807 800
    ID site (OCT1/OCT2
    NO: consensus)
    124
    SEQ V$OCTP/OCT1P.01 octamer-binding 0.86 798-810 804
    ID factor 1, POU-
    NO: specific domain
    125
    SEQ V$SRFF/SRF.01 serum response 0.66 797-815 806
    ID factor
    NO:
    126
    SEQ V$EVI1/EVI1.05 Ecotropic viral 0.80 802-818 810
    ID integration site 1
    NO: encoded factor
    127
    SEQ V$CLOX/CDP.01 cut-like 0.75 803-819 811
    ID homeodomain
    NO: protein
    128
    SEQ V$EVI1/EVI1.02 Ecotropic viral 0.83 807-823 815
    ID integration site 1
    NO: encoded factor
    129
    SEQ V$ECAT/NFY.02 nuclear factor Y 0.91 810-824 817
    ID (Y-box binding
    NO: factor)
    130
    SEQ V$HAML/AML3.01 Runt-related 0.84 811-825 818
    ID transcription
    NO: factor 2/CBFA1
    131 (core-binding
    factor, runt
    domain, alpha
    subunit 1)
    SEQ V$PCAT/CAAT.01 cellular and viral 0.90 813-823 818
    ID CCAAT box
    NO:
    132
    SEQ V$GATA/GATA.01 GATA binding site 0.95 818-830 824
    ID (consensus)
    NO:
    133
    SEQ V$HNF1/HNF1.02 Hepatic nuclear 0.76 818-834 826
    ID factor 1
    NO:
    134
    SEQ V$HOXT/MEIS1_HOXA9.01 Homeobox 0.79 823-835 829
    ID protein MEIS1
    NO: binding site
    135
    SEQ V$ECAT/NFY.01 nuclear factor Y 0.90 837-851 844
    ID (Y-box binding
    NO: factor)
    136
    SEQ V$FKHD/FREAC2.01 Fork head 0.84 844-860 852
    ID RElated
    NO: ACtivator-2
    137
    SEQ V$EVI1/EVI1.06 Ecotropic viral 0.83 846-862 854
    ID integration site 1
    NO: encoded factor
    138
    SEQ V$GATA/GATA1.01 GATA-binding 0.96 853-865 859
    ID factor 1
    NO:
    139
    SEQ V$PCAT/ACAAT.01 Avian C-type LTR 0.86 856-866 861
    ID CCAAT box
    NO:
    140
    SEQ V$XBBF/RFX1.01 X-box binding 0.89 909-927 918
    ID protein RFX1
    NO:
    141
    SEQ V$EBOX/MYCMAX.02 c-Myc/Max 0.92 912-928 920
    ID heterodimer
    NO:
    142
    SEQ V$MITF/MIT.01 MIT 0.81 911-929 920
    ID (microphthalmia
    NO: transcription
    143 factor) and TFE3
    SEQ V$ETSF/PU1.01 Pu.1 (Pu120) Ets- 0.86 927-943 935
    ID like transcription
    NO: factor identified
    144 in lymphoid B-
    cells
    SEQ V$OCT1/OCT1.06 octamer-binding 0.80 932-946 939
    ID factor 1
    NO:
    145
    SEQ V$TALE/TGIF.01 TG-interacting 1.00 936-942 939
    ID factor belonging
    NO: to TALE class of
    146 homeodomain
    factors
    SEQ V$MITF/MIT.01 MIT 0.81 935-953 944
    ID (microphthalmia
    NO: transcription
    147 factor) and TFE3
    SEQ V$OCT1/OCT1.04 octamer-binding 0.80 941-955 948
    ID factor 1
    NO:
    148
    SEQ V$GATA/GATA.01 GATA binding site 0.95 962-974 968
    ID (consensus)
    NO:
    149
    SEQ V$SRFF/SRF.01 serum response 0.66 968-986 977
    ID factor
    NO:
    150
    SEQ V$CDXF/CDX2.01 Cdx-2 0.84 970-988 979
    ID mammalian
    NO: caudal related
    151 intestinal transcr.
    factor
    SEQ V$FKHD/XFD2.01 Xenopus fork 0.89 972-988 980
    ID head domain
    NO: factor 2
    152
    SEQ V$MEF2/MEF2.01 myogenic 0.74 970-992 981
    ID enhancer factor 2
    NO:
    153
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 973-989 981
    ID TATA box
    NO: elements
    154
    SEQ V$CART/CART1.01 Cart-1 (cartilage 0.84 978-994 986
    ID homeoprotein 1)
    NO:
    155
    SEQ V$CART/CART1.01 Cart-1 (cartilage 0.84  985-1001 993
    ID homeoprotein 1)
    NO:
    156
    SEQ V$SATB/SATB1.01 Special AT-rich 0.93  985-1001 993
    ID sequence-binding
    NO: protein 1,
    157 predominantly
    expressed in
    thymocytes,
    binds to matrix
    attachment
    regions (MARs)
    SEQ V$BRNF/BRN3.01 POU transcription 0.78  987-1003 995
    ID factor Brn-3
    NO:
    158
    SEQ V$CLOX/CDP.01 cut-like 0.75  987-1003 995
    ID homeodomain
    NO: protein
    159
    SEQ V$HOMS/S8.01 Binding site for 0.97  992-1000 996
    ID S8 type
    NO: homeodomains
    160
    SEQ V$NKXH/DLX1.01 DLX-1, -2, and -5 0.91  990-1002 996
    ID binding sites
    NO:
    161
    SEQ V$HOXF/HOX1-3.01 Hox-1.3, 0.83  989-1005 997
    ID vertebrate
    NO: homeobox
    162 protein
    SEQ V$PDX1/PDX1.01 Pdx1 (IDX1/IPF1) 0.74  988-1008 998
    ID pancreatic and
    NO: intestinal
    163 homeodomain TF
    SEQ V$FKHD/XFD3.01 Xenopus fork 0.82  998-1014 1006
    ID head domain
    NO: factor 3
    164
    SEQ V$HNF1/HNF1.01 hepatic nuclear 0.78 1000-1016 1008
    ID factor 1
    NO:
    165
    SEQ V$HNF1/HNF1.01 hepatic nuclear 0.78 1002-1018 1010
    ID factor 1
    NO:
    166
    SEQ V$PAX4/PAX4.01 Pax-4 paired 0.97 1005-1015 1010
    ID domain protein,
    NO: together with
    167 PAX-6 involved in
    pancreatic
    development
    SEQ V$HOMS/S8.01 Binding site for 0.97 1007-1015 1011
    ID S8 type
    NO: homeodomains
    168
    SEQ V$HOXF/HOX1-3.01 Hox-1.3, 0.83 1003-1019 1011
    ID vertebrate
    NO: homeobox
    169 protein
    SEQ V$NKXH/DLX1.01 DLX-1, -2, and -5 0.91 1005-1017 1011
    ID binding sites
    NO:
    170
    SEQ V$RBIT/BRIGHT.01 Bright, B cell 0.92 1005-1017 1011
    ID regulator of IgH
    NO: transcription
    171
    SEQ V$TBPF/ATATA.01 Avian C-type LTR 0.81 1005-1021 1013
    ID TATA box
    NO:
    172
    SEQ V$CREB/CREBP1.01 cAMP-responsive 0.80 1004-1024 1014
    ID element binding
    NO: protein 1
    173
    SEQ V$RORA/RORA2.01 RAR-related 0.82 1007-1023 1015
    ID orphan receptor
    NO: alpha2
    174
    SEQ V$PCAT/CAAT.01 cellular and viral 0.90 1022-1032 1027
    ID CCAAT box
    NO:
    175
    SEQ V$NKXH/NKX25.02 homeo domain 0.88 1022-1034 1028
    ID factor Nkx-
    NO: 2.5/Csx, tinman
    176 homolog low
    affinity sites
    SEQ V$CREB/HLF.01 hepatic leukemia 0.84 1022-1042 1032
    ID factor
    NO:
    177
    SEQ V$HOXF/HOX1-3.01 Hox-1.3, 0.83 1056-1072 1064
    ID vertebrate
    NO: homeobox
    178 protein
    SEQ V$HOMS/S8.01 Binding site for 0.97 1061-1069 1065
    ID S8 type
    NO: homeodomains
    179
    SEQ V$NKXH/DLX1.01 DLX-1, -2, and -5 0.91 1059-1071 1065
    ID binding sites
    NO:
    180
    SEQ V$RBIT/BRIGHT.01 Bright, B cell 0.92 1059-1071 1065
    ID regulator of IgH
    NO: transcription
    181
    SEQ V$BRNF/BRN2.01 POU factor Brn-2 0.91 1058-1074 1066
    ID (N-Oct 3)
    NO:
    182
    SEQ V$OCT1/OCT1.06 octamer-binding 0.80 1060-1074 1067
    ID factor 1
    NO:
    183
    SEQ V$HOXF/HOX1-3.01 Hox-1.3, 0.83 1061-1077 1069
    ID vertebrate
    NO: homeobox
    184 protein
    SEQ V$OCT1/OCT1.06 octamer-binding 0.80 1079-1093 1086
    ID factor 1
    NO:
    185
    SEQ V$FAST/FAST1.01 FAST-1 SMAD 0.81 1080-1094 1087
    ID interacting
    NO: protein
    186
    SEQ V$RREB/RREB1.01 Ras-responsive 0.79 1081-1095 1088
    ID element binding
    NO: protein 1
    187
    SEQ V$E2FF/E2F.02 E2F, involved in 0.84 1085-1099 1092
    ID cell cycle
    NO: regulation,
    188 interacts with Rb
    p107 protein
    SEQ V$CREB/TAXCREB.01 Tax/CREB 0.81 1091-1111 1101
    ID complex
    NO:
    189
    SEQ V$AP1F/VMAF.01 v-Maf 0.82 1092-1112 1102
    ID
    NO:
    190
    SEQ V$MYT1/MYT1.01 MyT1 zinc finger 0.75 1123-1135 1129
    ID transcription
    NO: factor involved in
    191 primary
    neurogenesis
    SEQ V$CLOX/CLOX.01 Clox 0.81 1136-1152 1144
    ID
    NO:
    192
    SEQ V$HNF4/HNF4.01 Hepatic nuclear 0.82 1156-1172 1164
    ID factor 4
    NO:
    193
    SEQ V$LEFF/LEF1.01 TCF/LEF-1, 0.86 1157-1173 1165
    ID involved in the
    NO: Wnt signal
    194 transduction
    pathway
    SEQ V$PERO/PPARA.01 PPAR/RXR 0.70 1157-1177 1167
    ID heterodimers
    NO:
    195
    SEQ V$CLOX/CLOX.01 Clox 0.81 1173-1189 1181
    ID
    NO:
    196
    SEQ V$HNF6/HNF6.01 Liver enriched 0.82 1175-1189 1182
    ID Cut -
    NO: Homeodomain
    197 transcription
    factor HNF6
    (ONECUT)
    SEQ V$SRFF/SRF.02 serum response 0.83 1177-1195 1186
    ID factor
    NO:
    198
    SEQ V$CLOX/CDPCR3.01 cut-like 0.75 1180-1196 1188
    ID homeodomain
    NO: protein
    199
    SEQ V$PIT1/PIT1.01 Pit1, GHF-1 0.86 1186-1196 1191
    ID pituitary specific
    NO: pou domain
    200 transcription
    factor
    SEQ V$HMTB/MTBF.01 muscle-specific 0.90 1196-1204 1200
    ID Mt binding site
    NO:
    201
    SEQ V$FKHD/HFH8.01 HNF-3/Fkh 0.92 1200-1216 1208
    ID Homolog-8
    NO:
    202
    SEQ V$E4FF/E4F.01 GLI-Krueppel- 0.82 1223-1235 1229
    ID related
    NO: transcription
    203 factor, regulator
    of adenovirus E4
    promoter
    SEQ V$CREB/HLF.01 hepatic leukemia 0.84 1221-1241 1231
    ID factor
    NO:
    204
    SEQ V$VBPF/VBP.01 PAR-type chicken 0.86 1226-1236 1231
    ID vitellogenin
    NO: promoter-binding
    205 protein
    SEQ V$OCT1/OCT.01 Octamer binding 0.79 1259-1273 1266
    ID site (OCT1/OCT2
    NO: consensus)
    206
    SEQ V$STAT/STAT6.01 STAT6: signal 0.84 1261-1279 1270
    ID transducer and
    NO: activator of
    207 transcription 6
    SEQ V$CDXF/CDX2.01 Cdx-2 0.84 1270-1288 1279
    ID mammalian
    NO: caudal related
    208 intestinal transcr.
    factor
    SEQ V$SORY/SOX9.01 SOX (SRY-related 0.90 1280-1296 1288
    ID HMG box)
    NO:
    209
    SEQ V$FKHD/HFH2.01 HNF-3/Fkh 0.93 1285-1301 1293
    ID Homolog 2
    NO:
    210
    SEQ V$CDXF/CDX2.01 Cdx-2 0.84 1286-1304 1295
    ID mammalian
    NO: caudal related
    211 intestinal transcr.
    factor
    SEQ V$OCTB/TST1.01 POU-factor Tst- 0.87 1288-1302 1295
    ID 1/Oct-6
    NO:
    212
    SEQ V$PDX1/ISL1.01 Pancreatic and 0.82 1298-1318 1308
    ID intestinal lim-
    NO: homeodomain
    213 factor
    SEQ V$SORY/SOX9.01 SOX (SRY-related 0.90 1308-1324 1316
    ID HMG box)
    NO:
    214
    SEQ V$CREB/HLF.01 hepatic leukemia 0.84 1310-1330 1320
    ID factor
    NO:
    215
    SEQ V$VBPF/VBP.01 PAR-type chicken 0.86 1315-1325 1320
    ID vitellogenin
    NO: promoter-binding
    216 protein
    SEQ V$CEBP/CEBPB.01 CCAAT/enhancer 0.94 1313-1331 1322
    ID binding protein
    NO: beta
    217
    SEQ V$PDX1/ISL1.01 Pancreatic and 0.82 1313-1333 1323
    ID intestinal lim-
    NO: homeodomain
    218 factor
    SEQ V$HAML/AML1.01 runt-factor AML-1 1.00 1323-1337 1330
    ID
    NO:
    219
    SEQ V$GREF/ARE.01 Androgene 0.80 1323-1341 1332
    ID receptor binding
    NO: site
    220
    SEQ V$TEAF/TEF1.01 TEF-1 related 0.84 1343-1355 1349
    ID muscle factor
    NO:
    221
    SEQ V$CMYB/CMYB.01 c-Myb, important 0.99 1352-1360 1356
    ID in hematopoesis,
    NO: cellular
    222 equivalent to
    avian
    myoblastosis
    virus oncogene v-
    myb
    SEQ V$AP4R/TH1E47.01 Thing1/E47 0.93 1378-1394 1386
    ID heterodimer, TH1
    NO: bHLH member
    223 specific
    expression in a
    variety of
    embryonic
    tissues
    SEQ V$CP2F/CP2.01 CP2 0.90 1384-1394 1389
    ID
    NO:
    224
    SEQ V$CHOP/CHOP.01 heterodimers of 0.90 1386-1398 1392
    ID CHOP and
    NO: C/EBPalpha
    225
    SEQ V$CEBP/CEBP.02 C/EBP binding 0.85 1385-1403 1394
    ID site
    NO:
    226
    SEQ V$MEF2/HMEF2.01 myocyte 0.76 1384-1406 1395
    ID enhancer factor
    NO:
    227
    SEQ V$OCT1/OCT1.03 octamer-binding 0.85 1388-1402 1395
    ID factor 1
    NO:
    228
    SEQ V$HMTB/MTBF.01 muscle-specific 0.90 1394-1402 1398
    ID Mt binding site
    NO:
    229
    SEQ V$CLOX/CDPCR3.01 cut-like 0.75 1422-1438 1430
    ID homeodomain
    NO: protein
    230
    SEQ V$OCT1/OCT1.05 octamer-binding 0.90 1423-1437 1430
    ID factor 1
    NO:
    231
    SEQ V$HOXF/HOX1-3.01 Hox-1.3, 0.83 1423-1439 1431
    ID vertebrate
    NO: homeobox
    232 protein
    SEQ V$PDX1/PDX1.01 Pdx1 (IDX1/IPF1) 0.74 1423-1443 1433
    ID pancreatic and
    NO: intestinal
    233 homeodomain TF
    SEQ V$SORY/SOX5.01 Sox-5 0.87 1426-1442 1434
    ID
    NO:
    234
    SEQ V$OCT1/OCT1.05 octamer-binding 0.90 1444-1458 1451
    ID factor 1
    NO:
    235
    SEQ V$CREB/E4BP4.01 E4BP4, bZIP 0.80 1443-1463 1453
    ID domain,
    NO: transcriptional
    236 repressor
    SEQ V$VBPF/VBP.01 PAR-type chicken 0.86 1449-1459 1454
    ID vitellogenin
    NO: promoter-binding
    237 protein
    SEQ V$TBPF/MTATA.01 Muscle TATA box 0.84 1455-1471 1463
    ID
    NO:
    238
    SEQ V$PBXF/PBX1.01 homeo domain 0.78 1469-1481 1475
    ID factor Pbx-1
    NO:
    239
    SEQ V$COMP/COMP1.01 COMP1, 0.76 1467-1487 1477
    ID cooperates with
    NO: myogenic
    240 proteins in
    multicomponent
    complex
    SEQ V$SORY/S0X5.01 Sox-5 0.87 1478-1494 1486
    ID
    NO:
    241
    SEQ V$FKHD/FREAC2.01 Fork head 0.84 1485-1501 1493
    ID RElated
    NO: ACtivator-2
    242
    SEQ V$PDX1/ISL1.01 Pancreatic and 0.82 1495-1515 1505
    ID intestinal lim-
    NO: homeodomain
    243 factor
    SEQ V$HOXF/HOX1-3.01 Hox-1.3, 0.83 1499-1515 1507
    ID vertebrate
    NO: homeobox
    244 protein
    SEQ V$PDX1/PDX1.01 Pdx1 (IDX1/IPF1) 0.74 1498-1518 1508
    ID pancreatic and
    NO: intestinal
    245 homeodomain TF
    SEQ V$CART/XVENT2.01 Xenopus 0.82 1502-1518 1510
    ID homeodomain
    NO: factor Xvent-2;
    246 early BMP
    signaling
    response
    SEQ V$CDXF/CDX2.01 Cdx-2 0.84 1507-1525 1516
    ID mammalian
    NO: caudal related
    247 intestinal transcr.
    factor
    SEQ V$MEF2/MEF2.05 MEF2 0.96 1505-1527 1516
    ID
    NO:
    248
    SEQ V$HNF1/HNF1.01 hepatic nuclear 0.78 1510-1526 1518
    ID factor 1
    NO:
    249
    SEQ V$OCT1/OCT1.06 octamer-binding 0.80 1511-1525 1518
    ID factor 1
    NO:
    250
    SEQ V$TBPF/TATA.02 Mammalian C- 0.89 1510-1526 1518
    ID type LTR TATA
    NO: box
    251
    SEQ V$NKXH/MSX.01 Homeodomain 0.97 1514-1526 1520
    ID proteins MSX-1
    NO: and MSX-2
    252
    SEQ V$RBIT/BRIGHT.01 Bright, B cell 0.92 1515-1527 1521
    ID regulator of IgH
    NO: transcription
    253
    SEQ V$MEF2/AMEF2.01 myocyte 0.80 1514-1536 1525
    ID enhancer factor
    NO:
    254
    SEQ V$EVI1/EVI1.02 Ecotropic viral 0.83 1526-1542 1534
    ID integration site 1
    NO: encoded factor
    255
    SEQ V$GATA/GATA1.02 GATA-binding 0.99 1528-1540 1534
    ID factor 1
    NO:
    256
    SEQ V$GATA/GATA3.02 GATA-binding 0.91 1537-1549 1543
    ID factor 3
    NO:
    257
    SEQ V$GATA/GATA3.02 GATA-binding 0.91 1559-1571 1565
    ID factor 3
    NO:
    258
    SEQ V$OCT1/OCT1.02 octamer-binding 0.82 1561-1575 1568
    ID factor 1
    NO:
    259
    SEQ V$CEBP/CEBPB.01 CCAAT/enhancer 0.94 1567-1585 1576
    ID binding protein
    NO: beta
    260
    SEQ V$PLZF/PLZF.01 Promyelocytic 0.86 1574-1588 1581
    ID leukemia zink
    NO: finger (TF with
    261 nine Krueppel-
    like zink fingers)
    SEQ V$PAX3/PAX3.01 Pax-3 paired 0.76 1587-1599 1593
    ID domain protein,
    NO: expressed in
    262 embryogenesis,
    mutations
    correlate to
    Waardenburg
    Syndrome
    SEQ V$CREB/ATF.01 activating 0.90 1588-1608 1598
    ID transcription
    NO: factor
    263
    SEQ V$AP4R/TH1E47.01 Thing1/E47 0.93 1614-1630 1622
    ID heterodimer, TH1
    NO: bHLH member
    264 specific
    expression in a
    variety of
    embryonic
    tissues
    SEQ V$NKXH/MSX.01 Homeodomain 0.97 1619-1631 1625
    ID proteins MSX-1
    NO: and MSX-2
    265
    SEQ V$RBIT/BRIGHT.01 Bright, B cell 0.92 1620-1632 1626
    ID regulator of IgH
    NO: transcription
    266
    SEQ V$OCTB/TST1.01 POU-factor Tst- 0.87 1620-1634 1627
    ID 1/Oct-6
    NO:
    267
    SEQ V$NKXH/DLX3.01 Distal-less 3 0.91 1628-1640 1634
    ID homeodomain
    NO: transcription
    268 factor
    SEQ V$GREF/PRE.01 Progesterone 0.84 1628-1646 1637
    ID receptor binding
    NO: site
    269
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 1636-1652 1644
    ID TATA box
    NO: elements
    270
    SEQ V$FKHD/XFD2.01 Xenopus fork 0.89 1637-1653 1645
    ID head domain
    NO: factor 2
    271
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 1638-1654 1646
    ID TATA box
    NO: elements
    272
    SEQ V$CREB/E4BP4.01 E4BP4, bZIP 0.80 1638-1658 1648
    ID domain,
    NO: transcriptional
    273 repressor
    SEQ V$PDX1/ISL1.01 Pancreatic and 0.82 1644-1664 1654
    ID intestinal lim-
    NO: homeodomain
    274 factor
    SEQ V$COMP/COMP1.01 COMP1, 0.76 1648-1668 1658
    ID cooperates with
    NO: myogenic
    275 proteins in
    multicomponent
    complex
    SEQ V$TBPF/TATA.02 Mammalian C- 0.89 1658-1674 1666
    ID type LTR TATA
    NO: box
    276
    SEQ V$IRFF/ISRE.01 interferon- 0.81 1662-1676 1669
    ID stimulated
    NO: response element
    277
    SEQ V$XBBF/RFX1.01 X-box binding 0.89 1660-1678 1669
    ID protein RFX1
    NO:
    278
    SEQ V$MYT1/MYT1.02 MyT1 zinc finger 0.88 1667-1679 1673
    ID transcription
    NO: factor involved in
    279 primary
    neurogenesis
    SEQ V$OCT1/OCT1.06 octamer-binding 0.80 1683-1697 1690
    ID factor 1
    NO:
    280
    SEQ V$AP1F/TCF11MAFG.01 TCF11/MafG 0.81 1681-1701 1691
    ID heterodimers,
    NO: binding to
    281 subclass of AP1
    sites
    SEQ V$NKXH/MSX2.01 Muscle segment 0.95 1687-1699 1693
    ID homeo box 2,
    NO: homologue of
    282 Drosophila (HOX
    8)
    SEQ V$FAST/FAST1.01 FAST-1 SMAD 0.81 1687-1701 1694
    ID interacting
    NO: protein
    283
    SEQ V$PBXC/PBX1_MEIS1.03 Binding site for a 0.76 1686-1702 1694
    ID Pbx1/Meis1
    NO: heterodimer
    284
    SEQ V$CIZF/NMP4.01 NMP4 (nuclear 0.97 1699-1709 1704
    ID matrix protein 4)/
    NO: CIZ (Cas-
    285 interacting zinc
    finger protein)
    SEQ V$STAT/STAT6.01 STAT6: signal 0.84 1702-1720 1711
    ID transducer and
    NO: activator of
    286 transcription 6
    SEQ V$AP4R/TAL1BETAE47.01 Tal-1beta/E47 0.87 1710-1726 1718
    ID heterodimer
    NO:
    287
    SEQ V$SORY/HMGIY.01 HMGI(Y) high- 0.92 1720-1736 1728
    ID mobility-group
    NO: protein I (Y),
    288 architectural
    transcription
    factor organizing
    the framework of
    a nuclear protein-
    DNA
    transcriptional
    complex
    SEQ V$MYT1/MYT1.01 MyT1 zinc finger 0.75 1723-1735 1729
    ID transcription
    NO: factor involved in
    289 primary
    neurogenesis
    SEQ V$SRFF/SRF.01 serum response 0.66 1728-1746 1737
    ID factor
    NO:
    290
    SEQ V$HOXF/HOXA9.01 Member of the 0.87 1731-1747 1739
    ID vertebrate HOX -
    NO: cluster of
    291 homeobox factors
    SEQ V$HOXT/MEIS1_HOXA9.01 Homeobox 0.79 1734-1746 1740
    ID protein MEIS1
    NO: binding site
    292
    SEQ V$PIT1/PIT1.01 Pit1, GHF-1 0.86 1737-1747 1742
    ID pituitary specific
    NO: pou domain
    293 transcription
    factor
    SEQ V$AP1F/AP1.01 AP1 binding site 0.95 1734-1754 1744
    ID
    NO:
    294
    SEQ V$VBPF/VBP.01 PAR-type chicken 0.86 1746-1756 1751
    ID vitellogenin
    NO: promoter-binding
    295 protein
    SEQ V$FAST/FAST1.01 FAST-1 SMAD 0.81 1757-1771 1764
    ID interacting
    NO: protein
    296
    SEQ V$HOXF/EN1.01 Homeobox 0.77 1759-1775 1767
    ID protein engrailed
    NO: (en-1)
    297
    SEQ V$TBPF/MTATA.01 Muscle TATA box 0.84 1763-1779 1771
    ID
    NO:
    298
    SEQ V$ETSF/ETS2.01 c-Ets-2 binding 0.86 1774-1790 1782
    ID site
    NO:
    299
    SEQ V$MYT1/MYT1.02 MyT1 zinc finger 0.88 1780-1792 1786
    ID transcription
    NO: factor involved in
    300 primary
    neurogenesis
    SEQ V$GFI1/GFI1.01 Growth factor 0.97 1782-1796 1789
    ID independence 1
    NO: zinc finger
    301 protein acts as
    transcriptional
    repressor
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 1784-1800 1792
    ID TATA box
    NO: elements
    302
    SEQ V$BRNF/BRN2.01 POU factor Brn-2 0.91 1786-1802 1794
    ID (N-Oct 3)
    NO:
    303
    SEQ V$HOXT/MEIS1_HOXA9.01 Homeobox 0.79 1788-1800 1794
    ID protein MEIS1
    NO: binding site
    304
    SEQ V$MEF2/AMEF2.01 myocyte 0.80 1783-1805 1794
    ID enhancer factor
    NO:
    305
    SEQ V$OCTB/TST1.01 POU-factor Tst- 0.87 1787-1801 1794
    ID 1/Oct-6
    NO:
    306
    SEQ V$HOXF/HOXA9.01 Member of the 0.87 1787-1803 1795
    ID vertebrate HOX-
    NO: cluster of
    307 homeobox factors
    SEQ V$BRNF/BRN2.01 POU factor Brn-2 0.91 1788-1804 1796
    ID (N-Oct 3)
    NO:
    308
    SEQ V$PARF/DBP.01 Albumin D-box 0.84 1791-1805 1798
    ID binding protein
    NO:
    309
    SEQ V$OCT1/OCT1.02 octamer-binding 0.82 1795-1809 1802
    ID factor 1
    NO:
    310
    SEQ V$FKHD/FREAC2.01 Fork head 0.84 1816-1832 1824
    ID RElated
    NO: ACtivator-2
    311
    SEQ V$SORY/SOX5.01 Sox-5 0.87 1821-1837 1829
    ID
    NO:
    312
    SEQ V$AREB/AREB6.04 AREB6 (Atp1a1 0.98 1837-1849 1843
    ID regulatory
    NO: element binding
    313 factor 6)
    SEQ V$MYT1/MYT1.02 MyT1 zinc finger 0.88 1848-1860 1854
    ID transcription
    NO: factor involved in
    314 primary
    neurogenesis
    SEQ V$RBPF/RBPJK.01 Mammalian 0.84 1851-1865 1858
    ID transcriptional
    NO: repressor RBP-
    315 Jkappa/CBF1
    SEQ V$OCT1/OCT1.02 octamer-binding 0.82 1875-1889 1882
    ID factor 1
    NO:
    316
    SEQ V$FKHD/FREAC4.01 Fork head 0.78 1875-1891 1883
    ID RElated
    NO: ACtivator-4
    317
    SEQ V$EBOX/MYCMAX.02 c-Myc/Max 0.92 1880-1896 1888
    ID heterodimer
    NO:
    318
    SEQ V$PAX6/PAX6.01 Pax-6 paired 0.75 1880-1898 1889
    ID domain protein
    NO:
    319
    SEQ V$IRFF/IRF3.01 Interferon 0.86 1891-1905 1898
    ID regulatory factor
    NO: 3 (IRF-3)
    320
    SEQ V$HNF1/HNF1.02 Hepatic nuclear 0.76 1895-1911 1903
    ID factor 1
    NO:
    321
    SEQ V$FKHD/FREAC2.01 Fork head 0.84 1898-1914 1906
    ID RElated
    NO: ACtivator-2
    322
    SEQ V$E4FF/E4F.01 GLI-Krueppel- 0.82 1902-1914 1908
    ID related
    NO: transcription
    323 factor, regulator
    of adenovirus E4
    promoter
    SEQ V$CREB/CREBP1.01 cAMP-responsive 0.80 1900-1920 1910
    ID element binding
    NO: protein 1
    324
    SEQ V$VBPF/VBP.01 PAR-type chicken 0.86 1905-1915 1910
    ID vitellogenin
    NO: promoter-binding
    325 protein
    SEQ V$MYT1/MYT1.01 MyT1 zinc finger 0.75 1912-1924 1918
    ID transcription
    NO: factor involved in
    326 primary
    neurogenesis
    SEQ V$HNF1/HNF1.01 hepatic nuclear 0.78 1913-1929 1921
    ID factor 1
    NO:
    327
    SEQ V$PCAT/CAAT.01 cellular and viral 0.90 1928-1938 1933
    ID CCAAT box
    NO:
    328
    SEQ V$HNF6/HNF6.01 Liver enriched 0.82 1929-1943 1936
    ID Cut-
    NO: Homeodomain
    329 transcription
    factor HNF6
    (ONECUT)
    SEQ V$PXRF/PXRCAR.01 Halfsite of PXR 0.98 1935-1945 1940
    ID (pregnane X
    NO: receptor)/RXR
    330 resp. CAR
    (constitutive
    androstane
    receptor)/RXR
    heterodimer
    binding site
    SEQ V$RARF/RTR.01 Retinoid 0.81 1934-1952 1943
    ID receptor-related
    NO: testis-associated
    331 receptor
    (GCNF/RTR)
    SEQ V$HOXF/EN1.01 Homeobox 0.77 1936-1952 1944
    ID protein engrailed
    NO: (en-1)
    332
    SEQ V$NKXH/NKX25.01 homeo domain 1.00 1939-1951 1945
    ID factor Nkx-
    NO: 2.5/Csx, tinman
    333 homolog, high
    affinity sites
    SEQ V$GATA/GATA3.02 GATA-binding 0.91 1953-1965 1959
    ID factor 3
    NO:
    334
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 1968-1984 1976
    ID TATA box
    NO: elements
    335
    SEQ V$SRFF/SRF.01 serum response 0.66 1969-1987 1978
    ID factor
    NO:
    336
    SEQ V$CLOX/CDPCR3.01 cut-like 0.75 1972-1988 1980
    ID homeodomain
    NO: protein
    337
    SEQ V$PAX1/PAX1.01 Pax1 paired 0.61 2016-2034 2025
    ID domain protein,
    NO: expressed in the
    338 developing
    vertebral column
    of mouse
    embryos
    SEQ V$TBPF/ATATA.01 Avian C-type LTR 0.81 2019-2035 2027
    ID TATA box
    NO:
    339
    SEQ V$GFI1/GfI1B.01 Growth factor 0.82 2021-2035 2028
    ID independence 1
    NO: zinc finger
    340 protein Gfi-1B
    SEQ V$NRSF/NRSF.01 neuron-restrictive 0.69 2025-2045 2035
    ID silencer factor
    NO:
    341
    SEQ V$NFAT/NFAT.01 Nuclear factor of 0.97 2033-2043 2038
    ID activated T-cells
    NO:
    342
    SEQ V$AREB/AREB6.04 AREB6 (Atp1a1 0.98 2034-2046 2040
    ID regulatory
    NO: element binding
    343 factor 6)
    SEQ V$HNF1/HNF1.01 hepatic nuclear 0.78 2036-2052 2044
    ID factor 1
    NO:
    344
    SEQ V$FKHD/XFD3.01 Xenopus fork 0.82 2038-2054 2046
    ID head domain
    NO: factor 3
    345
    SEQ V$PDX1/PDX1.01 Pdx1 (IDX1/IPF1) 0.74 2036-2056 2046
    ID pancreatic and
    NO: intestinal
    346 homeodomain TF
    SEQ V$OCT1/OCT1.01 octamer-binding 0.77 2050-2064 2057
    ID factor 1
    NO:
    347
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 2053-2069 2061
    ID TATA box
    NO: elements
    348
    SEQ V$ETSF/GABP.01 GABP: GA 0.85 2080-2096 2088
    ID binding protein
    NO:
    349
    SEQ V$BEL1/BEL1.01 Bel-1 similar 0.78 2083-2105 2094
    ID region (defined in
    NO: Lentivirus LTRs)
    350
    SEQ V$VMYB/VMYB.01 v-Myb 0.90 2097-2107 2102
    ID
    NO:
    351
    SEQ V$GREF/ARE.01 Androgene 0.80 2106-2124 2115
    ID receptor binding
    NO: site
    352
    SEQ V$PDX1/PDX1.01 Pdx1 (IDX1/IPF1) 0.74 2137-2157 2147
    ID pancreatic and
    NO: intestinal
    353 homeodomain TF
    SEQ V$MYOD/MYOD.02 myoblast 0.98 2154-2168 2161
    ID determining
    NO: factor
    354
    SEQ V$GATA/GATA1.03 GATA-binding 0.95 2169-2181 2175
    ID factor 1
    NO:
    355
    SEQ V$AP4R/TAL1BETAE47.01 Tal-1beta/E47 0.87 2179-2195 2187
    ID heterodimer
    NO:
    356
    SEQ V$OAZF/ROAZ.01 Rat C2H2 Zn 0.73 2204-2220 2212
    ID finger protein
    NO: involved in
    357 olfactory
    neuronal
    differentiation
    SEQ V$GATA/GATA1.01 GATA-binding 0.96 2217-2229 2223
    ID factor 1
    NO:
    358
    SEQ V$MYOD/E47.02 TAL1/E47 dimers 0.93 2220-2234 2227
    ID
    NO:
    359
    SEQ V$LTUP/TAACC.01 Lentiviral TATA 0.71 2225-2247 2236
    ID upstream
    NO: element
    360
    SEQ V$RREB/RREB1.01 Ras-responsive 0.79 2239-2253 2246
    ID element binding
    NO: protein 1
    361
    SEQ V$OCT1/OCT1.05 octamer-binding 0.90 2251-2265 2258
    ID factor 1
    NO:
    362
    SEQ V$OCT1/OCT1.02 octamer-binding 0.82 2282-2296 2289
    ID factor 1
    NO:
    363
    SEQ V$COUP/COUP.01 COUP 0.81 2284-2298 2291
    ID antagonizes HNF-
    NO: 4 by binding site
    364 competition or
    synergizes by
    direct protein -
    protein
    interaction with
    HNF-4
    SEQ V$MEF2/MEF2.01 myogenic 0.74 2290-2312 2301
    ID enhancer factor 2
    NO:
    365
    SEQ V$CDXF/CDX2.01 Cdx-2 0.84 2296-2314 2305
    ID mammalian
    NO: caudal related
    366 intestinal transcr.
    factor
    SEQ V$MYT1/MYT1.01 MyT1 zinc finger 0.75 2301-2313 2307
    ID transcription
    NO: factor involved in
    367 primary
    neurogenesis
    SEQ V$NFAT/NFAT.01 Nuclear factor of 0.97 2314-2324 2319
    ID activated T-cells
    NO:
    368
    SEQ V$CIZF/NMP4.01 NMP4 (nuclear 0.97 2317-2327 2322
    ID matrix protein 4)/
    NO: CIZ (Cas-
    369 interacting zinc
    finger protein)
    SEQ V$GATA/GATA3.02 GATA-binding 0.91 2326-2338 2332
    ID factor 3
    NO:
    370
    SEQ V$HMTB/MTBF.01 muscle-specific 0.90 2351-2359 2355
    ID Mt binding site
    NO:
    371
    SEQ V$NOLF/OLF1.01 olfactory neuron- 0.82 2350-2372 2361
    ID specific factor
    NO:
    372
    SEQ V$PDX1/PDX1.01 Pdx1 (IDX1/IPF1) 0.74 2363-2383 2373
    ID pancreatic and
    NO: intestinal
    373 homeodomain TF
    SEQ V$GATA/GATA3.02 GATA-binding 0.91 2395-2407 2401
    ID factor 3
    NO:
    374
    SEQ V$NFAT/NFAT.01 Nuclear factor of 0.97 2406-2416 2411
    ID activated T-cells
    NO:
    375
    SEQ V$OCTP/OCT1P.01 octamer-binding 0.86 2433-2445 2439
    ID factor 1, POU-
    NO: specific domain
    376
    SEQ V$MITF/MIT.01 MIT 0.81 2438-2456 2447
    ID (microphthalmia
    NO: transcription
    377 factor) and TFE3
    SEQ V$PAX8/PAX8.01 PAX 2/5/8 0.88 2441-2453 2447
    ID binding site
    NO:
    378
    SEQ V$TBPF/ATATA.01 Avian C-type LTR 0.81 2451-2467 2459
    ID TATA box
    NO:
    379
    SEQ V$GATA/GATA3.02 GATA-binding 0.91 2462-2474 2468
    ID factor 3
    NO:
    380
    SEQ V$CLOX/CLOX.01 Clox 0.81 2462-2478 2470
    ID
    NO:
    381
    SEQ V$HNF6/HNF6.01 Liver enriched 0.82 2464-2478 2471
    ID Cut -
    NO: Homeodomain
    382 transcription
    factor HNF6
    (ONECUT)
    SEQ V$PIT1/PIT1.01 Pit1, GHF-1 0.86 2468-2478 2473
    ID pituitary specific
    NO: pou domain
    383 transcription
    factor
    SEQ V$AP4R/TAL1BETAITF2.01 Tal-1beta/ITF-2 0.85 2469-2485 2477
    ID heterodimer
    NO:
    384
    SEQ V$CIZF/NMP4.01 NMP4 (nuclear 0.97 2477-2487 2482
    ID matrix protein 4)/
    NO: CIZ (Cas-
    385 interacting zinc
    finger protein)
    SEQ V$NFAT/NFAT.01 Nuclear factor of 0.97 2480-2490 2485
    ID activated T-cells
    NO:
    386
    SEQ V$STAT/STAT.01 signal 0.87 2479-2497 2488
    ID transducers and
    NO: activators of
    387 transcription
    SEQ V$TBPF/TATA.02 Mammalian C- 0.89 2484-2500 2492
    ID type LTR TATA
    NO: box
    388
    SEQ V$FKHD/XFD3.01 Xenopus fork 0.82 2501-2517 2509
    ID head domain
    NO: factor 3
    389
    SEQ V$AP1F/AP1.01 AP1 binding site 0.95 2500-2520 2510
    ID
    NO:
    390
    SEQ V$AP1F/AP1.01 AP1 binding site 0.95 2504-2524 2514
    ID
    NO:
    391
    SEQ V$PCAT/CAAT.01 cellular and viral 0.90 2513-2523 2518
    ID CCAAT box
    NO:
    392
    SEQ V$CDXF/CDX2.01 Cdx-2 0.84 2524-2542 2533
    ID mammalian
    NO: caudal related
    393 intestinal transcr.
    factor
    SEQ V$MYT1/MYT1.02 MyT1 zinc finger 0.88 2539-2551 2545
    ID transcription
    NO: factor involved in
    394 primary
    neurogenesis
    SEQ V$ETSF/FLI.01 ETS family 0.81 2560-2576 2568
    ID member FLI
    NO:
    395
    SEQ V$MYT1/MYT1.01 MyT1 zinc finger 0.75 2569-2581 2575
    ID transcription
    NO: factor involved in
    396 primary
    neurogenesis
    SEQ V$TBPF/ATATA.01 Avian C-type LTR 0.81 2576-2592 2584
    ID TATA box
    NO:
    397
    SEQ V$SATB/SATB1.01 Special AT-rich 0.93 2578-2594 2586
    ID sequence-binding
    NO: protein 1,
    398 predominantly
    expressed in
    thymocytes,
    binds to matrix
    attachment
    regions (MARs)
    SEQ V$NKXH/NKX31.01 prostate-specific 0.84 2584-2596 2590
    ID homeodomain
    NO: protein NKX3.1
    399
    SEQ V$PARF/DBP.01 Albumin D-box 0.84 2589-2603 2596
    ID binding protein
    NO:
    400
    SEQ V$PAX5/PAX5.02 B-cell-specific 0.75 2591-2619 2605
    ID activating protein
    NO:
    401
    SEQ V$ECAT/NFY.03 nuclear factor Y 0.80 2604-2618 2611
    ID (Y-box binding
    NO: factor)
    402
    SEQ V$GFI1/GFI1.01 Growth factor 0.97 2608-2622 2615
    ID independence 1
    NO: zinc finger
    403 protein acts as
    transcriptional
    repressor
    SEQ V$HNF6/HNF6.01 Liver enriched 0.82 2608-2622 2615
    ID Cut -
    NO: Homeodomain
    404 transcription
    factor HNF6
    (ONECUT)
    SEQ V$MYT1/MYT1.01 MyT1 zinc finger 0.75 2610-2622 2616
    ID transcription
    NO: factor involved in
    405 primary
    neurogenesis
    SEQ V$PAX8/PAX8.01 PAX 2/5/8 0.88 2610-2622 2616
    ID binding site
    NO:
    406
    SEQ V$TTFF/TTF1.01 Thyroid 0.92 2609-2623 2616
    ID transcription
    NO: factor-1 (TTF1)
    407 binding site
    SEQ V$MYT1/MYT1.02 MyT1 zinc finger 0.88 2612-2624 2618
    ID transcription
    NO: factor involved in
    408 primary
    neurogenesis
    SEQ V$CDXF/CDX2.01 Cdx-2 0.84 2612-2630 2621
    ID mammalian
    NO: caudal related
    409 intestinal transcr.
    factor
    SEQ V$SORY/HMGIY.01 HMGI(Y) high- 0.92 2649-2665 2657
    ID mobility-group
    NO: protein I (Y),
    410 architectural
    transcription
    factor organizing
    the framework of
    a nuclear protein-
    DNA
    transcriptional
    complex
    SEQ V$HOXF/EN1.01 Homeobox 0.77 2657-2673 2665
    ID protein engrailed
    NO: (en-1)
    411
    SEQ V$OCT1/OCT1.06 octamer-binding 0.80 2662-2676 2669
    ID factor 1
    NO:
    412
    SEQ V$BCL6/BCL6.01 POZ/zinc finger 0.76 2683-2699 2691
    ID protein,
    NO: transcriptional
    413 repressor,
    translocations
    observed in
    diffuse large cell
    lymphoma
    SEQ V$OCTP/OCT1P.01 octamer-binding 0.86 2715-2727 2721
    ID factor 1, POU-
    NO: specific domain
    414
    SEQ V$TEAF/TEF1.01 TEF-1 related 0.84 2722-2734 2728
    ID muscle factor
    NO:
    415
    SEQ V$GFI1/GFI1.01 Growth factor 0.97 2723-2737 2730
    ID independence 1
    NO: zinc finger
    416 protein acts as
    transcriptional
    repressor
    SEQ V$HOXT/MEIS1_HOXA9.01 Homeobox 0.79 2729-2741 2735
    ID protein MEIS1
    NO: binding site
    417
    SEQ V$HOXF/HOXA9.01 Member of the 0.87 2728-2744 2736
    ID vertebrate HOX -
    NO: cluster of
    418 homeobox factors
    SEQ V$PARF/DBP.01 Albumin D-box 0.84 2729-2743 2736
    ID binding protein
    NO:
    419
    SEQ V$VBPF/VBP.01 PAR-type chicken 0.86 2732-2742 2737
    ID vitellogenin
    NO: promoter-binding
    420 protein
    SEQ V$CREB/E4BP4.01 E4BP4, bZIP 0.80 2728-2748 2738
    ID domain,
    NO: transcriptional
    421 repressor
    SEQ V$OCT1/OCT1.01 octamer-binding 0.77 2733-2747 2740
    ID factor 1
    NO:
    422
    SEQ V$FKHD/XFD1.01 Xenopus fork 0.90 2733-2749 2741
    ID head domain
    NO: factor 1
    423
    SEQ V$SRFF/SRF.01 serum response 0.66 2736-2754 2745
    ID factor
    NO:
    424
    SEQ V$OCTP/OCT1P.01 octamer-binding 0.86 2746-2758 2752
    ID factor 1, POU-
    NO: specific domain
    425
    SEQ V$CLOX/CDPCR3.01 cut-like 0.75 2748-2764 2756
    ID homeodomain
    NO: protein
    426
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 2749-2765 2757
    ID TATA box
    NO: elements
    427
    SEQ V$SRFF/SRF.01 serum response 0.66 2750-2768 2759
    ID factor
    NO:
    428
    SEQ V$TBPF/ATATA.01 Avian C-type LTR 0.81 2759-2775 2767
    ID TATA box
    NO:
    429
    SEQ V$TBPF/TATA.02 Mammalian C- 0.89 2762-2778 2770
    ID type LTR TATA
    NO: box
    430
    SEQ V$CABL/CABL.01 Multifunctional c- 0.97 2769-2779 2774
    ID Abl src type
    NO: tyrosine kinase
    431
    SEQ V$LEFF/LEF1.01 TCF/LEF-1, 0.86 2766-2782 2774
    ID involved in the
    NO: Wnt signal
    432 transduction
    pathway
    SEQ V$OCT1/OCT1.06 octamer-binding 0.80 2775-2789 2782
    ID factor 1
    NO:
    433
    SEQ V$MEF2/MMEF2.01 myocyte 0.90 2776-2798 2787
    ID enhancer factor
    NO:
    434
    SEQ V$OCT1/OCT1.06 octamer-binding 0.80 2780-2794 2787
    ID factor 1
    NO:
    435
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 2779-2795 2787
    ID TATA box
    NO: elements
    436
    SEQ V$CART/CART1.01 Cart-1 (cartilage 0.84 2780-2796 2788
    ID homeoprotein 1)
    NO:
    437
    SEQ V$FKHD/XFD2.01 Xenopus fork- 0.89 2780-2796 2788
    ID head domain
    NO: factor 2
    438
    SEQ V$MEF2/MEF2.05 MEF2 0.96 2778-2800 2789
    ID
    NO:
    439
    SEQ V$BRNF/BRN3.01 POU transcription 0.78 2785-2801 2793
    ID factor Brn-3
    NO:
    440
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 2786-2802 2794
    ID TATA box
    NO: elements
    441
    SEQ V$GFI1/GFI1.01 Growth factor 0.97 2791-2805 2798
    ID independence 1
    NO: zinc finger
    442 protein acts as
    transcriptional
    repressor
    SEQ V$HOXT/MEIS1_HOXA9.01 Homeobox 0.79 2797-2809 2803
    ID protein MEIS1
    NO: binding site
    443
    SEQ V$MEF2/MMEF2.01 myocyte 0.90 2792-2814 2803
    ID enhancer factor
    NO:
    444
    SEQ V$MEF2/MEF2.05 MEF2 0.96 2795-2817 2806
    ID
    NO:
    445
    SEQ V$MEF2/MMEF2.01 myocyte 0.90 2797-2819 2808
    ID enhancer factor
    NO:
    446
    SEQ V$HNF1/HNF1.01 hepatic nuclear 0.78 2802-2818 2810
    ID factor 1
    NO:
    447
    SEQ V$MEF2/MEF2.01 myogenic 0.74 2799-2821 2810
    ID enhancer factor 2
    NO:
    448
    SEQ V$HOXF/HOX1-3.01 Hox-1.3, 0.83 2814-2830 2822
    ID vertebrate
    NO: homeobox
    449 protein
    SEQ V$PARF/DBP.01 Albumin D-box 0.84 2816-2830 2823
    ID binding protein
    NO:
    450
    SEQ V$PDX1/ISL1.01 Pancreatic and 0.82 2814-2834 2824
    ID intestinal lim-
    NO: homeodomain
    451 factor
    SEQ V$GATA/GATA1.02 GATA-binding 0.99 2819-2831 2825
    ID factor 1
    NO:
    452
    SEQ V$HEAT/HSF1.01 heat shock factor 1 0.93 2845-2855 2850
    ID
    NO:
    453
    SEQ V$MYT1/MYT1.02 MyT1 zinc finger 0.88 2853-2865 2859
    ID transcription
    NO: factor involved in
    454 primary
    neurogenesis
    SEQ V$BCL6/BCL6.02 POZ/zinc finger 0.77 2857-2873 2865
    ID protein,
    NO: transcriptional
    455 repressor,
    translocations
    observed in
    diffuse large cell
    lymphoma
    SEQ V$TTFF/TTF1.01 Thyroid 0.92 2863-2877 2870
    ID transcription
    NO: factor-1 (TTF1)
    456 binding site
    SEQ V$EBOX/USF.02 upstream 0.94 2868-2884 2876
    ID stimulating factor
    NO:
    457
    SEQ V$HOXF/PTX1.01 Pituitary 0.79 2892-2908 2900
    ID Homeobox 1
    NO: (Ptx1)
    458
    SEQ V$MYOD/LMO2COM.01 complex of Lmo2 0.98 2901-2915 2908
    ID bound to Tal-1,
    NO: E2A proteins, and
    459 GATA-1, half-site 1
    SEQ V$REBV/EBVR.01 Epstein-Barr 0.81 2904-2924 2914
    ID virus
    NO: transcription
    460 factor R
    SEQ V$ETSF/PU1.01 Pu.1 (Pu120) Ets- 0.86 2932-2948 2940
    ID like transcription
    NO: factor identified
    461 in lymphoid B-
    cells
    SEQ V$MITF/MIT.01 MIT 0.81 2943-2961 2952
    ID (microphthalmia
    NO: transcription
    462 factor) and TFE3
    SEQ V$HAML/AML1.01 runt-factor AML-1 1.00 2950-2964 2957
    ID
    NO:
    463
    SEQ V$NFKB/CREL.01 c-Rel 0.91 2954-2968 2961
    ID
    NO:
    464
    SEQ V$IKRS/IK3.01 Ikaros 3, 0.84 2958-2970 2964
    ID potential
    NO: regulator of
    465 lymphocyte
    differentiation
    SEQ V$RBPF/RBPJK.01 Mammalian 0.84 2957-2971 2964
    ID transcriptional
    NO: repressor RBP-
    466 Jkappa/CBF1
    SEQ V$E2FF/E2F.01 E2F, involved in 0.74 2966-2980 2973
    ID cell cycle
    NO: regulation,
    467 interacts with Rb
    p107 protein
    SEQ V$E4FF/E4F.01 GLI-Krueppel- 0.82 2968-2980 2974
    ID related
    NO: transcription
    468 factor, regulator
    of adenovirus E4
    promoter
    SEQ V$CREB/ATF6.02 Activating 0.85 2966-2986 2976
    ID transcription
    NO: factor 6, member
    469 of b-zip family,
    induced by ER
    stress
    SEQ V$EBOX/ARNT.01 AhR nuclear 0.89 2968-2984 2976
    ID translocator
    NO: homodimers
    470
    SEQ V$E4FF/E4F.01 GLI-Krueppel- 0.82 2971-2983 2977
    ID related
    NO: transcription
    471 factor, regulator
    of adenovirus E4
    promoter
    SEQ V$EBOR/XBP1.01 X-box-binding 0.86 2970-2984 2977
    ID protein 1
    NO:
    472
    SEQ V$E2FF/E2F.01 E2F, involved in 0.74 2971-2985 2978
    ID cell cycle
    NO: regulation,
    473 interacts with Rb
    p107 protein
    SEQ V$STAT/STAT.01 signal 0.87 2989-3007 2998
    ID transducers and
    NO: activators of
    474 transcription
    SEQ V$BCL6/BCL6.02 POZ/zinc finger 0.77 2991-3007 2999
    ID protein,
    NO: transcriptional
    475 repressor,
    translocations
    observed in
    diffuse large cell
    lymphoma
    SEQ V$XSEC/STAF.01 Se-Cys tRNA 0.77 3003-3025 3014
    ID gene
    NO: transcription
    476 activating factor
    SEQ V$NF1F/NF1.01 Nuclear factor 1 0.94 3007-3025 3016
    ID
    NO:
    477
    SEQ V$OCT1/OCT1.02 octamer-binding 0.82 3014-3028 3021
    ID factor 1
    NO:
    478
    SEQ V$RCAT/CLTR_CAAT.01 Mammalian C- 0.75 3019-3043 3031
    ID type LTR CCAAT
    NO: box
    479
    SEQ V$SF1F/SF1.01 SF1 steroidogenic 0.95 3033-3045 3039
    ID factor 1
    NO:
    480
    SEQ V$OCT1/OCT1.01 octamer-binding 0.77 3038-3052 3045
    ID factor 1
    NO:
    481
    SEQ V$PARF/DBP.01 Albumin D-box 0.84 3042-3056 3049
    ID binding protein
    NO:
    482
    SEQ V$ETSF/ETS1.01 c-Ets-1 binding 0.92 3057-3073 3065
    ID site
    NO:
    483
    SEQ V$LEFF/LEF1.01 TCF/LEF-1, 0.86 3062-3078 3070
    ID involved in the
    NO: Wnt signal
    484 transduction
    pathway
    SEQ V$MAZF/MAZ.01 Myc associated 0.90 3072-3084 3078
    ID zinc finger
    NO: protein (MAZ)
    485
    SEQ V$SP1F/GC.01 GC box elements 0.88 3071-3085 3078
    ID
    NO:
    486
    SEQ V$TBPF/TATA.01 cellular and viral 0.90 3091-3107 3099
    ID TATA box
    NO:
    487 elements
    SEQ V$SEF1/SEF1.01 SEF1 binding site 0.69 3099-3117 3108
    ID
    NO:
    488
    SEQ
    ID Core Matrix
    NO: Str. sim. sim. Sequence
    SEQ (+) 1.000 0.900 ggaccatCAAAgtctgt
    ID
    NO:
    41
    SEQ (+) 1.000 0.823 agtctgtCATGtcatttgg
    ID
    NO:
    42
    SEQ (+) 0.833 0.904 gTCATgtcatttggg
    ID
    NO:
    43
    SEQ (+) 1.000 1.000 GTCAttt
    ID
    NO:
    44
    SEQ (+) 1.000 0.735 ctgtcatgtcatTTGGgggagggcctatg
    ID
    NO:
    45
    SEQ (−) 1.000 0.982 gccctCCCCcaaa
    ID
    NO:
    46
    SEQ (+) 0.876 0.898 tgggGGAGggcctat
    ID
    NO:
    47
    SEQ (−) 0.884 0.708 acagaggagggcATAGgccct
    ID
    NO:
    48
    SEQ (−) 0.800 0.811 cagataCACAgaggagggcataggccctc
    ID
    NO:
    49
    SEQ (−) 1.000 0.987 tgctattTAAGcccaga
    ID
    NO:
    50
    SEQ (−) 1.000 0.932 tgctATTTa
    ID
    NO:
    51
    SEQ (−) 0.750 0.865 ggtatgctATTTaag
    ID
    NO:
    52
    NO:
    65
    SEQ (−) 1.000 0.894 ttatAAAGctgagga
    ID
    NO:
    66
    SEQ (−) 1.000 0.910 agttaTAAAgctgagga
    ID
    NO:
    67
    SEQ (−) 1.000 0.902 agtgAAAGcagagag
    ID
    NO:
    68
    SEQ (−) 0.750 0.756 craCAGTtgacct
    ID
    NO:
    69
    SEQ (+) 1.000 0.940 GTCTtgact
    ID
    NO:
    70
    SEQ (−) 1.000 0.960 gagggATTAgaaaagga
    ID
    NO:
    71
    SEQ (−) 1.000 0.906 ggaatCCAAtygtag
    ID
    NO:
    72
    SEQ (+) 0.789 0.802 ctacraTTGGattccat
    ID
    NO:
    73
    SEQ (−) 1.000 0.897 tacagcTAAAcactgag
    ID
    NO:
    74
    SEQ (−) 0.840 0.865 gagcctTCATccagtagct
    ID
    NO:
    75
    SEQ (−) 1.000 0.746 tgtcatcttagagCCTTcatc
    ID
    NO:
    76
    SEQ (+) 1.000 0.861 agGCTCtaagatg
    ID
    NO:
    77
    SEQ (+) 0.750 0.837 tcTAAGatgacaattaa
    ID
    NO:
    78
    SEQ (+) 0.807 0.840 aaGATGacaattaag
    ID
    NO:
    79
    SEQ (+) 1.000 0.994 gacaATTAa
    ID
    NO:
    80
    SEQ (−) 1.000 1.000 cctTAATtgtcat
    ID
    NO:
    81
    SEQ (−) 0.766 0.808 cgacgattACCTtaattgtca
    ID
    NO:
    82
    SEQ (−) 0.750 0.768 aatgaggATCGacgattacct
    ID
    NO:
    83
    SEQ (+) 1.000 0.886 cgatcctcATTAtagtg
    ID
    NO:
    84
    SEQ (+) 1.000 0.868 tatagtGGAAgggcttc
    ID
    NO:
    85
    SEQ (+) 1.000 0.904 agggcttCAAAggcagt
    ID
    NO:
    86
    SEQ (−) 0.758 0.867 gagacTGCCtttgaagccc
    ID
    NO:
    87
    SEQ (−) 1.000 0.971 ttcaGATAggcag
    ID
    NO:
    88
    SEQ (−) 0.757 0.672 atgttcaGATAggcagtag
    ID
    NO:
    89
    SEQ (−) 0.800 0.824 gGAAAtgttcagatagg
    ID
    NO:
    90
    SEQ (+) 1.000 0.951 cctaatgCCAGatgtct
    ID
    NO:
    91
    SEQ (+) 1.000 0.852 aatgcCAGAtgtctctt
    ID
    NO:
    92
    SEQ (−) 1.000 0.851 gagaCATCtggca
    ID
    NO:
    93
    SEQ (−) 1.000 0.984 aggataggttTAAAgagacatct
    ID
    NO:
    94
    SEQ (−) 1.000 0.774 gGATAggtttaaagaga
    ID
    NO:
    95
    SEQ (+) 1.000 0.813 tgtctcttTAAAcctatcctggc
    ID
    NO:
    96
    SEQ (+) 1.000 0.877 ctcttTAAAcctatcct
    ID
    NO:
    97
    SEQ (+) 1.000 0.845 ctcccttcATTAaggta
    ID
    NO:
    98
    SEQ (−) 1.000 0.834 gagatacctTAATgaagggag
    ID
    NO:
    99
    SEQ (+) 0.944 0.926 gGTATctcatttttt
    ID
    NO:
    100
    SEQ (−) 1.000 0.972 gcAAAAaatga
    ID
    NO:
    101
    SEQ (−) 0.764 0.720 ggaaCAGAggagagcaa
    ID
    NO:
    102
    SEQ (−) 0.881 0.964 aaaactgaATCAgtggnggaa
    ID
    NO:
    103
    SEQ (+) 1.000 0.886 actgATTCagt
    ID
    NO:
    104
    SEQ (+) 0.850 0.956 nccactgaTTCAgtttttctg
    ID
    NO:
    105
    SEQ (−) 0.876 0.910 aaaAACTgaat
    ID
    NO:
    106
    SEQ (−) 1.000 0.975 agAAAAactga
    ID
    NO:
    107
    SEQ (+) 1.000 0.875 ctgatccctctTGTTctcc
    ID
    NO:
    108
    SEQ (−) 1.000 0.971 gaaaaagagaAGGGa
    ID
    NO:
    109
    SEQ (−) 1.000 0.987 ggAAAAagaga
    ID
    NO:
    110
    SEQ (−) 1.000 0.982 ggagGAAAaag
    ID
    NO:
    111
    SEQ (−) 1.000 0.910 ggtgGAGGgaagg
    ID
    NO:
    112
    SEQ (−) 1.000 0.932 gggggTGGGagggtg
    ID
    NO:
    113
    SEQ (+) 1.000 0.972 tcccaCCCCcatg
    ID
    NO:
    114
    SEQ (−) 1.000 0.815 aggaagggGAAAggg
    ID
    NO:
    115
    SEQ (−) 1.000 0.911 aaaataggAAATaagga
    ID
    NO:
    116
    SEQ (−) 1.000 0.883 aaaataGGAAataagga
    ID
    NO:
    117
    SEQ (−) 0.760 0.792 aGAGAaaataggaaata
    ID
    NO:
    118
    SEQ (−) 0.763 0.817 cccccagagaaAATAgg
    ID
    NO:
    119
    SEQ (−) 1.000 0.934 ccacaCCCCcaga
    ID
    NO:
    120
    SEQ (+) 0.983 0.894 gggtgtgGATTttat
    ID
    NO:
    121
    SEQ (−) 1.000 0.942 caccaTAAAatccacac
    ID
    NO:
    122
    SEQ (−) 0.866 0.813 aacataTGCAcagaagggcttccaccata
    ID
    NO:
    123
    SEQ (−) 1.000 0.790 catATGCacagaagg
    ID
    NO:
    124
    SEQ (−) 1.000 0.910 caacatATGCaca
    ID
    NO:
    125
    SEQ (+) 0.757 0.666 ctgtgcaTATGttgtctta
    ID
    NO:
    126
    SEQ (−) 0.750 0.828 caataagacaaCATAtg
    ID
    NO:
    127
    SEQ (−) 1.000 0.776 ccAATAagacaacatat
    ID
    NO:
    128
    SEQ (−) 1.000 0.836 tcaaccaatAAGAcaac
    ID
    NO:
    129
    SEQ (−) 1.000 0.960 atcaaCCAAtaagac
    ID
    NO:
    130
    SEQ (+) 1.000 0.844 tcttatTGGTtgata
    ID
    NO:
    131
    SEQ (−) 1.000 0.943 tcaaCCAAtaa
    ID
    NO:
    132
    SEQ (+) 1.000 0.956 ggttGATAaataa
    ID
    NO:
    133
    SEQ (+) 0.757 0.791 gGTTGataaataaagca
    ID
    NO:
    134
    SEQ (−) 0.750 0.797 gTGCTttatttat
    ID
    NO:
    135
    SEQ (+) 1.000 0.912 gttgtCCAAtaggga
    ID
    NO:
    136
    SEQ (+) 0.750 0.843 aataggGAAAcaagata
    ID
    NO:
    137
    SEQ (+) 1.000 0.960 tagggaaacaAGATagg
    ID
    NO:
    138
    SEQ (+) 1.000 0.970 acaaGATAggtgg
    ID
    NO:
    139
    SEQ (−) 0.750 0.867 cccaCCTAtct
    ID
    NO:
    140
    SEQ (−) 1.000 0.929 ggatcacatgGCAAccctc
    ID
    NO:
    141
    SEQ (−) 0.895 0.936 ggatCACAtggcaacc
    ID
    NO:
    142
    SEQ (+) 1.000 0.863 gggttgcCATGtgatccta
    ID
    NO:
    143
    SEQ (+) 1.000 0.950 ctaggaGGAAttgacac
    ID
    NO:
    144
    SEQ (−) 1.000 0.800 catgtgtcAATTcct
    ID
    NO:
    145
    SEQ (−) 1.000 1.000 tGTCAat
    ID
    NO:
    146
    SEQ (−) 1.000 0.835 ccattctCATGtgtcaatt
    ID
    NO:
    147
    SEQ (+) 0.846 0.800 caCATGagaatgggg
    ID
    NO:
    148
    SEQ (+) 1.000 0.998 gaaaGATAagtcc
    ID
    NO:
    149
    SEQ (−) 1.000 0.672 atattttTATAaggactta
    ID
    NO:
    150
    SEQ (−) 1.000 0.867 atatattTTTAtaaggact
    ID
    NO:
    151
    SEQ (+) 1.000 0.894 tccttaTAAAaatatat
    ID
    NO:
    152
    SEQ (+) 1.000 0.740 agtccttaTAAAaatatatatta
    ID
    NO:
    153
    SEQ (+) 1.000 0.963 ccttaTAAAaatatata
    ID
    NO:
    154
    SEQ (−) 1.000 0.870 acTAATatatattttta
    ID
    NO:
    155
    SEQ (−) 1.000 0.855 caTAATtactaatatat
    ID
    NO:
    156
    SEQ (−) 1.000 0.943 cataattacTAATatat
    ID
    NO:
    157
    SEQ (−) 1.000 0.816 cccATAAttactaatat
    ID
    NO:
    158
    SEQ (−) 0.757 0.765 ccCATAattactaatat
    ID
    NO:
    159
    SEQ (+) 1.000 0.989 agtaATTAt
    ID
    NO:
    160
    SEQ (−) 1.000 0.976 ccatAATTactaa
    ID
    NO:
    161
    SEQ (−) 1.000 0.886 aacccataATTActaat
    ID
    NO:
    162
    SEQ (−) 1.000 0.775 attaacccaTAATtactaata
    ID
    NO:
    163
    SEQ (+) 0.826 0.844 tatgggttAATAattaa
    ID
    NO:
    164
    SEQ (−) 0.755 0.857 aCTTAattattaaccca
    ID
    NO:
    165
    SEQ (+) 1.000 0.966 gGTTAataattaagtca
    ID
    NO:
    166
    SEQ (+) 1.000 0.972 taatAATTaag
    ID
    NO:
    167
    SEQ (−) 1.000 0.995 cttaATTAt
    ID
    NO:
    168
    SEQ (−) 1.000 0.873 ctgacttaATTAttaac
    ID
    NO:
    169
    SEQ (+) 1.000 0.988 taatAATaagtc
    ID
    NO:
    170
    SEQ (+) 1.000 0.931 taataATTAagtc
    ID
    NO:
    171
    SEQ (+) 1.000 0.881 taataatTAAGtcagag
    ID
    NO:
    172
    SEQ (−) 0.766 0.819 tagctctgACTTaattattaa
    ID
    NO:
    173
    SEQ (+) 0.750 0.874 ataattaAGTCagagct
    ID
    NO:
    174
    SEQ (+) 0.856 0.928 ctagCCATtaa
    ID
    NO:
    175
    SEQ (−) 1.000 0.903 tctTAATggctag
    ID
    NO:
    176
    SEQ (−) 0.770 0.842 ctagtGTTTcttaatggctag
    ID
    NO:
    177
    SEQ (+) 1.000 0.891 gcttcataATTAatata
    ID
    NO:
    178
    SEQ (−) 1.000 0.995 attaATTAt
    ID
    NO:
    179
    SEQ (+) 1.000 0.988 tcatAATTaatat
    ID
    NO:
    180
    SEQ (+) 1.000 0.952 tcataATTAatat
    ID
    NO:
    181
    SEQ (+) 1.000 0.945 ttcataatTAATatagt
    ID
    NO:
    182
    SEQ (−) 1.000 0.885 actatattAATTatg
    ID
    NO:
    183
    SEQ (−) 1.000 0.854 gatactatATTAattat
    ID
    NO:
    184
    SEQ (+) 0.750 0.875 tgtatgttCATTtgg
    ID
    NO:
    185
    SEQ (+) 0.850 0.887 gtatgttCATTtggg
    ID
    NO:
    186
    SEQ (−) 1.000 0.816 cCCCAaatgaacata
    ID
    NO:
    187
    SEQ (−) 1.000 0.849 tcagcccCAAAtgaa
    ID
    NO:
    188
    SEQ (+) 1.000 0.828 tggggcTGACacagttctggg
    ID
    NO:
    189
    SEQ (+) 1.000 0.833 ggggcTGACacagttctggga
    ID
    NO:
    190
    SEQ (+) 0.750 0.791 aggAAGAytactt
    ID
    NO:
    191
    SEQ (−) 0.804 0.820 cctacaATCCatgtacc
    ID
    NO:
    192
    SEQ (−) 1.000 0.864 atagagCAAAggactac
    ID
    NO:
    193
    SEQ (−) 1.000 0.907 catagagCAAAggacta
    ID
    NO:
    194
    SEQ (−) 1.000 0.700 tagacatagagcAAAGgacta
    ID
    NO:
    195
    SEQ (+) 0.804 0.831 gtctaaATCCatatatg
    ID
    NO:
    196
    SEQ (+) 0.833 0.929 ctaaaTCCAtatatg
    ID
    NO:
    197
    SEQ (+) 1.000 0.851 aaatCCATatatgaatgag
    ID
    NO:
    198
    SEQ (−) 1.000 0.761 actcattcatatATGGa
    ID
    NO:
    199
    SEQ (−) 1.000 0.919 actcATTCata
    ID
    NO:
    200
    SEQ (−) 0.807 0.901 tggtATGTa
    ID
    NO:
    201
    SEQ (−) 1.000 0.922 gaaagayAAACatggta
    ID
    NO:
    202
    SEQ (−) 0.789 0.898 gtgAGGTaacccc
    ID
    NO:
    203
    SEQ (+) 1.000 0.854 atgggGTTAcctcactcagga
    ID
    NO:
    204
    SEQ (+) 1.000 0.903 gTTACctcact
    ID
    NO:
    205
    SEQ (−) 0.758 0.870 cgcAGGCaaatgaat
    ID
    NO:
    206
    SEQ (+) 0.758 0.850 tcattTGCCtgcgaatttt
    ID
    NO:
    207
    SEQ (+) 1.000 0.869 tgcgaatTTTAagattcca
    ID
    NO:
    208
    SEQ (−) 1.000 0.990 taaaaCAATggaatctt
    ID
    NO:
    209
    SEQ (−) 1.000 0.931 aggaataaAACAatgga
    ID
    NO:
    210
    SEQ (+) 1.000 0.865 ccattgtTTTAttcctctg
    ID
    NO:
    211
    SEQ (−) 0.894 0.876 gaggAATAaaacaat
    ID
    NO:
    212
    SEQ (+) 1.000 0.824 tcctctgagTAATactccatt
    ID
    NO:
    213
    SEQ (−) 1.000 0.925 ttacaCAATggagtatt
    ID
    NO:
    214
    SEQ (−) 0.901 0.920 ggtacATTAcacaatggagta
    ID
    NO:
    215
    SEQ (−) 1.000 0.871 aTTACacaatg
    ID
    NO:
    216
    SEQ (+) 0.929 0.955 tccattgtGTAAtgtacca
    ID
    NO:
    217
    SEQ (+) 1.000 0.859 tccattgtgTAATgtaccaca
    ID
    NO:
    218
    SEQ (−) 1.000 1.000 aaaatgTGGTacatt
    ID
    NO:
    219
    SEQ (+) 0.750 0.819 aatgtaccacaTTTTctcc
    ID
    NO:
    220
    SEQ (+) 1.000 0.896 taCATTcttcagt
    ID
    NO:
    221
    SEQ (+) 1.000 0.990 caGTTGagg
    ID
    NO:
    222
    SEQ (−) 1.000 0.932 gcaatagCCAGaacctg
    ID
    NO:
    223
    SEQ (−) 1.000 0.945 gcaatagCCAG
    ID
    NO:
    224
    SEQ (−) 1.000 0.951 attTGCAatagcc
    ID
    NO:
    225
    SEQ (+) 1.000 0.853 tggctattGCAAataaccc
    ID
    NO:
    226
    SEQ (+) 1.000 0.809 ctggctattgcAAATaaccctgc
    ID
    NO:
    227
    SEQ (+) 1.000 0.889 ctattgcAAATaacc
    ID
    NO:
    228
    SEQ (−) 1.000 0.900 ggttATTTg
    ID
    NO:
    229
    SEQ (+) 0.975 0.761 acatatgtcattATTGt
    ID
    NO:
    230
    SEQ (+) 0.944 0.938 cATATgtcattattg
    ID
    NO:
    231
    SEQ (+) 1.000 0.836 catatgtcATTAttgta
    ID
    NO:
    232
    SEQ (−) 1.000 0.889 ttcatacaaTAATgacatatg
    ID
    NO:
    233
    SEQ (−) 1.000 0.870 tcataCAATaatgacat
    ID
    NO:
    234
    SEQ (−) 0.944 0.914 aATATgtaaaacaga
    ID
    NO:
    235
    SEQ (−) 1.000 0.856 tttaaaatatGTAAaacagat
    ID
    NO:
    236
    SEQ (+) 1.000 0.886 tTTACatattt
    ID
    NO:
    237
    SEQ (+) 1.000 0.841 tatttTAAAccatctct
    ID
    NO:
    238
    SEQ (−) 1.000 0.783 caagCAATctaga
    ID
    NO:
    239
    SEQ (+) 1.000 0.765 tctctagATTGcttgtaatat
    ID
    NO:
    240
    SEQ (−) 1.000 0.997 tttaaCAATattacaag
    ID
    NO:
    241
    SEQ (+) 1.000 0.885 tattgtTAAAcatagag
    ID
    NO:
    242
    SEQ (+) 1.000 0.839 catagagagTAATaatgctat
    ID
    NO:
    243
    SEQ (−) 1.000 0.872 atagcattATTActctc
    ID
    NO:
    244
    SEQ (−) 0.826 0.843 tttatagcaTTATtactctct
    ID
    NO:
    245
    SEQ (+) 1.000 0.829 agTAATaatgctataaa
    ID
    NO:
    246
    SEQ (−) 1.000 0.906 tttaattTTTAtagcatta
    ID
    NO:
    247
    SEQ (+) 1.000 0.983 aataatgctaTAAAaattaaaaa
    ID
    NO:
    248
    SEQ (−) 0.755 0.805 tTTTAatttttatagca
    ID
    NO:
    249
    SEQ (+) 1.000 0.832 gctataaaAATTaaa
    ID
    NO:
    250
    SEQ (+) 1.000 0.991 tgctaTAAAaattaaaa
    ID
    NO:
    251
    SEQ (−) 1.000 0.989 tttTAATttttat
    ID
    NO:
    252
    SEQ (+) 1.000 0.944 taaaaATTAaaaa
    ID
    NO:
    253
    SEQ (+) 1.000 0.807 ataaaaatTAAAaataatgataa
    ID
    NO:
    254
    SEQ (+) 1.000 0.872 aataatgatAAGAaaga
    ID
    NO:
    255
    SEQ (+) 1.000 0.993 taatGATAagaaa
    ID
    NO:
    256
    SEQ (+) 1.000 0.931 gaaAGATcctata
    ID
    NO:
    257
    SEQ (+) 1.000 0.915 tacAGATgaaaat
    ID
    NO:
    258
    SEQ (+) 0.763 0.867 cagATGAaaatttag
    ID
    NO:
    259
    SEQ (+) 0.985 0.964 aaaatttaGAAAtacttta
    ID
    NO:
    260
    SEQ (−) 0.958 0.866 agcTAAAgtatttct
    ID
    NO:
    261
    SEQ (−) 1.000 0.763 TCGTcagtggtag
    ID
    NO:
    262
    SEQ (+) 1.000 0.923 taccacTGACgaaatttgtat
    ID
    NO:
    263
    SEQ (−) 1.000 0.959 tttaattCCAGacattc
    ID
    NO:
    264
    SEQ (−) 1.000 0.977 cttTAATtccaga
    ID
    NO:
    265
    SEQ (+) 1.000 0.923 ctggaATTAaaga
    ID
    NO:
    266
    SEQ (+) 1.000 0.898 ctggAATTaaagaaa
    ID
    NO:
    267
    SEQ (−) 1.000 0.915 cagTAATttcttt
    ID
    NO:
    268
    SEQ (+) 1.000 0.922 aaagaaattacTGTTcttt
    ID
    NO:
    269
    SEQ (−) 1.000 0.934 ttataTAAAgaacagta
    ID
    NO:
    270
    SEQ (−) 1.000 0.890 attataTAAAgaacagt
    ID
    NO:
    271
    SEQ (−) 0.891 0.923 tattaTATAaagaacag
    ID
    NO:
    272
    SEQ (−) 0.769 0.856 ctattattatATAAagaacag
    ID
    NO:
    273
    SEQ (+) 1.000 0.836 tttatataaTAATagactgta
    ID
    NO:
    274
    SEQ (+) 0.791 0.760 tataataATAGactgtaaaat
    ID
    NO:
    275
    SEQ (+) 1.000 0.912 gactgTAAAatggcaac
    ID
    NO:
    276
    SEQ (+) 0.750 0.817 gtaaaatgGCAActt
    ID
    NO:
    277
    SEQ (+) 1.000 0.907 ctgtaaaatgGCAActttt
    ID
    NO:
    278
    SEQ (−) 1.000 0.882 taaAAGTtgccat
    ID
    NO:
    279
    SEQ (+) 1.000 0.878 tatttgctAATTcac
    ID
    NO:
    280
    SEQ (−) 0.777 0.865 tcctgTGAAttagcaaatatt
    ID
    NO:
    281
    SEQ (+) 1.000 0.969 tgCTAAttcacag
    ID
    NO:
    282
    SEQ (−) 0.850 0.866 tcctgtgAATTagca
    ID
    NO:
    283
    SEQ (+) 0.750 0.788 ttgctaatTCACaggat
    ID
    NO:
    284
    SEQ (−) 1.000 0.973 agAAAAaatcc
    ID
    NO:
    285
    SEQ (−) 1.000 0.908 agatgTTCCaaagaaaaaa
    ID
    NO:
    286
    SEQ (−) 1.000 0.919 ttgttCAGAtgttccaa
    ID
    NO:
    287
    SEQ (+) 1.000 0.953 tgaacaAATTtccctta
    ID
    NO:
    288
    SEQ (+) 0.750 0.757 acaAATTtccctt
    ID
    NO:
    289
    SEQ (+) 1.000 0.771 tttccctTATAtgaatcac
    ID
    NO:
    290
    SEQ (−) 1.000 0.908 agtGATTcatataaggg
    ID
    NO:
    291
    SEQ (−) 1.000 0.797 gTGATtcatataa
    ID
    NO:
    292
    SEQ (−) 1.000 0.912 agtgATTCata
    ID
    NO:
    293
    SEQ (+) 0.881 0.958 ttatatgaATCActtacattt
    ID
    NO:
    294
    SEQ (+) 1.000 0.860 cTTACattttt
    ID
    NO:
    295
    SEQ (+) 0.850 0.829 gcctgttCATTtaaa
    ID
    NO:
    296
    SEQ (−) 1.000 0.832 gtttTTTAaatgaacag
    ID
    NO:
    297
    SEQ (+) 1.000 0.853 tcattTAAAaaactgca
    ID
    NO:
    298
    SEQ (+) 1.000 0.866 actgcAGGAaagttgtg
    ID
    NO:
    299
    SEQ (+) 1.000 0.891 ggaAAGTtgtgat
    ID
    NO:
    300
    SEQ (−) 1.000 1.000 ataAATCacaacttt
    ID
    NO:
    301
    SEQ (−) 1.000 0.931 cattaTAAAtcacaact
    ID
    NO:
    302
    SEQ (−) 1.000 0.933 tgcattatAAATcacaa
    ID
    NO:
    303
    SEQ (+) 1.000 0.924 gTGATttataatg
    ID
    NO:
    304
    SEQ (−) 0.866 0.827 agttgcatTATAaatcacaactt
    ID
    NO:
    305
    SEQ (+) 0.894 0.898 tgtgATTTataatgc
    ID
    NO:
    306
    SEQ (+) 1.000 0.971 tgtGATTtataatgcaa
    ID
    NO:
    307
    SEQ (+) 1.000 0.916 gtgatttaTAATgcaac
    ID
    NO:
    308
    SEQ (+) 0.884 0.891 atttaTAATgcaact
    ID
    NO:
    309
    SEQ (+) 1.000 0.861 ataATGCaactgcac
    ID
    NO:
    310
    SEQ (+) 1.000 0.910 cagtctTAAAcaatgct
    ID
    NO:
    311
    SEQ (+) 1.000 0.992 ttaaaCAATgctaacca
    ID
    NO:
    312
    SEQ (+) 1.000 0.981 actgtGTTTcagc
    ID
    NO:
    313
    SEQ (−) 1.000 0.889 gggAAGTttatgc
    ID
    NO:
    314
    SEQ (−) 1.000 0.878 tgtgTGGGaagttta
    ID
    NO:
    315
    SEQ (+) 0.763 0.826 actATGAaaacacat
    ID
    NO:
    316
    SEQ (+) 1.000 0.786 actatgaaAACAcatgc
    ID
    NO:
    317
    SEQ (+) 0.895 0.920 gaaaaCACAtgcttaaa
    ID
    NO:
    318
    SEQ (−) 0.773 0.791 cctttAAGCatgtgttttc
    ID
    NO:
    319
    SEQ (+) 1.000 0.874 cttaaaggCAAAtct
    ID
    NO:
    320
    SEQ (−) 0.858 0.782 aGGTAaagatttgcctt
    ID
    NO:
    321
    SEQ (−) 1.000 0.853 ctgaggTAAAgatttgc
    ID
    NO:
    322
    SEQ (−) 0.789 0.830 ctgAGGTaaagat
    ID
    NO:
    323
    SEQ (+) 0.766 0.820 aaatctttACCTcagttaact
    ID
    NO:
    324
    SEQ (+) 1.000 0.862 tTTACctcagt
    ID
    NO:
    325
    SEQ (−) 0.750 0.775 gaaTAGTtaactg
    ID
    NO:
    326
    SEQ (+) 1.000 0.811 aGTTAactattccatag
    ID
    NO:
    327
    SEQ (+) 0.856 0.925 agagCCATtga
    ID
    NO:
    328
    SEQ (−) 1.000 0.873 tgaacTCAAtggctc
    ID
    NO:
    329
    SEQ (−) 1.000 0.980 ctTGAActcaa
    ID
    NO:
    330
    SEQ (+) 1.000 0.854 attgagtTCAAgtgcattt
    ID
    NO:
    331
    SEQ (+) 0.782 0.813 tgagTTCAagtgcattt
    ID
    NO:
    332
    SEQ (+) 1.000 1.000 gttcAAGTgcatt
    ID
    NO:
    333
    SEQ (+) 1.000 0.928 agaAGATataatg
    ID
    NO:
    334
    SEQ (−) 0.891 0.912 atataTATAtggccata
    ID
    NO:
    335
    SEQ (+) 1.000 0.777 atggccaTATAtatatata
    ID
    NO:
    336
    SEQ (−) 1.000 0.806 atatatatatatATGGc
    ID
    NO:
    337
    SEQ (−) 0.750 0.675 CTGTgctgatatatatata
    ID
    NO:
    338
    SEQ (+) 0.750 0.827 atatataTCAGcacagt
    ID
    NO:
    339
    SEQ (+) 1.000 0.904 ataTATCagcacagt
    ID
    NO:
    340
    SEQ (+) 1.000 0.704 atcAGCAcagtggaaacagtt
    ID
    NO:
    341
    SEQ (+) 1.000 0.970 agtgGAAAcag
    ID
    NO:
    342
    SEQ (−) 1.000 0.991 taactGTTTccac
    ID
    NO:
    343
    SEQ (−) 1.000 0.798 tGTTAttaactgtttcc
    ID
    NO:
    344
    SEQ (+) 0.826 0.824 aaacagttAATAacatt
    ID
    NO:
    345
    SEQ (+) 1.000 0.749 ggaaacagtTAATaacatttt
    ID
    NO:
    346
    SEQ (−) 1.000 0.863 taTATGctaaaatgt
    ID
    NO:
    347
    SEQ (−) 0.891 0.908 tagtaTATAtgctaaaa
    ID
    NO:
    348
    SEQ (+) 1.000 0.897 gaggctGGAAgggggct
    ID
    NO:
    349
    SEQ (+) 1.000 0.787 gctggaagggggcTCAGcagtta
    ID
    NO:
    350
    SEQ (−) 0.876 0.901 attAACTgctg
    ID
    NO:
    351
    SEQ (+) 0.750 0.840 atagcacatacTATTcttc
    ID
    NO:
    352
    SEQ (+) 0.782 0.747 gtttggtttTCATcacccatg
    ID
    NO:
    353
    SEQ (−) 1.000 0.988 gaacCACCtgacatg
    ID
    NO:
    354
    SEQ (−) 1.000 0.958 tacaGATAgaaat
    ID
    NO:
    355
    SEQ (+) 1.000 0.924 gtaacCAGAtgatacga
    ID
    NO:
    356
    SEQ (−) 0.750 0.762 agGTACccaaggggact
    ID
    NO:
    357
    SEQ (−) 1.000 0.960 aggtGATAgaggt
    ID
    NO:
    358
    SEQ (−) 1.000 0.939 atagCAGGtgataga
    ID
    NO:
    359
    SEQ (+) 0.759 0.710 cacctgctattctCACCcaaaga
    ID
    NO:
    360
    SEQ (+) 1.000 0.805 aCCCAaagacacaca
    ID
    NO:
    361
    SEQ (−) 0.944 0.904 tGTATgtgagtgtgt
    ID
    NO:
    362
    SEQ (+) 1.000 0.854 tgcATGCacatagtt
    ID
    NO:
    363
    SEQ (−) 0.977 0.855 tGAACtatgtgcatg
    ID
    NO:
    364
    SEQ (+) 0.750 0.767 catagttcAAAAaataaaatttt
    ID
    NO:
    365
    SEQ (−) 1.000 0.896 ttaaaatTTTAttttttga
    ID
    NO:
    366
    SEQ (−) 0.750 0.798 taaAATTttattt
    ID
    NO:
    367
    SEQ (+) 1.000 0.991 aaagGAAAaaa
    ID
    NO:
    368
    SEQ (+) 1.000 0.977 ggAAAAaaagc
    ID
    NO:
    369
    SEQ (−) 1.000 0.946 aaaAGATttgagc
    ID
    NO:
    370
    SEQ (−) 1.000 0.901 aggaATTTt
    ID
    NO:
    371
    SEQ (+) 0.806 0.820 taaaatTCCTatgagtgtgtgat
    ID
    NO:
    372
    SEQ (−) 0.782 0.753 tactgacttTGATcacacact
    ID
    NO:
    373
    SEQ (−) 1.000 0.942 cacAGATtatacc
    ID
    NO:
    374
    SEQ (+) 1.000 0.971 tgtgGAAAaca
    ID
    NO:
    375
    SEQ (+) 0.980 0.879 ctcagtATTCaca
    ID
    NO:
    376
    SEQ (−) 1.000 0.827 ctactttCATGtgtgaata
    ID
    NO:
    377
    SEQ (−) 0.850 0.952 cttTCATgtgtga
    ID
    NO:
    378
    SEQ (+) 1.000 0.838 aagtagcTAAGaataaa
    ID
    NO:
    379
    SEQ (−) 1.000 0.960 aatAGATtttatt
    ID
    NO:
    380
    SEQ (+) 0.806 0.819 aataaaATCTattcatc
    ID
    NO:
    381
    SEQ (+) 0.785 0.846 taaaaTCTAttcatc
    ID
    NO:
    382
    SEQ (+) 1.000 0.890 atctATTCatc
    ID
    NO:
    383
    SEQ (−) 1.000 0.881 aaaaaCAGAtgaataga
    ID
    NO:
    384
    SEQ (−) 1.000 0.981 ggAAAAacaga
    ID
    NO:
    385
    SEQ (−) 1.000 0.976 taagGAAAaac
    ID
    NO:
    386
    SEQ (−) 1.000 0.872 aggattttaaGGAAaaaca
    ID
    NO:
    387
    SEQ (+) 1.000 0.897 ttcctTAAAatcctggc
    ID
    NO:
    388
    SEQ (−) 1.000 0.880 actgagtcAACActgta
    ID
    NO:
    389
    SEQ (−) 1.000 0.984 accactgaGTCAacactgtag
    ID
    NO:
    390
    SEQ (+) 0.964 0.984 agtgttgaCTCAgtggttgct
    ID
    NO:
    391
    SEQ (−) 0.826 0.904 gcaaCCACtga
    ID
    NO:
    392
    SEQ (+) 1.000 0.883 tttaaatTTTAtgctcaaa
    ID
    NO:
    393
    SEQ (+) 1.000 0.891 caaAAGTtgaagc
    ID
    NO:
    394
    SEQ (+) 1.000 0.829 tgaaCCGGtaattctac
    ID
    NO:
    395
    SEQ (−) 1.000 0.757 acaAAGTagaatt
    ID
    NO:
    396
    SEQ (−) 0.750 0.816 aagtattTAATacaaag
    ID
    NO:
    397
    SEQ (−) 1.000 0.939 acaagtattTAATacaa
    ID
    NO:
    398
    SEQ (−) 1.000 0.865 taacAAGTattta
    ID
    NO:
    399
    SEQ (+) 1.000 0.882 acttgTTATgcatcg
    ID
    NO:
    400
    SEQ (−) 1.000 0.758 aacttgatttgttgAGCGatgcataacaa
    ID
    NO:
    401
    SEQ (+) 0.750 0.809 ctcaaCAAAtcaagt
    ID
    NO:
    402
    SEQ (+) 1.000 0.976 acaAATCaagtttta
    ID
    NO:
    403
    SEQ (+) 1.000 0.830 acaaaTCAAgtttta
    ID
    NO:
    404
    SEQ (−) 0.750 0.756 taaAACTtgattt
    ID
    NO:
    405
    SEQ (+) 1.000 0.907 aaaTCAAgtttta
    ID
    NO:
    406
    SEQ (+) 1.000 0.936 caaatCAAGttttaa
    ID
    NO:
    407
    SEQ (+) 1.000 0.887 atcAAGTtttaac
    ID
    NO:
    408
    SEQ (+) 1.000 0.883 atcaagtTTTAacacacca
    ID
    NO:
    409
    SEQ (−) 1.000 0.925 ttaaaaAATTtaagata
    ID
    NO:
    410
    SEQ (+) 1.000 0.780 atttTTTAaatgggcat
    ID
    NO:
    411
    SEQ (−) 0.750 0.818 tttatgccCATTtaa
    ID
    NO:
    412
    SEQ (+) 1.000 0.796 ctaTTCCtacagaagtc
    ID
    NO:
    413
    SEQ (+) 1.000 0.860 ctgaaaATGCatt
    ID
    NO:
    414
    SEQ (+) 1.000 0.898 tgCATTcctgatt
    ID
    NO:
    415
    SEQ (−) 1.000 0.981 ataAATCaggaatgc
    ID
    NO:
    416
    SEQ (+) 1.000 0.929 cTGATttatgtaa
    ID
    NO:
    417
    SEQ (+) 1.000 0.964 cctGATTtatgtaaata
    ID
    NO:
    418
    SEQ (+) 1.000 0.861 ctgatTTATgtaaat
    ID
    NO:
    419
    SEQ (−) 1.000 0.929 tTTACataaat
    ID
    NO:
    420
    SEQ (+) 1.000 0.943 cctgatttatGTAAatatatg
    ID
    NO:
    421
    SEQ (+) 1.000 0.895 ttTATGtaaatatat
    ID
    NO:
    422
    SEQ (+) 1.000 0.940 tttatgTAAAtatatgt
    ID
    NO:
    423
    SEQ (+) 1.000 0.691 atgtaaaTATAtgtatata
    ID
    NO:
    424
    SEQ (+) 0.849 0.883 atgtatATACata
    ID
    NO:
    425
    SEQ (+) 0.888 0.755 gtatatacatatATAGc
    ID
    NO:
    426
    SEQ (−) 0.891 0.903 ggctaTATAtgtatata
    ID
    NO:
    427
    SEQ (+) 1.000 0.709 atatacaTATAtagcctta
    ID
    NO:
    428
    SEQ (−) 1.000 0.816 ttgttttTAAGgctata
    ID
    NO:
    429
    SEQ (+) 1.000 0.899 agcctTAAAaacaaaga
    ID
    NO:
    430
    SEQ (+) 1.000 0.973 aaAACAaagat
    ID
    NO:
    431
    SEQ (+) 1.000 0.863 ttaaaaaCAAAgattgt
    ID
    NO:
    432
    SEQ (+) 1.000 0.811 aagattgtAATTttt
    ID
    NO:
    433
    SEQ (−) 1.000 0.900 acaatttaTAAAaattacaatct
    ID
    NO:
    434
    SEQ (−) 1.000 0.844 tttataaaAATTaca
    ID
    NO:
    435
    SEQ (−) 1.000 0.956 atttaTAAAaattacaa
    ID
    NO:
    436
    SEQ (+) 1.000 0.875 tgTAATttttataaatt
    ID
    NO:
    437
    SEQ (−) 1.000 0.903 aatttaTAAAaattaca
    ID
    NO:
    438
    SEQ (−) 1.000 0.973 tcacaatttaTAAAaattacaat
    ID
    NO:
    439
    SEQ (−) 0.750 0.798 atcACAAtttataaaaa
    ID
    NO:
    440
    SEQ (+) 1.000 0.927 ttttaTAAAttgtgatt
    ID
    NO:
    441
    SEQ (−) 1.000 0.997 aaaAATCacaattta
    ID
    NO:
    442
    SEQ (+) 1.000 0.806 gTGATttttaaaa
    ID
    NO:
    443
    SEQ (−) 1.000 0.923 tattttttTAAAaatcacaattt
    ID
    NO:
    444
    SEQ (+) 1.000 0.990 ttgtgattttTAAAaaaataaac
    ID
    NO:
    445
    SEQ (+) 1.000 0.905 gtgattttTAAAaaaataaacct
    ID
    NO:
    446
    SEQ (−) 0.755 0.796 gGTTTatttttttaaaa
    ID
    NO:
    447
    SEQ (+) 0.750 0.775 gatttttaAAAAaataaacctgc
    ID
    NO:
    448
    SEQ (+) 1.000 0.848 aaacctgcATTAtcttc
    ID
    NO:
    449
    SEQ (−) 0.884 0.851 gaagaTAATgcaggt
    ID
    NO:
    450
    SEQ (−) 1.000 0.853 tgctgaagaTAATgcaggttt
    ID
    NO:
    451
    SEQ (−) 1.000 0.993 tgaaGATAatgca
    ID
    NO:
    452
    SEQ (+) 0.867 0.951 TGAAtgttcct
    ID
    NO:
    453
    SEQ (+) 1.000 0.893 cctAAGTtttgta
    ID
    NO:
    454
    SEQ (+) 1.000 0.772 agttttgTAGAacttga
    ID
    NO:
    455
    SEQ (−) 1.000 0.927 cgtgtCAAGttctac
    ID
    NO:
    456
    SEQ (−) 1.000 0.997 tctgccaCGTGtcaagt
    ID
    NO:
    457
    SEQ (+) 1.000 0.795 aggattTTAGtctacac
    ID
    NO:
    458
    SEQ (−) 1.000 0.981 gatgCAGGtgtagac
    ID
    NO:
    459
    SEQ (−) 1.000 0.832 ctgtcctcagatgcaGGTGta
    ID
    NO:
    460
    SEQ (+) 1.000 0.873 ctaacaGGAAaggagac
    ID
    NO:
    461
    SEQ (+) 1.000 0.829 ggagacaCATGtgtggtag
    ID
    NO:
    462
    SEQ (+) 1.000 1.000 catgtgTGGTagttc
    ID
    NO:
    463
    SEQ (+) 1.000 0.919 tgtggtagTTCCcag
    ID
    NO:
    464
    SEQ (−) 1.000 0.841 aactgGGAActac
    ID
    NO:
    465
    SEQ (−) 1.000 0.842 aaacTGGGaactacc
    ID
    NO:
    466
    SEQ (−) 0.750 0.784 ttcacgtCAAAactg
    ID
    NO:
    467
    SEQ (−) 1.000 0.830 ttcACGTcaaaac
    ID
    NO:
    468
    SEQ (+) 1.000 0.985 cagttttGACGtgaaaagtcc
    ID
    NO:
    469
    SEQ (+) 1.000 0.891 gttttgaCGTGaaaagt
    ID
    NO:
    470
    SEQ (+) 1.000 0.909 ttgACGTgaaaag
    ID
    NO:
    471
    SEQ (+) 1.000 0.890 tttgACGTgaaaagt
    ID
    NO:
    472
    SEQ (+) 1.000 0.837 ttgacgtGAAAagtc
    ID
    NO:
    473
    SEQ (+) 1.000 0.937 cattcttactGGAAacctc
    ID
    NO:
    474
    SEQ (+) 0.800 0.805 ttcttacTGGAaacctc
    ID
    NO:
    475
    SEQ (+) 0.782 0.791 acctCCCTgaatccatgccaagc
    ID
    NO:
    476
    SEQ (−) 1.000 0.964 gctTGGCatggattcaggg
    ID
    NO:
    477
    SEQ (+) 1.000 0.820 tccATGCcaagcact
    ID
    NO:
    478
    SEQ (+) 1.000 0.787 gCCAAgcactacccatcaccttgac
    ID
    NO:
    479
    SEQ (−) 1.000 0.954 cagtCAAGgtgat
    ID
    NO:
    480
    SEQ (−) 1.000 0.800 ctTATGccagtcaag
    ID
    NO:
    481
    SEQ (−) 1.000 0.862 agtgcTTATgccagt
    ID
    NO:
    482
    SEQ (−) 1.000 0.920 atcaaAGGAaatgagtg
    ID
    NO:
    483
    SEQ (−) 1.000 0.969 ggggcatCAAAggaaat
    ID
    NO:
    484
    SEQ (−) 1.000 0.912 gaggGAGGggcat
    ID
    NO:
    485
    SEQ (−) 0.876 0.920 tgagGGAGgggcatc
    ID
    NO:
    486
    SEQ (+) 1.000 0.973 tattaTAAAagcacagt
    ID
    NO:
    487
    SEQ (−) 1.000 0.700 gaaagagacgaCTGTgctt
    ID
    NO:
    488

Claims (13)

We claim:
1. A method for identifying agents which modulate INGAP expression comprising:
contacting a host cell comprising a reporter construct having at least one INGAP regulatory region selected from nucleotides from the group consisting of SEQ ID NO: 1, 2, 23, 32, 35, 37, 28, 24, 25, 26, 27, 29, 30, 31, 33, 34, 38, and 36 and a region encoding a detectable product with a test agent;
determining expression of the detectable product in the cell; and identifying the test agent as a modulator of INGAP expression if the test agent modulates expression of the detectable product in the cell.
2. The method of claim 5 wherein the regulatory sequence comprises nucleotides 1-3137 of SEQ ID NO: 2.
3. The method of claim 1 wherein the reporter construct further comprises: a promoter element interposed between the regulatory region nucleotide sequence and the nucleotide sequence encoding the detectable product.
4. The method of claim 3 wherein the promoter element is selected from SEQ ID NO: 2.
5. An in vitro method for identifying agents which modulate INGAP expression, comprising: a. contacting a reporter construct having at least one INGAP regulatory region and a nucleotide sequence encoding a detectable product with a test substance under conditions sufficient for transcription and translation of said nucleotide sequence; determining expression of the detectable protein or nucleic acid product; and identifying the test substance as a modulator of INGAP expression if the test substance modulates expression of the detectable product.
6. The method of claim 5 wherein the regulatory region nucleotide sequence comprises of one or more regions chosen from nucleotides from the group consisting of SEQ ID NO: 1, -2, 23, 32, 35, 37, 28, 24, 25, 26, 27, 29, 30, 31, 33, 34, 38, and 36.
7. The method of claim 6 wherein the regulatory sequence comprises nucleotides 1-3137 of SEQ ID NO: 2.
8. The method of claim 5 wherein the reporter construct further comprises: a promoter element interposed between the regulatory region nucleotide sequence and the nucleotide sequence encoding the detectable product.
9. The method of claim 8 wherein the promoter element is selected from SEQ ID NO: 2.
10. A method for inducing INGAP expression in a mammal in need thereof, comprising administering to the mammal an effective amount of a factor that stimulates INGAP expression in the said mammal.
11. The method of claim 10 wherein the factor that stimulates INGAP expression was identified by:
contacting a host cell comprising a reporter construct having at least one INGAP regulatory region and a region encoding a detectable product with a test agent;
determining expression of the detectable product in the cell; and
identifying the test agent as a modulator of INGAP expression if the test agent modulates expression of the detectable product in the cell; wherein the regulatory region nucleotide sequence comprises of one or more regions chosen from nucleotides from the group consisting of SEQ ID NO: 1, -2, 23, 32, 35, 37, 28, 24, 25, 26, 27, 29, 30, 31, 33, 34, 38, and 36.
12. The method of claim 11 wherein the factor that stimulates INGAP expression was identified by:
contacting the SEQ ID NOS: 1, -2, 23, 32, 35, 37, 28, 24, 25, 26, 27, 29, 30, 31, 33, 34, 38, and 36 or fragments thereof with a test agent;
determining binding of the test agent to the nucleic acid; and
identifying the test agent as a potential modulator of INGAP expression if the test agent binds to the nucleic acid
15. The method of claim 10 wherein the factor that stimulates INGAP expression is selected from hLIF or PMA.
US13/195,596 2002-01-11 2011-08-01 Assay for the detection of factors that modulate the expression of ingap Abandoned US20120058078A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/195,596 US20120058078A1 (en) 2002-01-11 2011-08-01 Assay for the detection of factors that modulate the expression of ingap

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US34689802P 2002-01-11 2002-01-11
US36107302P 2002-03-01 2002-03-01
US38831502P 2002-06-14 2002-06-14
US10/339,767 US7355024B2 (en) 2002-01-11 2003-01-09 Regulatory sequences for modulation of INGAP expression and reporter constructs
US12/062,740 US20090156458A1 (en) 2002-01-11 2008-04-04 Assay for the detection of factors that modulate the expression of inagp
US13/195,596 US20120058078A1 (en) 2002-01-11 2011-08-01 Assay for the detection of factors that modulate the expression of ingap

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US12/062,740 Continuation US20090156458A1 (en) 2002-01-11 2008-04-04 Assay for the detection of factors that modulate the expression of inagp

Publications (1)

Publication Number Publication Date
US20120058078A1 true US20120058078A1 (en) 2012-03-08

Family

ID=27407757

Family Applications (3)

Application Number Title Priority Date Filing Date
US10/339,767 Expired - Fee Related US7355024B2 (en) 2002-01-11 2003-01-09 Regulatory sequences for modulation of INGAP expression and reporter constructs
US12/062,740 Abandoned US20090156458A1 (en) 2002-01-11 2008-04-04 Assay for the detection of factors that modulate the expression of inagp
US13/195,596 Abandoned US20120058078A1 (en) 2002-01-11 2011-08-01 Assay for the detection of factors that modulate the expression of ingap

Family Applications Before (2)

Application Number Title Priority Date Filing Date
US10/339,767 Expired - Fee Related US7355024B2 (en) 2002-01-11 2003-01-09 Regulatory sequences for modulation of INGAP expression and reporter constructs
US12/062,740 Abandoned US20090156458A1 (en) 2002-01-11 2008-04-04 Assay for the detection of factors that modulate the expression of inagp

Country Status (13)

Country Link
US (3) US7355024B2 (en)
EP (1) EP1463517A4 (en)
JP (1) JP2005514930A (en)
KR (2) KR20070043057A (en)
CN (1) CN1615150A (en)
BR (1) BR0306784A (en)
CA (1) CA2470359A1 (en)
IL (1) IL162493A0 (en)
MA (1) MA27241A1 (en)
MX (1) MXPA04006708A (en)
NO (1) NO20043330L (en)
PL (1) PL371485A1 (en)
WO (1) WO2003060096A2 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103305457B (en) * 2013-06-06 2015-07-08 浙江省医学科学院 Method for expanding pancreatic beta cell in vitro
CN118146379A (en) * 2016-04-26 2024-06-07 科济生物医药(上海)有限公司 Method for improving immune response cell function
CN110906869A (en) * 2019-11-22 2020-03-24 深圳市太赫兹科技创新研究院有限公司 OCT probe adjusting device

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5840531A (en) * 1995-02-22 1998-11-24 Mogill University Ingap protein involved in pancreatic islet neogenesis

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5834590A (en) * 1995-02-22 1998-11-10 Eastern Virginia Medical School Of The Medical College Of Hampton Roads Ingap protein involved in pancreatic islet neogenesis
US6090620A (en) * 1995-12-29 2000-07-18 University Of Washington Genes and gene products related to Werner's syndrome

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5840531A (en) * 1995-02-22 1998-11-24 Mogill University Ingap protein involved in pancreatic islet neogenesis

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Del Zotto et al. Possible relationship between changes in islet neogenesis and islet neogenesis-associated protein-postivie cell mass induced by sucrose administration to normal hamsters. Journal of Endocrinology, Vol. 165, pages 725-733, 2000. *
Rafaeloff et al. Cloning and sequencing of the pancreatic islet neogenesis associated protein (INGAP) gene and its expression in islet neogenesis in hamsters. Journal of Clinical Investigation, Vol. 99, No. 9, pages 2100-2109, May 1997. *

Also Published As

Publication number Publication date
WO2003060096A2 (en) 2003-07-24
WO2003060096A3 (en) 2003-12-04
US7355024B2 (en) 2008-04-08
PL371485A1 (en) 2005-06-13
MXPA04006708A (en) 2005-03-31
CN1615150A (en) 2005-05-11
JP2005514930A (en) 2005-05-26
KR20040072700A (en) 2004-08-18
AU2003202273A1 (en) 2003-07-30
NO20043330L (en) 2004-10-05
IL162493A0 (en) 2005-11-20
MA27241A1 (en) 2005-03-01
EP1463517A2 (en) 2004-10-06
KR100766759B1 (en) 2007-10-15
EP1463517A4 (en) 2006-01-04
KR20070043057A (en) 2007-04-24
US20030207301A1 (en) 2003-11-06
BR0306784A (en) 2005-04-26
US20090156458A1 (en) 2009-06-18
CA2470359A1 (en) 2003-07-24

Similar Documents

Publication Publication Date Title
Fang et al. The early growth response gene Egr2 (Alias Krox20) is a novel transcriptional target of transforming growth factor-β that is up-regulated in systemic sclerosis and mediates profibrotic responses
De Santa Barbara et al. Direct interaction of SRY-related protein SOX9 and steroidogenic factor 1 regulates transcription of the human anti-Mullerian hormone gene
Michaud et al. ARNT2 acts as the dimerization partner of SIM1 for the development of the hypothalamus
Furusawa et al. Identification of CtBP1 and CtBP2 as corepressors of zinc finger-homeodomain factor δEF1
Annicotte et al. Pancreatic-duodenal homeobox 1 regulates expression of liver receptor homolog 1 during pancreas development
Western et al. Temperature-dependent sex determination in the American alligator: expression of SF1, WT1 and DAX1 during gonadogenesis
Robert et al. Friend of GATA (FOG)-1 and FOG-2 differentially repress the GATA-dependent activity of multiple gonadal promoters
Tanaka et al. A zinc finger transcription factor, αA-crystallin binding protein 1, is a negative regulator of the chondrocyte-specific enhancer of the α1 (II) collagen gene
Fogel et al. A statistical analysis of the TRANSFAC database
US20120058078A1 (en) Assay for the detection of factors that modulate the expression of ingap
US7625874B2 (en) Compositions and methods for treating diseases associated with T-box and N-Myc
Goto et al. Sp1 Family of Transcription Factors Regulates the Human α2 (XI) Collagen Gene (COL11A2) in Saos‐2 Osteoblastic Cells
Haines et al. The NLRR gene family and mouse development: Modified differential display PCR identifies NLRR-1 as a gene expressed in early somitic myoblasts
Mizusawa et al. Differentiation phenotypes of pancreatic islet β-and α-cells are closely related with homeotic genes and a group of differentially expressed genes
Yuan et al. Pax-2 interacts with RB and reverses its repression on the promoter of Rig-1, a Robo member
US20070105101A1 (en) Gene expression associated with osteobla st differentiation
Braquart-Varnier et al. A subtractive approach to characterize genes with regionalized expression in the gliogenic ventral neuroepithelium: identification of chick sulfatase 1 as a new oligodendrocyte lineage gene
Supakar et al. Role of transcription factors in the age-dependent regulation of the androgen receptor gene in rat liver
ZA200404784B (en) Assay for the detection of factors that modulate the expression of ingap.
Aniello et al. First evidence of prothymosin α in a non-mammalian vertebrate and its involvement in the spermatogenesis of the frog Rana esculenta
Tanaka Transcriptional regulation of the hypocretin/orexin gene
Yoshida et al. Molecular cloning of rat and porcine retina-derived POU domain factor 1 (POU6F2) from a pituitary cDNA library
AU2007203333A1 (en) Assay for the detection of factors that modulate the expression of INGAP
Gareiß et al. Emerin expression in early development of Xenopus laevis
AU2003202273B2 (en) Assay for the detection of factors that modulate the expression of INGAP

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION