AU2007240161A1 - Nucleic acid generating the ABCA7 gene, molecular modulating its activity and therapeutic applications - Google Patents

Nucleic acid generating the ABCA7 gene, molecular modulating its activity and therapeutic applications Download PDF

Info

Publication number
AU2007240161A1
AU2007240161A1 AU2007240161A AU2007240161A AU2007240161A1 AU 2007240161 A1 AU2007240161 A1 AU 2007240161A1 AU 2007240161 A AU2007240161 A AU 2007240161A AU 2007240161 A AU2007240161 A AU 2007240161A AU 2007240161 A1 AU2007240161 A1 AU 2007240161A1
Authority
AU
Australia
Prior art keywords
nucleic acid
transcription
abca7
polynucleotide
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2007240161A
Inventor
Isabelle Arnould-Reguigne
Giovanna Chimini
Patrice Denefle
Nicolas Duverger
Jose Osorio Y Fortea
Catherine Prades
Marie-Francoise Rosier
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institut National de la Sante et de la Recherche Medicale INSERM
Aventis Pharma SA
Original Assignee
Institut National de la Sante et de la Recherche Medicale INSERM
Aventis Pharma SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institut National de la Sante et de la Recherche Medicale INSERM, Aventis Pharma SA filed Critical Institut National de la Sante et de la Recherche Medicale INSERM
Publication of AU2007240161A1 publication Critical patent/AU2007240161A1/en
Abandoned legal-status Critical Current

Links

Landscapes

  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Description

P001 Section 29 Regulation 3.2(2)
\O
AUSTRALIA
Patents Act 1990 COMPLETE SPECIFICATION STANDARD PATENT Application Number: Lodged: Invention Title: Nucleic acid for regulating the ABCA7 gene, molecules modulating its activity and therapeutic applications The following statement is a full description of this invention, including the best method of performing it known to us: P111AHAU/1107 1 NUCLEIC ACID FOR REGULATING THE ABCA7 GENE, MOLECULES
U
SMODULATING ITS ACTIVITY AND THERAPEUTIC APPLICATIONS The present invention relates to a nucleic 5 acid capable of regulating the transcription of the ABCA7 gene, which is a gene which may be involved in c the metabolism of lipids in the hematopoietic tissues, Sas well as in cell signaling mechanisms linked to the
(N
immune reaction and to inflammation.
The present invention also describes polypeptides and polynucleotides, the impairment of whose sequence or expression is potentially involved in diseases associated with the genetic locus q13 of chromosome 19.
The present invention also relates to nucleotide constructs comprising a polynucleotide encoding a polypeptide or producing a nucleic acid of interest, placed under the control of a nucleic acid for regulating the human or murine ABCA7 gene.
The invention also relates to recombinant vectors, transformed host cells, and nonhuman transgenic mammals comprising a nucleic acid for regulating the transcription of the human and mouse ABCA7 gene or an abovementioned nucleotide construct, as well as methods for screening molecules or substances capable of modulating the activity of the nucleic acid for regulating the ABCA7 gene.
The invention in addition relates to methods 2 which make it possible to detect an impairment of the
U
Stranscription of the ABCA7 gene and thus to diagnose a \O possible dysfunction in lipid metabolism at the level of hematopoietic tissues and in the cell signaling \0 5 mechanisms of immunity.
Its subject is also substances or molecules c modulating the activity of the nucleic acid for regulating the transcription of the ABCA7 gene as well as pharmaceutical compositions containing such substances or such molecules.
The ABC (ATP-Binding Cassette) transport proteins constitute a superfamily which is extremely well conserved during evolution, from bacteria to humans. These proteins are involved in membrane transport of various substrates, for example ions, amino acids, peptides, sugars, vitamins or steroid hormones (Higgins et al., Annu Rev. Cell Biol, 8, (1992) 67-113).
The characterization of the complete amino acid sequence of some ABC transporters has made it possible to define a common general structure comprising in particular two nucleotide binding folds (NBF) with Walker A and B type units as well as two transmembrane domains, each of the transmembrane domains consisting of six helices (Klein et al., BBA, 1461 (1999), 237-262). The specificity of the ABC transporters for the various transported molecules appears to be determined by the structure of the 3 transmembrane domains, whereas the energy necessary for
O
Sthe transport activity is provided by the degradation INO of ATP at the level of the NBF fold (Dean et al., Curr.
Opin. Genet. Dev, 5 (1995) 779-785).
5 Several ABC transport proteins have been 0 identified in humans and a number of them have been associated with various diseases.
SFor example, cystic fibrosis is caused by mutations in the CFTR (cystic fibrosis transmembrane conductance regulator) gene, also designated ABCC7, Moreover, some multidrug resistance phenotypes in tumor cells have been associated with mutations in the genes encoding MDR (multidrug resistance) proteins, also designated ABCB, which also have an ABC transporter structure.
Other ABC transporters have been associated with neuronal and tumor conditions (patent US No. 5,858,719) or are potentially involved in diseases caused by impairment of the homeostasis of metals, in particular the ABC-3 protein.
Likewise, another ABC transporter, designated PFIC2 or ABCB11, appears to be involved in a form of progressive familial intrahepatic cholestasia, this protein being potentially responsible, in humans, for the export of bile salts.
A subfamily A of ABC transporters, designated ABCA, has also been identified. It is characterized by the presence of a highly hydrophobic segment (HH1: 4 highly hydrophobic) between the two transmembrane domains, bound to the two NBF units (Broccardo et al., IN BBA 1461 (1999) 395-404). Four members of this subfamily have so far been characterized. They are the M\ 5 transporters ABCA1 and ABCA2, both located on 0 chromosome 9, at the loci 9q22-9q31 and 9q34, respectively, as well as the transporter ABCA3 located Son chromosome 16p13.3, and finally the transporter
(N
ABCA4 or ABCR located on chromosome 1p22 (Broccardo et al., 1999). The members of this subfamily are also highly conserved during evolution of multicellular eukaryotes. By way of examples, the transporters ABCA1 and ABCA4, which are the best known, exhibit 95% and 88% identity, respectively, with their murine orthologs. Members of this subfamily are in addition closely related since, for example, the transporters ABCAl and ABCA4 exhibit a protein sequence identity of 50.9%, as well as a very similar genomic organization (Allikmets et al., Nat. Genet. (1997) 15, 236-246; Broccardo et al., Biochim. Biophys. Acta (1999) 1461, 395-404; Luciani et al., Genomics (1994) 21(1), 150-9; Remaley et al., Proc. Natl. Acad. Sci. USA (1999) 96(22), 12685-90).
Moreover, members of the subfamily A appear to exhibit a similar functional specialization at the level of the transport of membrane lipids and phospholipids. It has indeed been shown that the loss of the function of these transporters affect the renewal of the phospholipids of the cell membrane
O
bilayer. In the case of ABCA4, there is observed, in a \D first instance, a normal renewal of phosphatidylethanolamine (PE) in the rod cell of the membrane 5 portion, which leads, via a succession of events, to a 0 total loss of visual acuity (Weng et al., Cell (1999) S98(1), 13-23. In the case of ABCA1, an abnormal Sdistribution of the membrane phospholipids in plasma membrane layers is observed, which results more precisely in the presence of a larger quantity of phosphatidylserine in the outer layer, and in a disruption of the Ca2' concentration.
The transporters ABCAl and ABCA4 have been particularly studied. The ABCAl gene indeed appears to be involved in pathologies linked to a cholesterol metabolism dysfunction which induces diseases such as atherosclerosis, or familial HDL deficiencies (FHD) such a Tangier disease (FR 99/7684000; Rust et al., Nat. Genet., 22 (1999) 352-355; Brooks-Wilson et al., Nat. Genet., 22 (1999) 336-345; Bodzioch et al., Nat.
Genet. 22 (1999) 347-351; Orso et al., Nat. Genet, 24 (2000) 192-196). Tangier disease would appear to be linked to a cellular defect in the translocation of cellular cholesterol which causes degradation of the HDLs, and thereby a disruption in lipoprotein metabolism. Thus, it would appear that the HDL particles which do not incorporate cholesterol from peripheral cells are not metabolized correctly but are 6 on the contrary rapidly eliminated from the body. The
O
Splasma HDL concentration in these patients is therefore ND extremely reduced and the HDLs no longer ensure the return of cholesterol to the liver. This cholesterol 5 accumulates in these peripheral cells and causes 0 characteristic clinical manifestations such as the c formation of orange-colored tonsils. Furthermore, other Slipoprotein disruptions such as overproduction of
(N
triglycerides as well as increased intracellular synthesis and catabolism of phospholipids are observed.
The ABCA4 transporter has moreover been associated with degenerative and inflammatory eye diseases such as Stargardt's recessive disease (Allikmets et al., 1997) and degeneration of the macular region of the retina linked to age (AMD) (Allikmets et al., Nat. Genet. 15 (1997) 236-246; Allikmets et al., Science, 277 (1997) 1805-1807; Cremers et al., Hum. Mol. Genet. (1998), 355-62; Martinez-Mir et al., Nat. Genet. 18 (1998) 11-12; Weng et al., Cell (1999) 98(1), 13-23).
In humans, a cDNA comprising the entire open reading frame of a new member of the A subfamily of ABC (ATP-Binding Cassette) transporters was recently cloned from human macrophage RNA, and designated ABCA7 (Kaminski et al., BBR, 273(2000), 532-538).
The characterization of the complete amino acid sequence of ABCA7 indicates that the protein product has the general structure characteristic of 7 ABCA transporters in that it comprises the symmetrical Sstructure comprising the two transmembrane domains and two NBF units. In addition to these characteristic units the ABCA7 protein has other units which were 5 recently identified as being characteristic of the ABCA transporters, namely the HH1 region and the hot spot region (Broccardo et al., Biochim. Biophys. Acta (1999) 0 1461, 395-404).
(N
Like the other members of the A subfamily of ABC transporters, the sequence of the ABCA7 protein is highly conserved in mice and in humans, with an interspecies identity of 79%. The ABCA7 protein exhibits furthermore an intron-exon organization characteristic of the members of the ABCA subfamily, as well as a high sequence homology in particular with the human transporters ABCA1 and ABCA4, of 54% and 49%, respectively.
Moreover, the protein transporter ABCA7 appears to exhibit a regulatory profile dependent on the flows of sterol, similar to that of the other members of the A subfamily, and in particular the ABCA1 transporter (Langman et al., BBR Com; 257(1999), 29-33; Laucken et al., PNAS, 97(2000) 817-822). There has indeed been observed by Kaminski et al. (supra) an increase in the expression of ABCA7 after incubation of human macrophages in the presence of acetylated lowdensity lipoproteins (AcLDL) which induce a sterol load, as well as a decrease in expression in the presence of the HDL3 cholesterol acceptor which causes Sa decrease in the sterol load.
O Moreover, ABCA7 exhibits, like the other ABCA members, a degree of specialization of its tissue O 5 expression, the ABCA7 messenger being predominantly present in the hematopoietic tissues consisting of the c lymphocytes, granulocytes, thymus, spleen, bone marrow or fetal tissues, whereas the expression of ABCA1 is predominant in the macrophages and the placenta, and that of ABCA4 is restricted in the retina (Rust et al., Nat. Genet, 22, (1999) 352-355) All the data disclosed above, relating to the identity of the protein sequences, to the regulatory mechanism and the specificity of expression suggests that the ABCA7 gene constitutes another transporter of the A subfamily, and that it has a similar, or even redundant, function to that of the other transporters and in particular to that of the ABCAl transporter.
This transporter could therefore presumably act as mediator in the metabolism of lipids, and it is highly possible that it is, in the same way as the ABCA1 transporter, responsible for certain metabolic dysfunctions or deficiencies. Moreover, the specialization of the expression of the ABCA7 transporter presumably indicates that the latter plays a role in the transmembrane transport (export) of lipids in the hematopoietic tissues, and possibly in the lymphocyte signaling mechanisms of immunity, for 9 example in the case of the pathogenesis of Satherosclerosis as indicated by Kaminski et al. (Supra) ND Although the expression of the human ABCA7 gene appears to be regulated according to the type of 5 cell or the metabolic situation of a given cell type, the sequence(s) making it possible to regulate this gene were not known.
However, a need exists in the state of the art to identify these regulatory sequences for the following reasons: a) These sequences are capable of being mutated in patients suffering from a pathology linked to a defect in the transport of lipids, possible substrates of the ABCA7 protein, or in patients who are likely to develop such pathologies.
The characterization of the regulatory sequences of the human ABCA7 gene would make it possible to detect mutations in patients, in particular to diagnose the individuals belonging to at-risk family groups. In addition, the isolation of these regulatory sequences would allow the complementation of the mutated sequence with a functional sequence capable of compensating for the metabolic dysfunctions induced by the mutation(s) diagnosed, by virtue of the construction of targeted therapeutic means, such as means intended for gene therapy.
b) The characterization of the regulatory sequences of the ABCA7 gene would make available to 010 persons skilled in the art means capable of allowing Sthe construction, by genetic engineering, and then the IN expression of defined genes in the cell types in which the ABCA7 gene is preferably expressed.
0 5 c) Moreover, some parts of the regulatory sequences of the ABCA7 gene could constitute constitutive promoter sequences with a high level of Sexpression, of the type which will allow the
(N
construction of novel means for the expression of defined sequences in the cells, supplementing a range of means which already exist.
It has to be noted that despite the efforts undertaken, the regulatory sequences of the ABCA7 gene had so far remained completely unknown.
The inventors have now isolated and then analyzed a human genomic DNA of 33.5 kb comprising the 46 exons of the open reading frame of the ABCA7 gene as well as the nontranscribed region of about 1.1 kb located on the 5' side of exon 1, upstream of the transcriptional site and comprising signals for regulating the human ABCA7 gene.
The inventors have also isolated and then analyzed a murine genomic DNA of 20Kb comprising the exons of the open reading frame of the ABCA7 gene as well as the nontranscribed region of about 1.2Kb in mice located on the 5' side of exon 1, upstream of the transcription site and comprising signals for regulating the murine ABCA7 gene.
11
U
GENERAL DEFINITIONS The term "isolated" for the purposes of the 5 present invention designates a biological material (nucleic acid or protein) which has been removed from its original environment (the environment in which it is naturally present).
For example, a polynucleotide present in the natural state in a plant or an animal is not isolated.
The same polynucleotide separated from the adjacent nucleic acids in which it is naturally inserted in the genome of the plant or animal is considered as being "isolated".
Such a polynucleotide may be included in a vector and/or such a polynucleotide may be included in a composition and remain nevertheless in the isolated state because of the fact that the vector or the composition does not constitute its natural environment.
The term "purified" does not require the material to be present in a form of absolute purity, exclusive of the presence of other compounds. It is rather a relative definition.
A polynucleotide is in the "purified" state after purification of the starting material or of the natural material by at least one order of magnitude, 12 preferably 2 or 3 and preferably 4 or 5 orders of magnitude.
NO For the purposes of the present description, the expression "nucleotide sequence" may be used to 5 designate either a polynucleotide or a nucleic acid.
The expression "nucleotide sequence" covers the genetic material itself and is therefore not restricted to the 0 information relating to its sequence.
(N
The terms "nucleic acid", "polynucleotide", "oligonucleotide" or "nucleotide sequence" cover RNA, DNA or cDNA sequences or alternatively RNA/DNA hybrid sequences of more than one nucleotide, either in the single-chain form or in the duplex form.
The term "nucleotide" designates both the natural nucleotides T, G, C) as well as the modified nucleotides which comprise at least one modification such as an analog of a purine, an analog of a pyrimidine, or an analogous sugar, examples of such modified nucleotides being described, for example, in the PCT application No. WO 95/04 064.
For the purposes of the present invention, a first polynucleotide is considered as being "complementary" to a second polynucleotide when each base of the first nucleotide is paired with the complementary base of the second polynucleotide whose orientation is reversed. The complementary bases are A and T (or A and or C and G.
13 "Variant" of a nucleic acid according to the invention will be understood to mean a nucleic acid IND which differs by one or more bases relative to the reference polynucleotide. A variant nucleic acid may be 5 of natural origin, such as an allelic variant which exists naturally, or may also be a nonnatural variant obtained, for example, by mutagenic techniques.
In general, the differences between the reference nucleic acid and the variant nucleic acid are small such that the nucleotide sequences of the reference nucleic acid and of the variant nucleic acid are very similar and, in many regions, identical. The nucleotide modifications present in a variant nucleic acid may be silent, which means that they do not alter the amino acid sequences encoded by said variant nucleic acid.
However, the changes in nucleotides in a variant nucleic acid may also result in substitutions, additions or deletions in the polypeptide encoded by the variant nucleic acid in relation to the peptides encoded by the reference nucleic acid. In addition, such nucleotide modifications in the coding regions may produce conservative or nonconservative substitutions in the amino acid sequence.
Preferably, the variant nucleic acids according to the invention encode polypeptides which substantially conserve the same function or biological activity as the polypeptide of the reference nucleic acid or alternatively the capacity to be recognized by
U
Santibodies directed against the polypeptides encoded by INO the initial nucleic acid.
Some variant nucleic acids will thus encode 5 mutated forms of the polypeptides whose systematic
INO
study will make it possible to deduce structureactivity relationships of the proteins in question.
Knowledge of these variants in relation to the disease studied is essential since it makes it possible to understand the molecular cause of the pathology.
"Fragment" will be understood to mean a reference nucleic acid according to the invention, a nucleotide sequence of reduced length relative to the reference nucleic acid and comprising, over the common portion, a nucleotide sequence identical to the reference nucleic acid.
Such a nucleic acid "fragment" according to the invention may be, where appropriate, included in a larger polynucleotide of which it is a constituent.
Such fragments comprise, or alternatively consist of, oligonucleotides ranging in length from to 25, 30, 40, 50, 70, 80, 100, 200, 500, 1 000 or 1 500 consecutive nucleotides of a nucleic acid according to the invention.
"Biologically active fragment" of an acid for regulating transcription according to the invention is understood to mean a nucleic acid capable of modulating the transcription of a DNA sequence placed under its lb control. Such a biologically active fragment comprises
O
Sa core promoter and/or a regulatory element, as defined INO in the present description.
"Regulatory nucleic acid" according to the 5 invention is understood to mean a nucleic acid which
INO
activates and/or regulates the expression of a DNA sequence selected and placed under its control.
"Promoter" is understood to mean a DNA sequence recognized by the proteins of the cell which are involved in the initiation of the transcription of a gene. The core promoter is the minimum regulatory nucleic acid capable of initiating the transcription of a defined DNA sequence which is placed under its control. In general, the core promoter consists of a genomic DNA region upstream of the site for initiation of transcription where there is very often present a CAAT sequence (where one or more transcriptional protein factors bind) as well as, except in rare cases such as in some housekeeping genes, the TATA or "TATA box" sequence or a related box. It is at the level of this box that RNA polymerase binds as well as one or more transcription factors, such as the "TATA" box binding proteins (TBPs).
A nucleotide sequence is "placed under the control" of a regulatory nucleic acid when this regulatory nucleic is located, relative to the nucleotide sequence, in such a manner as to control the initiation of transcription of the nucleotide sequence 16 by an RNA polymerase.
"Regulatory element" or "regulatory sequence" IN for the purposes of the invention, is understood to mean a nucleic acid comprising elements capable of IN 5 modulating transcription initiated by a core promoter, such as binding sites for various transcription factors, "enhancer" sequences for increasing Ctranscription or "silencer" sequences for inhibiting transcription.
"Enhancer" sequence is understood to mean a DNA sequence included in a regulatory nucleic acid capable of increasing or stimulating transcription initiated by a core promoter.
"Silencer" sequence is understood to mean a DNA sequence included in a regulatory acid capable of decreasing or inhibiting transcription initiated by a core promoter.
Regulatory elements may be present outside the sequence located on the 5' side of the site for initiation of transcription, for example in the introns and exons, including in the coding sequences.
The core promoter and the regulatory element may be "specific to one or more tissues" if they allow transcription of a defined DNA sequence placed under their control, preferably in certain cells (for example tissue-specific cells), that is to say either exclusively in the cells of certain tissues, or at different levels of transcription according to the 17 tissues.
"Transcription factor" is understood to mean N proteins which preferably interact with elements for regulating a regulatory nucleic acid according to the invention, and which stimulate or on the contrary repress transcription. Some transcription factors are active in the form of monomers, others being active in Sthe form of homo- or heterodimers.
The term "modulation" refers to either a positive regulation (increase, stimulation) of transcription, or a negative regulation (decrease, inhibition, blockage) of transcription.
The "percentage identity" between two nucleotide or amino acid sequences, for the purposes of the present invention, may be determined by comparing two sequences aligned optimally, through a window for comparison.
The portion of the nucleotide or polypeptide sequence in the window for comparison may thus comprise additions or deletions (for example "gaps") relative to the reference sequence (which does not comprise these additions or these deletions) so as to obtain an optimum alignment of the two sequences.
The percentage is calculated by determining the number of positions at which an identical nucleic base or an identical amino acid residue is observed for the two sequences (nucleic or peptide) compared, and then by dividing the number of positions at which there is identity between the two bases or amino acid
U
Sresidues by the total number of positions in the window INO for comparison, and then multiplying the result by 100 in order to obtain the percentage sequence identity.
5 The optimum sequence alignment for the comparison may be achieved using a computer with the aid of known algorithms contained in the package from the company WISCONSIN GENETICS SOFTWARE PACKAGE, GENETICS COMPUTER GROUP (GCG), 575 Science Doctor Madison, WISCONSIN.
By way of illustration, it will be possible to produce the percentage sequence identity with the aid of the BLAST software (versions BLAST 1.4.9 of March 1996, BLAST 2.0.4 of February 1998 and BLAST 2.0.6 of September 1998), using exclusively the default parameters (Altschul et al, J. Mol. Biol. (1990) 215: 403-410; Altschul et al, Nucleic Acids Res. (1997) 3389-3402). Blast searches for sequences similar/homologous to a reference "request" sequence, with the aid of the Altschul et al. (Supra) algorithm.
The request sequence and the databases used may be of the peptide or nucleic type, any combination being possible.
"High stringency hybridization conditions" for the purposes of the present invention will be understood to mean the following conditions: 1- Membrane competition and PREHYBRIDIZATION:
\O
19 Mix: 40 pl salmon sperm DNA (10 mg/ml) 40 p1 human placental DNA (10 mg/ml) 5 Denature for 5 min at 960C, then immerse the mixture in ice.
Remove the 2X SSC buffer and pour 4 ml of formamide mix into the hybridization tube containing the membranes.
Add the mixture of the two denatured DNAs.
Incubation at 420C for 5 to 6 hours, with rotation.
2- Labeled probe competition: Add to the labeled and purified probe 10 to 50 ul Cot I DNA, depending on the quantity of nonspecific hybridizations.
Denature for 7 to 10 min at 950C.
Incubate at 65 0 C for 2 to 5 hours.
3- HYBRIDIZATION: Remove the prehybridization mix.
U
Mix 40 41 salmon sperm DNA 40 41 human placental INO DNA; denature for 5 min at 960C, then immerse in ice.
5 Add to the hybridization tube 4 ml of formamide mix, the mixture of the two DNAs and the denatured labeled probe/Cot I DNA Incubate 15 to 20 hours at 420C, with rotation.
4- Washes: One wash at room temperature in 2X SSC, to rinse.
Twice 5 minutes at room temperature 2X SSC and 0.1%
SDS.
Twice 15 minutes 0.1X SSC and 0.1% SDS at 650C.
Envelope the membranes in Saran and expose.
The hybridization conditions described above are adapted to hybridization, under high stringency conditions, of a molecule of nucleic acid of varying length from 20 nucleotides to several hundreds of nucleotides.
It goes without saying that the hybridization conditions described above may be adjusted as a function of the length of the nucleic acid whose hybridization is sought or of the type of labeling 21 chosen, according to techniques known to persons skilled in the art.
ND Suitable hybridization conditions may for example be adjusted to the teaching contained in the \0 5 manual by HAMES and HIGGINS (1985) (Nucleic acid Hybridization iapractical Approach, Hames and Higgins Ed., IRL Press, Oxford) or in the manual by F. AUSUBEL et al (1999) (Currents Protocols in Molecular Biology,
(N
Green Publishing Associates and Wiley Interscience,
N.Y).
"Transformation" for the purposes of the invention is understood to mean the introduction of a nucleic acid (or of a recombinant vector) into a host cell. The term "transformation" also covers a situation in which the genotype of a cell has been modified by an exogenous nucleic acid, and that this cell thus transformed expresses said exogenous nucleic acid, for example in the form of a recombinant polypeptide or in the form of a sense or antisense nucleic acid.
"Transgenic animal" for the purposes of the invention is understood to mean a nonhuman animal, preferably a mammal, in which one or more cells contain a heterologous nucleic acid introduced by virtue of human intervention, such as by transgenesis techniques well known to persons skilled in the art. The heterologous nucleic acid is introduced directly or indirectly into the cell or the precursor of the cell, by genetic engineering such as microinjection or 22 infection with a recombinant virus. The heterologous
O
Snucleic acid may be integrated into the chromosome or NO may be provided in the form of DNA replicating extrachromosomally.
\O 0 NUCLEIC ACID FOR REGULATING THE ABCA7 GENE SUsing BAC-type vector libraries prepared from
(N
human and murine genomic material, the inventors succeeded in isolating a nucleic acid for regulating the human and murine ABCA7 genes.
The inventors determined, by comparative analysis of the human and murine genomic sequences, a regulatory nucleic acid comprising in particular two regulatory modules conserved in humans and mice. The inventors therefore determined that the nucleic acid for regulating transcription of the ABCA7 gene, when it is most broadly defined, consists of a polynucleotide comprising, from the 5' end to the 3' end: a nontranscribed region of about 1.2 kb located upstream of the site for initiation of transcription of the ABCA7 gene, and the partial sequence of the first exon of the ABCA7 gene.
In its broadest definition, the nucleic acid for regulating transcription of the ABCA7 gene comprises all the nucleotide regions as defined above 23 and is identified in the sequence SEQ ID No. 1
O
according to the invention.
IN Thus, a first subject of the invention consists of a nucleic acid comprising a polynucleotide \0 5 having at least 20 consecutive nucleotides of the nucleotide sequence SEQ ID No. 1, or a nucleic acid c having a complementary sequence.
The region of about 1.1 Kb located upstream
(N
of the site for initiation of transcription of the ABCA7 gene, and comprising the core promoter and multiple elements for regulating transcription is also included in the sequence identified as SEQ ID No. 2 according to the invention.
More precisely, the nucleotide at position 1 of the sequence SEQ ID No. 2 is the nucleotide at position -1111, relative to the site for initiation of transcription of the ABCA7 gene.
According to a second aspect, the invention relates to a nucleic acid comprising a polynucleotide having at least 20 consecutive nucleotides having the nucleotide sequence SEQ ID No. 2, or a nucleic acid having a complementary sequence.
As already specified above, the nucleic acid for regulating the transcription of the ABCA7 gene having the sequence SEQ ID No. 1 also comprises, in addition to a nontranscribed 5' regulatory region, the part of the first exon of the human ABCA7 gene.
24 The partial sequence of the first exon of the
O
SABCA7 gene is defined as the sequence SEQ ID No. 3.
\O According to a third aspect, the invention relates to a nucleic acid comprising a polynucleotide q\ 5 having at least 20 consecutive nucleotides having the nucleotide sequence SEQ ID No. 3, or a nucleic acid tq having a complementary sequence.
Preferably, a nucleic acid according to the
(N
invention will be in isolated and/or purified form.
Also forming part of the invention is any "biologically active" fragment of a nucleic acid as defined above.
According to yet another aspect, the invention relates to a nucleic acid having at least nucleotide identity with a nucleic acid as defined above.
In particular, this nucleic acid may be of murine origin, and consists of a polynucleotide having the nucleotide sequence SEQ ID NO: 4 comprising from the 5' to the 3' end: a nontranscribed region of about 1.2 Kb located upstream of the site for initiation of transcription of the murine ABCA7 gene, and the partial sequence of the first exon of the ABCA7 gene.
The region of about 1.2 Kb located upstream of the site for initiation of transcription of the ABCA7 gene, and comprising the core promoter and multiple elements for regulating transcription, is also Sincluded in the sequence identified as SEQ ID NO: INO according to the invention.
The invention also includes a nucleic acid 5 characterized in that it hybridizes, under high stringency conditions, with any of the nucleic acids according to the invention.
The invention also relates to a nucleic acid
(N
having at least 80%, advantageously 90%, preferably and most preferably 98% nucleotide identity with a nucleic acid comprising at least 20 consecutive nucleotides of a polynucleotide chosen from the group consisting of the nucleotide sequences SEQ ID No. 1 to SEQ ID No. DETAILED ANALYSIS OF THE SEQUENCES SEQ ID No. 2 AND SEQ ID No. According to a principal characteristic, the nucleic acid having the sequence SEQ ID No. 2, included in the nucleic acid for regulating the human ABCA7 gene having the sequence SEQ ID No. 1, comprises the constituent elements of a core promoter, respectively a degenerate "TATA" box (TTAAG) located 30 bp upstream of the site of initiation of transcription. Likewise, a degenerate "TATA" box (TTAAA) is located 30 bp upstream of the site of initiation of transcription, on the murine nucleic acid having the sequence SEQ ID NO: included in the nucleic acid for regulating the murine 26 ABCA7 gene having the sequence SEQ ID NO: 4. The "TATA"
O
boxes on the promoters of the human and murine ABCA7 O genes as well as the position of the sites of initiation of transcription are represented in 5 Figure 1.
The regulatory sequences SEQ ID No. 2 and SEQ ID No. 5 also comprise numerous binding sites for various transcription factors capable of positively or negatively regulating the activity of the core promoter.
Thus, the various sequences characteristic of the sites for the binding of various transcription factors in the sequences SEQ ID No. 2 and SEQ ID No. were identified by the inventors in the manner detailed below.
The sequences SEQ ID No. 2 and SEQ ID No. were used as reference sequences and treated according to the algorithms of the MatInspector software packages (Quandt et al., Nucl Acid Res (1995) 23(23), 4878-4884) and compared with the data stored in several databases such as Transfac and the presence as well as the location of the various sites characteristic of the sequences SEQ ID No. 2 and 5, and particularly the sites for the binding of the transcription factors were determined according to methods well known to persons skilled in the art.
More particularly, a detailed analysis was carried out using the software packages NNPP (Reese et 27 al. J. Comput Biol. (1997) 4(3) 311-23), TSSG and TSSW
U
S(Soloryev et al., Ismb (1995), 5, 294-302), on the 1.1 kb and 1.2 Kb upstream of the site of initiation of the sequences SEQ ID No. 2 and 5, respectively, made it 5 possible to identify 193 and 233 putative sites for binding to the transcription factors, in humans and mice during the first stage of the search. These are collated in Tables 1 and 2. After compiling and filtering as described above, and comparing the human and murine regulatory sequences, two modules common to the human and murine regulatory sequences were determined, and 5 and 3 possible sites for binding of various transcription factors were selected in the modules 1 and 2, respectively, on the human and murine sequences. The position with the filtration scores for the sites for binding to the transcription factors identified in the 1111 bp of the sequence SEQ ID No. 2 according to the invention, as well as in the 1220 bp of the sequence SEQ ID NO: 5 according to the invention, are presented in Table 3. The various binding sites were also schematically represented in Figure i.
The positions of the starting nucleotides in each of the sites for binding to the transcription factors are designated with reference to the numbering of the nucleotides of the sequences SEQ ID No. 2 and NO: 5 relative to the site of initiation of 28 transcription contained in the sequences SEQ ID
O
No. 1 and No. 4, as represented in Figure 1.
IND Figure 2 represents the sequence SEQ ID NO: 1, which contains the sequence SEQ ID No. 2. The O 5 first nucleotide at position 5' of the sequence of Figure 2 is also the first nucleotide at position 5' of one of the nucleotide sequences SEQ ID No. 1 and SEQ ID No. 2. In Figure 2, the sites for binding to the transcription factors are illustrated in bold characters which delimit their respective positions, and their respective designations are indicated above each of the corresponding boxes. The numbering of the nucleotides of the sequence represented in Figure 2 was carried out relative to the site of initiation of transcription, numbered the nucleotide in 5' of the nucleotide +1 being itself numbered Figure 3 represents the sequence SEQ ID NO: 4, which contains the sequence SEQ ID No. 5. The first nucleotide at position 5' of the sequence of Figure 3 is also the first nucleotide at position 5' of one of the nucleic sequences SEQ ID No. 4 and SEQ ID No. 5. In Figure 3, the sites for binding to the transcription factors are illustrated in bold characters which delimit their respective positions, and their respective designations are indicated above each of the corresponding boxes. The numbering of the nucleotides of the sequence represented in Figure 3 was carried out relative to the site of initiation of 29 transcription, numbered the nucleotide in 5' of the nucleotide +1 being itself numbered IND The genomic analysis of the nucleic acids regulating the human and murine sequences SEQ ID NO: 2 O 5 and 5, revealed two regulatory modules which are noted module 1 and module 2, and are particularly conserved in humans and mice. These two regulatory modules comprise ubiquitous transcription factor binding sites, such as NFl, NFY and AP4, as well as sites for binding of transcription factors specific to the liver such as CEBP and HNF3B. This is compatible with the experimental expression data presented in Example 3 below, and provided by Kaminski et al. (Supra), which show expression of the ABCA7 gene in human fetal hepatic tissues.
The two regulatory modules conserved in mice and humans also comprise sites for binding of transcription factors such as GF11 and NFkappaB (NFkB), which are essentially present in the lymphatic organs.
The description of the characteristics of the sites for binding to each of the transcription factors designated in Figures 2 and 3 as well as in Table 3 can be easily found by persons skilled in the art. A short description of some of them is made below.
NF1 factor: The binding characteristics of the NFl factor can be found in particular in the following entries of the Medline database: 88319941, 91219459, 86140112, S87237877, 90174951, 89282387, 90151633, 892618136, O 86274639, 87064414, 89263791. The NF1 factor recognizes the following palindromic sequence: "TGGCANNNTGCCA S 5 (NNTTGGCNNNNNNNNCCNN)" which is present in viral and Scellular promoters and at the level of the origin of replication of type 2 adenoviruses. These proteins are Scapable of activating transcription and replication.
(N
They bind to DNA in the form of a homodimer.
NFY factor: The NFY factor is in particular described in entry No. P25.208 of the Swissprot database. It is a factor which recognizes a "CCAAT" unit in the promoter sequences such as those of the gene encoding type 1 collagen, albumin and 3-actin. It is a stimulator of transcription.
AP4 factor: Persons skilled in the art will be able to advantageously refer to the articles corresponding to the following entries of the Medline database: 2123466, 2833704, 8530024. The AP4 factor has a domain for binding to DNA of the "helix loop helix" (bHLH) type as well as two dimerization domains. The consensus site of the AP4 factor is the following "CWCAGCTGGN", and the latter generally overlaps with a binding site for the AP1 factor.
CEBP
ND The characteristics for binding to the CEBP factor may be found in particular in the following MD 5 entries of the Medline database: 93315489, 91248826, 0 94193722, 93211931, 92390404, 90258863, 94088523, 90269225 and 96133958. It is an important transcription Sactivator in the regulation of genes involved in the
(N
immune and inflammatory responses. It binds specifically to an IL-1 response element in the gene for IL-6. It presumably plays a role in the regulation of the acute phase of the inflammation and in hemapoiesis. The consensus recognition site is the following: NNGNAA(T/G)".
HNF3B factor: Persons skilled in the art will be able to advantageously refer to the article by OVERDIER et al.
(1994, Mol. Cell Biol. Vol. 14: 2755-2766), as well as to the following entries of the Medline database: 91352065, 91032994, 92345837, 89160814, 91187609, 91160974, 91029477, 94301798 and 94218249. This transcription factor acts as activator of numerous genes in the liver such as the AFT gene and the genes for albumin and tyrosin aminotransferase and interacts with cis-acting regulatory regions of these genes.
GFI1 32 The characteristics for binding to the GFIl Sfactor may be found in particular in the following NO entries of the Medline database: 10762661, 9931446, 9571157, 9285685, 9070650 and 7789186. The GFI1 gene 5 encodes a zinc finger protein involved in the Stranscriptional regulation and more particularly in the Sinterleukin-2 signaling pathway. The consensus Srecognition site is the following:
"NNNNNAAATCANNGNNNNNNN"
NFkappa-B factor: Persons skilled in the art will be able to advantageously refer to the articles corresponding to the following entries of the Medline database: 95369245, 91204058, 94280766, 89345587, 93024383, 88248039, 94173892, 91088538, 91239561, 91218850, 92390404, 90156535, 93377072, 92097536, 93309429, 93267517, 92037544, 914266911, 91105848 and 95073993.
The NFkappa-B factor is a heterodimer consisting of a first subunit of 50 kDa and a second subunit of 65 kDa.
Two heterodimers may form a labile tetramer. Its binding to DNA depends on the presence of zinc It may be induced by numerous agents such as TNF, PKA or PKC. It is a key regulator of genes involved in responses to infection, inflammation and stress.
An essential characteristic of the regulatory nucleic acid according to the invention, and more particularly of the sequence located upstream of the site of initiation of transcription included both in
U
Sthe sequence SEQ ID No. 2 and in the sequence SEQ ID IN No. 5 is the presence of motifs characteristic of putative sites for binding to transcription factors 0 5 involved in the gene expression of the T lymphocytes, 0 such as the transcription factors CEBP, NFKB and GFIl.
GFI1 is a protooncogene which encodes a zinc Sfinger nuclear protein involved in the cytokine signaling pathway and in the clonal amplification of the T cells (Zweidler-McKay, et al., Mol. Cell. Biol.
(1966), 16(8), 4024-4034). The transcription factor GFIl which acts as a transcriptional repressor of the genes which inhibit the activation of the T cells and oncogenesis. It is specifically present in the thymus, the spleen and the T lymphocytes.
The transcription factors CEBP and NFkappaB which are expressed in the thymus, the spleen and the T lymphocytes are well known to persons skilled in the art and act in cooperation in the mediation of the induction of the expression of the genes of the T lymphocytes (Runch et al., 1994) and of the HepB3 cells (Shimizu et al., Gene, (1994) 149, 305-310).
The positions of the starting nucleotides, relative to the site of initiation of transcription which are at -498 and -469 for the CEBP sites, and at -260 for the NFkB site, on the human regulatory module, and at -787 and -760, for the CEBP sites, and at -301 for the NFKB site, show that the two regulatory sites 34 are more distant in the mouse promoter. However, it is
O
probable that the two sites are closer in a three- IND dimensional structure so as to allow coactivation by the two factors CEBP and NFkB.
S 5 The presence of these potential sites for binding to CEBP and to NFkB in a manner conserved in humans and mice on the regulatory nucleic acids according to the invention is compatible with the observation according to which the expression of the gene encoding the human ABCA7 protein is predominant in the hematopoietic tissues and the T lymphocytes, and is thought to be most probably involved in cellular mediation of immunity, in particular in the pathogenesis of atherosclerosis (Kaminski et al., Supra) As already mentioned above, the invention relates to a nucleic acid comprising a polynucleotide having at least 20 consecutive nucleotides of one of the nucleotide sequences SEQ ID No. 1 or 2, and SEQ ID No. 4 or 5, as well as a nucleic acid having a complementary sequence.
Included in the above definition are the nucleic acids comprising one or more "biologically active" fragments of one of the sequences SEQ ID No. 1 or 2, and SEQ ID No. 4 or 5. Persons skilled in the art can easily obtain biologically active fragments of these sequences, by referring in particular to Table 3 above as well as to Figures 2 and 3 in which the various characteristic units of the sequence for
U
Sregulating the ABCA7 gene are present. Persons skilled O in the art can thus obtain such biological active fragments by total or partial chemical synthesis of the 5 corresponding polynucleotides or by causing restriction endonucleases to act in order to obtain desired DNA fragments, it being possible for the restriction sites present on the sequences SEQ ID No. 1 to SEQ ID No. to be easily found from the sequence information, with the aid of common software packages for restriction mapping such as GCG version 9.1 map module.
The production of defined nucleic acid fragments with the aid of restriction endonucleases is for example described in the manual by SAMBROOK et al., (Molecular cloning: a laboratory manual, 2ed. Cold Spring Harbor Laboratory, Cold spring Harbor, New York (1989).
The invention therefore also relates to a nucleic acid as defined above, which is capable of modulating the transcription of a polynucleotide placed under its control.
According to a first preferred embodiment, a biologically active fragment of a nucleic acid for regulating transcription according to the invention comprises a first conserved module (module 2) which comprises the core promoter (TATA box) ranging from the nucleotide at position -1 to the nucleotide at position -390, relative to the site of initiation of transcription, the first nucleotide transcribed being Sthe nucleotide at position 1112 of the nucleotide IND sequence SEQ ID No. 1, or the nucleotide at position 1221 of the nucleotide sequence SEQ ID NO: 4.
S 5 According to a second embodiment, a biologically active fragment of a nucleic acid for regulating transcription according to the invention comprises the conserved modules 1 and 2 (Figure 1) from the nucleotide at position -1 to the nucleotide at position -860, relative to the site of initiation of transcription, the first nucleotide transcribed being the nucleotide at position 1112 of the nucleotide sequence SEQ ID No. 1, or the nucleotide at position 1221 of the nucleotide sequence SE ID NO: 4.
According to a third embodiment, such a biologically active fragment of an acid for regulating transcription according to the invention comprises, in addition to.the core promoter and the proximal regulatory elements, also other regulatory elements such as the various sites GFII, HNF3B, CEBPB, NFl and extends from the nucleotide at position -1 to the nucleotide at position -1111, relative to the site of initiation of transcription, the first nucleotide transcribed being the nucleotide at position 1112 of the nucleotide sequence SEQ ID No. i, and to the nucleotide at position -1220, relative to the site of initiation of transcription, the first nucleotide 37 transcribed being the nucleotide at position 1221 of Q the nucleotide sequence SEQ ID No. 4.
\O ID ANALYSIS OF EXON 1 5 The applicant has also identified the nucleotide sequences located downstream of the site of initiation of transcription and corresponding to the Send of exon 1, human and murine genes encoding the
(N
ABCA7 protein.
More precisely, the 5' end of exon 1, having a size of 1210 nucleotides, starts with the nucleotide at position 1112 of the sequence SEQ ID No. 1 and ends with the nucleotide at position 2322 of the sequence SEQ ID No. 1. The 5' end of exon 1 is identified as the sequence SEQ ID No. 3 and the complete sequence of exon 1 is identified as the sequence SEQ ID No. 6.
Exon 1 contains the beginning of the open reading phase of the human ABCA7 gene, the nucleotide A of the ATG codon being located at position 1208 of the sequences SEQ ID No. 3 and 6. Exon 1 encodes the polypeptide having the sequence SEQ ID No. 7.
Exon 1 is likely to contain elements for regulating the expression of the ABCA7 gene, in particular elements of the amplifying enhancer type and/or elements of the silencer or repressor type.
Consequently, a nucleic acid for regulating transcription according to the invention may also contain, in addition to biologically active fragments .38 of the sequence SEQ ID No. 1, also nucleotide
U
fragments, or even the entire sequences SEQ ID No. 2 to ND SEQ ID No. 3 and 6.
The nucleotide sequences SEQ ID No. 1 to 5 SEQ ID No. 3 and 6, as well as their fragments, may in particular be used as nucleotide probes or primers for c detecting the presence of at least one copy of the ABCA7 gene in a sample, or for amplifying a defined target sequence in the sequence for regulating the ABCA7 gene.
The subject of the invention is therefore also a nucleic acid having at least 80% nucleotide identity with a nucleic acid as defined above, in particular obtained from one of the sequences SEQ ID No. 1 to SEQ ID No. 3 and 6.
The invention also relates to a nucleic acid which hybridizes, under high stringency conditions, with any one of the nucleic acids according to the invention, in particular a nucleic acid obtained from a sequence chosen from the sequences SEQ ID No. 1 to SEQ ID No. 3 and 6.
The invention also relates to a nucleic acid as defined above and characterized, in addition, in that it is capable of modulating the transcription of a polynucleotide of interest placed under its control.
According to a first aspect, such a nucleic acid is capable of activating the transcription of the polynucleotide of interest placed under its control.
39 According to a second aspect, a regulatory
O
Snucleic acid according to the invention may be IND characterized in that it is capable of inhibiting the transcription of the polynucleotide of interest placed 5 under its control.
Preferably, a nucleic acid for regulating transcription according to the invention, when it is suitably located relative to a polynucleotide of interest whose expression is sought, will allow the transcription of said polynucleotide of interest, either constitutively or inducibly.
The inducible character of the transcription initiated by a regulatory nucleic acid according to the invention may be conferred by one or more of the regulatory elements which it contains, for example the presence of one or more sites as defined above in the sequence SEQ ID No. 1 or SEQ ID No. 2.
Furthermore, a tissue-specific expression of the polynucleotide of interest may be sought by placing this polynucleotide of interest under the control of a regulatory nucleic acid according to the invention which is capable, for example, of initiating the transcription of this polynucleotide of interest specifically in certain categories of cells, for example cells of the hematopoietic tissue, such as the peripheral leukocytes, thymus cells, spleen cells and bone marrow.
Preferably, a regulatory nucleic acid
O
according to the invention may comprise one or more IND "discrete" regulatory elements such as enhancer and silencer elements. In particular, such a regulatory 5 nucleic acid may comprise one or more potential sites for binding to the transcription factors as defined in Figure 2.
A regulatory acid according to the invention also includes a sequence which does not comprise the core promoter, that is to say the sequence ranging from the nucleotide at position -1 to the nucleotide at position -25, relative to the site of initiation of transcription.
Such a regulatory nucleic acid will then preferably comprise a so-called "heterologous" core promoter, that is to say a polynucleotide comprising a "TATA" box and a "homeobox" not derived from the nucleic acid for regulating the ABCA7 gene.
Also forming part of the invention is a nucleic acid for regulating transcription comprising all or part of the sequence SEQ ID No. 1 which has been modified, for example, by addition, deletion or substitution of one or more nucleotides. Such modifications may modulate the transcriptional activity by causing an increase or on the contrary a decrease in the activity of the promoter or of the regulatory element.
41 Such a modification may also affect the tissue specificity of the promoter or of the regulatory NO element. Thus, for example, a regulatory nucleic acid according to the invention may be modified so as to 5 stimulate transcription in only one of the tissues in which it is naturally expressed.
An acid for regulating transcription according to the invention may also be modified and be rendered inducible by a particular compound, for example by creating in the sequence an inducible site by a given therapeutic compound.
The modifications in a sequence comprising all or part of the sequence SEQ ID No. 1 and comprising the promoter or a regulatory element may be carried out with methods well known to persons skilled in the art, such as mutagenesis. The activity of the modified promoter or regulatory element may then be tested, for example by cloning the modified promoter upstream of a reporter gene, by transfecting the resulting DNA construct into a host cell and by measuring the level of expression of the reporter gene in the transfected host cell. The activity of the modified promoter can also be analyzed in vivo in transgenic animals. It is also possible to construct libraries of modified fragments which may be screened using functional tests in which, for example, only the promoters or regulatory elements having the desired activity will be selected.
42 Such tests may be based, for example on the
U
use of reporter genes conferring resistance to defined ND compounds, for example to antibiotics. The selection of cells having a regulatory nucleic acid/reporter gene 5 construct and containing a promoter or a regulatory element having a desired modification may then be c isolated by culturing host cells transformed with such a construct in the presence of the defined compound, for example of the defined antibiotic.
The reporter gene may also encode any protein which is easily detectable, for example an optically detectable protein such as luciferase.
Consequently, the subject of the invention is also a nucleic acid comprising: a) a nucleic acid for regulating transcription as defined above; and b) a polynucleotide of interest encoding a polypeptide or a nucleic acid of interest.
According to a first aspect, the polynucleotide of interest whose transcription is desired encodes a protein or a peptide. The protein may be of any type, for example a protein of therapeutic interest, including cytokines, structural proteins, receptors or transcription factors. For example, in the case where transcription specifically in certain tissues is desired, such as for example in cells of the hematopoietic tissue, that is to say of the spleen, of the bone marrow, or in the peripheral leukocytes, the nucleic acid regulating transcription will advantageously comprise a nucleic acid ranging from the nucleotide at position -1 to the nucleotide at position -1111, relative to the site of initiation of transcription of the sequence SEQ ID No. 1 or 2, and ranging from the nucleotide at position -1 to the nucleotide at position -1220 SEQ ID No. 4 or In this case, the polynucleotide of interest will encode a gene involved in combating inflammation, such as a receptor for cytokines or for a superoxide dismutase. If an antitumor effect is desired, it will then be sought to stimulate the number and the activation of the cytotoxic T lymphocytes specific for a given tumor antigen.
In another embodiment, a regulatory nucleic acid according to the invention will be used in combination with a polynucleotide of interest encoding the ABCA7 protein.
Also, the polynucleotide of interest may also be an oligonucleotide of the sense type.
As already mentioned, the polynucleotide of interest may also produce a nucleic acid, such as an antisense nucleic acid specific for a gene, the inhibition of whose translation is sought.
According to another aspect, the polynucleotide of interest whose transcription is regulated by the regulatory nucleic acid is a reporter gene, such as any gene encoding a detectable protein.
44 Among the preferred reporter genes, there may Sbe mentioned in particular the gene for luciferase, for IN P-galactosidase (LacZ), for chloramphenicol acetyl transferase (CAT) or any gene encoding a protein 5 conferring resistance to a particular compound, in particular to an antibiotic.
RECOMBINANT VECTORS The term "vector" for the purpose of the present invention will be understood to mean a circular or linear DNA or RNA molecule which is either in single-stranded or double-stranded form.
According to a first embodiment, a recombinant vector according to the invention is used in order to amplify the regulatory nucleic acid according to the invention which is inserted therein after transformation or transfection of the desired cellular host.
According to a second embodiment, it corresponds to expression vectors comprising, in addition to a regulatory nucleic acid in accordance with the invention, sequences whose expression is sought in a host cell or in a defined multicellular organism.
According to an advantageous embodiment, a recombinant vector according to the invention will comprise in particular the following elements: 4b a regulatory nucleic acid according to the invention; IN a polynucleotide of interest comprising a coding sequence included in the nucleic acid to be 5 inserted into such a vector, said coding sequence being 0 placed in phase with the regulatory signals described in and appropriate sequences for initiation and
(N
termination of transcription.
In addition, the recombinant vectors according to the invention may include one or more origins for replication in the cellular hosts in which their amplification or their expression is sought, markers or selectable markers.
By way of example, the bacterial promoters may be the LacI or LacZ promoters, the T3 or T7 bacteriophage RNA polymerase promoters, the lambda phage PR or PL promoters.
The promoters for eukaryotic cells will comprise the HSV virus thymidine kinase promoter or alternatively the mouse metallothionein-L promoter.
Generally, for the choice of a suitable promoter, persons skilled in the art can advantageously refer to the book by Sambrook et al. (1989) cited above or to the techniques described by Fuller et al. (1996; Immunology in Current Protocols in Molecular Biology).
When the expression of the genomic sequence of the ABCA7 gene is sought, use will preferably be 46 made of the vectors capable of containing large
O
Sinsertion sequences. In this particular embodiment, NO bacteriophage vectors such as the P1 bacteriophage vectors such as the vector p158 or the vector p158/neo8 5 described by Sternberg (Trends Genet., (1992) 8: 1-16; 0 Mamm. Genome (1994) 5: 397-404) will be preferably used.
SThe preferred bacterial vectors according to
(N
the invention are for example the vectors pBR322(ATCC37017) or alternatively vectors such as pAA223-3 (Pharmacia, Uppsala, Sweden), and pGEM1 (Promega Biotech, Madison, WI, UNITED STATES).
There may also be cited other commercially available vectors such as the vectors pQE70, pQE9 (Qiagen), psiX174, pBluescript SA, pNH8A, pNH16A, pNH18A, pNH46A, pWLNEO, pSV2CAT, pOG44, pXTI, pSG(Stratagene).
These may also include the recombinant vector PXP1 described by Nordeen SK et al. (Bio Techniques, (1988), 6: 454-457).
They may also be vectors of the Baculovirus type such as the vector pVL1392/1393 (Pharmingen) used to transfect cells of the Sf9 line (ATCC No. CRL 1711) derived from Spodoptera frugiperda.
They may also be adenoviral vectors such as the human adenovirus of type 2 or A recombinant vector according to the invention may also be a retroviral vector or an adeno- 47
(<N
associated vector (AAV). Such adeno-associated vectors Sare for example described by Flotte et al., Am. J.
Respir., Cell Mol. Biol. (1992) 7: 349-356; Samulski et al., J. Virol. (1989) 63: 3822-3828; or McLaughlin et 5 al., Am. J. Hum. Genet. (1996) 59: 561-569).
To allow the expression of a polynucleotide of interest under the control of a regulatory nucleic Sacid according to the invention, the polynucleotide construct comprising the regulatory sequence and the coding sequence must be introduced into a host cell.
The introduction of such a polynucleotide construct according to the invention into a host cell may be carried out in vitro, according to the techniques well known to persons skilled in the art for transforming or transfecting cells, either in primary culture, or in the form of cell lines. It is also possible to carry out the introduction of the polynucleotides according to the invention in vivo or ex vivo, for the prevention or treatment of diseases linked to a deficiency in the transport of the ABCA7 protein.
To introduce the polynucleotides or the vectors into a host cell, persons skilled in the art can advantageously refer to various techniques, such as the technique for precipitation with calcium phosphate (Graham et al., Virology (1973) 52: 456-457; Chen et al., Mol. Cell. Biol.(1987) 7: 2745-2752), DEAE Dextran (Gopal et al., Mol. Cell. Biol., (1985) 5: 1188-1190), electroporation (Tur-Kaspa et al., Mol. Cel. Biol, 00 c (1986) 6: 716-718.; Potter et al., Proc. Natl. Acad.
SSci. USA (1984), 81(22), 7161-5), direct microinjection (Harland et al., J. Cell Biol (1985) 101: 1094-1095), liposomes charged with DNA (Nicolau et al., Methods Enzymol (1987) 149: 157-76; Fraley et al., Proc. Natl.
Acad. Sci. USA (1979) 76: 3348-3352).
Once the polynucleotide has been introduced Sinto the host cell, it may be stably integrated into
(N
the genome of the cell. The intregration may be achieved at a precise site of the genome, by homologous recombination, or it may be randomly integrated. In some embodiments, the polynucleotide may be stably maintained in the host cell in the form of an episome fragment, the episome comprising sequences allowing the retention and the replication of the latter, either independently, or in a synchronized manner with the cell cycle.
According to a specific embodiment, a method of introducing a polynucleotide according to the invention into a host cell, in particular a host cell obtained from a mammal, in vivo, comprises a step during which a preparation comprising a pharmaceutically compatible vector and a "naked" polynucleotide according to the invention, placed under the control of appropriate regulatory sequences, is introduced by local injection at the level of the chosen tissue, for example a smooth muscle tissue, the 49 "naked" polynucleotide being absorbed by the cells of Sthis tissue.
Compositions for use in vitro and in vivo comprising "naked" polynucleotides are for example 5 described in PCT Application No. WO 95/11307 as well as in the articles by Tacson et al. (Nature Med. (1996) 888-892) and by Huygen et al., (Nat. Med. (1996) S2(8), 893-898).
According to a specific embodiment of the invention, a composition is provided for the in vivo production of a protein of interest. This composition comprises a polynucleotide encoding the polypeptide of interest placed under the control of a regulatory sequence according to the invention, in solution in a physiologically acceptable vector.
The quantity of vector which is injected into the host organism chosen varies according to the site of injection. As a guide, there may be injected between about 0.1 and about 100 pg of regulatory sequence/coding sequence polynucleotide construct into the body of an animal.
When the regulatory nucleic acid according to the invention is located on the polynucleotide construct (or vector), in such a manner as to control the transcription of a sequence comprising an open reading frame encoding the ABCA7 protein, the vector is preferably injected into the body of a patient likely b1U to develop a disease linked to a deficiency in the
O
SABCA7 protein.
\O Consequently, the invention also relates to a pharmaceutical composition intended for the prevention O 5 of or treatment of subjects affected by an ABCA7 protein dysfunction, comprising a regulatory nucleic acid according to the invention and a polynucleotide of interest encoding the ABCA7 protein, in combination with one or more physiologically compatible excipients.
Advantageously, such a composition will comprise the regulatory nucleic acid defined by one of the sequences SEQ ID No. 1 or 2, and SEQ ID No. 4 or or a biologically active fragment of this regulatory nucleic acid.
The subject of the invention is, in addition, a pharmaceutical composition intended for the prevention of or treatment of subjects affected by a dysfunction in the metabolism of lipids, comprising a recombinant vector as defined above, in combination with one or more physiologically compatible excipients.
The subject of the invention is also a pharmaceutical composition intended for the prevention of or treatment of subjects affected by a dysfunction in the processes involving the immune system and inflammation, comprising a recombinant vector as defined above, in combination with one or more physiologically compatible excipients.
The invention also relates to the use of a
U
Spolynucleotide construct in accordance with the IND invention and comprising a nucleic acid for regulating the ABCA7 gene as well as a sequence encoding the ABCA7 5 protein, for the manufacture of a medicament intended for the prevention of or treatment of subjects affected C by a dysfunction in the metabolism of lipids or by a problem of immunological origin or of inflammatory origin.
The invention also relates to the use of a recombinant vector according to the invention, comprising, in addition to a regulatory nucleic acid of the invention, a nucleic acid encoding the ABCA7 protein, for the manufacture of a medicament intended for the prevention of or treatment of subjects affected by a dysfunction in the processes involving the immune system and inflammation.
Vectors useful in methods of somatic gene therapy and compositions containing such vectors The present invention also relates to a new therapeutic approach for the treatment and/or prevention of pathologies linked to the metabolism of lipids as well as for the treatment and/or prevention of pathologies linked to the dysfunction in the mechanisms of lymphocyte mediation of inflammation. It provides an advantageous solution to the disadvantages of the prior art, by demonstrating the possibility of 52 treating pathologies, in particular pathologies linked Sto a dysfunction in the metabolism of lipids in myelo- INO lymphatic tissues, by gene therapy, by the transfer and the expression in vivo of a polynucleotide construct k\ 5 comprising, in addition to a regulatory nucleic acid according to the invention, a sequence encoding an ABCA7 protein which is highly presumed to be involved in the transport and/or metabolism of lipids. The invention thus offers a simple means allowing a specific and effective treatment of subjects affected by a dysfunction in the processes involving the immune system and inflammation.
Gene therapy consists in correcting a deficiency or an abnormality (mutation, aberrant expression and the like) or in bringing about the expression of a protein of therapeutic interest by introducing genetic information into the affected cell or organ. This genetic information may be introduced either ex vivo into a cell extracted from the organ, the modified cell then being reintroduced into the body, or directly in vivo into the appropriate tissue.
In this second case, various techniques exist, among which various transfection techniques involving complexes of DNA and DEAE-dextran (Pagano et al., J.Virol. 1 (1967) 891), of DNA and nuclear proteins (Kaneda et al., Science 243 (1989) 375), of DNA and lipids (Felgner et al., PNAS 84 (1987) 7413), the use of liposomes (Fraley et al., J.Biol.Chem. 255 (1980) 53 10431), and the like. More recently, the use of viruses as vectors for the transfer of genes has appeared as a NO promising alternative to these physical transfection techniques. In this regard, various viruses have been 5 tested for their capacity to infect certain cell populations. In particular, the retroviruses (RSV, HMS, MMS, and the like), the HSV virus, the adeno-associated viruses and the adenoviruses.
The present invention therefore also relates to a new therapeutic approach for the treatment of pathologies linked to the transport of lipids, consisting in transferring and expressing in vivo genes encoding ABCA7 placed under the control of a regulatory acid according to the invention. It is particularly advantageous to construct recombinant viruses containing a DNA sequence comprising a regulatory nucleic acid according to the invention and a sequence encoding an ABCA7 protein involved in the metabolism of lipids, to administer these recombinant viruses in vivo, and that this administration allows a stable and effective expression of a biologically active ABCA7 protein in vivo and with no cytopathological effect.
The adenoviruses constitute particularly efficient vectors for the transfer and expression of the ABCA7 gene. In particular, the use of recombinant adenoviruses as vectors makes it possible to obtain sufficiently high levels of expression of the gene of interest to produce the desired therapeutic effect.
b4 Other viral vectors, such as retroviruses or adeno-
O
Sassociated viruses (AAV), allowing a stable expression IND of the gene are also claimed.
The present invention thus offers a new 5 approach for the treatment and prevention of C pathologies linked to dysfunctions in the metabolism of lipids and in the signaling pathways for inflammation by the lymphocytes.
The subject of the invention is therefore also a defective recombinant virus comprising a regulatory nucleic acid according to the invention and a nucleic sequence encoding an ABCA7 protein involved in the metabolism of lipids or in processes involving the immune system and inflammation.
The invention also relates to the use of such a defective recombinant virus for the preparation of a pharmaceutical composition intended for the treatment and/or prevention of dysfunctions in the signaling of inflammation by the lymphocytes.
The present invention also relates to the use of cells genetically modified ex vivo with a virus as described above, or of cells producing such as viruses, implanted in the body, allowing a prolonged and effective expression in vivo of a biologically active ABCA7 protein.
The present invention shows that it is possible to incorporate a DNA sequence encoding ABCA7 under the control of a regulatory nucleic acid as defined above into a viral vector, and that these vectors make it possible to effectively express a INO biologically active mature form. More particularly, the invention shows that the in vivo expression of ABCA7 5 may be obtained by direct administration of an adenovirus or by implantation of a producing cell or of a cell genetically modified by an adenovirus or by a retrovirus incorporating such a DNA.
The present invention is particularly advantageous because it makes it possible to induce a controlled expression, and with no harmful effect, of ABCA7 in organs which are not normally involved in the expression of this protein. In particular, a significant release of the ABCA7 protein is obtained by implantation of cells producing vectors of the invention, or infected ex vivo with vectors of the invention.
The mediator activity in the metabolism of lipids produced in the context of the present invention may be of the human or animal ABCA7 type. The nucleic sequence used in the context of the present invention may be a cDNA, a genomic DNA (gDNA), an RNA (in the case of retroviruses) or a hybrid construct consisting, for example, of a cDNA into which one or more introns would be inserted. It may also involve synthetic or semisynthetic sequences. In a particularly advantageous manner, a cDNA or a gDNA is used. In particular, the use of a gDNA allows a better expression in human bb cells. To allow their incorporation into a viral vector
U
according to the invention, these sequences are IND advantageously modified, for example by site-directed mutagenesis, in particular for the insertion of 5 appropriate restriction sites. The sequences described in the prior art are indeed not constructed for use according to the invention, and prior adaptations may prove necessary, in order to obtain substantial expressions. In the context of the present invention, the use of a nucleic sequence encoding a human ABCA7 protein is preferred. Moreover, it is also possible to use a construct encoding a derivative of these ABCA7 proteins. A derivative of these ABCA7 proteins comprises, for example, any sequence obtained by mutation, deletion and/or addition relative to the native sequence, and encoding a product retaining the activity of mediator of the metabolism of lipids. These modifications may be made by techniques known to a person skilled in the art (see general molecular biological techniques below). The biological activity of the derivatives thus obtained can then be easily determined, as indicated in particular in the examples describing the measurement of the efflux of lipids from cells. The derivatives for the purposes of the invention may also be obtained by hybridization from nucleic acid libraries, using as probe the native sequence or a fragment thereof.
57 These derivatives are in particular molecules
O
Shaving a higher affinity for their binding sites, INO molecules exhibiting greater resistance to proteases, molecules having a higher therapeutic efficacy or fewer 5 side effects, or optionally having new biological properties. The derivatives also include the modified DNA sequences allowing improved expression in vivo.
In a first embodiment, the present invention relates to a defective recombinant virus comprising a regulatory nucleic acid according to the invention and a cDNA sequence encoding an ABCA7 protein involved in the transport and metabolism of cholesterol. In another preferred embodiment of the invention, the DNA sequence is a gDNA sequence. The cDNA sequence encoding the ABCA7 protein, and which can be used in a vector according to the invention, is advantageously the sequence SEQ ID No. 8.
The vectors of the invention may be prepared from various types of viruses. Preferably, vectors derived from adenoviruses, adeno-associated viruses (AAV), herpesviruses (HSV) or retroviruses are used. It is most particularly advantageous to use an adenovirus, for direct administration or for the ex vivo modification of cells intended to be implanted, or a retrovirus, for the implantation of producing cells.
The viruses according to the invention are defective, that is to say that they are incapable of autonomously replicating in the target cell. Generally, 58 the genome of the defective viruses used in the context
O
of the present invention therefore lacks at least the ND sequences necessary for the replication of said virus in the infected cell. These regions may be either 5 eliminated (completely or partially), or made 0 nonfunctional, or substituted with other sequences and in particular with the nucleic sequence encoding the SABCA7 protein. Preferably, the defective virus retains,
(N
nevertheless, the sequences of its genome which are necessary for the encapsidation of the viral particles.
As regards more particularly adenoviruses, various serotypes, whose structure and properties vary somewhat, have been characterized. Among these serotypes, human adenoviruses of type 2 or 5 (Ad 2 or Ad 5) or adenoviruses of animal origin (see Application WO 94/26914) are preferably used in the context of the present invention. Among the adenoviruses of animal origin which can be used in the context of the present invention, there may be mentioned adenoviruses of canine, bovine, murine (example: Mavl, Beard et al., Virology 75 (1990) 81), ovine, porcine, avian or simian (example: SAV) origin. Preferably, the adenovirus of animal origin is a canine adenovirus, more preferably a CAV2 adenovirus [Manhattan or A26/61 strain (ATCC VR- 800) for example]. Preferably, adenoviruses of human or canine or mixed origin are used in the context of the invention. Preferably, the defective adenoviruses of the invention comprise the ITRs, a sequence allowing the encapsidation and the sequence encoding the ABCA7
O
protein placed under the control of a nucleic acid OD according to the invention. Advantageously, in the genome of the adenoviruses of the invention, the El region at least is made nonfunctional. Still more Spreferably, in the genome of the adenoviruses of the N invention, the El gene and at least one of the E2, E4 Sand L1-L5 genes are nonfunctional. The viral gene
(N
considered may be made nonfunctional by any technique known to a person skilled in the art, and in particular by total suppression, by substitution, by partial deletion or by addition of one or more bases in the gene(s) considered. Such modifications may be obtained in vitro (on the isolated DNA) or in situ, for example, by means of genetic engineering techniques, or by treatment by means of mutagenic agents. Other regions may also be modified, and in particular the E3 (W095/02697), E2 (W094/28938), E4 (W094/28152, W094/12649, W095/02697) and L5 (W095/02697) region.
According to a preferred embodiment, the adenovirus according to the invention comprises a deletion in the El and E4 regions and the sequence encoding ABCA7 is inserted at the level of the inactivated El region.
According to another preferred embodiment, it comprises a deletion in the El region at the level of which the E4 region and the sequence encoding ABCA7 (French Patent Application FR94 13355) are inserted.
0U The defective recombinant adenoviruses
O
Saccording to the invention may be prepared by any NO technique known to persons skilled in the art (Levrero et al., Gene (1991) 101: 195, EP 185 573; Graham, EMBO k\ 5 J. (1984) 3: 2917). In particular, they may be prepared 0 by homologous recombination between an adenovirus and a plasmid carrying, inter alia, the DNA sequence encoding Sthe ABCA7 protein. The homologous recombination occurs
(N
after cotransfection of said adenoviruses and plasmid into an appropriate cell line. The cell line used must preferably be transformable by said elements, and contain the sequences capable of complementing the part of the defective adenovirus genome, preferably in integrated form in order to avoid the risks of recombination. By way of example of a line, there may be mentioned the human embryonic kidney line 293 (Graham et al., J. Gen. Virol. (1977) 36: 59) which contains in particular, integrated into its genome, the left part of the genome of an Ad5 adenovirus or lines capable of complementing the El and E4 functions as described in particular in Applications No.
WO 94/26914 and W095/02697.
Next, the adenoviruses which have multiplied are recovered and purified according to conventional molecular biological techniques, as illustrated in the examples.
As regards the adeno-associated viruses (AAV), they are DNA viruses of a relatively small size, which 61 integrate into the genome of the cells which they Sinfect, in a stable and site-specific manner. They are IND capable of infecting a broad spectrum of cells, without inducing any effect on cellular growth, morphology or differentiation. Moreover, they do not appear to be involved in pathologies in humans. The genome of AAVs has been cloned, sequenced and characterized. It comprises about 4700 bases, and contains at each end an inverted repeat region (ITR) of about 145 bases, serving as replication origin for the virus. The remainder of the genome is divided into 2 essential regions carrying the encapsidation functions: the left hand part of the genome, which contains the rep gene involved in the viral replication and the expression of the viral genes; the right hand part of the genome, which contains the cap gene encoding the virus capsid proteins.
The use of vectors derived from AAVs for the transfer of genes in vitro and in vivo has been described in the literature (see in particular WO 91/18088; WO 93/09239; US 4,797,368, US 5,139,941, EP 488 528). These documents describe various constructs derived from AAVs, in which the rep and/or cap genes are deleted and replaced by a gene of interest, and their use for transferring in vitro (on cells in culture) or in vivo (directly into an organism) said gene of interest. However, none of these documents either describes or suggests the use of a recombinant AAV for the transfer and expression in vivo 62 or ex vivo of an ABCA7 protein, or the advantages of such a transfer. The defective recombinant AAVs IN according to the invention may be prepared by cotransfection, into a cell line infected with a human S 5 helper virus (for example an adenovirus), of a plasmid containing the sequence encoding the ABCA7 protein bordered by two AAV inverted repeat regions (ITR), and Sof a plasmid carrying the AAV encapsidation genes (rep and cap genes). The recombinant AAVs produced are then purified by conventional techniques.
As regards the herpesviruses and the retroviruses, the construction of recombinant vectors has been widely described in the literature: see in particular Breakfield et al., (New Biologist 3 (1991) 203); EP 453242, EP 178220, Bernstein et al. (Genet.
Eng. 7 (1985) 235); McCormick, (BioTechnology 3 (1985) 689), and the like.
In particular, the retroviruses are integrating viruses, infecting dividing cells. The genome of the retroviruses essentially comprises two LTRs, an encapsidation sequence and three coding regions (gag, pol and env). In the recombinant vectors derived from retroviruses, the gag, pol and env genes are generally deleted, completely or partially, and replaced with a heterologous nucleic acid sequence of interest. These vectors may be produced from various types of retroviruses such as in particular MoMuLV ("murine moloney leukemia virus"; also called MoMLV), 1- 63 MSV ("murine moloney sarcoma virus"), HaSV ("harvey
O
sarcoma virus"); SNV ("spleen necrosis virus"); RSV ND ("rous sarcoma virus") or Friend's virus.
To construct recombinant retroviruses 5 containing a sequence encoding the ABCA7 protein placed under the control of a regulatory nucleic acid according to the invention, a plasmid containing in Sparticular the LTRs, the encapsidation sequence and
(N
said coding sequence is generally constructed, and then used to transfect a so-called encapsidation cell line, capable of providing in trans the retroviral functions deficient in the plasmid. Generally, the encapsidation lines are therefore capable of expressing the gag, pol and env genes. Such encapsidation lines have been described in the prior art, and in particular the PA317 line (US 4,861,719), the PsiCRIP line (WO 90/02806) and the GP+envAm-12 line (WO 89/07150). Moreover, the recombinant retroviruses may contain modifications at the level of the LTRs in order to suppress the transcriptional activity, as well as extended encapsidation sequences, containing a portion of the gag gene (Bender et al., J. Virol. 61 (1987) 1639). The recombinant retroviruses produced are then purified by conventional techniques.
To carry out the present invention, it is most particularly advantageous to use a defective recombinant adenovirus. The results given below indeed demonstrate the particularly advantageous properties of 64 adenoviruses for the in vivo expression of a protein
O
Shaving a lipid metabolism mediator activity. The IO adenoviral vectors according to the invention are particularly advantageous for a direct administration \O 5 in vivo of a purified suspension, or for the ex vivo transformation of cells, in particular autologous cells, in view of their implantation. Furthermore, the adenoviral vectors according to the invention exhibit,
(N
in addition, considerable advantages, such as in particular their very high infection efficiency, which makes it possible to carry out infections using small volumes of viral suspension.
According to another particularly advantageous embodiment of the invention, a line producing retroviral vectors containing a regulatory nucleic acid according to the invention and the sequence encoding the ABCA7 protein is used for implantation in vivo. The lines which can be used to this end are in particular the PA317 (US 4,861,719), PsiCrip (WO 90/02806) and GP+envAm-12 (US 5,278,056) cells modified so as to allow the production of a retrovirus containing a nucleic sequence encoding an ABCA7 protein according to the invention. For example, totipotent stem cells, precursors of blood cell lines, may be collected and isolated from the subject. These cells, when cultured, may then be transfected with the retroviral vector containing the sequence encoding the ABCA7 protein under the control of its own promoter.
These cells are then reintroduced into the subject. The Sdifferentiation of these cells will be responsible for ID cells of the hematopoietic tissue expressing the ABCA7 protein, in particular T lymphocytes which participate 5 in the signaling of inflammation.
Advantageously, in the vectors of the invention, the sequence encoding the ABCA7 protein is placed under the control of a regulatory acid according to the invention comprising the regulatory elements allowing its expression in the infected cells, and most particularly the regulatory elements of the NFkappaB, CEBP and GFIl type.
As indicated above, the present invention also relates to any use of a virus as described above for the preparation of a pharmaceutical composition intended for the treatment and/or prevention of pathologies linked to the metabolism of lipids or to the dysfunction linked to the processes involving the immune system and inflammation.
The present invention also relates to a pharmaceutical composition comprising one or more defective recombinant viruses as described above. These pharmaceutical compositions may be formulated for administration by the topical, oral, parenteral, intranasal, intravenous, intramuscular, subcutaneous, intraocular or transdermal route and the like.
Preferably, the pharmaceutical compositions of the invention contain a pharmaceutically acceptable vehicle 17~b b for an injectable formulation, in particular for an
U
intravenous injection, such as for example into the NO patient's portal vein. They may relate in particular to isotonic sterile solutions or dry, in particular 5 freeze-dried, compositions which, upon addition depending on the case of sterilized water or physiological saline, allow the preparation of injectable solutions. Direct injection into the patient's portal vein is advantageous because it makes it possible to target the infection at the level of the liver and thus to concentrate the therapeutic effect at the level of this organ.
The doses of defective recombinant virus used for the injection may be adjusted as a function of various parameters, and in particular as a function of the viral vector, of the mode of administration used, of the relevant pathology or of the desired duration of treatment. In general, the recombinant adenoviruses according to the invention are formulated and administered in the form of doses of between 104 and 1014 pfu/ml, and preferably 106 to 1010 pfu/ml. The term pfu ("plaque forming unit") corresponds to the infectivity of a virus solution, and is determined by infecting an appropriate cell culture and measuring, generally after 48 hours, the number of plaques of infected cells. The techniques for determining the pfu titer of a viral solution are well documented in the literature.
67 As regards retroviruses, the compositions
U
according to the invention may directly contain the IN producing cells, with a view to their implantation.
In this regard, another subject of the 5 invention relates to any mammalian cell infected with one or more defective recombinant viruses as described above. More particularly, the invention relates to any population of human cells infected with these viruses.
These may be in particular cells of blood origin (totipotent stem cells or precursors), fibroblasts, myoblasts, hepatocytes, keratinocytes, smooth muscle and endothelial cells, glial cells and the like.
The cells according to the invention may be derived from primary cultures. These may be collected by any technique known to persons skilled in the art and then cultured under conditions allowing their proliferation. As regards more particularly fibroblasts, these may be easily obtained from biopsies, for example according to the technique described by Ham (Methods Cell Biol (1980) 21a: 255) These cells may be used directly for infection with the viruses, or stored, for example by freezing, for the establishment of autologous libraries, in view of a subsequent use. The cells according to the invention may be secondary cultures, obtained for example from preestablished libraries (see for example EP 228458, EP 289034, EP 400047, EP 456640) 68 The cells in culture are then infected with Sthe recombinant viruses, in order to confer on them the IND capacity to produce a biologically active ABCA7 protein. The infection is carried out in vitro 5 according to techniques known to persons skilled in the art. In particular, depending on the type of cells used and the desired number of copies of virus per cell, persons skilled in the art can adjust the multiplicity of infections and optionally the number of infectious cycles produced. It is clearly understood that these steps must be carried out under appropriate conditions of sterility when the cells are intended for administration in vivo. The doses of recombinant virus used for the infection of the cells may be adjusted by persons skilled in the art according to the desired aim. The conditions described above for the administration in vivo may be applied to the infection in vitro. For the infection with retroviruses, it is also possible to coculture the cells which it is desired to infect with cells producing the recombinant retroviruses according to the invention. This makes it possible to dispense with the purification of the retroviruses.
Another subject of the invention relates to an implant comprising mammalian cells infected with one or more defective recombinant viruses as described above or cells producing recombinant viruses, and an extracellular matrix. Preferably, the implants -69 according to the invention comprise 10 5 to 1010 cells.
O
More preferably, they comprise 106 to 108 cells.
NO More particularly, in the implants of the invention, the extracellular matrix comprises a gelling O 5 compound and optionally a support allowing the anchorage of the cells.
For the preparation of the implants according to the invention, various types of gelling agents may be used. The gelling agents are used for the inclusion of the cells in a matrix having the constitution of a gel, and for promoting the anchorage of the cells on the support, where appropriate. Various cell adhesion agents can therefore be used as gelling agents, such as in particular collagen, gelatin, glycosaminoglycans, fibronectin, lectins and the like. Preferably, collagen is used in the context of the present invention. This may be collagen of human, bovine or murine origin. More preferably, type I collagen is used.
As indicated above, the compositions according to the invention advantageously comprise a support allowing the anchorage of the cells. The term anchorage designates any form of biological and/or chemical and/or physical interaction causing the adhesion and/or the attachment of the cells to the support. Moreover, the cells may either cover the support used, or penetrate inside this support, or both. It is preferable to use in the context of the invention a solid, nontoxic and/or biocompatible
S/U
support. In particular, it is possible to use Q polytetrafluoroethylene (PTFE) fibers or a support of ND biological origin.
The present invention thus offers a very 5 effective means for the treatment or prevention of pathologies linked to the transport of cholesterol, in particular obesity, hypertriglyceridemia, or, in the Sfield of cardiovascular conditions, myocardial
(N
infarction, angina, sudden death, cardiac decompensation and cerebrovascular accidents.
In addition, this treatment may be applied to both humans and any animals such as ovines, bovines, domestic animals (dogs, cats and the like), horses, fish and the like.
RECOMBINANT HOST CELLS The invention also relates to a recombinant host cell comprising any one of the nucleic acids of the invention having the sequence SEQ ID No. 1 to SEQ ID No. 6, and more particularly a nucleic acid having the sequence SEQ ID NO 1 to SEQ ID No. 3.
According to another aspect, the invention also relates to a recombinant host cell comprising a recombinant vector as described above.
The preferred host cells according to the invention are for example the following: a) prokaryotic host cells: strains of Escherichia coli (strain DH5-a), of Bacillus subtilis, 71 of Salmonella typhimurium, or strains of species such Q) as Pseudomonas, Streptomyces and Staphylococus IN b) eukaryotic host cells: HeLa cells (ATCC No. CCL2), Cv 1 cells (ATCC No. CCL70), COS cells (ATCC 5 No. CRL 1650), Sf-9 cells (ATCC No. CRL 1711), CHO cells (ATCC No. CCL-61) or 3T3 cells (ATCC No. CRL- 6361), or cells of the Hepa 1-6 line referenced at the SAmerican Type Culture Collection (ATCC, Rockville, MD, United States of America).
c) primary culture cells obtained from an individual in whom the expression of a nucleic acid of interest, placed under the control of a regulatory nucleic acid according to the invention, is sought.
d) cells multiplying indefinitely (cell lines) obtained from the primary culture cells of c) above, according to techniques well known to persons skilled in the art.
METHODS OF SCREENING Method of screening in vitro The invention provides methods for treating a subject suffering from a pathology linked to the level of expression of the ABCA7 protein. In particular, such a method of treatment consists in administering to the subject a compound modulating the expression of the ABCA7 gene, which may be identified by various methods of screening in vitro as defined below.
A first method consists in identifying
O
Scompounds modulating the expression of the ABCA7 gene.
\O According to such a method, cells expressing the ABCA7 gene are incubated with a candidate substance or molecule to be tested and the level of expression of the messenger RNA for ABCA7 or the level of production C of the ABCA7 protein is then determined.
The levels of messenger RNA for ABCA7 may be determined by gel hybridization of the Northern type, well known to persons skilled in the art. The levels of messenger RNA for ABCA7 may also be determined by methods using PCR or the technique described by WEBB et al. (Journal of Biomolecular Screening (1996), vol. 1: 119).
The levels of production of the ABCA7 protein may be determined by immunoprecipitation or immunochemistry using an antibody which specifically recognizes the ABCA7 protein.
According to another method of screening a candidate molecule or substance modulating the activity of a regulatory nucleic acid according to the invention, a nucleotide construct as defined above, comprising a regulatory nucleic acid according to the invention as well as a reporter polynucleotide placed under the control of the regulatory nucleic acid, is used, said regulatory nucleic acid comprising at least one core promoter and at least one element for regulating one of the sequences SEQ ID No. 1 to SEQ ID No. 3. The reporter polynucleotide may be a gene
U
Sencoding a detectable protein, such as a gene encoding ND a luciferase.
According to such a screening method, the 5 cells are transfected with the polynucleotide construct containing the regulatory nucleic acid according to the invention and the reporter polynucleotide, in a stable and transient manner.
The transformed cells are then incubated in the presence or in the absence of the candidate molecule or substance to be tested for a sufficient time, and then the level of expression of the reporter gene is determined. The compounds which induce a statistically significant change in the expression of the reporter gene (either an increase, or on the contrary a decrease in the expression of the reporter gene) are then identified and, where appropriate, selected.
Thus, the subject of the invention is also a method for the in vitro screening of a molecule or substance modulating the activity of a regulatory nucleic acid according to the invention, in particular modulating the transcription of the constitutive reporter polynucleotide of a polynucleotide construct according to the invention, characterized in that it comprises the steps consisting in: a) culturing a recombinant host cell comprising a polynucleotide of interest placed under 74 the control of a regulatory nucleic acid according to Sthe invention; IN b) incubating the recombinant host cell with the substance or molecule to be tested; 5 c) detecting the expression of the polynucleotide of interest; d) comparing the results obtained in step c) with the results obtained when the recombinant host
(N
cell is cultured in the absence of the candidate molecule or substance to be tested.
The invention also relates to a kit or box for the in vitro screening of a candidate molecule or substance capable of modulating the activity of a regulatory nucleic acid according to the invention, comprising: a) a host cell transformed with a polynucleotide construct as defined above, comprising a reporter polynucleotide of interest placed under the control of a regulatory nucleic acid according to the invention; and b) where appropriate, means for detecting the expression of the reporter polynucleotide of interest.
Preferably, the reporter polynucleotide of interest is the luciferase coding sequence. In this case, the regulatory nucleic acid according to the invention is inserted into a vector, upstream of the sequence encoding luciferase. This may be for example the vector pGL3-basic (pGL3-b) marketed by the company PROMEGA (Madison, Wisconsin, United States).
N
D In this case, the recombinant vector comprising the luciferase coding sequence placed under 5 the control of a regulatory nucleic acid according to 0 the invention is transfected into the recombinant host cells whose luciferase activity is then determined Safter culturing in the presence or in the absence of
(N
the candidate substance or molecule to be tested.
It is possible in this case to use as controls pGL3-b vectors containing either the cytomegalovirus (CMV) promoter, the ApoAI promoter or no promoter. To test for the luciferase activity, the transfected cells are washed with a PBS buffer and lyzed with 500 ul of lysis buffer (50 mM tris, 150 mM NaC1, 0.02% sodium azide, 1% of NP-40, 100 pg/ml of AEBSF and 5 pg/ml of leupeptin).
pl of the cell lysate obtained are then added to 100 ul of the luciferase substrate (Promega) and the measurements of activity are carried out on a spectrophotometric microplate reader, 5 minutes after the addition of the cell lysate.
The data are expressed as relative units of luciferase activity. The polynucleotide constructs producing high levels of luciferase activity in the transfected cells are those which contain a regulatory nucleic acid according to the invention contained in 16 the sequence SEQ ID No. 1 which is capable of
U
Sstimulating transcription.
IND For the measurements of the levels of expression of messenger RNA in a screening method O 5 according to the invention, probes specific for the messenger RNA for the reporter polynucleotide of interest are first of all prepared, for example with the aid of the multiprime labeling kit (marketed by the company Amersham Life Sciences, Cleveland, Ohio, United States) METHOD OF SCREENING IN VIVO According to another aspect of the invention, compositions modulating the activity of a regulatory nucleic acid according to the invention may be identified in vivo, in nonhuman transgenic animals.
According to such a method, a nonhuman transgenic animal, for example a mouse, is treated with a candidate molecule or substance to be tested, for example a candidate substance or molecule previously selected by an in vitro screening method as defined above.
After a defined period, the level of activity of the regulatory nucleic acid according to the invention is determined and compared with the activity of an identical nonhuman transgenic animal, for example an identical transgenic mouse, which has not received the candidate molecule or substance.
77 The activity of the regulatory nucleic acid
U
according to the invention which is functional in the IN transgenic animal may be determined by various methods, for example the measurement of the levels of messenger 5 RNA corresponding to the reporter polynucleotides of 0 interest placed under the control of said regulatory nucleic acid by Northern-type hybridization, or by in Ssitu hybridization or by noninvasive biophotonic
(N
imaging (Xenogen Corporation).
Alternatively, the activity of the regulatory nucleic acid according to the invention may be determined by measuring the levels of expression of the protein encoded by the reporter polynucleotides of interest, for example by immunohistochemistry, in the case where the reporter polynucleotide of interest comprises an open reading frame encoding a protein detectable by such a technique.
To carry out an in vivo method of screening a candidate substance or molecule modulating the activity of a regulatory nucleic acid according to the invention, nonhuman mammals such as mice, rats, guinea pigs or rabbits whose genome has been modified by the insertion of a polynucleotide construct comprising a reporter polynucleotide of interest placed under the control of a regulatory nucleic acid according to the invention, will be preferred.
The transgenic animals according to the invention comprise the transgene, that is to say the 78 abovementioned polynucleotide construct, in a plurality Sof their somatic and/or germ cells.
NO The construction of transgenic animals according to the invention may be carried out according 0 5 to conventional techniques well known to persons 0 skilled in the art. Persons skilled in the art will in particular be able to refer to the production of Stransgenic animals, and particularly to the production
(N
of transgenic mice, as described in patents US No. 4,873,191 (granted on 10 October 1989), US No. 5,464,764 (granted on 7 November 1995) and US 5,789,215 (granted on 4 August 1998), the content of these documents being incorporated herein by reference.
In brief, a polynucleotide construct comprising a regulatory nucleic acid according to the invention and a reporter polynucleotide of interest placed under the control of the latter is inserted into an ES-type stem cell line. The insertion of the polynucleotide construct is preferably carried out by electroporation, as described by Thomas et al. (1987, Cell, Vol. 51:503-512).
The cells which have been subjected to the electroporation step are then screened for the presence of the polynucleotide construct (for example by selection with the aid of markers, or by PCR or by Southern-type analysis of DNA on an electrophoresis gel) in order to select the positive cells which have integrated the exogenous polynucleotide construct into their genome, where appropriate following a homologous
U
recombination event. Such a technique is for example described by MANSOUR et al. (Nature (1988) 336: 348-352) 5 Next, the positively selected cells are isolated, cloned and injected into 3.5-day old mouse blastocysts, as described by BRADLEY (1987, Production Sand Analysis of Chimaeric mice. In: E.J. ROBERTSON teratocarcinomas and embryonic stem cells: A practical approach. IRL press, Oxford, page 113). The blastocysts are then introduced into a female host animal and the development of the embryo is continued to term.
Alternatively, positively selected ES-type cells are brought into contact with 2.5-day old embryos at an 8-16 cell stage (morulae) as described by WOOD et al. (1993, Proc. Natl. Acad. Sci. USA, vol.90: 4582- 4585) or by NAGY et al. (1993, Proc. Natl. Acad. Sci.
USA, vol. 90: 8424-8428), the ES cells being internalized so as to extensively colonize the blastocyst, including the cells producing the germ line.
The progeny is then tested in order to determine those which have integrated the polynucleotide construct (the transgene).
The subject of the invention is therefore also a nonhuman transgenic animal whose somatic and/or germ cells have been transformed with a nucleic acid or a polynucleotide construct according to the invention.
ND The invention also relates to recombinant host cells obtained from a transgenic animal as 5 described above. Recombinant cell lines obtained from a transgenic animal according to the invention may be established in a long-term culture from any tissue of such a transgenic animal, for example by transfection
(N
of the primary cell cultures with vectors expressing oncogenes such as the SV40 large T antigen, as described for example by CHOU (1989, Mol. Endocrinol.
Vol. 3: 1511-1514) and SCHAY et al. (1991, Biochem.
Biophys. Acta, vol. 1072: 1-7).
The invention also relates to a method for the in vivo screening of a candidate molecule or substance modulating the activity of a regulatory nucleic acid according to the invention, comprising the steps consisting in a) administering the candidate substance or molecule to a transgenic animal as defined above; b) detecting the level of expression of a reporter polynucleotide of interest placed under the control of the regulatory nucleic acid; c) comparing the results obtained in b) with the results obtained with a transgenic animal which has not received the candidate substance or molecule.
The invention in addition relates to a method for the in vivo screening of a substance or molecule modulating the transcription of a polynucleotide of 81 interest constituting a nucleic acid according to the invention, characterized in that it comprises the steps consisting in administering the candidate substance or molecule to a nonhuman transgenic mammal as defined above; detecting the expression of the polynucleotide of interest in the transgenic mammal as treated in step and comparing the detection results of step with the results observed in a nonhuman transgenic mammal as defined above which has not received the administration of the candidate substance or molecule.
The invention also relates to a kit or box for the in vivo screening of a candidate molecule or substance modulating the activity of a regulatory nucleic acid according to the invention, comprising a) a transgenic animal as defined above; b) where appropriate, the means for detecting the level of expression of the reporter polynucleotide of interest.
PHARMACEUTICAL COMPOUNDS AND COMPOSITIONS The invention also relates to pharmaceutical compositions intended for the prevention or treatment of a deficiency in the metabolism of lipids, or of a dysfunction in the processes involving the immune system and inflammation.
Firstly, the subject of the invention is also a candidate substance or molecule modulating the activity of a regulatory nucleic acid according to the invention.
82 Most preferably, the invention also relates
U
Sto a candidate substance or molecule characterized in IND that it increases the activity of a regulatory nucleic acid according to the invention, and most particularly 5 of a regulatory nucleic acid comprising the sequence SEQ ID No. i, 2, 4 or Preferably, such a substance or molecule capable of modulating the activity of a regulatory nucleic acid according to the invention was selected according to one of the in vitro or in vivo screening methods defined above.
Thus, a subject impaired in the metabolism of lipids or in immunity signaling is treated by the administration to this subject of an effective quantity of a compound modulating the activity of a regulatory nucleic acid according to the invention.
Thus, a patient having a weak ABCA7 promoter activity may be treated with an abovementioned molecule or substance in order to increase the activity of the ABCA7 promoter.
Alternatively, a patient having an abnormally high ABCA7 promoter activity may be treated with a compound capable of reducing or blocking the activity of the ABCA7 promoter.
Thus, the present invention also relates to a substance or molecule used as active ingredient of a medicament.
83 Such a compound may be a compound which
U
modulates the interaction of at least one transcription O factor with the ABCA7 promoter or a regulatory element of a regulatory nucleic acid according to the invention.
S 5 For example, the compound may inhibit the interaction of one of the transcription factors listed in Table 1 with a regulatory nucleic acid according to the invention.
The compound may also be a compound which modulates the activity of a transcription factor which binds to the ABCA7 promoter or a regulatory element present on the latter.
A compound of therapeutic interest according to the invention may also be a compound which modulates the interaction of a first transcription factor with a second transcription factor.
As detailed in the analysis of the various transcription factors capable of binding to the sequence SEQ ID No. i, 2, 4 or 5, some transcription factors are active only if they are combined with another transcription factor.
A compound of therapeutic interest according to the invention is preferably chosen from nucleic acids, peptides and small molecules.
For example, such a compound may be an antisense nucleic acid which specifically binds to one region of the ABCA7 promoter or to a regulatory element 84 of a nucleic acid for regulating ABCA7 and inhibiting
U
Sor reducing the activity of the promoter.
This compound of therapeutic interest may also be an antisense nucleic acid which interacts k 5 specifically with a gene encoding a transcription factor modulating the activity of the ABCA7 promoter, in a manner such that the interaction of the antisense nucleic acid with the gene encoding the transcription factor binding to the ABCA7 promoter reduces the production of this transcription factor, resulting in an increase or a decrease in the activity of the ABCA7 promoter, depending on whether the transcription factor increases or on the contrary reduces the activity of the ABCA7 promoter.
The toxicity and the therapeutic efficacy of the therapeutic compounds according to the invention may be determined according to standard pharmaceutical protocols in cells in culture or in experimental animals, for example in order to determine the lethal dose LD 5 0 (that is to say the dose which is lethal for of the population tested) as well as the effective dose ED50 (that is to say the dose which is therapeutically effective in 50% of the population tested) For all the compounds of therapeutic interest according to the invention, the therapeutically effective dose may be initially estimated from tests carried out in cell cultures in vitro.
The subject of the invention is also pharmaceutical compositions comprising a therapeutically IND effective quantity of a substance or molecule of therapeutic interest according to the invention.
The pharmaceutical compositions according to the invention are more particularly intended for the treatment and/or prevention of deficiencies in the metabolism of lipids, or in the mechanisms involving the immune system and inflammation.
Such pharmaceutical compositions may be formulated in a conventional manner using one or more physiologically acceptable vectors or excipients.
Thus, the compounds of therapeutic interest according to the invention, as well as their physiologically acceptable salts and solvates, may be formulated for administration by injection, inhalation or by oral, buccal, parenteral or rectal administration.
Techniques for the preparation of pharmaceutical compositions according to the invention can be easily found by persons skilled in the art, for example in the manual Remmington's Pharmaceutical Sciences, Mead Publishing Co., Easton, PA, United States.
For a systemic administration, injection will be preferred, including intramuscular, intravenous, intraperitoneal and subcutaneous injections. In this case, the pharmaceutical compositions according to the invention may be formulated in the form of liquid 86 solutions, preferably in physiologically compatible Ssolutions or buffers.
ID
^-i 0q METHOD FOR THE DETECTION OF AN IMPAIRMENT IN THE
U
STRANSCRIPTION OF THE HUMAN ABCA7 GENE IND The subject of the invention is in addition methods for determining if a subject is at risk of developing a pathology linked to a deficiency in the metabolism of lipids, or in the processes involving the C immune system and inflammation.
Such methods comprise the detection, in cells of a biological sample obtained from the subject to be tested, of the presence or of the absence of a genetic impairment characterized by impairment of the expression of a gene whose expression is regulated by the ABCA7 promoter.
By way of illustration, such genetic impairments may be detected in order to determine the existence of a deletion of one or more nucleotides in the sequence of a nucleic acid for regulating ABCA7, of the addition of one or more nucleotides or of the substitution of one or more nucleotides in said sequence SEQ ID No. 1, 2, 3 or 6.
According to a specific embodiment of a method for the detection of an impairment of the transcription of the ABCA7 gene in a subject, the genetic impairment is identified according to a method comprising the sequencing of all or part of the sequence SEQ ID No. i, or alternatively of all or part of at least the sequence SEQ ID No. 2.
88 Sequencing primers may be constructed so as
O
Sto hybridize with a defined region of the sequence SEQ INO ID No. 1. Such sequencing primers are preferably constructed so as to amplify fragments of about 300 to 5 about 500 nucleotides of the sequence SEQ ID No. 1 or of a complementary sequence.
The fragments amplified, for example by the PCR method, are then sequenced and the sequence obtained is compared with the reference sequence SEQ ID No. 1 in order to determine if one or more deletions, additions or substitutions of nucleotides are found in the sequence amplified from the DNA contained in the biological sample obtained from the subject tested.
The invention therefore also relates to a method of detecting an impairment of the transcription of the ABCA7 gene in a subject, comprising the following steps: a) sequencing of a nucleic acid fragment amplifiable with the aid of at least one nucleotide primer hybridizing with the sequence SEQ ID No. 1 or SEQ ID No. 2, according to the invention; b) aligning the sequence obtained in a) with the sequence SEQ ID No. 1 or SEQ ID No. 2; c) determining the presence of one or more deletions, additions or substitutions of at least one nucleotide in the sequence of the nucleic acid fragment, relative to the reference sequence SEQ ID No. 1 or SEQ ID No. 2.
89 The invention also relates to a kit or box for the detection of an impairment of the transcription D of the ABCA7 gene in a subject, comprising one or more sequencing primers capable of hybridizing with a region 5 of the sequence SEQ ID No: i, and thus of allowing the sequencing of a polynucleotide located upstream of the site of initiation of the transcription of the ABCA7 gene in the subject to be tested.
In addition, also forming part of the invention are oligonucleotide probes hybridizing with a region of the sequence SEQ ID No. 1 or of the sequence SEQ ID No. 2 in which an impairment in the sequence has been determined during the implementation of the method of detection described above.
Alternatively, also forming part of the invention are oligonucleotide probes hybridizing specifically with a corresponding region of the sequence SEQ ID No. 1 or of the sequence SEQ ID No. 2 for which one or more deletions, additions or substitutions of at least one nucleotide has been determined in a subject.
Such oligonucleotide probes constitute means of detecting impairments in the sequence for regulating the ABCA7 gene and therefore also means for detecting a predisposition to the development of a pathology linked to a deficiency in the metabolism of lipids or to dysfunction in the processes involving the immune system and inflammation.
The subject of the invention is therefore
U
Salso a kit or box for the detection of an impairment of O the transcription of the ABCA7 gene in a subject, comprising: O 5 a) one or more primers hybridizing with a region of the sequence SEQ ID No. 1 or of the sequence C SEQ ID No. 2; b) where appropriate, the means necessary for carrying out an amplification reaction.
The subject of the invention is also a kit or box for the detection of an impairment of the transcription of the ABCA7 gene in a subject, comprising: a) one or more oligonucleotide probes as defined above; b) where appropriate, the reagents necessary for carrying out a hybridization reaction.
The subject of the invention is also a kit or box for the detection of an impairment of the transcription of the ABCA7 gene in a subject, comprising one or more probes hybridizing with a region of the sequence SEQ ID No.: 1 or of the sequence SEQ ID No.: 2 which make it possible to quantify the messenger RNA for ABCA7 in a biological material obtained from the said subject to be tested.
The nucleic acid fragments derived from any one of the nucleotide sequences SEQ ID No. 1-6 are therefore useful for the detection of the presence of at least one copy of a nucleotide sequence for regulating 91 the ABCA7 gene or a fragment or a variant (containing a
O
Smutation or a polymorphism) thereof in a sample.
O The nucleotide probes or primers according to the invention comprise at least 8 consecutive k 5 nucleotides of a nucleic acid chosen from the group consisting of the sequences SEQ ID NO 1-5, or of a nucleic acid having a complementary sequence.
Preferably, nucleotide probes or primers according to the invention will have a length of 12, 15, 18 or 20 to 25, 35, 40, 50, 70, 80, 100, 200, 500, 1000, 1500 consecutive nucleotides of a nucleic acid according to the invention, in particular a nucleic acid having a nucleotide sequence chosen from the sequences SEQ ID NO. Alternatively, a nucleotide probe or primer according to the invention will consist of and/or comprise the fragments having a length of 12, 15, 18, 25, 35, 40, 50, 100, 200, 500, 1000, 1500 consecutive nucleotides of a nucleic acid according to the invention, more particularly of a nucleic acid chosen from the sequences SEQ ID No. 1-5, or of a nucleic acid having a complementary sequence.
The definition of a nucleotide probe or primer according to the invention therefore covers oligonucleotides which hybridize, under the high stringency hybridization conditions defined above, with a nucleic acid chosen from the sequences SEQ ID NO 6 or 8 or with a sequence complementary thereto.
Examples of primers and pairs of primers Swhich make it possible to amplify various regions of the ABCA7 gene are represented below.
This includes for example the pair of primers represented by the primer having the sequence SEQ ID 0 No. 9: AGCCAGCAACGCAATCCTCC and the primer having the sequence SEQ ID No. 10: CGCACCATGTCAATGAGCCC.
SA nucleotide primer or probe according to the
(N
invention may be prepared by any suitable method well known to persons skilled in the art, including by cloning and action of restriction enzymes or by direct chemical synthesis according to techniques such as the phosphodiester method by Narang et al., (Methods Enzymol (1979) 68:90-98) or by Brown et al. (Methods Enzymol (1979) 68:109-151), the diethylphosphoramidite method by Beaucage et al. (Tetrahedron Lett (1980) 22: 1859-1862) or the technique on a solid support described in patent EP 0,707,592.
Each of the nucleic acids according to the invention, including the oligonucleotide probes and primers described above, may be labeled, if desired, by incorporating a marker which can be detected by spectroscopic, photochemical, biochemical, immunochemical or chemical means.
For example, such markers may consist of radioactive isotopes (32P, 33P, 3H, 35S), fluorescent molecules (5-bromodeoxyuridine, fluorescein, acetylaminofluorene, digoxigenin) or ligands such as biotin.
j The labeling of the probes is preferably
U
carried out by incorporating labeled molecules into the IN polynucleotides by primer extension, or alternatively by addition to the 5' or 3' ends or by "nick translation".
0 5 Examples of nonradioactive labeling of 0 nucleic acid fragments are described in particular in French patent No. 78 109 75 or in the articles by SUrdea et al. (Nucleic Acid Res (1988) 11: 4937-4957) or Sanchez-pescador et al. Clin Mircrobiol (1988) 26(10) 1934-1938).
Advantageously, the probes according to the invention may have structural characteristics of the type to allow amplification of the signal, such as the probes described by Urdea et al.(Mol. Cell. Biol., (1991) 6:716-718), or alternatively in European patent No. EP-0,225,807 (CHIRON).
The oligonucleotide probes according to the invention may be used in particular in Southern-type hybridizations with genomic DNA or alternatively Northern-type hybridizations with RNA.
The probes according to the invention may also be used for the detection of products of PCR amplification or alternatively for the detection of mismatches.
Nucleotide probes or primers according to the invention may be immobilized on a solid support. Such solid supports are well known to persons skilled in the art and comprise surfaces of wells of microtiter 94 plates, polystyrene beds, magnetic beds, nitrocellulose bands or microparticles such as latex particles.
O Consequently, the present invention also relates to a method of detecting the presence of a O 5 nucleic acid as described above in a sample, said method comprising the steps of: S1) bringing one or more nucleotide probes according to the invention into contact with the sample to be tested; 2) detecting the complex which may have formed between the probe(s) and the nucleic acid present in the sample.
According to a specific embodiment of the method of detection according to the invention, the oligonucleotide probe(s) are immobilized on a support.
According to another aspect, the oligonucleotide probes comprise a detectable marker.
The invention relates, in addition, to a box or kit for detecting the presence of a nucleic acid according to the invention in a sample, said box comprising: a) one or more nucleotide probes as described above; b) where appropriate, the reagents necessary for the hybridization reaction.
According to a first aspect, the detection box or kit is characterized in that the probe(s) are immobilized on a support.
According to a second aspect, the detection Sbox or kit is characterized in that the oligonucleotide ND probes comprise a detectable marker.
According to a specific embodiment of the 5 detection kit described above, such a kit will comprise C a plurality of oligonucleotide probes in accordance with the invention which may be used to detect target sequences of interest or alternatively to detect mutations in the coding regions or the noncoding regions of the nucleic acids according to the invention, more particularly of the nucleic acids having the sequences SEQ ID NO 1-5, 6 and 8 or the nucleic acids having a complementary sequence.
Thus, the probes according to the invention, immobilized on a support, may be ordered into matrices such as "DNA chips". Such ordered matrices have in particular been described in US patent No. 5,143,854, in PCT applications No. WO 90/150 70 and 92/10092.
Support matrices on which oligonucleotide probes have been immobilized at a high density are for example described in US patents No. 5,412,087 and in PCT application No. WO 95/11995.
The nucleotide primers according to the invention may be used to amplify any one of the nucleic acids according to the invention, and more particularly all or part of a nucleic acid having the sequences SEQ ID NO 1-5, or alternatively a variant thereof.
Another subject of the invention relates to a method of amplifying a nucleic acid according to the IN invention, and more particularly a nucleic acid having the sequences SEQ ID NO 1-5 or a fragment or a variant thereof contained in a sample, said method comprising C the steps consisting in: a) bringing the sample in which the presence Sof the target nucleic acid is suspected into contact with a pair of nucleotide primers whose hybridization position is located respectively on the 5' side and on the 3' side of the region of the target nucleic acid whose amplification is sought, in the presence of the reagents necessary for the amplification reaction; and b) detecting the amplified nucleic acids.
To carry out the amplification method as defined above, use will be advantageously made of any one of the nucleotide primers described above.
The subject of the invention is, in addition, a box or kit for amplifying a nucleic acid according to the invention, and more particularly all or part of a nucleic acid having the sequences SEQ ID NO 1-5, said box or kit comprising: a) a pair of nucleotide primers in accordance with the invention, whose hybridization position is located respectively on the 5' side and on the 3' side of the target nucleic acid whose amplification is sought; b) where appropriate, the reagents necessary
U
Sfor the amplification reaction.
IND Such an amplification box or kit will advantageously comprise at least one pair of nucleotide 5 primers as described above.
The invention is in addition illustrated, (Ni without however being limited, by the figures and examples below.
Figure 1 is a schematic representation of the sites for transcription factors found in humans and mice in the promoter region of the ABCA7 genes.
Figure 2 illustrates the sequence SEQ ID No. 1 and the position of each of the characteristic units for binding to various transcription factors is represented in bold characters, the designation of the transcription factor specific for the corresponding sequence being indicated above the nucleotide sequence.
Figure 3 illustrates the sequence SEQ ID No. 4 and the position of each of the characteristic units for binding to various transcription factors is represented in bold characters, the designation of the transcription factor specific for the corresponding sequence being indicated above the nucleotide sequence.
Figure 4 illustrates the pattern of expression of the human ABCA7 gene on Northern blots of various adult and fetal tissues (Clontech) hybridized with an amplimer produced with the primers SEQ ID No. 9 and 10 (Table 4).
98 Figure 5 illustrates the pattern of expression of the murine ABCA7 gene on a Northern blot of various adult tissues hybridized with an amplimer produced with primers specific for the murine k\ 5 transcript.
Figure 6 represents the profile of expression of the gene encoding the ABCA7 protein on a cross Ssection of an inflamed artery, by in situ hybridization
(N
with an ABCA7 antisense probe.
Figure 7 represents the profile of expression of the gene encoding the ABCA7 protein on a cross section of bronchial tubes of an asthma patient, by in situ hybridization with an ABCA7 antisense probe.
Figure 8 represents the profile of expression of the gene encoding the ABCA7 protein on a cross section of colon of a patient suffering from Crohn's disease, by in situ hybridization with an ABCA7 antisense probe.
Figure 9 represents the profile of expression of the gene encoding the ABCA7 protein on a cross section of lymph node, by in situ hybridization with an ABCA7 antisense probe.
Figure 10 represents the profile of expression of the gene encoding the ABCA7 protein on a cross section of synovia of a patient suffering from rheumatoid arthritis, by in situ hybridization with an ABCA7 antisense probe.
Figure 11 represents the profile of Sexpression of the gene encoding the ABCA7 protein on a cross section of skin of a patient suffering from psoriasis, by in situ hybridization with an ABCA7 I 5 antisense probe.
C EXAMPLES: EXAMPLE 1: Determination of the 5' end of the cDNA for ABCA7 Amplification of the end of the mRNA by RT-PCR (RACE) was carried out using the SMART RACE cDNA amplification kit (Clontech, Palo Alto, CA). (PolyA) mRNAs extracted from human spleen tissues were used as template in order to produce a SMART 5' cDNA library according to the manufacturer's instructions. The first amplification primers and the internal primers were chosen from the cDNA sequence. The amplifications carried out with the internal primers for PCR amplification were cloned. Specific clones were then amplified using primers whose sequences are respectively (CAGGAAACAGCTATGAC) and (GCCAGTGTGATGGATAT) and sequenced on the two strands. Finally, the primers ABCA7 L1 GCGGAAAGCAGGTGTTGTTCAC (SEQ ID No. 11) and ABCA7L2 CGATGGCAGTGGCTTGTTTGG (SEQ ID No. 12) were used to identify the end of the human ABCA7 cDNA.
100 EXAMPLE 2: Analysis of the promoter of the human and
U
Smurine ABCA7 genes The site of initiation of transcription was located on the promoters of the human and murine genes 5 for ABCA7 using the following three software packages: C TSSG and TSSW (Solovyev et al., Ismb (1997) 5, 294-302) and NNPP (Reese MG, et al., 1999). A prediction of the Sbinding sites for the human and murine transcription factors was made using the MatInspector program for searching for motifs (Quandt et al., Nucl. Acid Research (1995) 23(23) 4878-84). The calculation of the scores for each binding site for the transcription factors is made using the following formula: (Of-Tf)/(Tf) in which Of is the frequency of observation of a motif and Tf is the calculated frequency of a consensus motif. In order to separate the motifs which are not considered to be relevant, a first filtration step was performed by adjusting the MatInspector program "template similarity" score above 0.85 and the "core similarity" score above 0.99.
Finally, a comparative analysis of the inter-species promoters was made as described by Werner T (Models for prediction and recognition of eukaryotic promoters, Mammalian Genome (1999) 10: 168-175) in order to define the transcription modules comprising sites having a similar motif and present both on the human and murine sequences of the sequence upstream of the ABCA7 gene.
101 EXAMPLE 3: Preferential expression of human and murine
U
SABCA7 genes in hematopoietic tissues O The profile of expression of the polynucleotides according to the present invention is O 5 determined according to the protocols for PCR-coupled 0 reverse transcription and Northern blot analysis c described in particular by Sambrook et al. (ref. CSH SSambrook, Fritsch, and Maniatis, T. (1989).
(N
"Molecular Cloning: A Laboratory Manual," 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor,
NY.).
For example, in the case of an analysis by reverse transcription, a pair of primers synthesized from each of the complete cDNAs of the human and murine ABCA7 genes in order to detect the corresponding cDNAs.
The sequences of these primers are presented in Table 4.
The polymerase chain reaction (PCR) is carried out on CDNA templates corresponding to retrotranscribed polyA mRNAs. The reverse transcription to cDNA is carried out with the enzyme SUPERSCRIPT II (GibcoBRL, Life Technologies) according to the conditions described by the manufacturer.
The polymerase chain reaction is carried out according to standard conditions, in 20 Ip of reaction mixture with 25 ng of cDNA preparation. The reaction mixture is composed of 400 M of each of the dNTPs, 2 units of Thermus aquaticus (Taq) DNA polymerase (Ampli Taq Gold; Perkin Elmer), 0.5 uM of each primer, 1 102 mM MgC12, and PCR buffer. Thirty PCR cycles (denaturing 30 s at 940C, annealing of 30 s divided up INO as follows during the 30 cycles: 640C 2 cycles, 610C 2 cycles, 58°C 2 cycles and 550C 28 cycles and an 5 extension of one minute per kilobase at 720C) are carried out after a first step of denaturing at 94°C for 10 min in a Perkin Elmer 9700 thermocycler. The PCR Sreactions are visualized on agarose gel by
(N
electrophoresis. The cDNA fragments obtained may be used as probes for a Northern blot analysis and may also be used for the exact determination of the polynucleotide sequence.
In the case of a Northern Blot analysis, a cDNA probe produced as described above is labeled with 32 P by means of the DNA labeling system High Prime (Boehringer) according to the instructions indicated by the manufacturer. After labeling, the probe is purified on a Sephadex G50 microcolumn (Pharmacia) according to the instructions indicated by the manufacturer. The labeled and purified probe is then used for the detection of the expression of the mRNAs in various tissues.
The Northern blot containing samples of RNA of various human tissues (Multiple Tissue Northern or MTN; references (Human II, 7759-1, Human 7760-1, and Human Fetal II 7756-1, Clontech) is hybridized with the designated specific labeled probe for ABCA7 (2637- 4881 bp).
103 The protocol followed for the hybridizations and
O
Swashes may be either directly that described by the IO manufacturer (Instruction manual PT1200-1) or an adaptation of this protocol using methods known to persons skilled in the art and described for example in F. Ausubel et al. (Currents Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience NY (1989). It is thus possible to vary,
(N
for example, the prehybridization and hybridization temperatures in the presence of formamide.
For example, it may be possible to use the following protocol: 1- Membrane competition and PREHYBRIDIZATION: Mix: 40 ul salmon sperm DNA (10 mg/ml) 40 ul human placental DNA (10 mg/ml) Denature for 5 min at 960C, then immerse the mixture in ice.
Remove the 2X SSC and pour 4 ml of formamide mix in the hybridization tube containing the membranes.
Add the mixture of the two denatured DNAs.
Incubation at 42 0 C for 5 to 6 hours, with rotation.
2- Labeled probe competition: 104 Add to the labeled and purified probe 10 to 50 ul Cot I DNA, depending on the quantity of repeat sequences.
Denature for 7 to 10 min at 95 0
C.
Incubate at 65 0 C for 2 to 5 hours.
3- HYBRIDIZATION: Remove the prehybridization mix.
Mix 40 ul salmon sperm DNA 40 pl human placental DNA; denature for 5 min at 96 0 C, then immerse in ice.
Add to the hybridization tube 4 ml of formamide mix, the mixture of the two DNAs and the denatured labeled probe/Cot I DNA.
Incubate 15 to 20 hours at 42 0 C, with rotation.
4- Washes: One wash at room temperature in 2X SSC, to rinse.
Twice 5 minutes at room temperature 2X SSC and 0.1%
SDS.
105 S- Twice 15 minutes 0.1X SSC and 0.1% SDS at 65 0
C.
NO
After hybridization and washing, the blot is 5 analyzed after overnight exposure in contact with a phosphorus screen revealed with the aid of Storm c (Molecular Dynamics, Sunnyvale, CA).
SThe results presented in Figure 5 show that
(N
the mouse ABCA7 gene is expressed in the adult tissues.
A larger quantity of murine ABCA7 mRNA is detected in the hematopoietic tissues such as the spleen and thymus, which is coherent with the expression of ABCA7 which was observed in the myelomonocytic and lymphocytic lines. No expression of the ABCA7 gene was detected in the fibroblastic cell lines.
Figure 4 shows a similar pattern of expression of the human ABCA7 gene with however a strong hybridization signal in the fetal liver.
EXAMPLE 4: Analysis of the gene expression profile for dysfunctions in the metabolism of lipids, or in inflammation signaling The verification of the impairment of the level of expression of the ABCA7 gene may be determined by hybridization of these sequences with probes corresponding to the mRNAs obtained from hematopoietic tissues from subjects who are affected or otherwise, according to the methods described below: 106 S1. Preparation of the total RNAs, of the poly(A) mRNAs NO and of cDNA probes The total RNAs are obtained from 5 hematopoietic tissues from normal or highly affected 0 subjects by the guanidine isothiocyanate method S(Chomczynski et al., Anal Biochem (1987) 162:156-159).
SThe poly(A) mRNAs are obtained by affinity
(N
chromatography on oligo(dT)-cellulose columns (Sambrook et al., (1989) Molecular cloning: a laboratory manual.
2ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, New York) and the cDNAs used as probes are obtained by RT-PCR (DeRisi et al., Science (1997) 278:680-686) with oligonucleotides labeled with a fluorescent product (Amersham Pharmacia Biotech; CyDye
TM
2. Hybridation and detection of the expression levels The glass slides containing the sequences according to the present invention corresponding to the ABCA7 gene are hydridized with the nucleotide probes prepared from the messenger RNA of the cell to be analyzed. The use of the Amersham/molecular Dynamics system (Avalanche Microscanner
TM
allows the differential quantification of the expressions of the products of sequences on healthy or affected cell type.
1 107 EXAMPLE 5: Test intended for the screening of molecules Sactivating or inhibiting the expression of the ABCA7 gene The screening test makes it possible to 5 identify a sequence capable of modulating the activity 0 of synthesis of the ABCA7 protein.
S5.1 Construction of the expression plasmids containing a nucleic acid for regulating the human ABCA7 gene The region of the acid for regulating the human ABCA7 gene ranging from the nucleotide at position -1111 up to the nucleotide at position -1, relative to the site of initiation of transcription, may be amplified by the PCR technique with the aid of the pair of primers specific for the region described above from human genomic DNA present in a BAC vector of a human BAC vector collection.
The amplified DNA fragment is digested with restriction endonuclease Sall, then inserted into the vector PXPl described by Nordeen et al. (Bio Techniques, (1988) 6:454-457), at the level of the Sal 1 restriction site of this vector. The insert was then sequenced.
5.2 Cell culture and transfection Cells of the CHO or HELA line (ATCC, Rockville, MD, USA) were cultured in the E-MEM (Minimum Essential Medium with Earle's Salts) medium 108 supplemented with 10% fetal calf serum S(BioWhittaker, Walkersville, MD). Approximately INO 1.5 X 10 5 cells were distributed into each of the wells of a 12-well culture plate (2.5 cm), and were cultured 5 up to about 50-70% confluence, and then cotransformed 0 with 1 ig of plasmid Sal-Lucif and 0.5 pg of the 'control vector pBetagal (CloneTech Laboratories Inc., SPalo Alto, CA, USA) using the Superfectin Reagent Kit
(N
(QIAGEN Inc., Valencia, CA, USA). Two hours after the addition of the DNA, the culture medium is removed and replaced with complete AMEM (Minimum Essential Medium Eagle's Alpha Modification) medium. After a period of twenty hours, the cells are placed in fresh medium of the DMEM (Dulbecco's Minimum Essential Medium) type supplemented with 2 pg/ml of glutamin, 100 units/ml of streptomycin and 0.1% of bovine serumalbumin (BSA, Fraction in the presence or otherwise of molecules at various concentrations.
The cells are recovered 16 hours after the last change of medium using a Lysis Solution obtained from the Tropix Luciferase Assay Kit (Tropix Inc., Bedford, MA, USA). The cellular lysate is divided into aliquot fractions which are used to quantify the proteins using the MicroBCA Kit (Pierce, Rockford, IL, USA) as well as to quantify the production of luciferase and beta-galactosidase using the Tropix Luciferase Assay Kit and Galacto-Light Plus Kit, respectively. The tests are carried out according to 109 the manufacturer's recommendations. The molecules
O
Sactive on the ABCA7 promoter are then selected according to the ratio "luciferase activity/betagalactosidase activity".
O 0 EXAMPLE 6: In situ hybridization experiment c Relevant tissue samples in paraffin were Shybridized with radioactive labelled complementary RNA probes. More precisely, a fragment of the ABCA7 gene corresponding to the nucleotide sequence ranging from nucleotide 594 to nucleotide 1055 of the GenBank sequence designated NM-019112 was subcloned into the plasmid pCRII (Invitrogen).
Antisense RNA probes labelled with 35 S Uridine triphosphate were then generated with the RNA polymerases SP6 and T7, and then hybridized with the various tissue sections.
The various tissue sections were digested with proteinase K and hybridized with the probes described above at a concentration equal to about x 10 7 dpm/ml for 18 hours at 650C. The slides were then treated with RNAase A and washed in 0.1X SSC at 700C for 2 hours, and were coated with a Kodak NTB-2 photographic emulsion, exposed for 7 days at 40C, and then visualized using a Kodak D-19 solution.
They were finally stained with haematoxylin and eosin and the images were made using a DVC digital photo camera coupled to a Nikon microscope.
110 Figure 6 is a section of the artery of a 92year-old male who has undergone an amputation below the IO knee, and who has arteriosclerosis and acute inflammation. A weak specific labelling of the \O 5 macrophages in the thrombi and at the site of inflammatory infiltration into the tunica adventitia.
Figure 7, which is a section of bronchi collected during autopsy on a 63-year-old asthmatic female, shows a weak labelling of the lymphocytes and macrophages in the submucous inflammatory infiltrate.
Figure 8 is a section of colon collected during operation on an 81-year-old female exhibiting a clinical diagnosis of Crohn's disease. Labelling of the macrophages and of a subfamily of lymphocytes in the lamina propria is observed.
Figure 9 corresponds to a section of a lymph node collected during operation on a 48-year-old male.
In the reactive germinative centre, the ganglionic cells are weakly labelled, and isolated macrophages are also labelled in the lymph node.
Figure 10, which represents a section of the synovial membrane of a 25-year-old female exhibiting a clinical diagnosis of rheumatoid arthritis, shows a strong labelling of the subsynovial histiocytes and of the macrophages.
Figure 11, which represents a section of skin obtained following biopsy on a 55-year-old female suffering from psoriasis, shows moderate labelling of p 111 the macrophages in the perivascular inflammatory
U
infiltrate. Perivascular isolated lymphocytes are also labelled.
OD
2007240161 06 Dec 2007 112 Table 1: Sites, scores, consensus and positions relative to the site of initiation of transcription (TSS) predicted by the NNPP, TSSG and TSSW software packages in humans Filtatin Sie Cnsenus equece. score IPositioiVTSS (Uip) Core similarit Template similarity GF11 l NNNNNNAAMrCANNIGNNNN gc(-;ccatAArCgga~y 1.6602 .0 .88 IINF3B_01 144NFIRTTIYTY qaaTGlTgyccc. 3.978804 -547 0.09 Comparative CEP-G RRTK.WGMAAYWN tgiqfGA~. .857c139 analyisbeel N15 NflGf'frJ4.,JCJNJ TGcrcc~q 1 651312 1 -402 1.00 0.86 AN Q6 CWCAGcr'GN tfjCPAG(Cqg 1. 3.I13 -340 10Q OF.
NFAPA~t GGGAM 0YC (Gc.qu285691 -201.00 0.0 NFYQ6 TRRCCAfiTSRN cOA62034*0 O 0.89 AHRARNT_01 KNNKN 1TYGCGTGCMAS c )aoaU%)CGTGc I 104021 ,5 1.030 087 10 RIIDW IA r Y GA SSS gATCaiij ?2.A74 t20 _106'1 1.00) 0.a/ scaje Z >~1.96 IK1 OI r-Jt4TGcGAA-rRCC c aacGGGAaaaf 1-1 '184154 -9Z 1.3Q NFY_06 I Rf~cCA/%TSRN acaT;gj 6203 9h1.00 0.87 LYF1O 01G&AG PLGGGAq~y I..20 197 1.00 0.91 BARBIF 01 ATNNAAAGCNG-RNGG aqraAAA Gcdyaaqt; 32.3396 -363 1.00 0.91 E470D2 WNNIRCAGGIOT1)MINN 0g*aA~y~c 5340.6 .0(.88 MYOD 0i1 SRACAGGTKY cc; .CAGG~gat 32.90,982 -949 1.00 u W LMO2COMCli S((NCAGGV(GNwTl cr CA3Gtggi 2.2i32388 -9-19 1.00 0.95 rH1E47_01 NNNNGN~RrCTGGM\NrrF ;h39(cJa3tCTGG&3(y§j 1t6752005 0.9 I GF11 NNNNNAAATCANNGNNN ggtgqa gaa [1GATT lgaaqg 1692 93 .009 MN t4Ng j.
NRF2 01 ACCGGAAGI'S J4CT roCt 9 0 6. 109080 -86.3 1.1( 0(yi l1GGKAM flC6I 9oyo.ayITCCCI. 1230 -186510( 083 CE-i SGRN. ~C aqjciTTCC ).943267 -Bf80( 1.0(0 0.9( NFKAPPABO 01 GGAMTTYCC jpa 9 *r cc.C c92856911 -859 1.09 0.91 IKI .01 WNTOrGGAAMiCC 1:97Cci (1.45418(.0.0 STAT-Oi T'TCCCRKAA 11 rCCutcaa .219 841.00 0.91 RARBIE-01 1\JNNAA68GCNGR14C0cGGc91~(~ 3.t36 8.38 1.00 0.90 US 0 YCCGCNG1r(CG1*GEc 5.300268 .60G2 1.00 0.9.1 AP2_06 MKCCCSCP*GOCO cCChrGc .613 -777 1.00 08 VMYBO01 AAYAACOGN(O ccr;CGT ryqjr. 4.360548 -61(0 (.8 2007240161 06 Dec 2007 113 Filtration Site Consensus Sequence Z score Position! Core Template TSS (bp) similarity similarity TATAOI STATAAAWRNNNNI'iN L aacluctalTI*Atcc 7.166360 -765 1.00 0.817 GATAC NGATAAGNMNN cclatTTATCc 2.004465 -761 1.00 0.93 GATAI_03 NNNNNGATAANNGN ctattTATCctcaa 2.776154 -760 1.00 0.94 VMYB-01 AAYAACGG7NN ccrAACGgca 4.360548 -743 1.00 0.91 AP2 Q6 M;KCCCSCNGGCG ct9gcciCGGGag 7.064136 -723 1.00 0.87 AHRART01 KNNKNNTYGCGTGCMS cccCACGcctclcact 10.450429- -7607- 1.00 0.85 BARBIE_01 ArTNNAAAGCNc3RNGG ctIcAA.AGctgtgga 32.363969 -680 1.00 0.8 AP4_08 CWCAGCTGGN caaaGCTGtg 12.133646 -677 1.00 0.67 AHRARNT-01 KNNI(NNTYG:CGTGCMS cZraC.ACGrdccaTtL( 10.450429 -6641 1.00 0.87 IIFH1-01 MAWTGTTTATWT aaqaGTTTaLU 5 1 812065 -629 1.00o 0.88 IKi 01 NNNTGGG.-AATRCC 9gaglGGGAaacgg 14.4 84 15- -603 1.00 0.89 VMYB_01 AAYAACGGNW g~aAACGggt 4.360548 -598 1.00 0.89 CREL-01 SGGRNWVTTCC c9ygIITTCC 6.194143 -131.00 0I..9 01 GGGRATrTTCC cgcjgttTTCC 28.315415 -593 1.00 0.95 GFII1 O- -NNNNNKAAATCANNGNNNNI ttCdCtaaAATCagggtacjccaCt 3.669729 -587 1.00 0.9.5 ~~NNNN STAT 011 TTCCCRKAA TTCci(aa F.847-8 .006 s-rAY 01 TTCCCRKAA tc9jGGAA 6.281497 -406 100 0.92 BARBIEMM01 ATNNAAAGCNGRNGG acccctaC-TTTacag 32.3635%9 -459 1.006.6 THI E47-01 NNNNGNRTCTGGMWTT agtcC(CAGaglctgga 1 6.6-052 1 -432 1.00 0.88 TM1E47-01 -NNN14GNRTCTGGMWTT cccayagtCTGGacta 16.677521 -429 1.00 0.90 AN4 Q6 CWCAGCTGGN gaCAGC.-gogg 12.133646 -385 1.00 01.0 IKI-01 NNNTGGGAATRCC c~agaGGG-Aa1cIcc 14.484154 -374 1.0 0.90 CHOP-01 NNRTGCAATMC IccTGCAattcgg 2.836-364 1.00 0.8 AP4 Q6 CWCAGCTGGN cyrjaGCTGcg 24.429042 35 4 1.00 0.88 AP4 M0(5 NNACGNcggaGCT~cg 3.791000 -354 1.00 0.92 CHOP_01 N14RTGCAATKICCC cgglatTGCAgcC 22.32838,- -3416100.3 I4LF-O RfK-Y-- GfTracaac 1.81423 -332 1.00 (1.45 0(1 01 NNNTGGGAATRCC clcgtTCCCggag 14A84154 -285 1.00 0088 SPI 06 NGGGGGCGGGGYN goagGGCGoccIg li.11914.i -276 1.00 0.6 NFKB 06 NGGGGAMiTTTCCNN cIGGGAtuctgccgg 26.126380 -262 1.00 0.817 AN4 0 6 CVVCAGCTGG14 cgCAGCtccg 24.429942 -146 1.00 0.8 AP4 OS NU-JCAGCTGNN ccjCAGCtccg 3.791000 -146 1.00 0.92 ATF-01 CNSTGACGINNNYVC gagI'GACgggcagg 8.675151 -121 1.00 0.96 APIFJ-02 RSTGACTNMNW a9TGACggca 5.905604 -120 1.00 0.91 2007240161 06 Dec 2007 114 Filtration Site Consensus Sequence Z score Position! Core 1 Template jTSS (bp) similarity Jsimilarity APi Q2 RSTGACTNMNW aigTGACqgqco 5.905fi04 -A20 .0 0.89 CAAT-01 NNNRRCCI\ATSA gt(cycCCAA~afy 4.415584 -108 1.00 0.86 AHRARNT 01 KtINK100 YGCGI GCMS cajtaycagCGTGcag 10.4504129 -102 1.00( 0.0 AH-RARNT_01 K14NKNN1TYGCGTGCMS aggcaggggCGTGccC 10.4504129 -88 1.00 0.92 GC_01 N GGGGCGGGGCNK aaggGGCGgcgcgc 15.033816 -28 1.00 0.92 SP1-.Q6 NGGGGGCGGGGYN aaggGGCGgcgcg 11.1191441 -28 1.00 0.03 AP4-Q6 CVICAGCTGGN gcctGCTGci 12.133646 -9 1.00 0.8 SPi-Qe N'GGGGGCGGGGYN gclg G-GGgaggg 11.119144 1 -2 1.00 0.90 GC-01 NRGGGGCGGGGCNK gctgGGCGgaggua 1 5.933816 -2 1.00 0.87 IKi 01 NNNTGGGAAYTRC cggaGG'GAaggrcg 14484154 4 1.00 0.87 AP4 06 CWCAGCTG'GN aagaGCTGcg( 12A133646 19 1.00 0.89 IKI_01 NNNTGGGAATRCC gacjaGGGAa~aag 14.484 15-1 56 1.00 0.87 IKiI0 NNNTGGGAATRCC caacjtTCCClggy 14.4 8415.; 110 1.0 0.87 8(1 01 NNNTGGGAATRCC Me GGat aq 4445 1 .009 TSTIDi___ NNKGAWTWANANTNN tgggAATTag~gggl 6.882911 119 1.00 (0.87 NKX25-02 CWTAAT-rG OaATTAgg 5.675005 122 1.00 0.91 API_02 i RSTGACTNMNW tcTGACcc 5.905504 140 1 1.00 0.06 APIFJO2. RfSTG ACTNMNW tcTGACctcci 5.905504 1.10 1.00 0.40 RORAlI-O NWAWANI\GGTCAN c1GA(Clccttcc 15.361241 141 1.00 0.04 RORA2-01 NWAWNTAGGTCAN clGAC~ccItcc, 3'.905118 141 1.00 0.95 NRF201i ACCGGAAGNS fccTTCCggt 6.109180 147 1.00 0.96 ATF 01 CNSTGACGTNNNYC Ig1TGAC~jcgg1 8655 1.00 0.91 CR EB -44 NSITGACGTMANN g1TGACgacggc 5.543014 161 1.00 0.87 API-02 RSTGACTNM14W gtTfGACgacgg 5.90)550-1 161 1.00 0.119 APIFJ-Q2 RSTGACTNtANW gITGACgacgg 5.905504 161 1.00 0.00 GMI101 t1NrJNINMAATC;ANNGNNNN gaalfgatcac(GATcaaggg 3.669729 174 1.00 0.02 CfOPCR3H-1_ NATYGATSSS aa11GATCac 2.74120 7 .009 T~i-E470 NNNGNRTCTGGMWVTT 1cggacatCT1GGgacc 16.677521 203 1 1.00 0.89 TAL1ALPHAE47 NNNAACAGATGKTtNN tcggarcaTCTGgacr. 43.162108 203 1.00 0.80 rA1EAE70 NNNAACAGATGKTNNN tcgacaTC rGggacr 43.16210B 203 1.00 01.87 E47 02 NNNMvRCAGGTGTTfvNN 1caicacaCCTGcagcc 15.631450 239 1.00 0.90 E47 01 NSNGCAGGTGKNCNN (cacacacCTGCagc 6.708124 239 1.00 0.97 LM02COMO01 SNNCAGGTG14NN acacaCCTGcag 2.232288 2411 1.00 0.07 2007240161 06 Dec 2007 115 Filtration Site Consensus Sequence Z score Position!/ Core Template (bp) jsimilarity similarity MYOOl SRAGAGGTGKYG acacaCCTGca9 32.908282 241 t.00 0.87 VMYB-01 AAYAACGG:NN UccCG TTaga 4.360548 259 1.00 0.92 SRY-02 NVYWAACAAWANN IcMAkAatgy 8.473458 293 1.00 01.86 TH1E47-01 NNNNGNRTCTGGMIWTT tuccCCAGatcclaaq 16.677521 320 1.00 0.88a E4BP4_01 NRTTAYGTAAYN ClIlgtGTAAag 12.678534 341 -1.00 0.86 VBP-01 GTTACRTNAN ItgaIGTAAa 5.053244 3412 1.00 0.92 CREL-01 SGGRNWTTCC GGAAagaac 2.943267 352 1.00 0.85 VBP-01 GTTACRTNAI4 cIggcGTAgc .034 362 1.00 0.88 THIE47_01 NNNN'GNRTCTGGMiWT-T gtaaggg[CTG3GgtcI 16.677521 367 1.00) 0.92 APIFJO2 RSTGAC-TNNMNW Ugcct9GTCAcc 9.018855 3983 1.00 o.92 APNQ2 RSTG3ACTNNNW qc-ctgGTCAcc 9.018855 39B 1.00 0.89 ANi 04 RSTGACTMANN yrcIgGTCAtcC 13.1485B6 398 1.00 0.88) ER-Q 06 NARGNNAfANNTGACCYN~N ccyGTCAccIllagcaac 11 677290 399 1.00 0.8R ELMi 01 -NN14ACMGGAAGTNCNN agcaacT rCClgcccrc 1.5.1645251- 412 1.00 0.88 AP1FJ 02 RSTGACTNMNWVCI ct'C~g 90854610 0.03 API 02 RSTGACTNMNW c-c1CGTCA9c g9018855 426 1.00 09 APi 04 RSTGACTMAN-N c(,tctGTCA(gc 3.486426 .008 GF11 01 (NNNNNAAATCANN'GNNNN .N ct3cagc9IraGATTctccak1i 3.660729 429 1.00 0.86 AT 1 CNSTGACGTNNNY- IgcJIagcGTCAgal 8.675151 430 1.0009 CREB 04 NSTGACGTMANN 9tca~cGTCAga 11.262690 131 1.00 0'18$ CREB Q2 NSTGACGTAA14N gtcagcGI CAcya 17.782892j .1311.008 APi FJ 02 RSTGACTNMNW IcagcGTCAga 5.905504 432 1.00.90 API_02 RST-GACI-NMNW (cagcGTCAga 5.905504 4132 1.00 0.68 TrALl BETAE470 NNNA, AGAT(;KTNNN ItctcraTCTGtglca 64.766306 443 A.00 0.80 TALl 8ETAITF2 0 NN4NAACAGATGKI*NNN thWtcciTCTGI9Ica 64.766306 443 1.00 0117 TAL1ALPHAE47_ NNNAACAGATc3KI 1N llctccaTCTGtgtca 64.766306 443 '1.00 0.R9 APIFJ 02 RSTGACTNMNW tctgtGTCAga 9.018855 450 1.00 0.93 APi 04 RSTGACTMANN Iclg1GTCA-Oa 13.148586 450 1.00 0."8 AP 2RSTGACTNMNW IgIGTCAga 9.018B55 450 1.00 (.91 CDPCR3H-D 01 NATYGATSSS aataGAI'Caq 2.474120 480 1.00 0.95 GFIIO1 14NNNrINAAATCANI4GNNNN agatcaygAATCgctyaQdcrag 3.669729481.008 1 NNNN 2007240161 06 Dec 2007 116 Filtration Site Consensus Sequence Z score Position/ Core Template (bp) similarity similarity APi 02 RSTGACTNMNW qcTGACK~cag 9018B855 495 1.00 0.94 AP1IFJ 02 RSTGACI*NMWWV gcTGACICCag 9.018855 495 1.00 0.q94 AP1Q04 RSTGACTNIANN gclGACtccag 13.148586 495 1.00 0.91 GATAI_03 1NNNNGrATAANNGN gtctcTATCccagc 2.77G354 508 1.00 (0.89 APIFJ rQ2 -RSI*GACTNM3IW ccTGACIcIII 9.018855 529 1.00 0.92 APIQ2 RSTGACTNAMW cacTGACIdtt 9.018855 529 1.00 0.901 BARBIE_01 ATNNAAAGCNGRNGG cgctCTrTcT4ct 3.399529 1.00 0.06, APi C34 RSTGACTMANN ccTGA(-tc(It 13.1,18586 529 1.0 0.88 THIE47_01 NNNNGNRTCTG>GMWTT ctMIItdcIGGctrc 16.677521 534 1.00 0.85 CP2 :.01 GCNIMNAMCMAG CTGGctrccyc 3245733 542 1.00 0.90 AP2-Q6 MKrCCSCNGGCG cdCCCGc~g~cc 7.004136 540 1.00 0.88 GUFl 01 NNNNN0NAAATCANNGNNNN gtccEtclgagcGATTaalgctac 3.669-729 554 100 01.8-,
NNNN____
A Q6CWCAGCTGGN cagaGCTGyg 12.133G46 590 1.00 087 AP4-01 WGARYCAG3ZTG'GGNCtL( glgcc;tcuaGCTGggtuaa 178.524321) 603 1.00 0.6 NNGTNRCtNNRGYAACNN4 (4ccagctggGCAActg 7.226828 607 1.00 89 IccVA--TGN1;aGCTGgg 18.2817q4 608 1.00 0.97 A-4P4_Q5 N-14NCAGCTGN N LcraGCTGgg 2.628244 608 1 (0O 0.96 AP4_06 CVVCAG7GGN IcCAGc1 9 cjg 18.281704 008 1.00 0.93 AP4Q 5 NNCAGCTGNN ICCAGCIgyg 2.628244 608 .0 0.91 E47 01 NSNGCAGGTGKNCNN ctgggc.aaCTGCc~g 6-708124 613 10 0 0.01- SREBP1 .02 KATCACCCCAC gIgggGTGAta 30.499284 682 1.001.) GATAI =03 1.JNNNNGATAANNGN gggg(GATAgtcca 2.776354 68'3 1.00 0.93 OLFi_01 NNCNANTCCCYNGRGARNN agractTCCCctggg9gya 64.601270 697 1.00 0.89 IKI1_01 NNNTGGGAATRCC gj.acITCCCcIg9 141.404154 696 1.00 081/ NRF2O01 ACCGGAAGNS c-acTl'CCcct 6.109080 699 1.00 0.86 AHRARNT-01 KNNKNNrYGCGTGCM4S icucwy99CjGTGlga 10.450-129 703 1.00 0.85 NFY_06 TRRCCAATSR~N c3UCCAA~alt 6.200634 7W0 1.00 0.817 COP 01 CCAATAAT(CGAT ucAATAttrgIt 147.430729 733 1.00 0.6 GATA C NGATAAGNMNN tgctgTTATCI 2.004465 7441 1.00 0.93 GATAl -03 N'NNNNGATAANNGN gctgtTATCItcgg 2.77635A 745 1.00 0.98 GF101 NNNNNNAAATCANf'JGNNN'N gggaaaggAATCCtt9CMccgCt 3.669729 770 1.00 0.89 NNNN CP2 01 GCNM14AMCMAG CTG-gc19gg 3.245733 787 1.00 0.90 RORAI 01 NWAWNNAGGTCAN ggctgggGG.TCa9 7.616064 807 100 0.85 2007240161 06 Dec 2007 117 Filtration Site Consensus Sequence Z score Position! Core Template (bp) similarity similarity APi_02 RSTGACTNMNW IggggGlCAgg 5.905504 810 1.00 0.88 APIFJ- 02 RSTGACTNMNW tggggGTCAgg 5.905504 8t0 1.00 0.91 NF01ACCGGAAGNS ccIGGAAgag 6.109080 822 1.00 0.06 E47_02 1NNNMRCAGGI'GTTMNN gctlcCAGGIgaggcI 15.631450 832 1.00 0.86 ATF_01 CNSTGACGTNNNYC tggTGAr-gaaagcg 8.675151 860 1.00 0.92 CREB-04 NSTGACGTMANN ggTGACgaaagc 16.981467 861 1.00 0.94 AP_2RSTGACTNMtNW ggTGACgaaag 9.018955 861 1.00 0.92 CREB-02 NSTGACGTAANN ggTGACgaaagc 26.730221 861 1.00 0.95 APiFJ_02 RSTGACTNMNW ggTGA~gaaag 9.018855 861 1.00 0.95 CREBPL-Q2 NSTGACOTMASN ggTGA~gaaagc 22.127714 861 1.00 0.89 AP1...Q4 RSTGACTMANIA UgTGACgaaag 13.148586 861 1.00 0.91 CREB..1 GlOM TGA~gaaa 4.176203 863 1.00 0.86 AP4 01 WGARYCAGCTGYGGNCNK aag~ccccaGCTGtcagr, 178.524329 917 1.00 0.88 AP4 08 CWCAGCTGGN ccaG8tci.281794 922 1.00 0.97 AP4_-06 CWCAGCTGGN ccCAGCtgtc 18.281794 922 1.00 0.94 AP4_05 NNCAGCTGNN cccaGCTGlc 2.628244 922 1.00 0.98 AP4_05 NNCAGCTGNN ccCAGCtgtc 2.628244 922 1.00 0.96 AP1FJ 02 RSTGACTNMNW cagctGTCAgc 5.905504 924 1.00 0.91 APl-02 RSTGACTNMNW CagclGTCAgc 5.905504 924 1.00 0.89 GFI101 NNNNNNAAATCANNGNNNN IggcagccAAT~agatgcgagga 3.669729 955 1.00 0.90 NNNN CAAT-01 NNNRRCCAATSA ggcagCCAAlca 4.415584 956 1.00 0.98 NFYC NCTGATTGGYTASY ggcagCC .AATcaga 69.836703 956 1.00 0.96 NFY_06 TRRCCAATSRN cagCCAAtcag 6.200634 958 1.00 0.98 AP4 00 CWCAGCTGGN gacgGClGcg 12.133646 976 1.00 0.86 AP2 06 MKCCCSCNGGCG cggctgCGGGtt 7.064136 978 1.00 0.91 NFY _06 TRRCCAATSBN eccaTTGGII( 6.200634 995 1.00 0.95 CAAT-01 NI'JNRRCCAATSA ccaTTGGIllac 4.4 15584 996 1.00 0.9t TATA-01 STATAAAWRNNNNNN ggagcclcTTTAtcg 7.166360 1025 1.00 0.86 GATA~C NGATAAGNhMN-N rccITTATCg 2.004465 1029 1.G0 0.92 GATAI_03 NNNNNGATAANNGN ctcttTAlCgagtg 2.776354 L 1030 1.00 0.93 API_02 RSTGACTNMtNW agTGACtacig 91851040 1.00 0.93 APl_04 RSTGACTMANN agTGACtactg 13.148586 1040 1.00 0.03 _PFJ02 RSTGACTNM14W aGAtcg9.018855 1040 1.00 09 GFI1..01 NNNNNNAAATCANNGNNNNlclcgctclAATCagagcltccaggI 3.669729 1056 1.00 0.94 2007240161 06 Dec 2007 118 Filtration Site Consensus Sequence Z score Position! Core Template TSS (bp) similarity similarity BTARBIE01 fTNNGNRG LcagacAcna c g 33399 11 NFKB_06 I TGGTCCCCNN f CagGGgac 6.21430 10722 1.00 0.88 NFKAPPA01 TGTYCA ttGGGAg4 .286697 112410 09 XBP1I 01 -NNrGATAIGKNNW gta~gcACTcg 2.3243 -19 11410.9 ER.Q6~~~"3 1NRNANTAGYN gcgttcGAcc 167 10 te GA1FJQ NSGATNMNW9 cGAaga 20-16 19 10 09 BAMYI_01 Nt'JN FkGNC GNGG ggcagAAAgraggc 32.363 7 108310 09 NFB_06 NN--iZGGCNNNNNCCNN aGITGCtgtgcg 16.T5132 112 -10074.
MZFI 01'C NONGOGG -9.25691 -10350 ZF _01 -02260 10164 AP2..O1 MKCCNYGGGN iGgGtgg 01385 1-10 1.0 MZFIOI NWCGGGG ~gGcgGGa 1013540 -1008.8 XB101 NNNGGTGKWNNWN gcggAGGtca 1.301385 100810 0 VMY 0 AYAAGGWNN gc.CagGta 4.658-11.0195 -999 DAF_01 RSTNGACTNN c-ACCctc .830664 -97.9 01 NNNNGGAN cttGAggg 7.106945 -76 NFf NrGCNNCNN aaaag gCCAa 1.30531 -960.0 09 DELTAEF101 NINCACCTNAN aAGGTgg 0.830664 -09048.00..
IK-iNNNYGGGAWNN'N iggcGGGAtg 019855 -13 O08 2007240161 06 Dec 2007 119 Filtration Site CnessSequence }Z score Position! Core Template (bp) similarity similarity 1(2_01 NNNYG GA NNN lagaGGG 39ag -195-8 .0.8 NFI 06 NNTTGGCNNNNNNCCtNNM IyggcttccIgGCCAlII 1.651312) -885 1.00 0.86 11(2 01 NNNYGGG.AWNNN cagtTCCCtcaa 1.019855 -8157 1.00 g NFl Q6 NNI1TGGCNNNNNCCNNN4 cttTGGCtgcaclctacc 1.651312 -831 1.00 0.133 DELTAEFI-01 NNN'CACCTNAN dAGGTt.ac 0.8306641 -820 1.00 088 NMYC_01 NNNCACGTGNNN a-.jcACGTGgcc 0.303606 -803 1.00 0.88 CMVBO NNNNNIIJGHCNGTTGNN atgldccccGlTGgc;ga 0.187475 -782 1.0 9 1K2 0-1 NI' YGGGAVNNN igicVrceccgit -1.0-19855 -8 1.00- 0.88 MZF1 =01 NGNGGGc3A tCC~t-0.225601 -777 -1.0)0 0.96 VMYBO02 NYAG ccCGTTggc 0.32709B -775 1.0f) 0.9 Nfl_0 NNT*TG-GCNNNNNNCCNNN cgtTGGCgaacrctatt 1.651312 -773 1.00 (1.94 GATAl-04 NNC'NGATARNNNN cOaUTATCMca i. 128924 -16 100 0.92 GATAI-02 NNNNNGATANKGNN claWfATCclcaa 1.132907 -760 1.00 0.89 L%1O2COM 02 14MGATANSG 0.679593c -/58 CMYB 01 NNNN~NGrTGNN gccMgcagc .8475- 1.00 0.94 VMYB-02 NSYAACGGN cvccAACGgc 0.327008I -743 1.00 0.99 DELTAEF 1= 1 NNNCACCTNAN tgrcACCrcct 0.830664 -732 1.00 0.93 12-01 NNNYGGGAVVNNN c-rgcGGGAgccg -1.019855 -720 1.00 0.89 11(2 0% NNNYGGGAWNNNN gccgTCCCcacg -1.019855 -712 1.00 0.91 MZFIIll G tCC(CCcg 1 0.66794l0 -708 1.00 0.90) 31(2_01 NNNYGGGAkNNN I actrlCCCcagc -1.019855 -69-4 I.Ofl 0.0( Mi01NGNGGGGA fCCCMag; n.667940 -600.0 0.9/1 WACGI-c.ACgct 0.302731 -606 1.00 0.06 AP405Q NNCAGCTGNN caaaGCTGtg 1.465487 -6771.0.9 11(2_01 NNNYGGGAWNNN acyLcTCCCatt -1.019655 I -660 1.00 091 NNG-AGCTIGNN ttGAGCttca 0.302731 .650 1.00 0.87 DEL1IAEFI 01 NNNCAGCTNAN cltrACCTccra 0 830664 -645 1.00) 11201 NNNYGGG3AWNNN gagtGGG'\aacg -1.019855 J -603 1.00 0.96 VMYB 02 NISYGG gcjAACGgg 0.327U98 -596 1.00o 0.90 S81NN1j4NYAA.TTN aclaTAATcggagaut -0.676808 -566 1.00 0.86 NFl Q6 f4NITGGCNNNNNNCCNNN tgtTGGCcccrtccccct 1.651312 -544 1.00 0.95 11(2 01 NNNYGGGAWNNN ccccTCCcct -10985-537 1.00 0.87 MZF1_01 NGNGGGGA ICctc -0.225601 -533 1.00 0.96 1K2_01 NNNYGGGAWNNN gtagTcCayaq 1.019855 -434 1.0)0 0.98 2007240161 06 Dec 2007 120 Filtration Site J Consensus Sequence [Z score Position/ Core Template TSS (bp) similarity similarity 1K2- 01 NNNqYGGGAWNNN tagaGGGAgccl -1.019855 1 -410 1.00 0.88 NF1_Q6 NNTTGGGCNNNNNN1C(:WNt 9JR09a9CCI9GCCAgcc 1.651312 -40P 1.00 0.6 M _I0 NNGG cccGGGGa -0.225601 1 -391 1.0 .9 11(2 01NNNYGGGAW __ccuGGaa ,ca c -1T019855 -39 1.0 -0.9 NNCAGCTG-NN gaCAGCcI~gg I1.408487 T -38 1.00 0.032 IK201O NNNYGGGAWVNNN agcgGGGAcaga -1.019855 -382 1.00 0.88 _Z--01 _GG agc;GGGGa -0.2256D1 -382 1.00 0.9- IK2 01 t'JNNYGGGAVVNNH cacgdGGGAaclc 1.0T185 -3 74 -100.9 C P01 RRTIK NNf-G i K N 1- a cltc t GC AAtt c 85 748 9- 1.0 0.-8 NNCAGCTGNN t----gCqAG C c ggt 1.006- -30To6 0.90 ARNT_01 NNNN-cKcGTGi-NNNN tatacaaCGTGggga 0.305357 -330 1.00 (0.88 MZF1 01 NG14JGGGGA c:CjtGGGGa 0.r67910 -323 1.00 0.(99 1K2_01 NNYGANNcgtgGGGAggca 1098 32 .008 11(2_01 NNNYGGGAWVNNN tggcTrCCCcaaa -1.019855 -308 1.00 0.89 M ZF1O01 NGNGGGGA fccccala -0.225601 -304 -1.00 -o 0b- AP4 5 t NNc-A ZGCTGN t4 A1-ca 0.30273T -296 1.00 0.86 11( 01NN~c~3GWNN tgtTCCgay -1.0-19855 -241.00 0.94 CETS1P54_01- NCMGGAWGYN ccCGGAgggr, 0327M -279 1.006- -OR9- 1K20 NNYGANN 6GAcg-198555 -264 1.00o9 NFl_06 N-KNTTGGZCNNNNNCCNNN cogalcCCcc 1532-22.00 03 CERl RTKNNGIVAA(NIN tgalgcaGCAAga9g 1.857489 -229 1.00 0.90 1K2 01 NNNYGGG-ANN gu9yyTCCCwa~ -1.019855 -211 1.00 0.91 NNNYG-GWNNN- glgaTCC~jc 1.0TO19855 -172 1.00 .9 (1(2 01 NNNYGGGAWONNN cIccJCCCttg 109 12iO08 NFI Q6 NNTTGGCNNN NNCCNNN -tTGGCccr cgt 1.651312 -156 1.00 092 NHl0 NNI TGGCN(4NNNNCCIIJNN cgar9giigcagGCCAgtg 1.651312 -138 1.00 0.86 R BO CE02 NNGNTGACGYNN -Tg-agrFGACgg{-. 7.972F 34 -122 1(0 018 ANQ51NCAGCTGNN cggcGCTGct 0.302731 701.0 0.116 DFL-TAEF 01 NNRNCACCTN4AN I jA-CCTycq -0.830664 -41 8 NNCAGCTGNN ctCAGCgcac 0.302731 -415 1.00 (3IV, 01 cIT- g 1.05-1 -38 1.00 -0.93 NF1 Q6 NN1TGGCfINNNNNCCNNN acfTG(3CfIaa9ggqcog 1.651312 -37 1.00 (1.92 1K2..1 -NNNYGGGAWqNNN 9grT6CF9tgc- -1-6019855 -18 1.00 c9 AQ5 NNCAGUTGNN gcZ1GCTGc1 1.4654187 1 -9 -1.00 0.92 2007240161 06 Dec 2007 121 Filtration Site Consensus Sequence Z score Position! Core Template (bp) similarity similarity AP4 Q5 NNCA(3TGNN tgctGC;T~gq 0.302731 -6 1.00 f 0.90 11(2_01 NNNYGGGAWNNN cggaGGGAaggc -1.019855 4 .0.3 AP4 Q5 -I NNCAGCTGNN aaqaGOTGcg 1.4654P7 19 1.0o( 0.93 DELTAEFi 01 N1NNCACCTNAN ggaAGGTgaqa 0. M 3O 6 64 37 1.00 09 11(2_01 N1'YGGG1MNNN gacjaGGGAagaa -1.019855 56 .00.93 11(2_01 NNNYGGGAWN14N gqccGGGAggga 18596 1.00 09 IK20..1 NNNYGGGAWNNN I (jqaGGGAtra -1.01985510 1W(201 NNNYGGGAWNNNV aatC-gy SR 01 NNI.NN~yAiN cciAgqaATTAqggg -0.67'30 116 0.
12 01 NNNYGGGAWNINN cccIGGGAalta CETS1P54_0 NCMGGAWGYN tcctTCCGgt -1.03277 17 1.00___ CMY13_01 NNNNNGNCNGT1G NN tccgtqaatlGTGacqa 0177 CREB_ 02 fNNGNTGACGYNN atgtTCA qacg 09?4 AP 5NMCAGCTG1,N gacgGOTGaa 0.302731 167100U8 1(2 01 NNNYGGGAWVNN -atctGGGAcet -1.019655 1.00.9 DELTAEF1 01 NNNCACCINAI4 acacACC'Tgca 0.8306G4 24 1 MYOD-06 NNCANCTGNY caCACCtgc-a -0.1 "580524 VMYB_02 NSYACG ccCGTTaga 0.32709826 11(201O NNNYGGG WNN IccC~ct -1 .019855 275 -MZF1 01 NGNGGGGf\ -0.225601 279 1-00 0.9 11(2 01 NNNYGGGAWNANN ccacTCC.C-cag -4-019855 316 1.00U.8 MZF1 01 NGGGAICCCCcag -0.22560130 )K 1201 NNNYGGGAWVNNN faagTCCCgctt -1.019855 332 1.00 n. 11(2 01 NNNYGGGAWNNN gajqTCCCagtt -1.019855 383 10 CETSiP54-01 NCMGGAWGYN cagtrCCGgc 1.032772 390 10 In 1ELAEF1 0.1 NNRNCAccTrNAt, ggie-ACCT11a 0.830664 4021.) 09 CEBPB Gi RI4RTKHNGMAA.KM'N acctfilaGGAAcII 41685.0890.93 DELTAEF1_Di '1NNCACCTNAN cagAGGTyyac 0.830664 45 10 09 2007240161 06 Dec 2007 122 Filtration Site Consensus Sequence Z score Position! Core Template TSS (bp)l similrt similarity SB 61 N!'NI'NYAATI N tclgacjcgAT-TAatyc -0.6768BE 559 I 1.00 (1.86 DELTAEF1_0,1 NN14CACCINAN alaAGGTgtgg 61.683O664-7 1.00 AN4_05 NNCAGCTGNN cagaGCTGgg 1.46i5487 590 1.00 0.91 CMYBO NNNNNNCNT-TGNN LcggCAACtqcd;gIctc 0.187475 6141.0.4 01 TYAAGTG cATO1.05547 670 1.00 0.88 GATAI-02 NNNN'NGATANI GNN gggtCATAytcca 1.132907 684 1.00 0.93 GATA104 N.NCWVGATAFRNNNN U ggtGATAgIcca A1182 8510 .2 LM02COM_02 NMC3AIANSG gGTic0 679593 687 1100.9 IK2_01 NNNYGGGAWNNN cwAtTCCCctgg -1.0855 690 1.01) 0.89 NKX25-01 TYAT ATFc1.905547 699 .008 MZF1 1 ICC3tg -0.225601 -70Y 0 0.95 N- 6 NTGGCNN;,NNNCCNN'N tgtucagcactGCCAa j 1.65131 721 1.00 09 GATA1O4 NCWGATARNNNN gcdgtTATC tcr 1.128024 74~5 1.00 0.95 GATA1__2 NNNGTNKGNN t gctgtTATCttrxjg I.12007 745 1.00 0.9 2 LM02COM-02 NMGATANSC3tGtAC 0.679593 7.17 1.00 0.94 MMZ0 NNGGG IgaGGGGa -0.225'60-1 766 1.00 0.97 1K2 01 NNN'YGGGAWNNN tga9GGGaoagg -1.019855 766 1.00 0.90 NFI 06 NIOTTGGC14NNNNNCCNNNI- gcctg(g CCAgqc -1.651312 7 85 1.00 0.95 LM02COM 01 SNNCAGGTGNNNq ftrCAGGtgagg 0.773414 84 1.00 0.94 DELTAEF1O01 NNNCACCTNAN tccAGGT~agy 0.030664 8-i5 1 00 0.918 MYOQ-Q6 NNCANCTGNY tcc~aGGTGa9 0.7410149 835610009 CREB-02 NNGI4TGACGYNN LAygTGACgaaa 0.972541 859 1.00.94 1K_1NNNYGGGAWNNI' IcggTCCCgga -1.019855 8036 1.0 .9 NKX250 I TYAAG1 G MA 1.905541t 9.15 1.00 0.806 1K2_ 01 NN14YGGGAWNNN taagTCCGCca~c -1 .019855 916 1.(00 0.0 MZFI".Ol NGNGGGGA fCCCCayc 01.667940 920 1.00o 0.97 AP4_015 1NCCGNIglCAGCcct 0.03 g2 1.00 0.86 NFI_06 NNTTGGCfiNNNNN~CC1N.I' ua9cdcggcaGCCAatc 1.65-1312 949 1.00 09 NFl Q6 NNTTGGC10INNNNCCNNN tccTlGG~agccaatca- 1.651312 952 1008 AN NNCAGCTG14N gacgGGTGcg 1.465487 9*76 1.00 0.91 1K2_01 NNNYGGGAWNNN gcgrcTCCCaftq -1.019855 g00 1.00 0.94 GATAIO-4 NNCWGATARNNNN cttlATCgayt 1.126924 1030) 1.01) 0.92 GATAI 02 WNNNNGATANKGNN cIvIITAI'CgagtU 1.132907 1030 1.00.0 1-1O2COM 02 NN1GATANSG c0TATOga 0.679593 1032 1 .00.05 2007240161 06 Dec 2007 123 Filtration Site Consensus Sequence Z score Position! Core Template TSS (bp) similarity similarity S_1NNNNNYAMATH1 I 5cTAATcagagctt -0.076808 109 '1.00 0.85 NNCAGCTG;-N ctgcGCTGtg 0.302731 10al1 1 000.1 11(2 01 NNNYGC$GAVVNNN ctgtGGGAaaa -1.01-9855 1--089 1,-i.00o 0.95 GATA1 _02 NNNNNG-ATANKGt.JN tglggGATAaagqja 1.132907 1090 1.00 0.93 GATA1 NWCWGXT-ARNNNN gtggGA~g -1.128924 1091 -1.0 0.03- I-MO2COM 02 NMGATANSG gEGA7VAaag 0.67959i3 1093 1.00 0.93 NMNBN01 NNCNNNGNGNGTG NN ggqgcagggaGTTGcccg 0.187475 1118 1.00 0.0 1(2-01 NNNYGGGAVWNNN ggcaGGGAgttg -1.019855 1120 1.00 0.89 NNCAGCTGNN c9CAGCLgca .l51I1f'100.9 ARNT-01 NNNWNCACGTGNNNNN c-accgCACGtcttcag 0.-305357 1 142 1. 00 (16- CMYBI NNN N N dNC-N F 1G NN CragYcccgacCTTGCCI 0.187475 1155 1.00 0.93 V MYB_02 NS-YAACGGN auOGTT(jtc (.20811532 1.00 0.9)7 (2 01 NNNYGGGAWINNN tctgTCCCgtcr, -1.0198-55 1179 1.00 IK2 01 NNNYGGGAWNNN GccgTCC-CGC 1.105 10 1.00 0.87 MZFIO NGGGAICCCCtgc 0.225601188100,9 HNF313 0 NNNTfRTTTRY(--- ct .;TGTTiItac 3.978004-110 0.99 0.84 COPCR3HD 01 NATYGATSSS gGAag.712-00093.5 USF Q15 GY7C-Ac-GfGT 2.1710_16809__.B None___ G;cCCggg -5.3§90268 -950u -0.86 -0.87 NoeE47 01 NStNGCA.GGTGKNCNN gccACAGqga9rct 6.708124 -950 0.83- (6 (Mtnseto efut USF Q6 GYCAGGTt3NU cacaGGTGag 10.9606075 -948 .208 parameters) LiPCNAG'N AG[a -031 97 .60.82 CETS1 P54_01 NCMGGAWGYN ggctTCCTgg 1.032772 -883 0.93 -0 iaAT-O 01 NN'NRRCCAT-SA C-cIU9 CCArltg 4.154-88 86- 06 CEBPB_01 RNRTKNNGMAN cagTTCCr.aa-t 1.857489 -857 0.87 00 AP2 .Q6 MKCCCSCNGGCG gCCCCCcatgCg 7.0641.6 -8-i43 0.08 0.86 USF-C NCACGTGN OCTCGTg 0.301857 -801 0.81 0.86 CES1P4 1 CGGWGNccTGGAtglc 1.032772 -787 0.85 0 02 CETS1 P54 01 NCMGG?\WGYN caor;TCCTgc 1.03277 2- -729 .93 CETSIP54_01 NC-MGGAWGyN carcTCCAgc 1.032772- 642- 0.85(i9 CETSIP54-01 NCMGGAWGYN UtdTCCAga 1.032772 -611 0.83509 CEBPB_01 RNRTKlNNGMAAKNN ggtTTTCctuaaaa -1-.657489 -9 09 0.88 CEBPB_01 RNRTfKNNGMAAKNN -gttTTCCtcaaaat 1.857489 -590 (1.87 09 GC0 NGGGG~~~ ggc, ccCTCCcccl 15.933816 -54(0 .88 0.91 SP..6 NGGGCGGNgccrCCTC~ccct 11.1191-14 5 3 0.(-84 0 2007240161 06 Dec 2007 124 Filtration Site GEYIP54 0- CGAW3N.c.CC AP2 (16 MKCCSCNGGCG a~q CCCgd AI2 QG MA(CtC5NGGCG lc;,G RFXI 02 NNGTNR NNRGYAACNN cgyggacaroigG NFKAPPAI365-1 J- GGGHATI TC
G~
Core Template milarity similarity 0.988 0.90 0.03 0.05 FBI 03 8r) CETSIP54-i 1 rjrlGGAWGYN 0wYB-1- AAYAACGGNtJ TATA 0% II LYFI 01 1 TTTGGGAGR CEBPII 01 fRNR['KNNG1 KNN cdtaAA 1.5748 -25 0.99 0iW0 pigDS C NGTGGTCTG 1267 173 _0 Ti CAAT 01 Ni'JRRCAATSA gc-.aggCCGt 4.454 -131 0.85 00 i$SF Q6 GYA;GGCgc(u'TGag 028-2 0.50 0.A'I) APNC VNASTN grGAGTGAc 1.751881 -123 085 0OPl ANI NTc3ASTCAN 9-1 GZGTGA(; 1.751881 -123 0.86 0.86 01APAO GGGAMTTYCC GGfJOC9.289691 -91 9040 6SFQ GYAcTN gCACTtggc 5.90b -40 0.86 0.80 USFC I ICACGTII4 kjCACTTgjg (1.057 033 -CETS1P54O 01 NCMGGAWGYN c'TGGAaggt 1 37234 0.85 00 RFXI 01 l4lNGTI'JP.Cl4NRGYAACNlN aaqTt'CEtgg(;aatta .2312 i 0.18 CLOX_01 lNNTATCGATTANYNkW tg-aATTicatcaclga 89796173 0.07 39 -CDP-02 J Nr'NtATCGA31 TANYOAN tgaAl*TGatcac;tga 37.346724 173 0.85 01.80q L-M02CO!A_01 NNAGMTINI rjg;vaTC-TGq~u 0.77341-1 201 0* .82 0.90 -K-6 d NNCANCTGNY qaCATCtqg -0.175805 2060.2.8 USF C WCCTG CACCIg 0.39t;5 243 0.13 0.9 AP2_06 MI<CCCSCrNGGCG aj(iCCC',tcjCCC 7.06,1136 2510.9 -~CMYB_01 -NNqNNNNGNCNGrr' L3NN -ccct-.ojccc;GTTAgaaC 0A 8I.V75 25-3 CETS11?S4_01 NUMGGAWGYN gziAP-CTqc 2. 67 4616GT- 207 T 0 AP2Q MKCCCSCNGGCG cItCCCCCItgCC 7.064136 27 8 0.98(06 MY.1AAYAACGGNN agaAACTgag 41.360548 3050.8.7 CERiP16 R I riflTk4NGlMAAKI'IN gctutaG'TAag 1.89,748934 CEBPBO01 RNRTKNNGAAAKNIN aaa(CGAA-iga 1.8748 015 CEBPBO RIJTNGMA('N Flg AAgg99 1.857489 3600.158 2007240161 06 Dec 2007 125 Filtration Site Consensus Sequence Z score Position! Core Template I ITSS (bp) similarity similarity -CETSIP5.4_01 ICMGGAVVGYi4 aactTCC fo 1.032712 415 0 93 0.06 LM02COM 0-1 SNCAGTGNIN ctcoaTCTGtgt 0.7734 445 (.200 MYIDQ NNCNCTNY cQ'TCtglg 75805 446 0.92 0.91 API..C 14TGASTCAN oTGTGTC;,Ag 1.7518e81 4151 0 86 I.0, -CLOX_01 NNTATCGATTANYNW aaaATAGatcaggaa 81.9789313 478 0.81 0.85 CDP_(02 NWNATCGATTANYNN aaAA-t-ga 7-.346724 4--786 0111 0.6 CET51P54 01 NCMGC-AWGYN tcAGGAalng 102772- 486 0.83 0,811 GATA.C NGATAAGNMI2IN agtcICTATCc; 2.004465 507 0.9 0.92 GC 01 NR GGGGGGGGCNK- !otgGGCAqW-g 15.9.33816-.- 58--T18f' CEI*SiP54_01 NGMGGAWGYN tgccTCCAgc 1.032772 604 0.85 0 89 LM02COIA 01 SNNCAGG1*GNNN t0cuiGCT Gggo 0.7734141 607 088 0.94 LM02COM_01 SNNCAGGTGNNN vlcCAGCtgggc 0.73414 607 0.88- MYOD 06MCN~MJ 1. 6561 02i 608 09200 MYOD_06 flNCAN4CTGNY tCCaGCTGcjg 1.656102- 668 0.0TT2 016t LM02COM_01 SNNqCAGGGN gcaACTG~CC 0.734 14 61-5 0 80 0 il1 VNIYBO1P AAYAACGGNII ggcAACTg(,c 4.Tr006548160806 VMYB 2 NSYAACGGN ggoAACTgc 0.327098 616 0.812 0.89 MYODOB6 NNCANCTGNY ggCAACtycc 0.175805 616 0.87 0.Y7 GATA C NGATAAGNfVNN tGATAGtocag 2.004465 M8 .3 8 AP2-Q MKCCCSCNGCG ttCCCCtgggCg 770-64136-- 702 .98- -0.08 USF 06GYCACGTGNC ggcgq1GT*Gaa 5.3002C-8 710 0.86 0.117 CHOP_01 NNRT'GGAATMCGC gtgTGAAa(gtcc 22.328306 713 0.80 0.08 CETS1P54 101 NCMGGAVVlGYN aayTCCAcjc 1072- 79- 0.85 0.86 OCTI_02 NNGAATATI(CANNN14 gc.caaIATTCgttgr 11.885447 732 03 00A CDPCR37 0i KACCR TrATNG CAATattcg13gctg 92.:378088 734 0.97 0.86 VMYB_01- AAY-AACGG--NN tgcTGTTatc; 4 M.6548 7441 0.82 0.39 STAT 01 TTCCCRKAA tcggAGA -6.211497 754 0.81 0.M8 CETIS1P54_01 I4CMGGAGYN yg.AGGAgct 1.032772 801 0.93 0.90 AP2_06 M KCCCSCNGGCG aggctgGGGGtc 7.064136 806 0.80.8 CE1SIP54_01 NCMGGAWvGYN tcAGGAcc~g i03-2772 816 0.87 CETS1PS4_01 NCMGGAWGYN ccl'GGAaoa-g .32772 822 0. 85 08 6-otCA4 NCMGGAWG;YN gg 99Cg 1.032772- 831 -0.85 0.02 USF 06 C3YCACG.:TGNC IccaGGTGag 10.9b0075 835 0.82 0.66 USCNCAC-GTG(N ccAGGTGa 0.301857 1- 836 0. 86 -0 PT2 2007240161 06 Dec 2007 126 Filtration Site Consensus Sequence Z score Position! Core Template (bp) similarity similarity SP1-Q6 NGGGGGCG(GcYN ttgQGGTGgagczc H.119144 B847 0.82 0.87 G C_01 NRGGGGCGGGGCNK Itg9GGTGgagcct -15.53816 847 0.87 0.91 USF_Qb GYGACGTGNC gcncfGGITG-c 5.92G 57 0.02 0.911 CEBPB_01 RNRTI(NNGiMAAKNN tgqtgacGAAAgcg 1.857489 860 0.99 0.91 MYOD_0i SRACAGGTGKY(3 ccr.caGCTGtca 32.908282 921 0.8308 L-MO2COM_01 SNNCAGGTGNtAN ccc~caGCTGtca 2.232288 921 0.08 n.92 LM02COM-01 -SNNCAGGTGNNN cccCAGCtgtca 0.744910.83 0.93 MYOD Q6 NNCANGTGNY ccA~~;1.6561(12 922 0 09 MYOD Q6 NNCANCTGNY rccaGCTGtc 1.656102 622 0.92 0.89 S-NNCAGG VGNNN aatCAGAlgcga 0.773414 9633 0.82 08 MYODQ6 NNCANCTGNY atcaGATGcg -0.175805 061 0.92 0.94 CEBPB-01 RlNRTKNNC3MAAKNN ytTTACtccaccc 1.85M489 1001 0.93 0.90 GC_01- NRGGC.GCG--GGGCNK ttarctcCACCcctg 15.93381% 1004 0.B7 0~ SPiQO NGGGGGCGGGSYN tactcCACCcctg 11.119144 1005 0.82 0.85 USF66 G'(CACGGGC aI(c9AGTGaE; 5.390268 1636 860.688 lINF3B_01 NNNTRTTTRYTY lac;TGTTtgcct 3.978804 1046 0.09 0.92 GETSIP54-01 NCMGGAWGYN gctTrcCAgg 1.032772 10005 .9 CETSIP54-0-1 NCrOGGAWGYN tc:AGGAacrc 1.0M2772 1075 0.93 0.811 GEBPB-01 RNRTKNNGMAAKNN ggataaaGGAAtga 1.857489 1094 0.87 0.80 .CEBPB_-:01 IRNRTI(NNGMAJd'.NN a9jgIIcciGAM~ggjg 1.857489 0 0.990.1 GC-01 NRGGGGCGGGGCNK aaggGGCAgggagt 15.933816 1116 0.81 0.8 NFKBC- t iGGGACTTTC.C.A iGGGAGttgocx 42.313772 -1123 06.83__- 2007240161 06 Dec 2007 127 Table 2: Sites, scores, consensus and positions relative to the site of initiation of transcription (TSS) predicted by the NNPP, TSSG and TSSW software packages in mice Filtration Site Consensus Sequence Z score Position! Core Template TSS (bp) similarity similarity GFIl_01 NNN ttgccl,,cAATrCcaogcaact(att 2.393233 -842 1.00 0.8 NOMNNNAAATCANNGNNNNNNNN Ht4F3B-Oi tNNITRTn*RYTY aacTATTyattc 2.FZ29849 -8251.0.8 Comparative CEBPB_01 RNRTKNNGMAAI(NN tgalttctGAAAttg 1.4608.16 -787 09 anlssbtenCEHPB 01 RNRYKfNMAKNN atgTl0Caaaalg 1.460836 -760 1.00 0.01 seesNFi_06 N~NTTGGCINNNNNCCNNN tt,.TGGCtggt~gcagga 2.19920 2 6M 1.00 080 AP4_06 CWCAGCTGGN eaCAGCagtg 1T4.14306 -386 1.00 0.07 NFKAPPABO 01G3GGAGCTGCC 0UUGCTGcc 11.12 -301 1000 NFY-Q6 -tRRCCAATSRN ccICCAAtggc 5.1873G9 -150 1.00 0.89 HFH2 01 NAWIGrrlTRrTr aaaaaAACAaaa 56.365713 -1211 1.00 0.04 ShY-02 N'NWAACAAWANN aaaaACAAaaac T 464442 120q Q.0 .94 Z score 1.96 HFI 12_01 NAWTGTTIRTTT acaaaAACAaaa 56.365713 -1.205 1.00 0.87 SRY-02 tIWAACAAWAAN aaaa.A~k~aaca 3.86-0390- -203- -1-.00 0 HFH2_01__ NAWTGTFTTRTTT aacaaAACAaaa 28.1 651 26 -1200 1.00 0.119 NV902 WAACAAWANf- caaaACAAaaac 3.8603,311 -1198 1.00 0.94 HFH2_01 1.JAWTGI'TTRTTT aicaaAACAaaa 56.3657*13 -11i.1 1.0r) 0.1 SRY_02 NWAbCAAWAN14 a aaaACA~aaac 1 7.964442 -1192 1.00 0.941 HFH2_01 1NAW\TGTTI'RTTT acaaaAACAata 26.165126 -11118 1.00 09 1*1-1101 NAWTGTTTATWT acaaAAACaala _28,079407 -1188 1.00 0.87 SRY-02 IOJV'WAACAAWAN1J aaaasACAA1aaa j3.860390 -11B 1.0.D8 TATA.81 STATAAAWRNNNNNN caaTAAAaaCclctg 3.065815 -1181 1.00 0801 NFI_06 I4NTTGGCNNNNt4NCCNNN gtiTGGCcgtgatggagg 2.199282 -1140 1.00 03- CHOP_01 NNRTGCAArmccc aggTGCAaq(cccl 20.43263A -1 1011.1 0.85 SRYO02 NWWVAACAAWANN clgcACAAaagt 3.860390 -1093 1.00 0.85 LM i01 TTTGGGAGR UaiGGGAga 1.842719 -1082 1.00 U E2F 02 TTTSGCGC grgaGAAA 3,546219 -10./i 1.00 0.91 -GATAl-03 NNNNNGAI*AANNGN 1 tgtgaGATAgalog 2.031644 -10412 1.00 0.8 CDPCR3H8 01l NATYGATSSS gataGAI'Qgg 2.349950 -1037 1.00 0.97 NFE2_01 TGCTGASTCAY ggCTGAgtctc 21-950203 -1009 1.00 0.87 CHOP 01 1NNRTGCAATMCCC atcTGCAaaaccc 20T432681 -960 -1.00 0.86 NFl 06 NNTTGGCNNNNNCCNNN aiactcargcttGCCAgqg 2.199282 -938 1.00 0.85 2007240161 06 Dec 2007 128 Filtration Site 1 Consensus Sequence Z score Position/ Core Template TSS (bp) similarity similarity SRY 02 P'.NVVAACAAWAIJIJ tg~tTTGTgaa :5.86090 -9 .008 oI~1(iNAWTGTT*VATA'T aai3IAAAC at 28.079407 -OMRg POLY C CAATAA/ANCCNYYYIF aAATAAAcc g t-ttft 177.269419 -18 1.00 0. U88-- ISRE_01 CA T Ic\CtiC raGTTIIGt(r M4.173196 00 1TALl 8ETAE47_01 Nr-INAACAGATGKTNNN 0 ggicaTCTGaga~at 1 5.99A313 -A59 8 E~ 0 NNNIJATANGNNNIIN altgag TCllgccacaah; 2:393233 b5 1.0 t::SRY 12F RWJ-AACAAWVV-ANN Cj(.CIAcAc(a 3860390 1,00 RX1 O' NN4GTNFRC TRGYAA;CtN, taaatccagGCAActa 7.172878 -83 008 G~hCi I)I.t-I~AJA CN1tGN1!i IN r:af-tT lhig 2.393233 -820 GF1 1 NNNNNNAAATCANNGNNN14NNNN tfgattCtAATCagi9alg 2.333-2 GATAI_03 NI.JNNI JGATAANINGN cttagGA IAttggy 2.031644 -ti09 1.00 NF 6TR.RCCAATSRN Qt~TTGGrt 5 187369 -8110 .1 GFM~ -0 NT.INNNNAPJJCAN4NGNNN NNNN gcjgctgc ;awcGATThtga1It 2.-391333 -79 0.9 E47 6:1 t4NMCAGGTCG TMAN( 1yC;TOi', IS3CAO LM2OMO 0 SNNCAGG3TGNNN tgccaCCTGatt 3.041567 -79 41.0.9 IAO 1SRAOAGGTGIG 4raC~0--0 06E8075 -794 1.0 0. (92 02 NWI/JAACAAWANN g~j~aTTGTcl~jc 1.860390 .7801.0.8 TH1E47-0 NN...GNRTGTGGAWT gtacaatMCAGGcigg 16.434630 -694 too) 0.85 CP2 0 (-GCNN'ANAMCMAG CTGGctqg(9.g IA721 -I (0 GAi13 NNNArNGNcacagGATAcaag 2.031644 5 SR 021ACAWN g~jatACAAagicu 3.860390 61.JO NF01ACGGGAAGNS accTTCCgar 6.701850 -600.07 ER_0Q6 NNARGNI4ANNNTGACCYNN aaat 0 gtccIcTGAC;GtCc 102,A054 0.8 _PF-Q RS9TG A T N M I A -V cTGAC, tcra 5.14'2253 -5410 -SGGNNW tGAcca.142253 1.00.86 NNNGATAA 4 NNG N r.-,1caGATAtg(rca 2.031644 -5610 0.87 OC11 6CWNAWT,(W.rSATRYN a~Iag~uATGGa ::&:4383 6-I 0.82 CTNATTTGCATAY a!*naqCAAAI aa 70.2, P 1 0!2 1.00 OCTATT aATrMat 3.983418 511.00.8(; 0 87 62-TTC3 atTAATta 3.983418 51 .00.87 NK2-2CWTIAA17 G IaA I TAaa 3.9B34 18 -A10 1.00 0.93 _R 0 NVWWIAACAAWANN asaaACAAatjgt 3.860390 -1910 2007240161 06 Dec 2007 129 Filtration Site Consensus Sequence Z score Position! Core Template jTSS (bp) similarity similarity NFl 06 NN'ITGGCNNNiNNOCNNN F tggTGGCac.acgcdtta 2.199282 -481 1.00 0.85 AHRARNT_01 Kt4NKNrYGCGTGCMS F 9 caCACGcctII~aI(. 14.600483 176 1 .Of1 0 86 GF101 NNNNNNAAATCANNGNNNNNNNN IacgcctfLAATCccagczictcagg 2.393233 -472 1.00 0.91 GFIIO1 -4NNNNAAAITCANNlrGNIJNNNNNN ggtctaaycagtGATltccaggcc 2.393233 -413 1.00 0.97 NK202 CWrAATTG aaATTAaa 3.98M118 -363 1.010 O6 LYF1_01 T1 TGGGAGR ttgGGC3Aga 7.842719 -333 1.00 0.89 NFI QG NNTTGGCrNNNNI4CCNNN 'g'ygagcIGC('Att 2.A99282 -305 1.00 0.8 NFKB Q6 NGGCGAMTTTCCNN tgGGGAgctgccat 30.067903 -303 1.00 f).87 WFKAPPAB 1)1 GGGA~iTYCC GGGAyctgcc 10.361187 -301 1. 00 __8 ER_06 NNARGNNAN14NTGACCYNN 9aactcacagqTGACccgI 10 .37.1054 -279 1,00 0,66 E47_02 NfINMRCAGGIGI IMNN actcaCAGGtgacccg 15.432640 T 277 1.00 0.90 LM02COM-01 SNIICAGGTGNNN 1tcaCAGCtgicu 3.041567 j -275 1 1.00 0.94I MYOD-01 SRACAGGTGK(YG tcaCAGGtgacc 40.698075 -275 1.00 0.09 SRUBP1_-01 NATCACI3TGA'( cZa-gGTG(Ac 15.355630 -274 1.0 .8 API_04 RSTGACTMANN 9gTGACccgtt 11.241163 -7 0 1.0 0.8c API 02 RSTGACTNMN'W- ggTGACccgIt 785015 -270 f91 AP1FJ Q2 RSTGAC TNMt.W I TAc.gt 7.80501 -270 1.00N VMYB-01 MAiAACGGNN accCGTTgf(7 3.427439 -268T 1.00 0.93 NFi 06 Ni4TTfGGCNN NNNN CCNNM ccogwCA 2.199282 -24-2 to0 (0.90 PADS C NGTGG.TCTC IGTGGTccc 5.230232 -169 1.00 0.89 GC_01 NRGCGGCGGGGCNi< tgqtccCGCCt.cct 3.031-167 1.00 0.87 SPiQ6 NGGGGGCGGGGYN grjtccCG~cC1t 25.529462 166 1.00 NFl_06 NNVTGGCNN[NNNGCNNN caal*GGCaaagtcgctg 2.199#282 -152 1.00 0.08 E47 02 NNNMRCAGGTGTTMN4N agtigrCAGGtg(Aaa '15.S 4320 *13410009 E47_01 NSNGCA-GGTGKNCNN gtaGCAGgtgcsiata 9.748242 -133 1.00 LM02COM-01 SNNCAGGTGNr4N OigCAGGtgcaa 3.0-11567 -132 1.00 0.06 MYOD_-01 SRACAGGTGKYG tagCAGGtglcaa 40.698075 -132 1.0000 CHOP_01 4NRTGCAATMGCC aggTGCAatatcc 20.432681-1810 -09 CAAT.01 NN'NRRCCAATSA aatatCCA~tag 113.434507 122 1.03 0.90 NFY Q6 TRRCCAATSRN tatCCAAIagt 5.187369 -120 1.00 0.92 GCO 01 NRGGGGCGGGGCNK aygggGGCEGggg-Ctg- 35151 -01.0 1.00 SPi_06 NGGGGGCGGG.-GYN agggGGCGgggct 25.529462 -103 1.00 0.99 BARBIE 01 ATNNAACCNGRNGG agcgAAGtggatgg 29.452018 6 1.00 0.91 NKX25_01 TYAAGTG gaAAGTt; 3.53-1570 9 1110 0.Mi 2007240161 06 Dec 2007 130 Filtration Site Consensus Sequence Z Position/ Core Template score TSS (bp) similarity similarity VMYB_01 AAYAACGGNN cagAACGgtg 3.427439 34 1.00 G FM 01 NNNNNNAAAI*CANNGNNNNNNNNI ggtgagaatw\TCccrcgaaggtq 2.303233 40 1.00 0.90 NI:KB Q6 NGGGGAMTTTCCNN tgagaaaaTCCCrn( 30.067903 42 1.00 0.86 ZInO 0 NGGC*TCYATCfYC g-gtGA c 1.9516 64 U01 TI1E47061- NNNNGt4RTCTGGMWTr ctggaqaICT'GGgyat 16.43,1630 75 1.00 0~ -S-RiEBP1-O2 IACCC gtyggGTGAgg 27.71080294.0(.9 NFE2-O1 TGCTGASTCY gUCTGA~cctc. 21.950203 108 -1.00 0.67 USP-06 GYCACGTrGNC gcCACG(Icc k6.857788 114 1.00 (1.87 NtNTGGGAATRCC cacytTGCCtg~a 14.836 1 .000 GATAI 03 NNNNNGATAANNGN tcccdGATAatIt9 2.031 644 1211.0 NKI X2 5_0 2 C(WT-A AI TG -ya TA A Tht 39 83-418 126 -1.06- 0.86 E47-02 IThNMRCAGGTGTTMNtN gtcCAGGlgc-ctac -~5.432640 136 100) (,.8T LM02COM_01 SNNCAGG'rGNNN ttcCAGgcct 3.041567 138 -1.00 0.06 WOD 01 S RACA-GGTGK-YG 1IL;CAGGIgccI 40.698075 138 1.001 0.06 GATA1_03 NNNNNGATAANNGN ttlctTATrCctjc,7. 2.031 644 -164 fAl 0.95 NFO0 ACCGGAAGNS h~cTTCCggg 6.701850 171 1.00) 0.91 STAF_(02 fM.NTTCCCAKMATKCMW,/NGC:NN ctccggcqgtUTGGGaaaa 3.11,255924 173 1.0000 IK1_01 I'4NNTGGGAATRCC gtgtGGGAaaaat 14.853568 183 100.92 LYFlI 0 TTTGGGAGR IqIGGGAaa 7.8,127 19 184 0.86 AP4_Q6 CWCAGCTGGN caCAGC99tc 14.114396 210 1 00 090 API 62- RSTGACT-NMNa cayc-gGlCAtc 5.142253 .212 .00.88 APi FJ Q2 RSTGACTNMMW cagegG TCAtc 5.142253 212 1.001(3 1 AU BETAE47-01 NNNAACAGATGKT NNN gcgqtcaTGTGgtcac 48.119467 '214 1.00 0.89 TALIALPHAE47_0 -'NNAACAGATGKTNNIl gc9yglcaTCTGgtcac 49.110467 2141.0.8 TAL1BETAITF2_0 NNNAACAGATGKTNNII4 gc~igtcaTCTG9Icac iS.1,19467 214 1.1) 0.88 TH1E47-01 NNNNGNRTCTGGMWTT- gc 4Ij~catCTGGteac 16.434030 214110 AP1FJ_02 RSTGACTNMNW atctgGlTCAc,. 7.895015 220 1.00 093 API-04 RSTGAC'TMAfWI a(IytGtTAcc 11.246163 2201.0U8 APIWQ2 RSTGACTNMNW atctgGTCAc(. 7.895015 220 1.00 0.90 ER Q6 NNARGNHANNNTGACCI-N tctgGTCAcclcgagggao 10.374054 221 1.00 0.86 NFl _Q6 NNTTGGCNNNNNNCCNtNN gagggacccLGCCAacc 2.199282 233 1..09 NKX25-01 TYAAGTG cACTTtc 3.534570 :267 1.00 0.66 APIFJ_02 RSTGACTNMIN 99ccIG1TCAcc 5.142253 281 1.1) 0.1 2007240161 06 Dec 2007 131 Filtration Site Consensus Sequence Z score Position! Core Template TSS similarity similarity APi 02 RSTGACTNMNW ggcctGTCAcc 5.142253 281 1.00 0.88 SREBPA1 02 KATCACCCCAC IgTCACc;cc~c 27.71.0802 285 1.00 O.1-1( TI401 NNNNGNRTCTGGMWTT ccccCCAGatclcaaa 16..134630 297 1.00 .i IRFK_01 SNAAAGY'GAAACC aaTTTCactttat 81.006772 311 1.00 0.137 IRF2_01 GAAAGYGAAASY aaTTI~actat 59.61531 I.00 I- N Kl 06 NTGCKNNNNNCCNNN gagt gaagccGCCAatI 2.A99282 341 1.00 0.93 NFY_06 YRRMCVTSRN cc9CCAA~tc 5.187369 350 to- OCTI_06 NNNNATGC'A/AFNAN e;caATTThccufgtag 1308,12 353 1.00 0417 0016 -C WIAW TK W SATRY N -1 a caU-IccAT1*GTa .4383641 353 -1.00 0 02 OCTI_07 TNTATGNTAtkTT AATrTccafgta 27.046261 355 1.00 0.60 CEBP-C NGWNTKNKGYAAKNNAYA aaactttgGCAATttccc 230615437 374 1.00 0116 NF1 06 NNTTGGCr4NNNNNCCNNN cttTGGCaa~lrcult 2.1990202 3771 1.00 0.95 NFKAPPAB65_01 GGGRATTTCC ggcaatTTCC 30.6691104 381 1.00 01.80 -CREL-01 SGGR14VVT' .CC gg(:-aTTCC 7.203414 381 1.00 0.86 NFKAPPARJOI GGGAMTTYCC gCaattT'CCC 03617382 1.00 0.05s IK 1NN.NTGGGAATrRCC' caaltTCC~tclc 14.853568 383 1.00 0.06 APIQ4 RSTGACTM.ANN tct,-GTGAgrc 11.246 1633 392 1.011 0.90 AP-IFJQ2 RSTGACTNMNWV tctctGTCAgc 7.895015 392 1 .00 0.95 API_02 RSTGACTIOMV tcO:-IGTCAgc 7.895015 392 r.009 ISRE-01 CAGTT-TCWCMTYCC caGTTTccrtatcgg 3811.173196 40V, 1.00 0.115 IKi-OF 01 -NINTGGGAATRCC cgTC ta 14.853568 405 1.00 0.87 GATA1 03 NNNNNGATAANNGN Itcc:,TATCggtat 2.031 644 400 1.00 0.9 1 GATAl_03 NNNNNGATAANNGN alcggTATCalgaa 2.031644 4-15 1.00 0.88 NFl_06 NNT'rGGCNNNNNNCCNNf4 IcatgaagcagGCCAc;ag 2.-199282 422 1.00 0.60 TATA-01 STATAAAWRNNNNNN aaaTAAafaacgaa 3965815 458 1.00 0U95 GFI11 NNNbJNNAAATCANNGNNNNNNWN aataiicgaA/%1Cagaatggcg9 2.3932:3 6 .009 VMYB 01 AAYAACGGNN aaLAACGaaa 3.427439 '16.1 1.00 0 1 AHRARNT-01 KNNKNNTYGCGTGCMS caggaalggCGTGctc 14.600483 475 1.00 0.93 API 04 RSTGACTNvIANNJ cc:TGACIcd.± 11.246163 503 1.00 APiQ2 RSTGACTNMNW crI'GACtc-ctc 7.895015 503 1.00 0.90 APIFJ_02 RSTGACTNION* ccTGAClcctc 7.895015 603 1.00 0.93 BARBIE-01 ATNNAAAGCNGRNGG tac.tlcCTTtgac '29452M1 532 1.00 00) AP1FJ 02RSGCMNI ttTGACteegg 7.805015 5'11 1.00 0.00 API _Q2 RST(3CSATfqKNW I UfGACtcegg 7.89%5015 541 1.00 0.110 2007240161 06 Dec 2007 132 Filtration Site Consensus Sequence Z score Position! Core Template (bp) similarity similarity AP I Q4 K3TGACTMANN ttTGACtccqg 1266 4 0 GCO1~~~. 11CGGGC~ic~GC~yc 3.2413 541 1.00 0.88 SPlOS6 NG(.GGGCGGGGYN ggagGGCGqccc 25.5290~2 55 .00.9.3 THTIE 70 NNNGNRTC1C(GGMWTT kIGctc(TGttcL 16.4346,10 565 1.00 0.08i AHRARNT, 01 I(NNKNNTYGCGTGCIS clgggagCGTGacj 14.600483 580 1 00o.n LYFi_01 TTTGGGAGR cItGGGAcjc 1747y _5811 1.00 0146 AP1FJ_02 R,;TGACTNMNVV (:gTGAcIt 1 C 17 051 501.0 A1 API-02 RSTGACTMN cgTGACIqc -7.895S01 i 589 1.00 08 AP1-Q4 RSTGACIMANN cgI GACullc It.246103 589 1.0 0 -0.V,9 IKI 01 NI.I.NTGGGAATRCC, tcagiTfCCc( ct 14 853568 613 '100 _0.
E47 01 NSNGCAGGTGKNCNN aaggc;cagCTGCaaai 9§.-74842-. 638- .9 AP4 06 CWCACCTGGN cCACGCIgca 21.2-11746 641- 100)-I AN4 05 NNCAGCTGNN j gcCAGCtnc 31080778 641 1.00 0.94 AP14 Q CVrjl(,GN gcc.aGCTGca 21.2417-16 641 -1.0 1 NNCAGCTGNN gccaGCTG ca 3.080778 641~ 1.0 -F OCI06 NNNNAI*GCAAATNAN uglcA~a 140.262- 1.00- U ANi Q2 tSTGA0TWMNW aaG~~a7.605011F 651 -1.00 0L.
APIFJQ2 RSTGACTNMNW aaTUiACac-aga 7.8 95-015 651 1.0 094 API 04 RSTGACI'MATh aa iGACacaga 1.24 61 FF 65-1 1.thi 0 E4_12 __~677 1.00 08 LMO2COM 01 S(-NAGGTGNNN ygcCTGggg 3016 7 .0-U~ MYOD 01 51RACAGGTGKYG ggcuiiCCTG99g 40.698075 670 it 0 00 AAYAACGGNN..........a 4274.39 69o 1.0 091 THI1F47.01 rNIMAGI4R TTGGM111VTT- 1':cgJCG 1 3.1 6 3 703: 1.00 O.W1 APIFJ 02 RSTGACTNMNW gcTGACrgIgg 5.142253 723 100 0.89 AN___Q2 P.STGACTNMAW gcl*GACcglgg 5125 721 1.06U 0.88 ZID 0*1 NGGCTrCYArCAYC gaccgtgGIGCr,, 41.225196 726 .001 BARBIE_01 AT[-INAAAGCNGRNcsG ccccaa9CT1iTaaac 29.452618 7§S0 1.00 (9 Gr 01 tJRGGG(;CGGGGCNK I ICaccCCC(;dI 35.P0311 767 1.00 0.9 (17 SP1-Q6 NGGGGGCGGGGYN 1 agctcCGCCcccl 25.529462 V68 1.00 0.95 CRL.0 SGRNW1CCaycTC 3.46/H58 79.3 j 1.0 Oq-. 02 THIE47 01 NNNNGNRTC-TGGMNTT ICtICCAGaccCza 0 16436079-7 10 00 CDP_02 NVWNATCGATTANYNN gccttualCGATagc____ 24.123980 811 -I.00 0.91 CLOX_01 PlNTATCGAIfANYN'VW 0qCC.IcatCGATa1Or 3 0.2405438 811 1.0 9 2007240161 06 Dec 2007 133 Filtration Site Consensus Sequence Z score Position! Core Template I (bp) similarity similarity GATAI 03 NNNN[JGATAANNGN tcaicGA IAgcccA 2.031644 815 1.00 0.88 NFI_06 NNTTG(.CNNA--NNNCCNNN t;gccctccaGC'CAa',C 2.199282 032 1.0 0J.93 NRF2_01 ACCGGAAGNS cc-,cTTCCagc 6180825 1.610 0 GMFiQ 14NNlNNNJ/VAT'CA\I~l.GNNNNNWiiNNI IIucagCcAAT~agctacgaggac 2.393233 828 1.00 0.80.- NFYC NCTGATTGGYTASY tccagCCAATcagc 65.593286 029 1.00 0.W) CAAT-0I NNNRRCCAATSA tccagCCAAtca 3.434507 829 1.00 0.99 NFY-0.6 TRRCCAATSRN cagjCCAAtcag 5.1873S9 B31 1.0,1 0.96 AP4_0Q6 CWCAGCTGGN gacgGCTGc 0 1i4.114396- 049 1.00 0.8f, 1110 NNT GGGAATRCC cgggtTCCCaIlg 14.853568 862 1 CI0 06 91 NFY 06 TRRCCAATSRN cccaTTGGlca 5.1 I/369 868 1 010 03.95 CAATO01 NNNRRCCAATSA cc;aTTGGtcact 3.434507 869 1.00 0.91 APIFJQ2 IRSTGACTNMJVV c~,i9GTCAtAt 7.8915 170.00 0.911 AP1-Q4 RSTGACTMANN cattgGTCAct 11.246163 870 1.00 0.91 AI 2RSTGACTNMNW cattgGTCAct 7.895015 870 1.00 0.91 01L17 01 NNCNANTCCCYNGRG,\RNtNKGN tjtcac3,TCCClagtqaatt'rt -77.977123 8275 1.00 I.6 IKI 1 WlNNN[GGGAATRCC tcactTCCCtaqt 14.853568' -876 10 0.87 SRY_02 NWWAAGAAWANN tiqcTT-GTtt9e R.P660390 905100.8 GFII_01 NNNNbJNAAArCANNGNNNN14NNN vtctgcg9aGATTIattgagg 2.393233 921 1.00 0.36 TATA-01 STATAAAVVRNrNNNNW C~jgga~aTTTAUg 3.9U5815 927 1.003 0.85 AP2_0.6 MKCCCSCNGGCG gaCCCGcagaca 12.284970 978 1.01) R TN1E47 01 NNNNG lRTCTGGlMWTT acalttglClGGd9CC 16.Z4630T9716 0089 NFI Q6 NNTTGGCNNNNNNCCNNN att16 Frtgg 7.GCCAcac 2.199(282 1 909 100(- 06 AP4 06 CWCAGCTGGN raCAGClcac 14.114396 '1004 1.00 0.08 AP40.6 CWCAGCTGGN ctcc(3CTGtt 14.114396 1040 1.00 0.80i TH1E47_01 NNNNGNRTCTGGMWTT cggiCCAGagtcatca 16.434630 -1052 1.00.086 VfAA*F1- NNlTGCTGACTCAGCANI-JN c;ggl cagaGTCAt-;atgg 168.-513881 1052 1. 00 0.87 APi 0Q2 RTANMWccagaGTCA4c 7.K0501.5 1056 1-6 09 APIQ4 RSTGAGTMANN ccagaGTCAtc 11.2416163 1056 1.00 0.90 Crsiiaiy>0.9 API FJ 02 RS'lGPCTlNMNWq ccacaGT~I 7.8)95015 1 056 10 09 CorsiXl5iy>.9 NNAACAATNN aaaCATaa 0.681190 -185 1.00 0.99 emltePmi4_tr> O 5 NNCAGCTGNN gggcGCTGtr; 0.508566 -1122 1.00 0.05 DELTAEFI_-01 NNNCACCTNAN gcaAGGTg.aa 0.538360 -1107 1.00 G.96 gilaGGGAgaag -0.854A442 -1083 1.00 0.91 CMBi(.JNNNNNGWCNGTTGf-IN aacagacauaGTT(;aalg 0.594660 -1065-' 1.00 j 0Y 2007240161 06 Dec 2007 134 Filtration Site Consensus Sequence Z score 1Position!/ Core Template J TSS (bp) Jsimilarity similarity GATAIO02 NNNNNGATANKGNN tgtgaGATga Cg 0.327-102 j GATAI 04 NNONGATARN4NN 9 i, GA-TAgatcg 0. r.5 31 60 -0113 9 L-[MO2COM_02 -___-NMGATrANSG gGATAgat 0.569272 .1039 0.91 1 2_01 tIINNY G GAWNN acjIrICCCICaC -0.85444 -1004 1.00 0.09 NNCAGCTGNN acCAGCtIce. 0,508566 -994 1.00.6 1K2-6- 01 NNYGGGAWNIIN catcTCCifcta -0.854442 -909 1.00 08 SOX5_01 NHAACAATNN cctaiC(ATcc 0.6511 o0 -8,19 1.00 0.86 GATAl-02 NNNNNGATANKGNN C tgatgg 0.90 0 1.0 0.9 GATAI-04 NUt-CWGATARNNWN ItagGATA~tggq 0.653180 -0 .000 LM02COM-02 NMGATANSG agGATAttg 0.569272 -806 1.00 0.92 DELTAEF-i_0 NNNCACCTNAN tgocACCTgat 0.538360 -794 1.00 0.97 MYO-DQ6 MICANCTGMY gc.CACCtgat 0.781061 -7931.0.9 SOUXS -01 -NNAAAATNN____ a aATGcta 0.681190 -779 1.00 0.06 1NNAACAATNN____ 991gaCAATIc 0.661190 691000.86 GATAl-02 NNNNNGATANKGNN 1 cacagGATAcavag 0.930257 -G65 1.00 0.00) GATAl-04 trINCVNGATARNNW-N ar-agGATAcaaag 0.653180 -664 1.00 0.8B LM 02CGM 02 NNMGATrANSG aqGATAcaa 0.5&J272 -(.828 -CESPB_01 BNRTKNNGMAAKNN accttgtGCAAac;c 1.460836 -n51 1.00 0.94 DELTAFFI01 NI0tNCACCTt4AN IccgACCTaaa 0.538360 0636 lAI)D 0.87 CEBBOIRNRTKNNGMAAKNN tc~Gcggt 1.460836.j -r320 1.00 0.87 iK2 01 fNN-YGGGAW-NI'N gacjgr'GCacat -0.854442 -1 .009 DELTAEF1_01 NNNCACCTNAN tcdgACCTcca 0.538360 -564 10 6F00! GATAI-02 t4NNNNGATANKGNN cwacaGATAtg(cca 0.930257 -6i56 1.00 0.93 -GATAI04 NNCWGATARIINNN cacaGA~gcca 0.653 1 k0 -55 9 LM02C0M_02 NMGATANSG (,aGATAtgc T.560272 j553 1.00 0.96 seO 01 NIMNYAATTN acccTAATaagcaiat -1.397287 -526 1.00 0.86 SB 01 NNNNNYAATTN ataagcaaATTAalita I-1.397287 -520 1.00 0.95 S8_01 NN[-NNYAATTN gcaaataATlTAaatt -1.397287 5b16 1. 0.97 N~NYATNaaatTAAkT~aaattta -1.972-87 -51 4 1.00 0.99 1K2 01 NNNYGGGAWNNN OaalCCagca-0542-6610.9 AP4_05 NNCAGCTONN cr;CAGCaCdc 0.086 411.00 0.85 NNCAGCTGNN (:aCAGCagtg 1.794672 -386 1.10 0.93 S8 01 NNNNNYAATrN ctctcaaaATlAaaaa -1.397287 7369 l 0.93 GNGGGGA M~tGGGGa 03.437162 -3-34 1.001 0.96) 2007240161 06 Dec 2007 135 Filtration Site Consensus Sequence Z score 1Position! Core Template (bp) -similarity similarity IK2_01 NNNYGGGAWNNN ctlgGGGAgagg -0.854442 -334 1.00 0.89 1K2_01 NNIIYGGGAWNNN tgIgjGCGAgc~tg 40 8541442 -305 060) 0.81T MZFOINGGGGAtgtGGGGa 0.437162 -305 1 00.99 AP4 Q5 NNACG UogaGC-TGcc 1.79,1672 -301 1.00O 0.91 MYOD 06 NNCANCTGNY cacaGGTGac 0.781061 -274 100 (.To DEaTAE-1 01 NNICACTIANccAGGTgacc 0530-74- 10-- CMYB.OI NNNNNNGFICNGTTGTNN cagjIgacccGTTGccc 0.504680 -272 1003 0.90 VMYB_02 NSYAACGGN ccCG"Tgtc 0.465812 -265 1.00 0.95 11(2_01 NN1IYGGGAWNN'N 9ltgTCCCcctc 542-- -6 .009 M Z F 6T NGNGGGGA tcIC-Ctc 0.437162 -258 1.00 00 IK(0 1WNN'G GAW1JNN cgtgTCCCagIg 0.854442 245 1.0 -j 0.93 NNCAGCTGNN !yCAGCaq~ r.016 -221 1.00 CMBI NNNNN NGNCNGTTGNN- CaggaatcctGTTGtccc 0 594660 -216 1.008 11(2_01 NNNYGG()ANNN gIIgTCCCIIIa -0.854442 -2-06' 1.00 0.91 AP4_05 -NNCAGCTGNN gogcjGCTGtg 0.508566 -175 1.00 0.00 11(2_01 WqNNYGG(SAWINNN gtygTCCCg.;t -0.854442 -168 1.00 09 DELTAEFI 01 NNNCACCTN'AN agc-AGGTgcaa 0.538360 131 110000 MAYODQ6 NNCANCTGNY agcaGTGva -0.147777 -131 I( GATA1 02 NNNNNGATAN'KGNIJ VIycaiTNCvata -0.01667-1 15 .0Os' GATAl-04 NNCWGATARNNNN tgcaaTATCcaat 0.653180 -125 1.00 0.87 LIA02COM_02 N.AASGcaaTATi Mca 0-123 1.00 0.92 NKX25_01 TYAAGTG cACITaa 1.619 33100oo 11(201 NNNYGGGAW14NN gCgcTCCCc(;gc -D.854442 -19 1.00 0.89 MZF1O01 NGNGGiGGA LCCCCCy9C 0.437162 -15 1100 0.96 VMYB 02 NSYAiNCGGN -aA~ 0.4582 4 .0 aAA ot 064.1 341.0.2 11(2 01 NNMYGGGAWVNN'N uaaaTCCCcga9 -0.8544_ 461.0.0 MZF1_01 NGNGGGGA tCCCCga9 1.679353 50 1.006.0 DELTAEFl-01 NNNCACGTNAN ggaAGGTggag 0.538360 63 1.00 0. .1 11(2_01 NNNYGGGAWNNN [(clg(-3GGAtgct -0.8541442 82 li) .0 MZF1 01 I4GNGGGGA tctGGGGa 0.437102 82 I.00 0.961 1ThCAGCTGNN gtgjgGCTdao 0.500566 105 5.008 ARNT-01 NNNNNCACGTGNNNNN tgWigcCACGttrcctg 0.51281 ill -100 0.00 1(2-01 NNI-NYGGGAWIJINN a.cgIt"CCClgat -0.854442 117 1.00 0.92? GATAI-02 NNNNNGATANI<GN(1 Ie(cIGATA;4Iltg 0.9)-30256Y 121 1.010 01 2007240161 06 Dec 2007 136 Filtration Site Consensus Sequence Z score Position/ Core Template (bp) similarity similarity GATA1 04 NNCWGATARNNNN ccclGAI'Aafttg 0.653180 122 1.00 0.95 SB_01___ NNIflNYWIN cIgaTAATtgggitt -1.397287 124 lAX) 0.95 LCM62COM_02 NMSGAIANSG cIGA-matt 0.669272 124 1.00 0.91 G AA C NGATAAGNMNN IGAIAAtttgg 1.411,097 125 1.00 0.90 MYOD Q6 NICANCTGH' 1 tLG(aGGTGc(; -0.14-7777 139 1.01) .0.91 DELTAEFI_01 NNNCACCTNAN tccAGGTgccI 0.538360 139 1.00 0.95 I120 NNNYGGGAV4NNN aclcTICCCtIg(c -0.854442 150 P :1.(o10- 0.0 GATAC -NGATAAGNMNN cltccTTATCs'. 1.4-11097 163 1.00 0.97 GATAI_04 I4NCWGATARNNNN ttcctTATCcttrc 0.653180 J164 1.0)0 0.94 GATAl_02 NNMNNGATANKGNN ttcctTAT(-cdL~c 0.930.5716 LmO2-~I002 NMGATANSG WTATCct 0.569272 160 1.00 0.96 CETSIP54 01 1ICEGGAVIGYN t(:otrCCGytj 1.244.487 .171 1.0)0 0.94 IK2-01 NNNYGGGAWHNN tccgGGGAgtjt -0.854442 il75 1.00 0086 fAZFi -61-'-G-W;NGGG tcCGGa 0.437162171.0.9 1K2 01 NNNYGG GAWNNWN 9IgtGGGAaaaa -0.65-1442 i31.00 0.91 NNGAGCTGNN -caCAGCgg; 1.794572 210 1 001.9 1JELTAEFI 01 NW.NCACCTNAN 9g(ACTci)fl 0.538360 224 1.00 .1 iK2_01 NNNYGGGAWNNN tcgaGGGAcfGIc -0.854442 231 1.00 0.90 CMYB-01 NZJ1NN14GNC1'JGTTGNN clgcCAACctacacctcc 0.594660 242 1.00 0.85 ELTAEFIOIi NNNCACCTMAN r'1acArCCTcca 0.538360 250 1110 0.94 IK2 01 NNNYGGGAWINNN agtgTGCCaclt -0.854442 260 1.00 0.93 MZF1 1 IGNGGGGA (-,CCCCacc 0,1137162 1 291 1.00 0.05 01 TYAAGTG cACTrta 1.519193 316 1 00 0.94 I- rINNYGGGAWNN-N a-,agTCCC(;gag -0.854442 332 1.00 0.88 MZFI_01 NGNGGGGA tCCCCgag 1.679353 -36 1(0009 CEBPB3_01 RN-T-KNNGMK4NN -aaICtttgGCAAt -1.460836' 375 1.00 0.90 1K2-01 NNNYGGGAVI'NNN aaQTCCCtCIr -0.85442 3B4 1.00 0.02 K-0 NNNYGGGAW1NN agttTCCCtatc -0.854442 406 1.00 0.94 GATAI-04 N1NCWRG :T\RNHNIA4 I 1kccTATCggta 0.653104950 0.93 GATAI_02 NNN14NGATANKGNN (t,-ccTATCgqIal 0.930257 4109 1.010 0.07 LM02COM 02 NAGATANSG cccTFATCg,3 0.569272 4111.0.9 GATAl-02 NNNNNGATANKGNN :ktc,3gTATCatga 0,9301257 415 1.00 0.92 GATA1 04 NNCWGATARNNNN atcggTATCatga I0.653180 415T 1.6000.91 LM2CM_0 NGATANSG eggTATCal 0.569272 41/ 2007240161 06 Dec 2007 137 Filtration Site Consensus Sequence Z score Position! Core Template ITSS (bp) similarity similarity CETSIPS4_01 NCMGGAWGYN cagillCCGgg 1,244487 445 1.00 0.92 MZE-1_01 NGNGGGGA cqgiGGa 0.13716245 0(.0 1K2 01 NNNYGGGAWNNN cgggGGGAaata -0.854442 451 1.00 0.01 1K2_01 N4NNY(.GGAVONN rcIy'TCCCgac -0135-442 4110 0.90 CETS iP S4 01 NCMG3GAWGYN tgacTCCGga 1.244487 543 1.00 08 CETS1 P54101 NCMGGAWGYN tcCGGAgggc 1248 4 .009 I K2 01 NNNYGGGAWNNIJ ccttGGGAgcgt -0 B544142 5811.011 0 92 I K2_ N NYGGGAWNNN cagtT'CCCatct -0O.854442 614 1.00 0.94 -CETSIP54OI1 NCMOGGAMIGYN, agacTCGC69-g 1248 659 '1.00 00 DELTAEFI_01 NNNCACCTNAN ggc:-ACCTggg 0.538360 679 1.00 0.95 -Y -1.-00 09 VMYB_02 NSYAACGGN gcgAAC(-ga 0.4-1581-2 639U 1.100 0.93 GAT4l_0-2- NN!-NNGATANKGNN tcatcGATAgccct 0.930257 815 1.00 09 GATAI-04 NCWGAT'ARI4NNN ca~cGATAgcI 06(53180 .816 100) 0.18 L-MO2COM_02 NMGATANSG tCGA~ 0.622 881.001 AP4 05 l0'JCAGCTGNN atCAGCtaeg 0_505_2 837__ 1.00 0.039 AP4_05 NNACGNgacgGCT~rcg 1.794672 840 1.00 0.91 1K2_01 NINNYGGGAWNNN gggt1'CCCattg -0.854412 863 1.00 0.97 IKY 01 r4NNYGGGAWNNN Cad-$ CCC:t;Igt F54442 8,77 1.00 0Yi2 NKX2S 01 TAG cACTTC 1.519193 877 1.00 0.08 IK2_01 NNt4YGI3GAWNNN (lgiGGGAgatt -0.8544/12 921- 1.00 0.139 NNCAGCTGNN ctCAGCccga 0.508566 946 1.0 0.87 [JNCAGG'NTN- caCAGCtcac 1.794672 .10 1.00 0.02 AP4 Q5 NNCAGCIGNM &0.cGC1'Gtt 1.7941672 1010 T.00 09 CETSIP54_01 NCMGGAWGYN tgttTCCGgt 1.244487 1046 1.00 0.95 HFI-01 NfAWTGTTTR-TTT aaacaAAAAaaa 55.365713 -1219 0.82 0.89 HNF3BOI1 NNNTRTTTRYTY aaaraAAAAaaa G. 1684/71 -1219 0.85 0.90() None HFI 12_01- NAWTGI'TTRTTT aaaaaAAAAaac 28.165126 -1215 0.82 0.08 (MtispctrHNF3B 01 NNNTRTTTRYTY a3?aaAAC;Aaaa 6.641-1211 0.901 008 default parameters) I4N~FRT a--AAca .(79 VMFP3B_01 NNNTRTT IRYI Y aaacaAMAcaa 9.407093 -11%6 0.85 0.89 HNF3B_01 NNNTRITTRYTY aaacaAAAAcaa 0.407093 -11q0 0.8.5 0.8q HF112Ui I-JAWTGI"TTRI*T- aaaacAATAaaa 28.165126 -1185 0.90 0.85 TATAC NCTATAAAAR ac.AtTAAAAa 8.1117712 -118 l98 0.93T 2007240161 06 Dec 2007 138 Filtration Site Consensus 1 Sequence Z score Position! Core Template J_ TSS (bp) similarity similarity VMYB 01 AAYAACGGNN ctcTG-Fict T.249 171 0.82 8 VMYB-0I AAYAACGG('II ayaAACAgac :.4274-39 -10;8 0.0 6.86 LM02COM_01 SNNGAGG'TGNNN araCAGTtgaal 1.242813 A(060 083 087 MYODOQG NIACGYcaraGTI'Gaa -0.147777 -1059 0.07 0.89 OCT1-06 CWNAWiTKVVSATRYN car-.agt~gaATGAa 8.438364 1059 08 8 VMTEO2 -NSYAACGGN -acAGTTgaa 0.Y465812 -1058 0.82- -0.88 GATA_C NGATAAG-NMI-IN a(3ATAGatc 99 1A I13097 -10380.9.0 CEBPB 01 RNRTKNNGMAAKNN gggtggaGAAAgag 1.460836 -1022 0.93 LM02COM_01 StZCAGG VGr'NI a',ic~aTCTGcaa 1.242813 -973 0.82 0.91 MYOD 06 NNCANCTGNY I tygCATCtgca -0.147777 -972 -0.92 (1.9 API-_; Nl'GASTrCAN ci*AACTCAc 1.430304 -940 0.86 0.87 API _C NTGASTCAN c(T-AACI'CAc 1.4303034 .940 0.85 0.87 PADS C NGTGGTCTC gGTGATcta 5.230232 -922 0.90 0.0 CEBp*C NGWNTKNKGYAAKNF4,AA (9gct[IgGAAA Fa~acc 23.6-1,r37 -,197 0.89 0.89~ CEBPB_-01 RNRTI(NNGMAAKNN g'rttgIGAAAtaa 1.460836 -896 0.093 VMAYB 01 AAYAACGGNN accAGTTI 3.427439 -802 0.838 (0.88 I IFH2 -01 NAATG'TTRTTT caO.TTTTt(M 2 9.1 Gi12 -680 0.02 86 CETSI P54_01 NOMGGAWGYN ttTCCAga 1.244487 -872 0.05 0.86 LMO2COMO01 SNWCACGGTGNNN -g ag(uaTCTGagja 1.2428313 -857 0.82 0.89 MYOD_06 NNCANCTGNY gaCATCtgag -0.147777 -856 0.02 (.1.9 SRYO02 f'JVWWAACAAWANN actaTrGAt~ct 3.860390 -824 0.81 0.85 CDPCR3HO_01 NATYGATSSS tattGATTct 2.349950 -822 0.89 0.93 OCTI-02 NNGAATATKCANNNN tcttaGGATaltggg 8.039815 -810 0.86 (0.86 GATA C NC;ATAAGNMNN' OGATATt9y 0 c 1.411097 -805 0087 0.066 USFQ6 GYCACGTGNC gcCACCtgat 13.858419 -793 0.82- 0.89 USFC NCACGTGN cCACCTga 0 OCO7-662 792 0. 66 0.93 O USF_06 GYCACGTGNC gu!ct(-TGgc 6.867788 -684 01 U2 .8 CETSiP54_01 NUAGGAWGYN gcAGGAgatg 1.447-676 0.93 0.88 USF_06 GYCACGTGNC yCAC,\ggaI 6.578B -670(6.
CETS1P54_01 NCIVGGAWGYN acAGGAtaci, 1.244487 -664 093 0.91 G&TAC NC3ATAAGNIONN gGATACaaaga 1.41 i097 -661 o.fo8 0.09 CETSIP54_-01 NCMGGAWGYN coAGGAaaIgj 1.24,1487 -578 0.83 0.92 GATAC NGATAAGNMNN aGATATgccat 1.411097 -552 0.87 0.94 OCTi 06 CWMAWTKWSATRYN gATATyccattocat B.438364 -551 0 94 2007240161 06 Dec 2007 139 Filtration Site Consensus Sequence Z score Position! Core Template (bp) similarity similarity LMO2COM_01 SNNCAGGTGNNN atyCATGtgtcc 1.242813 -539 0.82 0.89 11JSF_6 GY7 CTGf IgraITGIru 6.B57768 -538 0.86 .8 USFC__ NCACGTGN I gcATGTGt 0.507662 -537 0.88 0.93 USFO NAC(3GN CATGTgt 0-.-50 766-2 -5370.2.8 OCTIO.6 CWNAVWTKWSATRYN attattaaATT--a 8.4383634 512 0.89 0.903 TiA-TAC -N b TATAAAA-RR- aaTTTAAAAi 8.111772 -504 0.93 0.87 MYCMAX-02 ACCTNINtaAG~~ 3.4b.1-391 -480100.86 USF_06 GYCACGTGN'C ggCACAcgcrl. 666.8-577 -477 0.6c8 0.861 CETSI____ P50 NC(AGGAWGYl I Ic:AGGAggca A 1.24487 430309 PADS-c NGTGGTCTC ziGTGATtc. 5.230232. -'404 0w 900.1 CETSIP54-01 NCIAGGAWGYN' gjatTC gg 1 j447- 0 0.81508 CAAT 01 NN14RRCCAATSA cjtgagCCACiI 3.434507 -377 1 0.803 0.85 NFY_0Q6 TRRCCAATSRN 9a9CCACtctc 5.187369 -375 0.81 0.05 TrATAC NCTA AJAIAR ItTTTAAAAa 16.3.152 350- 0.1d3 0.8 TATAC NCTATAAAAR tTTT-TAAAaa 16.345002 -350 0.9300 AP2 06 MKCCCSCNGG-CG gcctIGGGGag 1 12.284970 -337 0,98 0.85 CETS1P54_=01 NCNIGGAWGYN acAGGAatgt 1.;.24,187 20.13.8 OCT1_06 CWNAWTKWSATRYN gCCATttcaagatg 18.439364 -294 0.83 0.8F0 CEBPBO1D R5NR TKNNGMAAK.NN ecaTl TCaaayItgI 1.460836 -293 .0.99 0.92 E4701 NSNGCAGGTGKNCNt- cicACAGgtgarccg 9.748242 -276 0.83 0.805 USF d6 GYCACfGTGNC ctCAGAggig 6.857788 -706 -0.86 08 6 USF 06 GYCACGTGN-,C cacaGGTGa 13.858419 -274 0 USFC rNCACGTC-N acAGGTGa 0.507662 -273 0.86 0.92 ARPI (11 TGtNGCYTTGA.JCOYW igaccGTfTGtcCCCI; 123.979855I -268 0 113 (.87 CETSIP54 01 NCMGGAWGYN 1 icAGGAatrc 1.244487 -217 0.93 0.00 CETS1P54 DI NCMGGAWGYN ggaaTC;CTgt 1.244487 -214 0.93 0.883 VMYBO- 1 AAYA.CGG1NN tccTGTTrjic 3.427439 -210 01 02 0.116 CEBPB 01 RNRTKNNGMAfNKNN cttaaGAAAccc 1.460836 -200099.9 USF -C NOACGTGN gcAGGTGr 0.507662 -130 0 060.9? -GATA C NGATAAGNMNN gtgcATATCc 1.t4-11097 -126 0.87 0.8n OCTI-02 N14GAATATKCAU,.NNN lgcaatTC Caatag 8.039015 -125 0.86 0.92 NFKBC NG.GGACTTTCCA gaaaATCCCc 4.4014 0 931 0r, NFKAPPABO GGGAMiTTYCC gaaaatCCCC 10.361187 45 0.90 0.8 r CETs P56:0, FICMGGAWGYN neTGGAga', 1.24.1487 740.5 I 00 2007240161 06 Dec 2007 140 Filtration Site f Consensus Sequence Z score Position/ Core Template TSS (bp) similarity similarity RFX1_01 NNGf M- UIN;RGYAACNN acqTTlC cgataaMl 7.1-i2878 117 UJ.88 CLES154111NCGGWGN ggICCjg1.244467 13.9 0.85 0.0( UF NC AC GTN tcAGGTGrc. 0.80662-E 140 086092 cEBB_0 I~RTNNGAAKN t~qGAA~a.,.1836 181 0.817 0 NI1 ,a2CM01- 216) 0.82 0 IM SNNCA6TGNN fIqcTC~l 1.24213 mAOQ (:CNCTGW)Y 0 ICAI C1OgI -0.141i777 217 0.02 0..33 CET31P54 01 tiItMGGAW&VV)N ;acc ICCAjI 1.244487 253 0,85 j 0 USF Q6 GYCACGTGNC 1 1ct;AGTGIC, 6.857788 __256 o. 0.86) -C EBPOI RNRT KNNMIMAAKNN cafr;1Gcaaaga 1.1608636 :?67 099 01 9.1 SRF Q6 GNCCA;VVATAVVGGMM lCAag~c 0030210709 GC 01 f'lRGGGGC;GCGGCNK ac~tCcc 35.L'05311 289 0.87 09 SP16 OGGGGGGY 'x:cCA~rcc 25.529462 290 0 62 0. 9 2 OCTI_06 CWNAWrKWSArRYN cAAATltcacttla 8.438364 309 0.89 0.05 -C-EBPBO1RRKN(1AN (l:IAA~a I403 1 09904 OCTI 05 MI(IATTTGC-AIAYY c-.caaM(CCA-Tqta 4.692353 0.5 CBP_0 R'RKNIGAA~mC~g1fTc~iicy 1-68,6 405 0.99 0 1(1 GAI'AC ljGATAAGN(MNN ttlc(;CTA*I C9 1I4 IOW09 408 0.89 0,93 GATA C NGATAAGNIANN ___LtcgGTATCajA1944(88.0 -US F =Q G F1 TGC-q. -CA C-A-jc.-a 68778 433 0(16 085 AP2 Q6 N4KCCCSCNGGCG t(lcygGGGGaa 12.284970 4468.6 OCTI 06 WHAWTKWSATRYIJ gkArA, aaeataacq k133C4 5 0.89 00 CERB 01 RfRTM NGGMAAKII 114 uGA~c 46-638 AP2 Q6 MKCCCSC14GGCG ctccggAGGGcg 12.284910 546 0.86 1 0.9 RFXI 02 NNGT81RCNI.IN1RGYAACNN tgq1-T7C(tggqagq 114!-15 574 0.88 18 FI 0 NNGTNRCNNRGCYAACNN tqOgflTCctlg.)ga~jc0 7.1728378 574 0.0 LM02C~OMi0 SNNCAGGI'Gr\NN (1cCAGCtqcaa 1.242W13 64 b.( LMO2COVI 01 SNNCAcGTGNN14 ggcc;aGCTrGcaa 1.2412813 6401.8)00 MYOD 06NNCANJCTG14Y gccaGC IGca 1.709098 641 0.92 0.91 MYOO Q6 N'NGAN I GNY gcCAGG~gca 1.70OP98 6J41 0.92 0.00 API_1 NTGASTCAN aTGACACAg 1.430304 652 0.86 005 USFQ6 GYCACGTGNC c CACCyjggg M(78 670. ,6 b USFCIB6 GYCACGTGNC, gf:cAccigg 13.B58,119 060.08 USFI 1;CTG' CCAC T 0762 681 0.80 0.92 RFI 2 Nl.JGTNR(Th1I-tJNRG7YAAOC 414 cilgg ;goacGC7AA;r9g A.741 Z05 1 2007240161 06 Dec 2007 141 Filtration Site Consensus Sequence Z score Position! Core Template TSS (bp) similarity similarity AP2_06 IAKCGCSCIAGGCGG agCCCCgagCCrc 12.284070 734 0.983 0.85 OCTI (6 HNNNATGCAAAT:4AN agaatgcAAACag,3 11 430842 781 0.B0 00 IANF38_01 NNNTRTTTRYT tgcaAACAggg 2T9 29 49 79 0.99 09-2 CETSIPS4 0 NC.MGGAkNGYN gic;ITCCAga -1.241-187 796 0.85 0.89 CDPCR3HD 01 NATYGATSSS ttCATCgata 2.349950 R 14 0.03 0.93 CDPCR3I-01 NATYGATSSS catcGATAgc 2.349950 8,16 0.84 0.95 GATAC NGATrAAO-NMNN cGATAG4cAI 11)97810.9.8 CETSIP54-01 NCMGGAWGYNJ ccctTCCAgc 1.244487 825 0.85 0-89 H-*F3B 61 NNNTRTI'TRYTY cd tTGTTtgccqj 2.929849 970.9 V) 0.88 CETS1 P5401 NCMGGAWGYN IttgTCCTgt 1.244487 1027 09q3 08 2007240161 06 Dec 2007 142 Table 3 Pomoter 54 u rmtscription Factor Cor Sntyi~ evNORR~riiianty 4_scr Posit orv I b(P flint corise1SL Ai0 O, 4,4 -569 GOUXPTAAA I IGLW.GAC I U I A(A NNN'g~ w-U .i '4i'ii B HNtF3B_03 0,9 0,85 'S 547 GATIGTIGIGCCC.A NNNTRTTTRYTY ABWA C- G;EBw01 u 0.85 -498 GRI cAG RTh(w#MI<IE1 Hman GEBP_01 -Ug 0.85 -469~ ATCTAGTGGAAC RN'RTN'qfJGMMI<J'I' NFl-06 1,00,86 zoo- -467- Gu7Gxr: CCGG NNTTOCNNNNNCNN AP4_05 0.90 1,68- -340 TCAG3(XGGT NNCAGWXTGNN -FP1 01 7F T-96260 GGGAIX1 (30 GGG'UMl1WC 1- W-Y. I 1,0 0,89 -106 CC(XOAAIN3C ]T1XCAATS'J A CIV01 1,00.88 -842 TTGQXJINAIATC)(JGAAI AI I Pq~NAA U -F'W3B 03 0,9 0,85 3.5-825 AACrAI I CATC N'4NfIIXYIY ABA C UF P_0.9 0.94 T -787 Ga]TcTGAaATTG Mouse 0 CB_10.91 1,7 -76 AITG CTAATG FM&TNAAKM' i 06 0.88 -6a88 w TTGTGGGC2JUUIGC'tJCCN11 0.93 -ZT -386 CACGAGTG t'5CJ.Ll(3'I' 1FKA00A 01-08 TTT2f7 GGGAGCUTGCCG rwy mi,. -156 I ATGGC II ULATS 143 Table 4 Oligonucleotides specific for the human ABCA7 gene Name Sequence Orientation ABCA7 U2 CTTCAGCCCGACCGTTG Sense ABCA7_AJ AGAATTTCATGTATCCCC Sense ABCA7 L12 CGATGGCAGTGGCTTGT-TTG- Antisense ABCA7 Ll GCGGAA.AGCA-GTG7:TGTTC.AC Antisense ABCA7 AL CTCGAGTTGCTGTCAGAG Sense ABCA7 AK GGGTAAAAGGTGTATCTGG Antisense ABCA7 AN 7CFACAGGACC2AATAAGAT-- Sense AB'- A~ AM T-GTCAGTGTCACGGAGTAG Antisense ABCA7 AP Ct-TO-SAAGCTGTGTGC Sense ABCA7 AC ACGGAGACGCCAGGAC Antisense ABCA7 AR GTCCTGGCGTCTCCc-TTC Sense ABCA7 CTICGTCC"AGGATAACAAC Antisense ABCA7_AT GTGCTc-CCCTPACACGOr7 Sense ABCA7 AS CAGTG"CCCAGCCCTGTAC Antisense AECA? 4V ACCICCAGAOTCTCCATiCC Sense A3CA; -A"J G A-A C CT C CG T AT C:G AC Antisense A~A X CTGCTCTCCGC.TGTT-C Sense ABC-t7 AWT 21OA CCATGTAPCAC Antisense I- c CTCPICTG)ACT Sense i AY CTTGCSTT vTOTTCCTC Antisense b AACCArCGGCTTCTCTCC- Antisense ~3CbT -rrCAGCAACGCA-AT'C CCes n TCCATZGTCCC Antisense T A± jT G A COATACT~ Antisense T C~C- CG AC AC TC CTC Antisense -CT G'CAA C Sentsens T. 4 C T SGTCA,; PO T GcAo Sense F ;::2CC:CATCTCAC: Sense

Claims (17)

1. Nucleic acid comprising a polynucleotide sD having at least 20 consecutive nucleotides having the nucleotide sequence chosen from the sequences SEQ ID No. 1-5, or a nucleic acid having a complementary sequence. (C 2. Nucleic acid having at least nucleotide identity with a nucleic acid according to Claim 1.
3. Nucleic acid hybridizing, under high stringency hybridization conditions, with a nucleic acid according to Claim 1 or 2.
4. Nucleic acid according to one of Claims 1 to 3, capable of modulating the transcription of a polynucleotide placed under its control. Nucleic acid according to Claim 4, comprising a polynucleotide ranging from the nucleotide at position -1 to the nucleotide at position -1111 relative to the first nucleotide transcribed, located at position 1112 of the nucleotide sequence SEQ ID No. 1.
6. Nucleic acid according to Claim 4, capable of activating the transcription of a polynucleotide of interest placed under its control. 7: Nucleic acid according to Claim 4, capable of inhibiting the transcription of a polynucleotide of interest placed under its control. 145 S8. Nucleic acid comprising: O a) a nucleic acid according to one of O Claims 1 to 7; and b) a polynucleotide encoding a polypeptide or 5 a nucleic acid of interest. \O
9. Nucleic acid according to Claim 8, CI characterized in that the nucleic acid of interest is an oligonucleotide of the sense or antisense type. Recombinant cloning and/or expression vector comprising a nucleic acid according to one of Claims 1 to 9.
11. Host cell transformed with a nucleic acid according to one of Claims 1 to 9 or with a recombinant vector according to Claim
12. Nonhuman transgenic mammal whose somatic cells and/or germ cells have been transformed with a nucleic acid according to one of Claims 1 to 9 or with a recombinant vector according to Claim
13. Method for screening a substance or a molecule modulating the transcription of the constitutive polynucleotide of the nucleic acid according to Claim 8, characterized in that it comprises the following steps: a) culturing a host cell transformed according to Claim 11; b) incubating the transformed host cell in the presence of the candidate substance or molecule; 146 c) detecting the expression of the O Spolynucleotide of interest; NO a) comparing the results of the detection obtained in step c) with the results of the detection 5 obtained by culturing the transformed host cell in the \O absence of the candidate molecule or substance. CI 14. Kit or box for the in vitro screening of a candidate molecule or substance modulating the transcription of the polypeptide of interest encoded by a constitutive polynucleotide of the nucleic acid according to Claim 8, comprising: a) a host cell transformed according to Claim 11; b) where appropriate, the means necessary for the detection of the transcription of the constitutive polynucleotide of interest of the nucleic acid according to Claim 8. Method of in vivo screening of a substance or molecule modulating the transcription of a constitutive polynucleotide of interest of the nucleic acid according to Claim 8, characterized in that it comprises the following steps: a) administering the candidate substance or molecule to a nonhuman transgenic mammal according to Claim 12; 147 b) detecting the expression of the U polynucleotide of interest in the transgenic mammal as IND treated in step a); c) comparing the results of detection of step 5 b) to the results observed with a nonhuman transgenic \O according to Claim 12 which has not received the CI administration of the candidate substance or molecule.
16. Kit or box for the in vivo screening of a candidate molecule or substance modulating the transcription of the constitutive polynucleotide of interest of the nucleic acid according to Claim 8, comprising: a) a nonhuman transgenic mammal according to Claim 12; b) where appropriate, the means necessary for the detection of the transcription of said polynucleotide of interest.
17. Substance or molecule modulating the transcription of a constitutive polynucleotide of interest of the nucleic acid according to Claim 8.
18. Substance or molecule according to Claim 17, characterized in that it is selected according to the method of Claim 13 or of Claim
19. Pharmaceutical composition comprising, as active ingredient, a substance or a molecule according to either of Claims 17 and 18. 148 C 20. Pharmaceutical composition according to O Claim 19, characterized in that it is intended for the O treatment and/or prevention of deficiencies in the metabolism of lipids, or in the mechanisms involving 5 the immune system and inflammation. \O
21. Substance or molecule according to CI either of Claims 17 and 18, as active ingredient for a medicament.
22. Method of detecting an impairment of the transcription of the ABCA7 gene in a subject, comprising the following steps: a) extracting the total messenger RNA from a biological material obtained from the subject to be tested; b) quantifying the messenger RNA for ABCA7 present in said biological material; c) comparing the quantity of messenger RNA for ABCA7 obtained in step b) with the quantity of messenger RNA for ABCA7 expected in a normal subject.
23. Method of detecting an impairment of the transcription of the ABCA7 gene in a subject, comprising the following steps: a) sequencing, from a biological material obtained from the subject to be tested, a polynucleotide located upstream of the site of initiation of transcription of the ABCA7 gene; 149 b) aligning the nucleotide sequence obtained O Sin a) with the sequence SEQ ID No. 1; IND c) determining the various nucleotides between the sequenced polynucleotide obtained from the 5 biological material of the subject to be tested and the \O reference sequence SEQ ID No. i. Cl 24. Kit or box for the detection of an Simpairment of the transcription of the ABCA7 gene in a subject, comprising the means necessary for quantifying the messenger RNA for ABCA7 in a biological material obtained from said subject to be tested. Kit or box for the detection of an impairment of the transcription of the ABCA7 gene in a subject, comprising the means necessary for the sequencing of a polynucleotide located upstream of the site of initiation of transcription of the ABCA7 gene in the subject to be tested.
26. Method of screening a molecule or substance modulating the transcription of the constitutive polynucleotide of interest of the nucleic acid according to Claim 8, comprising the following steps: a) incubating a nucleic acid according to one of Claims 1 to 9 or a recombinant vector according to Claim 10 with a candidate molecule or substance to be tested; S150 C b) detecting the complex formed between the Scandidate molecule or substance and the candidate ND molecule or substance.
27. Kit or box for the screening of a candidate molecule or substance modulating the transcription of the constitutive polynucleotide of C interest of the nucleic acid according to Claim 8 comprising: a) a nucleic acid according to one of Claims 1 to 9 or a recombinant vector according to Claim b) where appropriate, the means necessary for the detection of the complex formed between the candidate molecule or substance and said nucleic acid AVENTIS PHARMA S.A. AND INSERM WATERMARK PATENT AND TRADE MARK ATTORNEYS P22572AU01
AU2007240161A 2000-10-24 2007-12-06 Nucleic acid generating the ABCA7 gene, molecular modulating its activity and therapeutic applications Abandoned AU2007240161A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
FR00/13649 2000-10-24
US60/253,141 2000-11-28
AU2002210661 2001-10-17

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
AU2002210661 Division 2000-10-24 2001-10-17

Publications (1)

Publication Number Publication Date
AU2007240161A1 true AU2007240161A1 (en) 2008-01-03

Family

ID=38903078

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2007240161A Abandoned AU2007240161A1 (en) 2000-10-24 2007-12-06 Nucleic acid generating the ABCA7 gene, molecular modulating its activity and therapeutic applications

Country Status (1)

Country Link
AU (1) AU2007240161A1 (en)

Similar Documents

Publication Publication Date Title
AU738651B2 (en) Promoter for smooth muscle cell expression
Yoo et al. Amelioration of pancreatic fibrosis in mice with defective TGF-β signaling
JP5328603B2 (en) KCNQ2 and KCNQ3-potassium channel genes mutated in benign familial neonatal convulsions (BFNC) and other epilepsy
WO1999009049A1 (en) Hypoxia-regulated genes
KR20210131370A (en) Recombinant adeno-associated virus for the treatment of GRN-associated adult-onset neurodegeneration
CN112449605A (en) Trans-splicing molecules
EP1403367A1 (en) Method of testing drug for treating or preventing diseases such as hyperlipemia
US20030211533A1 (en) Method for testing a therapeutic or preventive agent for hyperlipidemia
AU721105B2 (en) Mammalian regulator of nonsense-mediated RNA decay
US7378274B2 (en) Regulatory nucleic acid for the ABC1 gene, molecules modifying its activity and therapeutic uses
AU2007240161A1 (en) Nucleic acid generating the ABCA7 gene, molecular modulating its activity and therapeutic applications
US20030077591A1 (en) Nucleic acid for regulating the ABCA7 gene, molecules modulating its activity and therapeutic applications
JP2004512045A (en) Nucleic acids that regulate the ABCA7 gene, molecules that modulate its activity, and therapeutic uses
AU5540800A (en) Nucleic and proteinic acids corresponding to human gene abc1
Kobayashi et al. GP2/THP gene family of self-binding, GPI-anchored proteins forms a cluster at chromosome 7F1 region in mouse genome
US20090036662A1 (en) Hypoxia-regulated genes
US7169764B1 (en) Promoter for smooth muscle cell expression
FR2822165A1 (en) New promoter of the ABCA7 gene, useful for identifying modulators of transcription and in gene therapy of e.g. disorders of lipid metabolism
TW202246499A (en) Modulation of aav-based gene expression
ZA200110545B (en) Nucleic and proteinic acids corresponding to human gene ABC1.
Rodriguez Characterization of the effects of reduced mutant huntingtin in the R6/1 model of Huntington's disease
US20050130916A1 (en) Nucleic acid coding for the cgl1 polypeptide and diagnostic and therapeutic application of said nucleic acid and of the cgl1 polypeptide
WO2006014798A2 (en) Methods and compositions for using sax2

Legal Events

Date Code Title Description
MK1 Application lapsed section 142(2)(a) - no request for examination in relevant period