AU767647B2 - htfIIIA human gene and coded hTFIIIA protein - Google Patents

htfIIIA human gene and coded hTFIIIA protein Download PDF

Info

Publication number
AU767647B2
AU767647B2 AU11645/00A AU1164500A AU767647B2 AU 767647 B2 AU767647 B2 AU 767647B2 AU 11645/00 A AU11645/00 A AU 11645/00A AU 1164500 A AU1164500 A AU 1164500A AU 767647 B2 AU767647 B2 AU 767647B2
Authority
AU
Australia
Prior art keywords
htfiiia
sequence
gene
protein
seq
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU11645/00A
Other versions
AU1164500A (en
Inventor
Florence Bordon-Pallier
Corinne Rocher
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Aventis Pharma SA
Original Assignee
Aventis Pharma SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Aventis Pharma SA filed Critical Aventis Pharma SA
Publication of AU1164500A publication Critical patent/AU1164500A/en
Application granted granted Critical
Publication of AU767647B2 publication Critical patent/AU767647B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • C07K14/4701Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
    • C07K14/4702Regulators; Modulating activity
    • C07K14/4705Regulators; Modulating activity stimulating, promoting or activating activity
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy

Landscapes

  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Veterinary Medicine (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Animal Behavior & Ethology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Public Health (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Toxicology (AREA)
  • Zoology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Description

Human htFIIIA gene and coded htfIIIA protein The present invention relates to the gene coding for the human transcription factor hereafter called htFIIIA (or htfC2) gene and the coded htfIIIA protein, as well as the use of this htFIIIA gene and that of the coded htfIIIA protein in the diagnosis and identification of certain diseases related to the transcription mechanism.
Hereafter the gene coding for the transcription factor TFIIIA will be called tfIIIA (or tfC2) and the gene coding for the human transcription factor htfIIIA will be called htfIIIA.
The human htFIIIA gene codes therefore for the corresponding htFIIIA protein.
We will also use the following abbreviations below: AA for amino acids, NA for nucleic acids, bp for base pairs, DNA for deoxyribonucleic acid, cDNA for complementary DNA, RNA for ribonucleic acid, RNase for ribonuclease and C for deoxycytidine.
The term screening which indicates a specific screening technique and the term primer which indicates an oligonecleotide used as primer will also be used.
The tfIIIA gene and the corresponding tfIIIA protein are involved in the regulation of the biological transcription mechanism as indicated below.
Since the tfIIIA protein was purified as transcription factor for the first time in 1980 from Xenopus ovocytes [Segall et al, J. Biol. Chem., 255, 11986-11991 (1980)], work has been carried out in vivo and in vitro within the Xenopus to study the mechanism of transcription control exercised by TFIIIA. It has thus been shown that Xenopus TFIIIA is necessary for the initiation of the transcription of 5S RNA gene [Sakonji et al, Cell 19, 13-25 (1980)] and binds to a internal control region of the 5S RNA gene [Bogenhagen et al, Cell, 19,27-35 (1980)].
The nucleotide sequence of the cDNA of Xenopus TFIIIA and the corresponding amino acid sequence have already been published [Ginberg et al, Cell 39,479-489 (1984)]. It can be noted that this gene codes for a structure of 9 zinc fingers, a zinc finger corresponding to the repetition of the CYS2 HIS2 (C2H2) moiety. This zinc finger structure is considered an essential domain for a group of proteins which bind themselves to the DNA (DNA binding proteins) [Miller et al, Embo 4, 1607-1614 (1985)].
In this way transcription factors in human beings, binding to the DNA which also have this zinc finger structure such as for example XT1 of the Wilms human tumor gene, [Gessier et al, Nature, 343, 774-778 (1990)], the YY1 human transcription repressor [SHI et al, Cell, 67, 377-388 (1991)], the MAZ protein combined with the human MYC gene [Bossone et al, Proc. Natl. Acad. Sci., USA, 89, 7452-7456 (1992)] or also spl [Kuwahara et al, J.Biol. Chem., 29, 8627-8631 (1990)] are known.
Studies have been carried out in order to isolate the human htFIIIA gene, but until now none have led to discovery of the true sequence of the htFIIIA gene.
On one hand the studies described in the European Application EP 0704526 (Fujisawa et al), can thus be mentioned and are examined in the article: Arakawa et al (1995), Cytogenet Cell Genet 70, 235-238, which have led to a sequence that we will call Arakawa htfIIIA and on the other hand the studies described in the article: DREW et al (1995), Gene 159, 215-218, which have led to a sequence that we will call DREW htfIIIA.
These DREW and ARAKAWA htfIIIA sequences are represented in Figures 4 and respectively below. The documents indicated above therefore each describe a sequence of the htflIIA gene but these two sequences differ from one another by a few nucleotides and differ from the htflIIA gene of the present Application as indicated below.
The present invention has made it possible to isolate the gene coding for the human transcription factor hTFIIIA.
The present invention has also made it possible to reveal the nucleic acid sequence S 25 of the htfIIIA gene and also the amino acid sequence of the hTFIIIA protein coded by this gene.
Therefore a subject of the present invention is an DNA isolated S ee *oe 03/06/03,mc 12020.speci.doc,2 -3sequence of the htflIIA gene coding for a protein having the biological function of human transcription factor htfIIA.
A precise subject of the present invention is the isolated DNA sequence of the htfIIIA gene of human transcription factor hTFIIIA as defined above, coding for the amino acid sequence SEQ ID No2.
Such a SEQ ID n°2 sequence of the present invention therefore comprises 365 amino acids.
A subject of the present invention is also the DNA sequence of the htflIA gene as defined above, containing the nucleotide sequence SEQ ID No3.
A subject of the present invention is the DNA sequence of the htflIIA gene as defined above, containing the nucleotide sequence SEQ ID N°4.
A subject of the present invention is also the DNA sequence of the htflIIA gene as defined above, corresponding to the nucleotide sequence SEQ ID N°3.
The sequence SEQ ID No3 therefore comprises 1273 nucleotides.
A particular subject of the present invention is the DNA sequence of the htfIIA gene as defined above, corresponding to the nucleotide sequence SEQ ID N°4. The sequence SEQ ID N 0 4 therefore comprises 1213 nucleotides.
The sequence SEQ ID No1 represents the nucleotide sequence of the htFIIIA gene on the upper line according to the present invention i.e. SEQ ID N°3, and the corresponding amino acid sequence (AA) of this nucleotide sequence i.e. SEQ ID No2 on the lower line.
Figures 1 and 2 below represent the AA sequence coded by htflIIA of the present invention SEQ ID N 0 2 on the upper line, and the AA sequences coded by the DREW htflIIA genes, in Figure 1, and ARAKAWA genes in Figure 2 on the lower line respectively, these DREW and ARAKAWA sequences are as published in the documents S 25 referred to above.
Figure 3 below represents the comparison of AA sequences coded by the DREW and ARAKAWA htfIIA genes respectively with the AA sequence coded by Arakawa htfIIIA on the upper line and the AA sequence coded by DREW htflIIA on the lower line.
Figure 2 therefore shows, that the corresponding AA 03/06/03,mc 12020.speci.doc,3 sequence of htflIIA according to the present invention comprises differences from the AA sequence published in the ARAKAWA article or EP 0704 526, in particular in the corresponding positions 105 and 163, 156 and 214, 320 to 329 and 378 to 387 respectively, these positions being given in relation to the numbering indicated in Figure 2.
Figure 2 also shows that the AA sequence coded by htfIIIA of the present invention begins at position 59 of the AA sequence of Arakawa htfIIIA.
Figure 3 shows that the AA sequences coded by Arakawa and DREW htfIIIA comprise differences at the corresponding positions 214 and 154, 378-387 and 318-327 respectively, these positions being given in relation to the numbering indicated in Figure 3.
Figure 5 shows that the Arakawa htfIIIA sequence codes for a protein, the amino acid sequence of which, indicated in EP 0704 526, begins with the AA methionine specified by the ATG codon which is found in position 20-22 and the translation stops at a TAA codon. If the nucleotide sequence of htfIIIA according to the present invention SEQ ID N 0 3 is compared with the nucleotide sequence of EP 0704 526 i.e.
Arakawa htfIIIA shown in Figure 5 (sequence pl-12-13 of EP 0704 526), it can be noted that it lacks a C nucleotide in position 127 of the EP 0704 526 sequence. This additional C nucleotide results in a shift in the translation of amino acids of this nucleotide sequence: in fact, the ATG which is found in position 20-22 of the ARAKAWA sequence shown in Figure 5 and which is considered to be a start codon of proteinic synthesis by ARAKAWA, is therefore no longer in the same reading frame because of this shift. By taking into consideration this additional C nucleotide, the translation of AA reveals a TGA stop codon in position 57-59 of the ARAKAWA sequence shown in Figure 5. Consequently, the start codon of proteinic synthesis according to the present invention is located downstream of this stop codon.
Translation experiments in vitro of SEQ ID N 0 4 and expression tests in mammalian cells such as Cos cells have made it possible to identify the start codon of the proteinic synthesis of hTFIIIA according to the present invention.
This start codon of proteinic synthesis of hTFIIIA according to the present invention is the CTG codon in position 176-178 of SEQ ID N 0 3 (which would correspond to position 194-196 of the ARAKAWA sequence shown in Figure The coding section of the htFIIIA gene of the present invention begins therefore with this CTG codon which is found in position 176-178 of SEQ ID N 0 3 which should correspond to the AA Leucine and which in fact corresponds to the AA Methionine as this codon is recognised as a start codon (ref: David S. Peabody The Journal of Biological Chemistry, vol.
264, n°9, pp. 5031-5035, 1989).
Consequently, as Figure 2 shows, the ARAKAWA hTFIIIA protein is longer than the hTFIIIA protein of the present invention.
Furthermore, if the hTFIIIA protein of the present invention and the DREW hTFIIIA protein are compared (comparison shown in Figure it is noticed that the amino acid threonine in position 105 of the hTFIIIA protein of the present invention corresponds to an asparagine residue in position 103 in the DREW hTfIIIA sequence and that the two first AA, M and D of the hTFIIIA protein of the present invention have not been determined for the DREW hTFIIIA protein. The absence of codons specifying these AA and in particular the absence of the start codon of proteinic synthesis, does not permit the expression of this protein.
The DREW htfIIIA sequence shown in Figure 4 is therefore incomplete, and this is recognised by the authors of the publication referred to above (DREW et al on page 216 lines 39-41). It can be noted moreover, that the authors of this article also think that the start codon of the DREW htfIIIA sequence should correspond to a methionine coded by ATG as in the ARAKAWA sequence.
The htfIIIA gene according to the present invention is therefore different from the DREW and ARAKAWA htfIIIA genes (EP 0704526) and codes for a hTFIIIA protein, the AA sequence of which is different from that of the DREW and ARAKAWA hTFIIIA proteins.
Therefore a particular subject of the present invention is the DNA sequence of the htfIIIA gene as defined above containing the nucleotide sequence SEQ ID N 0 3.
A more particular subject of the present invention is the DNA sequence as defined above having the sequence beginning at nucleotide 176 and finishing at nucleotide 1270 of SEQ ID N 0 3.
One such sequence of the present invention therefore begins at a CTG codon and thus comprises 1095 nucleotides.
A subject of the present invention is the DNA sequence coding for the human transcription factor hTFIIIA as defined above as well as the DNA sequences which hybridize with it and/or show a significant homology with this sequence or fragments of it and coding for proteins having the same function.
By sequences which hybridize are included DNA sequences which hybridize with one of the DNA sequences above under standard conditions of high, medium or low stringency. By proteins with the same function are included polypeptides with the same transcription factor function. The stringency conditions are those carried out in conditions known to a person skilled in the art, such as those described by Sambrook et al (1989) Molecular cloning, Cold Spring Harbor Laboratory Press, 1989. Such stringency conditions are for example hybridization at 65 0 C, for 18 hours in a 5 x SSPE; x Denhardt's; 100pg/ml ssDNA; 1 SDS solution followed by washing 3 times for 5 minutes with 2 x SSC; 0.05 SDS, then washing 3 times for 15 minutes at 65 0 C in 1 x SSC; 0.1 SDS.
The high stringency conditions for example include hybridization for 18 hours at 65 0 C in a 5 x SSPE; 10 x Denhardt's; 100pg/ml ssDNA; 1 SDS solution, followed by washing twice for 20 minutes with a 2 x SSC; 0.05 SDS solution at 65 0 C followed by a final wash for 45 minutes in a 0.1 x SSC; 0.1 SDS solution at 65°C. Medium stringency conditions for example include a final washing for 20 minutes in a 0.2 x SSC, 0.1 SDS solution at By sequences which show a significant homology are included sequences with a nucleotide sequence with a similarity of at least 50 with one of the DNA sequences above and which codes for a protein having the same transcription factor function.
A subject of the present invention is also the DNA sequence as defined above comprising modifications introduced by suppression deletion), insertion and/or substitution of at least one nucleotide coding for a protein having the same biological activity as the human transcription factor htfilIA.
A particular subject of the present invention is the DNA sequence as defined above as well as similar DNA sequences which have a nucleotide sequence homology of at least or at least 60 and preferably at least 70 with the said DNA sequence.
Therefore a subject of the present invention is also the DNA sequence as defined above as well as the DNA sequences which code for a protein, the AA sequence of which has a homology of at least 40 and in particular of 45 or at least 50 rather at least and preferably at least 70 with the AA sequence coded by the said DNA sequence.
The gene of the present invention is represented as a single strand DNA sequence but it is understood that the present invention includes the complementary DNA sequence of this single strand DNA sequence, and also includes the so-called double strand DNA sequence constituted by these two DNA sequences complementary to each other.
The DNA sequence of the present invention is an example of the combination of codons coding for the amino acids corresponding to the amino acid sequence SEQ ID N'2, but it is also understood that the present invention includes any other arbitrary combination of codons coding for this same amino acid sequence SEQ ID N 0 2.
.The DNA sequence as defined above or this modified DNA sequence as indicated above, can be prepared according to techniques known to a person skilled in the art and in particular those described in the book by Sambrook, J. Fritsh, E. F. Maniatis, T.
25 (1989) entitled: "Molecular cloning: a laboratory manual Laboratory, Cold Spring Harbor NY. In particular the DNA sequence above can be a oo •.oo *oo.
.oooo 23/ 0 9/03,swl 2020spa,7 cDNA sequence obtained by identification of the 3' and parts of the coding sequence, then amplification of these parts using a DNA polymerase such as pfu polymerase or other DNA polymerases. The introduction, into the oligonucleotide sequence used for PCR, of restriction sites such as Hind III or SmaI allow the cloning of these fragments in appropriate vectors and then the restoration of the sought complete sequence. A detailed description of the operating conditions in which the present invention was carried out is given below.
A quite particular subject of the invention is the polypeptide having the function of human transcription factor hTFIIIA and having the amino acid sequence SEQ ID N 0 2 coded by the DNA sequence as defined above and the analogues of this polypeptide.
By analogues is understood the polypeptides the amino acid sequence of which has been modified by substitution, suppression or addition of one or more amino acids but which retain the same biological function. Such polypeptide analogues can be produced spontaneously or can be produced by post-transcriptional modification or also by modification of the DNA sequence of the present invention as indicated above, by using techniques known to a person skilled in the art: amongst these techniques, the directed mutagenesis technique 25 (Kramer, et al., Nucl. Acids Res., 12, 9441 (1984); Kramer, W. and Fritz, Methods in Enzymology, 154 350 (1987); Zoller, M.J. and Smith, M. Methods in Enzymology, 100.468 (1983)) can in particular be mentioned.
Modified DNA synthesis can also be carried out by using well-known chemical synthesis techniques such as the phosphotriester method for example [Letsinger, R.L and Ogilvie, K. Am. CHEM. Soc., 91.3350 (1969); Merrifield, Sciences, 150, 178 (1968)] or the phosphoamidite method [Beaucage, S.L and Caruthers, M Tetrahedron Lett., 22, 1859 (1981); McBRIDE, L.J. and Caruthers, M.H. Tetrahedron Lett., 24 245 (1983)] or also by the combination of these methods.
The polypeptides of the present invention can therefore be prepared by techniques known to a person skilled in the art, in particular partially by chemical synthesis or also by cDNA synthesis by expression in a procaryotic or eucaryotic host cell as indicated below.
A particular subject of the present invention is the process for the preparation of the recombinant htFIIIA protein having the amino acid sequence SEQ ID N 0 2. This process includes the expression of the DNA sequence as defined above in an appropriate host, then isolation and purification of the said recombinant protein.
To produce the polypeptide of the present invention, recombinant DNA techniques using genetic engineering and cell culture methods known to a person skilled in the art can in particular be used. The following stages can therefore be carried out: firstly preparation of the appropriate gene, then incorporation of this gene into a vector, transfer of the gene carrier vector into an appropriate host cell, expression of the gene and finally purification of the protein coded by this gene.
The DNA sequences according to the present invention and in particular SEQ ID N°3 or SEQ ID N 0 4 can be prepared according to techniques known to a person skilled in the art, in particular by chemical synthesis, by screening of a gene bank or a cDNA bank using oligonucleotide synthesis probes using S 25 known hybridization techniques or also by reverse transcriptase from messenger RNA (mRNA) The advantage of the technique comprising firstly the isolation of mRNA by extraction of the total RNA then the synthesis of cDNA from this mRNA by reverse transcriptase particularly rests on the fact that the mRNA does not contain introns even though these non-coding sequences are present in the genomic DNA.
The following procedure can in particular be carried out.
Firstly the total RNA originating from a cell line such as for example the Raji cell line (RNA Plus, BIOPROBE) is extracted, and from this RNA, synthesis of the sought cDNA is then carried out, in particular by using a kit such as the RNA PCR kit (Perkin Elmer) It can be noted that within the scope of the present invention, two oligonucleotides located at the ends of the htfIIIA coding sequence published by ARAKAWA (Figure 5) were synthesized i.e. OLT5 and OLT3 and are defined as follows: OLT5: 5' CGGGGTACCAAAA ATG CGC AGC AGC GGC GCC GAC 3' i.e.
SEQ ID N°5 and OLT3: 5' CGGTCTAGA TTA GCC AAG GGT AAG TAC TGC 3' i.e. SEQ ID N 0 9 but these two oligonucleotides have not made it possible to obtain an amplification product by PCR.
Thus, within the scope of the present invention, the hTFIIIA coding sequence was isolated in two stages: firstly identification of the 3' part then identification of the part.
After identification of the 3' and 5' parts, a HindIII restriction site located on each of these fragments then made it possible to restore the complete sought sequence as indicated below in the experimental part.
The following process was then carried out: The 3' part was amplified using pfu polymerase (STRATAGENE) using the OLT5.2 and TFIIIA 3'SmaI oligonucleotides as primer i.e.: OLT5.2: 5'TCCTTCCCTGACTGCAGCGCC 3' or SEQ ID N 0 6 and TFIIIA3'SmaI: 5'CCT CCC GGG GCC AAG GGT AAG TAC TGC AAC 3' or SEQ ID N 0 The amplification primers are chosen as a function of the part to be amplified according to the usual criteria of a person skilled in the art.
The primers used in the present invention were chosen in the Arakawa htfIIIA sequence shown in Figure The sequences SEQ ID N 0 6, SEQ ID N 0 7 and SEQ ID N 0 8 are located in positions 320-340 361-380 (reverse and complementary sequence) and 391-410 (reverse and complementary sequence) respectively of this Arakawa htfIIIA sequence.
The sequences SEQ ID N 0 5, SEQ ID N 0 9 and SEQ ID N 0 10 are located in positions 20-40 1271-1291 (reverse and complementary sequence) and 1268-1288 (reverse and complementary sequence) respectively of this Arakawa htfIIIA sequence.
It can be noted that sequences SEQ ID N°5, SEQ ID N 0 9 and SEQ ID N 0 10 contain sequences corresponding to the restriction enzyme sites i.e. KpnI, XbaI and SmaI respectively.
The oligonucleotide TFIIIA 3' SmaI introduces a restriction site SmaI downstream of the coding sequence.
This site permits, if necessary and if required, the fusion of the coding sequence for hTFIIIA with a coding sequence for a hemaglutinin epitope peptide designated TAG HA The expression of the coding sequence for TFIIIA can therefore be combined with that of the coding sequence for TAG HA which can be detected by Western blot analysis, if the fusion gene is expressed.
For identification of the 5' part, this region was isolated by the 5' anchored PCR (5 race System, GIBCO BRL; pfu polymerase, STRATAGENE) technique. Two successive PCR's were carried out using the following oligonucleotides as primer: UAP and TFIIIAPCR5' for the first PCR and UAP and TFIIIA SEQ2 for the second PCR.
UAP is an oligonucleotide provided in the kit.
These oligonucleotides have the following sequences: TFIIIAPCR5': 5' CACAAACAAATGGTCTCC 3' or SEQ ID N 0 8 TFIIIA SEQ2: 5' TGCACAGGTGCGCGTCAAGC 3' or SEQ ID N 0 7.
S 25 The products of these PCR's i.e. the amplified 5' and 3' fragments are then purified on agarose gel and cloned using the TA cloning kit (INVITROGEN). Sequencing is then carried out: the plasmid DNA of several independent clones is prepared (QIAGEN Plasmids KIT) and the fragments corresponding to the coding sequence of hTFIIIA are sequenced on the two strands (ABI 377XL sequencer, PERKIN ELMER).
The following process can then be carried out according to usual cloning techniques known to a person skilled in the art and in particular cloning by insertion of each fragment into a plasmid provided with the commercial kit (TA cloning Kit Invitrogen), then transformation of a bacterial strain by the plasmid thus obtained is then carried out. The XLl Blue E. coli strain can in particular be used.
The clones are then cultured in order to extract the plasmid DNA according to standard techniques known to a person skilled in the art referred to above (Sambrook, Fritsh and Maniatis).
Sequencing of the DNA of the amplified fragment contained in the plasmid DNA is carried out.
The compilation of the sequence data thus obtained reveals that in the main part of the isolated sequence corresponds to the htfIIIA sequence of DREW et al.
In the longer sequence starts in position 80 of the htfIIIA sequence of Arakawa et al., shown in Figure 2F, and reveals the insertion of a C nucleotide in position 127 in relation to this sequence. If it can be supposed that the synthesis of the cDNA in the application of the technique described above is not complete, the insertion of a nucleotide nevertheless creates a major problem. In fact, the addition of a nucleotide in the coding sequence creates a shift in the reading frame. In order to verify the presence of this nucleotide in the htFIIIA gene, human genomic DNA was analysed by PCR. This DNA was subjected to a PCR reaction using pfu polymerase (STRATAGENE) or Taq polymerase (Perkin Elmer) using the oligonucleotides OLT5 and TFIIIA SEQ2 called SEQ ID N 0 5 and SEQ ID N 0 7 respectively as primer. The two PCR products were cloned (TA cloning Kit) then sequenced.
Analysis of the sequence data confirms the presence of this additional C nucleotide in relation to the Arakawa htfIIIA sequence for these two amplifications. The ATG initially described as start codon of proteinic synthesis for Arakawa htfIIIA can therefore no longer be considered as such.
The assembly of 5' and 3' sequences is then carried out and a unique plasmid containing the sought hTFIIIA sequence of the present invention is obtained. The complete hTFIIIA coding sequence is restored in the following manner. A clone originating from the amplification of the genomic DNA is digested using the restriction enzymes EcoRI and HindIII, and after purification, a fragment of approximately 350 bp is obtained. Furthermore, a clone originating from the amplification of the 3' part using the restriction enzymes HindIII and SmaI is digested and after purification, a fragment of approximately 930 bp is obtained.
The ligation of these fragments in the plasmid pYX223 (expression vector for the yeast R§D) previously digested by EcoRI and SmaI is then carried out.
A detailed account of the conditions under which the operations indicated above can be carried out is given below in the experimental part. A plasmid is thus obtained in which the gene of the present invention is inserted and this plasmid introduced into a host cell is also thus obtained by operating according to the usual techniques known to a person skilled in the art.
The polypeptide of the present invention can be obtained by expression in a host cell containing the DNA sequence coding for the polypeptide of the invention preceded by a suitable promoter sequence. The host cell can be a procaryotic cell, for example E. coli or a eucaryotic cell such as yeasts, such as for example ascomycetes amongst which are Saccharomyces cerevisiae or also mammalian cells such as Cos. cells A particular subject of the present invention is an expression vector containing a DNA sequence as defined above.
Thus, such an expression vector according to the present S 25 invention contains a DNA sequence which can be the nucleotide sequence SEQ ID N 0 3 or the sequence beginning at nucleotide 176 and terminating at nucleotide 1270 of SEQ ID N 0 3.
Such an expression vector according to the present invention can also contain the DNA sequences which hybridize with the sequences defined above, and/or show a significant homology with these sequences or fragments of them.
Such an expression vector according to the present invention can also contain DNA sequences which comprise modifications introduced by suppression, insertion and/or substitution of at least one nucleotide coding for a protein with the same biological activity as the human transcription factor hTFIIIA.
Expression vectors are vectors allowing the expression of the protein under the control of an appropriate promoter.
Such a vector can be a plasmid, a cosmid or viral DNA. For the procaryotic cells, the promoter can be for example the lac promoter, trp promoter, tac promoter, P-lactamase promoter or PL promoter. For yeast cells, the promoter can be for example PGK promoter or GAL promoter. For mammalian cells, the promoter can for example be SV40 promoter or adenovirus promoters. Baculovirus type vectors can also be used for expression in insect cells.
The host cells are for example procaryotic cells or eucaryotic cells. The procaryotic cells are for example E.
coli, Bacillus or Streptomyces. The eucaryotic host cells comprise yeasts as well as cells of higher organisms, for example mammalian or insect cells. The mammalian cells are for example fibroblasts such as CHO or BHK hamster cells and Cos monkey cells. The insect cells are for example SF9 cells.
The present invention therefore relates to a process which comprises the expression of the htFIIIA protein in a host cell transformed by a DNA coding for the polypeptide sequence corresponding to sequence SEQ ID N 0 2.
For the implementation of the present invention, the vectors used can for example be pGEX or bpAD and the host cell can be E. coli or for example the vector pYX223, and the host cell can also be S. cerevisiae.
A particular subject of the present invention is a host cell transformed with a vector as defined above, containing the htfIIIA gene according to the present invention.
A very precise subject of the present invention is the plasmid deposited at the CNCM under the number 1-2071.
It thus concerns the XL1-Blue/bpShtfc2LHA strain containing the htfIIIA gene according to the present invention.
The operating conditions in which the present invention was carried .out are described below in the experimental part.
The hTFIIIA protein coded by the htfIIIA gene is therefore a transcription regulation factor. In fact, the hTFIIIA protein coded by the gene of the present invention has a biological role as a protein binding to the DNA and the product of this gene is useful as transcription regulation factor.
In particular, the gene of the present invention is expressed in different tissues and probably plays an important role in the initiation of the transcription of the 5S ribosomal RNA gene, and in maintaining the stability of the transcription of other genes in particular involved in control functions.
A very large number of diseases accompanying a transcription control disorder have recently been brought to light. It has therefore been noted that certain oncogenic products act as transcription regulation factors and can lead to canceration of cells such as for example in certain leukaemias or also that production of the regulation factor Hox2-4 in too great a quantity induces leukaemia in mice.
Furthermore, in some hereditary diseases, the protein concerned can in itself be normal, the pathogenicity results from the transcription mechanism of the gene coding for this protein. In particular, many hereditary diseases show an abnormality in the quantity of proteins synthesized which is probably due to a disorder in proteinic synthesis which can in particular bring into play the htfIIIA gene and the coded protein as factors involved in the control of the transcription of 5S RNA.
The gene of the present invention can thus be used for the research into abnormalities in the transcription of genes, and in particular in the identification of hereditary diseases for the study of diseases implicating regulation factors and in particular the protein coded by htfIIIA.
The gene of the present invention can also be used for the treatment of certain diseases through transcription control or in the analysis of the pathogenics of these diseases.
The present invention therefore envisages the use of the htfIIIA gene of the present invention and the hTFIIIA protein of the present invention to contribute in particular to the understanding of the transcription mechanism in human beings and also to contribute to the understanding, in the diagnosis and treatment of diseases linked to a disturbance in the transcription mechanism. Thus hTfIIIA and the htFIIIA protein could be used in the diagnosis or identification of hereditary diseases such as certain cancers or of other diseases resulting from abnormal transcription control.
These factors can also be useful in the analysis of the transcription regulation mechanisms.
Therefore a subject of the present invention is the use of the DNA sequence of the gene of the human transcription factor htfIIIA or of the polypeptide having the function of human transcription factor coded by the said DNA sequence as it is defined above, for the preparation of compositions useful in the diagnosis or treatment of diseases linked to a disorder in transcription control.
Such compositions are prepared under the usual conditions known to a person skilled in the art.
A more precise subject of the present invention is the use as defined above in which the disease concerned is cancer. Figures 1 to 5 below show the following illustrations. Figure 1 represents the comparison of the hTFIIIA protein of the present invention with the DREW hTFIIIA protein.
Figure 2 represents the comparison of the hTFIIIA protein of the present invention with the ARAKAWA hTFIIIA protein.
Figure 3 represents the comparison of the DREW hTFIIIA protein with the ARAKAWA hTFIIIA protein.
Figure 4 represents the DREW htfIIIA sequence and the corresponding hTFIIIA protein.
Figure 5 represents the ARAKAWA htfIIIA sequence and the corresponding hTFIIIA protein.
The sequences indicated in the present invention SEQ ID N°1 to SEQ ID N'10 are described below.
The experimental part below allows the description of the present invention without however limiting it.
Experimental part Example i: cloning and sequencing of the hTFIIIA gene I) Extraction of total RNA originating from the RAJI human cell line (RNA Plus, BIOPROBE) The RAJI human cell line was chosen as a source of total RNA.
17 The RAJI cells used were cultured under the usual culture conditions for this line known to a person skilled in the art.
To extract the total RNA of these cells a standard protocol is carried out using RNA Plus (BIOPROBE SYSTEMS) commercial extraction solution.
Then the following is carried out: a) homogenization: The cells cultured in suspension are pelleted without being washed beforehand in order to avoid the risk of degradation of the mRNA then are lysed by adding the extraction solution of the RNA Plus kit at a rate of 6 ml per 107 cells. The samples of homogenate obtained can be stored at 70 0C.
b) extraction of the RNA: After homogenization, the homogenate obtained in a) above is left at 4 0 C for 5 minutes in order to allow the complete disassociation of the nucleoproteinic complexes then 0.2ml of chloroform per Iml of the RNA Plus solution is added, as above in the medium is agitated vigorously for 15 seconds and left to rest in ice for 5 minutes, followed by centrifuging at 12000 g and at 4 0 C, for 15 minutes.
Two clearly visible phases then form: the DNA and the proteins are found in the organic phase (lower phase) and at the interface. The RNA is in the aqueous phase (upper phase) which represents approximately 40 to 50 of the total volume.
c) Precipitation of the RNA: The aqueous phase obtained in b) is transferred into a new tube, a volume of isopropanol is added and the sample is placed at 40C for 15 minutes, followed by centrifuging for minutes at 40C and at 1200 g. A precipitate is obtained which forms a yellow-white pellet at the bottom of the tube.
d) Washing the RNA: The supernatant of the solution obtained in c) is eliminated then the pellet is washed with a 75 ethanol solution using at least 0.8 ml of ethanol per 50 to 100 micrograms of RNA.
The medium is mixed (vortex), centrifuged for 10 minutes at 7500 g at 40C and dried under vacuum. The RNA obtained is then taken up in 60 microlitres of Tris 10 mM EDTA 1 mM II) Synthesis of cDNA a) Reagents used: The commercial kit Gene Amp® RNA PCR Kit (Perken Elmer) was used for this cDNA synthesis.
By using this kit, the reverse transcription of RNA to cDNA is firstly obtained by reverse transcriptase MuLV (Murine Leukaemia Virus). An RNase inhibitor isolated from human placenta is included in order to inhibit certain mammalian RNases. The fragments of cDNA are amplified by polymerase chain reaction (PCR). The enzyme used for this reaction is S pfu polymerase (Stratagene).
The term dNTP designates the dGTP, dATP, dTTP and dCTP nucleotides.
The term PCR Buffer designates the solution containing 500 mM KC1 and 100 mM HC1 at pH 8.3.
The term Oligod(T)16 designates a nucleotide sequence constituted by 16 dTTP nucleotides.
Oligonucleotides are used as primers in the technique described below.
The concentrations indicated below represent the final concentrations in the reaction medium.
b) Synthesis of the cDNA by reverse transcription: S 25 2 microlitres of the total RNA (1 microgram) obtained above in l)d) are pre-incubated at 65°C for 5 minutes, then 8 microlitres of the following reaction solution: 5mM MgC12, IxPCR buffer, 1 mM of each dNTP, 5 of DMSO, 1 U/microlitres of RNase inhibitor, 2.5 U/microlitres of reverse transcriptase MuLV, 2.5 microlitres of oligo(dT)16 is added.
The solution is then incubated at 42 0 C for one hour, then at 99 0 C for 5 minutes then at 50C for 5 minutes.
III) Amplification by PCR, cloning and sequencing of the 3' and 5' nucleotide sequences a) Reaction conditions: Escherichia coli coli) XL1- Blue type K12 (Stratagene) bacteria was used for the preparation of the plasmids of the present invention.
Growth of this bacteria was carried out according to the usual conditions in LB liquid medium which contains 10 g of bactotryptone, 5 g of yeast extract and 10 g of NaCl per litre of water and which also contains 100 microg/ml of ampicillin (SIGMA).
The colony was removed onto a solid LB agar ampicillin medium then cultured in 100 ml of LB medium and incubated to OD (600nm) 0.8.
The incubation was carried out at 37 0 C under a normal atmosphere and agitation at 225 rpm.
The viability of the strain is verified when the strain grows on LB ampicillin medium at 100 microg/ml, the insert S containing a gene for resistance to ampicillin bla.
It can be noted that a gene for resistance to ampicillin bla is part of the vector of the kit (TA cloning Kit Invitrogen) in which the fragments of htfIIIA are cloned.
Thus, selection of strains containing the plasmids containing the htfIIIA gene of the present invention can be carried out by culture of the strains in this medium which contains ampicillin (100 microg/ml), such a medium allowing the survival only of strains which contain the gene for resistance to ampicillin and therefore only strains which contain the htfIIIA gene of the present invention.
For the preservation of the strains obtained, 15 glycerol 25 is added to the culture medium: the cultures are therefore preserved in the LB 100 micrograms/ml of ampicillin 15 of glycerol at the bacterial concentration of OD (600nm) 0.8 suspension medium in the form of aliquots in cryotubes of 1 ml per tube.
For the sequencing, the plasmid DNA of several bacteria originating from each of the cloning procedures indicated below is prepared using a commercial kit (Qiagen Plasmids kit). The fragments corresponding to the htfIII coding sequence are sequenced on the two strands according to standard techniques known to a person skilled in the art (use of the sequencer ABI 377 XL, Perkin Elmer) b) Amplification by PCR, and cloning of the 3' and nucleotide sequences: 1) Amplification and cloning of the 3' nucleotide sequence Two amplification primers (primers) were chosen according to the published ARAKAWA HTfIIIA sequence. These OLT3 or TFIIIA3'SmaI and OLT5.2 primers are called SEQ ID N 0 10 and SEQ ID N 0 6 respectively.
These oligonucleotides are chosen from the hTFIIIA sequence published by ARAKAWA (Figure 5) and are synthesized according to standard methods known to a person skilled in the art.
The TFIIIA3'SmaI oligonucleotide introduces a restriction site SmaI downstream of the coding sequence. This site will allow the fusion of the htfIIIA 3' nucleotide sequence with a coding sequence for the hemaglutinine TAG peptide.
Thus, the peptide resulting from the expression of the cloned sequence will therefore consist of both the htfIIIA sequence of the present invention and that of TAG HA and can therefore be detected by Western analysis according to usual techniques known to a person skilled in the art.
The following process is then carried out: 2 microlitres of cDNA obtained above in II) b) is added to 50 microlitres of the following reaction solution: 2mM MgC12, IxPCR buffer, 200 nanograms/ml of each dNTP, the TFIIIA3'SMAI and OLT5.2 primers at a rate of 0.15 micromoles/l for each, 5 DMSO and U AmpliTaq DNA polymerase.
The cDNA is thus subjected to 30 PCR cycles firstly at 940C for one minute then at 65 0 C for 1 minute then at 72 0 C for 1 minute.
The products amplified by PCR thus obtained are therefore 3' fragments of approximately 970 base pairs.
The 3' fragments obtained above are cloned in the pCRII vector using the TA cloning Kit (Invitrogen) The plasmid thus obtained is called 5.2 Raji 2.9.
This plasmid is transferred into the XL1 Blue E. coli strain.
The E. coli strain transformed by the plasmid 5.2 Raji 2.9 is thus obtained.
2) Amplification and cloning of the 5' nucleotide sequence The 5' portion of the htfIIIA gene of the present invention was isolated using the said 5' anchored PCR technique using a commercial kit (5'RACE System, Rapid Amplification of cDNA Ends, GIBCO BRL).
Two amplification primers (primers) were chosen from the published ARAKAWA htfIIIA sequence (cf. Figure These TFIIIAPCR5' and TFIIIA SEQ2 primers are called SEQ ID
N
0 8 and SEQ ID N 0 7 respectively.
A homopolymeric chain is added to the 3' end of the cDNA using dATP and terminal deoxynucleotidyl transferase (TdT): microlitres of cDNA obtained above in II) b) are incubated at 37°C for 10 minutes in the 1 X tailing buffer reaction solution (Commercial kit solution) and 0.2 mM of dATP and TdT. The TdT is deactivated for 10 minutes at 65 0 C and the reaction is then brought to The reaction is then directly amplified by PCR: microlitres of the TdT reaction are added to 50 microlitres of PCR reaction solution i.e. 1.5 mM of MgC12, IxPCR buffer, 200 nanomoles/ml of each dNTP, UAP and TFIIIA PCR5' primers at a rate of 0.2 micromoles/l for each, 5 DMSO and 2.5 U AmpliTaq DNApolymerase.
The UAP primer is an oligonucleotide provided with the commercial kit.
The cDNA is thus subjected to 30 PCR cycles, firstly at 94°C for one minute, then at 65°C for 1 minute then at 72°C for 1 minute.
The products amplified by this first PCR i.e. PCR1 are subjected to a second amplification reaction by PCR using the UAP primer and a specific TFIIIASEQ 2 primer. The following process is carried out: 5 microlitres of PCR1 are added to microlitres of the PCR reaction solution indicated below mM of MgC12, IxPCR buffer, 200 micromoles/l of each dNTP, the UAP and TFIIIA SEQ2 primers at a rate of 0.2 micromoles/l for each, 5 DMSO and 2.5 U AmpliTaq DNA polymerase.
The DNA is then subjected to 30 PCR cycles, firstly at 94°C for one minute, then at 65°C for 1 minute then at 720C for 1 minute.
The products amplified by this second PCR i.e. PCR2 are purified on agarose gel. The 5' fragments of approximately 380 base pairs are thus isolated.
The 5' fragments obtained above are thus cloned in the pCRII vector using the TA cloning Kit (Invitrogen).
The plasmid thus obtained is called cDNA-DMSO-3 This plasmid is transferred into the XL1 Blue E. coli strain.
The E. coli strain transformed by the plasmid cDNA-DMSO-3 is thus obtained.
3) Verification of the 5' sequence by amplification of the genomic DNA Construction of the 5 geno-3 plasmid Human genomic DNA is extracted from human liver cells according to the usual methods known to a person skilled in the art.
Amplification by PCR of the human genomic DNA is carried out in the following manner: 2 micrograms of human genomic DNA obtained as indicated above is added to 100 microlitres of the following PCR reaction solution: 2mM MgC12, 1 x native Pfu DNA polymerase buffer, 200 nanograms/ml of each dNTP, the OLT5 and TFIIIA SEQ2 primers at a rate of 0.15 micromoles/l for each, 5 DMSO and U pfu polymerase.
OLT5 and TFIIIA SEQ2 are called SEQ ID N 0 5 and SEQ ID N 0 7 respectively.
The reaction medium is thus subjected to 30 PCR cycles, firstly at 94 0 C for one minute, then at 60 0 C for 1 minute, then at 72 0 C for 1 minute.
S 25 The products amplified by PCR thus obtained are fragments of DNA of approximately 360 base pairs.
The fragments thus obtained are cloned in the pCR-Script vector using the pCR-Script Cloning kit (Stratagene).
The plasmid thus obtained is called 5 geno-3.
This plasmid is transferred into the XL1 Blue E. coli strain.
The E. coli strain transformed by the plasmid 5 geno-3 is thus obtained.
4) Cloning of the htfIIIA gene according to the present invention.
Construction of the pYX TFIIIALHA plasmid The complete htfIIIA coding sequence is restored by assembly of the two 3' and 5' fragments obtained above in III) b)l) and III) b)3).
A Hind III restriction site located on each of the 3' and fragments obtained above makes it possible to restore the complete sequence.
The 5 geno-3 plasmid obtained above in III) b)3) is digested by the EcoRl and HindIII restriction enzymes.
The EcoRl site is located 11 nucleotides upstream of the coding sequence.
Fragments of approximately 350 base pairs are obtained after purification on agarose gel.
Ligation with the vector pYX/EcoRI HindIII is then carried out and the vector pYXTFIIIA5' is obtained.
The addition of the 3' fragment to the 5' fragment is then carried out: the 5.2 Raji 2.9 plasmid obtained above in III) is digested by the restriction enzymes HindIII and SmaI.
After purification on agarose gel, a fragment of approximately 930 base pairs is obtained. This fragment is inserted into the pYXTFIIIA5' plasmid obtained above, previously digested by the restriction enzymes SmaI and HindIII.
The pYXTFIIIALHA plasmid is thus obtained which therefore contains the hTFIIIA gene of the present invention.
Example 2: Construction of the XL1 Blue/pYX TFIIIALHA strain The preparation of the XL1-Blue/ pYX TFIIIALHA strain, is S 25 carried out according to techniques known to a person skilled in the art (ref above: Sambrook, Fritsh and Maniatis) from the XL1- Blue type K12 E. coli strain (Stratagene), and the pYX TFIIIALHA plasmid obtained above in Example 1 is introduced.
Example 3: Construction of the bpS-tfC2LHA plasmid The vector bpS-SK+ (Stratagene) is used, in which an insert coding for the htFIIIA gene of the present invention is integrated. The following process is carried out: the pYXTFIIIALHA plasmid obtained above in Example 1 is digested by the restriction enzyme EcoRI, this end is filled using DNA Polymerase (Klenow fragment) in the presence of dNTP. This plasmid is then digested by Nhe I and the fragment corresponding to the htfIIIA sequence according to the present invention is purified. This fragment is inserted into the bpS-SK+ vector prepared as follows: the vector is digested by EcoRI, this site is filled using DNA polymerase then digested by XbaI.
The plasmid bpS-tfC2LHA is thus obtained.
Example 4: Construction of the XL1-Blue/bpS-tfC2LHA strain For the preparation of the XL1-Blue/bpS-tfC2LHA strain, techniques known to a person skilled in the art, using XL1- Blue type K12 E. coli strain (Stratagene) are carried out, and the bpS-tfC2LHA plasmid obtained above in Example 3 is introduced.
A sample of the strain obtained i.e. XL1- Blue type K12 E.
S Coli (Stratagene) containing the bpS-SK+ vector (Stratagene) with an insert coding for tfC2 (cDNA coding part containing the htfIIIA coding region) i.e. XL1-Blue/bps-tfC2LHA coding region was deposited at L'Institut Pasteur 25, rue du Docteur ROUX Paris 75015 at the CNCM on the 15th September 1998 under the number 1-2071.
Example 5: Identification of the start codon of proteinic synthesis.
Purification of the hTFIII protein was described by Moorefield et al (1994) [reference: the Journal of Biological Chemistry, Vol. 269, N° 33, pp. 20857-20865, 1994, Purification and Characterization of Human Transcription S 25 Factor IIIA, B. Moorefield and R. G. Roeder].
The hTFIIIA protein identified by Moorefield has a molecular weight of 42 kDa. It can be noted that the theoretical molecular weight of the htFIIIA protein coded by the Arakawa htfIIIA sequence is 47 kDa.
Proteinic synthesis is generally started at an ATG codon.
However the htfIIIA coding sequence of the present invention does not contain the ATG codon in phase.
It has been demonstrated that the different ATG codons, in particular the CTG or GTG codons are start codons of translation in natural cellular transcripts.
With techniques known to a person skilled in the art such as translation experiments in vitro with the htfIIIA sequence according to the invention obtained above in Example 1, and by expression tests in mammalian cells such as Cos cells, the start codon of hTFIIIA proteinic synthesis according to the present invention was demonstrated.
Within the scope of the present invention, it has thus been demonstrated that the start codon of htfIIIA proteinic synthesis according to the present invention is the CTG codon which is found in position 176-178 of SEQ ID N 0 3.
Analysis of the results Analysis of the results obtained by the preparations of the examples indicated above reveal the following points relating to the htfIIIA coding sequence: in 3' (above in III) the main part of the sequence isolated in the present Application corresponds to the DREW htfIIIA sequence 15 in 5' (above in III) the longest sequence of fragments obtained by the preparation described above in III) b)3) begins in position 20 of the ARAKAWA htfIIIA sequence and reveals the insertion of a nucleotide in position 127 of the ARAKAWA htfIIIA sequence.
The results obtained by the preparations of htfIIIA described above according to the present invention confirm that omission of a nucleotide in position 127 in the ARAKAWA sequence really does exist in the human htfIIIA gene.
Where the terms "comprise", "comprises", "comprised" or "comprising" are used S* in this specification, they are to be interpreted as specifying the presence of the stated features, integers, steps or components referred to, but not to preclude the presence or addition of one or more other feature, integer, step, component or group thereof group thereof.
EDITORIAL NOTE APPLICATION NUMBER 11645/00 The following Sequence Listing pages 1 to 6 are part of the description. The claims pages follow on pages 26 to 27.
1 SEQUENCE LISTING <110> Hoechst Marion Roussel <120> Human htF'TTIA gene and <130> 9823seq <140> <141> <160> <170> Patentln Vers. <210> 1 <211> 1273 <212> DNA <213> Human <220> <221> CDS <222> (176) .(1270) <400> 1 atgogcagoa goggcgccga ogcggggcgg gtgccggcgt cgcgcgaagg ttcagcaggg aogtgtctcg goacgtggca gcgcgcotgg coded htfTTTA protein tgcctggtga agccgtgggc occtgggott ccgogcgcgc cgggcgcgcc ggaggcgccg tcccggaagt ggttcoggc gcgcc ctg Met gat Asp gao Asp 000 Pro aat As n ggg Gly ato Ile aag Lys cog Pro ttc Phe ott Leu ago Ser aga Arg gao Asp ttt Phe 100 gtg Val1 goa Ala agg Arg goo Ala ttt Phe oat His t gt Cys goo Al a ggo Gly tto Phe aag Lys tgt Cys ago Ser goo Al a tog Ser ago Ser t go Cys gao Asp tat T yr oao His ggo Gly gtg Val1 t oa Ser too Ser gog Al a gaa Glu 75 att Ile t gt Cys tog S er got Ala tto Phe oao His 60 ggg Gly ot g Leu gat Asp too Ser oog Pro oot Pro ot g Leu t gt Cys acot Thr oaa Gln ttg Leu aoo Thr gao Asp tgo Cys ggo Gly cac His aaa Lys 110 at o Ile oog Pro ago Ser oao His goo Al a gga Gly aao Asn goo Ala ogo Arg goo Ala a og Thr tt 0 Phe gaa Glu aoa Thr 120 178 226 274 322 370 418 466 514 aaa Lys aaa Lys 130 cat His ttc Phe aag Lys gga Gly aga Arg 210 ttt Phe gaa Giu act Thr agc Ser atg Met 290 aag Lys gcc Al a ggc Gly aag Lys t ca Ser 115 caa Gin cag Gin aag Lys ctg Leu t gt Cys 195 gaa Glu aaa Lys agg Arg act Thr cgc Arg 275 aaa Lys aaa Lys t ct Ser tta Leu atg Met 355 aac Asn tat Tyr cag Gin tgt Cys aaa Lys 180 tcc Ser acc Thr cgc Arg gat Asp gt g Val1 260 cct Pro caa Gin atg Met cat His t ct Ser 340 ct c Leu ttg Leu ata Ile ctg Leu acc Thr 165 cga Arg ttt Phe cat His aaa Lys gt a Val1 245 ttt Phe ttt Phe agt Ser aag Lys ct c Leu 325 ttg Leu aag Lys tgc Cys aaa Lys 150 cag Gin cat His gt g Vai aaa Lys gat Asp 230 t gt Cys aat Asn gt g Val1 ct c Leu ct c Leu 310 agt Ser t gt Cys aaa Lys agt Ser 135 at c Ile gaa Giu gcc Al a gca Al a gag Glu 215 tac Tyr cgc Arg ctc Leu t gt Cys act Thr 295 aaa Lys gga Gly caa Gin cat His 120 ttt Phe cat His gga Giy aag Lys aaa Lys 200 gaa Glu ctt Leu tgt Cys caa Gin ga a Glu 280 agg Arg gt c Val1 tat Tyr aac Asn ttt Phe gaa Giu cag Gin t gt Cys gcc Al a 185 aca Thr ata Ile aag Lys cca Pro agc Ser 265 cat His cat His aaa Lys atc Ile gga Gly 345 gaa Giu gac Asp t gc Cys ggg Giy 170 cac His tgg Trp ct a Leu caa Gin aga Arg 250 cat His gct Ala gct Ala aaa Lys cct Pro 330 gag Giu cgc aaa cat Arg Lys His tgt Cys cag Gin 155 aaa Lys gag Giu acg Thr tgt Cys cac His 235 gaa Giu atc Ile Gi y gt t Val1 t ct Ser 315 ccc Pro tca Ser 125 aag Lys 140 cat His cac His ggc Gly gaa Glu gaa Glu 220 at g Met ggc Gly ct c Leu t gt Cys gt a Val1 300 cgt Arg aaa Lys ccc Pro aag Lys acc Thr ttt Phe tat Tyr ctt Leu 205 gt a Val1 aaa Lys tgt Cys tcc Ser ggc Gly 285 cat His gaa Giu a gg Arg aac Asn gaa Glu acc Thr aat Asn gca Ala gt a Val 190 ct g Leu tgc Cys act Thr gga Gly ttc Phe 270 aaa Lys gat Asp aaa Lys aaa Lys t gt Cys 350 aat caa caa Asn Gin Gin ttt Phe gaa Giu t ca Ser 175 t gt Cys aaa Lys cgg Arg cat His aga Arg 255 cat His aca Thr cct Pro cgg Arg caa Gin 335 gt g Val aag Lys cct Pro 160 ccc Pro caa Gin cat His aaa Lys gcc Ala 240 acc Thr aaa Lys 145 ct a Leu agc Ser aaa Lys gtg Val aca Thr 225 cca Pro tat Tyr 562 610 658 706 754 802 850 898 946 994 1042 1090 1138 1186 1234 1273 gag gaa Giu Glu ttt gca Phe Ala gac aag Asp Lys 305 agt ttg Ser Leu 320 ggg caa Gly Gin gaa gac Glu Asp tcg aca gtt gca gta ctt acc ctt ggc taa Ser Thr Val Ala Val Leu Thr Leu Gly <210> 2 <211> 365 <212> PRT <213> Human <400> 2 Met Asp Pro Pro Ala Val Val Ala Glu Ser Val Ser Ser Leu Thr Ile 1 5 10 Ala Asp Ala Phe Ile Ala Ala Gly Glu Ser Ser Ala Pro Thr Pro Pro 25 Arg Pro Ala Leu Pro Arg Arg Phe Ile Cys Ser Phe Pro Asp Cys Ser 40 Ala Asn Tyr Ser Lys Ala Trp Lys Leu Asp Ala His Leu Cys Lys His 55 Thr Gly Glu Arg Pro Phe Val Cys Asp Tyr Glu Gly Cys Gly Lys Ala 70 75 Phe Ile Arg Asp Tyr His Leu Ser Arg His Ile Leu Thr His Thr Gly 90 Glu Lys Pro Phe Val Cys Ala Ala Thr Gly Cys Asp Gin Lys Phe Asn 100 105 110 Thr Lys Ser Asn Leu Lys Lys His Phe Glu Arg Lys His Glu Asn Gin 115 120 125 Gin Lys Gin Tyr Ile Cys Ser Phe Glu Asp Cys Lys Lys Thr Phe Lys 130 135 140 Lys His Gin Gin Leu Lys Ile His Gin Cys Gin His Thr Asn Glu Pro 145 150 155 160 Leu Phe Lys Cys Thr Gin Glu Gly Cys Gly Lys His Phe Ala Ser Pro 165 170 175 Ser Lys Leu Lys Arg His Ala Lys Ala His Glu Gly Tyr Val Cys Gln 180 185 190 Lys Gly Cys Ser Phe Val Ala Lys Thr Trp Thr Glu Leu Leu Lys His 195 200 205 Val Arg Glu Thr His Lys Glu Glu Ile Leu Cys Glu Val Cys Arg Lys 210 215 220 Thr Phe Lys Arg Lys Asp Tyr Leu Lys Gin His Met Lys Thr His Ala 225 230 235 240 Pro Glu Arg Asp Val Cys Arg Cys Pro Arg Glu Gly Cys Gly Arg Thr 245 250 255 Tyr Thr Thr Val Phe Asn Leu Gin Ser His Ile Leu Ser Phe His Glu 260 265 270 Glu Ser Arg Pro Phe Val Cys Glu His Ala Gly Cys Gly Lys Thr Phe 275 280 285 Ala Met Lys Gin Ser Leu Thr Arg His Ala Val Val His Asp Pro Asp 290 295 300 Lys Lys Lys Met Lys Leu Lys Val Lys Lys Ser Arg Glu Lys Arg Ser 305 310 315 320 Leu Ala Ser His Leu Ser Gly Tyr Ile Pro Pro Lys Arg Lys Gin Gly 325 330 335 Gin Gly Leu Ser Leu Cys Gin Asn Gly Glu Ser Pro Asn Cys Val Glu 340 345 350 Asp Lys Met Leu Ser Thr Val Ala Val Leu Thr Leu Gly 355 360 365 <210> 3 <211> 1273 <212> DNA <213> Human <400> 3 atgcgcagca gtgccgqcgt acgtgtctcg tccgccggcc agccggcgag ctccttccct caagcacacg cagggactac tqcagccact acqcaaacat ctttaagaaa caagtgtacc tgccaaggcc gacggaactt ccggaaaaca aagggatgta tctccaaagc tggctgtggc tcctgacaag ctctcatctc tcaaaacgga tacccttggc qcggcgccqa cgcgcgaagg gcacgtggca gtggtcqccg agct caqct c gactgcagcg ggggagagac cat ctgagcc ggctgtgatc gaaaatcaac catcagcagc caggaagqat cacgagggct ctgaaacatg tttaaacgca tgtcgctgtc catatcctct aaaacatttg aagaaaatga agtggatata gagtcaccca taa cgcgggqcgg ttcagcaggg gcgcgcctgg agtcggtgtc cgaccccgcc ccaattacaq catttgtttq gccacattct aaaaattcaa aaaaacaata tgaaaatcca gtgggaaaca atgtatgtca tqagagaaac aagattacct caagagaag ccttccatga caatgaaaca agctcaaagt tccctcccaa actgtgtgga tgcctggtqa agccgtgggc ccctqggctt gtccttgacc gcgccccgcg caaaqcctgg tgactatgaa gactcacaca cacaaaatca tatatgcagt tcagtgccag ctttgcatca aaaaggatgt ccataaagag taagcaacac ctgtggaaga qqaaagccgc aagtctcact caaaaaatct aaggaaacaa agacaagatg ccgcgcgcgc cgggcgcgcc ggaggcgccg at cgccgacg cttcccagga aagcttgacg gggtqggca ggagaaaagc aacttgaaga tttgaaqact cataccaatg cccagcaagc tcctttgtgg gaaatactat atgaaaactc acctatacta ccttttgtgt aggcatgctg cgtgaaaaac gggcaaggct ctctcgacag tcccggaagt ggttCccggc gcgccctgga cgttcattgc ggttcatctg cgcacctgtg aggccttcat cgt t tgt t t aacattttga gtaagaagac aacctctatt tgaaacgaca caaaaacatg gtgaagtatg atgccccaga ctgtgtttaa gtgaacatgc ttgtacatga ggagtttggc tatctttgtg ttgcagtact 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1273 <210> 4 <211> 1213 <212> DNA <213> Human <400> 4 j gtgccggcqc acqtqtctcg tccgccggcc agccggcgag ctccttccct caaqcacacg cagqgactac tgcagccact acgcaaacat ctttaagaaa caagtgtacc tgccaaggcc gacggaactt ccggaaaaca aagggatgta tctccaaagc tggctgtggc tcctgacaag ct ctcat ctc tcaaaacgga tacccttggc cgcgcgaagq gcacgtggca gtqgt cgccg aqctcagctc gactgcagcg ggggagagac catctgagcc ggctqtgatc gaaaatcaac catcagcagc caggaaggat cacgaggqct ctgaaacatg tttaaacgca t gt cgctqt c catatcctct aaaacatttg aagaaaatga agtggatata gaqtcaccca taa ttcagcaggg gcgcqcctgg agtcggtgtc cgaccccgcc ccaattacag catttgtttg gccacattct aaaaattcaa aaaaacaata tgaaaatcca gtgggaaaca atgtatgtca tgagagaaac aagattacct caagagaagg ccttccatga caatgaaaca agctcaaagt tccctcccaa actgtgtqga agccgtgggc ccctgggctt gtccttgacc gcgccccgcg caaagcctgg tgactatgaa gactcacaca cacaaaatca tatatgcagt tcagtgccag ctttgcatca aaaaggatgt ccataaagag taagcaacac ctgtggaaga qgaaagccgc aagtctcact caaaaaatct aaggaaacaa agacaagatg cgggcgcgcc ggaggcqccg atcgccgacg cttcccagga aagcttgacg gggtgtggca ggagaaaagc aacttgaaga tttgaagact cataccaatg cccagcaagc tcctttgtgg gaaatactat atgaaaactc acctatacta ccttttgtgt aggcatgctg cgtgaaaaac gggcaaggct ctctcgacag ggttcccggc qcgccctgga cgttcattgc ggttcatctg cgcacctgtg aggcctt cat cgtttqtttg aacattttga gtaagaagac aacctctatt tqaaacgaca caaaaacatg gtgaagtatg atgccccaqa ctgtgtttaa gtgaacatgc ttgtacatga gqagtttggc tatctttgtg ttgcagtact 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1213 <210> <211> 34 <212> DNA <213> Human <400> cggggtacca aaaatgcqca qcaqcggcgc cgac <210> 6 <211> 21 <212> DNA <213> Human <400> 6 tccttccctg actgcagcgc c <210> 7 <211> <212> DNA <213> Human <400> 7 tgcacaggtg cgcgtcaagc <210> 8 <211> <212> DNA <213> Human <400> 8 cacaaacaaa tggtctctcc <210> 9 <211> <212> DNA <213> Human <400> 9 cggtctagat tagccaaggg taagtactgc <210> <211> <212> DNA <213> Human <400> cctcccgggg ccaagggtaa gtactgcaac

Claims (11)

1. An isolated DNA sequence of the htflIIA gene of the human transcription factor hTFIIIA coding for a protein having the biological function of human transcription factor hTFIIA characterised in that it codes for the amino acid sequence SEQ ID No. 2.
2. The isolated DNA sequence of the htflIIA gene according to claim 1 containing the nucleotide sequence SEQ ID No. 3.
3. The isolated DNA sequence of the htflIIA gene according to claim 1 containing the nucleotide sequence SEQ ID No. 4.
4. The isolated DNA sequence according to claim 1 having the sequence beginning at nucleotide 176 and finishing at the nucleotide 1270 of SEQ ID No. 3. The isolated DNA sequence according to claim 1 to 4 comprising besides a modification introduced by deletion, insertion, and/or substitution of one nucleotide and coding for a protein with the same biological activity as human transcription factor hTFIIIA.
6. A recombinant polypeptide having the function of human transcription factor hTFIIIA and with the amino acid sequence SEQ ID No. 2 coded by the DNA sequence according to one of claims 1 to
7. An analogue of a polypeptide according to claim 6 the amino acid sequence of which has been modified by substitution, deletion or addition of one amino acid but which retain the same biological function.
8. A process for the preparation of the recombinant protein having the amino acid sequence SEQ ID No. 2 comprising the expression of the DNA sequence according to one of claims 1 to 5 in a appropriate host, then isolation and purification of the said recombinant protein. S 25 9. An expression vector containing the DNA sequence according to one of claims 2 to A host cell transformed with a vector according to claim 9.
11. A plasmid deposited at the CNCM under the number 1-2071.
12. Use of the human transcription factor hTFIIIA gene or of the human 30 transcription factor coded by this gene according to any one of the claims 1 to 6 for the preparation of compositions which can be used for the diagnosis or treatment of diseases Slinked to a disorder in transcription control.
13. Use according to claim 12 for which the disease concerned is cancer. 23/09/03,sw 12020spa,26 27
14. The isolated DNA according to Claim 1 substantially as hereinbefore described in any one of the Examples. DATED this 3 rd day of June, 2003 AVENTIS PHARMA S.A. By Their Patent Attorneys: CALLIINAN LAWRIE 03/06/03,mc I 2020.claims.doc,27
AU11645/00A 1998-11-10 1999-11-09 htfIIIA human gene and coded hTFIIIA protein Ceased AU767647B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
FR9814146 1998-11-10
FR9814146A FR2785618B1 (en) 1998-11-10 1998-11-10 HtfIIIA HUMAN GENE AND hTFIIIA CODED PROTEIN
PCT/FR1999/002738 WO2000028024A1 (en) 1998-11-10 1999-11-09 htfIIIA HUMAN GENE AND CODED hTFIIIA PROTEIN

Publications (2)

Publication Number Publication Date
AU1164500A AU1164500A (en) 2000-05-29
AU767647B2 true AU767647B2 (en) 2003-11-20

Family

ID=9532585

Family Applications (1)

Application Number Title Priority Date Filing Date
AU11645/00A Ceased AU767647B2 (en) 1998-11-10 1999-11-09 htfIIIA human gene and coded hTFIIIA protein

Country Status (6)

Country Link
EP (1) EP1137775A1 (en)
JP (1) JP2002529084A (en)
AU (1) AU767647B2 (en)
CA (1) CA2351019A1 (en)
FR (1) FR2785618B1 (en)
WO (1) WO2000028024A1 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0704526A1 (en) * 1994-09-05 1996-04-03 Otsuka Pharmaceutical Co., Ltd. hTFIIIA gene

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0704526A1 (en) * 1994-09-05 1996-04-03 Otsuka Pharmaceutical Co., Ltd. hTFIIIA gene

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ARAKAWA ET AL. (1995) CYTOGENET. CELL. GENET. 70:235-238 *
DREW ET AL. (1995) GENE 159:215-218 *

Also Published As

Publication number Publication date
CA2351019A1 (en) 2000-05-18
EP1137775A1 (en) 2001-10-04
FR2785618B1 (en) 2002-12-13
WO2000028024A1 (en) 2000-05-18
FR2785618A1 (en) 2000-05-12
JP2002529084A (en) 2002-09-10
AU1164500A (en) 2000-05-29

Similar Documents

Publication Publication Date Title
Hartings et al. The O2 gene which regulates zein deposition in maize endosperm encodes a protein with structural homologies to transcriptional activators.
Cantatore et al. Mapping and characterization of Paracentrotus lividus mitochondrial transcripts: multiple and overlapping transcription units
Vogel et al. hDIP—a potential transcriptional regulator related to murine TSC-22 and Drosophila shortsighted (shs)—is expressed in a large number of human tissues
AU767647B2 (en) htfIIIA human gene and coded hTFIIIA protein
Dell'Orto et al. The potato mitochondrial ATP synthase subunit 9: gene structure, RNA editing and partial protein sequence
JPH05199882A (en) Production of bilirubin oxidase
US5773214A (en) Multiple drug resistance gene of aspergillus flavus
AU775543B2 (en) Novel candida albicans genes and proteins coded by said genes
US6746837B2 (en) Multiple drug resistance gene atrD of aspergillus nidulans
US5945324A (en) Multiple drug resistance gene ATRC of aspergillus nidulans
EP0422175B1 (en) Proteins which regulate the expression of vertebrate mhc class ii genes, dna sequences encoding them and pharmaceutical compositions
AU769683B2 (en) CATFIIIA candida albicans TFIIIA gene (CATFIIIA) and the coded CATFIIIA protein
NZ330697A (en) Structure and expression of the d-amino acid oxidase gene from the yeast rhodosporidium toruloides
NZ246085A (en) Fungine endopolygalacturonase inhibitor protein and nucleotide sequence and its recombinant production
US5705352A (en) Multiple drug resistance gene of Aspergillus fumigatus
CA2147110A1 (en) Multiple drug resistance gene of aureobasidium pullulans
WO1992014820A1 (en) Human nerve cell adhesion factor l1, production thereof, and gene coding for the same
CN116003542A (en) Microorganism for producing citric acid, construction method and application thereof
Sosa et al. Structure of a ribosomal protein gene in Mucor racemosus

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)