WO1998024909A1 - Chaperone fragments - Google Patents

Chaperone fragments Download PDF

Info

Publication number
WO1998024909A1
WO1998024909A1 PCT/GB1996/002980 GB9602980W WO9824909A1 WO 1998024909 A1 WO1998024909 A1 WO 1998024909A1 GB 9602980 W GB9602980 W GB 9602980W WO 9824909 A1 WO9824909 A1 WO 9824909A1
Authority
WO
WIPO (PCT)
Prior art keywords
polypeptide
protein
groel
amino acid
activity
Prior art date
Application number
PCT/GB1996/002980
Other languages
French (fr)
Inventor
Alan Roy Fersht
Ralph Zahn
Myriam Marlenne Altamirano
Original Assignee
Medical Research Council
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Medical Research Council filed Critical Medical Research Council
Priority to PCT/GB1996/002980 priority Critical patent/WO1998024909A1/en
Priority to AU10363/97A priority patent/AU1036397A/en
Priority to EP97943058A priority patent/EP0928336A1/en
Priority to AU44674/97A priority patent/AU736670B2/en
Priority to PCT/GB1997/002652 priority patent/WO1998013496A1/en
Priority to JP10515422A priority patent/JP2001501093A/en
Publication of WO1998024909A1 publication Critical patent/WO1998024909A1/en
Priority to US09/276,455 priority patent/US20030078373A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K1/00General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
    • C07K1/107General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides
    • C07K1/113General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides without change of the primary structure
    • C07K1/1133General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length by chemical modification of precursor peptides without change of the primary structure by redox-reactions involving cystein/cystin side chains
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/24Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
    • C07K14/245Escherichia (G)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides

Definitions

  • the present invention relates to chaperone polypeptides which are active in the folding and maintenance of structural integrity of other proteins.
  • the invention also relates to nucleic acids encoding chaperone polypeptides, vectors comprising these nucleic acids, host cells modified with the nucleic acids or vectors so as to express the chaperone polypeptides.
  • the invention further relates to methods of making chaperone polypeptides whether by synthetic or recombinant means, pharmaceutical compositions comprising the chaperone polypeptides or nucleic acids encoding the same and the use of chaperone polypeptides in the treatment of disease or in the reconditioning of biologically active materials.
  • the invention also relates to antibodies reactive against chaperone polypeptides and their use in medicine and diagnostics.
  • Chaperones are in general known to be large multisubunit protein assemblies essential in mediating polypeptide chain folding in a variety of cellular compartments. Families of chaperones have been identified, for example the chaperonin hsp ⁇ O family otherwise known as the cpn ⁇ O class of proteins are expressed constitutively and there are examples to be found in the bacterial cytoplasm (GroEL) , in endosymbiotically derived mitochondria (hsp ⁇ O) and in chloroplasts (Rubisco binding protein) . Another chaperone family is designated TF55/TCP1 and found in the thermophilic archaea and the evolutionarily connected eukaryotic cytosol.
  • GroEL which is a member of the hsp ⁇ O family of heat shock proteins.
  • GroEL is a tetradecamer wherein each monomeric subunit (cpn ⁇ Om) has a molecular weight of approximately 57kD. The tetradecamer facilitates the in vi tro folding of a number of proteins which would otherwise misfold or aggregate and precipitate.
  • the structure of GroEL from E. coli has been established through X-ray crystallographic studies as reported by Braig K et al (1994) Nature 371: 578-586.
  • the holo protein is cylindrical, consisting of two seven- membered rings that form a large central cavity which according to Ellis R J and Hartl F U (1996) FASEB Journal 10: 20-26 is generally considered to be essential for activity.
  • Some small proteins have been demonstrated to fold from their denatured states when bound to GroEL (Gray T E and Fersht A R (1993) J Mol Biol 232: 1197- 1207; Hunt J F et al (1996) Nature 379: 37-45; Weiss an J S et al (1996) Cell 8 ⁇ : 481-490; Mayhew M et al (1996) Nature 379: 420-426; Corrales F J and Fersht A R (1995) Proc Nat Acad Sci 92_: 5326-5330) and it has been argued that a cage-like structure is necessary to sequester partly folded or assembled proteins (Ellis R J and Hartl F U (1996) supra .
  • E. coli GroEL The entire amino acid sequence of E. coli GroEL is also known (see Braig K et al (1994) supra) and three domains have been ascribed to each cpn ⁇ Om of the holo chaperonin (tetradecamer) . These are the intermediate (amino acid residues 1-5, 134-190, 377-408 and 524-548), equatorial (residues 6-133 and 409-523) and apical (residues 191- 376) domains.
  • GroEL facilitates the folding of a number of proteins by two mechanisms; (1) it prevents aggregation by binding to partly folded proteins (Goloubinoff P et al (1989) Nature 342: 884-889; Zahn R and Pl ⁇ ckthun A (1992) Biochemistry 31: 3249-3255), which then refold on GroEL to a nativelike state (Zahn R and Pl ⁇ ckthun A (1992) Biochemistry 31: 3249-3255; Gray T E and Fersht A R (1993) J Mol Biol 232: 1197-1207); and (2) it continuously anneals misfolded proteins by unfolding them to a state from which refolding can start again (Zahn R et al (1996) Science 271 : 642-645) .
  • the equatorial domain has been shown from the 2.4 A crystal structure of ATP ⁇ S-ligated GroEL (Boisvert D C et al (1996) Nature Structure Biology 3_: 170-177) and mutagenesis studies (Fenton W A et al (1994) Nature 371: 614-619) to have the nucleotide binding sites. Binding and hydrolysis of ATP is cooperative (Bochkareva E S et al (1992) J Biol Chem 267: 6796-6800; Gray T E and Fersht A R (1991) FEBS Lett 292: 254-258), and lowers the affinity for polypeptides (Jackson G S et al (1993) Biochemistry 3_2 ⁇ 2554-2563) .
  • the crystal structure of GroEL shows unusually high B- factors for the apical domain compared with the equatorial or intermediate domain, and the B-f actors vary considerably within the domain (Braig K et al (1994) Nature 371: 578-586; Braig K et al (1995) Nature Struct Biol 2 : 1083-1094; Boisvert D C et al (1996) Nature Structure Biology 3_: 170-177) .
  • the high overall B-factor seems to result from a static disorder within the asymmetric unit and probably throughout the crystals of GroEL, and has been attributed to rigid-body movements generated by hinge-like J3-sheets in the intermediate domain.
  • Regions of high flexibility have also been observed in the 2.8A structure of the co-chaperonin GroES (Hunt J F et al (1996) Nature 79: 37-45) .
  • a mobile loop has been shown to be directly involved in ADP-dependent binding to the apical domain (Landry S J et al (1993) Nature 364: 255-258) .
  • the proteolytic fragment GroEL 150-456 elutes as a monomer during gel filtration, it still comprises the apical domain and significant portions of the intermediate and equatorial domains, the latter of which determine the intersubunit contacts of GroEL (Braig K et al (1994) supra) , thus allowing transient formation of the central cavity thereby accounting for the chaperonin activity which is observed.
  • EP-A-0 650 975 discloses chaperoin molecules and a method of refolding denatured proteins using GroEL chaperonin 60 monomers (cpn60m) obtained from Thermus thermophil us .
  • the holo-chaperonin was first extracted and then purified from the bacterial source according to the method of Taguchi et al (1991) J Biol Chem 266: 22411-22418.
  • the cpn60m was then produced by treatment of the holo-chaperonin with trifluoroacetic acid (TEA) followed by reverse phase (rp) HPLC of the resulting denatured protein.
  • TAA trifluoroacetic acid
  • rp reverse phase
  • the refolding activity of the cpn60m was assayed in solution by monitoring the regain in activity of inactivated rhodanese, which in specific activity terms amounted to about only 25% of the specific activity of the rhodanese prior to inactivation. When background spontaneous rhodanese refolding is subtracted then there is only an approximately 20% refolding activity.
  • EP-A-0 650 975 also discloses the use of an approximately 50kD N-terminal deletion fragment of cpn60m wherein the N-terminal amino acid residues up to (but not including) the Thr residue at position 79 are removed by proteolysis.
  • This 50kD fragment showed an approximately 35% (about 30% when background is subtracted) rhodanese refolding activity when in solution.
  • Taguchi H et al (1994) J Biol Chem 26_9: 8529-8534 is a scientific report on which the invention of EP-A-0 650 975 is based.
  • a transiently formed GroEL tetradecamer (the holo-chaperonin) was perceived to exist when the chaperonin monomers are present in solution. Consequently, the refolding activity of these preparations can be seen to be caused by the presence of holo chaperonin, not monomers.
  • Taguchi et al immobilised cpn60m to a chromatographic resin to exclude the possibility of holo chaperonin formation. When immobilised and therefore when in truly monomeric form, cpn ⁇ Om exhibited only about 10% rhodanese refolding activity.
  • TIBS 3 ⁇ : 81-82 suggested that an "internal fragment" of GroEL may possess a chaperone activity on the basis of amino acid sequence similarity between the altered mRNA stability (ams) gene product (Ams) of E. coli and the central part of GroEL.
  • the ams locus is a temperature-sensitive mutation that maps at 23 min on the E. coli chromosome and results in mRNA with an increased half-life.
  • the ams gene has been cloned, expressed and shown to complement the ams mutation.
  • the gene product is a 149-amino acid protein (Ams) with an apparent molecular weight of 17kD.
  • the present inventors have identified a need for a simple, truly monomeric chaperone molecule which is of defined sequence and structure and which can efficiently and reproducibly refold, renature, reactivate or recondition proteins from a range of sources in the absence of added cofactor or other agents and without the need to associate to form a holo chaperonin.
  • a problem which the invention seeks to solve is therefore how to provide an active portion or fragment of a chaperone in truly monomeric form so as to promote a useful and efficient reagent for the refolding, renaturing or reconditioning of biological molecules, particularly proteins.
  • a further object is the provision of such a monomeric form at minimum size.
  • the present invention provides a chaperone polypeptide having an amino acid sequence selected from at least amino acid residues 230-271 but no more than residues 150-455 or 151-456 of a GroEL sequence substantially as shown in Figure 7, or a corresponding sequence of a substantially homologous chaperone polypeptide, or a modified, mutated or variant thereof having chaperone activity.
  • the amino acid sequence is preferably selected from at least amino acid residues 193-337, more preferably 191- 345, even more preferably 191-376 but no more than residues 151-455.
  • the invention therefore includes polypeptides being GroEL amino acid residues 230-271, 230-272 ...et seq... 230-455 and in like manner residues 230-271, 229-271 . . . et seq. . . 151-271. Also, residues 230-271, 229-272 . . . et seq. . . 151-351, 151-352 ...et seq. . . 151-455. All amino acid sequences of 42 or more residues comprising at least contiguous residues 230-271 and not exceeding 151-455 are within the scope of this aspect of the invention eg 171-423 or 166-406.
  • Chaperone activity may be determined in practice by an ability to refold cyclophilin A but other suitable proteins such as glucosamine-6-phosphate deaminase or a mutant form of indoleglycerol phosphate synthase (IGPS) (amino acid residues 49-252) may be used.
  • IGPS indoleglycerol phosphate synthase
  • a rhodanese refolding assay may also be used. Details of suitable refolding assays are described in more detail in the specific examples provided hereinafter.
  • the invention provides monomeric polypeptide having chaperone activity and incapable of multimerisation in solution.
  • the invention provides a chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including the protein binding active site motif:
  • X's represent a peptide bond or bonds or at least one amino acid residue
  • the invention provides a chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including at least one protein binding active site motif moiety selected from:
  • 1 is selected from amino acid residues: I, M, L, V, S, F or A; wherein 2 is selected from: L, I, P, V or A; wherein 3 is selected from: L, E, V, H or I; wherein 4 is selected from: E, A, R, L, Q, or N; wherein 5 is selected from: A V, I, M, L, N, S, R, T, Q or K; wherein 6 is selected from: E, D or G; wherein 7 is selected from: A, P, S, T, G or L; wherein 8 is selected from: T, A, N, S or V; wherein 9 is selected from: V, L, I or A; wherein 10 is selected from: V, L, I, F or H; wherein 11 is selected from: N, S or L; wherein 12 is selected from: R, K, N, Q, L or S; wherein 13 is selected from: I, T, S, G, V, A, & ty K, ForP; wherein 14 is selected from:
  • the claim provides a chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including the protein binding active site motif:
  • X is at least one amino acid residue, or a functional variant thereof in which one or more of the specified amino acid residues has undergone a conservative substitution.
  • the claims provides a chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including at least one protein binding active site motif moiety selected from:
  • X is at least one amino acid residue, or a functional variant thereof in which one or more of the specified amino acid residues has undergone a conservative substitution.
  • a conservative substitution is the replacement of one amino acid residue for another chemically or functionally similar amino acid residue such that the function of the polypeptide overall remains substantially unchanged.
  • refold reactivate
  • recondition reconditional
  • an inactive protein perhaps denatured using urea may have an unfolded structure. This inactive protein may then be refolded with a polypeptide of the invention thereby reactivating it.
  • reconditional an increase in the specific activity of the refolded/reactivated protein compared to the protein prior to inactivation/denaturation: this is termed “reconditional” .
  • the active site motif or an active site motif moiety includes the conserved sequence:
  • the invention provides monomeric polypeptide having chaperone activity and incapable of multimerisation characterised in that in the absence of ATP the polypeptide has a protein refolding activity of more than 50%, preferably 60%, even more preferably 75%, said refolding activity being determined by contacting the polypeptide with an inactivated protein of known specific activity prior to inactivation, and then determining the specific activity of the said protein after contact with the polypeptide, the % refolding activity being:
  • the chaperone activity is determined by the refolding of cyclophilin A. More preferably, 8M urea denatured cyclophilin A (lOO ⁇ M) is diluted into lOOmM potassium phosphate buffer pH7.0, lOmM DTT to a final concentration of l ⁇ M and then contacted with at least l ⁇ M of said polypeptide at 25°C for at least 5 min, the resultant cyclophilin A activity being assayed by the method of Fischer G et al (1984) Bio ed Biochim Acta 43: 1101-1111.
  • the polypeptide is preferably an hsp60 polypeptide, preferably a GroEL polypeptide.
  • the invention provides a polypeptide as claimed in any preceding claim which comprises at least an amino acid sequence selected from GroEL residues:
  • a preferred polypeptide has the amino acid sequence 191- 345 or 191-376, more preferably 193-337 of GroEL, or the equivalent residues of substantially homologous chaperonins, or a modified, mutated or variant sequence thereof.
  • the polypeptide preferably has a molecular weight of less than 34kDa.
  • Modifications include chemically modified polypeptides for example.
  • “Variants” include, for example, naturally occurring variants of the kind to be found amongst a population of hsp60 chaperonin harbouring organisms/cells as well as naturally occurring polymorphisms or mutations.
  • “Mutations” may also be introduced artificially by processes of mutagenesis well known to a person skilled in the art.
  • substantially homologous peptides may have at least 50% amino acid sequence homology with the specified GroEL amino acid sequences, preferably at least 60% homology and more preferably 75% homology. Homology may of course also reside in the nucleotide sequences for the polypeptide which may be at least 50%, preferably at least 60% homologous and more preferably 75% homologous with the nucleotide sequence encoding the specified GroEL amino acid residues.
  • the hsp ⁇ O class of chaperonin proteins are generally homologous in structure and so there are therefore conserved or substantially homologous amino acid sequences between the members of the class.
  • GroEL is just an example of an hsp60 chaperonin protein; other suitable proteins having an homologous apical domain may be followed.
  • OWL is a non redundant database merging SWISS-PROT, PIR (1-3) , GenBank (translation) and NRL-3D.
  • HECHSPABl HECHSPAB NID g712829 - Helicobacter pylori (individual_isolate 85P) D 221-405 CH60_ARATH MITOCHONDRIAL CHAPERONIN HSP60 PRECURSOR. - ARABIDOPSIS THALIANA (MOUS 224-408 CH60_MAIZE MITOCHONDRIAL CHAPERONIN HSP60 PRECURSOR. - ZEA MAYS (MAIZE) .
  • 191-375 PAU17072 PAU17072 NID g576778 - Pseudomonas aeruginosa. " 191-375 CH60_RHILV 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) .
  • PSEGROESL NID gl51241 - Pseudomonas aeruginosa (library: ATCC 27853) 189-372 CH61_SYNY3 60 KD CHAPERONIN 1 (PROTEIN CPN60 1) (GROEL HOMOLOG 1).- SYNECHOCYSTI 189-373 CH60_CLOTM 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (HSP-60) .
  • S40172 S40172 NID g251679 - Chlamydia psittaci pigeon strain P-1041.
  • SYOGROEL2 SYOGROEL2 NID g562270 - Synechococcus vulcanus DNA.
  • 191-375 CH60_CHLPS 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (57 KD CHLAMYDIAL HYP 188- 372 CH62_STRAL 60 KD CHAPERONIN 2 (PROTEIN CPN60 2) (GROEL PROTEIN 2) (HSP56) .
  • TRBMTHSP TRBMTHSP NID g903883 - Mitochondrion Trypanosoma brucei (strain EATRO 8-69 ECOGROELA ECOGROELA NID: gl46268 - E.coli DNA, clone E. 142-325 ENHCPN60P ENHCPN60P NID: g675513 - Enta ⁇ ioeba histolytica (strain HM-1:IMSS) DNA. 257-433 CH60_PLAFG MITOCHONDRIAL CHAPERONIN CPN60 PRECURSOR. - PLASMODIUM FALCIPARUM (ISO 1-90 CRECPN1C CRECPN1C NID: g603914 - Chlamydomonas reinhardtii cDNA to mRNA.
  • the polypeptide may further comprise a polyamino acid sequence, preferably an N-terminal polyamino acid sequence although a C-terminal sequence may be present instead or in addition to an N-terminal sequence.
  • the polyamino acid sequence may be selected from the same or different amino acid residues. When the same amino acid residue is repeated a particularly preferred polyamino acid sequence is a polyhistidine sequence.
  • the further polyamino acid sequence may comprise any number of amino acid residues so long as chaperonin activity is provided.
  • the other amino acid residues which may be included in the further polyamino acid sequence could be selected from any of the twenty or so essential amino acids common to biological systems or of any amino acid variant or derivative.
  • the polyamino acid sequence is other than a homopolymer then it may comprise a repeated sequence of two or more amino acid residues.
  • the amino acid sequence may encode a portion of another known protein or polypeptide or it may even be random.
  • the further polyamino acid sequence preferably also includes a cleavage site cleavable by a cleavage agent.
  • a preferred cleavage agent is thrombin although any other suitable agent will suffice.
  • the sequence of amino acid residues is of course selected to permit cleavage by the desired cleavage agent.
  • the further polyamino acid sequence preferably comprises 17 to 39 amino acids although more than 39 and less than 17 amino acids may also be employed.
  • the polypeptide may be attached to a support, preferably in immobilised form, optionally immobilised to a chromatographic matrix, more preferably an agarose resin.
  • an agarose resin is preferably a nickel- nitrilo-tri-acetic acid (NTA) -ligated agarose resin. This has affinity for a polypeptide having a polyhistidine tail.
  • polypeptide of the invention may be obtained by recombinant means.
  • the polypeptide may be produced by a routine chemical synthesis using standard polypeptide synthesis procedures known in the art. If produced by recombinant means the polypeptide may be fused to a heterologous protein or polypeptide.
  • the inventors have found that the fragments with polyhistidine tails of 17 amino acid residues sht- GroEL191-345 and sht-GroEL191-376 as well as GroEL191-345 are poteint facilitators of the folding of inactive forms of cyclophilin A and rhodanese, and catalyse the unfolding of barnase. Maximal refolding yield was obtained at stoichiometric concentrations of cyclophilin A and apical domain, indicating the formation of a 1:1 complex between chaperone fragment and substrate protein during refolding.
  • the fragments sht-GroEL191-345 and sht-GroEL191-376 when attached to nickel-nitriol-tri- acetic acid (NTA)- ligated agarose resin are also active in enhancing the folding yield of destabilised mutants of indole glycerol-phosphate synthase and of the protein cyclophilin A. This proves the monomer has refolding activity and also demonstrates that the immobilised protein is functionally useful.
  • Fragments of GroEL may be produced either from recombinant DNA methods or from protein chemistry or any means of chemical synthesis, or the equivalent fragments of homologous hsp60 (cpn60) proteins or any natural variants or any variants produced by mutagenesis, or of larger fragments containing the sequences 191-376, 191- 345 of GroEL from E. coli .
  • the invention provides a nucleic acid molecule comprising a nucleotide sequence encoding a polypeptide of the invention, or a nucleotide sequence hybridisable thereto and optionally encoding a polypeptide having chaperone activity.
  • nucleotide sequence correspond to the preferred features of the polypeptides of the aspects of the invention as hereinbefore defined.
  • This aspect of the invention therefore includes a recombinant DNA molecule for use in cloning and/or expressing a DNA sequence, said recombinant DNA molecule comprising:
  • a nucleotide sequence encoding amino acid residues of GroEL selected from amino acid residues: (i) 191-329, 191-330, 191-331, 191-332, 191-333, 191- 334, 191-335, 191-336, 191-337, 191-338, 191-339, 191-340, 191-341, 191-342, 191-343, 191-344, 191- 345, 191-346, 191-347, 191-348, 191-349, 191-350, 191-351, 191-352, 191-353, 191-354, 191-355, 191- 356, 191-357, 191-358, 191-359, 191-360, 191-361, 191-362, 191-363, 191-364, 191-365, 191-366, 191- 367, 191-368, 191-369, 191-370, 191-371, 191-372, 191-373, 191-374, 191-375 or 191-376, or
  • nucleic acid sequence may be as described hereinbefore in relation to the polypeptides of the invention; the nucleic acid sequence having the appropriate nucleotide sequence which on expression provides a polypeptide which possesses these features.
  • the invention provides a vector comprising a nucleic acid as hereinbefore defined.
  • the invention provides a host cell transformed with a vector or a nucleic acid molecule as hereinbefore defined.
  • a method of making a polypeptide of the invention comprising transforming a host cell with a nucleic acid encoding said polypeptide, culturing the transformed cell for and expressing said polypeptide.
  • Expression may be direct or as a fusion product.
  • the method preferably includes a step wherein the expressed polypeptide product is subject to cleavage.
  • the polypeptide may be recovered from the transformed cells expressing it.
  • the invention provides a pharmaceutical formulation comprising a polypeptide of the invention, optionally together with a diluent, carrier or excipient.
  • the active ingredients of a pharmaceutical composition comprising the polypeptide are contemplated to exhibit excellent therapeutic activity, for example, in the alleviation of Alzheimer's disease when administered in amount which depends on the particular case. Dosage procedures may be adjusted to provide the optimum therapeutic response. For example, several divided doses may be administered daily or the dose may be proportionally reduced as indicated by the exigencies of the therapeutic situation.
  • the active compound may be administered in a convenient manner such as by the oral, intravenous (where water soluble) , intramuscular, subcutaneous, intranasal, intradermal or suppository routes or implanting (eg using slow release molecules) .
  • the active ingredient may be required to be coated in a material to protect said ingredients from the action of enzymes, acids and other natural conditions which may inactivate said ingredient.
  • the polypeptide In order to administer the polypeptide by other than parenteral administration, it will be coated by, or administered with, a material to prevent its inactivation.
  • the polypeptide may be administered in an adjuvant, co-administered with enzyme inhibitors or in liposomes.
  • Adjuvant is used in its broadest sense and includes any immune stimulating compound such as interferon.
  • Adjuvants contemplated herein include resorcinols, non-ionic surfactants such as polyoxyethylene oleyl ether and n-hexadecyl polyethylene ether.
  • Enzyme inhibitors include pancreatic trypsin.
  • Liposomes include water-in-oil-in-water CGF emulsions as well as conventional liposomes.
  • the active compound may also be administered parenterally or intraperitoneally.
  • Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations contain a preservative to prevent the growth of microorganisms.
  • the pharmaceutical forms suitable for injectable use include sterile aqueous solutions (where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion.
  • the form must be sterile and must be fluid to the extent that easy syringability exists. It must be stable under the conditions of' manufacture and storage and must be preserved against the contaminating action of microorganisms such as bacteria and fungi.
  • the carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyetheylene gloycol, and the like) , suitable mixtures thereof, and vegetable oils.
  • the proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of superfactants .
  • the prevention of the action of microorganisms can be brought about by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thirmerosal, and the like.
  • isotonic agents for example, sugars or sodium chloride.
  • Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.
  • Sterile injectable solutions are prepared by incorporating the active compound in the required amount in the appropriate solvent with various of the other ingredients enumerated above, as required, followed by filtered sterilisation.
  • dispersions are prepared by incorporating the sterilised active ingredient into a sterile vehicle which contains the basic dispersion medium and the required other ingredients from those enumerated above.
  • the preferred methods of preparation are vacuum drying and the freeze-drying technique which yield a powder of the active ingredient plus any additional desired ingredient from previously sterile-filtered solution thereof.
  • the polypeptide When the polypeptide is suitably protected as described above, it may be orally administered, for example, with an inert diluent or with an assimilable edible carrier, or it may be enclosed in hard or soft shell gelatin capsules, or it may be compressed into tablets, or it may be incorporated directly with the food of the diet.
  • the active compound may be incorporated with excipients and used in the form of ingestible tablets, buccal tablets, troches, capsules. elixirs, suspensions, syrups, wafers, and the like. The amount of active compound in such therapeutically useful compositions in such that a suitable dosage will be obtained.
  • the tablets, troches, pills, capsules and the like may also contain the following: a binder such as gum tragacanth, acacia, corn starch or gelatin; excipients such as dicalcium phosphate; a disintegrating agent such as corn starch, potato starch, alginic acid and the like; a lubricant such as magnesium stearate; and a sweetening agent such as sucrose, lactose or saccharin may be added or a flavouring agent such as peppermint, oil of wintergreen, or cherry flavouring.
  • a binder such as gum tragacanth, acacia, corn starch or gelatin
  • excipients such as dicalcium phosphate
  • a disintegrating agent such as corn starch, potato starch, alginic acid and the like
  • a lubricant such as magnesium stearate
  • a sweetening agent such as sucrose, lactose or saccharin may be added or a flavouring agent such as peppermin
  • any material may be present as coatings or to otherwise modify the physical form of the dosage unit.
  • tablets, pills, or capsules may be coated with shellac, sugar or both.
  • a syrup or elixir may contain the active compound, sucrose as -a sweetening agent, methyl and propylparabens as preservatives, a dye and flavouring such as cherry or orange flavour.
  • any material used in preparing any dosage unit form should be pharmaceutically pure and substantially non-toxic in the amounts employed.
  • the active compound may be incorporated into sustained- release preparations and formulations.
  • pharmaceutically acceptable carrier and/or diluent includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents and the like.
  • solvents dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents and the like.
  • the use of such media and agents for pharmaceutical active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active ingredient, use thereof in the therapeutic compositions is contemplated.
  • Supplementary active ingredients can also be incorporated into the compositions.
  • Dosage unit form refers to physically discrete units suited as unitary dosages for the mammalian subjects to be treated; each unit containing a predetermined quantity of active material calculated to produce the desired therapeutic effect in association with the required pharmaceutical carrier.
  • the specification for the novel dosage unit forms of the invention are dictated by and directly dependent on (a) the unique characteristics of the active material and the particular therapeutic effect to be achieved, and (b) the limitations inherent in the art of compounding such as active material for the treatment of disease in living subjects having a diseased condition in which bodily health is impaired.
  • the principal active ingredient is compounded for convenient and effective administration in effective amounts with a suitable pharmaceutically acceptable carrier in dosage unit form.
  • the dosages are determined by reference to the usual dose and manner of administration of the said ingredients.
  • polypeptide of the invention as hereinbefore defined for use in the treatment of disease. Consequently there is provided the use of a polypeptide of the invention for the manufacture of a medicament for the treatment of disease associated with aberrant protein/polypeptide structure.
  • the aberrant nature of the protein/polypeptide may be due to misfolding or unfolding which in turn may be due to an anomalous eg mutated amino acid sequence.
  • the protein/polypeptide may be destablished or deposited as plaques eg as in Alzheimer's disease.
  • the disease might be caused by a prion.
  • a polypeptide-based medicament of the invention would act to renature or resolubilise aberrant, defetive or deposited proteins.
  • nucleic acid molecule in accordance with other aspects of the invention for use in the treatment of disease. Consequently, there is provided the use of a nucleic acid molecule of the invention for the manufacture of a medicament for the treatment of disease associated with protein/polypeptide structure. Genetic therapy in vivo is therefore provided for by way of introduction and expression of DNA encoding the monomeric polypeptide in cells/tissues of an individual to provide chaperonin activity in those cells/tissues.
  • a polypeptide of the invention for altering the structure of a molecule, particularly a protein or polypeptide and the alteration in structure may be by folding, unfolding or resolubilsing.
  • the stoichiometry between the monomeric polypeptide of the invention and the molecule being altered is about 1:1, preferably 1:1.
  • a method of reconditioning a molecule preferably a protein comprising contacting said protein with a polypeptide of the invention.
  • the protein is subjected to inactivation or denaturation prior to contacting with said polypeptide.
  • the polypeptide may be immobilised to a solid phase, preferably a chromatographic matrix, and the contacting of protein and polypeptide is carried out by applying the protein to the top of a bed of the matrix packed in a column and then eluting the polypeptide through the column.
  • a solid phase preferably a chromatographic matrix
  • This aspect of the invention therefore provides for the use of a polypeptide as hereinbefore defined for altering the structure of a molecule, preferably a protein or polypeptide and the alteration in structure is by folding, unfolding or refolding.
  • the stoichiometry between the polypeptide and the molecule being altered may be about 1:1.
  • the invention provides for the use of a polypeptide of the invention the purification or increase in yield, specific activity or quality of biological molecules, preferably said polypeptide being attached to a support.
  • the invention provides a kit for reconditioning or refolding a molecule, preferably a protein, comprising a polypeptide as hereinbefore defined immobilised to a solid phase and a container for holding said solid phase polypeptide.
  • the invention provides for the use of a polypeptide as hereinbefore defined in the production of a protein or polypeptide by recombinant means, wherein the said polypeptide is co-expressed with the protein or polypeptide thereby to improve the yield or quality of the protein or polypeptide.
  • the invention provides an antibody reactive against a polypeptide of the invention.
  • the invention also provides for the use of the antibody in the treatment of disease.
  • the invention provides for the use of the antibody in the manufacture of a medicament for the treatment of disease associated with protein/polypeptide structure.
  • the invention provides various methods of treating disease, namely a method of treating disease in which an effective amount of a polypeptide of the invention is administered. There is also a method of treating disease which comprises administering an effective amount of an inhibitor of the chaperone activity of a polypeptide of the invention. Preferably, said inhibitor is an antibody. Also, there is a method of treating disease by gene therapy which utilises a construct encoding a polypeptide of the invention or an antagonist thereof.
  • GroEL has an intrinsic chaperone activity, which is not restricted to its oligomeric state or central cavity.
  • the C-terminal ⁇ -helices of the apical domain are not directly involved in polypeptide binding, and form a separate folding unit. At physiological temperature, about 50% of the C-terminus is physiological temperature, about 50% of the C-terminus is unfolded, allowing high mobility movements of the polypeptide binding "domain core".
  • Flexibility within the apical domain may be crucial for cooperative binding of a wide range of proteins with different native structure, and for the conformation change of GroEL on binding to the co-chaperonin GroES.
  • the crystal structure of the "domain core" of the apical domain has an overall B-factor 60% lower than that of the same region in intact GroEL.
  • the overall fold of the fragment is similar to the corresponding region of intact GroEL, but the amount of secondary structure is considerably larger.
  • the inventors have expressed the apical domain of GroEL as a stable monomeric protein and with high yield in E. coli, allowing its activity, folding, and structure to be studied separately from those of the equatorial and intermediate domains.
  • the isolated, apical domain is functional in polypeptide binding ( Figure 2), and it facilitates protein folding even when truncated to remove its C-terminal ⁇ -helices, Hll and H12 ( Figure 3) .
  • the apical domain slows down spontaneous refolding of rhodanese and cyclophilin A by >15-fold and >150-fold, respectively ( Figures 3c-e) .
  • apical domain and barnase would form a complex of large molecular size, i.e. containing seven or 14 molecules of apical domain, one would expect a considerable degree of line broadening in the NMR spectra of barnase in the presence of apical domain. However, this is not what was observed, even not at eight-fold higher monomer concentration of apical domain than of GroEL (Zahn et al (1996) Science 271: 642- 645) .
  • the intrinsic chaperone activity of monomeric apical domain facilitates refolding of rhodanese even though physiological conditions, GroES, and ATP are required in the presence of intact GroEL.
  • the role of GroES appears to be to weaken the affinity of GroEL for substrates, and to prevent premature dissociation of aggregation prone states: there has to be a balance between tight binding for the annealing activity and weaker binding to allow folding (Corrales F J and Fersht A R (1996) Proc Natl Acad Sci USA 93_ 4509- 4512) .
  • the weaker binding of rhodanese to the fragment must be adequate for chaperoning activity and weak enough for folding.
  • the mechanism in general, may involve several components: GroES binds to the cis-end of a GroEL-substrate complex and displaces the bound polypeptide into the cavity; the dissociated and encapsulated polypeptide chain then refolds in an enlarged folding cavity to a native-like state, until it is released upon the dissociation of GroES; and, the cycle of binding and dissociation of unfolded protein, GroES, ATP, and GroEL may be repeated several times, before the substrate protein can adopt a native conformation (Hunt et al (1996) Nature 379: 37-45; Weissman et al (1996) Cell 8_4: 481-490; Weissman J S et al (1995) Cell 33_: 577-587; Mayhew M et al (1996) Nature 379: 420-426; Corrales F J and Fersht A R (1996) Proc Natl Acad Sci USA 92_ 4509-4512) .
  • Helices Hll and H12 are much less stable than the "domain core" of the apical domain, and form a separate folding unit ( Figure 4) .
  • Figure 4 At physiological temperature, about 50% of the secondary structure of the low-melting helices is unfolded and thus becomes flexible. But, in vivo, the C- terminus is not degraded, implying that the primary structure is not accessible to degradation by E. coli proteases.
  • the low stability of the C-terminal helices allows movements and larger fluctuations of the "domain cores" within the GroEL-ring. This flexible arrangement of the apical domain may be implicated in the cooperative binding of unfolded polypeptide to the seven subunits of GroEL in each ring.
  • the third 3 ⁇ 0 -helix has an unusual length of 10 residues.
  • a 3 ⁇ 0 -helix with a similar length (9 residues) has been found in aconitase (Robbins et al (1989) Proteins: Structure, Function and Genetics 5_: 289-312), an enzyme of the Calvin cycle.
  • the apical domain and aconitase also share a similar fold: a central ⁇ -sheet linked by ⁇ -helices, suggesting that they may use a common mechanism for substrate binding.
  • Figure 1 shows the amino acid sequence of the apical region of GroEL from E. coli including further amino acid residues providing an N-terminal histidine-tail (ht) .
  • Figure 2a is a bar chart showing the catalytic effect of GroEL 191-345 on the amide proton exchange of unfolded barnase. The ratio of the protection factors of the 39 measurable amide protons of barnase in the absence (P) and presence (P+(G)) of the GroEL fragment is shown.
  • Figure 2b is a bar chart similar to that of Figure 2a but for the GroEL fragment 191-345 with a 17 residue N- terminal histidine tail (sht-GroEL 191-345) .
  • Figure 2c is a bar chart similar to that of Figure 2a but for the GroEL fragment 191-376 with a 17 residue N- terminal histidine tail (sht-GroEL 191-376) .
  • Figure 2d is a plot of observed exchange rate constants in the absence of chaperonin (k ⁇ ) against those in the presence of chaperonin (/c s (- ⁇ ) ) for GroEL fragment 191- 345.
  • Global, mixed and local amide protons are displayed by circles, triangles and squares respectively.
  • Figure 2e is a plot similar to that of Figure 2d but for sht-GroEL 191-345.
  • Figure 2f is a plot similar to that of Figure 2d but for sht-GroEL 191-376.
  • Figure 3a is a bar chart and table showing the relative enzymatic activity of rhodanese after refolding in the presence or absence of GroEL, GroES, ATP, sht-GroEL 191- 345, sht-GroEL 191-376 or bovine serum albumin (BSA) .
  • BSA bovine serum albumin
  • Figure 3b is a plot showing the refolding kinetics of rhodanese in the presence of GroEL, GroES and ATP.
  • Figure 3c is a plot showing the refolding kinetics of rhodanese in the presence of various concentrations of sht-GroEL 191-345.
  • Figure 3d is a plot showing the refolding kinetics of rhodanese in the presence of various concentrations of sht-GroEL 191-376.
  • Figure 3e is a plot comparing the refolding kinetics of specified concentrations of sht-GroEL 191-376, sht-GroEL 191-345 and GroEL.
  • Figure 4a is a trace of the thermal denaturation of sht- GroEL 191-376 (upper trace) and sht-GroEL 191-345 (lower trace) monitored by far ultra violet-circular dichroism at 222nm.
  • Figure 4b is a trace as in Figure 4a but thermal denaturation is monitored by differential scanning calorimetry.
  • Figure 5a shows a three dimensional representation of the structure of sht-GroEL 191-345.
  • Figure 5b shows a three dimensional representation of the backbone structure of sht-GroEL 191-345.
  • Figure 5c shows a three dimensional representation of election densities of sht-GroEL 191-345 viewed along the helices H8 and H9.
  • Figure 6 is a trace showing the elution of denatured mutant IGPS from a column of immobilised GroEL 191-345 developed with refolding buffer.
  • Figure 7 shows the amino acid sequence of GroEL from E. coli numbered from the N-terminal methionine.
  • Figure 8 shows the amino acid sequence of a portion of GroEL wherein the residues involved in binding peptides are highlighted.
  • Figures 9a-c comprise partial amino acid sequences of known cpn 60 family members aligned with one another for comparison.
  • Example 1 Cloning and expression of the apical domain of GroEL and various of its fragments
  • the apical domain of GroEL (GroEL 191-376) and various C- terminally truncated fragments of the apical domain were cloned by polymerase chain reaction (PCR) into the polylinker site of a pRSET A vector (Invitrogen), coding for an N-terminal histidine-tail, which contained an engineered thrombin cleavage site.
  • the histidine-tail was composed of 36 amino acids (Invitrogen) or 17 amino acids.
  • the plasmid pOF39 (Fayet 0 et al (1989) J Bacteriol 171: 1379-1385) was used as a template together with two primers flanking the respective GroEL fragment with BamHl and CeoRI restriction enzyme sites. This permitted the cloning of the PCR fragment into the polylinker of pRSET A.
  • PCR was performed with Pfu (Stratagene) to reduce the risk of undesired random mutations.
  • the reaction was performed in a volume of 25 ⁇ l for 25 cycles with 400 nM or primer and 200 ⁇ M of deoxynucleoside-5'-triphosphates .
  • the annealing temperature was 65°C.
  • the following primers were used for the PCT to generate DNA encoding the apical region of GroEL and a variety of fragments thereof: 5' flanking: 5' -CGG ATC CGA AGG TAT GCA GTT CGA CCG; 3' flanking (s) ht-GroEL191-376, 5 '-CGA ATT CTT AAA CGC CGC CTG CCA GTT TCG; 3' flanking (s) ht-GroEL191-345, 5'-CGA ATT CTT AAC GGC CCT GGA TTG CAG CTT C; 3' flanking ht- GroEL191-337, 5' -CGA ATT CTT AAC CCA CGC CAT CGA TGA TAG TGG TG; 3' flanking ht-GroEL191-328, 5' -CGA ATT CTT AGT CTT TGT TGA TCA CAA CAC GTT TAG CCT GAC; 3' flanking ht- GroEL191-322
  • the N- and C-terminal ends of the generated fusion proteins namely ht-GroEL 191-298, ht-GroEL 191-322, ht- GroEL 191-328, ht-GroEL 191-337, ht-GroEL 191-345, ht- GroEL 191-376, sht-GroEL 191-345, and sht-GroEL 191-376 are indicated by rectangular arrows.
  • M13/T7-phage at a multiplicity of infection of 10 phages per cell.
  • the cells were harvested 8 h after induction, centrifuged, and resuspended in 200 ml buffer A (50 mM Tris-HCl pH 8.2, 300 mM NaCl).
  • the soluble protein fraction was added to 20 ml nickel-NTA agarose resin and stirred for 10 min.
  • the resin was washed with 200 ml buffer A, and the histidine-tail containing fusion protein was eluted with 50 ml buffer A containing 200 mM imidazole.
  • the eluted protein was precipitated with 80% ammonium sulfate, re-dissolved in 4 ml of buffer B (50 mM Tris-HCl pH 8.2, 150 mM sodium chloride), and was loaded on a HiLoad 26/60 Superdex 75 column (Pharmacia) , which was equilibrated with buffer B.
  • the fragment GroEL 191- 345 was produced by thrombin-cleavage of ht-GroEL 191-376 before gel filtration.
  • the cleaving reaction was carried out for several days in buffer B after addition of 250 ⁇ l thrombin (Sigma, lU/ ⁇ l) to the protein solution eluted from the nickel-NTA column.
  • the GroEL fragments were analyzed by quantitative amino acid analysis, N-terminal sequencing, and mass spectrometry.
  • the apical domain of GroEL (GroEL 191-376) and various fragments of the apical domain (Fig 1) were expressed in E. coli as fusion proteins containing an N-terminal histidine-tail, and this allowed for a straightforward purification using a nickel- nitrilo-tri-acetic acid (NTA) -ligated agarose resin.
  • NTA nickel- nitrilo-tri-acetic acid
  • the histidine-tail of either 39 (ht) or 17 amino acids (sht) contained a sequence of six histidine residues and a thrombin cleavage site.
  • the apical domain was expressed at >20 mg purified protein per litre of culture as were the smaller fragments lacking the C-terminal ⁇ -helices, Hll and H12.
  • the apical region and monomeric fragments were found to have chaperone activity.
  • the circular dichroism (CD) of sht-GroEL 191-376, sht-GroEL 191-345, and GroEL 191-345 in the far ultra violet (UV) region and in the near UV region indicated native-like secondary and tertiary structure, respectively.
  • the apical domain and the fragment truncated at position 345 are monomeric at micromolar concentrations, when determined by ultracentrifugation.
  • the line widths in the nuclear magnetic resonance (NMR) spectra of GroEL 191-345 are larger than expected for a 17 kD protein but small than for a stable dimer (data not shown) , indicating a fast intermolecular interaction between monomers on the NMR time scale (W ⁇ thrich, K, NMR of Proteins and Nucleic Acids (Wiley, New York, 1986) . There seems to be a low affinity self-recognition of the apical domain.
  • Figures 2d, 2e and 2f serve to demonstrate a mechanism of amide proton exchange of barnase.
  • Fig 2a and 2d there is only an estimate of the final concentration of GroEL fragment.
  • GroEL 191-345 tended to crystallise during the exchange experiment.
  • Figures 2a , 2b and 2c show that the GroEL fragments catalyse, under folding conditions, exchange of amide protons of native barnase that are known to exchange only from its fully unfolded state (Perrett et al (1995) Biochemistry 3_4: 9288-9298) .
  • the apical domain binds with high affinity to unfolded barnase, and helices Hll and H12 are not essential for polypeptide binding ( Figures 2a and 2b) .
  • the presence of the N- terminal histidine-tail did not abolish binding activity ( Figures 2b and 2c) .
  • the mechanism of exchange at pD 6.6 is EX2 ( Figure 2d), but changes to EX1 at higher pD ( Figures 2e and 2f) .
  • EX2 mechanism Human A and Nielsen S 0 (1966) Advan Protein Chem 2_1: 287-386
  • the observed rate constant is limited by the intrinsic rate constant for exchange, which depends on pD.
  • the criterion for an EX2 mechanism is that the rate constant for reprotection (ie a refolding or dissociation reaction) is much greater than the intrinsic rate constant.
  • Rhodanese refolding assays were performed using GroEL, GroES, ATP, sht-GroEL 191-345 and sht-GroEL 191-376 and carried out as described by Horowitz (Horwitz P M in Protein Stability and Folding (eds Shirley B A) 361-368 (Humana Press, 1995) ) .
  • rhodanese (9 ⁇ M) was unfolded for 45 min at 25°C in the presence of 8 M urea and 1 mM ⁇ -mercaptoethanol. Refolding was initiated by diluting 3 ⁇ l of the unfolded rhodanese into a final volume of 250 ⁇ l of a standard buffer containing 50 mM Tris-HCl pH 7.8, 50 mM sodium thiosulphate, 10 mM MgCl 2 , 10 mM KCl.
  • GroEL, GroES, ATP, apical domain (or its C-truncated form) and bovine serum albumin (BSA) were included as indicated, at final concentrations of 2.5 ⁇ M monomer, 2.5 ⁇ M monomer. 2mM, 2.5 ⁇ M and 45 ⁇ g/ml (the same concentration by weights as the GroEL fragments) , respectively.
  • BSA bovine serum albumin
  • Figure 3a shows the relative enzymatic activity of rhodanese (0.1 ⁇ M) after refolding in the presence (+) or absence (-) of GroEL (2.5 ⁇ M monomer), GroES (2.5 ⁇ M monomer), ATP (2 mM) , sht-GroEL 191-345 (2.5 ⁇ M) , sht- GroEL 191-376 (2.5 ⁇ M) , or bovine serum albumin (BSA; 45 ⁇ g/ml) , from 8 M urea (U) . 100% activity was obtained with native rhodanese (N) alone.
  • Figure 3b shows the refolding kinetics of rhodanese in presence GroEL, GroES, and ATP (•) .
  • the final concentrations are the same as in Figure 2a. 100% activity was obtained with native rhodanese (0) .
  • Figures 3c and 3d show the refolding kinetics of rhodanese in the presence of 0.18 ⁇ M (D) , 2.5 ⁇ M (•) , or 5 ⁇ M (0) sht-GroEL 191-345 and sht-GroEL 191-376, respectively.
  • Figure 3a shows a rhodanese refolding activity of about 42.5% for shtGroEL 191-345 which is about 37.5% above background refolding as shown by the control of unfolded rhodanese (U) alone.
  • the results of Figure 3b are for comparitive purposes and show a time course of refolding activity for GroEL, GroES and ATP.
  • Figure 3c shows the rhodanese refolding kinetics at various concentrations of sht-GroEL 191-345. Refolding activities of about 50% are achieved at about 25 mins.
  • Figure 3d is similar to Figure 3c but for sht-GroEL 191- 376 and showing refolding activites of about 40% after 25 mins.
  • Enzymatic activity was obtained from the absorbance at 460 nm of the complex formed between thiocyanate and ferric ion. Results correspond to the average of three different independent assays. Standard error bars are shown. b-d, same as in a, but, to stop the refolding reaction rhodanese activity was assayed in the presence of 10 mM trans-1, 2-cyclohexanediaminetetraacetate (CDTA) to inhibit GroEL activity or 0.5 mg/ml of casein to saturate the apical domain.
  • CDTA 2-cyclohexanediaminetetraacetate
  • the chaperone activity of sht-GroEL 191-345 and sht-GroEL 191-376 was tested using cyclophilin A.
  • Refolding of cyclophilin A was initiated by diluting 8 M urea denatured protein (100 ⁇ M) into 100 mM potassium phosphate buffer pH 7.0, 10 mM DTT to a final concentration of 1 ⁇ M.
  • the final concentration of GroEL and apical domain in refolding buffer was 7 ⁇ M and 4 ⁇ M or 1 ⁇ M, respectively.
  • Refolding temperature was 25°C.
  • cyclophilin activity was measured as described (Fischer G et al (1984) Biomed Biochim Acta 4J3: 1101-1111) . Spontaneous refolding of cyclophilin A occurred to a yield of about 30%, and was finished in less than 1 min. Standard error was 5%.
  • Figure 3e shows the refolding of 1 ⁇ M cyclophilin A in the presence of 7 ⁇ M GroEL monomer (0, ) , 4 ⁇ M sht-GroEL 191-345 (•), 4 ⁇ M sht-GroEFL 191-376 (D) , or 1 ⁇ M sht- GroEL 191-376 (D) . 100% activity was obtained with native cyclophilin A.
  • Figure 3e demonstrates that 100% refolding of inactivated cyclophilin A can be achieved with sht-GroEL 191-345 or sht-GroE 191-376 and that this is equivalent to that seen with GroEL.
  • UV-CD far ultra violet- circular dichroism
  • a Jasco J720 spectropolarimeter interfaced with a Neslab RTE-100 water bath, using a ther ostated cuvette (Helma) with 1 mm path length.
  • the temperature was increased at a linear rate of 50 deg/h.
  • the protein concentration was 40 ⁇ M in 10 mM sodium phosphate buffer pH 7.0.
  • Data were fitted to a denaturation curve (Pace, C N (1990) Trends Biotechnol 8_: 93-98) to determine T mr the midpoint temperature of denaturation.
  • FIG 4a shows the far UV-CD results and Figure 4b shows the DSC results.
  • sht-GroEL 191-376 is the upper trace and sht-GroEL 191-345 is the lower trace.
  • the C-terminal helices were found to be flexible.
  • the apical domain and the fragment truncated at position 345 are reversibly denatured by temperature or urea, and the denaturation is not influenced by the N-terminal histidine-tail.
  • the CD spectrum of the apical domain is identical, to that of the truncated domain.
  • the C-terminal ⁇ -helices therefore, must melt at the lower temperature and separately from the "domain core".
  • the second cooperative transition associated with the extra 31 amino acids in sht-GroEL 191-376 was confirmed by DSC ( Figure 4b) .
  • the calorimetric data are also essentially consistent with the unfolding of the apical domain as a monomer. At physiological temperature, about 50% of helices Hll and H12 are in an unfolded conformation, and thus flexible.
  • Crystals were obtained from hanging drops initially containing sht-GroEL 191-345 at 23 mg ml "1 , 11% PEG 4000, 50 mM Tris-HCl pH 8.5 and 100 mM LiS0 4 , equilibrated against reservoirs consisting of 22% PEG 4000, 100 mM Tris-HCl pH 8.5 and 200 mM LiS0.
  • the structure was solved by conventional molecular replacement methods, using the program AMORE (Navaza J (1994) Acta Crystallogr A 5_0: 157-163), and a search model consisting of residues 191-345 of the refind structure of GroEL (Braig et al (1995) Nature Struct Biol 2 ⁇ 1083-1094) .
  • the asymmetric unit contains one protein molecule, corresponding to a solvent content of 51%.
  • Model rebuilding and refinement was carried out with 0 (Jones et al (1991) Acta Crystallogr A 1_: 110-119), and the structure was refined using X-PLOR (Briinger A T X-PLOR, Version 3.1, A System for Crystallography and NMR (Yale Univ Press, New Haven, CT, 1992) ) , using Engh and Huber parameters (Engh R A & Huber R (1991) Acta Crystallogr A Vl_ 392-400) .
  • the current model contains 8 water molecules and is complete with the following exceptions, which could not be modelled due to poor or non-existent electron density: residues 302-307 and residues 337-345 from the C-terminus. Electron density for the N-terminal His-tag is also not observed. No residues have disallowed backbone ⁇ angles. Table 1 provides a summary of the crystallographic data.
  • R- factor/ free R- factor (%), F 0* 21.4/29.1 r.m.s.d bond length (A) 0.006 r.m.s.d bond angle (deg) 1.42
  • Example 7 The three dimensional structure of sht-GroEL 191-345 Figure 5a shows the secondary structure representation drawn with MolScript (Kraulis P J (1991) J Appl Crystallogr 24_: 946-950) and Raster3D (Merrit E A and Murphy M E P (1994) Acta Crystallogr D 50: 869-873). Helices are labelled as in Braig et al (Braig et al
  • N and C refer to the N-terminus (residue 191) and C-terminus (residue 336) of the model.
  • Figure 5b is a backbone representation drawn with programm 0 (Jones et al (1991) Acta Crystallogr A 47: 110-119) , in same orientations as Figure 5a colour-coded according to B-factor of main-chain atoms: blue (20 A 2 ) to red (60 A 2 ) .
  • Figure 5c shows a representative region of electron density, calculated using refined co-ordinates, viewed along the helices H8 and H9.
  • sht-GroEL 191-345 has the same fold as the corresponding region of the intact GroEL protein ( Figure 5a) : two orthogonal ⁇ -sheets forming a ⁇ -sandwich, flanked by three ⁇ -helices. The structure is more ordered and better resolved than is the apical domain of the intact protein ( Figure 5b) : the average B-factor is 42 A 2 , compared with 97 A 2 for residues 191-336 of the GroEL structure.
  • Figure 5b the average B-factor is 42 A 2 , compared with 97 A 2 for residues 191-336 of the GroEL structure.
  • the structure can be described as a well ordered ⁇ -sandwich scaffold, flanked by relatively flexible helical and loop regions.
  • the B-factors of most of the ⁇ -strand, ⁇ - helix, and loop structure is about 20, 40, and 60 A 2 , respectively ( Figure 5b) .
  • the apical domain of GroEL (GroEL (191-376) ) and the "core" of the apical domain, GroEL (191-345) , were cloned and expressed in E. coli as fusion proteins containing a 17-residue N-terminal histidine-tail. . The fragments were immobilised by two methods.
  • Ni-NTA resin (from QIAGEN) is a chelating adsorbant composed of a high surface concentration of nitrilo-tri- acetic acid (NTA) ligand attached to Sepharose CL-6B.
  • NTA nitrilo-tri- acetic acid
  • the NTA occupies four of six ligand binding sites in the coordination sphere of the Ni 2+ ion, leaving two sites free to interact with the six histidines in the N- terminal tail.
  • the stability of the 6 x His/Ni-NTA interaction is unaffected by strong denaturants such as 6M guanidine hydrochloride or 8M urea, or the presence of low levels of ⁇ -mercaptoethanol (1-10 mM) .
  • 3.5 mL of Ni-NTA resin were equilibrated with 0.1M potassium phosphate at pH 7.8, containing 5 mM ⁇ - mercaptoethanol .
  • the GroEL domain was added to saturation of the affinity gel (21 mg of protein per 3.5 mL of gel) and incubated at room temperature for 30 min with gentle mixing.
  • the gel was packed in a column suitable for FPLC (5 x 100 mm, from Pharmacia) and thoroughly washed with the initial buffer.
  • the binding capacity of the gel was reduced by controlled hydrolysis of the activated gel before coupling.
  • 300 mg of freeze- dried powder were suspended in 50 mM of NaHC0 3 pH 8.3, washed with the same buffer and re-swollen on a sintered glass filter (G3), then suspended in the buffer and mixed in an end-over-end shaker for 4 h at room temperature.
  • the apical domain dissolved in the coupling buffer (0.1M NaHC03, pH 8.3 and 0.5 M NaCl), was added to the gel suspension (10 mg protein/mL gel) and mixed in an end-over-end shaker for 6 h at room temperature. It was then washed with the coupling buffer.
  • the remaining active groups were blocked by adding 2.5 M ethanolamine pH 8 and shaking for 4 h at room temperature. Uncoupled apical domain was removed by washing with five cycles of alternately high and low pH buffer solution (Tris-HCl 0.1M pH 7.8 containing 0.5M NaCl followed by acetate buffer (0.1M, pH 4 plus 0.5M NaCl) . The gel was finally washed with 0.1 M potassium phosphate at pH 7.8, containing 5 mM 2-mercaptoethanol (refolding buffer) . The immobilised fragments were used in a chromatography column ( Figure 6) . Denatured protein in urea was added to the column and developed with a refolding buffer.
  • IGPS indoleglycerol phosphate synthase
  • a column of ht-GroEL 191-345 immobilised on Ni-NTA agarose (3.0 mL) was loaded with 2 nmol of IGPS (49-252) dissolved in 20 ⁇ L of 8 M urea, and the column was developed with refolding buffer (0.1 M potassium phosphate at pH 7.8, containing 5 mM 2- mercaptoethanol) using a Waters 625 LC HPLC system.
  • refolding buffer 0.1 M potassium phosphate at pH 7.8, containing 5 mM 2- mercaptoethanol
  • This experiment uses batchwise mixing of materials.
  • a solution of denatured cyclophilin in 8 M urea was diluted 100 fold with Ni-NTA-immobilised GroEL (191-345) in a refolding buffer (see Table II below) . After gentle mixing for 30 min, the cyclophilin was recovered in 84% yield of protein.
  • Control experiments (Table II) in which the protein was denatured in 8 M urea and added to the refolding buffer alone, or mixed with agarose that had not been linked to GroEL fragments, gave cyclophilin that had only 20% of the activity of the starting material before denaturation.
  • the cyclophilin used for the denaturation experiments had 88% of the activity of the purest samples previously obtained by the inventors.
  • GroEL (191-345) -agarose resin (covalently linked) was 126% of that of the previous purest samples. Further, the total recovery of activity was 25% more than that initially present. Thus, the immobilised GroEL fragment had "reconditioned" the cyclophilin A by converting inactive material into active.
  • suspension of 200 ⁇ L of gel was mixed with refolding buffer (100 mM phosphate buffer, pH 7.8 plus 5 mM 2-mercaptoethanol) to give a volume of 990 ⁇ L.
  • refolding buffer 100 mM phosphate buffer, pH 7.8 plus 5 mM 2-mercaptoethanol
  • the gel suspension was centrifuged to separate the supernatant ( ⁇ 800 ⁇ L) .
  • the gel pellet was washed in miniprep columns and the eluate added to the supernatant to give about 900 ⁇ L.
  • the protein concentration was determined from the A 2 8O nm. Cyclophilin activity was measured in the supernatant as described .in (Makino Y et al (1993) FEBS Lett 3_36_: 363-367) .
  • the sample prior to denaturation had 88% of the specific activity of the highest activity of native cyclophilin previously obtained by the inventors.
  • the control is agarose alone, without Ni. The results are shown in table II below.
  • Glucosamine 6-phosphate deaminase (6 x 30 kDa) (Oliva G et al (1995) Structure 3_: 1323-1332) normally regains only 10 % or less activity after renaturation from urea denaturation. A 100 % yield was obtained on batchwise treatment with GroEL (191-345) - agarose. Further, a sample that had lost all activity on storage in solution in 50% glycerol/water at -20 °C for 5 years also regained 100 % activity with this treatment after dissolving in urea.
  • the X-ray crystal structure of GroEL 191-376 with the 17 residue N-terminal tail shows that seven residues of the tail of one molecule bind in the active site of the other. Residues 230-271 are in the binding site. All residues are shown in Figure 8 in which large, bold and underlined residues are those detected by X-ray crystal structure of ht GroEL 191-376 as being involved in protein binding.
  • the X-ray crystal structure shows that 193-336 should be reasonably stable.
  • the 193-337 fragment was cloned and expressed and found to be stable. Therefore, residues 191 and 192 may be omitted.
  • Polypeptide fragments of E. coli GroEL were synthesised on a protein synthesiser according to methods and using equipment well known to a person skilled in the art. Longer fragments were generated by synthesising smaller fragments and then linking them together. The fragments were then tested by their ability to refold rhodanese, cyclophilin A, glucosamine-6-phosphate deaminase and IGPS 49-252 as described above.
  • Figure 9 shows sequences in OWL database release 28.1 containing clear homology to apical domain of GroEL
  • OWL is a non redundant database merging SWISS-PROT, PIR (1-3), GenBank (translation) and NRL-3D.
  • Consensus sequence residues 230-271 inclusive) containing peptide-binding site (as identified by crystal structure analysis and polypeptide binding studies) .
  • X residue in peptide- binding site in the X-ray crystal structure of mini chaperone. The GroEL E. coli chaperone sequence is shown in italics.

Abstract

An E. coli GroEL polypeptide fragment (191-376) being the optical domain is coupled to a chromatographic resin (Ni-NTA Agarose). A denatured solution of cyclophilin A of known specific activity prior to denaturation is loaded onto a column of the GroEL polypeptide resin and eluted in a refolding buffer. The cyclophilin A is refolded (reconditioned) such that its specific activity is > 100 % of what it was prior to denaturation.

Description

Chaperone Fragments
The present invention relates to chaperone polypeptides which are active in the folding and maintenance of structural integrity of other proteins. The invention also relates to nucleic acids encoding chaperone polypeptides, vectors comprising these nucleic acids, host cells modified with the nucleic acids or vectors so as to express the chaperone polypeptides. The invention further relates to methods of making chaperone polypeptides whether by synthetic or recombinant means, pharmaceutical compositions comprising the chaperone polypeptides or nucleic acids encoding the same and the use of chaperone polypeptides in the treatment of disease or in the reconditioning of biologically active materials. The invention also relates to antibodies reactive against chaperone polypeptides and their use in medicine and diagnostics.
Chaperones are in general known to be large multisubunit protein assemblies essential in mediating polypeptide chain folding in a variety of cellular compartments. Families of chaperones have been identified, for example the chaperonin hspβO family otherwise known as the cpnβO class of proteins are expressed constitutively and there are examples to be found in the bacterial cytoplasm (GroEL) , in endosymbiotically derived mitochondria (hspβO) and in chloroplasts (Rubisco binding protein) . Another chaperone family is designated TF55/TCP1 and found in the thermophilic archaea and the evolutionarily connected eukaryotic cytosol. A comparison of amino acid sequence data has shown that there is at least 50% sequence identity between chaperones found in prokaryotes, mitochondria and chloroplasts (Ellis R J and Van der Vies S M (1991) Ann Rev Biochem >0: 321-347) .
A typical chaperonin is GroEL which is a member of the hspβO family of heat shock proteins. GroEL is a tetradecamer wherein each monomeric subunit (cpnβOm) has a molecular weight of approximately 57kD. The tetradecamer facilitates the in vi tro folding of a number of proteins which would otherwise misfold or aggregate and precipitate. The structure of GroEL from E. coli has been established through X-ray crystallographic studies as reported by Braig K et al (1994) Nature 371: 578-586. The holo protein is cylindrical, consisting of two seven- membered rings that form a large central cavity which according to Ellis R J and Hartl F U (1996) FASEB Journal 10: 20-26 is generally considered to be essential for activity. Some small proteins have been demonstrated to fold from their denatured states when bound to GroEL (Gray T E and Fersht A R (1993) J Mol Biol 232: 1197- 1207; Hunt J F et al (1996) Nature 379: 37-45; Weiss an J S et al (1996) Cell 8±: 481-490; Mayhew M et al (1996) Nature 379: 420-426; Corrales F J and Fersht A R (1995) Proc Nat Acad Sci 92_: 5326-5330) and it has been argued that a cage-like structure is necessary to sequester partly folded or assembled proteins (Ellis R J and Hartl F U (1996) supra .
The entire amino acid sequence of E. coli GroEL is also known (see Braig K et al (1994) supra) and three domains have been ascribed to each cpnδOm of the holo chaperonin (tetradecamer) . These are the intermediate (amino acid residues 1-5, 134-190, 377-408 and 524-548), equatorial (residues 6-133 and 409-523) and apical (residues 191- 376) domains. Monomers of GroEL have been induced by urea or pressure, but they are inactive and have to reassociate to form the central cavity in order to facilitate the refolding of rhodanese (Mendoza J A et al (1994) J Biol Chem 269:
2447-2451; Ybarra J and Horowitz P M (1995) J Biol Chem 270: 22962-22967) .
GroEL facilitates the folding of a number of proteins by two mechanisms; (1) it prevents aggregation by binding to partly folded proteins (Goloubinoff P et al (1989) Nature 342: 884-889; Zahn R and Plϋckthun A (1992) Biochemistry 31: 3249-3255), which then refold on GroEL to a nativelike state (Zahn R and Plϋckthun A (1992) Biochemistry 31: 3249-3255; Gray T E and Fersht A R (1993) J Mol Biol 232: 1197-1207); and (2) it continuously anneals misfolded proteins by unfolding them to a state from which refolding can start again (Zahn R et al (1996) Science 271 : 642-645) . Some mutations in the apical domain led to a decrease in polypeptide binding (Fenton A et al (1994) Nature 371: 614-619), suggesting that this domain is involved in the binding of polypeptides. Electron microscopy suggests that denatured protein binds to the inner side of the apical end of the GroEL-cylinder (Chen S et al (1994) Nature 371.: 261-264) . The equatorial domain has been shown from the 2.4 A crystal structure of ATPγS-ligated GroEL (Boisvert D C et al (1996) Nature Structure Biology 3_: 170-177) and mutagenesis studies (Fenton W A et al (1994) Nature 371: 614-619) to have the nucleotide binding sites. Binding and hydrolysis of ATP is cooperative (Bochkareva E S et al (1992) J Biol Chem 267: 6796-6800; Gray T E and Fersht A R (1991) FEBS Lett 292: 254-258), and lowers the affinity for polypeptides (Jackson G S et al (1993) Biochemistry 3_2^ 2554-2563) . Most of the intermolecular contacts between the subunits of GroEL are between the equatorial domain. The intermediate domain connects the other two domains, transmitting allosteric effects (Braig K et al (1994) Nature 3 1: 578-586; Braig K et al (1995) Nature Struct Biol 2 : 1083-1094) .
The crystal structure of GroEL shows unusually high B- factors for the apical domain compared with the equatorial or intermediate domain, and the B-f actors vary considerably within the domain (Braig K et al (1994) Nature 371: 578-586; Braig K et al (1995) Nature Struct Biol 2 : 1083-1094; Boisvert D C et al (1996) Nature Structure Biology 3_: 170-177) . The high overall B-factor seems to result from a static disorder within the asymmetric unit and probably throughout the crystals of GroEL, and has been attributed to rigid-body movements generated by hinge-like J3-sheets in the intermediate domain. Regions of high flexibility have also been observed in the 2.8A structure of the co-chaperonin GroES (Hunt J F et al (1996) Nature 79: 37-45) . A mobile loop has been shown to be directly involved in ADP-dependent binding to the apical domain (Landry S J et al (1993) Nature 364: 255-258) . Binding of GroES leads to a conformational change of GroEL and a concomitant enlargement of the GroEL-cavity (Chen S et al (1994) Nature 371 : 261-264), in which the encapsulated polypeptide substrate can refold to a native-like state without the danger of aggregation (Martin J et al (1993) Nature 3_66: 228-233; eissman J S et al (1995) Cell 3: 577-587) . Monomeric forms of GroEL have been induced by site- directed mutagenesis and expressed and although these bind to rhodanese they do not affect its refolding (White Z W et al (1995) J Biol Chem 210^: 20404-20409).
Yoshida et al (1993) FEBS 3 6: 363-367 report that a 34kD proteolytic fragment of E. coli GroEL which lacks 149 NH2-terminal residues and ~93 COOH-terminal residues (GroEL 150-456) facilitates refolding of denatured rhodanese in the absence of GroES and ATP. Although the proteolytic fragment GroEL 150-456 elutes as a monomer during gel filtration, it still comprises the apical domain and significant portions of the intermediate and equatorial domains, the latter of which determine the intersubunit contacts of GroEL (Braig K et al (1994) supra) , thus allowing transient formation of the central cavity thereby accounting for the chaperonin activity which is observed.
In any event, the mode of rhodanese refolding by GroEL
150-456 is very different from that brought about by the holo protein; the yield of productive refolding is low, folding is rapidly saturated with time, and it is not affected by GroES and ATP. Efficient release and folding requires the hydrolysis of ATP (Landry S J et al (1992) Nature 3_55: 455-457; Gray T E and Fersht A R (1992) FEBS Lett 282: 254-258; Jackson G S et al (1993) Biochemistry 32: 2554-2563; Todd M et al (1993) Biochemistry 3^: 8560- 8567.)
EP-A-0 650 975 (NIPPON OIL CO LTD) discloses chaperoin molecules and a method of refolding denatured proteins using GroEL chaperonin 60 monomers (cpn60m) obtained from Thermus thermophil us . The holo-chaperonin was first extracted and then purified from the bacterial source according to the method of Taguchi et al (1991) J Biol Chem 266: 22411-22418. The cpn60m was then produced by treatment of the holo-chaperonin with trifluoroacetic acid (TEA) followed by reverse phase (rp) HPLC of the resulting denatured protein. A peak fraction containing the approximately 57kD cpn60m was obtained. The refolding activity of the cpn60m was assayed in solution by monitoring the regain in activity of inactivated rhodanese, which in specific activity terms amounted to about only 25% of the specific activity of the rhodanese prior to inactivation. When background spontaneous rhodanese refolding is subtracted then there is only an approximately 20% refolding activity.
As well as cpn60m, EP-A-0 650 975 also discloses the use of an approximately 50kD N-terminal deletion fragment of cpn60m wherein the N-terminal amino acid residues up to (but not including) the Thr residue at position 79 are removed by proteolysis. This 50kD fragment showed an approximately 35% (about 30% when background is subtracted) rhodanese refolding activity when in solution.
Taguchi H et al (1994) J Biol Chem 26_9: 8529-8534 is a scientific report on which the invention of EP-A-0 650 975 is based. A transiently formed GroEL tetradecamer (the holo-chaperonin) was perceived to exist when the chaperonin monomers are present in solution. Consequently, the refolding activity of these preparations can be seen to be caused by the presence of holo chaperonin, not monomers. To test this, Taguchi et al immobilised cpn60m to a chromatographic resin to exclude the possibility of holo chaperonin formation. When immobilised and therefore when in truly monomeric form, cpnβOm exhibited only about 10% rhodanese refolding activity.
The refolding of rhodanese has been a common and convenient assay to determine chaperonin activity but it has been observed that there are significant problems with the assay which cast serious doubt on existing assertions of refolding activity based on this assay. The fact is that rhodanese refolds spontaneously in the absence of molecular chaperones with the yield of refolded rhodanese increasing progressively as the rhodanese concentration decreases (see Taguchi et al (1994) supra) . The 10% of rhodanese refolding activity reported in EP-A-0 650 975 for immobilised (truly monomeric) cpn60m is therefore too close to the spontaneous regain of activity by rhodanese to demonstrate that any monomeric chaperonin has a refolding activity towards proteins generally, let alone cpn60m and rhodanese.
Alconada A and Cuezva J M (1993) TIBS 3^: 81-82 suggested that an "internal fragment" of GroEL may possess a chaperone activity on the basis of amino acid sequence similarity between the altered mRNA stability (ams) gene product (Ams) of E. coli and the central part of GroEL. The ams locus is a temperature-sensitive mutation that maps at 23 min on the E. coli chromosome and results in mRNA with an increased half-life. The ams gene has been cloned, expressed and shown to complement the ams mutation. The gene product is a 149-amino acid protein (Ams) with an apparent molecular weight of 17kD. Chanda P K et al (1985) J Bacteriol 161: 446-449 found that a 17kD protein fragment corresponding to part of the L gene of the groE operon, when expressed in E. coli ams mutants restores the wild-type phenotype. This 17kD fragment was suggested as being an isolated, functional chaperonin protein module. The amino acid sequences of three chaperonins (E. coli GroEL, ribulose bisphosphate carboxylase (RUBPC) subunit-binding protein from Tri ticum aestivum and Saccharomyces cerevisiaet mitochondrial hsp60) were compared with the sequence of Ams. Residues 307-423 were found to correspond substantially between Ams and GroEL. These residues comprise nearly equivalent portions of both the intermediate and apical domains of GroEL.
The sequence alignments of Ams protein with the chaperonins noted above reveals a striking similarity (98%) between the amino-terminal four-fifths of Ams and the central part (approximately one-fifth) of E. coli GroEL chaperonin. The 50% sequence similarity between the Ams amino terminal region and the two other chaperonins is in line with the reported identity among the chaperonin family. The carboxy-terminal part of the Ams protein showed no similarity with chaperonins (<10% homology) .
The present inventors have identified a need for a simple, truly monomeric chaperone molecule which is of defined sequence and structure and which can efficiently and reproducibly refold, renature, reactivate or recondition proteins from a range of sources in the absence of added cofactor or other agents and without the need to associate to form a holo chaperonin. A problem which the invention seeks to solve is therefore how to provide an active portion or fragment of a chaperone in truly monomeric form so as to promote a useful and efficient reagent for the refolding, renaturing or reconditioning of biological molecules, particularly proteins. A further object is the provision of such a monomeric form at minimum size.
In first aspect the present invention provides a chaperone polypeptide having an amino acid sequence selected from at least amino acid residues 230-271 but no more than residues 150-455 or 151-456 of a GroEL sequence substantially as shown in Figure 7, or a corresponding sequence of a substantially homologous chaperone polypeptide, or a modified, mutated or variant thereof having chaperone activity.
The amino acid sequence is preferably selected from at least amino acid residues 193-337, more preferably 191- 345, even more preferably 191-376 but no more than residues 151-455. The invention therefore includes polypeptides being GroEL amino acid residues 230-271, 230-272 ...et seq... 230-455 and in like manner residues 230-271, 229-271 . . . et seq. . . 151-271. Also, residues 230-271, 229-272 . . . et seq. . . 151-351, 151-352 ...et seq. . . 151-455. All amino acid sequences of 42 or more residues comprising at least contiguous residues 230-271 and not exceeding 151-455 are within the scope of this aspect of the invention eg 171-423 or 166-406.
There are four key properties that may characterise a protein as a molecular chaperone (1) suppression of aggregation during protein folding; (2) suppression of aggregation during protein unfolding; (3) influence on the yield and kinetics of folding; and (4) effects exerted at near stoichiometric levels.
Chaperone activity may be determined in practice by an ability to refold cyclophilin A but other suitable proteins such as glucosamine-6-phosphate deaminase or a mutant form of indoleglycerol phosphate synthase (IGPS) (amino acid residues 49-252) may be used. A rhodanese refolding assay may also be used. Details of suitable refolding assays are described in more detail in the specific examples provided hereinafter.
In second aspect the invention provides monomeric polypeptide having chaperone activity and incapable of multimerisation in solution.
In third aspect the invention provides a chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including the protein binding active site motif:
1 X X X 2 X X 3 4 XX 5 X X X X X X X XX X X X X X X
6 X X 7 8 X 9 10 11 X X 12 X 13 14
wherein 1 is selected from amino acid residues:
I, M, L, V, S, F or A; wherein 2 is selected from: L, I, P, V or A; wherein 3 is selected from: L, E, V, H or I; wherein 4 is selected from: E, A, R, L, Q, or N; wherein 5 is selected from: A V, I, ty L, N, S, R, T, Qcr K; wherein 6 is selected from: E, D or G; wherein 7 is selected from: A, P, S, T, G or L; wherein 8 is selected from: T, A, N, S or V; wherein 9 is selected from: V, L, I or A; wherein 10 is selected from: V, L, I, F or H; wherein 11 is selected from: N, S or L; wherein 12 is selected from: R, K, N, Q, L or S; wherein 13 is selected from: I, T, S, G, V, A, O K, FαrP; wherein 14 is selected from: V, I, L, F, D or T; and
wherein the X's represent a peptide bond or bonds or at least one amino acid residue,
or a functional variant thereof in which one or more of the numbered amino acid residues 1 to 14 has undergone a conservative substitution.
In fourth aspect the invention provides a chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including at least one protein binding active site motif moiety selected from:
(a) 1 X X X 2 X X 3 4 X and
(b) X 5 X X X X X X X X X X X X X X X 6 X X 7 8 X 9 10 11 X X 12 X 13 14
wherein 1 is selected from amino acid residues: I, M, L, V, S, F or A; wherein 2 is selected from: L, I, P, V or A; wherein 3 is selected from: L, E, V, H or I; wherein 4 is selected from: E, A, R, L, Q, or N; wherein 5 is selected from: A V, I, M, L, N, S, R, T, Q or K; wherein 6 is selected from: E, D or G; wherein 7 is selected from: A, P, S, T, G or L; wherein 8 is selected from: T, A, N, S or V; wherein 9 is selected from: V, L, I or A; wherein 10 is selected from: V, L, I, F or H; wherein 11 is selected from: N, S or L; wherein 12 is selected from: R, K, N, Q, L or S; wherein 13 is selected from: I, T, S, G, V, A, & ty K, ForP; wherein 14 is selected from: V, I, L, F, D or T; and
wherein X is at least one amino acid residue,
or a functional variant thereof in which one or more of the numbered amino acid residues 1 to 14 has undergone a conservative substitution.
In fifth aspect the claim provides a chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including the protein binding active site motif:
I X X X L X X L E X X A X X X X X X X X X X X X X X X E X X A T X VVN X X RX I V
wherein X is at least one amino acid residue, or a functional variant thereof in which one or more of the specified amino acid residues has undergone a conservative substitution.
In sixth aspect the claims provides a chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including at least one protein binding active site motif moiety selected from:
(a) I X X X L X X L E X (b) XAX X X X X X X X X X X X X X X E X X A T X VVN X X RXI V
wherein X is at least one amino acid residue, or a functional variant thereof in which one or more of the specified amino acid residues has undergone a conservative substitution.
A conservative substitution is the replacement of one amino acid residue for another chemically or functionally similar amino acid residue such that the function of the polypeptide overall remains substantially unchanged.
The terms "refold", "reactivate" and "recondition" are not intended as being mutually exclusive. For example, an inactive protein, perhaps denatured using urea may have an unfolded structure. This inactive protein may then be refolded with a polypeptide of the invention thereby reactivating it. In some circumstances there may be an increase in the specific activity of the refolded/reactivated protein compared to the protein prior to inactivation/denaturation: this is termed "reconditional" .
Preferably, the active site motif or an active site motif moiety includes the conserved sequence:
PLL(V) I (V) IA(S)EDV(I)EGEAL
in which amino acid symbols in parenthesis are alternatives to the immediately preceding symbol reading left to right. In seventh aspect the invention provides monomeric polypeptide having chaperone activity and incapable of multimerisation characterised in that in the absence of ATP the polypeptide has a protein refolding activity of more than 50%, preferably 60%, even more preferably 75%, said refolding activity being determined by contacting the polypeptide with an inactivated protein of known specific activity prior to inactivation, and then determining the specific activity of the said protein after contact with the polypeptide, the % refolding activity being:
specific activity of protein after contact with polypeptide x 100 specific activity of protein prior to inactivation 1
Preferably, the chaperone activity is determined by the refolding of cyclophilin A. More preferably, 8M urea denatured cyclophilin A (lOOμM) is diluted into lOOmM potassium phosphate buffer pH7.0, lOmM DTT to a final concentration of lμM and then contacted with at least lμM of said polypeptide at 25°C for at least 5 min, the resultant cyclophilin A activity being assayed by the method of Fischer G et al (1984) Bio ed Biochim Acta 43: 1101-1111.
The polypeptide is preferably an hsp60 polypeptide, preferably a GroEL polypeptide.
In eighth aspect the invention provides a polypeptide as claimed in any preceding claim which comprises at least an amino acid sequence selected from GroEL residues:
(a) 191-329, 191-330, 191-331, 191-332, 191-333, 191- 334, 191-335, 191-336, 191-337, 191-338, 191-339, 191-340, 191-341, 191-342, 191-343, 191-344, 191- 345, 191-346, 191-347, 191-348, 191-349, 191-350, 191-351, 191-352, 191-353, 191-354, 191-355, 191- 356, 191-357, 191-358, 191-359, 191-360, 191-361, 191-362, 191-363, 191-364, 191-365, 191-366, 191- 367, 191-368, 191-369, 191-370, 191-371, 191-372, 191-373, 191-374, 191-375 or 191-376, or
(b) 192-329, 192-330, 192-331, 192-332, 192-333, 192- 334, 192-335, 192-336, 192-337, 192-338, 192-339,
192-340, 192-341, 192-342, 192-343, 192-344, 192- 345, 192-346, 192-347, 192-348, 192-349, 192-350, 192-351, 192-352, 192-353, 192-354, 192-355, 192- 356, 192-357, 192-358, 192-359, 192-360, 192-361, 192-362, 192-363, 192-364, 192-365, 192-366, 192- 367, 192-368, 192-369, 192-370, 192-371, 192-372, 192-373, 192-374, 192-375 or 192-376, or
(c) 193-329, 193-330, 193-331, 193-332, 193-333, 193- 334, 193-335, 193-336, 193-337, 193-338, 193-339,
193-340, 193-341, 193-342, 193-343, 193-344, 193- 345, 193-346, 193-347, 193-348, 193-349, 193-350, 193-351, 193-352, 193-353, 193-354, 193-355, 193- 356, 193-357, 193-358, 193-359, 193-360, 193-361, 193-362, 193-363, 193-364, 193-365, 193-366, 193- 367, 193-368, 193-369, 193-370, 193-371, 193-372, 193-373, 193-374, 193-375 or 193-376, or
(d) 230-271, 229-271, 229-272, 228-272, 228-273, ...et seq. . . 194-328, 194-329, or
the equivalent residues of substantially homologous chaperonins, or a modified, mutated or variant sequence thereof. A preferred polypeptide has the amino acid sequence 191- 345 or 191-376, more preferably 193-337 of GroEL, or the equivalent residues of substantially homologous chaperonins, or a modified, mutated or variant sequence thereof.
The polypeptide preferably has a molecular weight of less than 34kDa.
"Modifications" include chemically modified polypeptides for example. "Variants" include, for example, naturally occurring variants of the kind to be found amongst a population of hsp60 chaperonin harbouring organisms/cells as well as naturally occurring polymorphisms or mutations. "Mutations" may also be introduced artificially by processes of mutagenesis well known to a person skilled in the art.
In being "substantially homologous" peptides may have at least 50% amino acid sequence homology with the specified GroEL amino acid sequences, preferably at least 60% homology and more preferably 75% homology. Homology may of course also reside in the nucleotide sequences for the polypeptide which may be at least 50%, preferably at least 60% homologous and more preferably 75% homologous with the nucleotide sequence encoding the specified GroEL amino acid residues.
The hspβO class of chaperonin proteins are generally homologous in structure and so there are therefore conserved or substantially homologous amino acid sequences between the members of the class. GroEL is just an example of an hsp60 chaperonin protein; other suitable proteins having an homologous apical domain may be followed.
The list was compiled from the OWL database release 28.1. The sequences listed below show clear homology to apical domain (residues 191-375) in PDB structure pdblgrl.ent.
OWL is a non redundant database merging SWISS-PROT, PIR (1-3) , GenBank (translation) and NRL-3D.
190-374 CH60_ECOLI 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (AMS) . - ESCHERICHIA 190-374 CH60_SALTI 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - SALMONELLA TYPHI. 191-375 S56371 GroEL protein - Escherichia coli 190-374 CH60_LEPIN 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (HEAT SHOCK 58 KD PRO 191-375 S47530 groEL protein - Porphyromonas gingivalis 190-374 LPNHTPBG LPNHTPBG NID:gl49691 - Legionella pneumophila (strain SVir) (library: 189-373 CH60_ACTAC 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - ACTINOBACILLUS ACT 191- 375 JC4519 heat-shock protein GroEL - Pasteurella multocida
191-375 CH60_BRUAB 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN). - BRUCELLA ABORTUS. 191-375 CH60_HAEIN 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN).- HAEMOPHILUS INFLUE 190-373 CH60_CAUCR 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN).- CAULOBACTER CRESCE 190-374 CH60_AMOPS 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN).- AMOEBA PROTEUS SYM 191-375 CH60_HAEDU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - HAEMOPHILUS DUCREY 191-375 CH61JRHIME 60 KD CHAPERONIN A (PROTEIN CPN60 A) (GROEL PROTEIN A) . - RHIZOBIUM ME 190-374 CH60JLEGMI 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (58 KD COMMON ANTIGEN 191-375 CH60_YEREN 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (HEAT SHOCK PROTEIN 6) 190-374 CH 63_BRAJA 60 KD CHAPERONIN 3 (PROTEIN CPN60 3) (GROEL PROTEIN 3). - BRADYRHIZOBI 191- 375 CH60_PORGI 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - PORPHYROMONAS GING 191-375 S52901 heat shock protein 60K - Yersinia enterocolitica 191-375 S26423 heat shock protein 60 - Yersinia enterocolitica
191-375 RSU373691 RSU37369 NID: gl208541 - Rhodobacter sphaeroides strain=HR. 190-374 CH62_BRAJA 60 KD CHAPERONIN 2 (PROTEIN CPN60 2) (GROEL PROTEIN 2). - BRADYRHIZOBI 191-375 CH60_ACYPS 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (SYMBIONIN) . - ACYRTH 191-375 CH63_RHIME 60 KD CHAPERONIN C( PROTEIN CPN60 C) (GROEL PROTEIN C) . - RHIZOBIUM ME 191-375 YEPHSPCRP1 YEPHSPCRP NID: g466575 - Yersinia enterocolitica DNA. 191-375 CH60_BORPE 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - BORDETELLA PERTUSS 189-373 BRUGROl BRUGRO NID: gl44106 - Brucella aabortus (library: lambda-2001) DNA.
191-375 CH60_PSEAE 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - PSEUDOMONAS AERUGI 190-374 CH60_BARBA 60 KD CHAPERONIN (PROTEIN CPN60) (IMMUNOREACTIVE PROTEIN BB65) (IMMUNO 191-375 BAOBB63A BAOBB63A NID: gl43845 - Bartonella bacilliformis (library: ATCC 35685) 189-373 CH60_BACST 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - BACILLUS STEAROTHE 188-372
190-373 CH60_BORBU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - BORRELIA BURGDORFE 224-408 S26583 chaperonin hsp60 - maize 190-373 A49209 heat shock protein HSP60 - Lyme disease spirochete 224-408 MZECPN60B MZECPN60B NID: g309558 - Zea mays (strain B73 )( library : Dashll of P.S 189-373 CH60_THEP3 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (HEAT SHOCK 61 KD PRO 188-372 CH60_STAEP 60 KD CHAPERONIN ("PROTEIN CPN60) (GROEL PROTEIN) (HEAT SHOCK PROTEIN 6 189-373 CH60_LACLA 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - LACTOCOCCUS LACTIS 188-374 CH61_STRAL 60 KD CHAPERONIN 1 (PROTEIN CPN60 1) (GROEL PROTEIN 1) (HSP58) . - STRE 191-375 CH60_CHLPN 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - CHLAMYDIA PNEUMONI 224-408 MZECPN60A MZECPN60A NID: g309556 - Zea mays (strain B73) (library : Dach 11 of P. 190-373 HECHSPABl HECHSPAB NID: g712829 - Helicobacter pylori (individual_isolate 85P) D 221-405 CH60_ARATH MITOCHONDRIAL CHAPERONIN HSP60 PRECURSOR. - ARABIDOPSIS THALIANA (MOUS 224-408 CH60_MAIZE MITOCHONDRIAL CHAPERONIN HSP60 PRECURSOR. - ZEA MAYS (MAIZE) . 190-374 CH60_CHLTR 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (57 KD CHLAMYDIAL HYP 189-373 CH60_STAAU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (HEAT SHOCK PROTEIN 6 189-373 CH60_CLOPE 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - CLOSTRIDIUM PERFRI 212-397 HS60_YEAST HEAT SHOCK PROTEIN 60 PRECURSOR (STIMULATOR FACTOR 1 66 KD COMPONENT) 217-403 CH60_PYRSA 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - PYRENOMONAS SALINA 191-377 CH60_EHRCH 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN).- EHRLICHIA CHAFFEEN 191-375 CHTGROE1 CHTGROE NID: gl44503 - C . trachomatis DNA. 188- 372 CH60_THETH 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - THERMUS AQUATICUS 189-373 TAU294831 TAU29483 NID: gll22940 - T ermus aquaticus. 190-378 CH60J ICTS 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (MAJOR ANTIGEN 58) (5 189-375 SYCCPNC SYCCPNC NID: gl001102 - Synechocystis sp. (strain PCC6803,) DNA.
190-373 CPU308211 CPU30821 NID: gl016083 - Cyanophora paradoxa. 189-373 CH61_MYCLE 60 KD CHAPERONIN 1 (PROTEIN CPN60 1) (GROEL PROTEIN 1) . - MYCOBACTERIU 239-423 PSU21139 PSU21139 NID: g806807 - pea. 191-377 CH60_COWRU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - COWDRIA RUMINANTIU 245-429 RUBB_BRANA RUBISCO SUBUNIT BINDING-PROTEIN BETA SUBUNIT PRECURSOR (60 KD CHAPERON 144-328 SCCPN60 SCCPN60 NID: gll67857 - rye.
153-338 CH60_EHRRI 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (55 KD MAJOR ANTIGEN) 245-429 RUBB_ARATH RUBISCO SUBUNIT BINDING-PROTEIN BETA SUBUNIT PRECURSOR (60 KD CHAPERON 235-419 ATU49357 ATU49357 NID: gl223909 - thale cress strain=ecotype Wassilewskija . 195-379 RUB1_BRANA RUBISCO SUBUNIT BINDING-PROTEIN ALPHA SUBUNIT (60 KD CHAPERONIN ALPHA 189-374 CH62_SYNY3 60 KD CHAPERONIN 2 (PROTEIN CPN60 2) (GROEL HOMOLOG 2) . - SYNECHOCYSTI 178- 362 RUBA_RICCO RUBISCO SUBUNIT BINDING-PROTEIN ALPHA SUBUNIT (60 KD CHAPERONIN ALPHA 190-375 CH60_ODOSI 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - ODONTELLA SINENSIS 236-420 PSU21105 PSU21105 NID: gll85389 - pea. 224-409 CH60_BRANA MITOCHONDRIAL CHAPERONIN CH60_BACSU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - BACILLUS SUBTILIS. 191-375 CH60_AGRTU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - AGROBACTERIUM TUME 191-375 b36917 heat shock protein GroEL - Agrobacterium tumefaciens
191-375 PAU17072 PAU17072 NID: g576778 - Pseudomonas aeruginosa. "191-375 CH60_RHILV 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - RHIZOBIUM LEGUMINO 187-373 CH61_STRCO 60 KD CHAPERONIN 1 (PROTEIN CPN60 1) (GROEL PROTEIN 1)(HSP58).- STRE 191-375 CH60_COXBU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (HEAT SHOCK PROTEIN B 191-375 CH62_RHIME 60 KD CHAPERONIN B (PROTEIN CPN60 B) (GROEL PROTEIN B) . - RHIZOBIUM ME 191-375 PSEGROESL1 PSEGROESL NID: gl51241 - Pseudomonas aeruginosa (library: ATCC 27853) 189-372 CH61_SYNY3 60 KD CHAPERONIN 1 (PROTEIN CPN60 1) (GROEL HOMOLOG 1).- SYNECHOCYSTI 189-373 CH60_CLOTM 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (HSP-60) . - CLOSTRIDI 191-373 CH60_PSEPU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN).- PSEUDOMONAS PUTIDA 190-373 CH60_SYNP7 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN).- SYNECHOCOCCUS SP. 190-374 CH60_GALSU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN).- GALDIERIA SULPHURA 190-374 CH60_ZYMMO 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN). - ZYMOMONAS MOBILIS. 191-375 JC2564 heat shock protein groEL - Zyitiomonas mobilis
191-375 CH60_CHRVI 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - CHROMATIUM VINOSUM 189-373 CH60_MYCTU 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (65 KD ANTIGEN) (HEAT 191-375 CH60_NEIME 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (63 KD STRESS PROTEIN 189- 373 CH60_TREPA 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (TPN60) (TP4 ANTIGEN) 190-374 CH60_HELPY 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (HEAT SHOCK PROTEIN 6 191-375 CH60_NEIGO 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (63 KD STRESS PROTEIN 222-406 CH61_CUCMA MITOCHONDRIAL CHAPERONIN HSP60-1 PRECURSOR. - CUCURBITA MAXIMA (PUMPKI 189-373 CH60_MYCPA 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (65 KD ANTIGEN) (HEAT 230-414 MPU15989 MPU15989 NID:g559802 - Mycobacterium paratuberculosis . 224-408 S26582 chaperonin hsp60 - maize 191-375 S40247 heat-shock protein - Neisseria gonorrhoeae 189-373 CH60_CLOAB 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - CLOSTRIDIUM ACETOB 191-375 CH60_NEIFL 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (63 KD STRESS PROTEIN 190-373 CH60_LEGPN 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (58 KD COMMON ANTIGEN 222-406 CH62_CUCMA MITOCHONDRIAL CHAPERONIN HSP60-2 PRECURSOR. - CUCURBITA MAXIMA (PUMPKI 191-375 CHTGROESL1 CHTGROESL NID: g402332 - Chlamydia trachomatis DNA. 64-248 S40172 S40172 NID: g251679 - Chlamydia psittaci pigeon strain P-1041. 189- 373 SYOGROEL2 SYOGROEL2 NID:g562270 - Synechococcus vulcanus DNA. 191-375 CH60_CHLPS 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (57 KD CHLAMYDIAL HYP 188- 372 CH62_STRAL 60 KD CHAPERONIN 2 (PROTEIN CPN60 2) (GROEL PROTEIN 2) (HSP56) . - STRE 189-373 CH62_MYCLE 60 KD CHAPERONIN 2 (PROTEIN CPN60 2) (GROEL PROTEIN 2) (65 KD ANTIGEN) 236-420 MSGANTM MSGANTM NID: gl49923 - M.leprae DNA, clone Y3178. CPN60 PRECURSOR. - BRASSICA NAPUS (RAPE) . 105-289 PMSARG2 PMSARG2 NID: g607157 - Prochlorococcus marinus. 234-417 RUB2_BRANA RUBISCO SUBUNIT BINDING-PROTEIN ALPHA SUBUNIT PRECURSOR (60 KD CHAPERO 75-259 CRECPNIA CRECPNIA NID: g603910 - Chlamydomonas reinhardtii cDNA to mRNA. 215-400 P60_CRIGR MITOCHONIDRIAL MATRIX PROTEIN PI PRECURSOR (P60 LYMPHOCYTE PROTEIN) (CH224-408 CRECPN1B CRECPN1B NID: g603912 - Chlamydomonas reinhardtii cDNA to mRNA. 191-375 RUBA_WHEAT RUBISCO SUBUNIT BINDING-PROTEIN ALPHA SUBUNIT PRECURSOR (60 KD CHAPERO 189-373 B47292 heat shock protein groEL - Mycobacterium tuberculosis 206-391 CELHSP60CP CELHSP60CP NID: g533166 - Caenorhabditis elegans (strain CB1392) cDNA 215-400 P60_HUMAN MITOCHONDRIAL MATRIX PROTEIN PI PRECURSOR (P60 LYMPHOCYTE PROTEIN) (CH 215-400 P60_MOUSE MITOCHONDRIAL MATRIX PROTEIN PI PRECURSOR (P60 LYMPHOCYTE PROTEIN) (CH 215-400 P60_RAT MITOCHONDRIAL MATRIX PROTEIN PI PRECURSOR (P60 LYMPHOCYTE PROTEIN) (CH 215-400 A41931 chaperonin hsp60 - mouse
197-382 MMHSP60A MMHSP60A NID:g51451 - house mouse. 218- 402 CH63_HELVI 63 KD CHAPERONIN PRECURSOR (P63). - HELIOTHIS VIRESCENS (NOCTUID MOTH) 205-390 EGHSP60GN EGHSP60GN NID: gl217625 - Euglena gracilis. 222-407 HS60_SCHPO PROBABLE HEAT SHOCK PROTEIN 60 PRECURSOR. - SCHIZOSACCHAROMYCES POMBE 198-385 S61295 heat shock protein 60 - Trypanosoma cruzi
198-385 TRBMTHSP TRBMTHSP NID: g903883 - Mitochondrion Trypanosoma brucei (strain EATRO 8-69 ECOGROELA ECOGROELA NID: gl46268 - E.coli DNA, clone E. 142-325 ENHCPN60P ENHCPN60P NID: g675513 - Entaπioeba histolytica (strain HM-1:IMSS) DNA. 257-433 CH60_PLAFG MITOCHONDRIAL CHAPERONIN CPN60 PRECURSOR. - PLASMODIUM FALCIPARUM (ISO 1-90 CRECPN1C CRECPN1C NID: g603914 - Chlamydomonas reinhardtii cDNA to mRNA.
5-65 ATTS0779 ATTS0779 NID: gl7503 - thale cress. 189-373 CH60_MYCGE 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) . - MYCOPLASMA GENITAL 228-411 HTOHSP60X HTOHSP60X NID: g553068 - Histoplasma capsulatum (strain G217B) DNA. 190-297 CH60_SYNP6 60 KD CHAPERONIN (PROTEIN CPN60) (GROEL PROTEIN) (FRAGMENT) . - SYNECHO 169-245 RUBA_ARATH RUBISCO SUBUNIT BINDING-PROTEIN ALPHA SUBUNIT (60 KD CHAPERONIN ALPHA
The polypeptide may further comprise a polyamino acid sequence, preferably an N-terminal polyamino acid sequence although a C-terminal sequence may be present instead or in addition to an N-terminal sequence. The polyamino acid sequence may be selected from the same or different amino acid residues. When the same amino acid residue is repeated a particularly preferred polyamino acid sequence is a polyhistidine sequence.
Whether composed of the same or different amino acid residues, the further polyamino acid sequence may comprise any number of amino acid residues so long as chaperonin activity is provided. The other amino acid residues which may be included in the further polyamino acid sequence could be selected from any of the twenty or so essential amino acids common to biological systems or of any amino acid variant or derivative. When the polyamino acid sequence is other than a homopolymer then it may comprise a repeated sequence of two or more amino acid residues. The amino acid sequence may encode a portion of another known protein or polypeptide or it may even be random.
The further polyamino acid sequence preferably also includes a cleavage site cleavable by a cleavage agent. A preferred cleavage agent is thrombin although any other suitable agent will suffice. The sequence of amino acid residues is of course selected to permit cleavage by the desired cleavage agent.
The further polyamino acid sequence preferably comprises 17 to 39 amino acids although more than 39 and less than 17 amino acids may also be employed. The polypeptide may be attached to a support, preferably in immobilised form, optionally immobilised to a chromatographic matrix, more preferably an agarose resin. When an agarose resin is used it is preferably a nickel- nitrilo-tri-acetic acid (NTA) -ligated agarose resin. This has affinity for a polypeptide having a polyhistidine tail.
The polypeptide of the invention may be obtained by recombinant means. Alternatively, the polypeptide may be produced by a routine chemical synthesis using standard polypeptide synthesis procedures known in the art. If produced by recombinant means the polypeptide may be fused to a heterologous protein or polypeptide.
The inventors have found that the fragments with polyhistidine tails of 17 amino acid residues sht- GroEL191-345 and sht-GroEL191-376 as well as GroEL191-345 are poteint facilitators of the folding of inactive forms of cyclophilin A and rhodanese, and catalyse the unfolding of barnase. Maximal refolding yield was obtained at stoichiometric concentrations of cyclophilin A and apical domain, indicating the formation of a 1:1 complex between chaperone fragment and substrate protein during refolding. The fragments sht-GroEL191-345 and sht-GroEL191-376 when attached to nickel-nitriol-tri- acetic acid (NTA)- ligated agarose resin are also active in enhancing the folding yield of destabilised mutants of indole glycerol-phosphate synthase and of the protein cyclophilin A. This proves the monomer has refolding activity and also demonstrates that the immobilised protein is functionally useful. Fragments of GroEL may be produced either from recombinant DNA methods or from protein chemistry or any means of chemical synthesis, or the equivalent fragments of homologous hsp60 (cpn60) proteins or any natural variants or any variants produced by mutagenesis, or of larger fragments containing the sequences 191-376, 191- 345 of GroEL from E. coli .
In tenth aspect the invention provides a nucleic acid molecule comprising a nucleotide sequence encoding a polypeptide of the invention, or a nucleotide sequence hybridisable thereto and optionally encoding a polypeptide having chaperone activity.
Preferred characteristics of the nucleotide sequence correspond to the preferred features of the polypeptides of the aspects of the invention as hereinbefore defined.
This aspect of the invention therefore includes a recombinant DNA molecule for use in cloning and/or expressing a DNA sequence, said recombinant DNA molecule comprising:
(a) a nucleotide sequence encoding amino acid residues 191-376 of GroEL,
(b) a nucleotide sequence encoding amino acid residues 191-345 of GroEL,
(c) a nucleotide sequence encoding amino acid residues 193- 337,
(d) a nucleotide sequence encoding amino acid residues of GroEL selected from amino acid residues: (i) 191-329, 191-330, 191-331, 191-332, 191-333, 191- 334, 191-335, 191-336, 191-337, 191-338, 191-339, 191-340, 191-341, 191-342, 191-343, 191-344, 191- 345, 191-346, 191-347, 191-348, 191-349, 191-350, 191-351, 191-352, 191-353, 191-354, 191-355, 191- 356, 191-357, 191-358, 191-359, 191-360, 191-361, 191-362, 191-363, 191-364, 191-365, 191-366, 191- 367, 191-368, 191-369, 191-370, 191-371, 191-372, 191-373, 191-374, 191-375 or 191-376, or
(ii) 192-329, 192-330, 192-331, 192-332, 192-333, 192- 334, 192-335, 192-336, 192-337, 192-338, 192-339, 192-340, 192-341, 192-342, 192-343, 192-344, 192- 345, 192-346, 192-347, 192-348, 192-349, 192-350, 192-351, 192-352, 192-353, 192-354, 192-355, 192- 356, 192-357, 192-358, 192-359, 192-360, 192-361, 192-362, 192-363, 192-364, 192-365, 192-366, 192- 367, 192-368, 192-369, 192-370, 192-371, 192-372, 192-373, 192-374, 192-375 or 192-376, or
(iii) 193-329, 193-330, 193-331, 193-332, 193-333, 193- 334, 193-335, 193-336, 193-337, 193-338, 193-339, 193-340, 193-341, 193-342, 193-343, 193-344, 193- 345, 193-346, 193-347, 193-348, 193-349, 193-350, 193-351, 193-352, 193-353, 193-354, 193-355, 193- 356, 193-357, 193-358, 193-359, 193-360, 193-361, 193-362, 193-363, 193-364, 193-365, 193-366, 193- 367, 193-368, 193-369, 193-370, 193-371, 193-372, 193-373, 193-374, 193-375 or 193-376.
(e) 230-271, 229-271, 229-272, 228-272, 228-273, . et seq. . . 194-328, 194-329, or (f) a nucleotide sequence hybridisable to any of (a),
(b) , (c) , (d) or (e) above and encoding a monomeric polypeptide having chaperone activity, or
(g) degenerate nucleotide sequences corresponding to (a), (b) , (c), (d) , (e) or (f) above.
Further features of the nucleic acid sequence may be as described hereinbefore in relation to the polypeptides of the invention; the nucleic acid sequence having the appropriate nucleotide sequence which on expression provides a polypeptide which possesses these features.
In eleventh aspect the invention provides a vector comprising a nucleic acid as hereinbefore defined.
In twelfth aspect the invention provides a host cell transformed with a vector or a nucleic acid molecule as hereinbefore defined.
In thirteenth aspect there is provided a method of making a polypeptide of the invention comprising transforming a host cell with a nucleic acid encoding said polypeptide, culturing the transformed cell for and expressing said polypeptide. Expression may be direct or as a fusion product. When the polypeptide product is expressed as a fustion then the method preferably includes a step wherein the expressed polypeptide product is subject to cleavage. The polypeptide may be recovered from the transformed cells expressing it.
In fourteenth aspect the invention provides a pharmaceutical formulation comprising a polypeptide of the invention, optionally together with a diluent, carrier or excipient.
The active ingredients of a pharmaceutical composition comprising the polypeptide are contemplated to exhibit excellent therapeutic activity, for example, in the alleviation of Alzheimer's disease when administered in amount which depends on the particular case. Dosage regima may be adjusted to provide the optimum therapeutic response. For example, several divided doses may be administered daily or the dose may be proportionally reduced as indicated by the exigencies of the therapeutic situation.
The active compound may be administered in a convenient manner such as by the oral, intravenous (where water soluble) , intramuscular, subcutaneous, intranasal, intradermal or suppository routes or implanting (eg using slow release molecules) . Depending on the route of administration, the active ingredient may be required to be coated in a material to protect said ingredients from the action of enzymes, acids and other natural conditions which may inactivate said ingredient.
In order to administer the polypeptide by other than parenteral administration, it will be coated by, or administered with, a material to prevent its inactivation. For example, the polypeptide may be administered in an adjuvant, co-administered with enzyme inhibitors or in liposomes. Adjuvant is used in its broadest sense and includes any immune stimulating compound such as interferon. Adjuvants contemplated herein include resorcinols, non-ionic surfactants such as polyoxyethylene oleyl ether and n-hexadecyl polyethylene ether. Enzyme inhibitors include pancreatic trypsin.
Liposomes include water-in-oil-in-water CGF emulsions as well as conventional liposomes.
The active compound may also be administered parenterally or intraperitoneally. Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations contain a preservative to prevent the growth of microorganisms.
The pharmaceutical forms suitable for injectable use include sterile aqueous solutions (where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. In all cases the form must be sterile and must be fluid to the extent that easy syringability exists. It must be stable under the conditions of' manufacture and storage and must be preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyetheylene gloycol, and the like) , suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of superfactants .
The prevention of the action of microorganisms can be brought about by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thirmerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.
Sterile injectable solutions are prepared by incorporating the active compound in the required amount in the appropriate solvent with various of the other ingredients enumerated above, as required, followed by filtered sterilisation. Generally, dispersions are prepared by incorporating the sterilised active ingredient into a sterile vehicle which contains the basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum drying and the freeze-drying technique which yield a powder of the active ingredient plus any additional desired ingredient from previously sterile-filtered solution thereof.
When the polypeptide is suitably protected as described above, it may be orally administered, for example, with an inert diluent or with an assimilable edible carrier, or it may be enclosed in hard or soft shell gelatin capsules, or it may be compressed into tablets, or it may be incorporated directly with the food of the diet. For oral therapeutic administration, the active compound may be incorporated with excipients and used in the form of ingestible tablets, buccal tablets, troches, capsules. elixirs, suspensions, syrups, wafers, and the like. The amount of active compound in such therapeutically useful compositions in such that a suitable dosage will be obtained.
The tablets, troches, pills, capsules and the like may also contain the following: a binder such as gum tragacanth, acacia, corn starch or gelatin; excipients such as dicalcium phosphate; a disintegrating agent such as corn starch, potato starch, alginic acid and the like; a lubricant such as magnesium stearate; and a sweetening agent such as sucrose, lactose or saccharin may be added or a flavouring agent such as peppermint, oil of wintergreen, or cherry flavouring. When the dosage unit form is a capsule, it may contain, in addition to materials of the above type, a liquid carrier.
Various other materials may be present as coatings or to otherwise modify the physical form of the dosage unit. For instance, tablets, pills, or capsules may be coated with shellac, sugar or both. A syrup or elixir may contain the active compound, sucrose as -a sweetening agent, methyl and propylparabens as preservatives, a dye and flavouring such as cherry or orange flavour. Of course, any material used in preparing any dosage unit form should be pharmaceutically pure and substantially non-toxic in the amounts employed. In addition, the active compound may be incorporated into sustained- release preparations and formulations.
As used herein "pharmaceutically acceptable carrier and/or diluent" includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents and the like. The use of such media and agents for pharmaceutical active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active ingredient, use thereof in the therapeutic compositions is contemplated.
Supplementary active ingredients can also be incorporated into the compositions.
It is especially advantageous to formulate parenteral compositions in dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete units suited as unitary dosages for the mammalian subjects to be treated; each unit containing a predetermined quantity of active material calculated to produce the desired therapeutic effect in association with the required pharmaceutical carrier. The specification for the novel dosage unit forms of the invention are dictated by and directly dependent on (a) the unique characteristics of the active material and the particular therapeutic effect to be achieved, and (b) the limitations inherent in the art of compounding such as active material for the treatment of disease in living subjects having a diseased condition in which bodily health is impaired.
The principal active ingredient is compounded for convenient and effective administration in effective amounts with a suitable pharmaceutically acceptable carrier in dosage unit form. In the case of compositions containing supplementary active ingredients, the dosages are determined by reference to the usual dose and manner of administration of the said ingredients. In ninth aspect there is provided polypeptide of the invention as hereinbefore defined for use in the treatment of disease. Consequently there is provided the use of a polypeptide of the invention for the manufacture of a medicament for the treatment of disease associated with aberrant protein/polypeptide structure. The aberrant nature of the protein/polypeptide may be due to misfolding or unfolding which in turn may be due to an anomalous eg mutated amino acid sequence. The protein/polypeptide may be destablished or deposited as plaques eg as in Alzheimer's disease. The disease might be caused by a prion. A polypeptide-based medicament of the invention would act to renature or resolubilise aberrant, defetive or deposited proteins.
In fifteenth aspect there is provided a nucleic acid molecule in accordance with other aspects of the invention for use in the treatment of disease. Consequently, there is provided the use of a nucleic acid molecule of the invention for the manufacture of a medicament for the treatment of disease associated with protein/polypeptide structure. Genetic therapy in vivo is therefore provided for by way of introduction and expression of DNA encoding the monomeric polypeptide in cells/tissues of an individual to provide chaperonin activity in those cells/tissues.
In sixteenth aspect there is provided the use of a polypeptide of the invention for altering the structure of a molecule, particularly a protein or polypeptide and the alteration in structure may be by folding, unfolding or resolubilsing. Preferably, the stoichiometry between the monomeric polypeptide of the invention and the molecule being altered is about 1:1, preferably 1:1. In seventeenth aspect there is provided a method of reconditioning a molecule preferably a protein comprising contacting said protein with a polypeptide of the invention. Preferably the protein is subjected to inactivation or denaturation prior to contacting with said polypeptide. The polypeptide may be immobilised to a solid phase, preferably a chromatographic matrix, and the contacting of protein and polypeptide is carried out by applying the protein to the top of a bed of the matrix packed in a column and then eluting the polypeptide through the column.
This aspect of the invention therefore provides for the use of a polypeptide as hereinbefore defined for altering the structure of a molecule, preferably a protein or polypeptide and the alteration in structure is by folding, unfolding or refolding. The stoichiometry between the polypeptide and the molecule being altered may be about 1:1.
In nineteenth aspect the invention provides for the use of a polypeptide of the invention the purification or increase in yield, specific activity or quality of biological molecules, preferably said polypeptide being attached to a support.
In twentieth aspect the invention provides a kit for reconditioning or refolding a molecule, preferably a protein, comprising a polypeptide as hereinbefore defined immobilised to a solid phase and a container for holding said solid phase polypeptide.
In twenty-first aspect the invention provides for the use of a polypeptide as hereinbefore defined in the production of a protein or polypeptide by recombinant means, wherein the said polypeptide is co-expressed with the protein or polypeptide thereby to improve the yield or quality of the protein or polypeptide.
In twenty-second aspect the invention provides an antibody reactive against a polypeptide of the invention. The invention also provides for the use of the antibody in the treatment of disease. Also, the invention provides for the use of the antibody in the manufacture of a medicament for the treatment of disease associated with protein/polypeptide structure.
In twenty-third aspect the invention provides various methods of treating disease, namely a method of treating disease in which an effective amount of a polypeptide of the invention is administered. There is also a method of treating disease which comprises administering an effective amount of an inhibitor of the chaperone activity of a polypeptide of the invention. Preferably, said inhibitor is an antibody. Also, there is a method of treating disease by gene therapy which utilises a construct encoding a polypeptide of the invention or an antagonist thereof.
The momomeric apical domain of GroEL or its fragments catalyse amide proton exchange of barnase, and facilitate refolding of rhodanese and cyclophilin A in the absence of cofactors. Thus, GroEL has an intrinsic chaperone activity, which is not restricted to its oligomeric state or central cavity. The C-terminal α-helices of the apical domain are not directly involved in polypeptide binding, and form a separate folding unit. At physiological temperature, about 50% of the C-terminus is physiological temperature, about 50% of the C-terminus is unfolded, allowing high mobility movements of the polypeptide binding "domain core". Flexibility within the apical domain may be crucial for cooperative binding of a wide range of proteins with different native structure, and for the conformation change of GroEL on binding to the co-chaperonin GroES. The crystal structure of the "domain core" of the apical domain has an overall B-factor 60% lower than that of the same region in intact GroEL. The overall fold of the fragment is similar to the corresponding region of intact GroEL, but the amount of secondary structure is considerably larger. There are three 310-helices in the structure of the fragment, which are involved in binding of polypeptide and/or GroES.
The inventors have expressed the apical domain of GroEL as a stable monomeric protein and with high yield in E. coli, allowing its activity, folding, and structure to be studied separately from those of the equatorial and intermediate domains. The isolated, apical domain is functional in polypeptide binding (Figure 2), and it facilitates protein folding even when truncated to remove its C-terminal α-helices, Hll and H12 (Figure 3) . The apical domain slows down spontaneous refolding of rhodanese and cyclophilin A by >15-fold and >150-fold, respectively (Figures 3c-e) . The inventors have shown increased refolding yield of rhodanese and cyclophilin A in the presence of apical domain (Figure 3), which is too small to reassociate and to form a cavity, clearly demonstrates the presence of an intrinsic chaperone activity in GroEL, which is independent of a central cavity. This is in agreement with NMR experiments on barnase in the presence of apical domain (Figure 2) . The presence of micromolar concentrations of GroEL considerably broadens the resonance lines of barnase owing to a fast association and idssociation of native barnase from the slowly rotational tumbing GroEL (Zahn et al (1996) Science 271: 642-645) . If the apical domain and barnase would form a complex of large molecular size, i.e. containing seven or 14 molecules of apical domain, one would expect a considerable degree of line broadening in the NMR spectra of barnase in the presence of apical domain. However, this is not what was observed, even not at eight-fold higher monomer concentration of apical domain than of GroEL (Zahn et al (1996) Science 271: 642- 645) . The complex between apical domain and barnase, therefore, rather seems to be of low molecular weight and probably has a 1:1 stoichiometry, which would be consistent with the binding stoichiometry of the apical domain and cyclophilin A (Figure 3e) .
The intrinsic chaperone activity of monomeric apical domain facilitates refolding of rhodanese even though physiological conditions, GroES, and ATP are required in the presence of intact GroEL. Although the inventors do not wish to be bound by any particular hypothesis the role of GroES appears to be to weaken the affinity of GroEL for substrates, and to prevent premature dissociation of aggregation prone states: there has to be a balance between tight binding for the annealing activity and weaker binding to allow folding (Corrales F J and Fersht A R (1996) Proc Natl Acad Sci USA 93_ 4509- 4512) . The weaker binding of rhodanese to the fragment must be adequate for chaperoning activity and weak enough for folding. This is in agreement with the observed similar rate constant for refolding of rhodanese in the presence of apical domain and in the presence of GroEL, GroES, and ATP (Figures 3b-d) , indicating a similar binding affinity. The complex structure of GroEL, and the modulation of its substrate affinity by GorES and nucleotides, must be to allow the efficient folding of a wide range of proteins, with a wide variation of affinities for GroEL. The mechanism, in general, may involve several components: GroES binds to the cis-end of a GroEL-substrate complex and displaces the bound polypeptide into the cavity; the dissociated and encapsulated polypeptide chain then refolds in an enlarged folding cavity to a native-like state, until it is released upon the dissociation of GroES; and, the cycle of binding and dissociation of unfolded protein, GroES, ATP, and GroEL may be repeated several times, before the substrate protein can adopt a native conformation (Hunt et al (1996) Nature 379: 37-45; Weissman et al (1996) Cell 8_4: 481-490; Weissman J S et al (1995) Cell 33_: 577-587; Mayhew M et al (1996) Nature 379: 420-426; Corrales F J and Fersht A R (1996) Proc Natl Acad Sci USA 92_ 4509-4512) . Polypeptides with intermediate affinity for GroEL, such as cyclophilin A (Figure 3e) , bind transiently to GroEL and folding is facilitated without the need for co-factors.
Helices Hll and H12 are much less stable than the "domain core" of the apical domain, and form a separate folding unit (Figure 4) . At physiological temperature, about 50% of the secondary structure of the low-melting helices is unfolded and thus becomes flexible. But, in vivo, the C- terminus is not degraded, implying that the primary structure is not accessible to degradation by E. coli proteases. The low stability of the C-terminal helices allows movements and larger fluctuations of the "domain cores" within the GroEL-ring. This flexible arrangement of the apical domain may be implicated in the cooperative binding of unfolded polypeptide to the seven subunits of GroEL in each ring. It is understandable that flexibility is required for binding to a large variety of proteins with different amino acid sequence, and secondary and tertiary structure. The observed low stability of the C- terminus may also contribute to the large conformational chance in GroEL on binding of GroES, which has been suggested to occur through rigid body movement through the β-sheet hinge region of the intermediate domain (Braig K et al (1994) Nature 37JL: 578-586) .
The production and crystallisation of monomeric fragments of the apical domain by the inventors has allowed a determination of the three-dimensional structure of the polypeptide binding part of GroEL with JB-factors much lower than those of the equivalent regions in GroEL (Figure 5) (Braig K et al (1994) Nature 3TI : 578-586; Braig et al (1995) Nature Struct Biol 2 : 1083-1094; Boisvert et al (1996) Nature Structure Biology .3: 170-
177), perhaps because of favourable packing interactions within the crystal, or the absence of the flexible C- termini in the GroEL fragment, or a combination of both. Particularly interesting is the existence of 3ι0-helices within the apical domain. These tightly packed helices are much less common than α-helices, and the majority are very short, with 96% being less than four residues. They are usually found at the end of α-helices, where the final turn may have this conformation, or between two β- strands. All the 3ι0-helices of the apical domain are localised within regions, which have been shown to be involved in polypeptide and/or GroES binding (Fenton et al (1994) Nature 371: 614-619). The third 3ι0-helix has an unusual length of 10 residues. A 3ι0-helix with a similar length (9 residues) has been found in aconitase (Robbins et al (1989) Proteins: Structure, Function and Genetics 5_: 289-312), an enzyme of the Calvin cycle. Interestingly, the apical domain and aconitase also share a similar fold: a central β-sheet linked by α-helices, suggesting that they may use a common mechanism for substrate binding.
Preferred embodiments of the invention will now be described in detail by way of the following examples and with reference to the accompanying drawings in which:
Figure 1 shows the amino acid sequence of the apical region of GroEL from E. coli including further amino acid residues providing an N-terminal histidine-tail (ht) .
Figure 2a is a bar chart showing the catalytic effect of GroEL 191-345 on the amide proton exchange of unfolded barnase. The ratio of the protection factors of the 39 measurable amide protons of barnase in the absence (P) and presence (P+(G)) of the GroEL fragment is shown.
Figure 2b is a bar chart similar to that of Figure 2a but for the GroEL fragment 191-345 with a 17 residue N- terminal histidine tail (sht-GroEL 191-345) .
Figure 2c is a bar chart similar to that of Figure 2a but for the GroEL fragment 191-376 with a 17 residue N- terminal histidine tail (sht-GroEL 191-376) .
Figure 2d is a plot of observed exchange rate constants in the absence of chaperonin (k^ ) against those in the presence of chaperonin (/c s(-ϊ) ) for GroEL fragment 191- 345. Global, mixed and local amide protons are displayed by circles, triangles and squares respectively.
Figure 2e is a plot similar to that of Figure 2d but for sht-GroEL 191-345.
Figure 2f is a plot similar to that of Figure 2d but for sht-GroEL 191-376.
Figure 3a is a bar chart and table showing the relative enzymatic activity of rhodanese after refolding in the presence or absence of GroEL, GroES, ATP, sht-GroEL 191- 345, sht-GroEL 191-376 or bovine serum albumin (BSA) .
Figure 3b is a plot showing the refolding kinetics of rhodanese in the presence of GroEL, GroES and ATP.
Figure 3c is a plot showing the refolding kinetics of rhodanese in the presence of various concentrations of sht-GroEL 191-345.
Figure 3d is a plot showing the refolding kinetics of rhodanese in the presence of various concentrations of sht-GroEL 191-376.
Figure 3e is a plot comparing the refolding kinetics of specified concentrations of sht-GroEL 191-376, sht-GroEL 191-345 and GroEL.
Figure 4a is a trace of the thermal denaturation of sht- GroEL 191-376 (upper trace) and sht-GroEL 191-345 (lower trace) monitored by far ultra violet-circular dichroism at 222nm. Figure 4b is a trace as in Figure 4a but thermal denaturation is monitored by differential scanning calorimetry.
Figure 5a shows a three dimensional representation of the structure of sht-GroEL 191-345.
Figure 5b shows a three dimensional representation of the backbone structure of sht-GroEL 191-345.
Figure 5c shows a three dimensional representation of election densities of sht-GroEL 191-345 viewed along the helices H8 and H9.
Figure 6 is a trace showing the elution of denatured mutant IGPS from a column of immobilised GroEL 191-345 developed with refolding buffer.
Figure 7 shows the amino acid sequence of GroEL from E. coli numbered from the N-terminal methionine.
Figure 8 shows the amino acid sequence of a portion of GroEL wherein the residues involved in binding peptides are highlighted.
Figures 9a-c comprise partial amino acid sequences of known cpn 60 family members aligned with one another for comparison.
Example 1 - Cloning and expression of the apical domain of GroEL and various of its fragments The apical domain of GroEL (GroEL 191-376) and various C- terminally truncated fragments of the apical domain were cloned by polymerase chain reaction (PCR) into the polylinker site of a pRSET A vector (Invitrogen), coding for an N-terminal histidine-tail, which contained an engineered thrombin cleavage site. The histidine-tail was composed of 36 amino acids (Invitrogen) or 17 amino acids. For the PCR reaction, the plasmid pOF39 (Fayet 0 et al (1989) J Bacteriol 171: 1379-1385) was used as a template together with two primers flanking the respective GroEL fragment with BamHl and CeoRI restriction enzyme sites. This permitted the cloning of the PCR fragment into the polylinker of pRSET A. PCR was performed with Pfu (Stratagene) to reduce the risk of undesired random mutations. The reaction was performed in a volume of 25 μl for 25 cycles with 400 nM or primer and 200 μM of deoxynucleoside-5'-triphosphates . The annealing temperature was 65°C. The following primers were used for the PCT to generate DNA encoding the apical region of GroEL and a variety of fragments thereof: 5' flanking: 5' -CGG ATC CGA AGG TAT GCA GTT CGA CCG; 3' flanking (s) ht-GroEL191-376, 5 '-CGA ATT CTT AAA CGC CGC CTG CCA GTT TCG; 3' flanking (s) ht-GroEL191-345, 5'-CGA ATT CTT AAC GGC CCT GGA TTG CAG CTT C; 3' flanking ht- GroEL191-337, 5' -CGA ATT CTT AAC CCA CGC CAT CGA TGA TAG TGG TG; 3' flanking ht-GroEL191-328, 5' -CGA ATT CTT AGT CTT TGT TGA TCA CAA CAC GTT TAG CCT GAC; 3' flanking ht- GroEL191-322, 5' -CGA ATT CTT AAC GTT TAG CCT GAC CCA GGT CTT CCA; 3' flanking ht-GroEL191-298, 5* -CGA ATT CTT AAC CGC CAG TCA GGG TTG CGA TAT C.
The following protocol was used for expression and purification of the fragments of GroEL in E. coli TG2 cells. With reference to Figure 1, the expression vector used coded for an N-terminal histidine-tail (ht) composed of 36 amino acids and containing a thrombin cleavage site (vertical arrow) . Alternatively, a shorter version of this histidine-tail (sht) containing 17 amino acids was used. The N- and C-terminal ends of the generated fusion proteins, namely ht-GroEL 191-298, ht-GroEL 191-322, ht- GroEL 191-328, ht-GroEL 191-337, ht-GroEL 191-345, ht- GroEL 191-376, sht-GroEL 191-345, and sht-GroEL 191-376 are indicated by rectangular arrows.
Two litres of L-Broth medium plus ampicillin was inoculated 1:100 with an over-night culture of TG2 cells containing the respective plasmid. At an Aβoo of 0.3, expression was induced with isopropyl-1-thio-β-D- galactopyranoside to 0.2 mM final concentration and
M13/T7-phage at a multiplicity of infection of 10 phages per cell. The cells were harvested 8 h after induction, centrifuged, and resuspended in 200 ml buffer A (50 mM Tris-HCl pH 8.2, 300 mM NaCl).
After sonication and centrifugation, the soluble protein fraction was added to 20 ml nickel-NTA agarose resin and stirred for 10 min. The resin was washed with 200 ml buffer A, and the histidine-tail containing fusion protein was eluted with 50 ml buffer A containing 200 mM imidazole. The eluted protein was precipitated with 80% ammonium sulfate, re-dissolved in 4 ml of buffer B (50 mM Tris-HCl pH 8.2, 150 mM sodium chloride), and was loaded on a HiLoad 26/60 Superdex 75 column (Pharmacia) , which was equilibrated with buffer B. The fragment GroEL 191- 345 was produced by thrombin-cleavage of ht-GroEL 191-376 before gel filtration. The cleaving reaction was carried out for several days in buffer B after addition of 250 μl thrombin (Sigma, lU/μl) to the protein solution eluted from the nickel-NTA column. The GroEL fragments were analyzed by quantitative amino acid analysis, N-terminal sequencing, and mass spectrometry.
In Figure 1, secondary structure is indicated by boxes and arrows for α-helix (grey) or 310-helix (white) and β- sheet structure, respectively. Assignment of secondary structure of residues 191 to 336 was done on the basis of the crystal structure of sht-GroEL 191-345 (Table 1 and Fig 5) using PROCHECK (Laskowski R A et al (1993) J Appl Cryst 2_6: 283-291) and the algorithm of Kabsch & Cander (Kabsch W and Sander C (1983) Biopolymers 22: 2577-2637) .
Numbering of α-helices and secondary assignment of residues 337 to 376 according to Braig et al (Braig K et al (1995) Nature Struct Biol 2: 1083-1094) .
Functional apical domain of GroEL and various functional fragments of the apical domain in E. coli were expressed allowing the study of chaperone activity, folding, and crystal structure of the polypeptide binding part of
GroEL. In particular, the apical domain of GroEL (GroEL 191-376) and various fragments of the apical domain (Fig 1) were expressed in E. coli as fusion proteins containing an N-terminal histidine-tail, and this allowed for a straightforward purification using a nickel- nitrilo-tri-acetic acid (NTA) -ligated agarose resin. The histidine-tail of either 39 (ht) or 17 amino acids (sht) contained a sequence of six histidine residues and a thrombin cleavage site. The apical domain was expressed at >20 mg purified protein per litre of culture as were the smaller fragments lacking the C-terminal α-helices, Hll and H12. Further truncation before residue 329 led to considerable destabilisation of the apical domain. The fragment GroEL 191-345, containing no histidine-tail and without the two helices was generated by thrombin cleavage of purified ht-GroEL 191-376.
The apical region and monomeric fragments were found to have chaperone activity. The circular dichroism (CD) of sht-GroEL 191-376, sht-GroEL 191-345, and GroEL 191-345 in the far ultra violet (UV) region and in the near UV region indicated native-like secondary and tertiary structure, respectively. The apical domain and the fragment truncated at position 345 are monomeric at micromolar concentrations, when determined by ultracentrifugation. However, the line widths in the nuclear magnetic resonance (NMR) spectra of GroEL 191-345 are larger than expected for a 17 kD protein but small than for a stable dimer (data not shown) , indicating a fast intermolecular interaction between monomers on the NMR time scale (Wϋthrich, K, NMR of Proteins and Nucleic Acids (Wiley, New York, 1986) . There seems to be a low affinity self-recognition of the apical domain.
Example 2 - Binding of GroEL 191-345 , sht-GroEL 191-345, and sht-GroEL 191-376 to unfolded barnase
Exchange experiments were carried out at 33°C in 20 mM imidazole buffer as described in Zahn et al (1996) Science 271: 642-645. The concentration of barnase was 2.4 mM. The results are shown in Figures 2a, 2b and 2c; locally and globally exchanging amide protons, and amide protons exchanging by a mixture of both mechanisms
(Perrett et al (1995) Biochemistry 3_4: 9288-9298 and Clarke et al (1993) Proc Natl Acad Sci USA ^0: 9837-9841) are indicated by white, black, and grey bars. respectively. The indole NH proton of Trp71 is indicated by 71s.
The same experiments are shown in Figures 2d, 2e and 2f respectively. The data in Figure 2d is fitted to λ°s(-G) = Ck^ where C is a factor by which EX2 mechanism exchange is catalysed by GroEL fragment. The values of C for the globally, locally, and mixed amide protons of barnase are 10, 5, and 1. The data in Figures 2e and 2f is fitted to / CAs(v-rG)7 = Ck°CAs+ k°C}A>s (G.υ) , where /<C*AS(G.U) is the observed EX1 rate constant for global exchange of barnase in the presence of GroEL fragment. Figures 2d, 2e and 2f serve to demonstrate a mechanism of amide proton exchange of barnase. In Fig 2a and 2d there is only an estimate of the final concentration of GroEL fragment. At the high initial protein concentration used, GroEL 191-345 tended to crystallise during the exchange experiment.
Figures 2a , 2b and 2c show that the GroEL fragments catalyse, under folding conditions, exchange of amide protons of native barnase that are known to exchange only from its fully unfolded state (Perrett et al (1995) Biochemistry 3_4: 9288-9298) . Thus, like intact GroEL (Zahn et al (1996) Science 271: 642-645), the apical domain binds with high affinity to unfolded barnase, and helices Hll and H12 are not essential for polypeptide binding (Figures 2a and 2b) . The presence of the N- terminal histidine-tail did not abolish binding activity (Figures 2b and 2c) .
The mechanism of exchange at pD 6.6 is EX2 (Figure 2d), but changes to EX1 at higher pD (Figures 2e and 2f) . In an EX2 mechanism (Hvidt A and Nielsen S 0 (1966) Advan Protein Chem 2_1: 287-386) , the observed rate constant is limited by the intrinsic rate constant for exchange, which depends on pD. The criterion for an EX2 mechanism is that the rate constant for reprotection (ie a refolding or dissociation reaction) is much greater than the intrinsic rate constant. At the other extreme, when intrinsic exchange is much faster than reprotection, the mechanism becomes EX1 (Hvidt A and Nielsen S 0 (1966) supra) and the observed rate constant is simply equal to the rate constant for the formation of the unprotected state. Thus, from the exchange behaviour (and assuming that the presence of the histidine-tail does not effect binding) the rate constant for dissociation of barnase from the apical domain is about 2 s-1, which is less than five-fold larger than that from intact GroEL (Zahn et al (1996) supra) .
Example 3 - Refolding of rhodanese
Rhodanese refolding assays were performed using GroEL, GroES, ATP, sht-GroEL 191-345 and sht-GroEL 191-376 and carried out as described by Horowitz (Horwitz P M in Protein Stability and Folding (eds Shirley B A) 361-368 (Humana Press, 1995) ) .
In more detail, rhodanese (9 μM) was unfolded for 45 min at 25°C in the presence of 8 M urea and 1 mM β-mercaptoethanol. Refolding was initiated by diluting 3 μl of the unfolded rhodanese into a final volume of 250 μl of a standard buffer containing 50 mM Tris-HCl pH 7.8, 50 mM sodium thiosulphate, 10 mM MgCl2, 10 mM KCl. GroEL, GroES, ATP, apical domain (or its C-truncated form) and bovine serum albumin (BSA) were included as indicated, at final concentrations of 2.5 μM monomer, 2.5 μM monomer. 2mM, 2.5 μM and 45 μg/ml (the same concentration by weights as the GroEL fragments) , respectively. After incubation for 50 min at 25°C, rhodanese activity was measured by adding 25 μl from the refolding mixture to 1 ml of 50 mM Na2S203 50 mM KCN and 40 mM potassium phosphate buffer pH 8.6. The reaction was terminated after 15 min incubation by addition of 0.5 ml of 18% formaldehyde. Colour was developed by mixing with 1.5 ml of ferric nitrate reagent (400 g FeN03.9H2); 800 ml 65% HN03 in a final volume of 3 dm3) prepared as indicated (Perrett et al (1995) supra) .
Figure 3a shows the relative enzymatic activity of rhodanese (0.1 μM) after refolding in the presence (+) or absence (-) of GroEL (2.5 μM monomer), GroES (2.5 μM monomer), ATP (2 mM) , sht-GroEL 191-345 (2.5 μM) , sht- GroEL 191-376 (2.5 μM) , or bovine serum albumin (BSA; 45 μg/ml) , from 8 M urea (U) . 100% activity was obtained with native rhodanese (N) alone.
Figure 3b shows the refolding kinetics of rhodanese in presence GroEL, GroES, and ATP (•) . The final concentrations are the same as in Figure 2a. 100% activity was obtained with native rhodanese (0) .
Figures 3c and 3d show the refolding kinetics of rhodanese in the presence of 0.18 μM (D) , 2.5 μM (•) , or 5 μM (0) sht-GroEL 191-345 and sht-GroEL 191-376, respectively.
Figure 3a shows a rhodanese refolding activity of about 42.5% for shtGroEL 191-345 which is about 37.5% above background refolding as shown by the control of unfolded rhodanese (U) alone. The results of Figure 3b are for comparitive purposes and show a time course of refolding activity for GroEL, GroES and ATP.
Figure 3c shows the rhodanese refolding kinetics at various concentrations of sht-GroEL 191-345. Refolding activities of about 50% are achieved at about 25 mins.
Figure 3d is similar to Figure 3c but for sht-GroEL 191- 376 and showing refolding activites of about 40% after 25 mins.
Enzymatic activity was obtained from the absorbance at 460 nm of the complex formed between thiocyanate and ferric ion. Results correspond to the average of three different independent assays. Standard error bars are shown. b-d, same as in a, but, to stop the refolding reaction rhodanese activity was assayed in the presence of 10 mM trans-1, 2-cyclohexanediaminetetraacetate (CDTA) to inhibit GroEL activity or 0.5 mg/ml of casein to saturate the apical domain.
Maximal refolding yield is obtained at molar ratios of apical domain and rhodanese of larger than one, from which is estimated a dissociation equilibrium constant of >lxl0"7 M.
Example 4 - Refolding of cyclophilin A
The chaperone activity of sht-GroEL 191-345 and sht-GroEL 191-376 was tested using cyclophilin A. Refolding of cyclophilin A was initiated by diluting 8 M urea denatured protein (100 μM) into 100 mM potassium phosphate buffer pH 7.0, 10 mM DTT to a final concentration of 1 μM. The final concentration of GroEL and apical domain in refolding buffer was 7 μM and 4 μM or 1 μM, respectively. Refolding temperature was 25°C. After incubation for the times indicated, cyclophilin activity was measured as described (Fischer G et al (1984) Biomed Biochim Acta 4J3: 1101-1111) . Spontaneous refolding of cyclophilin A occurred to a yield of about 30%, and was finished in less than 1 min. Standard error was 5%.
Figure 3e shows the refolding of 1 μM cyclophilin A in the presence of 7 μM GroEL monomer (0, ) , 4 μM sht-GroEL 191-345 (•), 4 μM sht-GroEFL 191-376 (D) , or 1 μM sht- GroEL 191-376 (D) . 100% activity was obtained with native cyclophilin A. Figure 3e demonstrates that 100% refolding of inactivated cyclophilin A can be achieved with sht-GroEL 191-345 or sht-GroE 191-376 and that this is equivalent to that seen with GroEL.
Cyclophilin A refolds only at low yield in the absence of chaperone but refolding is facilitated in the presence of GroEL monomer owing to a transient complex formation (see Zahn et al (1996) FEBS Lett 3"J30: 152-156) . A similar rate constant was found for refolding of cyclophilin A in the presence of intact GroEL monomer and in the presence of GroEL fragments, which within a factor of four did not depend on chaperone concentration (Figure 3e) . Maximal refolding yield was obtained at stoichiometric concentrations of cyclophilin A and apical domain, indicating the formation of 1:1 complex between chaperone fragment and substrate protein during refolding. Example 5 - Thermal denaturation of sht-GroEL 191-376 and sht-GroEL 191-345
Thermal denaturation monitored by far ultra violet- circular dichroism (UV-CD) at 222 nm and was carried out on a Jasco J720 spectropolarimeter interfaced with a Neslab RTE-100 water bath, using a ther ostated cuvette (Helma) with 1 mm path length. The temperature was increased at a linear rate of 50 deg/h. The protein concentration was 40 μM in 10 mM sodium phosphate buffer pH 7.0. Data were fitted to a denaturation curve (Pace, C N (1990) Trends Biotechnol 8_: 93-98) to determine Tmr the midpoint temperature of denaturation.
Thermal denaturation was also monitored by differential scanning calorimetry (DSC) and the measurements were performed at a protein concentration of 88+^ 5 μM in 10 mM sodium phosphate buffer pH 7.0, using a Microcal MC-2D instrument at a notional scan rate of 60 deg/h. Sample preparation and data analysis were performed as described previously (Johnson C M and Fersht A R (1995) Biochemistry 3_4_: 6795-6804) . Both proteins exhibit at least 50% reversibility in their thermal unfolding as judged from the area of endotherms obtained on rescanning samples. Higher levels of reversibility were obtained at lower concentrations or by stopping scans at temperatures closer to the main unfolding transition. The low temperature transition observed in sht-GroEL 191-376 was completely reversible with scans limited in temperatures below 50°C.
Figure 4a shows the far UV-CD results and Figure 4b shows the DSC results. In each Figure, sht-GroEL 191-376 is the upper trace and sht-GroEL 191-345 is the lower trace. The C-terminal helices were found to be flexible. The apical domain and the fragment truncated at position 345 are reversibly denatured by temperature or urea, and the denaturation is not influenced by the N-terminal histidine-tail. There are two cooperative folding transitions, at 34°C and 67°C. At 45°C, the CD spectrum of the apical domain is identical, to that of the truncated domain. The C-terminal α-helices, therefore, must melt at the lower temperature and separately from the "domain core". The second cooperative transition associated with the extra 31 amino acids in sht-GroEL 191-376 was confirmed by DSC (Figure 4b) . The calorimetric data are also essentially consistent with the unfolding of the apical domain as a monomer. At physiological temperature, about 50% of helices Hll and H12 are in an unfolded conformation, and thus flexible.
Example 6 - Crystallisation of GroEL fragments and diffraction study
Crystals were obtained from hanging drops initially containing sht-GroEL 191-345 at 23 mg ml"1, 11% PEG 4000, 50 mM Tris-HCl pH 8.5 and 100 mM LiS04, equilibrated against reservoirs consisting of 22% PEG 4000, 100 mM Tris-HCl pH 8.5 and 200 mM LiS0. X-ray data were collected from a capillary-mounted crystal at 4°C using a 30 cm Mar Research image plate detector at station 9.6 of the Synchrotron Radiation Source (SRS) at Daresbury, UK (λ=0.87 A). Unless stated otherwise, all data processing, data reduction, electron density syntheses and structural analyses were carried out using CCP4 software (Daresbury Laboratory, Warrington, UK) . Indexing and intensity measurements of diffraction data were carried out with the MOSFLM program suite Leslie A G W in Joint CCP4 and ESF-EACMB Newsletter on Protein Crystallography No 26 (Daresbury Laboratory, Warrington, UK, 1992) .
The structure was solved by conventional molecular replacement methods, using the program AMORE (Navaza J (1994) Acta Crystallogr A 5_0: 157-163), and a search model consisting of residues 191-345 of the refind structure of GroEL (Braig et al (1995) Nature Struct Biol 2 \ 1083-1094) . The asymmetric unit contains one protein molecule, corresponding to a solvent content of 51%.
Model rebuilding and refinement was carried out with 0 (Jones et al (1991) Acta Crystallogr A 1_: 110-119), and the structure was refined using X-PLOR (Briinger A T X-PLOR, Version 3.1, A System for Crystallography and NMR (Yale Univ Press, New Haven, CT, 1992) ) , using Engh and Huber parameters (Engh R A & Huber R (1991) Acta Crystallogr A Vl_ 392-400) . The current model contains 8 water molecules and is complete with the following exceptions, which could not be modelled due to poor or non-existent electron density: residues 302-307 and residues 337-345 from the C-terminus. Electron density for the N-terminal His-tag is also not observed. No residues have disallowed backbone φψ angles. Table 1 provides a summary of the crystallographic data.
TABLE 1 Summary of crystallographic data
Data collection statistics Unit cell dimensions a =b= 91.67 A, c=
38.33A
Space group P3χ21
Resolution (A) 22.0-2.5
Measured reflections 21,762
Unique reflections 6,564
Completeness of data (%)* 99.4 (96.7)
Emerge (%)*+ 9.9 (45.1)
<F/σF>* 19.8 (4.6)
Multiplicity* 3.3 (3.0)
Refinement statistics
Resolution (A) 8.0-2.5
R- factor/ free R- factor (%), F 0* 21.4/29.1 r.m.s.d bond length (A) 0.006 r.m.s.d bond angle (deg) 1.42
* values given in parenthesis are for the highest resolution shell. t agreement between intensities of repeated measurements of the same reflections and can be defined as: Σ(Jhfi- <Jh>) /∑ih/i, where Jh(ι are individual values and <Jh> is the mean value of the intensity of reflection h Φ The free R-factor was calculated with the 10% data omitted from the refinement (test set, prepared using DATAMAN (Kleywegt, G.J. & Jones, t.A. Acta. crystallogr. D 50, 178-185 (1994)
Example 7 - The three dimensional structure of sht-GroEL 191-345 Figure 5a shows the secondary structure representation drawn with MolScript (Kraulis P J (1991) J Appl Crystallogr 24_: 946-950) and Raster3D (Merrit E A and Murphy M E P (1994) Acta Crystallogr D 50: 869-873). Helices are labelled as in Braig et al (Braig et al
(1995) Nature Struct Biol 2 : 1083-1094) . N and C refer to the N-terminus (residue 191) and C-terminus (residue 336) of the model.
Figure 5b is a backbone representation drawn with programm 0 (Jones et al (1991) Acta Crystallogr A 47: 110-119) , in same orientations as Figure 5a colour-coded according to B-factor of main-chain atoms: blue (20 A2) to red (60 A2) .
Figure 5c shows a representative region of electron density, calculated using refined co-ordinates, viewed along the helices H8 and H9.
Crystals of sht-GroEL 191-345 grow in the trigonal space group P3ι21 with one molecule per asymmetric unit, giving a solvent content of 51%. The three dimensional structure of the "domain core" was solved by molecular replacement and refined to an R-factor of 21.4% and a free R-factor of 29.1% for all data between 8.0 and 2.5 A (Table 1) . The quality of the electron density map is shown in Figure 5c.
Overall, sht-GroEL 191-345 has the same fold as the corresponding region of the intact GroEL protein (Figure 5a) : two orthogonal β-sheets forming a β-sandwich, flanked by three α-helices. The structure is more ordered and better resolved than is the apical domain of the intact protein (Figure 5b) : the average B-factor is 42 A2, compared with 97 A2 for residues 191-336 of the GroEL structure. Unfortunately, such unusually high disorder in the GroEL structure complicates the interpretation of a structural comparison, in the same way as it can be misleading to use an average of NMR structures for comparison with a crystal structure, and so we have not done so here. In essence, the structure can be described as a well ordered β-sandwich scaffold, flanked by relatively flexible helical and loop regions. In particular, the B-factors of most of the β-strand, α- helix, and loop structure is about 20, 40, and 60 A2, respectively (Figure 5b) . There are two regions of considerable disorder. First, electron density is not found for the C-terminal residues 337-345, which correspond to the first half of α-helix Hll. This is an agreement with the results from folding experiments, described above. Second, electron density for residues 302-307 is very poor and fragmented. This region is part of the most disordered segment in the structure of intact GroEL (Braig et al (1995) Nature Struct Biol 2 \ 1083- 1094) . The content of α-helix and β-sheet secondary structure of the GroEL fragment (Figure 1) is 48% and 74% higher, respectively, compared with the corresponding region of intact GroEL. There are four additional segments of secondary structure in the new structure (Figures 1 and 5a) : residues 299-301 form a short β- strand; residues 201-205, 229-232 and 308-317 form 310- helices.
Example 8 - Refolding chromatography of IGPS (49-252)
(indoleglycerol phosphate synthase lacking residues 1-48) The fragments GroEL (191-345) or GroEL (191-376) were expressed in E. coli with a 17-residue N-terminal tail containing 6 histidine residues so that they could be purified using Ni-NTA resin - the Ni2+ ion which is chelated by the agarose binds to the histidine tag.
Initially, the same resin was used to immobilise purified GroEL (191-345) or GroEL (191-376) by their histidine tags for preparative purposes. Subsequently, the purified fragments were also linked covalently to agarose via CNBr activation .(Axen R et al (1967) Nature 1302-1304) .. The immobilised GroEL fragments were found to facilitate the refolding of cyclophilin A with high efficiency.
The apical domain of GroEL (GroEL (191-376) ) and the "core" of the apical domain, GroEL (191-345) , were cloned and expressed in E. coli as fusion proteins containing a 17-residue N-terminal histidine-tail. .The fragments were immobilised by two methods.
A) . Immobilised apical domain attached to Ni -NTA resin. The Ni-NTA resin (from QIAGEN) is a chelating adsorbant composed of a high surface concentration of nitrilo-tri- acetic acid (NTA) ligand attached to Sepharose CL-6B. The NTA occupies four of six ligand binding sites in the coordination sphere of the Ni2+ ion, leaving two sites free to interact with the six histidines in the N- terminal tail. Proteins containing one or more 6 x His affinity tags, located at either the amino or carboxyl terminus of the protein, bind to the Ni-NTA resin with an affinity K = 10-13 M, pH 7.8). The stability of the 6 x His/Ni-NTA interaction is unaffected by strong denaturants such as 6M guanidine hydrochloride or 8M urea, or the presence of low levels of β-mercaptoethanol (1-10 mM) . 3.5 mL of Ni-NTA resin were equilibrated with 0.1M potassium phosphate at pH 7.8, containing 5 mM β- mercaptoethanol . The GroEL domain was added to saturation of the affinity gel (21 mg of protein per 3.5 mL of gel) and incubated at room temperature for 30 min with gentle mixing. The gel was packed in a column suitable for FPLC (5 x 100 mm, from Pharmacia) and thoroughly washed with the initial buffer.
B) . Immobilised apical domain attached to CNBr-activated Sepharose 4B
To minimise the steric effects and preserve the structure of the binding site in the apical domain, the binding capacity of the gel was reduced by controlled hydrolysis of the activated gel before coupling. 300 mg of freeze- dried powder were suspended in 50 mM of NaHC03 pH 8.3, washed with the same buffer and re-swollen on a sintered glass filter (G3), then suspended in the buffer and mixed in an end-over-end shaker for 4 h at room temperature. The apical domain, dissolved in the coupling buffer (0.1M NaHC03, pH 8.3 and 0.5 M NaCl), was added to the gel suspension (10 mg protein/mL gel) and mixed in an end-over-end shaker for 6 h at room temperature. It was then washed with the coupling buffer. The remaining active groups were blocked by adding 2.5 M ethanolamine pH 8 and shaking for 4 h at room temperature. Uncoupled apical domain was removed by washing with five cycles of alternately high and low pH buffer solution (Tris-HCl 0.1M pH 7.8 containing 0.5M NaCl followed by acetate buffer (0.1M, pH 4 plus 0.5M NaCl) . The gel was finally washed with 0.1 M potassium phosphate at pH 7.8, containing 5 mM 2-mercaptoethanol (refolding buffer) . The immobilised fragments were used in a chromatography column (Figure 6) . Denatured protein in urea was added to the column and developed with a refolding buffer. The protein eluted as from a conventional binding column, its passage was retarded and the peak could be characterised by a retention time. Some mutants of indoleglycerol phosphate synthase (IGPS) were expressed E. coli and isolated as inclusion bodies. These reprecipitate on attempts to renature them after dissolving at high concentrations in urea, and attempts at low protein concentrations yielded soluble material of non-native conformation. The mutant IGPS (49-252) (1 x 22 kDa), which lacks the first 48 amino acid residues, was obtained on chromatography on the GroEL (191-345) column in a 92% yield of material that had 100% binding activity (Figure 6) . The term "refolding chromatography" can be used to describe the phenomenon of refolding by passage through the column.
In more detail, a column of ht-GroEL 191-345 immobilised on Ni-NTA agarose (3.0 mL) was loaded with 2 nmol of IGPS (49-252) dissolved in 20 μL of 8 M urea, and the column was developed with refolding buffer (0.1 M potassium phosphate at pH 7.8, containing 5 mM 2- mercaptoethanol) using a Waters 625 LC HPLC system. In Figure 6, the two peaks are seen which are described by: Kav = ( e - V0)/(Vt - V0) , where Vt s the total volume of gel in the column, V0, the void volume, and Ve the volume at the maximum of the peak. A value of JCav indicates that the protein interacts with the support. Peak 1 had Kav = 1.0 (9.6 % of the total area) and peak 2 fCav = 1.7 (90.4%) . The protein in peak 2 was recovered in 1.2 mL, and contained 1.85 nmol of truncated IGPS (92.5 % yield). It displayed a circular dichroism spectrum characteristic of a native a/b protein and bound 3H-rCdRP (tritium- labelled reduced 1 [ (2-carboxyphenyl) amino] -1- deoxyribulose 5-phosphate) , a specific inhibitor of IGPS, with a stoichiometry of 1.0. Initially denatured cyclophilin chromatographed with two peaks, an inactive at Kav = 0.38 and an active at Kav 1.43
Example 9 - Batchwise renaturation of cyclophilin A on Ni-NTA agarose gels
This experiment uses batchwise mixing of materials. A solution of denatured cyclophilin in 8 M urea was diluted 100 fold with Ni-NTA-immobilised GroEL (191-345) in a refolding buffer (see Table II below) . After gentle mixing for 30 min, the cyclophilin was recovered in 84% yield of protein. Control experiments (Table II) in which the protein was denatured in 8 M urea and added to the refolding buffer alone, or mixed with agarose that had not been linked to GroEL fragments, gave cyclophilin that had only 20% of the activity of the starting material before denaturation. The cyclophilin used for the denaturation experiments had 88% of the activity of the purest samples previously obtained by the inventors. The specific activity of the material obtained from the
GroEL (191-345) -agarose resin (covalently linked) was 126% of that of the previous purest samples. Further, the total recovery of activity was 25% more than that initially present. Thus, the immobilised GroEL fragment had "reconditioned" the cyclophilin A by converting inactive material into active.
In more detail, suspension of 200 μL of gel (wet, sedimented volume) was mixed with refolding buffer (100 mM phosphate buffer, pH 7.8 plus 5 mM 2-mercaptoethanol) to give a volume of 990 μL. 10 μL of cyclophilin (from a 100 μM stock solution in refolding buffer + 8M urea, = 1 nmol of cyclophilin A) were added and the suspension was mixed in an up-down mixer for 30 min at room temperature.
The gel suspension was centrifuged to separate the supernatant (~800 μL) . The gel pellet was washed in miniprep columns and the eluate added to the supernatant to give about 900 μL. The protein concentration was determined from the A28O nm. Cyclophilin activity was measured in the supernatant as described .in (Makino Y et al (1993) FEBS Lett 3_36_: 363-367) . The sample prior to denaturation had 88% of the specific activity of the highest activity of native cyclophilin previously obtained by the inventors. The control is agarose alone, without Ni. The results are shown in table II below.
Table II
Gel type Protein yield Cyclophilin Cyclophilin specific activity yield activity relative to "pure" native
(%) ( % : ( % )
10
GroEL 191-345 agarose 84 98 103
GroEL 191-376 agarose 81 105 115
GroEL 191-345 agarose 87 125 126
Agarose (control) ^ 72 17 20
Figure imgf000064_0001
The procedure was applied to other proteins used in the laboratory. Glucosamine 6-phosphate deaminase (6 x 30 kDa) (Oliva G et al (1995) Structure 3_: 1323-1332) normally regains only 10 % or less activity after renaturation from urea denaturation. A 100 % yield was obtained on batchwise treatment with GroEL (191-345) - agarose. Further, a sample that had lost all activity on storage in solution in 50% glycerol/water at -20 °C for 5 years also regained 100 % activity with this treatment after dissolving in urea.
Example 10 - Residues of GroEL implicated in binding peptides
The X-ray crystal structure of GroEL 191-376 with the 17 residue N-terminal tail shows that seven residues of the tail of one molecule bind in the active site of the other. Residues 230-271 are in the binding site. All residues are shown in Figure 8 in which large, bold and underlined residues are those detected by X-ray crystal structure of ht GroEL 191-376 as being involved in protein binding.
The X-ray crystal structure shows that 193-336 should be reasonably stable. The 193-337 fragment was cloned and expressed and found to be stable. Therefore, residues 191 and 192 may be omitted.
Figure 8
190 VEGMQFDRGY LSPYFINKPE TGAVELESPF ILLADKKISN
230 IREMLPVLEA VAKAGKPLLI IAEDVEGEAL ATAWNTIRG 270 IVKVAAVKAP GFGDRRKAML QDIATLTGGT VISEEIGMEL
310 EKATLEDLGQ AKRWINKDT TTIIDGVGEE AAIQGRVAQI
Example 11 - Minimal size of functional unit - refolding activity of polypeptides
Polypeptide fragments of E. coli GroEL were synthesised on a protein synthesiser according to methods and using equipment well known to a person skilled in the art. Longer fragments were generated by synthesising smaller fragments and then linking them together. The fragments were then tested by their ability to refold rhodanese, cyclophilin A, glucosamine-6-phosphate deaminase and IGPS 49-252 as described above.
The following fragments were made and tested:
a) 245-256, 244-257 ...et seq. . . 192-309, 191-310 [55 separate fragments] .
b) 235-301, 234-302 ... et seq. . . 193-343, 192-344 [44 separate fragments] .
Refolding activity above background controls was found for all polypeptides comprising at least amino acid residues 230-271 of GroEL. Example 12 - Alignment of consensus binding sequences of cpn60 family members
Figure 9 shows sequences in OWL database release 28.1 containing clear homology to apical domain of GroEL
(residues 191-375) in PDB structure pdblgrl.ent. OWL is a non redundant database merging SWISS-PROT, PIR (1-3), GenBank (translation) and NRL-3D. Consensus sequence = residues 230-271 inclusive) containing peptide-binding site (as identified by crystal structure analysis and polypeptide binding studies) . X = residue in peptide- binding site in the X-ray crystal structure of mini chaperone. The GroEL E. coli chaperone sequence is shown in italics.
n : 349spec

Claims

Claims :
1. A chaperone polypeptide having an amino acid sequence selected from at least amino acid residues 230-271 but no more than residues 150-455 or 151-456 of a GroEL sequence substantially as shown in Figure 7, or a corresponding sequence of a substantially homologous chaperone polypeptide, or a modified, mutated or variant thereof having chaperone activity.
2. Monomeric polypeptide having chaperone activity and incapable of multimerisation in solution.
3. A chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including the protein binding active site motif:
1 X X X 2 X X 3 4 XX 5 X X X X X X X XX X X X X X X 6 X X 7 8 X 9 10 11 X X 12 X 13 14
wherein 1 is selected from amino acid residues:
I, M, L, V, S, F or A; wherein 2 is selected from: L, I, P, V or A; wherein 3 is selected from: L, E, V, H or I; wherein 4 is selected from: E, A, R, L, Q, or N; wherein 5 is selected from: A, V, I, M, L, N, S, R, T, Q or K; wherein 6 is selected from: E, D or G; wherein 7 is selected from: A, P, S, T, G or L; wherein 8 is selected from: T, A, N, S or V; wherein 9 is selected from: V, L, I or A; wherein 10 is selected from: V, L, I, F or H; wherein 11 is selected from: N, S or L; wherein 12 is selected from: R, K, N, Q, L or S; wherein 13 is selected from: I, T, S, G, V, A, Q, N, K, F or P; wherein 14 is selected from: V, I, L, F, D or T; and wherein the X's represent a peptide bond or bonds or at least one amino acid residue,
or a functional variant thereof in which one or more of the numbered amino acid residues 1 to 14 has undergone a conservative substitution.
4. A chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including at least one protein binding active site motif moiety selected from:
(a) 1 X X X 2 X X 3 4 X and
(b) X 5 X X X X X X X X X X X X X X X 6 X X 7 8 X 9 10 11 X X 12 X 13 14
wherein 1 is selected from amino acid residues: I, M, L, V, S, F or A; wherein 2 is selected from: L, I, P, V or A; wherein 3 is selected from: L, E, V, H or I; wherein 4 is selected from: E, A, R, L, Q, or N; wherein 5 is selected from: A, V, I, M, L, N, S, R, T, Q or K; wherein 6 is selected from: E, D or G; wherein 7 is selected from: A, P, S, T, G or L; wherein 8 is selected from: T, A, N, S or V; wherein 9 is selected from: V, L, I or A; wherein 10 is selected from: V, L, I, F or H; wherein 11 is selected from: N, S or L; wherein 12 is selected from: R, K, N, Q, L or S; wherein 13 is selected from: I, T, S, G, V, A, Q, N, K, F or P; wherein 14 is selected from: V, I, L, F, D or T; and
wherein X is at least one amino acid residue. or a functional variant thereof in which one or more of the numbered amino acid residues 1 to 14 has undergone a conservative substitution.
5. A chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including the protein binding active site motif:
I X X X X X L E X X AX X X X X X X X X X X X X X X E X X A T X VVN X X R X I V
wherein X is at least one amino acid residue, or a functional variant thereof in which one or more of the specified amino acid residues has undergone a conservative substitution.
6. A chaperone polypeptide which, when in solution, remains monomeric and has the ability to refold, reactivate or recondition proteins, said polypeptide including at least one protein binding active site motif moiety selected from:
(a) I X X X L X X L E X
(b) X A X X X X X X X X X X X X X X X E X X A T X VV N X X R XI V
wherein X is at least one amino acid residue, or a functional variant thereof in which one or more of the specified amino acid residues has undergone a conservative substitution.
7. A chaperone polypeptide as claimed in any one of claims 3 to 6 in which the active site motif or an active site motif moiety includes the conserved sequence:
PLL(V)I(V)IA(S)EDV(I)EGEAL in which amino acid symbols in parenthesis are alternatives to the immediately preceding symbol reading left to right.
8. Monomeric polypeptide having chaperone activity and incapable of multimerisation characterised in that in the absence of ATP the polypeptide has a protein refolding activity of more than 50%, preferably 60%, even more preferably 75%, said refolding activity being determined by contacting the polypeptide with an inactivated protein of known specific activity prior to inactivation, and then determining the specific activity of the said protein after contact with the polypeptide, the % refolding activity being:
specific activity of protein after contact with polypeptide x 100 specific activity of protein prior to inactivation 1
9. A polypeptide as claimed in any preceding claim, wherein the chaperone activity is determined by the refolding of cyclophilin A.
10. A polypeptide as claimed in claim 9 wherein 8M urea denatured cyclophilin A (lOOμM) is diluted into lOOmM potassium phosphate buffer pH7.0, lOmM DTT to a final concentration of lμM and then contacted with at least IμM of said polypeptide at 25°C for at least 5 min, the resultant cyclophilin A activity being assayed by the method of Fischer G et al (1984) Biomed Biochim Acta 43_: 1101-1111.
11. A polypeptide as claimed in any preceding claim being an hsp60 polypeptide, preferably a GroEL polypeptide.
12. A polypeptide as claimed in any preceding claim which comprises at least an amino acid sequence selected from GroEL residues: (a) 191-329, 191-330, 191-331, 191-332, 191-333, 191-334,
191-335, 191-336, 191-337, 191-338, 191-339, 191-340,
191-341, 191-342, 191-343, 191-344, 191-345, 191-346,
191-347, 191-348, 191-349, 191-350, 191-351, 191-352,
191-353, 191-354, 191-355, 191-356, 191-357, 191-358,
191-359, 191-360, 191-361, 191-362, 191-363, 191-364,
191-365, 191-366, 191-367, 191-368, 191-369, 191-370,
191-371, 191^372, 191-373, 191-374, 191-375 or 191-376, or
(b) 192-329, 192-330, 192-331, 192-332, 192-333, 192-334, 192-335, 192-336, 192-337, 192-338, 192-339, 192-340, 192-341, 192-342, 192-343, 192-344, 192-345, 192-346, 192-347, 192-348, 192-349, 192-350, 192-351, 192-352, 192-353, 192-354, 192-355, 192-356, 192-357, 192-358,
192-359, 192-360, 192-361, 192-362, 192-363, 192-364,
192-365, 192-366, 192-367, 192-368, 192-369, 192-370,
192-371, 192-372, 192-373, 192-374, 192-375 or 192-376, or
(c) 193-329, 193-330, 193-331, 193-332, 193-333, 193-334, 193-335, 193-336, 193-337, 193-338, 193-339, 193-340, 193-341, 193-342, 193-343, 193-344, 193-345, 193-346, 193-347, 193-348, 193-349, 193-350, 193-351, 193-352, 193-353, 193-354, 193-355, 193-356, 193-357, 193-358,
193-359, 193-360, 193-361, 193-362, 193-363, 193-364,
193-365, 193-366, 193-367, 193-368, 193-369, 193-370,
193-371, 193-372, 193-373, 193-374, 193-375 or 193-376, or
(d) 230-271, 229-271, 229-272, 228-272, 228-273, . . . et seq. . . 194-328, 194-329, or
the equivalent residues of substantially homologous chaperonins, or a modified, mutated or variant sequence thereof.
13. A polypeptide as claimed in claim 8, wherein the selected amino acid sequence is selected from 230-271, 191- 345, 191-376, or 193-337 of GroEL, or the equivalent residues of substantially homologous chaperonins, or a modified, mutated or variant sequence thereof.
14. A polypeptide as claimed in any preceding claim further comprising a polyamino acid sequence, preferably an N- terminal polyamino acid sequence.
15. A polypeptide as claim in claim 14, wherein the polyamino acid sequence is a polyhistidine sequence.
16. A polypeptide as claimed in claim 14 or claim 15, wherein the polyamino acid sequence includes a cleavage site cleavable by a cleavage agent, preferably said cleavage agent is thrombin.
17. A polypeptide as claimed in any one of claims 14 to 16 wherein the further polyamino acid sequence comprises a number of amino acid residues in the range 2 to 500, preferably 5 to 100, more preferably 17 to 39.
18. A polypeptide as claimed in any preceding claim and is immobilised form, optionally immobilised to a chromatographic matrix, preferably an agarose resin.
19. A polypeptide as claimed in claim 18, wherein the agarose resin is a nickel-nitrilo-tri-acetic acid (NTA) - ligated agarose resin.
20. A polypeptide as claimed in any preceding claim fused to a heterologous protein or polypeptide.
21. A recombinant polypeptide as claimed in any preceding claim.
22. An isolated nucleic acid molecule comprising a nucleotide sequence encoding a polypeptide as defined in any one of claims 1 to 21 or a nucleotide sequence hybridisable thereto and optionally encoding a polypeptide having chaperone activity.
23. A recombinant nucleic acid molecule for use in cloning and/or expressing a nucleic acid sequence, said recombinant nucleic acid molecule comprising:
(a) a nucleotide sequence encoding amino acid residues 191- 376 of GroEL, or
(b) a nucleotide sequence encoding amino acid residues 191- 345 of GroEL, or
(c) a nucleotide sequence encoding amino acid residues 193- 337, or
(d) a nucleotide sequence encoding amino acid residues of GroEL selected from amino acid residues:
(i) 191-329, 191-330, 191-331, 191-332, 191-333, 191-334,
191-335, 191-336, 191-337, 191-338, 191-339, 191-340,
191-341, 191-342, 191-343, 191-344, 191-345, 191-346,
191-347, 191-348, 191-349, 191-350, 191-351, 191-352, 191-353, 191-354, 191-355, 191-356, 191-357, 191-358,
191-359, 191-360, 191-361, 191-362, 191-363, 191-364,
191-365, 191-366, 191-367, 191-368, 191-369, 191-370,
191-371, 191-372, 191-373, 191-374, 191-375 or 191-376, or (ii) 192-329, 192-330, 192-331, 192-332, 192-333, 192-334,
192-335, 192-336, 192-337, 192-338, 192-339, 192-340,
192-341, 192-342, 192-343, 192-344, 192-345, 192-346,
192-347, 192-348, 192-349, 192-350, 192-351, 192-352,
192-353, 192-354, 192-355, 192-356, 192-357, 192-358,
192-359, 192-360, 192-361, 192-362, 192-363, 192-364,
192-365, 192-366, 192-367, 192-368, 192-369, 192-370,
192-371, 192-372, 192-373, 192-374, 192-375 or 192-376, or
(iii) 193-329, 193-330, 193-331, 193-332, 193-333, 193-334,
193-335, 193-336, 193-337, 193-338, 193-339, 193-340,
193-341, 193-342, 193-343, 193-344, 193-345, 193-346,
193-347, 193-348, 193-349, 193-350, 193-351, 193-352, 193-353, 193-354, 193-355, 193-356, 193-357, 193-358,
193-359, 193-360, 193-361, 193-362, 193-363, 193-364,
193-365, 193-366, 193-367, 193-368, 193-369, 193-370,
193-371, 193-372, 193-373, 193-374, 193-375 or 193-376, or
(e) 230-271, 229-271, 229-272, 228-272, 228-273, ...et seg... 194-328, 194-329, or
(f) a nucleotide sequence hybridisable to any of (a), (b) , (c) , (d) or (e) above and encoding a monomeric polypeptide having chaperone activity, or
(g) degenerate nucleotide sequences corresponding to (a) ,
(b), (c), (d), (e) or (f) above.
24. A vector comprising a nucleic acid as claimed in claim 22 or claim 23.
25. A host cell transformed with a nucleic acid as defined in claim 22 or claim 23 or vector as defined in claim 24.
26. A method of making a polypeptide as defined in any one of claims 1 to 21 comprising transforming a host cell with a nucleic acid encoding said polypeptide, culturing the transformed cell and expressing said polypeptide.
27. A method of making a polypeptide as claimed in claim 26 wherein the nucleic acid is as defined in claim 22 or claim 23.
28. A method as claimed in claim 26 or claim 27, wherein the expressed polypeptide product is subject to cleavage.
29. A pharmaceutical formulation comprising a polypeptide of any one of claims 1 to 21, optionally together with a diluent, carrier or excipient.
30. A polypeptide as defined in any one of claims 1 to 21 for use in the treatment of disease.
31. The use of a polypeptide as defined in any one claims 1 to 21 in the manufacture of a medicament for the treatment of disease associated with protein/polypeptide structure.
32. A nucleic acid molecule as defined in claims 22 or claim 23 for use in the treatment of disease.
33. The use of nucleic acid molecule as defined in claim 22 or claim 23 in the manufacture of a medicament for the treatment of disease associated with protein/polypeptide structure.
34. A method of reconditioning a molecule preferably a protein comprising contacting said protein with a polypeptide of any one of claims 1 to 21.
35. A method as claimed in claim 34, wherein the protein is subjected to inactivation or denaturation prior to contacting with said polypeptide.
36. A method as claimed in claim 34 or claim 35, wherein the polypeptide is immobilised to a solid phase.
37. A method as claimed in claim 33 or claim 34, wherein the polypeptide is immobilised to a solid phase, preferably a chromatographic matrix, and the contacting of protein and polypeptide is carried out by applying the protein to the top of a bed of the matrix packed in a column and then eluting the polypeptide through the column.
38. Use of a polypeptide as claimed in any one of claims 1 to 21 for altering the structure of a molecule.
39. The use of claim 38, wherein the molecule is a protein or polypeptide and the alteration in structure is by folding unfolding or refolding.
40. The use of claim 38 or claim 39, wherein the stoichiometry between the polypeptide and the molecule being altered is about 1:1.
41. Use of a polypeptide as defined in any one of claims 1 to 21 in the purification or increase in yield, specific activity or quality of biological molecules, preferably said polypeptide being attached to a support.
42. A kit for reconditioning or refolding a molecule, preferably a protein, comprising a polypeptide of any one of claims 1 to 21 immobilised to a solid phase and a container for holding said solid phase polypeptide.
43. Use of a polypeptide as defined in any one of claims 1 to 21 in the production of a protein or polypeptide by recombinant means, wherein the said polypeptide is co- expressed with the protein or polypeptide thereby to improve the yield or quality of the protein or polypeptide.
44. An antibody reactive against a polypeptide as defined in any one of claims 1 to 21.
45. An antibody as claimed in claim 44 for use in the treatment of disease.
46. The use of an antibody as claimed in claim 44 in the manufacture of a medicament for the treatment of disease associated with protein/polypeptide structure.
47. A method of treating disease in which an effective amount of a polypeptide of any one of claims 1 to 21 is administered.
48. A method of treating disease which comprises administering an effective amount of an inhibitor of the chaperone activity of a polypeptide of any one of claims 1 to 21.
49. A method as claimed in claim 48 wherein said inhibitor is an antibody.
50. A method of treating disease by gene therapy which utilises a construct encoding a polypeptide of any one of claims 1 to 20 or an antagonist thereof.
n:/349cl
PCT/GB1996/002980 1996-09-26 1996-12-03 Chaperone fragments WO1998024909A1 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
PCT/GB1996/002980 WO1998024909A1 (en) 1996-12-03 1996-12-03 Chaperone fragments
AU10363/97A AU1036397A (en) 1996-12-03 1996-12-03 Chaperone fragments
EP97943058A EP0928336A1 (en) 1996-09-26 1997-09-26 Chaperone fragments
AU44674/97A AU736670B2 (en) 1996-09-26 1997-09-26 Chaperone fragments
PCT/GB1997/002652 WO1998013496A1 (en) 1996-09-26 1997-09-26 Chaperone fragments
JP10515422A JP2001501093A (en) 1996-09-26 1997-09-26 Chaperone fragment
US09/276,455 US20030078373A1 (en) 1996-09-26 1999-03-25 Chaperone fragments

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/GB1996/002980 WO1998024909A1 (en) 1996-12-03 1996-12-03 Chaperone fragments

Publications (1)

Publication Number Publication Date
WO1998024909A1 true WO1998024909A1 (en) 1998-06-11

Family

ID=10788667

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB1996/002980 WO1998024909A1 (en) 1996-09-26 1996-12-03 Chaperone fragments

Country Status (2)

Country Link
AU (1) AU1036397A (en)
WO (1) WO1998024909A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998053322A1 (en) * 1997-05-23 1998-11-26 The Institute Of Cancer Research: Royal Cancer Hospital Binding complexes
WO1999005163A1 (en) * 1997-07-24 1999-02-04 Medical Research Council Refolding method using a foldase and a chaperone
WO2000075346A1 (en) * 1999-06-09 2000-12-14 Medical Research Council Fusion proteins comprising a fragment of a chaperon polypeptide

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0650975A1 (en) * 1993-08-03 1995-05-03 Nippon Oil Co., Ltd. Refolding of denatured proteins using chaperonin 60 monomers from Thermus thermophilus

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0650975A1 (en) * 1993-08-03 1995-05-03 Nippon Oil Co., Ltd. Refolding of denatured proteins using chaperonin 60 monomers from Thermus thermophilus

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
H. TAGUCHI ET AL.: "Monomeric Chaperonin-60 and Its 50-kDa Fragment Possess the Ability to Interact with Non-native Proteins, to Suppress Aggregation, and to Promote Protein Folding", JOURNAL OF BIOLOGICAL CHEMISTRY, vol. 269, no. 11, 18 March 1994 (1994-03-18), BALTIMORE, MD US, pages 8529 - 8534, XP002037453 *
R. ZAHN ET AL.: "Chaperone activity and structure of monomeric polypeptide binding domains of GroEL", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA, vol. 93, no. 26, 24 December 1996 (1996-12-24), WASHINGTON US, pages 15024 - 15029, XP002037454 *
Y. MAKINO ET AL.: "Truncated GroEL monomer has the ability to promote folding of rhodanese without GroES and ATP", FEBS LETTERS, vol. 336, no. 2, 27 December 1993 (1993-12-27), AMSTERDAM NL, pages 363 - 367, XP002037451 *
Z.W. WHITE ET AL.: "A Monomeric Variant of GroEL Binds Nucleotides but Is Inactive as a Molecular Chaperone", JOURNAL OF BIOLOGICAL CHEMISTRY, vol. 270, no. 35, 1 September 1995 (1995-09-01), BALTIMORE, MD US, pages 20404 - 20409, XP002037452 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998053322A1 (en) * 1997-05-23 1998-11-26 The Institute Of Cancer Research: Royal Cancer Hospital Binding complexes
US7018791B1 (en) 1997-05-23 2006-03-28 Institute Of Cancer Research: Royal Cancer Hospital Binding complexes
WO1999005163A1 (en) * 1997-07-24 1999-02-04 Medical Research Council Refolding method using a foldase and a chaperone
WO2000075346A1 (en) * 1999-06-09 2000-12-14 Medical Research Council Fusion proteins comprising a fragment of a chaperon polypeptide

Also Published As

Publication number Publication date
AU1036397A (en) 1998-06-29

Similar Documents

Publication Publication Date Title
AU736670B2 (en) Chaperone fragments
Förster et al. The 1.9 Å structure of a proteasome-11S activator complex and implications for proteasome-PAN/PA700 interactions
Löwe et al. Crystal structure of the SMC head domain: an ABC ATPase with 900 residues antiparallel coiled-coil inserted
Aleshin et al. Crystal structures of mutant monomeric hexokinase I reveal multiple ADP binding sites and conformational changes relevant to allosteric regulation
McKnight et al. A thermostable 35-residue subdomain within villin headpiece
Lascu et al. Quaternary structure of nucleoside diphosphate kinases
Blickling et al. Structure of dihydrodipicolinate synthase of Nicotiana sylvestris reveals novel quaternary structure
Ko et al. Structure of human erythrocyte catalase
Morera et al. Refined X-ray structure of Dictyostelium discoideum nucleoside diphosphate kinase at 1.8 Å resolution
Chen et al. Regulatory features of the trp operon and the crystal structure of the trp RNA-binding attenuation protein from Bacillus stearothermophilus
Xu et al. The structures of the PII proteins from the cyanobacteria Synechococcus sp. PCC 7942 and Synechocystis sp. PCC 6803
Roosild et al. KTN (RCK) domains regulate K+ channels and transporters by controlling the dimer-hinge conformation
Rypniewski et al. The sequence and X-ray structure of the trypsin from Fusarium oxysporum
Biswas et al. A new scaffold of an old protein fold ensures binding to the bisintercalator thiocoraline
Tolbert et al. Pyruvoyl-dependent arginine decarboxylase from Methanococcus jannaschii: crystal structures of the self-cleaved and S53A proenzyme forms
AU782516B2 (en) Crystallization and structure determination of Staphylococcus aureus UDP-N-acetylenolpyruvylglucosamine reductase (S. aureus MurB)
WO1998024909A1 (en) Chaperone fragments
US20030077692A1 (en) Refolding method
Helmann Purification of Bacillus subtilis RNA polymerase and associated factors
Orriss et al. Crystal structure of the IIBSor domain of the sorbose permease from Klebsiella pneumoniae solved to 1.75 Å resolution
Chen et al. Crystal structure of a nucleoside diphosphate kinase from Bacillus halodenitrificans: coexpression of its activity with a Mn-superoxide dismutase
Pérez et al. Expression, purification, and spectroscopic studies of the guanine nucleotide exchange domain of human elongation factor, EF-1β
Cordeiro et al. Leishmania mexicana mexicana glucose-6-phosphate isomerase: crystallization, molecular-replacement solution and inhibition
Xin et al. Expression of bovine mitochondrial elongation factor Ts in Escherichia coli and characterization of the heterologous complex formed with prokaryotic elongation factor Tu
Knäblein et al. Isolation, Cloning, Sequence Analysis and X-Ray Structure of Dimethyl Sulfoxide/Trimethylamine N-Oxide Reductase from Rhodobactercapsulatus

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE HU IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK TJ TM TR TT UA UG US UZ VN AM AZ BY KG KZ MD RU TJ TM

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): KE LS MW SD SZ UG AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG

121 Ep: the epo has been informed by wipo that ep was designated in this application
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

NENP Non-entry into the national phase

Ref country code: JP

Ref document number: 1998525301

Format of ref document f/p: F

122 Ep: pct application non-entry in european phase