US20070128657A1 - Molecular libraries - Google Patents

Molecular libraries Download PDF

Info

Publication number
US20070128657A1
US20070128657A1 US10/493,582 US49358203A US2007128657A1 US 20070128657 A1 US20070128657 A1 US 20070128657A1 US 49358203 A US49358203 A US 49358203A US 2007128657 A1 US2007128657 A1 US 2007128657A1
Authority
US
United States
Prior art keywords
library
peptides
cells
protein
nucleic acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US10/493,582
Inventor
Mohamed El-Gewely
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GB0201522A external-priority patent/GB0201522D0/en
Priority claimed from GB0201523A external-priority patent/GB0201523D0/en
Application filed by Individual filed Critical Individual
Publication of US20070128657A1 publication Critical patent/US20070128657A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1086Preparation or screening of expression libraries, e.g. reporter assays
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K7/00Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
    • C07K7/04Linear peptides containing only normal peptide links
    • C07K7/06Linear peptides containing only normal peptide links having 5 to 11 amino acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1055Protein x Protein interaction, e.g. two hybrid selection
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B30/00Methods of screening libraries
    • C40B30/04Methods of screening libraries by measuring the ability to specifically bind a target molecule, e.g. antibody-antigen binding, receptor-ligand binding
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides

Definitions

  • the present invention relates to molecular libraries, in particular to molecular libraries which can be screened to identify members of the library with certain characteristics.
  • proteins which have functionally active surfaces and pockets which are characterised by the three dimensional shape of the protein molecule and particular reactive amino acid side groups. These proteins are able to interact, associate or bind with a variety of other biomolecules, including other proteins, nucleic acids, organic molecules, ions etc.
  • target molecules e.g. proteins
  • target molecules e.g. proteins
  • such molecules may be able to act as agonists or antagonists of enzyme or receptor activity for example, and more recently it has been proposed that the three dimensional shape of target proteins may be altered through interactions with small peptides (Nature Biotechnology letter El-Gewely 1999, Vol 17, p 210). In this way the wild-type function of a mutant protein can be restored.
  • a variety of molecular libraries which can be screened to identify molecules with desired properties are known in the art. These may comprise organic chemicals synthesised by traditional methods or chemicals produced by combinational chemistry. It is expensive to generate very large libraries of such organic chemicals even using combinatorial techniques and so interest has turned to biological systems.
  • phage display where peptides are expressed on the surface of a filamentous phage as a fusion protein with a coat protein.
  • each phage will carry only one such peptide (although it will generally have several copies of that molecule) on its surface and these are then screened in vitro for their ability to bind to a target ligand.
  • the phage provide a useful link between genotype and phenotype, as the phage which has a useful ligand on its surface will also have the nucleic acid inside it which codes for that peptide ligand.
  • the phage can be cultured, the nucleic acid amplified and then sequenced to determine the identity of the ligand exhibiting desirable properties.
  • Aptamers/SELEX Single or double stranded nucleic acid molecules that bind to other molecules are investigated, e.g. in order to find a small molecule—in this case a nucleic acid molecule, that binds to a target protein.
  • the short nucleic acid molecules with strong and specific binding capabilities are the “aptamers”. Again, these systems only allow in vitro screening of molecules.
  • the present invention provides a library of nucleic acid constructs which can express free peptides in an intra-cellular environment, said peptides having the sequence M-G/M/V—(X) n , wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid.
  • the above defined free peptides are optionally expressed as a fusion peptide with a cleavable signal sequence responsible for localization or translocation of the peptide.
  • signal sequences are well known in the art and will typically be no more than 30, preferably no more than 20 amino acids in length.
  • the term ‘free’ will be understood with this in mind and therefore constructs whose initial translation product incorporates a peptide of formula M-G/M/V—(X) n together with a cleavable signal sequence are still considered ‘constructs which can express free peptides’ of sequence M-G/M/V—(X) n .
  • the free peptides encoded by the libraries of the present invention can be contrasted with libraries of proteins which comprise peptides of interest fused to a large carrier protein, e.g. designed for surface display of the peptides of interest.
  • n in the above formula is from 3 to 18, preferably from 3 to 10 and typically from 3 to 5.
  • the first residue is methionine
  • the second residue is glycine, methionine or valine
  • the third and subsequent amino acids may be any of the genetically coded amino acids.
  • the universally accepted single letter code system for amino acids is used throughout.
  • MMACD SEQ ID NO.: 1
  • MMACE SEQ ID NO.: 2
  • MMACF SEQ ID NO.: 3
  • MMACG SEQ ID NO.: 4
  • MMADE SEQ ID NO.: 5
  • MMADF SEQ ID NO.: 6
  • MMADG SEQ ID NO.: 7
  • MGACD SEQ ID NO.: 8
  • MGACF SEQ ID NO.: 10
  • the second amino acid is glycine.
  • a library of constructs according to the invention may encode peptides of different lengths but typically a library will encode only peptides of 1-3 different lengths, often of only one length. Thus a library will typically have constructs which encode just pentamers or just hexamers etc.
  • the nucleic acid encoding these peptides are referred to herein as microgenes or forming a microgenex library.
  • the present invention provides a library of constructs which in turn is capable of generating a library of peptides which can be screened for their activity in a given intra-cellular environment.
  • the library of molecules which is actually investigated/screened is made up of peptides, the library as a whole providing a repertoire of different shapes.
  • the “shapes” will also have regions of positive and negative charge and various reactive groups which will impact on their in vivo activity.
  • the peptides are putative ligands for one or more target proteins and are conveniently referred to herein as a ‘ligand repertoire’ when either in peptide or corresponding nucleic acid form.
  • the present invention provides a (synthetic) library of peptides, each member of said library having the amino acid sequence M-G/M/V—(X) n , wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid.
  • the library is synthetic in that it has been generated through the use of recombinant DNA technology and is not naturally occurring.
  • the libraries of the present invention have at least 50, typically at least 100, preferably at least 500 members more preferably at least 2000.
  • the members of the library are usually screened at the same or substantially the same time, e.g. the library members are in contact with the host cells at the same time or at least screened as part of a series of coordinated multiplex assays.
  • the library may be screened all together or in batches but in that case at least 8, preferably at least 64 or 96 different library members would still be contacted with the host cell population at the same time.
  • the different nucleic acid constructs are recognisably part of a library as they will generally differ only in the region which encodes the peptides of formula M-G/M/V—(X) n defined above; thus the constructs, which are typically plasmid vectors, will contain the same flanking sequences i.e. promoter etc. on either side of the sequence encoding the peptide of interest.
  • nucleic acid construct refers to a nucleic acid molecule which contains a coding region for the peptides as defined herein operably linked to a promoter region.
  • the constructs are typically in the form of expression vectors, e.g. plasmid vectors and other expression vectors such as phagemid or lambda expression vectors can also be used.
  • the constructs are designed to allow expression in prokaryotic or eukaryotic host cells and thus a mammalian, bacterial or yeast etc. expression vector is selected as appropriate. Suitable expression vectors for use with different cell types are well known in the art.
  • the library of peptide molecules encoded by the library of constructs can be analysed for their ability to interact with a given target system or protein. If the target protein is mammalian it follows that the library will ideally be expressed in mammalian cells and the expression vector will be selected accordingly.
  • the present invention also provides a method of generating a library of nucleic acid constructs which can express free peptides in an intra-cellular environment, which method comprises synthesising a library of DNA molecules which include the nucleotide sequence ATGGGA(NNK) n , wherein n is an integer from 3 to 18, N is a nucleotide selected from A, C, T or G and K is a nucleotide selected from G or T and wherein each NNK triplet may be the same or different, and inserting a library of synthesised DNA fragments which each includes a nucleotide sequence of formula ATGGGA(NNK) n into expression vectors.
  • the invention provides libraries for use in any host cell for which a suitable promoter to drive expression is known. Modifications to nucleic acid libraries disclosed herein which would be necessary for expression in a different host cell system are within the competence of the skilled man, so a range of bacterial, eukaryotic or yeast compatible libraries may readily be developed.
  • the Shine-Dalgarno sequence which is required for expression in bacteria does not hinder expression in eukaryotic cells.
  • the construct library is capable of generating a repertoire of different molecules with different three dimensional configurations and different functional groups.
  • this same library of peptides can be used in many different screening methods; whether it be for allosteric inhibition of an enzyme, the ability to competitively inhibit the activity of an intra-cellular messenger at a receptor binding site, to control nucleic acid transcription or translation, or to restore the function of a mutant protein through interaction therewith which alters the mutant protein's three dimensional shape.
  • the libraries of the present invention may be used in an in vitro screening method to investigate the activities of the peptide ligands in normal cell conditions. Thus toxic molecules may be excluded at an early stage in testing.
  • the cells in which the library of peptide molecules are to be expressed can internalise the expression vectors into which the ligand repertoire has been constructed and there is thus no need either to synthesize peptides or purify peptide libraries.
  • the expression vectors are capable of autonomous replication to prevent the dilution of transfected ligand-encoding plasmids during cell division or the integration of the (e.g. plasmid) vector into the host cell chromosome, which causes difficulties in sequence analysis of the region of the nucleic acid within a cell which encodes the peptide ligand of interest.
  • Recombinant technology to generate expression vectors is well known and a suitable method for generating the library of constructs incorporating the desired sequence variation is described in the Examples.
  • the variation in the nucleic acid sequence which translates into the different peptide ligands is typically generated through the synthesis of a master primer which has regions of known sequence flanking the region which encodes the peptide of formula M-G/M/V—(X) n defined above (see FIG. 2 ).
  • the first amino acid is always methionine
  • the first three nucleotides will be ATG
  • the second residue is glycine
  • the next three nucleotides will be GGA and the remaining triplets follow the formula NNK, wherein N is A, C, T or G and K is G or T. This can allow for any nucleotide in any of the Y positions.
  • the primer is generated adding particular nucleotides in turn until the NNK regions are reached when all 4 nucleotides are added in equal molar amounts for the addition of 2 nucleotides to the growing strand and then G and T are added in equal molar amounts for the third position in the triplet. This cycle of additions is repeated until nucleic acid encoding for the peptides of the required length is generated.
  • the NNK motif of nucleotide addition means that all of the standard amino acids can be incorporated into the peptides in any combination, so the peptides can include all the range of reactive side groups and overall variation in size, shape, polarity, hydrophobicity etc. which are provided by the amino acids individually and by combinations of amino acids in juxtaposition.
  • the present invention provides a method of identifying an active peptide ligand of formula M-G/M/V—(X) n as defined above which comprises:
  • the present invention further provides a method of identifying a peptide ligand of formula M-G/M/V—(X) n as defined above having a desired activity on a target protein, which method comprises:
  • a positive response in said reporter system indicating that one or more peptides in that cell has the aforementioned desired activity.
  • the desired activity will typically be an ability to restore wild-type activity in the target protein, the host cells having a non-wild-type mutant form of the target protein.
  • the present invention provides a method of screening a library of peptides of formula M-G/M/V—(X) n wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid, which method comprises:
  • the reporter system is fully or partially an intra-cellular system.
  • microgenes encoding the library of peptides may be expressed constitutively or expression can be induced from a promoter which is operably linked to said microgene.
  • the constructs may be autonomously replicated or integrated in the host chromosome.
  • Suitable transformation and culturing techniques are well known in the art and some are discussed in detail in the Examples hereto.
  • Suitable reporter systems are discussed in detail below and will depend on the desired activity of the ligands.
  • the reporter system will conveniently involve up or down regulation of gene expression, either at the level of transcription or translation.
  • the gene whose level of expression is monitored may be native to the host cell but is typically an exogenous reporter gene.
  • the reporter system will incorporate a target protein whose activity can be altered through interaction with one or more members of the library of peptides defined and described above.
  • Binding or association of the peptide ligand to or with the target protein will, in some cases, result in a conformational charge in the target protein and where this results in a change in activity of the target protein this may be observed through the reporter system.
  • Suitable target proteins are discussed in more detail below and include enzymes and transcription factors.
  • the reporter system may be directly or indirectly related to the desired ligand function. For example, if the ability of a ligand to activate transcription or translation of a gene conferring antibiotic resistance is being investigated, culturing the host cells in the presence of that antibiotic would readily identify successful ligands as cells which expressed non-successful ligands would not grow. Given the size of the peptides, activation is unlikely to be through direct nucleic acid binding but through interaction with the protein machinery which performs and/or controls transcription/translation.
  • analysis of the host cell population may simply be by observation, e.g. of growing plaques or through detecting a particular colour, for example if the ⁇ -gal reporter system is used.
  • step d) the active peptide is typically identified by analysis of the nucleic acid encoding it.
  • the Examples provide a discussion of how the nucleic acid of the constructs from selected host cells may be sequenced. For a given host cell, the link between phenotype of the peptide ligand in terms of its impact on the reporter system and genotype of the microgene encoding said ligand allows convenient identification of ligands exhibiting the desired activity.
  • a library of molecular shapes, i.e. ligands, for in vivo screening has a wide variety of uses.
  • members of a library to encode peptides which can take part in protein therapy through protein function restoration.
  • Small molecules e.g. peptides, may be used which interact with the mutant protein to modify its three dimensional structure and thus its activity.
  • Proteins are known to change their conformation on binding or associating with a range of different classes of molecules, e.g. other proteins, nucleic acids, prosthetic groups such as heme, ions such as Ca 2+ etc.
  • Peptide ligands identified according to the above described method can be used therapeutically to alter the three-dimensional structure of a mutant protein and thus restore or alter the function or activity of that mutant protein. Typically the ligand can restore, in whole or in part, wild type protein function.
  • Peptide ligands identified according to the above screening method may be chemically modified before being used in therapy, in particular to enhance in vivo stability or by introduction of targeting or signal moieties such as the HIV Tat tag or folate, the receptor for which is over-expressed in many cancer cells. There are standard techniques which can be used to achieve this e.g. C terminal modifications such as amidation or N terminal modifications or the use of D- instead of L-amino acids.
  • the free peptides expressed by the library of nucleic acid constructs will preferably have only 3-8 amino acids, particularly preferably 3-6 amino acids. If a signal sequence is used, these preferred lengths refer to the peptides generated after cleavage of the signal sequence.
  • the present invention therefore provides for the use of a library of nucleic acid constructs as defined herein in a method of screening a library of molecules for the ability of members of that library to restore or modify the function of a target protein in an intra-cellular environment, which method comprises introducing the library into cells which have a reporter system which allows the identification of those cells in which the function of the target protein has been restored or modified.
  • the reporter system is compatible with, or more particularly includes, the target protein.
  • the reporter system preferably comprises a reporter gene which is operably linked to a sequence of nucleotides which provides a binding site for the target protein or for a protein which associates with or is a substrate for said target protein.
  • the screening methods of the invention give information about individual members of the library of molecules.
  • the method is performed in an intra-cellular environment (i.e. in vivo), and therefore all toxic compounds will be weeded out very early on. This reduces the cost of screening and of pre-clinical and even phase I clinical trials.
  • the method is able to indicate in a qualitative possibly also quantitative manner the performance of individual molecules in the test system.
  • a target protein and suitable host cells are selected and the method designed so that the reporter system is able to give information about the ability of each molecule to restore or modify the function of the target protein.
  • the reporter system is typically designed to measure restoration of a particular function of the target protein, for example the ability to act as a DNA binding transcription factor.
  • restoration of function refers to a return to wild-type function of the protein in at least one physiologically relevant and measurable respect.
  • the wild-type functions of the target protein in all respects may be restored but this is not essential for the successful working of the invention. It will be appreciated that partial restoration of the function of a protein may still be useful and it is not a requirement for a positive identification according to the claimed screening method to provide 100% of wild-type activity.
  • the sensitivity of the reporter system may conveniently be modified to provide a more or less stringent assay and thus to identify as giving “positive” results only those molecules which have achieved a significant increase in protein function.
  • molecules giving positive results according to the claimed screening methods will have brought about at least a 30%, more preferably at least a 50%, particularly preferably at least a 70% restoration in wild-type protein function, with respect to the particular function monitored by the intra-cellular reporter system.
  • a ligand may correct the mutant to a level of measured activity which is actually greater than wild type. It will be clear to the skilled reader that the art provides a number of ways of comparing the relative activities of different molecules. Some reporter systems e.g. those based on fluorescence may be quantitative and thus facilitate such a comparison between wild-type and corrected mutant. However an assessment relative to the wild-type protein may more conveniently be performed as part of a separate test utilising serial dilutions, measurement of plaque size etc.
  • Suitable target proteins for restoration of function will include those whose presence in mutated form is associated with a disease state.
  • the mutated version of the protein will typically have an altered 3-dimensional structure which affects its ability to interact with other molecules (ions, intra-cellular organelles and other cell components etc.).
  • the members of the molecular library will be screened for their ability to interact with the mutant target and thus alter 3-dimensional structure.
  • the 3-dimensional structure in vivo may be closer in one or more respects to the wild-type 3-dimensional structure as a result of successful interaction with a member of the library but protein function may be restored through a further compensating change in 3-dimensional structure which has a return to wild-type function as a ‘net’ -result.
  • Preferred target proteins will be those wherein point mutation(s) are the cause of disease.
  • transcription factors DNA binding proteins
  • p53 but others would be enzymes, peptide hormones or receptor molecules.
  • All diseases caused by monogenic Mendelian mutations could be targets for treatment with molecules identified according to the screening methods of the present invention. These include genetic (i.e. hereditary) diseases, cancer and symptoms of aging caused by mutations.
  • the host cells should provide a suitable model for determining whether the members of the library are capable of causing the desired change in target protein function with a view, typically, to therapeutic administration of successful members of the library or a derivative thereof.
  • the host cells will typically be derived from cancerous or other diseased cells and naturally produce the dysfunctional target protein.
  • Suitable cell lines are available, for example in cell culture collections such as the ATCC and may be derived from osteosarcoma, adenocarcinoma etc., or indeed any established cell line that expresses the mutant protein.
  • Cell lines that lack any expression of the protein of interest can also be used after introducing and expressing the gene encoding the mutated protein.
  • Standard transfection techniques may be used to introduce the nucleic acid constructs into the host cells in which expression of the free peptide ligands is to take place.
  • transformation and ‘transfection’ are used interchangeably herein.
  • the appropriate reporter system for a given screening method will depend upon the nature of the target protein under investigation.
  • the reporter system is chosen to be responsive to the target protein and thus be capable of indicating the functional status of the target protein.
  • the mammalian p53 protein illustrates the principle behind a suitable reporter system and is a particularly preferred target protein.
  • P53 functions in mammalian cells mainly through the ability to act as a transcription factor and is a transcriptional activator of several genes associated with cell cycle regulation such as p21, Bax, CD95(Fas/Apo-1) and 4-3-3 ⁇ (HME1). It can also down regulate the transcription of cell-cycle-regulating genes such as Bcl2, Cdc2 and cyclin.
  • P53 acts as a checkpoint, monitoring DNA damage and regulating cell cycle progression.
  • Loss of p53 activity predisposes cells to the acquisition of oncogenic mutations and may favour genetic instability ⁇ 90% of mutations reported in the p53 mutation database are found in the DNA binding domain. Mutations in p53 can lead to cancer formation where p53-mediated apoptosis is deficient due to the mutation.
  • P21 is one of the p53-transactivated genes that are critical in cell cycle control and many p53 mutants found in cancer cells lose the ability to transactivate p21 transcription.
  • the transactivation level of p21 promoter in a cell reflects whether the p53 protein expressed in that cell has wild type function.
  • a p53 reporter system was constructed by cloning human p21 promoter upstream of the puromycin resistance gene. The reporter construct can provide human cells with puromycin resistance only in the presence of wild type or wild-type functioning p53 that transactivates the p21 promoter. Such a reporter system offers significant practical benefits as a life-death selection system is possible where only cells with wild-type-functioning p53 can survive when grown in the presence of puromycin.
  • the host cells may have a reporter system as part of their normal genome which can be utilised, typically the host cells will have been modified to include a suitable gene based reporter system.
  • the genetic constructs which comprise all or part of the reporter system will thus have been introduced into the host cells, e.g. by standard transformation/transfection techniques, before the screening method is performed.
  • the reporter constructs could be introduced at the same time as the molecular library or even, but not preferably, after introduction of the library.
  • host cells will preferably have been co-transfected (although not necessarily simlutaneously) with a reporter construct and a construct which encodes a peptide whose ability to restore or modify the function of a target protein is to be investigated.
  • members of a chemical compound library may be contacted with the host cells which have been transfected with reporter constructs.
  • Target proteins are preferably DNA binding proteins which up or down regulate the expression of other genes through binding to promoter or enhancer regions.
  • Their native gene targets may be utilised to report on whether a target protein function has been restored or modified, but preferably their target DNA binding regions will be operably linked to reporter genes such as genes conferring antibiotic resistance, or which encode fluorescent proteins such as GFP (green fluorescent protein), or the much used bacterial reporter enzymes ⁇ -galactosidase ( ⁇ -Gal) or ⁇ -glucuronidase ( ⁇ -Gus) etc.
  • reporter genes such as genes conferring antibiotic resistance, or which encode fluorescent proteins such as GFP (green fluorescent protein), or the much used bacterial reporter enzymes ⁇ -galactosidase ( ⁇ -Gal) or ⁇ -glucuronidase ( ⁇ -Gus) etc.
  • the protein products of these reporter genes e.g. ⁇ -Gal or ⁇ -Gus
  • the cells expressing the reporter protein can readily be separated physically from other cells that do not express the reporter protein, e.g. by the use of antibodies.
  • Suitable signal peptides are described in the literature and include the signals from a protein such as PDGFR (platelet-derived growth factor receptor).
  • PDGFR platelet-derived growth factor receptor
  • Secretion signal peptide-reporter protein eg. ⁇ -Gal/ ⁇ -Gus
  • secretion signal peptide-reporter protein eg. ⁇ -Gal/ ⁇ -Gus
  • secretion signal peptide-reporter protein eg. ⁇ -Gal/ ⁇ -Gus
  • METDTLLLWVLLLWVPGSTGD SEQ. ID. NO.: 12
  • - ⁇ -Gal/ ⁇ -Gus- AVGQDTQEVIVVPHSL PFKVVVISAILALVV SEQ. ID. NO.: 13
  • LTIISLIILIMLWQKKPR -stop LTIISLIILIMLWQKKPR -stop.
  • Non-DNA binding proteins can also be used as target proteins provided their wild type activities can be monitored and assayed for or the reporter system engineered such that cell survival depends on restoration of the protein function.
  • Identification of cells in which the function of the target protein has been restored or modified(e.g. wild-type function has been restored) will depend on the reporter system used. As described above, this is particularly conveniently performed when cells in which wild-type function has not been restored do not survive in the selected culturing environment. Fluorescence detection can also be used to identify cells in which the function of the target protein has been restored or modified.
  • the target protein may, for example, be an enzyme rather than a nucleic acid binding protein and the ability of the target enzyme to act on a substrate may be used as the intra-cellular reporter system allowing identification of cells in which the function of the target protein has been restored or modified.
  • Other suitable reporter system such as the ⁇ -gal. blue/white system are well known in the art and can be used or adapted by the skilled man in his chosen screening method.
  • a library of nucleic acid constructs according to the present invention is conveniently referred to herein as a “Microgenex” library and the stretches of nucleic acid which encode the peptide ligands of interest as “Microgenes”.
  • FIG. 1 shows in schematic form the general strategy for construction of a library of nucleic acid constructs according to the present invention.
  • FIG. 3 illustrates the construction of a peptide library for expression in mammalian cells.
  • FIG. 4 illustrates the construction of a peptide library for expression in bacteria ( E. coli ).
  • FIG. 5 gives a map of the constructed Microgenex library of the Examples that can be used for expression in mammalian cells.
  • FIG. 6 gives a map of the constructed Microgenex library of the Examples that can be used for expression in E. coli .
  • Cm-R refers to chloramphenicol resistance gene.
  • p15A is replication origin compatible with several other replication origins used in molecular biology and biotechnology.
  • P BAD promoter is suppressed by araC but can be induced by L-arabinose.
  • Peptide library coding sequence was inserted under the P BAD promoter between KpnI and HindIII sites. SD sequence is a ribosome-binding site to help in the translation of the peptides in E. coli .
  • the first two amino acids at the N-terminal (“M” for Met, and “G” for Gly) can stabilize the peptides inside cells.
  • the two stop codons assure the termination of translation while the terminator (“term”) sequence will terminate the transcription driven by the P BAD promoter.
  • the DNA fragment of the peptide library was generated by PCR, using Master, Right and
  • FIG. 7 provides a schematic representation of the generated peptide library plasmid.
  • Each peptide in the library is composed of three random amino acids after two N-terminal amino acids Methioline (Met, or M) and Glycin (Gly, or G).
  • the next three X stands for any amino acid that forms a library consisting of randomly distributed three-amino-acid peptides.
  • the peptide-coding DNA-fragment was inserted into the mammalian expression vector pCEP4 between its KpnI and HindIII site under the CMV promoter.
  • the plasmid has OriP that enables the replication of the plasmid inside mammalian cells. Selection marker expression cassette in the plasmid provides the plasmid-transfected cells with hygromycin resistance.
  • Electrocompetent DH10BTM (Life Technologies, GibcoBRL) was used to transform the ligated Microgenex expression libraries.
  • PCR product was first treated with phenol-chloroform, chloroform and then precipitated with 3 volumes of cold ethanol. The DNA pellet was washed with cold 70% ethanol, dried and dissolved in appropriate volume of H 2 O. DNA was then restricted by KpnI and HindIII. Restricted DNA was gel-purified and subsequently ligated to the corresponding expression vector that has been also restricted with by KpnI and HindIII. Ligation mix was precipitated and resuspended in H 2 O and subsequently used to transform, by electroporation, of the Escherichia coli strain DH10BTM (Life Technologies, GibcoBRL). Plating was done on LB ampicillin plates (200 ⁇ g/ml) for pCEP4, or chloramphenicol (40 ⁇ g/ml) for pBAD33.
  • Colonies from all the plates were pooled together, washed with LB with ampicillin (200 ⁇ g/ml) if the cloning vector contains beta-lactamase gene, such in the case of pCEP4. This step is important to get rid of cells that lack the plasmid. Pooled cell mixture was divided into small aliquots and stored at ⁇ 70 ⁇ C. Each aliquot contained millions of cells to ensure complexity of the Microgenex library.
  • One frozen portion can be used to inoculate a one-liter culture of LB containing the appropriate antibiotic.
  • Standard plasmid DNA isolation protocols/kits are used to isolate the DNA suitable to transform, for example mammalian cells for ligand screening.
  • Master primer was used as a template for all constructed peptide expression libraries (Microgenex) whether the expression in mammalian cells or Escherichia coli . It could also be used for yeast or insect cell expression. Left and Right oligoes were used as forward and reverse primers, in order to generate double-stranded DNA fragments encoding repertoire of peptide library by PCR. The first ATG in this fragment encodes Met as the translation initiation site of the peptides. Gly, the second amino acid encoding by the DNA fragment, assures the peptides expressed to be stable inside cells according to the N-end rule.
  • NNK code N for A, C, G, and T in equal molar ratio, K for G and T in equal molar ratio
  • K for G and T in equal molar ratio
  • the peptide-coding DNA-fragment was inserted into the mammalian expression vector pCEP4 between its KpnI and HindIII site under the CMV promoter.
  • the plasmid has the replication origin, OriP, which enables the replication of the plasmid inside mammalian cells.
  • Selection marker expression cassette in the plasmid provides the plasmid-transfected cells with hygromycin resistance.
  • Colonies in the range of 25000-75,000 on the plates were commonly obtained ensuring complexity/diversity of the cloned Microgenex repertoire. They were collected into 10-12 portions.
  • FIG. 2 Plasmids used for the construction of microgenex libraries Plamid Name Promoter Insertion Selection marker Replication origin Reference.
  • pCEP4 CMV No E. coli : Amp, E. coli : ColE, Vector from Mammalian: Mammalian: OriP Invitrogen Mammalian Microgenex library with Hygromycin B This work, FIG. 2 .3. microgenex amino acid sequence as: library M-G-X—X—X. pBAD33 pBAD No Chloramphenicol p15A Vector from Beckwith; Guzman, et al., 1995 E. coli Microgenex library with This work. microgenex amino acid sequence as: library M-G-X—X—X.
  • Plasmid prepared from one of such frozen aliquots was examined by PCR.
  • mammalian Microgenex library DNA was used as a template in PCR reactions using pCEP4 forward and reverse primers showed that there were insertions with expected size.
  • PCR results from the 6 randomly picked colonies from the original mammalian Microgenex library plates showed that 4 of the colonies had the right construction (showed correct size of insertion) while two did not.
  • mutant p53 in cancer cells can result from not only the inability in binding to its specific recognized consensus DNA sequence, but also the mal-localization inside cells or incompetent to interact correctly with other transcription co-factors.
  • the sub-cellular localization mechanism and transcription machinery in mammalian cells is different from that in prokaryotic organisms.
  • the loss of specific DNA binding ability may engage the post-translational modification processes that are totally different from that in prokaryotic organisms.
  • a p53 reporter for mammalian cells has more advantages than that for prokaryotic cells.
  • P53 protein functions in mammalian cells mainly through its transcriptional factor activity.
  • P21 is one of the p53-transactivated genes that are critical in cell cycle control. P53 can induce cell cycle arrest in G1 phase via transactivation of p21 gene once the cells are under stresses such as DNA damaging. Lots of p53 mutants found in cancer cells lose the ability to transactivate p21 transcription. This leads to not only the vast development of cancer, but also the resistance to cancer chemotherapy and radiotherapy that aim to damage DNA in order to kill cancer cells. Thus the transactivation level of p21 promoter in a cell reflects whether the p53 protein expressed in that cell has wild type function or not.
  • the reporter can provide human cells puromycin resistance only in the presence of wild type or wild-type-functioning p53 protein that transactivates p21 promoter, while leaving the cells sensitive to puromycin with dysfunctional mutated p53 protein.
  • the advantage to use puromycin resistant gene is to set up a life-death selection: only the cells with wild-type-functioning p53 protein can survive from puromycin selection.
  • Plasmid and vectors used in this Example are WWP-Luc (gift from Prof. Stanbridge E. J.), pPUR (Clontech), and pUC18 (Pharmacia). Enzymes used for cloning are EcoRI (Promega), PvuII (Promega), SalI (Promega), SphI (Promega), T4 DNA ligase (New England BioLab), Klenow (Promega), and SAP (Shrimp Alkaline Phosphotase).
  • U-2 OS cell ATCC Number: HTB-96, Homo sapiens (human), osteosarcoma, bone, wild type p53
  • Saos-2 cell ATCC Number: HTB-85, Homo sapiens (human), osteosarcoma, bone, p53 null
  • SW480.7 ATCC number: CCL 228, Homo sapiens (human), colorectal adenocarcinoma, colon, p53 mutant: R273H/P309S).
  • Medium and other chemicals include McCoy's medium, DME medium, MEM, FCS, calcium phosphate transfection reagents (0.1 ⁇ TE (pH8.0), 2 ⁇ HBS (pH7.4), 2M CaCl 2 , 15% glycerol in HBS), PBS, and puromycin (Sigma).
  • WWP-Luc plasmid was restricted by EcoRI and SalI.
  • the 2366 bp fragment was gel-purified (1% agarose in 1 ⁇ TBE) by freeze-thaw method.
  • Vector pUC18 was restricted by EcoRI and SalI, dephosphorylized by SAP, and purified by phenol-chloroform. Then the p21 promoter fragment was subcloned into pUC18 vector by T4 DNA ligase, generating a plasmid named as p21uc. All drug resistant markers in the constructed plasmids, are listed in Table 1. Genes involved in the resistance are also listed. TABLE 1 Drug resistant markers in constructed plasmids and their genes.
  • Neomycin Puromycin Puromycin-N-acetyl-transferase (pac)
  • Neomycin Neomycin phosphotransferase gene Hygromycin B Hygromycin B phosphotransferase Histidinol Histidinol dehydrogenase (hisD)
  • P21 promoter was cut out from p21 uc first by EcoRI, blunted with Klenow and dNTP, and then by SphI.
  • the 2450 bp-p21-promoter fragment was gel-purified using the same method as above.
  • Vector pPUR was restricted fully by PvuII and SphI, dephosphorylized by SAP, and gel-purified by the same method, which removed SV40 early promoter upstream of puromycin resistant gene.
  • the p21 promoter was cloned into pPUR vector upstream of puromycin resistant gene by T4 DNA ligase, generating the p53 reporter named as p21 ur.
  • Verification of the construction was carried out by restriction of the generated plasmid with HindIII.
  • the original vector pPUR has only one HindIII site thus gives one band of 4257 bp after gel electrophoresis. This HindIII site was removed while constructing the reporter p21ur. Two new HindIII sites were introduced into p21ur with the p21 promoter thus can give two bands, one at 2.4 kb and the other at 3.9 kb.
  • Validation of the responsiveness of p21 ur to p53 was carried out by transfection of p21ur to osteosarcoma cell line U-2 OS that expresses wild type p53 protein, Saos-2 that is p53 null, and colon cell line SW480.7 that expresses R273H and P309S double mutated p53 protein.
  • U-2 OS that expresses wild type p53 protein
  • Saos-2 that is p53 null
  • SW480.7 colon cell line SW480.7 that expresses R273H and P309S double mutated p53 protein.
  • 3 ⁇ 10 5 U-2 OS, 1.5 ⁇ 10 5 Saos-2, and 3 ⁇ 10 5 SW480.7 cells were seeded to each well of a 6-well plate.
  • Transfection was carried out by the calcium phosphate method, with 25 ⁇ g of p21ur for each well.
  • a 2-minute glycerol-shock was applied 3 hours after DNA was added to the cells and the transfected cells were incubated in MEM/10% FCS at 37 ⁇ C with 5% CO 2 overnight.
  • the transfected cells were transferred to new 6-well plates at the concentration of 5 ⁇ 10 4 cells per well on the next day, and incubated in growth medium (McCoy's medium/10% FCS for U-2 OS and DME medium/10% FCS for Saos-2 and SW480.7) with puromycin.
  • the same puromycin treatment was also given to the non-transfected cells as a control. Refreshed the medium with puromycin every three-day and observed growth status of the cells.
  • the cloned vector was restricted by HindIII, which proved that there is the p21 promoter, a ⁇ 2.4 kb fragment, inserted upstream of the puromycin resistant gene. DNA electrophoresis showed that the restriction gave two bonds with correct theoretical sizes at 2.4 and 3.9 Kb.
  • Transfection of the reporter p21 ur into U-2 OS cell line provided puromycin resistance to the transfected cells (Table 2).
  • Cells transfected with p21ur kept alive after 5 days of puromycin treatment at 0.5 ⁇ g/ml, while the untransfected cells died out. The transfected cells could not survive from higher concentration of puromycin because of the low expression level of p53 protein.
  • microgenex we have constructed a microgene library to provide a series of molecular shapes for testing. This ‘microgenex’ approach enabled us to screen among the expressed peptide repretoire and identify some that could correct mutated p53 and could restore all of its downstream activity.
  • Selection of the functional peptides can be carried out either in vivo or in vitro according to the peptide library.
  • the advantage of in vivo selection over in vitro selection is that it allows the peptides to carry out their function in an environment similar or even exactly the same as where they will later be applied in therapy. This makes the peptides more applicable and simplifies the modification step for the selected peptides. Besides, it can often detect weak and transient interactions.
  • peptide library allows in vivo screening for possible peptide ligands that can adjust mutant p53 protein back to its wild type function.
  • the peptides were constructed into an expression vector, they were easily internalized by cells. With the help of positive-selection reporters, the possible ligands were also readily identified by the recovery of ligand-encoding plasmids from surviving cells. Moreover, according to this technique, there is no need to either synthesize peptides or purify peptides, which can be expensive and labour intensive.
  • An un-integrated, autonomously replicated mammalian expression vector was chosen to prevent the dilution of transfected ligand-encoding plasmids during cell division, or the integration of the plasmids into the host chromosomes, which causes difficulties in rescuing ligand from surviving cells.
  • pCEP4 (Invitrogen): TTGACG CAAA T GGGCGGTAG GCGTGTACGG TGGGAGGTC T ATATAA GC AG AGCTCGTTTA CAAT TATA pCEP Forward GTGAACCG TC AGATCTCTAG AAGCTG GGTA CC AGCTGCTA GC AAGCTT GC TAGCGGCCGC primer KpnI HindIII TCGAGGCCGG CAAGGCCGGA TCCAGACATG (SEQ. ID.
  • HindIII pCEP Forward primer 5′-AGAGCTCGTTTAGTGAACCG-3′ (SEQ. ID. NO.: 25)
  • EBV Reverse primer 5′-GTGGTTTGTCCAAACTCATC-3′ (SEQ. ID. NO.: 26) KpnI, HindIII, T4 DNA ligase, 10x T4 DNA ligase buffer
  • Oligo REGP-10 was used as a template, while oligos REGP-11 and REGP-12 were used as forward and reverse primers, to generate double-stranded DNA fragments encoding a degenerated peptide library by PCR.
  • the first ATG in this fragment encodes Met as the translation initiation site of the peptides.
  • Gly the second amino acid encoding by the DNA fragment, assures the peptides expressed are stable inside cells according to the N-end rule.
  • the following three repeats of NNK (N for A, C, G, and T in equal molar ratio, K for G and T in equal molar ratio) code for all possible amino acids and thus form the peptide library.
  • Two stop codens, TGA and TAA, are added right after the last NNK to stop peptide translation.
  • the 5′-end KpnI site and 3′-end HindIII allows the fragments to be cloned into pCEP4 vector in the correct direction under CMV promoter.
  • SW480.7 is a human colon cancer cell line that carries R273H and P309S double mutations in its p53 gene.
  • R273H is one of the most frequently occurred p53 mutations that have been observed in various cancer tissues. It has been reported that the p53 mutant in SW480.7 loses its function in transactivating its downstream genes such as p21, Bax, 14-3-3 ⁇ , and CD95.
  • Our purpose of screening the peptide library constructed above is to find out one or more peptide ligands that can restore wild-type p53 function without alteration of the mutant p53 gene in the cell and disturbing cells with wild type p53.
  • the choice of a human cancer cell line provides the real human environment in which the selected peptides can be modified, degraded, and transported to subcellular organelles, and thus assures the reliability of the selected peptides to function in human body.
  • Human colon cancer cell line SW480.7 with p53 mutations R273H and P309S ATCC Number: CCL-228, Homo sapiens (human), adenocarcinoma, colorectal; colon), DMEM medium, FCS, PBS, trypsin, puromycin (sigma), hygromycin (sigma), Peptide library (constructed by our laboratory described as above), pCEP4 vector (Invitrogen), calcium phosphate transfection reagents (0.1 ⁇ TE (pH8.0), 2 ⁇ HBS (pH7.4), 2M CaCl 2 , 15% glycerol in HBS).
  • DNA extraction reagents TSS buffer (0.8% NaCl, 0.02% KCl, 0.3% Tris, pH7.4), TE buffer (pH8.0), DNA extraction buffer (10 mM TrisCl, 100 mM EDTA, 20 ⁇ g/ml RNase, 0.5% SDS, pH8.0), Proteinase K (18 mg/ml), chloroform, phenol (Tris saturated), ethanol, 7.5M ammonium acetate (pH 7.4)), XL1-blue electroporation competent cell, pCEP Forward primer, EBV Reverse primer, dye-terminator thermocyclic DNA sequencing kit, ALF express DNA automatic sequencing system.
  • the peptide library was transfected into SW480.7 using the calcium phosphate method described in the reporter constructing part. Meanwhile pCEP4 was also transfected into the SW480.7 cells as a negative control. On the second day after transfection, puromycin was added into the medium at the concentration of 0.5 ⁇ g/ml. Then the transfected cells were washed with PBS and changed with fresh medium with 0.5 ⁇ g/ml puromycin every three days. After all the negative control cells died, all the library-transfected cells that still attached to the wells were collected by scraping into ice-cold TBS buffer.
  • the DNA and amino acid sequences are: DNA: 5′- GGTACC AAGAAGGAGTTT ACATATG GGA TGG TGT ACT TGA TAA KpnI Amino acid: M G W C T (SEQ. ID. NO.: 28) stop stop DNA: GGATCC AAGCTT -3′ (SEQ. ID. NO.: 27) HindIII D—Validation of the Selected Peptide Ligand
  • the purpose of this experiment is to confirm that the selected peptide ligand indeed restores the mutated p53 protein to the wild-type p53 function in SW480.7 cells.
  • the selection of peptide ligands was based on puromycin resistance. There might be a chance that the cells got the resistance with, or even without the help of the selected ligand via mechanisms other than p53-transactivated p21 promoter. Thus we should make sure that the mutant p53 in SW480.7 cells functions as wild-type p53 only if the selected ligand is added. Confirmation/validation was achieved in two different ways:
  • P53 wild-type regulates different cell responses via transactivating different downstream genes.
  • p53 causes cell cycle arrest at G1 phase by transactivation of p21 gene.
  • p53 causes cell arrest at G2 phase by transactivation of CD95 gene.
  • Apoptosis is due to p53 transactivation of 14-3-3 ⁇ and/or Bax gene.
  • p21, CD95, and 14-3-3 ⁇ gene transcription levels by RT PCR, to reveal the capability of the selected peptide ligand in restoring the wild-type p53 transactivation of the downstream genes.
  • the expression level of ⁇ -actin was measured as a control.
  • ⁇ -Actin transcription level was used to normalize total mRNA amount between samples for comparison owing to its constitutive expression in all cells.
  • RT-PCR primer sequences Name Sequence RT-WAF1- 5′-CTACCTCAGGCAGCTCAAGC-3′ FOR (SEQ. ID. NO.: 29) RT-HME1- 5′-AGACAGCACCCTCATCATGC-3′ FOR (SEQ. ID. NO.: 30) RT-CD95- 5′-TGGTGCTCATCTTAATGGCC-3′ FOR (SEQ. ID. NO.: 31) RT-ACTIN- 5′-TGACAAAACCTAACTTGCGC-3′ FOR (SEQ. ID. NO.: 32) CDS 5′-AAGCAGTGGTAACAACGCAGAGTACT (30) N ⁇ 1 N-3′ Primer (SEQ. ID. NO.: 33) PCR 5′-AAGCAGTGGTAACAACGCAGAGT-3′ Primer (SEQ. ID. NO.: 34)
  • Target genes and their corresponding RT-PCR product using the above primers are listed in Table 4. TABLE 4 RT-PCR primer target genes and their corresponding product. Amplify Size of Target gene region in RT-PCR Primer name Name Function target mRNA product RTWAF1FOR p21 G1 arrest 1514-2121 607 bp RTHME1F0R 14-3-3 ⁇ G2 arrest 652-1245 593 bp RTCD95FOR CD95 Apoptosis 1968-2534 566 bp RTACTINFOR ⁇ -actin cytoskeleton 1236-1793 557 bp
  • Cells SW480.7, U-2OS, and Sa-os2 were transfected with peptide ligand and pCEP4 parallelly using the calcium phosphate method.
  • the transfection plan was shown in Table 5: TABLE 5 Transfection plan for the verification of peptide ligand function.
  • RNA samples were harvested as SW480.7/Ligand-0 day, SW480.7/Ligand-1 day, SW480.7/Ligand-2 day, SW480.7/pCEP4-0 day, SW480.7/pCEP4-1 day, SW480.7/pCEP4-2 day.
  • Protocol for total RNA extraction by TRIZOL® Reagent was according to that given by Life Technology:
  • RNA Transfer the aqueous phase to a fresh tube.
  • Precipitate the RNA by adding 500 ⁇ l of isopropyl alcohol per 1 ml of TRIZOL Reagent used for the initial homogenization. Incubate samples at room temperature for 10 minutes and centrifuge at no more than 12,000 ⁇ g for 10 minutes at 2 to 8 C.
  • RNA pellet once with 75% ethanol, adding at least 1 ml of 75% ethanol per 1 ml of TRIZOL Reagent used for the initial homogenization. Mix the sample by vortexing and centrifuge at no more than 7,500 ⁇ g for 5 minutes at 2 to 8 C.
  • RNA pellet Dissolve RNA in DEPC water by passing the solution several times through a pipette tip, and incubation for 10 minutes at 55 to 60 C. Determine the concentration by measure OD 260 . Store at ⁇ 70 C.
  • RNA sample For each RNA sample, combine the following reagents in a sterile 0.2-ml reaction tube: 1-3 ⁇ l RNA sample (using same amount of RNA among samples) 1 ⁇ l CDS primer (10 ⁇ M) 1 ⁇ l RT forward primer (10 ⁇ M) x ⁇ l Deionized H 2 O 5 ⁇ l Total volume
  • Samples can be stored at ⁇ 20 C for up to three months.
  • PCR was carried out using the following cycling parameters: 95 C. 15 min 2 cycles 94 C. 30 sec 70 C. 30 sec 72 C. 1 min 2 cycles 94 C. 30 sec 69 C. 30 sec 72 C. 1 min 2 cycles 94 C. 30 sec 67 C. 30 sec 72 C. 1 min 2 cycles 94 C. 30 sec 65 C. 30 sec 72 C. 1 min 2 cycles 94 C. 30 sec 63 C. 30 sec 72 C. 1 min 2 cycles 94 C. 30 sec 61 C. 30 sec 72 C. 1 min 2 cycles 94 C. 30 sec 59 C. 30 sec 72 C. 1 min 2 cycles 94 C. 30 sec 57 C. 30 sec 72 C. 1 min 40 cycles 94 C. 30 sec 55 C. 30 sec 72 C. 1 min 1 cycle 72 C. 7 min 4 C. store
  • RT-PCR analysis of the mRNA from ligand transfected SW480.7 cells showed that this ligand can reestablish wild type p53 function.
  • Higher mRNA levels of p21 (WAF1/Cip1), 14-3-3 and CD95 genes were found compared with that in the empty pCEP4 transfected SW480.7 cells. This proved that the apoptosis and cell growth arrest of ligand transfected cells were owing to the ligand that corrected R273H/P309S p53 mutant to wild type function.
  • the previous p53 reporter p21ur is based on the transactivation activity of wild type p53 to p21 promoter. It has been proved that up-regulation of p21 protein expression will result in cell growth arrest, but not apoptosis.
  • Another p53 target, Bax (a cell death gene), can lead to apoptosis once its transcription and expression is upregulated. Bax is one of the key mediators for p53 to induce cell apoptosis—to kill the cells that have failed to repair their damaged genomes.
  • Bax promoter is different from p21 promoter, though both have p53 consensus binding sequence. It has been reported that some p53 mutant can transactivate p21 promoter, but not Bax. This indicates a more stringent requirement of p53 conformation for the transactivation of Bax promoter.
  • Green fluorescence protein has also been used in the new reporter system.
  • Bax promoter drives a long transcript of both puromycin resistant gene and GFP gene.
  • IRES Interleukin-1 receptor 1
  • GFP Green fluorescence protein
  • the two genes that introduces a second ribosome-binding site they can be translated into protein simultaneously from one transcript (Clontech pIRESneo manual).
  • the green color from GFP will visualize the Bax promoter activity, hence the ligand's activity to correct p53 mutants, in addition to the positive selection of puromycin resistance.
  • the green fluorescence will eliminate the background caused by mechanisms which have cells developed to survive from puromycin selection other than transactivation of Bax promoter.
  • Selection marker is important for integration of the plasmid into genome.
  • a stable p53-null cell line with integrated reporter system will be convenient for ligand screening, especially for non-peptide ligands.
  • Plasmids used in this Example are: p21ur (above), pIRESneo (Clontech), pEGFP-C1 (Clontech), and pREP8 (Invitrogen).
  • Genomic DNA human, male, normal
  • Restriction endonucleases Provided by EcoRV, HindIII, NruI, PvuI, SacI, SalI, SpeI, XbaI, XhoI, XmaI.
  • Bax promoter primer Bax promoter Forward: 5′-atct aagctt gaggcttcagcccgggaat (SEQ. ID. NO.: 35) tccag-3′ HindIII Bax promoter Reverse: 5′-atct accggt gccagcagtggcgccgtcc (SEQ. ID. NO.: 36) aacag-3′
  • the final product is denoted as pBaxur.
  • the structure of p21 urEGFP was verified by restriction of HindIII, which should give five bands of 3489 bp, 2339 bp, 1041 bp, 360 bp, and 348 bp. Agarose gel electrophoresis of the restricted plasmid showed the expected bands.
  • the structure of pBaxur was verified by restriction of either HindIII together with SacI, which should give two bands of 3996 bp, and 388 bp, or SmaI alone, which should give three bands of 3184 bp, 620 bp, and 580 bp. Agarose gel electrophoresis of the restricted plasmid showed the expected bands.
  • pBaxurEGFP The structure of pBaxurEGFP was verified by restriction of either EcoRI together with XbaI, which should give two bands of 2907 bp, and 2766 bp, or SacI alone, which should give two bands of 3125 bp, and 2548 bp. Agarose gel electrophoresis of the restricted plasmid showed the expected bands.
  • pBaxurEGFP-hisD was verified by restriction of either EcoRI, which should give two bands of 5955 bp, and 3211 bp, or SacI, which should give two bands of 6618 bp, and 2548 bp. Agarose gel electrophoresis of the restricted plasmid showed the expected bands.
  • Screening for new peptide ligands is done by first transfecting the outlined new reporter p21 urEGFP-hisD along with the cotransfection of the peptide library constructed above into cancer cell lines such as SW480.7.
  • Caspase inhibitor VAD-fmk (ApoAlert®, Clontech) is added to the cell culture at the final concentration of 40 mM in order to prevent apoptisis in the case of the overexpression of BAX gene.
  • 0.5 g/ml puromycin is added in the media one-day after transfection. After several days, surviving green cells (excited at 488 nm wavelength) will be the cells containing the candidate ligands.
  • Rescued plasmids from these cells are subjected to DNA sequencing to deduce the amino acid sequences of the encoded peptide ligands.
  • a stable cell line of the reporter p21urEGFP-hisD is constructed.
  • Members of the chemical compound library are added to the cell culture.
  • the cells should also be cultured in the medium containing 40 mM VAD-fmk and 0.5 g/ml puromycin. Surviving green cells after excitation at 488 nm is the indicator for successful candidate ligands.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • Molecular Biology (AREA)
  • Zoology (AREA)
  • General Health & Medical Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Medicinal Chemistry (AREA)
  • Plant Pathology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Microbiology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Immunology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Peptides Or Proteins (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Photoreceptors In Electrophotography (AREA)
  • Polysaccharides And Polysaccharide Derivatives (AREA)
  • Bipolar Transistors (AREA)

Abstract

The present invention provides a method of screening a library of peptides of formula M-G/M/V—(X)n wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid, which method comprises:
  • a) transforming a host cell population with a library of nucleic acid constructs which can express free peptides of formula M-G/M/V—(X)n;
  • b) culturing the transformed host cells under conditions suitable for intra-cellular expression of the peptides of formula M-G/M/V—(X)n; and
  • c) analysing the host cell population to determine the effect of the peptides of formula M-G/M/V—(X)n on a reporter system. Also describe are a library of nucleic acid constructs which can express free peptides in an intra-cellular environment, said peptides having the sequence M-G/M/V—(X)n, wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid; and a method of generating a library of nucleic acid constructs which can express free peptides in an intra-cellular environment, which method comprises synthesising a library of DNA molecules which include the nucleotide sequence ATGGGA(NNK)n, wherein n is an integer from 3 to 18, N is A, C, T or G and K is G or T and wherein each NNK triplet may be the same or different, and inserting a library of synthesised DNA fragments which each includes a nucleotide sequence of formula ATGGGA(NNK)n into expression vectors.

Description

  • The present invention relates to molecular libraries, in particular to molecular libraries which can be screened to identify members of the library with certain characteristics.
  • The complex interactions and pathways between biomolecules in living organisms are the focus for much research, particularly as scientists strive to develop new candidate drugs for the diagnosis, prevention or treatment of disease. Of particular interest are proteins which have functionally active surfaces and pockets which are characterised by the three dimensional shape of the protein molecule and particular reactive amino acid side groups. These proteins are able to interact, associate or bind with a variety of other biomolecules, including other proteins, nucleic acids, organic molecules, ions etc.
  • There is a great desire to identify molecules which can undergo specific interactions with target molecules, e.g. proteins, such molecules may be able to act as agonists or antagonists of enzyme or receptor activity for example, and more recently it has been proposed that the three dimensional shape of target proteins may be altered through interactions with small peptides (Nature Biotechnology letter El-Gewely 1999, Vol 17, p 210). In this way the wild-type function of a mutant protein can be restored.
  • A variety of molecular libraries which can be screened to identify molecules with desired properties are known in the art. These may comprise organic chemicals synthesised by traditional methods or chemicals produced by combinational chemistry. It is expensive to generate very large libraries of such organic chemicals even using combinatorial techniques and so interest has turned to biological systems.
  • The best known of these is phage display where peptides are expressed on the surface of a filamentous phage as a fusion protein with a coat protein. Ideally each phage will carry only one such peptide (although it will generally have several copies of that molecule) on its surface and these are then screened in vitro for their ability to bind to a target ligand. The phage provide a useful link between genotype and phenotype, as the phage which has a useful ligand on its surface will also have the nucleic acid inside it which codes for that peptide ligand. The phage can be cultured, the nucleic acid amplified and then sequenced to determine the identity of the ligand exhibiting desirable properties. However, only ligands in the form of fusion proteins with coat proteins can be investigated according to this method and it is only possible to perform a rather limited in vitro binding assay. Much more useful information could be generated if it was possible to analyse the performance of the members of the molecular library in vivo, in an environment similar to that which would be found inside the cells where the molecule is intended to act.
  • Another biologically based system which provides a molecular library is known as Aptamers/SELEX (Ellington and Conrad 1995, Gold et al. 1997 etc. SELEX stands for Systematic Evolution of Ligands by Exponential Enrichment. According to this technique, single or double stranded nucleic acid molecules that bind to other molecules are investigated, e.g. in order to find a small molecule—in this case a nucleic acid molecule, that binds to a target protein. The short nucleic acid molecules with strong and specific binding capabilities are the “aptamers”. Again, these systems only allow in vitro screening of molecules.
  • Thus, there remains a need for a relatively cheap molecular library whose members can be analysed in vivo without the need to generate fusion proteins which as a whole may not reflect the potential binding activity of the individual library molecules themselves. These problems have been addressed by the present invention, which according to one aspect provides a library of nucleic acid constructs which can express free peptides in an intra-cellular environment, said peptides having the sequence M-G/M/V—(X)n, wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid.
  • The above defined free peptides are optionally expressed as a fusion peptide with a cleavable signal sequence responsible for localization or translocation of the peptide. Such signal sequences are well known in the art and will typically be no more than 30, preferably no more than 20 amino acids in length. The term ‘free’ will be understood with this in mind and therefore constructs whose initial translation product incorporates a peptide of formula M-G/M/V—(X)n together with a cleavable signal sequence are still considered ‘constructs which can express free peptides’ of sequence M-G/M/V—(X)n. The free peptides encoded by the libraries of the present invention can be contrasted with libraries of proteins which comprise peptides of interest fused to a large carrier protein, e.g. designed for surface display of the peptides of interest.
  • The integer represented by the letter n in the above formula is from 3 to 18, preferably from 3 to 10 and typically from 3 to 5.
  • Thus in each of the expressed peptides the first residue is methionine, the second residue is glycine, methionine or valine and the third and subsequent amino acids may be any of the genetically coded amino acids. The universally accepted single letter code system for amino acids is used throughout.
  • It is well known that there are 20 genetically coded amino acids and any of these may be present in the encoded peptides. There will be between 3 and 18 X residues which may be the same or different and it is envisaged that for a given library, once the desired length of the peptides has been set, representatives of most if not all available combinations of residues will be present. Thus for example, using the single letter code for amino acids, where the desired peptides are pentamers, the library would encode most or all of the following peptides, which are themselves only listed by way of example:
    MMACD (SEQ ID NO.: 1)
    MMACE (SEQ ID NO.: 2)
    MMACF (SEQ ID NO.: 3)
    MMACG (SEQ ID NO.: 4)
    MMADE (SEQ ID NO.: 5)
    MMADF (SEQ ID NO.: 6)
    MMADG (SEQ ID NO.: 7)
    MGACD (SEQ ID NO.: 8)
    MGACE (SEQ ID NO.: 9)
    MGACF (SEQ ID NO.: 10)
    MGACG (SEQ ID NO.: 11)
  • etc. Preferably the second amino acid is glycine.
  • A library of constructs according to the invention may encode peptides of different lengths but typically a library will encode only peptides of 1-3 different lengths, often of only one length. Thus a library will typically have constructs which encode just pentamers or just hexamers etc. The nucleic acid encoding these peptides are referred to herein as microgenes or forming a microgenex library.
  • The present invention provides a library of constructs which in turn is capable of generating a library of peptides which can be screened for their activity in a given intra-cellular environment. Thus, according to the present invention, the library of molecules which is actually investigated/screened is made up of peptides, the library as a whole providing a repertoire of different shapes. The “shapes” will also have regions of positive and negative charge and various reactive groups which will impact on their in vivo activity. The peptides are putative ligands for one or more target proteins and are conveniently referred to herein as a ‘ligand repertoire’ when either in peptide or corresponding nucleic acid form.
  • In a further aspect the present invention provides a (synthetic) library of peptides, each member of said library having the amino acid sequence M-G/M/V—(X)n, wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid. The library is synthetic in that it has been generated through the use of recombinant DNA technology and is not naturally occurring.
  • The libraries of the present invention have at least 50, typically at least 100, preferably at least 500 members more preferably at least 2000. The members of the library are usually screened at the same or substantially the same time, e.g. the library members are in contact with the host cells at the same time or at least screened as part of a series of coordinated multiplex assays. The library may be screened all together or in batches but in that case at least 8, preferably at least 64 or 96 different library members would still be contacted with the host cell population at the same time. The different nucleic acid constructs are recognisably part of a library as they will generally differ only in the region which encodes the peptides of formula M-G/M/V—(X)n defined above; thus the constructs, which are typically plasmid vectors, will contain the same flanking sequences i.e. promoter etc. on either side of the sequence encoding the peptide of interest.
  • The term “nucleic acid construct” refers to a nucleic acid molecule which contains a coding region for the peptides as defined herein operably linked to a promoter region. The constructs are typically in the form of expression vectors, e.g. plasmid vectors and other expression vectors such as phagemid or lambda expression vectors can also be used. The constructs are designed to allow expression in prokaryotic or eukaryotic host cells and thus a mammalian, bacterial or yeast etc. expression vector is selected as appropriate. Suitable expression vectors for use with different cell types are well known in the art. The library of peptide molecules encoded by the library of constructs can be analysed for their ability to interact with a given target system or protein. If the target protein is mammalian it follows that the library will ideally be expressed in mammalian cells and the expression vector will be selected accordingly.
  • According to a further aspect, the present invention also provides a method of generating a library of nucleic acid constructs which can express free peptides in an intra-cellular environment, which method comprises synthesising a library of DNA molecules which include the nucleotide sequence ATGGGA(NNK)n, wherein n is an integer from 3 to 18, N is a nucleotide selected from A, C, T or G and K is a nucleotide selected from G or T and wherein each NNK triplet may be the same or different, and inserting a library of synthesised DNA fragments which each includes a nucleotide sequence of formula ATGGGA(NNK)n into expression vectors.
  • Thus the invention provides libraries for use in any host cell for which a suitable promoter to drive expression is known. Modifications to nucleic acid libraries disclosed herein which would be necessary for expression in a different host cell system are within the competence of the skilled man, so a range of bacterial, eukaryotic or yeast compatible libraries may readily be developed. The Shine-Dalgarno sequence which is required for expression in bacteria does not hinder expression in eukaryotic cells.
  • It should be understood that the construct library is capable of generating a repertoire of different molecules with different three dimensional configurations and different functional groups. As with a chemical library, this same library of peptides can be used in many different screening methods; whether it be for allosteric inhibition of an enzyme, the ability to competitively inhibit the activity of an intra-cellular messenger at a receptor binding site, to control nucleic acid transcription or translation, or to restore the function of a mutant protein through interaction therewith which alters the mutant protein's three dimensional shape.
  • Whereas a phage display library is used in an in vitro binding assay, the libraries of the present invention, because the peptides are expressed as free peptides inside the cell, may be used in an in vivo screening method to investigate the activities of the peptide ligands in normal cell conditions. Thus toxic molecules may be excluded at an early stage in testing.
  • The cells in which the library of peptide molecules are to be expressed can internalise the expression vectors into which the ligand repertoire has been constructed and there is thus no need either to synthesize peptides or purify peptide libraries. Preferably, the expression vectors are capable of autonomous replication to prevent the dilution of transfected ligand-encoding plasmids during cell division or the integration of the (e.g. plasmid) vector into the host cell chromosome, which causes difficulties in sequence analysis of the region of the nucleic acid within a cell which encodes the peptide ligand of interest.
  • Recombinant technology to generate expression vectors is well known and a suitable method for generating the library of constructs incorporating the desired sequence variation is described in the Examples. The variation in the nucleic acid sequence which translates into the different peptide ligands is typically generated through the synthesis of a master primer which has regions of known sequence flanking the region which encodes the peptide of formula M-G/M/V—(X)n defined above (see FIG. 2). As the first amino acid is always methionine, the first three nucleotides will be ATG, in the preferred case where the second residue is glycine, the next three nucleotides will be GGA and the remaining triplets follow the formula NNK, wherein N is A, C, T or G and K is G or T. This can allow for any nucleotide in any of the Y positions.
  • The primer is generated adding particular nucleotides in turn until the NNK regions are reached when all 4 nucleotides are added in equal molar amounts for the addition of 2 nucleotides to the growing strand and then G and T are added in equal molar amounts for the third position in the triplet. This cycle of additions is repeated until nucleic acid encoding for the peptides of the required length is generated. There are many commercially available machines or services for synthesis of oligonucleotides, e.g. as provided by Sigma and Life Technologies.
  • This technique provides a simple method for generating a vast range of potentially active biomolecules. The NNK motif of nucleotide addition means that all of the standard amino acids can be incorporated into the peptides in any combination, so the peptides can include all the range of reactive side groups and overall variation in size, shape, polarity, hydrophobicity etc. which are provided by the amino acids individually and by combinations of amino acids in juxtaposition.
  • In a further aspect, the present invention provides a method of identifying an active peptide ligand of formula M-G/M/V—(X)n as defined above which comprises:
  • a) transforming a host cell population with a library of nucleic acid constructs as defined herein;
  • b) culturing the transformed host cells under conditions suitable for intra-cellular expression of the free peptides of formula M-G/M/V—(X)n;
  • c) analysing the host cell population to determine the effect of the expressed free peptides on a reporter system; and optionally
  • d) identifying the peptide of formula M-G/M/V—(X)n in those cells in which the reporter system indicates a positive response.
  • The present invention further provides a method of identifying a peptide ligand of formula M-G/M/V—(X)n as defined above having a desired activity on a target protein, which method comprises:
  • a) transforming a host cell population with a library of nucleic acid constructs which can express peptides of formula M-G/M/V—(X)n;
  • b) culturing the transformed host cells under conditions suitable for intra-cellular expression of the peptides of formula M-G/M/V—(X)n;
  • c) analysing the host cell population to determine the effect of the expressed peptides on said target protein, which target protein forms part of a reporter system; and optionally
  • d) identifying the peptide of formula M-G/M/V—(X)n in those cells in which the reporter system indicates a positive response. A positive response in said reporter system indicating that one or more peptides in that cell has the aforementioned desired activity. The desired activity will typically be an ability to restore wild-type activity in the target protein, the host cells having a non-wild-type mutant form of the target protein.
  • Alternatively viewed, the present invention provides a method of screening a library of peptides of formula M-G/M/V—(X)n wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid, which method comprises:
  • a) transforming a host cell population with a library of nucleic acid constructs which can express free peptides of formula M-G/M/V—(X)n;
  • b) culturing the transformed host cells under conditions suitable for intra-cellular expression of the peptides of formula M-G/M/V—(X)n; and
  • c) analysing the host cell population to determine the effect of the peptides of formula M-G/M/V—(X)n on a reporter system. The reporter system is fully or partially an intra-cellular system.
  • These same method steps could also be considered to define a method of screening a library of nucleic acid constructs. The ‘microgenes’ encoding the library of peptides may be expressed constitutively or expression can be induced from a promoter which is operably linked to said microgene. The constructs may be autonomously replicated or integrated in the host chromosome.
  • We describe methods of “screening a library”. While the library may have been selected or designed to have certain structural motifs and the nature of individual members of that library can be assumed, this phrase implies that it is not known which member is introduced into which cell (or cell sample) and the molecules are not introduced in a controlled series.
  • Suitable transformation and culturing techniques are well known in the art and some are discussed in detail in the Examples hereto. Suitable reporter systems are discussed in detail below and will depend on the desired activity of the ligands. The reporter system will conveniently involve up or down regulation of gene expression, either at the level of transcription or translation. The gene whose level of expression is monitored may be native to the host cell but is typically an exogenous reporter gene. The reporter system will incorporate a target protein whose activity can be altered through interaction with one or more members of the library of peptides defined and described above. Binding or association of the peptide ligand to or with the target protein will, in some cases, result in a conformational charge in the target protein and where this results in a change in activity of the target protein this may be observed through the reporter system. Suitable target proteins are discussed in more detail below and include enzymes and transcription factors. The reporter system may be directly or indirectly related to the desired ligand function. For example, if the ability of a ligand to activate transcription or translation of a gene conferring antibiotic resistance is being investigated, culturing the host cells in the presence of that antibiotic would readily identify successful ligands as cells which expressed non-successful ligands would not grow. Given the size of the peptides, activation is unlikely to be through direct nucleic acid binding but through interaction with the protein machinery which performs and/or controls transcription/translation.
  • Thus analysis of the host cell population may simply be by observation, e.g. of growing plaques or through detecting a particular colour, for example if the β-gal reporter system is used.
  • In step d) the active peptide is typically identified by analysis of the nucleic acid encoding it. The Examples provide a discussion of how the nucleic acid of the constructs from selected host cells may be sequenced. For a given host cell, the link between phenotype of the peptide ligand in terms of its impact on the reporter system and genotype of the microgene encoding said ligand allows convenient identification of ligands exhibiting the desired activity.
  • A library of molecular shapes, i.e. ligands, for in vivo screening has a wide variety of uses. Of particular interest is the potential for members of a library to encode peptides which can take part in protein therapy through protein function restoration. It has recently been shown that while a mutant protein may have an altered three-dimensional structure due to a non-wild type primary structure, the three-dimensional structure can be corrected without changing the primary structure. Small molecules, e.g. peptides, may be used which interact with the mutant protein to modify its three dimensional structure and thus its activity. Proteins are known to change their conformation on binding or associating with a range of different classes of molecules, e.g. other proteins, nucleic acids, prosthetic groups such as heme, ions such as Ca2+ etc.
  • Peptide ligands identified according to the above described method can be used therapeutically to alter the three-dimensional structure of a mutant protein and thus restore or alter the function or activity of that mutant protein. Typically the ligand can restore, in whole or in part, wild type protein function. Peptide ligands identified according to the above screening method may be chemically modified before being used in therapy, in particular to enhance in vivo stability or by introduction of targeting or signal moieties such as the HIV Tat tag or folate, the receptor for which is over-expressed in many cancer cells. There are standard techniques which can be used to achieve this e.g. C terminal modifications such as amidation or N terminal modifications or the use of D- instead of L-amino acids.
  • It has been found that very small peptides can restore or modify the function of target proteins through interacting with them in vivo and causing changes in their conformation. The free peptides expressed by the library of nucleic acid constructs will preferably have only 3-8 amino acids, particularly preferably 3-6 amino acids. If a signal sequence is used, these preferred lengths refer to the peptides generated after cleavage of the signal sequence.
  • In a further aspect, the present invention therefore provides for the use of a library of nucleic acid constructs as defined herein in a method of screening a library of molecules for the ability of members of that library to restore or modify the function of a target protein in an intra-cellular environment, which method comprises introducing the library into cells which have a reporter system which allows the identification of those cells in which the function of the target protein has been restored or modified.
  • Thus the reporter system is compatible with, or more particularly includes, the target protein. The reporter system preferably comprises a reporter gene which is operably linked to a sequence of nucleotides which provides a binding site for the target protein or for a protein which associates with or is a substrate for said target protein.
  • The screening methods of the invention give information about individual members of the library of molecules. The method is performed in an intra-cellular environment (i.e. in vivo), and therefore all toxic compounds will be weeded out very early on. This reduces the cost of screening and of pre-clinical and even phase I clinical trials. The method is able to indicate in a qualitative possibly also quantitative manner the performance of individual molecules in the test system. A target protein and suitable host cells are selected and the method designed so that the reporter system is able to give information about the ability of each molecule to restore or modify the function of the target protein.
  • The reporter system is typically designed to measure restoration of a particular function of the target protein, for example the ability to act as a DNA binding transcription factor. Thus restoration of function refers to a return to wild-type function of the protein in at least one physiologically relevant and measurable respect.
  • In many cases, the wild-type functions of the target protein in all respects may be restored but this is not essential for the successful working of the invention. It will be appreciated that partial restoration of the function of a protein may still be useful and it is not a requirement for a positive identification according to the claimed screening method to provide 100% of wild-type activity.
  • The sensitivity of the reporter system may conveniently be modified to provide a more or less stringent assay and thus to identify as giving “positive” results only those molecules which have achieved a significant increase in protein function. Preferably, molecules giving positive results according to the claimed screening methods will have brought about at least a 30%, more preferably at least a 50%, particularly preferably at least a 70% restoration in wild-type protein function, with respect to the particular function monitored by the intra-cellular reporter system. In some instances, a ligand may correct the mutant to a level of measured activity which is actually greater than wild type. It will be clear to the skilled reader that the art provides a number of ways of comparing the relative activities of different molecules. Some reporter systems e.g. those based on fluorescence may be quantitative and thus facilitate such a comparison between wild-type and corrected mutant. However an assessment relative to the wild-type protein may more conveniently be performed as part of a separate test utilising serial dilutions, measurement of plaque size etc.
  • Suitable target proteins for restoration of function will include those whose presence in mutated form is associated with a disease state. The mutated version of the protein will typically have an altered 3-dimensional structure which affects its ability to interact with other molecules (ions, intra-cellular organelles and other cell components etc.). The members of the molecular library will be screened for their ability to interact with the mutant target and thus alter 3-dimensional structure. The 3-dimensional structure in vivo may be closer in one or more respects to the wild-type 3-dimensional structure as a result of successful interaction with a member of the library but protein function may be restored through a further compensating change in 3-dimensional structure which has a return to wild-type function as a ‘net’ -result. Preferred target proteins will be those wherein point mutation(s) are the cause of disease. Particularly preferred are transcription factors (DNA binding proteins) such as p53, but others would be enzymes, peptide hormones or receptor molecules.
  • All diseases caused by monogenic Mendelian mutations could be targets for treatment with molecules identified according to the screening methods of the present invention. These include genetic (i.e. hereditary) diseases, cancer and symptoms of aging caused by mutations. The host cells should provide a suitable model for determining whether the members of the library are capable of causing the desired change in target protein function with a view, typically, to therapeutic administration of successful members of the library or a derivative thereof.
  • Thus, where protein function restoration is required, the host cells will typically be derived from cancerous or other diseased cells and naturally produce the dysfunctional target protein. Suitable cell lines are available, for example in cell culture collections such as the ATCC and may be derived from osteosarcoma, adenocarcinoma etc., or indeed any established cell line that expresses the mutant protein. Cell lines that lack any expression of the protein of interest can also be used after introducing and expressing the gene encoding the mutated protein.
  • Standard transfection techniques may be used to introduce the nucleic acid constructs into the host cells in which expression of the free peptide ligands is to take place. The terms ‘transformation’ and ‘transfection’ are used interchangeably herein.
  • The appropriate reporter system for a given screening method will depend upon the nature of the target protein under investigation. The reporter system is chosen to be responsive to the target protein and thus be capable of indicating the functional status of the target protein. The mammalian p53 protein illustrates the principle behind a suitable reporter system and is a particularly preferred target protein. P53 functions in mammalian cells mainly through the ability to act as a transcription factor and is a transcriptional activator of several genes associated with cell cycle regulation such as p21, Bax, CD95(Fas/Apo-1) and 4-3-3σ (HME1). It can also down regulate the transcription of cell-cycle-regulating genes such as Bcl2, Cdc2 and cyclin. P53 acts as a checkpoint, monitoring DNA damage and regulating cell cycle progression. Loss of p53 activity predisposes cells to the acquisition of oncogenic mutations and may favour genetic instability −90% of mutations reported in the p53 mutation database are found in the DNA binding domain. Mutations in p53 can lead to cancer formation where p53-mediated apoptosis is deficient due to the mutation.
  • P21 is one of the p53-transactivated genes that are critical in cell cycle control and many p53 mutants found in cancer cells lose the ability to transactivate p21 transcription. Thus the transactivation level of p21 promoter in a cell reflects whether the p53 protein expressed in that cell has wild type function. As discussed in more detail in the Examples, a p53 reporter system was constructed by cloning human p21 promoter upstream of the puromycin resistance gene. The reporter construct can provide human cells with puromycin resistance only in the presence of wild type or wild-type functioning p53 that transactivates the p21 promoter. Such a reporter system offers significant practical benefits as a life-death selection system is possible where only cells with wild-type-functioning p53 can survive when grown in the presence of puromycin.
  • Thus, while the host cells may have a reporter system as part of their normal genome which can be utilised, typically the host cells will have been modified to include a suitable gene based reporter system. The genetic constructs which comprise all or part of the reporter system will thus have been introduced into the host cells, e.g. by standard transformation/transfection techniques, before the screening method is performed. In certain circumstances, the reporter constructs could be introduced at the same time as the molecular library or even, but not preferably, after introduction of the library. Thus for performance of the screening methods of the invention, host cells will preferably have been co-transfected (although not necessarily simlutaneously) with a reporter construct and a construct which encodes a peptide whose ability to restore or modify the function of a target protein is to be investigated. Alternatively, members of a chemical compound library may be contacted with the host cells which have been transfected with reporter constructs.
  • Target proteins are preferably DNA binding proteins which up or down regulate the expression of other genes through binding to promoter or enhancer regions. Their native gene targets may be utilised to report on whether a target protein function has been restored or modified, but preferably their target DNA binding regions will be operably linked to reporter genes such as genes conferring antibiotic resistance, or which encode fluorescent proteins such as GFP (green fluorescent protein), or the much used bacterial reporter enzymes β-galactosidase (β-Gal) or β-glucuronidase (β-Gus) etc. According to a particularly convenient and preferred method the protein products of these reporter genes (e.g. β-Gal or β-Gus) could be fused to a signal peptide in order to ‘display’ the proteins on the cell surface. In this way, the cells expressing the reporter protein can readily be separated physically from other cells that do not express the reporter protein, e.g. by the use of antibodies. Suitable signal peptides are described in the literature and include the signals from a protein such as PDGFR (platelet-derived growth factor receptor). A typical construct would thus be as follows:
  • Secretion signal peptide-reporter protein (eg. β-Gal/β-Gus)-transmembrane domain; e.g.
    METDTLLLWVLLLWVPGSTGD (SEQ. ID. NO.: 12)
    -β-Gal/β-Gus-
    AVGQDTQEVIVVPHSL PFKVVVISAILALVV (SEQ. ID. NO.: 13)
    LTIISLIILIMLWQKKPR
    -stop.
  • Non-DNA binding proteins can also be used as target proteins provided their wild type activities can be monitored and assayed for or the reporter system engineered such that cell survival depends on restoration of the protein function.
  • Identification of cells in which the function of the target protein has been restored or modified(e.g. wild-type function has been restored) will depend on the reporter system used. As described above, this is particularly conveniently performed when cells in which wild-type function has not been restored do not survive in the selected culturing environment. Fluorescence detection can also be used to identify cells in which the function of the target protein has been restored or modified. The target protein may, for example, be an enzyme rather than a nucleic acid binding protein and the ability of the target enzyme to act on a substrate may be used as the intra-cellular reporter system allowing identification of cells in which the function of the target protein has been restored or modified. Other suitable reporter system such as the β-gal. blue/white system are well known in the art and can be used or adapted by the skilled man in his chosen screening method.
  • A library of nucleic acid constructs according to the present invention is conveniently referred to herein as a “Microgenex” library and the stretches of nucleic acid which encode the peptide ligands of interest as “Microgenes”.
  • A further discussion relevant to the context and appreciation of the present invention is found in our co-pending International application filed 23 Jan. 2003 and claiming first priority from GB 0201522.0.
  • The invention will now be described in more detail in the following non-limiting Examples and with reference to the figures in which:
  • FIG. 1 shows in schematic form the general strategy for construction of a library of nucleic acid constructs according to the present invention.
  • FIG. 2 illustrates the strategy of primer design for construction of a Microgenex library. Restriction enzyme sites within primers as well as the encoded amino acids are shown below the primer sequences. M=methionine, G=glycine and X=any of the 20 genetically coded amino acids.
  • FIG. 3 illustrates the construction of a peptide library for expression in mammalian cells.
  • FIG. 4 illustrates the construction of a peptide library for expression in bacteria (E. coli).
  • FIG. 5 gives a map of the constructed Microgenex library of the Examples that can be used for expression in mammalian cells.
  • FIG. 6 gives a map of the constructed Microgenex library of the Examples that can be used for expression in E. coli. “Cm-R” refers to chloramphenicol resistance gene. “p15A” is replication origin compatible with several other replication origins used in molecular biology and biotechnology. PBAD promoter is suppressed by araC but can be induced by L-arabinose. Peptide library coding sequence was inserted under the PBAD promoter between KpnI and HindIII sites. SD sequence is a ribosome-binding site to help in the translation of the peptides in E. coli. The first two amino acids at the N-terminal (“M” for Met, and “G” for Gly) can stabilize the peptides inside cells. The two stop codons assure the termination of translation while the terminator (“term”) sequence will terminate the transcription driven by the PBAD promoter. The DNA fragment of the peptide library was generated by PCR, using Master, Right and Left primers as described in the Examples.
  • FIG. 7 provides a schematic representation of the generated peptide library plasmid. Each peptide in the library is composed of three random amino acids after two N-terminal amino acids Methioline (Met, or M) and Glycin (Gly, or G). The next three X stands for any amino acid that forms a library consisting of randomly distributed three-amino-acid peptides. The peptide-coding DNA-fragment was inserted into the mammalian expression vector pCEP4 between its KpnI and HindIII site under the CMV promoter. The plasmid has OriP that enables the replication of the plasmid inside mammalian cells. Selection marker expression cassette in the plasmid provides the plasmid-transfected cells with hygromycin resistance.
  • EXAMPLES Example 1 Strategy for Microgenex (Nucleic Acid Library) Construction
  • Materials and Methods:
  • 1—Primers.
  • All primers used to construct Microgenex libraries are listed in Table 1.
    TABLE 1
    Primers: Primers used to construct
    Microgenex libraries.
    No Name Sequence bases
    1 Master AAGAGCTCGGTACCAAGAAGGAGTTTACATATGG 71
    GANNKNNKNNKTGATAAGGATCCAAGCTTGAATT
    CAG
    (SEQ. ID. NO.: 14)
    2 Left AAGAGCTCGGTACCAAGAAGGAG 23
    (SEQ. ID. NO.: 15)
    3 Right CTGAATTCAAGCTTGGATCCTTATC 25
    (SEQ. ID. NO.: 16)
  • Primers used for DNA sequence are listed in Table 2.
    TABLE 2
    The Sequencing primers used for the
    Microgenex verifications.
    No Name Sequence bases
    1 pCEP-f AGAGCTCGTTTAGTGAACCG 20
    (SEQ. ID. NO.: 17)
    2 pCEP-r GTGGTTTGTCCAAACTCATC 20
    (SEQ. ID. NO.: 18)
    3 Left* AAGAGCTCGGTACCAAGAAGGAG 23
    (SEQ. ID. NO.: 19)
    4 Right* CTGAATTCAAGCTTGGATCCTTATC 25
    (SEQ. ID. NO.: 20)

    *These primers can also be used to verify sequence of any Microgenex library. Other primers within the used cloning vector can also be used.

    2—Cloning Vectors/Plasmids
      • pCEP4 (Invitrogen): for mammalian expression.
      • pBAD33 (Dr. Beckwith, Harvard; Guzman et al., 1995 J. Bacteriol. 177(14) 4121-4130) for Escherichia coli expression.
        3—Escherichia coli Strain Used for Transforming Constructed Microgenex Libraries:
  • The Electrocompetent DH10B™ (Life Technologies, GibcoBRL) was used to transform the ligated Microgenex expression libraries.
  • 4—PCR Conditions for Microgenex Amplifications
  • 4.1. Concentrations of Primers:
    Master primer  25 PM
    Right primer 100 PM
    Left primer 100 PM
  • 4.2. PCR Reactions:
  • At least two different Polymerases gave good results: Taq polymerase (Life Technologies, GibcoBRL) and rTth DNA polymerase (Perkin Elmer).
      • 4.2.1. Standard Taq polymerase as for example in SuperMix (Life Technologies, GibcoBRL).
      • Prepare 1 to 10 PCR tubes with the following:
        • 90 □l SuperMix
        • 2 □l Master Primer (25 PM)
        • 4 □l Right Primer (100 PM)
        • 1 Left Primer (100 PM)
      • 4.2.2 The XL PCR, extra long (rTth DNA polymerase) (Perkin Elmer) Also this PCR polymerase worked very well after adding buffers according to the manufacturer, but using primers as above and also the PCR cycles as indicated below.
      • 4.2.3. PCR Cycles
  • Best Conditions were found by making a hot start at 95 for 3-5 minutes. PCR cycles were preformed only using 25 cycles. Example for PCR Cycle:
    Hot start 95° C. 3 min.
    25 cycles
    Denaturation 94° C. 1 min.
    Aneaing 38° C. 1 min.
    Elongation 60° C. 1 min.
    Hold at  4° C.

    5. Ligation of PCR Products to the Corresponding Expression Vector:
  • PCR product was first treated with phenol-chloroform, chloroform and then precipitated with 3 volumes of cold ethanol. The DNA pellet was washed with cold 70% ethanol, dried and dissolved in appropriate volume of H2O. DNA was then restricted by KpnI and HindIII. Restricted DNA was gel-purified and subsequently ligated to the corresponding expression vector that has been also restricted with by KpnI and HindIII. Ligation mix was precipitated and resuspended in H2O and subsequently used to transform, by electroporation, of the Escherichia coli strain DH10B™ (Life Technologies, GibcoBRL). Plating was done on LB ampicillin plates (200 □g/ml) for pCEP4, or chloramphenicol (40 □g/ml) for pBAD33.
  • 6. Pooling of Colonies:
  • Colonies from all the plates were pooled together, washed with LB with ampicillin (200 □g/ml) if the cloning vector contains beta-lactamase gene, such in the case of pCEP4. This step is important to get rid of cells that lack the plasmid. Pooled cell mixture was divided into small aliquots and stored at −70□C. Each aliquot contained millions of cells to ensure complexity of the Microgenex library.
  • 7. DNA Isolation of Microgenex Library:
  • One frozen portion can be used to inoculate a one-liter culture of LB containing the appropriate antibiotic. Standard plasmid DNA isolation protocols/kits are used to isolate the DNA suitable to transform, for example mammalian cells for ligand screening.
  • Master primer was used as a template for all constructed peptide expression libraries (Microgenex) whether the expression in mammalian cells or Escherichia coli. It could also be used for yeast or insect cell expression. Left and Right oligoes were used as forward and reverse primers, in order to generate double-stranded DNA fragments encoding repertoire of peptide library by PCR. The first ATG in this fragment encodes Met as the translation initiation site of the peptides. Gly, the second amino acid encoding by the DNA fragment, assures the peptides expressed to be stable inside cells according to the N-end rule. The following three repeats of NNK code (N for A, C, G, and T in equal molar ratio, K for G and T in equal molar ratio) for all possible amino acids thus form the peptide library. Two stop codons, TGA and TAA, are added right after the last NNK to stop peptide translation. The 5′-end KpnI site and 3′-end HindIII allows the fragments to be cloned into pCEP4 vector in the correct direction under CMV promoter.
  • The peptide-coding DNA-fragment was inserted into the mammalian expression vector pCEP4 between its KpnI and HindIII site under the CMV promoter. The plasmid has the replication origin, OriP, which enables the replication of the plasmid inside mammalian cells. Selection marker expression cassette in the plasmid provides the plasmid-transfected cells with hygromycin resistance.
  • After ligation of the KpnI/HindIII restricted PCR product, and the KpnI/HindIII restricted pCEP4 or pBAD 33, the DNA were electroporated into DH10B™ (Life Technologies, GibcoBRL) cells. Transformants were plated on LB plates with ampicillin (200 □g/ml) or Chloramphenicol (40 □g/ml) depending on the coloning vector used in the construction of the Microgenex library. All colonies from all the plates were pooled together, washed with LB with ampicillin (200 □g/ml), only if the cloning vector contains beta-lactamase gene, such in the case of pCEP4. Pooled cell mixture was divided into small aliquots and stored at −70□C. Each frozen portion of the Microgenex library can be used to inoculate and start a 1-liter-culture (with the appropriate antibiotic, in order to prepare the plasmid DNA. Insertions were verified by PCR of the generated Microgenex with Right and Left oligoes. Also, the forward and reverse primers of pCEP4 were used for verifications for the constructed mammalian Microgenex. Similar PCR reactions were preformed using plasmid DNA prepared from a small number of randomly-picked colonies in order to estimate the cloning efficiency and the ratio of peptide-encoding plasmids to empty vectors in any given Microgenex. Note that all generated peptides will start with M and G amino acids followed by three random amino acids (X, X, X) It should be noted that the number of the random amino acids (X) can be changed by changing the number of the codes (NNK).
  • Results
  • Colonies in the range of 25000-75,000 on the plates were commonly obtained ensuring complexity/diversity of the cloned Microgenex repertoire. They were collected into 10-12 portions.
  • Summary of Microgenex libraries are listed in Table 3.
    TABLE 3
    Plasmids used for the construction of microgenex libraries
    Plamid Name Promoter Insertion Selection marker Replication origin Reference.
    pCEP4 CMV No E. coli: Amp, E. coli: ColE, Vector from
    Mammalian: Mammalian: OriP Invitrogen
    Mammalian Microgenex library with Hygromycin B This work, FIG. 2.3.
    microgenex amino acid sequence as:
    library M-G-X—X—X.
    pBAD33 pBAD No Chloramphenicol p15A Vector from
    Beckwith; Guzman,
    et al., 1995
    E. coli Microgenex library with This work.
    microgenex amino acid sequence as:
    library M-G-X—X—X.
  • Plasmid prepared from one of such frozen aliquots was examined by PCR. When mammalian Microgenex library DNA was used as a template in PCR reactions using pCEP4 forward and reverse primers showed that there were insertions with expected size. PCR results from the 6 randomly picked colonies from the original mammalian Microgenex library plates showed that 4 of the colonies had the right construction (showed correct size of insertion) while two did not.
  • A test for the presence of Microgene insertions in a constructed mammalian library by PCR from 6 randomly picked colonies indicated that 4 out of the 6 colonies have the right size of insertion. Also, PCR results from 6 randomly picked colonies from the original E. coli Microgenex library plates showed that 4 of the colonies had the right construction, in that they showed the correct size of insertion.
  • Example 2 Demonstrating the Generation and Use of a Microgene Library in Identifying a Protein Binding Ligand of Interest
  • A—Construction of p53 Reporter Working in Mammalian Cells
  • The dysfunction of mutant p53 in cancer cells can result from not only the inability in binding to its specific recognized consensus DNA sequence, but also the mal-localization inside cells or incompetent to interact correctly with other transcription co-factors. The sub-cellular localization mechanism and transcription machinery in mammalian cells is different from that in prokaryotic organisms. The loss of specific DNA binding ability may engage the post-translational modification processes that are totally different from that in prokaryotic organisms. Thus a p53 reporter for mammalian cells has more advantages than that for prokaryotic cells.
  • P53 protein functions in mammalian cells mainly through its transcriptional factor activity. P21 is one of the p53-transactivated genes that are critical in cell cycle control. P53 can induce cell cycle arrest in G1 phase via transactivation of p21 gene once the cells are under stresses such as DNA damaging. Lots of p53 mutants found in cancer cells lose the ability to transactivate p21 transcription. This leads to not only the vast development of cancer, but also the resistance to cancer chemotherapy and radiotherapy that aim to damage DNA in order to kill cancer cells. Thus the transactivation level of p21 promoter in a cell reflects whether the p53 protein expressed in that cell has wild type function or not.
  • We constructed our p53 reporter by cloning human p21 promoter upstream of the puromycin resistant gene. It has been reported that basal transcription level from p21 promoter is very low. The reporter can provide human cells puromycin resistance only in the presence of wild type or wild-type-functioning p53 protein that transactivates p21 promoter, while leaving the cells sensitive to puromycin with dysfunctional mutated p53 protein. The advantage to use puromycin resistant gene is to set up a life-death selection: only the cells with wild-type-functioning p53 protein can survive from puromycin selection.
  • Materials and Methods
  • Plasmid and vectors used in this Example are WWP-Luc (gift from Prof. Stanbridge E. J.), pPUR (Clontech), and pUC18 (Pharmacia). Enzymes used for cloning are EcoRI (Promega), PvuII (Promega), SalI (Promega), SphI (Promega), T4 DNA ligase (New England BioLab), Klenow (Promega), and SAP (Shrimp Alkaline Phosphotase). Cell lines for the validation of our reporter are U-2 OS cell (ATCC Number: HTB-96, Homo sapiens (human), osteosarcoma, bone, wild type p53), Saos-2 cell (ATCC Number: HTB-85, Homo sapiens (human), osteosarcoma, bone, p53 null), and SW480.7 (ATCC number: CCL 228, Homo sapiens (human), colorectal adenocarcinoma, colon, p53 mutant: R273H/P309S). Medium and other chemicals include McCoy's medium, DME medium, MEM, FCS, calcium phosphate transfection reagents (0.1×TE (pH8.0), 2×HBS (pH7.4), 2M CaCl2, 15% glycerol in HBS), PBS, and puromycin (Sigma).
  • WWP-Luc plasmid was restricted by EcoRI and SalI. The 2366 bp fragment was gel-purified (1% agarose in 1×TBE) by freeze-thaw method. Vector pUC18 was restricted by EcoRI and SalI, dephosphorylized by SAP, and purified by phenol-chloroform. Then the p21 promoter fragment was subcloned into pUC18 vector by T4 DNA ligase, generating a plasmid named as p21uc. All drug resistant markers in the constructed plasmids, are listed in Table 1. Genes involved in the resistance are also listed.
    TABLE 1
    Drug resistant markers in constructed plasmids and their genes.
    Drug Resistant gene
    Puromycin Puromycin-N-acetyl-transferase (pac)
    Neomycin Neomycin phosphotransferase gene
    Hygromycin B Hygromycin B phosphotransferase
    Histidinol Histidinol dehydrogenase (hisD)
  • P21 promoter was cut out from p21 uc first by EcoRI, blunted with Klenow and dNTP, and then by SphI. The 2450 bp-p21-promoter fragment was gel-purified using the same method as above. Vector pPUR was restricted fully by PvuII and SphI, dephosphorylized by SAP, and gel-purified by the same method, which removed SV40 early promoter upstream of puromycin resistant gene. Finally the p21 promoter was cloned into pPUR vector upstream of puromycin resistant gene by T4 DNA ligase, generating the p53 reporter named as p21 ur.
  • Verification of the construction was carried out by restriction of the generated plasmid with HindIII. The original vector pPUR has only one HindIII site thus gives one band of 4257 bp after gel electrophoresis. This HindIII site was removed while constructing the reporter p21ur. Two new HindIII sites were introduced into p21ur with the p21 promoter thus can give two bands, one at 2.4 kb and the other at 3.9 kb.
  • Validation of the responsiveness of p21 ur to p53 was carried out by transfection of p21ur to osteosarcoma cell line U-2 OS that expresses wild type p53 protein, Saos-2 that is p53 null, and colon cell line SW480.7 that expresses R273H and P309S double mutated p53 protein. One day before transfection, 3×105 U-2 OS, 1.5×105 Saos-2, and 3×105 SW480.7 cells were seeded to each well of a 6-well plate. Transfection was carried out by the calcium phosphate method, with 25 μg of p21ur for each well. A 2-minute glycerol-shock was applied 3 hours after DNA was added to the cells and the transfected cells were incubated in MEM/10% FCS at 37□C with 5% CO2 overnight. The transfected cells were transferred to new 6-well plates at the concentration of 5×104 cells per well on the next day, and incubated in growth medium (McCoy's medium/10% FCS for U-2 OS and DME medium/10% FCS for Saos-2 and SW480.7) with puromycin. The same puromycin treatment was also given to the non-transfected cells as a control. Refreshed the medium with puromycin every three-day and observed growth status of the cells.
  • The cloned vector was restricted by HindIII, which proved that there is the p21 promoter, a ˜2.4 kb fragment, inserted upstream of the puromycin resistant gene. DNA electrophoresis showed that the restriction gave two bonds with correct theoretical sizes at 2.4 and 3.9 Kb.
  • Transfection of the reporter p21 ur into U-2 OS cell line provided puromycin resistance to the transfected cells (Table 2). Cells transfected with p21ur kept alive after 5 days of puromycin treatment at 0.5 μg/ml, while the untransfected cells died out. The transfected cells could not survive from higher concentration of puromycin because of the low expression level of p53 protein. Neither transfected Saos-2 cells (null p53) or SW480.7 cells (mutant p53) survived after 5 days of puromycin treatment at the concentration of 0.5 μg/ml or higher, which proved that the resistance of p21 ur-transfected U-2 OS cell to puromycin requires the existence of wild type p53 that transactivates p21 promoter. Thus it validated the proper responsiveness of the reporter p21 ur to p53 in human cells.
    TABLE 2
    Validation of the reporter p21ur - its response to wild type p53 protein.
    Number of cells attached to the bottom of
    Puromycin the well after 5 days of puromycin treatment
    (μg/ml) Untransfected U-2 OS Transfected U-2 OS
    0 Growing well 12600
    0.5 0 11500
    1 0 <5000
    2 0 <5000
    3 0 <5000

    B—Construction of Peptide Library Expression in Mammalian Cells
  • We have constructed a microgene library to provide a series of molecular shapes for testing. This ‘microgenex’ approach enabled us to screen among the expressed peptide repretoire and identify some that could correct mutated p53 and could restore all of its downstream activity.
  • Selection of the functional peptides can be carried out either in vivo or in vitro according to the peptide library. The advantage of in vivo selection over in vitro selection is that it allows the peptides to carry out their function in an environment similar or even exactly the same as where they will later be applied in therapy. This makes the peptides more applicable and simplifies the modification step for the selected peptides. Besides, it can often detect weak and transient interactions.
  • Expression of the peptide library allows in vivo screening for possible peptide ligands that can adjust mutant p53 protein back to its wild type function. Once the peptides were constructed into an expression vector, they were easily internalized by cells. With the help of positive-selection reporters, the possible ligands were also readily identified by the recovery of ligand-encoding plasmids from surviving cells. Moreover, according to this technique, there is no need to either synthesize peptides or purify peptides, which can be expensive and labour intensive. An un-integrated, autonomously replicated mammalian expression vector was chosen to prevent the dilution of transfected ligand-encoding plasmids during cell division, or the integration of the plasmids into the host chromosomes, which causes difficulties in rescuing ligand from surviving cells.
  • Materials and Methods
    pCEP4 (Invitrogen):
    TTGACGCAAA TGGGCGGTAG GCGTGTACGG
    TGGGAGGTCT ATATAAGCAG AGCTCGTTTA
     CAAT         TATA  pCEP Forward
    GTGAACCGTC AGATCTCTAG AAGCTGGGTA
    CCAGCTGCTA GCAAGCTTGC TAGCGGCCGC
    primer      KpnI    HindIII
    TCGAGGCCGG CAAGGCCGGA TCCAGACATG (SEQ. ID. NO.: 21)
    TTGATGAGTT TGGACAAACC
                 EBV Reverse primer
    ACAACTAGAA
    Oligo REGP-10:
    5′-AAGAGCTCGG TACCAAGAAG
    GAGTTTACAT ATG GGA NNK NNK NNK
    TGA TAA
       KpnI     M G X X X stop stop
    GGATCCAAG CTTGAATTCA G-3′ (SEQ. ID. NO.: 22)
       HindIII
    Oligo REGP-11:
    5′-AAGAGCTCGGTACCAAGAAGGAG-3′ (SEQ. ID. NO.: 23)
         KpnI
    Oligo REGP-12:
    5′-CTGAATTCAAGCTTGGATCCTTATC-3′ (SEQ. ID. NO.: 24)
        HindIII
    pCEP Forward primer:
    5′-AGAGCTCGTTTAGTGAACCG-3′ (SEQ. ID. NO.: 25)
    EBV Reverse primer:
    5′-GTGGTTTGTCCAAACTCATC-3′ (SEQ. ID. NO.: 26)
    KpnI, HindIII, T4 DNA ligase,
    10x T4 DNA ligase buffer
  • Oligo REGP-10 was used as a template, while oligos REGP-11 and REGP-12 were used as forward and reverse primers, to generate double-stranded DNA fragments encoding a degenerated peptide library by PCR. The first ATG in this fragment encodes Met as the translation initiation site of the peptides. Gly, the second amino acid encoding by the DNA fragment, assures the peptides expressed are stable inside cells according to the N-end rule. The following three repeats of NNK (N for A, C, G, and T in equal molar ratio, K for G and T in equal molar ratio) code for all possible amino acids and thus form the peptide library. Two stop codens, TGA and TAA, are added right after the last NNK to stop peptide translation. The 5′-end KpnI site and 3′-end HindIII allows the fragments to be cloned into pCEP4 vector in the correct direction under CMV promoter.
  • After ligation of KpnI/HindIII restricted PCR product and KpnI/HindIII restricted pCEP4, the DNA molecules were electroporated into XL1-blue cells and plated on LB plates with ampicillin. All colonies from the plates were poured together, delivered into small portions and stored at −70□C. Each portion of the transformed cells can be inoculated and start a 1-liter-culture in order to prepared the plasmids by Qiagen Maxi-Prep kit. Insertions were verified by PCR of the generated plasmids with oligos REGPD-11 and REGPD-12. Same PCR reactions were also applied to plasmids prepared from 6 randomly picked colonies so as to estimate the cloning efficiency and the ratio of peptide-encoding plasmids to empty vectors in the population. Note that all generated peptides will start with M and G amino acids followed by three random amino acids.
  • Results
  • There were ˜25000 colonies on the plates. They were collected into 12 portions. Plasmid prepared from one of such portions was examined by PCR.
  • Gel electrophoresis showed that there were insertions with correct size. PCR results from the 6 randomly picked colonies from the original library plates showed that 4 of the colonies had the right construction (showed correct size of insertion) while two did not. The structure of the generated peptide library plasmid is shown in FIG. 7.
  • C—Screening for Ligands that can Correct p53 Mutants in SW480.7 Cells
  • SW480.7 is a human colon cancer cell line that carries R273H and P309S double mutations in its p53 gene. R273H is one of the most frequently occurred p53 mutations that have been observed in various cancer tissues. It has been reported that the p53 mutant in SW480.7 loses its function in transactivating its downstream genes such as p21, Bax, 14-3-3σ, and CD95. Our purpose of screening the peptide library constructed above is to find out one or more peptide ligands that can restore wild-type p53 function without alteration of the mutant p53 gene in the cell and disturbing cells with wild type p53. The choice of a human cancer cell line provides the real human environment in which the selected peptides can be modified, degraded, and transported to subcellular organelles, and thus assures the reliability of the selected peptides to function in human body.
  • Materials and Methods
  • Human colon cancer cell line SW480.7 with p53 mutations R273H and P309S (ATCC Number: CCL-228, Homo sapiens (human), adenocarcinoma, colorectal; colon), DMEM medium, FCS, PBS, trypsin, puromycin (sigma), hygromycin (sigma), Peptide library (constructed by our laboratory described as above), pCEP4 vector (Invitrogen), calcium phosphate transfection reagents (0.1×TE (pH8.0), 2×HBS (pH7.4), 2M CaCl2, 15% glycerol in HBS).
  • DNA extraction reagents (TBS buffer (0.8% NaCl, 0.02% KCl, 0.3% Tris, pH7.4), TE buffer (pH8.0), DNA extraction buffer (10 mM TrisCl, 100 mM EDTA, 20 μg/ml RNase, 0.5% SDS, pH8.0), Proteinase K (18 mg/ml), chloroform, phenol (Tris saturated), ethanol, 7.5M ammonium acetate (pH 7.4)), XL1-blue electroporation competent cell, pCEP Forward primer, EBV Reverse primer, dye-terminator thermocyclic DNA sequencing kit, ALF express DNA automatic sequencing system.
  • The peptide library was transfected into SW480.7 using the calcium phosphate method described in the reporter constructing part. Meanwhile pCEP4 was also transfected into the SW480.7 cells as a negative control. On the second day after transfection, puromycin was added into the medium at the concentration of 0.5 μg/ml. Then the transfected cells were washed with PBS and changed with fresh medium with 0.5 μg/ml puromycin every three days. After all the negative control cells died, all the library-transfected cells that still attached to the wells were collected by scraping into ice-cold TBS buffer.
  • Rescue of library plasmids was carried out by the extraction of total DNA from the collected puromycin resistant cells according to Blin and Stafford's method. The extracted DNA was then directly electroporated into the competent cell XL1-blue and plated on LB plates with ampicillin. All the colonies were picked up for plasmid preparation. The plasmids were first screened by size using agarose gel electrophoresis to get rid of the reporters. The rest of the plasmids were sequenced by ALF express DNA automatic sequencing system (Pharmacia) using EBV Reverse sequencing primer (Invitrogen) to reveal the DNA sequence, hence amino acid sequence of the selected peptide ligands.
  • Results
  • No pCEP4-transfected cells survived in the well after 7 days of selection with puromycin, while there were still some living library-transfected cells sticking to the bottom of the well.
  • Electroporation of the DNA extracted from the library-transfected cells after 7 days of selection gave ˜150 colonies. Among them, 50 of the colonies have the right size of library plasmid and were sequenced. One ligand peptide sequence was found. The DNA and amino acid sequences are:
    DNA: 5′-GGTACCAAGAAGGAGTTT
    ACATATG GGA TGG TGT
    ACT TGA TAA
       KpnI
    Amino acid:      M  G  W  C  T (SEQ. ID. NO.: 28)
    stop stop
    DNA: GGATCCAAGCTT-3′ (SEQ. ID. NO.: 27)
    HindIII

    D—Validation of the Selected Peptide Ligand
  • The purpose of this experiment is to confirm that the selected peptide ligand indeed restores the mutated p53 protein to the wild-type p53 function in SW480.7 cells. The selection of peptide ligands was based on puromycin resistance. There might be a chance that the cells got the resistance with, or even without the help of the selected ligand via mechanisms other than p53-transactivated p21 promoter. Thus we should make sure that the mutant p53 in SW480.7 cells functions as wild-type p53 only if the selected ligand is added. Confirmation/validation was achieved in two different ways:
      • 1—We observed the function of the peptide ligand on cell cycle control of SW480.7 cells, as well as that of U-2 OS (WTp53) and Saos2 cells (no p53 copy). The selected peptide ligand can be further verified only if it can induce cell cycle arrest or apoptosis in SW480.7 cells but neither in U-2 OS nor in Saos2 cells, which means the specificity of the selected ligand to p53 mutation in SW480.7.
      • 2—We also tested the transcription levels of genes that are only transactivated by wild-type p53. If the transcription levels of these genes are highly induced in ligand treated cells compared with the negative control (cells transfected only with the empty vector (pCEP4). The selected ligand/peptide was proved to be capable of altering the mutant p53 function in SW480.7 into wild-type function.
  • P53 (wild-type) regulates different cell responses via transactivating different downstream genes. For example p53 causes cell cycle arrest at G1 phase by transactivation of p21 gene. While p53 causes cell arrest at G2 phase by transactivation of CD95 gene. Apoptosis is due to p53 transactivation of 14-3-3σ and/or Bax gene. Thus we measured p21, CD95, and 14-3-3σ gene transcription levels by RT PCR, to reveal the capability of the selected peptide ligand in restoring the wild-type p53 transactivation of the downstream genes. The expression level of β-actin was measured as a control. β-Actin transcription level was used to normalize total mRNA amount between samples for comparison owing to its constitutive expression in all cells.
  • Materials and Methods
  • SW480.7, U-2 OS, Saos 2, DMEM, McCoy's medium, FCS, PBS, trypsin, trypan blue (Sigma), calcium phosphate transfection reagents, peptide ligand (selected above), pCEP4 (Invitrogen), TRIZOL Reagent, chloroform, isopropyl alcohol, 75% ethanol (in DEPC-treated water), 0.01% (v/v) DEPC water, SuperScripII reverse transcriptase (Life Technology), TE buffer (10 mM Tris (pH7.6), 1 mM EDTA), ethanol, 4M ammonium acetate (pH7.0), 1-kb DNA ladder, 10×TBE electrophoresis buffer, 1% agarose in 1×TBE buffer with ethidium bromide.
  • Primers for RT-PCR are listed in Table 3.
    TABLE 3
    RT-PCR primer sequences.
    Name Sequence
    RT-WAF1- 5′-CTACCTCAGGCAGCTCAAGC-3′
    FOR (SEQ. ID. NO.: 29)
    RT-HME1- 5′-AGACAGCACCCTCATCATGC-3′
    FOR (SEQ. ID. NO.: 30)
    RT-CD95- 5′-TGGTGCTCATCTTAATGGCC-3′
    FOR (SEQ. ID. NO.: 31)
    RT-ACTIN- 5′-TGACAAAACCTAACTTGCGC-3′
    FOR (SEQ. ID. NO.: 32)
    CDS 5′-AAGCAGTGGTAACAACGCAGAGTACT(30)N−1N-3′
    Primer (SEQ. ID. NO.: 33)
    PCR 5′-AAGCAGTGGTAACAACGCAGAGT-3′
    Primer (SEQ. ID. NO.: 34)
  • Target genes and their corresponding RT-PCR product using the above primers are listed in Table 4.
    TABLE 4
    RT-PCR primer target genes and their corresponding product.
    Amplify Size of
    Target gene region in RT-PCR
    Primer name Name Function target mRNA product
    RTWAF1FOR p21 G1 arrest 1514-2121 607 bp
    RTHME1F0R 14-3-3σ G2 arrest  652-1245 593 bp
    RTCD95FOR CD95 Apoptosis 1968-2534 566 bp
    RTACTINFOR β-actin cytoskeleton 1236-1793 557 bp
  • Cells SW480.7, U-2OS, and Sa-os2 were transfected with peptide ligand and pCEP4 parallelly using the calcium phosphate method. The transfection plan was shown in Table 5:
    TABLE 5
    Transfection plan for the verification of peptide ligand function.
    Cell type SW480.7 U-2OS Sa-os2
    DNA Ligand pCEP4 Ligand pCEP4 Ligand pCEP4
    Number of 3 3 1 1 1 1
    6-well
    plates
  • After transfection, cells were cultured without drug selection and diluted at the ratio of 1:100 every five days. Cell proliferation status was closely observed until ligand transfected SW480.7 cells died out. Life staining with 0.4% trypan blue was applied to determine the death of the cells. Dead cells can not exclude the dye thus were stained blue. Living cells can pump the dye outside thus were not stained. Total RNA was collected from two wells of one ligand transfected SW480.7 plate and one pCEP4 transfected SW480.7 plate each day on day 0, day 1, and day 2 after transfection. Thus RNA samples were harvested as SW480.7/Ligand-0 day, SW480.7/Ligand-1 day, SW480.7/Ligand-2 day, SW480.7/pCEP4-0 day, SW480.7/pCEP4-1 day, SW480.7/pCEP4-2 day. Protocol for total RNA extraction by TRIZOL® Reagent was according to that given by Life Technology:
  • 1. Homogenization
  • Wash the cell with DEPC water once, then add 1 ml of TRIZOL Reagent per well. Pass the cell lysate several times through a pipette.
  • 2. Phase Separation
  • Incubate the homogenized samples for 5 minutes at room temperature to permit the complete dissociation of nucleoprotein complexes. Add 200 μl of chloroform per 1 ml of TRIZOL Reagent. Cap the sample tubes securely. Shake tubes vigorously by hand for 15 seconds and incubate them at room temperature for 2 to 3 minutes. Centrifuge the samples at no more than 12,000×g for 15 minutes at 2 to 8 C.
  • 3. RNA Precipitation
  • Transfer the aqueous phase to a fresh tube. Precipitate the RNA by adding 500 μl of isopropyl alcohol per 1 ml of TRIZOL Reagent used for the initial homogenization. Incubate samples at room temperature for 10 minutes and centrifuge at no more than 12,000×g for 10 minutes at 2 to 8 C.
  • 4. RNA Wash
  • Remove the supernatant. Wash the RNA pellet once with 75% ethanol, adding at least 1 ml of 75% ethanol per 1 ml of TRIZOL Reagent used for the initial homogenization. Mix the sample by vortexing and centrifuge at no more than 7,500×g for 5 minutes at 2 to 8 C.
  • 5. Re-Dissolving the RNA
  • Briefly dry the RNA pellet. Dissolve RNA in DEPC water by passing the solution several times through a pipette tip, and incubation for 10 minutes at 55 to 60 C. Determine the concentration by measure OD260. Store at −70 C.
  • RT-PCR:
  • First-Strand cDNA Synthesis
  • 1. For each RNA sample, combine the following reagents in a sterile 0.2-ml reaction tube:
    1-3 μl RNA sample
    (using same amount of RNA among samples)
    1 μl CDS primer (10 μM)
    1 μl RT forward primer (10 μM)
    x μl Deionized H2O
    5 μl Total volume
  • 2. Mix and spin the tube briefly.
  • 3. Incubate the tube at 70 C in a thermal cycler for 2 min.
  • 4. Spin the tube briefly to collect contents at the bottom. Keep tube at room temperature.
  • 5. Add the following to each reaction tube:
    2 μl 5x First-Strand Buffer
    1 μl DTT (20 mM)
    1 μl 50x dNTP (10 mM)
    1 μl SuperScripII reverse transcriptase (200 units/μl)
  • 6. Gently vortex and spin the tubes briefly.
  • 7. Incubate the tubes at 42 C for 1 hr in an air incubator or cycler.
  • 8. Add 40 μl TE buffer (10 mM Tris (pH7.6), 1 mM EDTA) to dilute the first-strand reaction product.
  • 9. Heat tubes at 72 C for 7 min to inactivate reverse transcriptase.
  • 10. Samples can be stored at −20 C for up to three months.
  • cDNA Amplification by PCR
  • 1. Preheat the PCR thermal cycler to 95 C.
  • 2. For each reaction, 5 μl of diluted first-strand cDNA and 5 μl of deionized water are added to a labeled 0.2-ml reaction tube.
  • 3. Prepare a master mix for all reaction tubes, plus one additional tube. For each reaction:
    20.5 μl Deionized water
    5 μl 10x PCR buffer
    1 μl 50x dNTP (10 mM)
    1.5 μl PCR primer (10 μM)
    1.5 μl RT forward primer (10 μM)
    10 μl 5x Q solution
    0.5 μl HotStart Taq DNA polymerase
    40 μl Total volume
  • 4. Mix by vortexing and spin the tube briefly.
  • 5. Aliquot 40 μl of the PCR Master Mix into each tube from Step 2.
  • 6. Cap the tube, and place it in the preheated thermal cycler.
  • 7. PCR was carried out using the following cycling parameters:
  • 7. PCR was carried out using the following cycling parameters:
    95 C. 15 min
    2 cycles 94 C. 30 sec
    70 C. 30 sec
    72 C. 1 min
    2 cycles 94 C. 30 sec
    69 C. 30 sec
    72 C. 1 min
    2 cycles 94 C. 30 sec
    67 C. 30 sec
    72 C. 1 min
    2 cycles 94 C. 30 sec
    65 C. 30 sec
    72 C. 1 min
    2 cycles 94 C. 30 sec
    63 C. 30 sec
    72 C. 1 min
    2 cycles 94 C. 30 sec
    61 C. 30 sec
    72 C. 1 min
    2 cycles 94 C. 30 sec
    59 C. 30 sec
    72 C. 1 min
    2 cycles 94 C. 30 sec
    57 C. 30 sec
    72 C. 1 min
    40 cycles 94 C. 30 sec
    55 C. 30 sec
    72 C. 1 min
    1 cycle  72 C. 7 min
     4 C. store
  • 8. Electrophoreses 5 μl of each PCR reaction alongside 0.1 μg of 1-kb DNA ladder on a 1.2% agarose/EtBr gel in 1×TBE buffer.
  • Results
  • After transfecting the ligand into SW480.7 cells for 15 days without puromycin selection, the cells could no longer attach to the wall and died. On the contrary, the ligand-transfected U2-OS and Saos-2 cells were growing well.
  • RT-PCR analysis of the mRNA from ligand transfected SW480.7 cells showed that this ligand can reestablish wild type p53 function. Higher mRNA levels of p21 (WAF1/Cip1), 14-3-3 and CD95 genes were found compared with that in the empty pCEP4 transfected SW480.7 cells. This proved that the apoptosis and cell growth arrest of ligand transfected cells were owing to the ligand that corrected R273H/P309S p53 mutant to wild type function.
  • Example 3 New Reporter System Utilising the Bax Promoter
  • The previous p53 reporter p21ur is based on the transactivation activity of wild type p53 to p21 promoter. It has been proved that up-regulation of p21 protein expression will result in cell growth arrest, but not apoptosis. Another p53 target, Bax (a cell death gene), can lead to apoptosis once its transcription and expression is upregulated. Bax is one of the key mediators for p53 to induce cell apoptosis—to kill the cells that have failed to repair their damaged genomes.
  • Bax promoter is different from p21 promoter, though both have p53 consensus binding sequence. It has been reported that some p53 mutant can transactivate p21 promoter, but not Bax. This indicates a more stringent requirement of p53 conformation for the transactivation of Bax promoter.
  • Green fluorescence protein (GFP) has also been used in the new reporter system. Bax promoter drives a long transcript of both puromycin resistant gene and GFP gene. By inserting an IRES fragment between the two genes that introduces a second ribosome-binding site, they can be translated into protein simultaneously from one transcript (Clontech pIRESneo manual). Thus the green color from GFP will visualize the Bax promoter activity, hence the ligand's activity to correct p53 mutants, in addition to the positive selection of puromycin resistance. The green fluorescence will eliminate the background caused by mechanisms which have cells developed to survive from puromycin selection other than transactivation of Bax promoter.
  • Selection marker is important for integration of the plasmid into genome. A stable p53-null cell line with integrated reporter system will be convenient for ligand screening, especially for non-peptide ligands.
  • Materials and Methods
  • Plasmids used in this Example are: p21ur (above), pIRESneo (Clontech), pEGFP-C1 (Clontech), and pREP8 (Invitrogen).
  • Genomic DNA (human, male, normal)
  • Restriction endonucleases (Promega): AgeI, EcoRV, HindIII, NruI, PvuI, SacI, SalI, SpeI, XbaI, XhoI, XmaI.
  • Modification enzymes: Vent DNA polymerase (NEB), T4 DNA ligase (Promega), and Shrimp alkaline phosphotase (SAP).
    Bax promoter primer:
    Bax promoter Forward:
    5′-atctaagcttgaggcttcagcccgggaat (SEQ. ID. NO.: 35)
    tccag-3′
         HindIII
    Bax promoter Reverse:
    5′-atctaccggtgccagcagtggcgccgtcc (SEQ. ID. NO.: 36)
    aacag-3′
         AgeI
    EGFP primer:
    EGFP Forward:
    5′-aataacccgGGTCGCCACCATGGTGAGCA (SEQ. ID. NO.: 37)
    AG-3′
              XmaI
    EGFP Reverse:
    5′-aataatctagaACTTGTACAGCTCGTCCA (SEQ. ID. NO.: 38)
    TGCCG-3′
              XbaI
            stop
  • Construction of pBaxur-Bax promoter controlled puromycin resistant gene:
      • Making PCR from normal human genomic DNA using Bax promoter primers. The PCR product is 500 bp. It has a SmaI site near its 5′-end and a SacI site near its 3′-end.
  • Cut PCR product Bax promoter with HindIII and AgeI.
      • Cut p21 ur with HindIII and AgeI, and dephosphorylate afterwards. Purify the 3.9 kb fragment from gel.
  • Ligate Bax promoter into the 3.9 kb fragment so that Bax promoter can control the puromycin resistant gene expression in mammalian cells. The final product is denoted as pBaxur.
  • Construction of pBaxurEGFP and p21 urEGFP-Green fluorescent protein downstream of puromycin resistant gene that visualizes the transcription level of Bax or p21 promoter:
      • Amplify EGFP gene fragment from pEGFP-C1 vector by PCR with the EGFP primers.
      • The PCR product is around 750 bp. Cut the product with XmaI and XbaI, then purify it from gel.
  • Cut pIRESneo with XmaI and XbaI, then purify the 4.5 kb fragment from gel.
  • Ligate PCR product with pIRESneo 4.5 kb fragment to generate vector pIRES-EGFP (5.2 kb).
  • Cut p21ur or pBaxur with EcoRV and XbaI, and purify the 3.2 kb or 1.3 kb fragment from gel.
  • Cut pIRES-EGFP with NruI and SpeI, dephosphorylate, and purify the 4.4 kb fragment from gel.
  • Ligate the two fragments to generate transient p53-double-reporter p21 urEGFP (7.6 kb) or pBaxurEGFP (5.7 kb).
  • Construction of pBaxurEGFP-hisD-Integrative plasmid with selection mark as histidinol D resistant:
      • Cut pREP8 with PvuI, SalI, and SacI, purify the 5194 bp fragment from gel.
      • Cut pBaxEGFP with PvuI and XhoI, dephosphorylated, and purify the 3972 bp fragment from gel.
  • Ligate the hisD fragment into pBaxurEGFP fragment to generate integrative p53-double reporter pBaxurEGFP_hisD.
  • The structure of p21 urEGFP was verified by restriction of HindIII, which should give five bands of 3489 bp, 2339 bp, 1041 bp, 360 bp, and 348 bp. Agarose gel electrophoresis of the restricted plasmid showed the expected bands. The structure of pBaxur was verified by restriction of either HindIII together with SacI, which should give two bands of 3996 bp, and 388 bp, or SmaI alone, which should give three bands of 3184 bp, 620 bp, and 580 bp. Agarose gel electrophoresis of the restricted plasmid showed the expected bands.
  • The structure of pBaxurEGFP was verified by restriction of either EcoRI together with XbaI, which should give two bands of 2907 bp, and 2766 bp, or SacI alone, which should give two bands of 3125 bp, and 2548 bp. Agarose gel electrophoresis of the restricted plasmid showed the expected bands.
  • The structure of pBaxurEGFP-hisD was verified by restriction of either EcoRI, which should give two bands of 5955 bp, and 3211 bp, or SacI, which should give two bands of 6618 bp, and 2548 bp. Agarose gel electrophoresis of the restricted plasmid showed the expected bands.
  • Screening for new peptide ligands is done by first transfecting the outlined new reporter p21 urEGFP-hisD along with the cotransfection of the peptide library constructed above into cancer cell lines such as SW480.7. Caspase inhibitor VAD-fmk (ApoAlert®, Clontech) is added to the cell culture at the final concentration of 40 mM in order to prevent apoptisis in the case of the overexpression of BAX gene. Also 0.5 g/ml puromycin is added in the media one-day after transfection. After several days, surviving green cells (excited at 488 nm wavelength) will be the cells containing the candidate ligands. Rescued plasmids from these cells are subjected to DNA sequencing to deduce the amino acid sequences of the encoded peptide ligands.
  • When screening non-peptide ligands, a stable cell line of the reporter p21urEGFP-hisD is constructed. Members of the chemical compound library are added to the cell culture. The cells should also be cultured in the medium containing 40 mM VAD-fmk and 0.5 g/ml puromycin. Surviving green cells after excitation at 488 nm is the indicator for successful candidate ligands.

Claims (42)

1. A method of screening a library of peptides of formula M-G/M/V—(X)n wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid, which method comprises:
a) transforming a host cell population with a library of nucleic acid constructs that expresses free peptides of formula M-G/M/V—(X)n;
b) culturing the transformed host cells under conditions suitable for intracellular expression of the peptides of formula M-G/M/V—(X)n; and
c) analyzing the host cell population to determine the effect of the peptides of formula M-G/M/V—(X)n on a reporter system.
2-27. (canceled)
28. The method of claim 1 wherein n is an integer from 3 to 5.
29. The method of claim 1 wherein the reporter system includes a target protein.
30. The method of claim 29 wherein the target protein is a nucleic acid binding protein.
31. The method of claim 30 wherein the nucleic acid binding protein is p53.
32. The method of claim 1 wherein the reporter system comprises a reporter gene.
33. The method of claim 32 wherein the reporter gene is operably linked to a sequence of nucleotides that provides a binding site for a target protein or for a protein that associates with or is a substrate for a target protein.
34. The method of claim 33 wherein the reporter gene is operably linked to a p21 or Bax promoter.
35. The method of claim 32 wherein the protein product of the reporter gene includes a secretion signal peptide.
36. The method of claim 32 wherein the protein product of the reporter gene includes a transmembrane domain.
37. The method of claim 32 wherein the host cells have been transfected with the reporter gene.
38. The method of claim 1 wherein the peptide library has at least 500 different members.
39. The method of claim 1 wherein the host cells are eukaryotic cells.
40. A method of identifying a peptide ligand of formula M-G/M/V—(X)n wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine, and each X, which may be the same or different, is any genetically encoded amino acid, having a desired activity on a target protein, which method comprises:
a) transforming a host cell population with a library of nucleic acid constructs which expresses peptides of formula M-G/M/V—(X)n;
b) culturing the transformed host cells under conditions suitable for intra-cellular expression of the peptides of formula M-G/M/V—(X)n;
c) analyzing the host cell population to determine the effect of the expressed peptides on the target protein, wherein the target protein forms part of a reporter system; and
d) identifying the peptide of formula M-G/M/V—(X)n in those cells in which the reporter system indicates a positive response.
41. The method of claim 40 wherein n is an integer from 3 to 5.
42. The method of claim 40 wherein the reporter system comprises a reporter gene.
43. The method of claim 42 wherein the reporter gene is operably linked to a sequence of nucleotides which provides a binding site for a target protein or for a protein which associates with or is a substrate for a target protein.
44. The method of claim 43 wherein the reporter gene is operably linked to a p21 or Bax promoter.
45. The method of claim 42 wherein the protein product of the reporter gene includes a secretion signal peptide.
46. The method of claim 42 wherein the protein product of the reporter gene includes a transmembrane domain.
47. The method of claim 42 wherein the host cells have been transfected with reporter gene.
48. The method of claim 40 wherein the peptide library has at least 500 different members.
49. The method of claim 40 wherein the host cells are eukaryotic cells.
50. A library of nucleic acid constructs which expresses free peptides in an intra-cellular environment, the peptides having the sequence M-G/M/V—(X)n, wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid.
51. The library of claim 50 wherein each nucleic acid construct contains a promoter region which is operably linked to the sequence which encodes the peptide of formula M-G/M/V—(X)n.
52. The library of claim 51 wherein the nucleic acid constructs are in the form of expression vectors.
53. The library of claim 52 wherein the expression vectors are capable of autonomous replication.
54. The library of claim 50 wherein n is an integer from 3 to 10.
55. The library of claim 50 which has at least 500 different members.
56. The library of claim 55 which has at least 2000 different members.
57. The library of claim 50 wherein the value of n is the same for each member of the library.
58. A library of peptides, each member of the library having the amino acid sequence M-G/M/V—(X)n, wherein n is an integer from 3 to 18, M is methionine, G is glycine, V is valine and each X, which may be the same or different, is any genetically encoded amino acid.
59. The library of claim 58 wherein n is an integer from 3 to 10.
60. The library of claim 59 wherein n is an integer from 3 to 5.
61. The library of claim 58 which has at least 500 different members.
62. The library of claim 61 which has at least 2000 different members.
63. The library of claim 58 wherein the value of n is the same for each member of the library.
64. A method of generating a library of nucleic acid constructs which expresses free peptides in an intracellular environment, which method comprises synthesizing a library of DNA molecules which include the nucleotide sequence ATGGGA (NNK)n, wherein n is an integer from 3 to 18, N is A, C, T or G and K is G or T and wherein each NNK triplet may be the same or different, and inserting a library of synthesized DNA fragments which each includes a nucleotide sequence of formula ATGGGA (NNK)n into expression vectors.
65. The method of claim 64 wherein during synthesis of the library of DNA molecules equimolar amounts of nucleotides A, C, T and G are added for incorporation at each N position and equimolar amounts of nucleotides G and T are added for incorporation at each K position.
66. The library of nucleic acid constructs prepared according to a method of claim 64.
67. The library of nucleic acid constructs prepared according to a method of claim 65.
US10/493,582 2002-01-23 2003-01-23 Molecular libraries Abandoned US20070128657A1 (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
GB0201522.0 2002-01-23
GB0201522A GB0201522D0 (en) 2002-01-23 2002-01-23 A method
GB0201523.8 2002-01-23
GB0201523A GB0201523D0 (en) 2002-01-23 2002-01-23 Molecular Libraries
PCT/GB2003/000296 WO2003062420A1 (en) 2002-01-23 2003-01-23 Molecular libraries

Publications (1)

Publication Number Publication Date
US20070128657A1 true US20070128657A1 (en) 2007-06-07

Family

ID=27614796

Family Applications (3)

Application Number Title Priority Date Filing Date
US10/502,510 Abandoned US20070105778A1 (en) 2002-01-23 2003-01-23 Methods of screening molecular libraries and active molecules identified thereby
US10/493,582 Abandoned US20070128657A1 (en) 2002-01-23 2003-01-23 Molecular libraries
US13/043,284 Expired - Fee Related US8304211B2 (en) 2002-01-23 2011-03-08 Methods of screening molecular libraries and active molecules identified thereby

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/502,510 Abandoned US20070105778A1 (en) 2002-01-23 2003-01-23 Methods of screening molecular libraries and active molecules identified thereby

Family Applications After (1)

Application Number Title Priority Date Filing Date
US13/043,284 Expired - Fee Related US8304211B2 (en) 2002-01-23 2011-03-08 Methods of screening molecular libraries and active molecules identified thereby

Country Status (6)

Country Link
US (3) US20070105778A1 (en)
EP (2) EP1515985B1 (en)
AT (1) ATE495251T1 (en)
AU (2) AU2003202689A1 (en)
DE (1) DE60335702D1 (en)
WO (2) WO2003072600A2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070105778A1 (en) * 2002-01-23 2007-05-10 El-Gewely Mohamed R Methods of screening molecular libraries and active molecules identified thereby
US20080020377A1 (en) * 2004-03-26 2008-01-24 Wolfgang Hillen Peptide-Based Method for Monitorting Gene Expression in a Host Cell
US20100047783A1 (en) * 2006-06-20 2010-02-25 El-Diery Wafik S Small molecule modulators of p53 family signaling
EP3842807A1 (en) * 2015-12-22 2021-06-30 Phoremost Limited Library of nucleic acid molecules encoding short peptides for use in screening methods

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0410983D0 (en) * 2004-05-17 2004-06-16 El Gewely Mohamed R Molecules
WO2014182961A1 (en) * 2013-05-08 2014-11-13 The Regents Of The University Of California Structure-based peptide inhibitors of p53 aggregation as a new approach to cancer therapeutics
ES2795982T3 (en) 2013-08-07 2020-11-25 Yeda Res & Dev Peptides capable of reactivating p53 mutants
JP7072508B2 (en) 2016-02-04 2022-05-20 イェダ リサーチ アンド ディベロップメント カンパニー リミテッド Peptides and their use in the treatment of diseases, disorders or conditions associated with mutant p53
US10415094B2 (en) 2016-10-06 2019-09-17 HelicalHelp LLC Risk stratification method for a patient having a polymorphism

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6169073B1 (en) * 1995-02-16 2001-01-02 Bayer Corporation Peptides and peptidomimetics with structural similarity to human p53 that activate p53 function
US6180343B1 (en) * 1998-10-08 2001-01-30 Rigel Pharmaceuticals, Inc. Green fluorescent protein fusions with random peptides
US6821728B1 (en) * 1998-06-22 2004-11-23 Esbatech Ag Screening system
US6875741B2 (en) * 1998-09-02 2005-04-05 Renuka Pillutla Insulin and IGF-1 receptor agonists and antagonists
US7297482B2 (en) * 1998-10-08 2007-11-20 Rigel Pharmaceuticals, Inc. Structurally biased random peptide libraries based on different scaffolds

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9500423D0 (en) * 1995-01-10 1995-03-01 Univ Edinburgh Novel vectors and use thereof for capturing target genes
GB9521544D0 (en) * 1995-10-20 1995-12-20 Univ Dundee Activation of P53 protein and therapeutic applications thereof
AU7457996A (en) 1995-10-20 1997-05-07 Dana-Farber Cancer Institute Cystatin m, a novel cysteine proteinase inhibitor
JP2001509661A (en) * 1996-01-23 2001-07-24 ザ ボード オブ トラスティーズ オブ ザ リーランド スタンフォード ジュニア ユニバーシティ Method for screening trans-dominant intracellular factor peptide and RNA molecule
US6110889A (en) * 1996-06-14 2000-08-29 Board Of Regents, The University Of Texas System Peptide tumor cell growth inhibitors
JP2001501097A (en) * 1996-09-24 2001-01-30 ケイダス ファーマスーティカル コーポレイション Methods and compositions for identifying receptor effectors
AU7238098A (en) * 1997-05-15 1998-12-08 Kyowa Hakko Kogyo Co. Ltd. Peptides having cyclic structures and exerting p53 protein activty-restoring effect on p53 protein mutants
US6472151B1 (en) * 1998-08-19 2002-10-29 Bristol-Myers Squibb Company Method of screening for compounds that modulate the activity of a molecular target
EP1515985B1 (en) * 2002-01-23 2013-01-02 Mohammed Raafat El-Gewely Methods of screening molecular libraries for active molecules that restore the function of mutant p53

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6169073B1 (en) * 1995-02-16 2001-01-02 Bayer Corporation Peptides and peptidomimetics with structural similarity to human p53 that activate p53 function
US6821728B1 (en) * 1998-06-22 2004-11-23 Esbatech Ag Screening system
US6875741B2 (en) * 1998-09-02 2005-04-05 Renuka Pillutla Insulin and IGF-1 receptor agonists and antagonists
US6180343B1 (en) * 1998-10-08 2001-01-30 Rigel Pharmaceuticals, Inc. Green fluorescent protein fusions with random peptides
US7297482B2 (en) * 1998-10-08 2007-11-20 Rigel Pharmaceuticals, Inc. Structurally biased random peptide libraries based on different scaffolds

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070105778A1 (en) * 2002-01-23 2007-05-10 El-Gewely Mohamed R Methods of screening molecular libraries and active molecules identified thereby
US8304211B2 (en) 2002-01-23 2012-11-06 Mohamed Raafat El-Gewely Methods of screening molecular libraries and active molecules identified thereby
US20080020377A1 (en) * 2004-03-26 2008-01-24 Wolfgang Hillen Peptide-Based Method for Monitorting Gene Expression in a Host Cell
US20100047783A1 (en) * 2006-06-20 2010-02-25 El-Diery Wafik S Small molecule modulators of p53 family signaling
EP3842807A1 (en) * 2015-12-22 2021-06-30 Phoremost Limited Library of nucleic acid molecules encoding short peptides for use in screening methods
US11085926B2 (en) 2015-12-22 2021-08-10 Phoremost Limited Methods of screening
US11821904B2 (en) 2015-12-22 2023-11-21 Phoremost Limited Methods of screening

Also Published As

Publication number Publication date
AU2003205830A1 (en) 2003-09-02
US20110294212A1 (en) 2011-12-01
ATE495251T1 (en) 2011-01-15
EP1515985B1 (en) 2013-01-02
EP1515985A2 (en) 2005-03-23
EP1470224B1 (en) 2011-01-12
WO2003072600A3 (en) 2005-01-27
AU2003202689A8 (en) 2003-09-09
WO2003062420A8 (en) 2004-10-07
AU2003202689A1 (en) 2003-09-09
EP1470224A1 (en) 2004-10-27
US20070105778A1 (en) 2007-05-10
WO2003072600A8 (en) 2004-10-28
WO2003062420A1 (en) 2003-07-31
WO2003072600A2 (en) 2003-09-04
US8304211B2 (en) 2012-11-06
DE60335702D1 (en) 2011-02-24

Similar Documents

Publication Publication Date Title
US8304211B2 (en) Methods of screening molecular libraries and active molecules identified thereby
JP4262778B2 (en) Protein fragment complementation assay to detect biomolecular interactions
AU698174B2 (en) A gene expression system utilizing an RNA polymerase prebound to DNA
EP1696038B1 (en) Isolating biological modulators from biodiverse gene fragment libraries
US20060166864A1 (en) Methods for modulating telomerase activity
JPH05507700A (en) Compositions and methods for identifying biologically active molecules
JP2004515219A (en) Tunable catalytically active nucleic acids
EP1053347B1 (en) Peptide detection method
DE102005060920A1 (en) Peptides for interaction with alpha-helical coiled-coil structures and / or coiled-coil sequences, means derived therefrom and their use
JPH11503011A (en) Conditional expression system
US6326150B1 (en) Yeast interaction trap assay
WO1992015702A1 (en) The biological selection of useful molecules
EP0995797A1 (en) Methods for detecting and isolating nuclear transport proteins
AU752792B2 (en) Methods and compositions for the determination of protein function and identification of modulators thereof
US20020177125A1 (en) Human rhinovirus assays, and compositions therefrom
JPH10500311A (en) Factors that interact with nuclear proteins
US7932030B2 (en) System for pulling out regulatory elements using yeast
Ng Elucidating The Mechanism Of Nonsense Suppressor-Induced Premature Stop-Codon Readthrough In Eukaryotes
Ahn Studies on the 2 micron plasmid encoded components for plasmid stability and copy number control in Saccharomyces cerevisiae
JP2009504144A (en) A method for evolution adapted to the continuous purpose of proteins in vivo
WO2005075508A2 (en) Hyperactive stat molecules and their use in assays employing gene activation
CZ295047B6 (en) General-purposes cell system for testing ingression of substances into cell nuclei

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION