AU2007200582A1 - Methods of identifying the activity of gene products - Google Patents

Methods of identifying the activity of gene products Download PDF

Info

Publication number
AU2007200582A1
AU2007200582A1 AU2007200582A AU2007200582A AU2007200582A1 AU 2007200582 A1 AU2007200582 A1 AU 2007200582A1 AU 2007200582 A AU2007200582 A AU 2007200582A AU 2007200582 A AU2007200582 A AU 2007200582A AU 2007200582 A1 AU2007200582 A1 AU 2007200582A1
Authority
AU
Australia
Prior art keywords
target
amino acid
library
acid sequence
gene product
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2007200582A
Inventor
Arthur J Blume
Neil Goldstein
Ku-Chuan Hsiao
Renuka Pillutla
John Prendergast
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
DGI BIO TECHNOLOGIES Inc
Original Assignee
Dgi Bio Tech Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dgi Bio Tech Inc filed Critical Dgi Bio Tech Inc
Priority to AU2007200582A priority Critical patent/AU2007200582A1/en
Publication of AU2007200582A1 publication Critical patent/AU2007200582A1/en
Abandoned legal-status Critical Current

Links

Landscapes

  • Peptides Or Proteins (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Description

P/00/011 Regulation 3.2
AUSTRALIA
Patents Act 1990 COMPLETE SPECIFICATION STANDARD PATENT Invention Title: Methods of identifying the activity of gene products The following statement is a full description of this invention, including the best method of performing it known to us: -1 A- 0METHODS OF IDENTIFYING THE ACTIVITY OF GENE PRODUCTS This application claims priority to United States provisional patent application Serial No. 60/202,912 filed May 9, 2000 which is incorporated herein by reference in its entirety.
c- 00 I'n FIELD OF THE INVENTION This invention relates to a general method for identifying the activity or function.of gene products by identifying peptide binding partners which cause a cellular response in cells expressing the gene products. Accordingly, this invention is useful for determining the influence which specific genotypes have on phenotypes. In particular, the invention is concemrned with a method of obtaining peptides which bind to a target such as a novel gene product. Such peptides provide 1) sequences that may be used to identify the natural protein partner of the target, and 2) enable synthesis of peptides which alter the phenotype of cells expressing the target. This invention also relates to providing material useful for conducting competitive binding assays capable of identifying small molecules reactive with and modulatory of the target protein.
BACKGROUND OF THE INVENTION Present estimates of the number of different genes range over 30,000 and may read over 150,000 if one considers splicing varients. Despite rapid progress at identifying genes based on analysis of the human genome, progress identifying the activity and function of the gene products lags significantly behind.
A number of methods have been reported for connecting specific genes with specific diseases or conditions of pharmaceutical interest. One general method, referred to here as a genomic'knock-out', eliminates, physically or by mutation, single base deletion or insertion, the gene in question. The earliest of these knock-outs in animals have been done in such a fashion that function can be lost in all cells derived from a single fertilized egg. Genomic knock-outs, done in cells as well as animals have revealed the functions of many genes.
4 0Genomic knock-outs have severe limitations regarding providing useful Spharmacological targets. These limitations result from the occurrence of the knock-out in many places at one time and an all or none event occurring very early in development whereas many diseases result from timed and or graded alteration in gene activity. Furthermore, the gene causing the phenotypic change 00 5 often is not the best target for drug therapy, and little information may be gained Sby this procedure on the best drug target. Lastly, knowing the gene target does not necessarily provide the investigator with a simple tool for obtaining small organic molecules which act on the target of interest and are useful for animal phenotyping and as drug leads.
A second knock-out approach to elucidating gene function is the use of anti-sense nucleic acids to prevent translation of mRNA into functional proteins.
In this approach, antisense molecules can be applied from without or within the cell. Although this method has the clear advantage of being controllable with respect to timing, and graded response, it suffers from the fact that mRNA and protein are not uniformly linked, therefore allowing a large degree of variation in expected protein level manipulation. In addition, antisense approaches are prong to non-specific artifacts which will confound the phenotypic effect.
Sequence analysis mutation identification) or quantitation of mRNA expression, are other genomic approaches to determining gene function and phenotype relation. Both methods however are associated with severe limitatior for discovering the phenotypic relationship. As noted above, RNA quantitation is too removed from protein activity for one to rely on such information. In addition, although mutations can provide an association between a gene and a specific phenotype or condition, most mutations result in all or nothing events much unlike many disease conditions of interest.
Recent information has made it clear that there are large networks of genes coding for products which appear to be interrelated. Knocking out one of such genes influences the level of expression of the other. These gene networks are being elucidated via DNA chip technology which allows for the simultaneous 0 quantitation of mRNA from a very large number of genes. See United States Patent 5,800,992 and WO 95/35505 which are incorporated herein by reference.
Although this information and the data bases derived from them are powerful road maps of networking, they suffer in their inability to distinguish initial c interactions from secondary interactions from nth interactions. -Although 00 Srepeated quantitative analysis over many time points may provide some of this 0information of primary, secondary and later levels of protein interactions, such increases in experimental number are costly and time consuming. Network information on proteins of known function is useful, but is much less useful on genes of unknown function.
Many gene products produce their effect by binding to one or more other peptides or proteins. Presently, there are few approaches for identifying a protein's partner, the protein with which the target gene product directly interacts. This information is critical as most direct protein:ligand interactions, whether non-covalent or covalent, have major consequences for protein activity, including signaling, information transfer within and between parallel signaling cascades, and molecular processing. Examples of such protein:ligand interactions may include those between an enzyme and its substrate, a ligand (peptide or not) and its receptor, or a transport protein and ligand. Regulatory proteins also often function through binding to a molecular partner. Partner information therefore includes knowing, for example, the ligand for its receptor, the substrate for a 'kinase' or a protease, the regulatory protein controlling mRNA translation or DNA transcription.
The classical approach to partner identification is to obtain the target and its partner in some sort of isolated complex. Newer approaches place target and partner in two fusion proteins such that when they are complexed a signal is generated and the fusion protein or its gene sequence is used to identify the partner. For example, the yeast two-hybrid system has been used for partner identification. While the yeast two-hybrid approach is popular, it has a number of inherent problems including a high potential for false positives, the inability to use S non-protein targets-sch as mRNA or membrane bound/extracellular proteins Sand the inability to address postranslational modifications on a target. Moreover, systems based on fusion proteins in general, while powerful, are not easily applied to a very large number of genes of unknown function, as this becomes a random association problem and would necessitate a combinatorial approach N, covering all genes.
00oo For the subclass of proteins which interact with nucleic acids, most are Scurrently identified based on information on the nucleic acid:protein complex which exists either in soluble form, or in gels or via some type of genetic recombinant expression system. Use of such an approach for the very large number of nucleic acid interacting proteins requires extensive efforts.
Partner information is critical to developing a target binding assay capable of identify drug leads. Among the large number of assays that exist, there are in vitro and cellular ones, and many types of binding assay formats in each case.
The vast majority of in vitro ones, contain a target and a ligand but a few require only target. Those without ligand, suffer from not being directed to any particular surface on the target and therefore will generate a high frequency of false positives, i.e. compounds which bind but do not cause a change in target activity.
20 In the case of unkown gene targets, only the non-directed assay could be used which would mean a much larger effort at screening than desired.
Panning of unknown gene products with phage displayed libraries to find natural partners would not seem worthwhile as the published results of panning of known receptor genes with known partners have not shown surrogate peptides to have natural partner amino acid motifs, sequences etc. Panning of the EPO (erythropoietin) and IPO (thrombopoietin) receptors identified potent peptides, which after dimerization are active. However, these identified peptides have no natural TPO or EPO motifs and therefore fail to identify these proteins in database searches based on amino acid sequences.
Housey, United States Patent 5,877,007, relates to methods and compositions for screening for compounds which inhibit or activate a protein of interest expressed byaell relative to a control cell.
SPicksley et al., U.S. Patent 5,770,377, relates to methods of identifying compounds which interfere with the binding of an oncogene protein, such as MDM2, to p53.
Blume, U.S. Patent 6,010,861 relates to methods and compositions for 0 identifying drug candidates based on the ability of the drug candidate to compete 00 ln with a reporter molecule identified from a recombinant library.
cl All of the publications discussed above are incorporated by reference Sherein in their entirety.
SUMMARY OF INVENTION The method presented in detail in this application greatly simplifies identifying a target protein's partners and the establishment of site directed assays for-test compounds which can regulate the activity of the target protein and enable phenotyping in in vitro, cellular and animal model systems and provide drug leads as well. A target protein's partner includes all naturally occurring binding partners and any precusor polypeptides which may be modified post translationally. A target is any naturally occurring target which may be a peptide, a protein, a nucleic acid, a polysaccharide or a combination thereof. A target may be, for example, a receptor, a transport protein, a regulatory site.
In one embodiment, the method of the invention involves the isolation of peptides, preferably from a recombinant phage display peptide library, which bind to the protein and nucleic acid (NA) products of genes of unknown function and contain sufficient information to allow identification of the natural partner protein of the target and high affinity binding peptides. This method can be automated to increase the number of known and unknown genes and gene products that can be used as targets. The term gene products encompass any post translational modifications.
rt In one embodiment of the invention, a method of identifying the function of a) gene products is provided by detecting the phenotypic change in a cell or animal following contact of the gene product with a binding peptide.
In another embodiment of the invention, the function of a binding peptide cand its corresponding gene product is obtained through analysis of sequence oo data bases of naturally occurring protein or nucleic acid sequences. Homology of binding peptides identified from a library which bind a novel gene product, with a known peptide of known function, provides relevant information for determining Sthe function of the novel gene product.
I
In another embodiment, the invention relates to a method for determining the activity of a gene product comprising the steps of 1) expressing the gene product in at least one cell type in which the gene product is active; 2) contacting the cells with a ligand known to bind the gene product; and 3) detecting a change in phenotype in the cells in which the gene product is active.
Thus, this invention provides means for identifying peptide ligands capable of activating or inhibiting gene products through their ability to bind to such gene products as well as the activity and function of the gene products themselves. The identification of active peptide ligands also provides means for identifying other molecules, preferably small organic molecules, which also are active at the sites at which the peptide ligands bind and which therefore are useful as drug candidates. This invention provides methods for identifying the activity of both binding partners, ligand and receptor, of gene products which together result in a phenotypic change.
Peptide binding ligands identified through this invention directly enable phenotyping studies in various systems, including cell, tissue, and simple organism, of surface and intracellular targets. Attachment of cell-penetrating peptide sequences to the peptide binding ligands provides a means for detecting intracellular action of the peptide binding ligand. For example, the reagent BioPORTER® from Gene Therapy Systems may be used to deliver peptide.
The present invention also provides a method to simplify and quicken the a establishment of high through-put screening system (HTS) formats of competition binding assays that can identify small organic molecules and other test substances which are reactive with the surfaces on unknown targets and Sone capable of modifying their activity. This can be used to facilitate 0o 5 phenotyping in more complex models such as organisms and animals and eventually provide leads for drug development.
SDETAILED DESCRIPTION OF THE INVENTION In one embodiment of the invention, the method involves panning of unknown gene protein products or other targets such as regulatory mRNA domains with phage displayed libraries of random peptides and obtaining a set of peptides which bind to such targets. Libraries included fully randomized libraries as well as libraries which contain fixed amino acids at particular positions among the other randomized amino acids. The number of peptide binders obtained may range from about 10 sequences to the order of 100s of sequences. More complex identification motifs may require obtaining a larger number of sequences. The peptide binders are sequenced and used individually or as consensus motifs to search for genes with expressed proteins of matching amino acid sequence. Soluble binding peptide ligands with and without penetrating peptide additions, are obtained from those which contain natural gene motifs or recurring novel sequences via synthetic or recombinant methods. To assess their activity and to identify their function either as gene products themselves, or of gene products to which they bind, they are applied to cells identified to express the target gene. Phenotypic changes including morphological, biochemical, genetic or immunological changes, other than changes in the target protein itself are then observed. The peptide binding ligands may then be labeled and used in competitive site directed assays for small molecules which interact at a regulatory domain of the target protein and as described in U.S.
Patent 6,010,861, incorporated herein by reference.
Another embodiment of the invention is a method of identifying a naturally occurring binding partner or precursor for a target by identifying an amino acid sequence motif which confers detectable binding properties of a peptide by screening a library of expressed amino acid sequences for binding of members cI to the target, identifying amino acid sequence motifs and comparing the 00 Sidentified amino acid sequence motifs to known amino acid sequences of a Sgenome to identify a naturally occurring binding partner or precursor for said Starget. Motifs are patterns of amino acids common to the amino acids of the Ssurrogates and the naturally occurring partner which may contain contact sites for the target. In addition, the nucleic acid sequence for identified naturally occurring binding partner or precursor may be determined A further embodiment of the invention is a method of identifying an amino acid sequence motif which confers binding properties to a natural target by screening a library of expressed amino acid sequences for binding to the target, determining the amino acid sequence of the members of the library which bind to the target, and identifying as motifs common amino acid sequences.
Another embodiment of the invention is a method for determining the activity of a gene product by expressing said gene product in a cell, contacting the cells with a ligand which binds said gene product, and detecting a change in phenotype of the cells. In addition, the invention embodies a method of determining the phenotypic outcome of the expression of a gene product by expressing the gene product in cells, contacting said cells with an amino acid sequence that has a binding motif identified by screening members of a peptide library which bind to the target, and detecting a change in phenotype of the cells.
An additional embodiment of the invention is a method of identifying a naturally occurring binding partner or precursor for a target by identifying an amino acid sequences that bind the target by screening a library of expressed amino acid sequences and comparing the identified amino acid sequence to known amino acid sequences of a genome and identifying a gene product that possesses an amino acid sequence substantially similar to the identified amino acid sequence. The nucleic acid of the identified naturally occurring binding partner may also be determined.
0 The method of the invention has been tested using different types of targets wherein the partners and function are known. For example, one target Swas an extracellular protein growth and differentiation factor. Another target was 00 5 a 5' untranslated RNA domain. These tests are valid as neither target type has 0, been panned before with peptide libraries to yield binding peptide-ligands or 0surrogates which have amino acid sequences sufficient to identify the target's Snatural and known partner. In the former case that partner is the factor's transmembrane receptor and in the later case a ribosomal binding protein, EIF2.
A match of a surrogate amino acid sequence, or at least a part thereof, with a natural sequence enables partner identification. Surrogates containing natural sequences likely interact with regulatory surfaces. Accordingly, these surrogates should be useful as antagonists and some may also be agonists. In either case, agonism and antagonist are readily assayable in a phenotyping study. Antagonism is directly assayable in the presence of the natural partner or after addition of the natural partner to target containing systems. For those surrogates which do not contain natural sequence motifs, one does not know, a priori, whether these entities will be regulatory as the nature of their target's binding surface is unknown. However, analysis of surrogate libraries indicates a very high percentage of binders found by panning methodologies are to regulatory surfaces. The peptide binders are identified by competition with natural ligands, partners or neutralizing antibodies. In order to take into account the possibility of nonregulatory surrogates, phenotyping would be done with a small number, about six surrogates with unrelated sequence motifs and those which modified test systems phenotypes would be used initially for site directed assay development. It is possible that some surrogates would only function as antagonists of agonistic surrogates.
Present databases and computers allow rapid searches for partners based on surrogate sequences. Examples of available computer based I programs to analyse sequences include BLAST, Pattemfind, ExPASy, MEME (Multiple EM for Motif Elicitation), 01 (http://meme.sdsc.edu/meme/websitelintro.html, MAST (Motif Alignment and Search Tool, http://meme.sdsc.edu/mem/website/mast-intro.html).
c (www.expasy.ch/) and ISREC (www.isrec.isb-sib.ch/software/software.html).
oo n identification of surrogates provides tools for partner identification, phenotyping O and small molecule discovery. Given that a site directed assay is available at this early stage for the unknown target, high throughout screening allows the Srapid identification of reactive small molecules of low target affinity.
Combinatorial chemistry, allows for improvements in potency which would then provide small molecules for phenotyping and testing in animal models.
The method of determining the activity and functions of an unknown gene product is determined according to a preferred embodiment of the invention as follows: 1. An unknown full length gene is expressed and the gene product protein is isolated.
2. The gene product is then panned with a >20mer surrogate library as described, for example in U.S. Patent 6,010,861, and members of the library which bind the gene product are isolated and sequenced.
3. The sequences of a representative number of peptide binders are analyzed using a database such as BLASTp and then tBLASTn. These searches on protein and EST databases are directed at uncovering matches to known or unknown proteins and genes.
4. Overlapping ESTs are knitted together to obtain full length partners.
5. Upon positive partner identification, EST databases and general literature may be searched for information on gene expression mRNA, protein and activity levels) a. in various tissues, cells, organisms; -11b. in normal and pathologic states; c. at various developmental times; and O d. other related or known proteins.
SBased on partner identification the function of the expressed target gene 005 may be postulated. Confirmation of its activity and function is then confirmed by Sdetecting its activity in cells in which it is expressed.
COMPUTATIONAL APPROACH TO IDENTIFY NATURAL PARTNER After identification of a surrogate peptide binder, it is subjected to partner analysis using several different database search programs. In addition, the set of multiple surrogate peptide binders are aligned into groups based on motifs or consensus regions. Motifs and consensus regions can be identified by sequence alignment programs like MEME (Multiple EM for Motif Elicitation), (http://meme.sdsc.edu/meme/website/intro.html). The motifs and consensus regions can be used as query pattems to search the available databases using MAST (Motif Alignment and Search Tool, http'J/meme.sdsc.edu/mem/website/mast-intro.html) or Pattemfind. The identified sequences can be further examined for significant differences in the expected frequency of amino acids and the number of time a specific peptide sequence has been repeated.
An example of a strategy for the computational approach to identifying a natural partner is shown below: In the initial step, the entire peptide sequence and consensus motifs (if found) are entered into an Advanced BLAST search (http://www.ncbi.nlm.nih.gov/blast/blast.cgi?Jform=l): using the following parameters: Programs: blastp, tblastn 0-12-
(N
o Databases: protein and nucleotide databases including dbest ~(ESTs), dsts (STSs) and htgs (unfinished high throughput genomic sequences) -pe,---1a00 or 10000 20000 00 5 Matrix: PAM30 or Query: Consensus motif alone and varying combinations of sequence at N- and C-terminal ends c In subsequent steps, motifs and consensus regions identified by sequence alignment programs like MEME are used as query patterns to search the available databases using Pattemfind.
For Pattemfind, the following parameters are used: Databases: Nonredundant, Swissprot, TREST and TRGEN Limit Between 10 and 5000 Query; Consensus motif alone and varying combinations of sequence at N-and C-terminal ends Data obtained from the various searches are analyzed under the following conditions: Analyze results of different searches independently and then together to look for similar classes of proteins (eg. nucleic acid binding proteins, kinases) that may emerge.
Identify some of the best matches that show up in more than one kind of search (eg. same protein/ORF picked up by BLAST searches using different parameters, or by both BLAST and Pattemfind) and compare sequence of protein in this region with other peptide surrogates containing this motif.
Examine potential significance of protein interaction in the context of the cellular function of target.
-13- The output from each search are analyzed for partner hits based on the following criteria: 1. Search gives an exact match of at least 5-7 amino acids or appearance of the partner in at least 50% of the top cohort of any one search, and/or the appearance of the same or related hits occurring in multiple searches.
o00
V)
0 2. Search matches an expected class of protein partners based on c function, cellular location or tissue/disease distribution.
3. Candidate produces a phenotype change when added into the appropriate model system.
Preferably, the partner hit has at least two of the criteria described above.
More preferably, the partner hit appears in at least 50% of the top cohort of any one search or appears (or a related sequence appears) in multiple search results. Even more preferably, the partner hit has an exact match of at least 5 7 amino acids. Criterion 2 addresses the biological relevance of a hit distribution, disease indication, etc.), and criterion 3 relates to the biological activity of the surrogate and its ability to cause a phenotypic change in the appropriate test system.
The homology between the partner and surrogate can range from being scattered over a long stretch (for example 15-25 amino acids) to a perfect match within a short sequence (at least 5-8 amino acids).
The generation of surrogates using large random and diverse libraries is target independent and their utility for partner identification resides in the computational analysis of the identified peptide's sequence. For successful partner identification to be feasible, surrogates must exhibit either the natural linear or conformational surface properties complementary to the target under investigation. The complementary peptide surface is selected via a biological enrichment process panning) which is based on preferential binding potency to the target protein. Since the preferred libraries for use with the invention contain totally random peptides ranging from 10 and up to about amino acids in lengMThnd more preferably 20 to 40 amino acids in length), there -14- S are no known restrictions on the amino acids that can be selected to create the surrogate's 'complementary' surface. Thus, the examples described herein relate to the utility of the surrogate approach for finding the cognate receptor for both protein and non-protein targets. In the case of the surrogates for both HCVmRNA and TNFP, it is clear that the large diversity and size of the original library was, in fact, critical to their successful isolation since libraries of <20 amino acids peptides would not have contained either the KcB7 peptide or the HCV-specific surrogates.
In addition to the data presented in the Examples, we have screened other targets using this approach. While the expected natural partners were found for most of the proteins, there were some instances where surrogates were generated but lacked partner information IGF-1R, growth hormone receptor, etc.). There are several possible explanations for these results.
Examples of targets panned and partners revealed by surrogate peptides.
Target Panned Coagulation FIX
TNF-P
GHR
IgAR IGF-1R
IR
TNFR2 TNFR1 TRAIL receptor fasR PAB 1620 (antip53 antibody) MDM-2 mRNA targets mRNA HCV Natural Site of Action ExtraCell ExtraCell PI.Membr.
PI.Membr.
PI.Membr.
PI.Membr.
PI.Membr.
PI.Membr P.Membr P.Membr IntraCell IntraCell IntraCell IntraCell Partner Revealed by Surrogate none TNFR1 none IgA none none TNF ligands none none fasL p53 p53 RNA binding motif elF3 Phenotype(s) of Surrogate Peptides antagonist antagonist antagonist Agonist and antagonist Agonist and antagonist Agonist and antagonist Antagonist antagonist
ND
antagonist
NT
antagonist
NT
NT
This Table gives a list of the targets panned using the 20mer and random libraries. Column 2 lists the putative site of biological action for each o target. Column 3 describes whether a natural partner was found using a surrogate peptide found from the panning.. Column 4 describes the biological activity of each surrogate in the appropriate biological assay.
Legend: Extracell: Target expressed as an extracellular protein; Pl.Membr 05 Target expressed as a plasma membrane protein; IntraCell: Target expressed 00 5 a intracellularly; TNFO, Tumor Necrosis Factor p; IgAR, IgA receptor, GHR, Growth Hormone receptor, IGF-1R, Insulin-like Growth Factor-1 receptor; IR, SInsulin receptor, TNFR2, Tumor Necrosis Factor Receptor-2 (p75); TNFR1, ci Tumor Necrosis Factor receptor-1 (p55); NT Not Tested.
While the libraries used are large and diverse, it is probable that identification of a surrogate peptide with partner information is a rare event. With that in mind, it may require the isolation and sequencing of large numbers of clones (perhaps >500/target) in order to find the appropriate surrogate for partner identification. In addition, some targets may have complex or unusual protein:protein contact sites that preclude generation of a surrogate with partner information.
Surrogates have also been found to have the minimal structural content necessary to induce a pharmacological effect on any target in addition to their use in partner identification. Most surrogates have been shown to have either agonist or antagonist activity in the appropriate biochemical and/or biological models (see Table above). Surrogates have also been shown to subdivide large 2 contact surfaces into smaller contact domains through which target activity can be modified. These attributes provide for surrogate use in phenotyping and validating novel genes whose functions are unknown and for which there exist no known partners. Surrogates can also be used to develop competitive Site Directed Assays (SDAs) for each essential sub-domain, thereby allowing their use in high throughput screening of large combinatorial libraries of small molecules. See U.S. Patent 6,010,861. Most peptide surrogates isolated from these complex libraries by routine panning procedures bind to regulatory hot spots on varied targets. This non-random association between a surrogate and a -16- I 1target's "hotspor pharmacological active site) assures a high degree of Cprobability that, once found, surrogates will have utility for the rapid development of SDAs capable of identifying small molecules of pharmacological importance.
c, Selecting Expression Systems Of Original 00 5 Target Gene For Phenotyping With Surrogate Two expression systems may be used to assess phenotypic changes resulting from binding of the gene product with the surrogate. In one method, Ccells which express the gene product are identified and used as a natural expression system.
Information from EST data bases (cDNA libraries used to isolate ESTs; and others) is searched for the distribution of expressed cellular and tissue mRNA data collected by Northern blot analysis or other methods including but not limited to expression of protein or activity, if available) encoding the gene product. To identify high expression systems, surrogates may be labelled (via biotin, FITC tags), and used to probe tissue sections, tissue culture cells and organisms by immunological or fluorescent detection such as Elisas and FACS.
Altematively, if natural expression systems are unavailable, an expression system may be created by expressing the gene in cells using'standard techniques. Because the activity of the gene product may be cell type dependent, it is desirable to express the gene in a plurality of cell types.
Expression And Purification Of Novel Protein Open Reading Frames No single heterologous expression system is adequate to produce all protein sequences in high yield and as fully folded active entities. In order to maximize the chances of recovering an active protein, any unknown new sequence should be expressed in multiple expression systems. One method for accomplishing this would be to sequentially clone the desired protein into several expression vectors optimized for individual cell culture expression systems.
Alternatively, commercially available systems have been developed to allow protein sequences te-be-cloned and expressed in several cell culture systems -17simultaneously. One such system, the pTriEx™-1 Multisystem Vector, is r available from Novagen. In this system, the protein sequence to be expressed is cloned into a multisystem vector incorporating consecutive CAG, T71ac and promotors. These three promotors allow high level expression from the single c vector in mamalian, E. coli and insect cells, respectively. The vector also oo vI incorporates HSV-Tag® and His-Tag@ tags on the c-terminus of expressed 0proteins to facilitate immunochemical detection and affinity purification.
Expression levels can be checked using anti-HSV antibodies, and the crude Sproteins can be purified to near homogeneity using metal affinity chromatography. The purified protein would be suitable for use in biopanning and surrogate characterization.
Detecting Phenotypic Changes To detect the activity of the gene, phenotype changes (morphorlogical, biochemical, immunological) are observed following contact of the surrogate to the cell, tissue or organism. The surrogate may be free or attached to a penetrating peptide sequence, as anti-target probe, in fashion similar to known methods used with anti-sense technology.
Phenotyping can be done in natural systems if the target/target interaction is related to an observable phenotype. Under these conditions there is no need to over-express the target in a model cell.
The overall strategy for determining the functions of a gene by detecting changes in phenotype may be summarized as follows: I. OBTAIN SURROGATE: a. Make gene product of unknown functions for panning i. Obtain oligoribonucleotides of 5' and 3'untranslated mRNA domains' ii. Obtain full length DNA and express open reading frame ORF) protein and purify ORF protein product -18- 0 b. Pan peptide libraries (phage, bacterial, yeast, mammalian cell or in C vitro/ribosomal display) against gene product such as, for example: 0 i. Untranslated 3' and 5'mRNA domains, or N ii. ORF encoded protein oo Sc. Sequentially make nth generation mutated libraries based on Spanned surrogate's sequences until a limited number of consensus sequences is obtained.
11. USE SURROGATE SEQUENCES TO a. Search data bases of translated consensus sequences and identify potential partner protein and genes.
b. Synthesize (or recombinantly express a fusion) surrogate consensus peptides obtained in the 1st to nh generation pan of peptide displayed libraries either i. linked to cellular uptake peptide leader such as antanopedia) or ii. free with terminal amino acids for solubility if needed) III. USE SOLUBLE SURROGATES TO DETECT CHANGES IN PHENOTYPE MEDIATED THROUGH ACTIVATION OF THE GENE PRODUCT BY THE SURROGATE a. Add surrogates to intact model cells and quantitate effect b. Add surrogates to in vitro model system and quantitate effect c. Add surrogates at various doses and produce graded phenotypic knockouts.
The following non-limiting examples illustrate various aspects and embodiments of the invention and should not be contrived as limiting the scope of the invention. All references cited herein are incorporated herein by reference in their entirety.
-19- Q EXAMPLES SExample 1: Design of 40-mer and 20-mer Random Peptide Libraries
O
DNA fragments coding for peptides containing 40 random amino acids c were generated by a PCR approach using synthetic oligonucleotides. A 145 00 Sbase oligonucleotide was synthesized containing the sequence (NNK) 40 where N O A, C, T, or G and K G or T. See U. S. Patents 6,143,531, 5,681,726 and 388, which are hereby incorporated by reference. This oligonucleotide was used 0 as the template in PCR reactions along with two shorter oligonucleotide primers, both of which are biotinylated at their 5' ends. The resulting 190 bp product was purified and concentrated (followed by digestion with Sfil and Notl). The resulting 150 bp fragment was purified and the phagemid pCANTAB5E (Pharmacia) was digested with Sfil and Notl. The digested DNA was resolved using a 1% agarose gel, excised and purified by QIAEX II treatment (Qiagen). The vector and insert were ligated overnight at 15 0 C. The ligation product was purified.
Electrocompetent cells were prepared by harvesting cells from a culture broth with an OD of 0.5-0.7 UOD. by centrifugation in a fixed rotor for 10 minutes at 950g. The cells were washed three times with ice cold pure water.
Electroporations were performed at 1500 V in an electroporation cuvette (0.1 mm gap; 0.5 ml volume) containing 12.5 ug DNA and 500 uL of E. coli strain TG1 electrocompetent cells. Immediately after the pulse, 12.5 ml of pre-warmed (42 0 C) 2x YT medium containing 2% glucose (YT-G) was added and the transformants grown at 37 0 C for one hour. Cell transformants were pooled, the volume measured and an aliquot plated onto 2x YT-G containing 100 jLg/ml ampicillin (YT-AG) to determine the number of transformants. The diversity of the random 40-mer peptide cell library was found to be 1.6 X 101 0 The phage library was produced by rescue of the cell library according to standard phage preparation protocols. See Carcamo, et al. Proc. Natl Acad Sci USA (1998) 95:11146-11151. Phage titers were usually 4 X 101 3 CFU/ml.
Sequencing of randomly selected clones from the cell library indicated that about 54% of all-eenes were in-frame. The short FLAG sequence, DYKD, 9 was included at the N-terminus as an immunoaffinity tag. In addition, the E-tag epitope (GAPVPYPDPLEPR) was engineered into the carboxy terminus of the peptide.
A second random phage library of 20-mer peptides was constructed using the same approach. The diversity of this cell library was found to be 1.1 X 1011 Sclones and sequencing revealed 77% of the clones were in frame.
0Example 2: Panning TNF-P c A standard method was used to coat and block all microtiter plates. The target was diluted to 1 mg/ml in 50 mM sodium carbonate buffer, pH 9.5. One hundred microliters of this solution was added to an appropriate number of wells in a 96-well microtiter plate (MaxiSorp plates, Nunc) and incubated overnight at 4° C. Wells were then blocked with MPBS (PBS containing 2% non fat milk) at room temperature for one hour.
Eight wells being used for each round of panning. The phage for the phage library were incubated with MPBS for 30 minutes at room temperature, then 100 pl was added to each well. For the first round, the input phage titer was 4 x 1013 cfu/ml. For rounds 2 and 3, the input phage titer was approximately 1011 cfu/ml. Phage were allowed to bind for two to three hours at room temperature. The wells were then quickly washed 13 times with 300 il/well of MPBS. Bound phage were eluted by incubation with 100 pl/well of 20 mM glycine-HCI, pH 2.2 for 30 seconds. The resulting solution was then neutralized with Tris-HCI, pH 8.0. Log phase TG1 cells were infected with the eluted phage by incubation at 37 OC for 1 hr. Helper phage (M13K07) was then added (multiplicity of infection(MOI)=15) and cells incubated in the presence of 50 pg/ml ampicillin and 2% glucose for 1 hr at 37 °C with shaking at 250 rpm. Following infection, cells were pelleted, resuspended in the initial culture volume of 2xYT containing 50 pg/ml ampicillin and 50 pg/ml kanamycin and grown overnight at 37 OC with shaking at 225 rpm. Cells from the ovemight culture were pelleted and supematant containing phage was recovered. Phage was precipitated with -21o 6% PEG 8000, 300mM NaCI and chilled on ice for 1 hr. Precipitated phage was pelleted by centrifugation at 10,000 x g for 30 min, then resuspended in PBS 1mM MgCl2(1/100 of the initial volume). The phage was used for the next round of panning.
0 For Elisa analysis of individual clones, colonies were picked and phage 00 5 prepared as described above using helper phage, M13K07. Microtiter wells were coated and blocked as described above. Wells were coated with either 0IGF-1R or a control IgG MAb. Phage were added at 100 pl/well and incubated at ci room temperature for 2 hr. The phage solution was then removed, and the wells were washed three times with PBS at room temperature. Anti-M13 antibody conjugated to horseradish peroxidase (Pharmacia Biotech) was diluted 1:3000 in MPBS and added to each well (100 pl/well). Incubation was for another hour at room temperature, followed by PBS washes as described. Color was developed by addition of ABTS solution (100 pil/well; Boehringer). Plates were analyzed at 405 nm using a SpectraMax 340 plate reader (Molecular Devices) and SoftMax Pro software. Data points were averaged after subtraction of appropriate blanks.
A clone was considered "positive" if the A 405 of the well was a 2-fold over background.
An additional series of panning experiments were performed using the eluted phage from the first panning of TNF-p. This additional panning, a subtractive panning, was included to remove any peptides that cross-reacted with other members of the TNF family. In particular, the phage was subsequently panned against TNFR1, TNFR2 and TNF-a.
The panning experiments identified a surrogate peptide, KcB7, with the amino acid sequence RKEMGGGGGPGWSENLFQ. A Blastp search, using several different queries revealed TNFR1 which is the natural biological partner of TNFP.
BLASTp search results for the TNFI Surrogate peptide KcB7 Query: WSENLFQ 4Database: Scoe Sequences producing significant alignments: (bits) Value prf12102238A tumor necrosis factor alpha inhibitor [Homo s 20 2419 gbAAA36756.11 (M60275) TNF receptor [Homo sapiens 20 2419 00 5 pdbIlTNRrR Chain R, Tumor Necrosis Factor Receptor P55 20 2419 pdbI1lNCFIA Chain A, Binding Protein, Cytokine Mol id: 1; 20 2419 refiN?001056. 11 tumor necrosis factor receptor 1 (55kD) 20 2419 >pr112102238A tumor necrosis factor alpha inhibitor [Homo sapiens] Length =160 Score 20.4 bits Expect 2419 Identities 717 Positives 7/7 (100%) Query: 1 WSENLFQ 7
WSENLFQ
Sbjct: 96 WSENLFQ 102 >gblAAA36756.11 (M60275) TNF receptor [Homo sapiens] Length 453 Score 20.4 bits Expect 2419 Identities 7/7 Positives 7/7 (100%) Query: 1 WSENLFQ 7
WSENLFQ
Sbjct: 136 WSENLFQ 142 >pdbjlITNRIR Chain R, Tumor Necrosis Factor Receptor P55 (Extracellular Domain) Complexed With Tumor Necrosis Factor-Beta Length 139 Score 20.4 bits Expect =2419 Identities 7n7 Positives 7/7 (100%) Patternfind search results for the TNF-3 Surroaaep~ieKB Query sequence: WSENLFQ IV. DATABASE: NONREDUNDANT Limit -23gpIM602751339760IAC886035F969E231 TNF receptor [Homo sapiens] Occurences: 1 Position 136 WSENLFQ spIP19438ITNR1_HUMAN4CEFBA96DO3B8225 (TNFRSF1A..)TUMOR NECROSIS FACTOR RECEPTOR 1 PRECURSOR (TUMOR NECROSIS FACTOR BINDING 00 5 PROTEIN 1) (TBPI) (P60) (TNF-R1) (TNF-RI) (P55) (CD120A).[Homo sapiens] Occurences: 1 Position 136 WSENLFQ 2 matches found Closer examination of the complementary sequences revealed that the short N-terminal sequence RKEMG and the C-terminal sequence WSENLFQ were identical to regions on TNFR1 (amino acids 77-81 and 107-113 respectively). These segments corresponded to amino acids within two critical ligand:receptor contact domains. In the case of the N-terminal grouping, the surrogate contained 5 of the 15 amino acids of the 77-81 contact domain whereas in the C-terminal grouping, the surrogate contained 6 of the 9 amino acids identified within the 107-113 contact domain.
Comparison with human TNFR1 extracellular domain
IYPSGVIGLVPHLGDREKRDSVCPQGKYIPQNNSICCTKCHKGTYLYNDCPGPG
QDTDCRECesgsFTASENHLRhcLscSkCRkeMgQVEISSCTVDRDTVCGCRKNQYR HYWSENLFqcFNCSLCLNGTVHLSCQEKQNTVCTCHAGFFLRENECVSCS Contact residues are based on Banner et al., (1993) Cell 73: 431-445.
Bold= contacted by TNFP subunit A lower case contacted by TNFP subunit C italics contacted by TNFP both subunits A and C Underline homology to the clone
TNFP
LPGVGLTPSAAQTARQHPKMHLAHSTLKPAAHLIGDPSKQNSLLWRANTDRAF
LQDGFSLSNNSLLVPTSGIYFVYSQVVFSGKAYSPKAPSspLyLAHEVQLFSsqypfH -24- 0 vPLLSSqKmVYPGLQIEPWLHSMYHGAAFQLTQGDQLSThTdGIPHL VLSPSTVFF
GAFAL
Bold =TNFP subunit A lower case TNFP subunit C 00 Comparison with human TNFR2 extracellular domain rKBMGGGGGGpgwSENIQ
LPAQVAFTPYAPEPGSTCRLREYYDQTAQMCCSKCSPGQHAKVFGTKTSDTVCD
SCEDSTYTQLWNWVPECLSCGSRCSSDQVETQACTrEQNRICTCRpgwYCAISKQ EGRLCAPLRKCPGFGVARPGTETSDVVCKPCAPGTFSNTrSSTDICRPHQICN
WAIPGNASRDAVCTSTSPT
Example 3: Panning RNA target Surrogate peptides were obtained by panning a portion of the 5'UTR of HCV mRNA using both the 20mer and 40mer random libraries. All solutions and surfaces were pretreated with DEPC or RNaseZap (Ambion, Inc.), respectively, to eliminate RNase contamination that may compromise the integrity of the RNA.
Biotinylated RNA target was diluted to 1 mg/mi in binding buffer (PBS containing I MM MgCI 2 denatured at 65 'C for 5 min and reannealed by slow cooling to room temperature to allow for appropriate refolding. The synthetic biotinylated-RNA target had the following sequence, 5'-biotin'AA UUG CCA GGA CGA CCG GGU CCU UUIC UUG GAU CAA CCC GCU CMA UGC CUG GAG AUU-3'. Reannealed RNAs were stored in small aliquots (1 0-25jd/tube) at 0 C. Microtiter wells were treated with RNaseZap (Ambion, Inc.) before use. One hundred microliters of RNA solution diluted to 2.5 ng/ L was added to an appropriate number of wells in a 96-well microtiter plate precoated with Streptavidin (Pierce) and incubated for 1 hr at room temperature. Unbound streptavidin was then blocked with 50 j d of 2 mM biotin at room temperature for 1 h r. Four wells were used for each round of panning and 100 .I phage was added to each well. Pahge were precipitated with RNase-free 6% PEG 8000 00.3 M NaCI, washed with the same solution once and resuspended in RNase-
O
free PBS 1 mM MgCI 2 Superasin (RNase inhibitor from Ambion, Inc.). For c the first round, the input phage titer was 1 x 1013 cfu/ml. For rounds 2 and 3, the oo io input phage titer was approximately 101 cfu/ml. Phage were allowed to bind for Stwo to three hours at room temperature. The wells were then quickly washed 13 times with 400 il/well of PBS. Bound phage were eluted by incubation with 150 0 il/well of 50 mM glycine-HCI, pH 2.2 0.1% BSA for 5 min. The resulting solution was then neutralized with Tris-HCI, pH 8.0. Log phase TG1 cells were infected with the eluted phage by incubation at 37 OC for 1 hr. Helper phage (M13K07) was then added (multiplicity of infection(MOI)=15) and cells incubated in the presence of 50 .Lg/ml ampicillin and 2% glucose for 1 hr at 37 OC with shaking at 250 rpm. Following infection, cells were pelleted, resuspended in the initial culture volume of 2xYT containing 50 tg/ml ampicillin and 50 jig/ml Kanamycin and grown ovemight at 37 OCwith shaking at 225 rpm. Cells from the ovemight culture were pelleted and supematant containing phage was recovered. Phage was precipitated with 6% PEG 8000, 300mM NaCI and chilled on ice for 1 hr. Precipitated phage was pelleted by centrifugation at 10,000 x g for 30 min, washed once with the same solution and then resuspended in PBS 1mM MgC 2 (1/100 of the initial volume).
For Elisa analysis of individual clones, colonies were picked and phage prepared as described above using helper phage, M13K07. Streptavidin-coated microtiter plates were blocked with PBS containing 2% non fat milk for 1 hr at room temperature, treated with RNaseZap, then coated with biotinylated RNA target (100ng/well) by incubation for 1 hr at room temperature. Superasin (RNase inhibitor from Ambion, Inc.) was added to the wells prior to addition of 100 .l/well of phage from isolated clones and incubated at room temperature for 2 hr. The phage solution was then removed, and the wells were washed three times with PBS at room temperature. Anti-M13 antibody conjugated to horseradish peroxiaa-se (Pharmacia Biotech) was diluted 1:3000 in PBS (also -26o containing Superasin) and added to each well (100 Il/well). Incubation was for another hour at room temperature, followed by PBS washes as described. Color was developed by addition of ABTS solution (100 pl/well; Boehringer). Plates were analyzed at 405 nm using a SpectraMax 340 plate reader (Molecular cN Devices) and SoftMax Pro software. Data points were averaged after subtraction 00 rI of appropriate blanks. A clone was considered "positive" if the A40s of the well Owas 2-fold over background.
0 Peptides HCV-3-F5, HCV-3-H8 and HCV-NG-D9 were obtained from the cN 40-mer library. Peptide HCV-3-C3 was obtained from the 20mer library.
Sequence analysis of these surrogate peptide binders to HCV using MEME (Motif Elicitation Program) and other peptide sequence alignment programs identified a consensus sequence TxRLL. Database searches using BLAST and Pattemfind identified a human gene product, subunit p170 of elF3. The consensus sequences are shown below in bold and underlined. Sequences outside the motif that are conserved between the surrogates and elF3 are in Italics and underlined.
HCV ALIGNMENTS eIF3': EDLDNIQTPE- SVLLSAVSGEDTQDRTDRLLLTPWVKFLWESY CONSENSUS: TxRLL HCV-NG-D9 TSGESSGDRTRSRLTSSSARTLPN HCV- 3- F5 LLVTGFF£- QLLLGGAVCGP- STPRLRTGLCRLSGT HCV-3-H8 RRTCGDPAAMLERLSCRAGDYRGASHGERLLNLRGMHQYP HCV-3-C3 OUTPUT FROM ADVANCED BLAST SEARCH FOR HCV mRNA SURROGATE QUERY SEARCH 1: Query sequence: TSGESSGDRTRRVLT 3 Program: blastp Database: swissprot Expect value: 10000
OUTPUT:
Sequences producing significant alignments: Score E Value -27- 0 spIP31258IHXAB_CHICK HOMEOBOX PROTEIN HOX-All (GHOX-1I) spIP23116IIF3AHOUSE EUKARYOTIC TRANSLATION INITIATION FACT...
spIP396901KHS1_YEAST KILER TOXIN KHS PRECURSOR (KILLER OF spIOB3264INTJSGTREPA TRANSCRIPTION ANTITERMINATION PROTEIN..
spIP13 0791CARB -STRTH RRNA METHYLTRANSFERASE
(CAROMCIN-RS...
SI1Q1415211MF3AHUMAX EUKARYOTIC TRANSLATION INITIATION FACTsp P399251 AFG3_YEAST MITOCHONDRIAL RESPIRATORY CHAIN COMPLE...
spIP165611 HEMALVACCT HEMAGGLUTInIN PRECURSOR spIP52023IDP3BSYNP7 DNA POLYMERASE III, BETA CHAIN spIP2 09781IHEMAVACCC HEMAGGLUTININ PRECURSOR sp IP159891 CA36_CHICK COLLAGEN ALPHA 3 (VI) CHAIN PRECURSOR List truncated here...
(bits) 20 19 19 19 19 477 1072 10720 1072 1072 19 1072 19 1072 19 1404 19 1404 19 1404 19 1404 >spIP31258IHXABCHICK H-OMEOBOX PROTEIN HOX-All (GHOX-1I) (CHOX-1. .9) Length 297 Score 20.4 bits Expect =477 Identities 8/11 Positives 9/11 (81%) Query: 2 SGESSGDRTRR 12 SG SSG RTR+ Sbjct: 217 SGSSSGQRTRK 227 >sp IP231161 IF3A_-MOUSE EUKARYOTIC TRANSLATION INITIATION FACTOR 3 SUBUNIT 10 (EIF-3 THETA) (EIF3 P167) (EIF3 P180) (EIF3 P185) (P162
PROTEIN)
20
(CENTROSOMIN)
Length 1344 Score 19.2 bits Expect 1072 Identities 8/13 Positives 10/13 (76%) Query: 2 SGESSGDRTRRVL 14 SGE DRT R+L Sbjct: 133 SGEDTQDRTDRLL 145 >SPIP39690IKS_-YEAST KILLER TOXIN KHS PRECURSOR (KILLER OF HEAT SENSITIVE) Length 708 Score 19.2 bits Expect =1072 Identities 8/13 Positives 10/13 (76%) Query: 3 GESSGDRTRRVLT G+SSG T+R LT Sbjct: 98 GKSSGSATKRGLT 110 -28rl_ 0 sp10832641NUSG-TREPA TRANSCRIPTION ANTITERM~INATION PROTEIN
NIJSG
Length 185 Score 19.2 bits Expect =1072 Identities 7/12 Positives 9/12 (74%) OC 5 Query: 2 SGESSGDRTRRV 13 In +GE GDRT R+ Sbjct: 117 AGEIKGDRTPRI 128 I P13 0791 CARB STRTH RRNA METHYLTRANSFERASE (CARBOMYCIN- RESISTANCE PROTEIN) (1 Length 299 Score 19.2 bits Expect =1072 Identities 8/12 Positives =8/12 (66%) Query: 2 SGESSGDRTRRV 13 SG S DR RRV Sbjct: 40 SGRSEADRRRRV 51 >SPIQ14152IIF3AJ- UMAN EUKARYOTIC TRANSLATION INITIATION FACTOR 3 SUBUNIT 10 (EIF-3 THETA) (EIF3 P167) (EIF3 P180) (EIF3 P185) (KIAAO139) Length 1382 Score 19.2 bits Expect =1072 Identities 8 /13 Positives 10/13 (76%) Query: 2 SGESSGDRTRRVL 14 SGE DRT R+L Sbjct: 133 SGEDTQDRTDRLL 145 >SPIP39925IAFG3 YAST MITOCHONDRIAL RESPIRATORY CHAIN COMPLEXES ASSEMBLY PROTEIN AFG3 (TAT-BINDING HOMOLOG Length 761 Score 19.2 bits Expect =1072 Identities 8/14 Positives 10/14 (71%0) Query: 2 SGESSGDRTRRVLT S +SGD RVLT Sbjct: 136 SSNNSGDDSNRVLT 149 -29- OUTPUT FROM ADVANCED BLAST SEARCH FOR HCV mRNA SURROGATE QUERY SEARCH 2: Query sequence: TSGESSGDRTRRVLTSSS Program: blastp Database: swissprot 00 5 Expect value: -e 10000 Sequences producing significant alignments: Score E Value (bits) spIQ01 7281NACIRAT SODIUM/CALCIUM EXCHANGER 1 PRECURSOR (NA 21 spIP7044INACIMOUSE SODIUM/CALCIUM EXCHANGER 1 PRECURSOR 21 spIP48765INACl-BOVIN SODIUM/CALCIUM EXCHANGER I PRECURSOR 20 190 spIP48766INACICAVPO SODIUM/CALCIUM EXCHANGER 1 PRECURSOR 20 190 spIP324I8INAC1 HUMAN SODIUM/CALCIUM EXCHANGER I PRECURSOR 20 190 spIP23685INCICANFA SODIUM/CALCIUM EXCHANGER 1 PRECURSOR 20 190 spIP48767INACIFELCA SODIUM/CALCIUM EXCHANGER I PRECURSOR 190 spIP08I73ACM4_HUMAN MUSCARINIC ACETYLCHOLINE RECEPTOR M4 19 *4 spIP23II6IF3AMOUSE EUKARYOTIC TRANSLATION INMA71ON FACT... 19 249 spjQ14152IF3A-HUMAN EUKARYOTIC TRANSLATION INITIATION FACT... 19 249 spIPI5656IFGF5_MOUSE FIBROBLAST GROWTH FACTOR-5 PRECURSOR 19 327 spIP30042IESHUMAN ESI PROTEIN HOMOLOG PRECURSOR (PROTEIN 19 327 sp1035491 ICLK2 MOUSE PROTEIN KINASE CLK2 18 428 spIP4976OICLK2_HUMAN PROTEIN KINASE CU<2 18 428 spIPI572MYODHUMAN MYOBLAST DETERMINATION PROTEIN 1 (MYOG 18 428 sp1075069[Y481_HUMAN HYPOTHETICAL PROTEIN KIAA0481 (HH1480) 18 428 spIP025331IKCN.HUMAN KERATIN, TYPE I CYTOSKELETAL 14 (CYTOK 18 561 spIP3O989INTR1_HUMAN NEUROTENSIN RECEPTOR TYPE 1 (NT-R-1) 18 561 spIP3O551 ICCKkRAT CHOLECYSTOKININ TYPE A RECEPTOR (CCK-A R 18 561 spIQ083691GAT4-MOUSE TRANSCRIPTION FACTOR GATA-4 (GATA BIND 18 561 List truncated here...
>spjQO1728INAC1_-RAT SODIUM/CALCIUM EXCHANGER 1 PRECURSOR (NA+/CA2+-EXCHANGE PROTEIN 1) Length 971 Score 21.2 bits Expect Identities 9/15 Positives 11/15 (73%) Query: 3 GESSGDRTRRVLTSS 17 GE G RT ++LTSS Sbjct: 933 GELGGPRTAKLLTSS 947 >spIP70414INAC1_-MOUSE SODIUM/CALCIUM EXCHANGER 1 PRECURSOR (NA+/CA2+-EXCHANGE PROTEIN 1) Length 970 Score 21.2 bits Expect Identities 9/15 Positives 11/15 (73%) Query: 3 GESSGRTRRVLTSS 17 11114GE G RT ++LTSS Sbj ct: 932 GELGGPRTAKaJLTSS 946 >BpIP48765INAC1 -BOVIN SODIUM/CALCIUM EXCHANGER 1 PRECURSOR (NA+/CA2+-EXCHANGE PROTEIN 1) Length 970 Score 19.6 bits Expect 190 tt~ Identities 8/14 Positives 10/14 (71%) c-iQuery: 3 GESSGDRTRRVLTS 16 GE G RT ++LTS Sbjct: 932 GELGGPRTAKLLTS 945 >sPIP48766INAC1SCAVPO SODIUM/CALCIUM EXCHANGER 1 PRECURSOR (NA+/CA2+-EXCHANGE PROTEIN 1) Length 970 Score 19.6 bits Expect =190 Identities 8/14 Positives 10/14 (71%) Query: 3 GESSGDRTRRVLTS 16 GE G RT ++LTS Sbjct: 932 GELGGPR.TAKLLTS 945 >SPIP3241BINAC1JIIHUMAN SODIUM/CALCIUM EXCHANGER 1 PRECURSOR (NA+/CA2+-EXCHANGE PROTEIN 1) Length 970 Score 19.6 bits Expect =190 Identities 8/14 Positives 10/14 (71%) Query: 3 GESSGDR.TRRVLTS 16 GE G RT ++LTS Sbjct: 932 GELGGPRTAKJLTS 945 >sPIP23685INAC1 -CANFA SODIUM/CALCIUM EXCHANGER 1 PRECURSOR (NA+/CA2+-EXCHANGE PROTEIN 1) Length 970 Score 19.6 bits Expect =190 Identities =8/14 Positives 10/14 (71%) Query: 3 GESSGDRTRRVLTS 16 GE G RT ++LTS Sbjct: 932 GELGGPRTAKLLTS 945 -31- (1114 0 spIP48767INAC1_FELCA SODIUM/CALCIUM EXCHANGER 1 PRECURSOR (NA+/CA2+-EXCHANGE PROTEIN 1) Length 970 Score 19.6 bits Expect 190 Identities 8/14 Positives 10/14 (71%) OC 5 Query: 3 GESSGDRTRRVLTS 16 tf~ GE G RT ++LTS Sbjct: 932 GELGGPRTAKJLTS 945 >sPIPO8173IACM4_HUMAN MIJSCARINIC ACETYLCHOLINE RECEPTOR M4 *Length 479 Score =19.2 bits Expect 249 Identities 8/14 Positives 13/14 (92%) Query: 5 SSGDRTRRVLTSSS 18 SSG+++ R++TSSS Sbjct: 10 SSGNQSVRLVTSSS 23 >spIP23116IIF3A_-MOUSE EUKARYOTIC TRANSLATION INITIATION FACTOR 3 SUBUNIT 10 (EIF-3 THETA) (EIF3 P167) (EIF3 P180) (EIF3 P185) (P162
PROTEIN)
(CENTROSOMIN)
Length 1344 Score 1.9.2 bits Expect 249 Identities 8/13 Positives 10/13 (76%) Query: 2 SGESSGDRTRRVL 14 SGE DRT R+L Sbjct: 133 SGEDTQDRTDRLL 145 >5pIQ14152IIF3A_-HUMAN EUKARYOTIC TRANSLATION INITIATION FACTOR 3 SUBUNIT 10 (EIF-3 THETA) (EIF3 P167) (EIF3 P180) (EIF3 P185) (KIAAQ139) Length 1382 Score 19.2 bits Expect 249 Identities 8/13 Positives 10/13 (76%) Query: 2 SGESSGDRTRRVL 14 SGE DRT R+L Sbjct: 133 SGEDTQDRTDRLL 145 -32- >spIP1565GIFGF5 MOUSE FIBROBLAST GROWTH FACTOR-5 PRECURSOR
(HBGF-S)
Length 264 Score 18.8 bits Expect =327 Identities 9/16 Positives =10/16 (62%) 00 5 Query: 3 GESSGDRTRRVLTSSS 18 G+SSG RR T SS Sbjct: 39 GDSSGSRGRSSATFSS 54 >spIP30042IES1 HUMAN ESi PROTEIN HOMOLOG PRECURSOR (PROTEIN KNP-I) (GT335) CI Length 268 Score 18.8 bits Expect =327 Identities 8/18 Positives 11/18 Query: 1 TSGESSGDRTRRVLTSSS 18 T G+ S +RVLT S+ Sbjct: 93 TKGQPSEGESRNVLTES.A 110 >sp10354911CLK2 -MOUSE PROTEIN KINASE CLK2 Length 499 Score 18.4 bits Expect =428 Identities 8/11 Positives 8/11 (72%) Query: 2 SGESSGDRTRR 12 S SS DRTRR Sbjct: 34 SWSSSSDRTRR 44 >BpIP49760ICLK2 UMAN PROTEIN KINASE CLK2 Length 499 Score 18.4 bits Expect =428 Identities 8/11 Positives 8/11 (72%) Query: 2' SGESSGDRTRR 12 S SS DRTRR Sbjct: 34 SWSSSSDRTRR 44 Database searches using Pattemnfind at the ISREC server were performed using parameters appropriate for short protein queries and were successful in identifying a human4;ene product, subunit p 170 of elF3. Searches using the -33- "1 consensus region as the query likewise identified sequence homology to the r large subunit p170 of elF3.
O
Output from Patternfind for HCV mRNA surrogate query Cq Query sequence: DRTxRLL 0 0 5 Database: Nonredundant Limit: splQ14152|IF3A_HUMAN|485C01B28D67EBBA (EIF3S10)EUKARYOTIC STRANSLATION INITIATION FACTOR 3 SUBUNIT 10 (EIF-3 THETA) (EIF3 P167) O (EIF3 P180) (EIF3 P185) (KIAA0139).[Homo sapiens] SOccurrences: 1 Position 139 DRTDRLL spIP4637 3[FAS1_RHOFAA66B6F3DF1286566 (FASI..)CYTOCHROME P450 FAS1 (EC ).[Rhodococcus fascians] Occurrences: 1 Position 170 DRTARLL spIP231161IF3A_MOUSEF4CAE2169F577712 (EIF3S10..)EUKARYOTIC TRANSLATION INITIATION FACTOR 3 SUBUNIT 10 (EIF-3 THETA) (EIF3 P167) (EIF3 P180) (EIF3 P185) (P162 PROTEIN) (CENTROSOMIN).[Mus musculus] Occurrences: 1 Position 139 DRTDRLL Example 4: Panning mRNA Short linear amino acid domains found in naturally occurring RNA-binding proteins were identified in peptides isolated from the random peptide libraries.
These domains are generic, i.e. general RNA binding protein motifs rather than specific RNA binding motifs. Surrogate peptides were obtained by panning a portion of the 5'UTR of four different mRNA targets using both the 20mer and random libraries as described in Example 3. Isolated phage binders from rounds three and four of each pan were sequenced. For each mRNA target, the predicted amino acid sequences of the peptide binders were analyzed for both overall amino acid content and the occurrence of known RNA-binding motifs and consensus domains. All of the peptide binders showed enrichment of arginine residues, as would be expected for RNA binding proteins. Also, tryptophan, -34serine, and glycine residues were enriched. The following table gives a comparison of the specific amino acid composition of peptide binders with regard to their average frequency of occurrence seen within the original unpanned library. These data were compared to the actual frequency of occurrence in the library before and after panning on the various mRNA targets denoted as M1, M2, and M3. All numbers are expressed as a percentage of the expected frequency.
Exp. Freq.
Library Ml M2 M3 Arg 9.4 9.4 13.9 13.0 12.4 Gly 6.3 11.6 12.6 13.0 12.3 Trp 3.1 3.1 5.2 4.3 5.1 Ser 9.4 7.7 10.0 8.3 9.8 0 In addition, several peptides from each pan showed the presence of the RGG box, a well-defined RNA-binding motif, as indicated below. RGG sequences in each surrogate is in bold and underlined.
M1-3-B7 M1-3-E8 M1-3-H6 M1-4-Hi M2-3-Ci M2-3-C9 M2-3-E2 M2-3-E3 M2-3-E9 M2-3-H12 M2 -NG-C7 M3-3-B9 M3-3-C2 M3-3-C7 M4-NG-A4
RGLFTEWFRGGSWSNYRVTS
TDGGRSVISDNVBgSRLWLWRHiGSWS-AWGPQDAWS SK
RVSSAQPGCTSRVRFRCPULLFNGVTSTNPKTGLSNAQ
VVYVGVLSYWPHL SGGGRLQVRCLI GRGGFGCRGG
WPPGRTLSDLIEAGARGM
SSGGLHRWSAIJRGGHGHGLA
AMRLKPIAFKGPRAGAGWVEVPCFAAFRAACTESHHH
LHAGWDVTAPRRACKGAQGPGLHGRFYCHE9LCSGLGRC
DE.SSLKGKLRGALVRLGMGHAMPHRGGVWPSTGRPSKQG
WTPRHGPMRCWRHSVFPVGAGPHWALWPIKGPRGGRTAC
RKTGSNIWLPLYHKVCPASTRAGNGGGSRFLWGSMQTNC
RLQRRGGAVAVVWVGFGVGLLWGRLLLI ILGWVLMWFLS QHSEHGGTEWRKRgGMAFAASFLCMRDSYRTTRLRSLLG GTRHVINRVRDSSGVPCKRFGGLQFSQMGKCTI PREA VLBMVGKGLMWCQEVDWRTGGPRSNLWGLWNGRaPPK Furthermore, one sequence was found from panning the 20-mer random peptide library on traget M1 that contained the KH motif, which is also a known RNA-binding motifs. The surrogate motif corresponding to the KH domain is in Sbold and underlined.
KH Motif VIGxxGxxF M1-3-C6 GVIGGRGLLEPLSGFLHQHR 00 Example 5: Panning of Tie- (pro-anaiogenic tvrosine kinase) C Surrogates acting as for Tie-1 were identified by panning against Tie-1.
O Six wells of a 96-well microtiter plates were coated with Tie-1 extracellular c domain (R&D Systems) at concentrations ranging from 50-500 ng/well). Plates are incubated overnight at 4°C. At the same time, an aliquot of E.coli, strain TG1 was inoculated into 2x YT media and grown overnight at 37 0 C. The next day, unbound antigen was removed and the coated wells were blocked with 300 ul of 2% non-fat milk in PBS (NFM-PBS) for one hour at room temperature. The plates were then washed plates 3 times with PBS. The phage libraries were thawed and mixed with 0.1 vol of PBS-2% non-fat milk (NFM), 100 pl of each library was added to the antigen-coated wells and the plates are incubated for 3 hours at room temperature. Each well was washed 13 times with PBS-2% NFM and the phage eluted with 100ul of 50 mM glycine-HCL containing 0.1% BSA (pH2.2) following a five minute incubation. The eluted phage from each library was pooled, neutralized with 100 ul of 1M Tris-HCI (pH and added to 10 ml of log phase E coli TG1 (OD 0 oo and amplified in 2x YT- glucose medium 2 for one hour at 37°C. Helper phage (M13K07) and ampicillin were then added and the cells were incubated for an additional hour at 37°C. The cells were pelleted at 3500 RPM for 20 minutes, resuspended in 2x YT-AK medium (YT medium containing ampicillin and kanamycin) and incubated overnight at 37"C.
The next day, the infected bacterial cells were centrifuged at 3500 RPM at 4°C for 15 minutes and the pellet discarded. The supematant contained the phage and was precipitated with volume of 30% PEG-8000 in 1.6 M NaCI by incubating on ice for 1 hour. The precipitant was centrifuged at 10,000 RPM at 4°C for 30 minutes and the phage pellet resupended in about 1 ml of NFM-PBS.
-36r" The phage was then used for the next round of panning. Three-four rounds of Spanning were done for both the 20-mer and 40-mer libraries. Two to three hundred random clones were picked from rounds 3 and 4 and grown in 96 well cluster plates as a master stock.
cN For screening, 40 ul of master stock was transferred from each master to oo Sanother set of cluster tubes containing 400 pjl of 2x YT-AG and helper phage S(final concentration of 5XlO 10 The tubes were incubated at 37 0 C with constant shaking for two hours. The cultures were centrifuged at 2500 x g at 4°C ci for 20 minutes, the supematant was discarded, and the bacterial pellet was resuspended in 400ul of 2x YT-AK medium and was incubated ovemight at 37 0 C. At that time, the cells were removed by centrifugation at 2500 x g and the supematants were transferred to a new set of cluster tubes and used in ELISA or stored at 4 0
C.
Each well of a MaxiSorp plate (Nunc) was coated with 100 4l of target (1 pg/ml) ovemight at 4 0 C. The wells were blocked with NFM-PBS for 1.5 hours at room temperature. Phage was added at 100 ul/well and the plates incubated for 3 hours at room temperature. After washing 3x with PBS-Tween, plates were probed with an anti-M13 antibody conjugated to horseradish peroxidase (1:3000 in PBS-NFM) for 1 hour at room temperature followed by addition of 100 ul of ABTS for 15-30 minutes at room temperature. The OD was measured using a SpectraMax Microplate Spectrophotometer (Molecular Devices) at 405 nM after a 30 minute incubation at room temperature.
A total of 104 binders were sequenced yielding 32 unique sequences.
Several different peptide motifs were identified that selectively bind to the Tie-1 receptor but not to other tyrosine kinases (insulin receptor, IGFR-1R). The 3 criteria for a positive clone is a >2 fold difference vs. an unrelated target. The results of the following database searches identified mannose-binding protein associated serine protease 2 (MASP-2) as a nature partner.
C
1o Sequences of Peptide Binders to Tie-1 SConsensus: GxAVVFLDRWGNP >RPT13 SLWGCSGRAVLFLDSVGNPTGTVRC >RPT9 RRVDAGGAVVYLDRWGNVSV >RPT34 VVFLDRWGNPQYLGVKASGG TI1-G11-R40 GPFSWLFETEWGNPKTVPFGADRWNRHGRWDPGPVSDYGT 00 Results of Advanced Blast Search 0 Ci Reference: Altschul, Stephen Thomas L. Madden, Alejandro A.
SSchAffer, C Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
0 Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.
RID: 988980952-24595-10839 Query= RPT9: RRVDAGGAVVYLDRWGNVSV letters) Database: Non-redundant SwissProt sequences 96,103 sequences; 35,068,824 total letters Score E Sequences producing significant alignments: 2 (bits) Value gi 7387859IsplO00187|MASP2_HUMAN MANNAN-BINDING LECTIN SERIN... 29 0.17 Alignments >gi 7387859 1 sp000187 IMAS2_HUMAN MANNAN-BINDING LECTIN SERINE PROTEASE 2 PRECURSOR (MANNOSE-BINDING PROTEIN ASSOCIATED SERINE PROTEASE 2) (MASP-2) Length 686 -38- C 0 Score 29.1 bits Expect 0.17 Identities 13/25 Positives 16/25 Gaps= S7/25 (28%) SQuery: 2 RVDAGGAVVYLD--- -RW---GNVS 19 R D+GGA+V+LD RW G VS Sbjct: 630 RGDSGGALVFLDSETERWFVGGIVS 654 00 o O Example 6: Generation of Agonist/Antagonist Assays for Tie-1 Recoptor for Determining Phenotypic Effects of Surrogates: Surrogates are tested for agonist and/or antagonist activity in cell lines expressing both full lengthTie-1 and a chimeric receptor containing the extracellular domain of Tie-1 and the cytoplasmic region of the epidermal growth factor receptor (EGFR). The EGFR was chosen because: a) both the EGFR and Tie-1 are receptor tyrosine kinases; b) both appear to signal following dimerization and c) there is an extensive body of information regarding EGFR signal transduction pathways and the downstream events involved in transcription and cell growth.
Several models are used including cell proliferation and gene reporter assays. In the proliferation models, full length and chimeric Tie-1 are transfected into the IL-3 dependent cell line, FDC. After selection, these cells proliferate in the presence of a putative Tie-1 agonist. In gene reporter assays, various gene- 2 reporter systems are used, including STAT (signal transducer and activator of transcription) -luciferase, STAT-GFP (green fluorescence protein), SRE (serum response element)-luciferase and SRE-GFP. Co-transfection experiments establish cell lines expressing either full length or chimeric Tie-1. These STAT and SRE lines allow the high throughput screening of phage clones to determine the putative bioactivity of the peptide surrogates. See Carcamo, et al. Proc. Natl Acad Sci USA 95:11146-11151.
The complete ORF of the Tie-1 gene is cloned from fetal human brain (Clontech Quick-Clone cDNA) or fetal human heart using the following primers: -39a 5' Tie-1 forward: GGT CGG CCT CTG GAG TAT GGT CTG 3' Tie-1 reverse: TCC TTG AGG CAG CTT AAG TCA GAG 0 The complete ORF of the EGFR gene is cloned from the above libraries or from a placental cDNA library (Clontech Placenta Marathon ready cDNA) o 5 using the following primers: EGFR forward: GGA GCA GCG ATG CGA CCC TC 3' EGFR reverse: GGT CCT GGG TAT CGA AAG AGT CTG G In the chimeric receptor, the extracellular and transmembrane regions of Tie-1 are joined to the cytoplasmic kinase domain of the EGFR with an NHE I site which will add the amino acids alanine and serine at the junction. The primers for generating the chimeric receptor are the following primers (with the NHE site underlined): EGFR forward: GCG CTG CTA GCC GAA GGC GCC ACA TCG TTC Tie-1 reverse: GCT GCT GCT AGC GAT GCA CAC CAG GGT TAA AAG G Both the full length Tie-1 and the chimeric receptor are cloned into pCDNA 3.1 for transfection experiments.
The various target cell lines are used to screen surrogate peptides with agonist and antagonist activity. The surrogates are used as peptidomimetics or for the generation of Site Directed Assays and small molecule discovery via high throughput screening.

Claims (43)

  1. 2. The method according to claim I wherein the target is an untranslated region of mRNA.
  2. 3. The method according to claim I wherein the target is a cellular receptor.
  3. 4. The method according to claim I wherein said library comprises a 25 peptide library of random amino acid sequences. The method according to claim 4 wherein the peptides of said library comprises a random sequence of about 10 to about 50 amino acids.
  4. 6. The method according to claim 5 wherein the random sequence comprises about 20 to 40 amino acids.
  5. 7. The method according to claim 6 wherein the random sequence consists essentially_.about 20 amino acids. -41-
  6. 8. The method according to claim 6 wherein the random sequence consists essentially of about 40 amino acids.
  7. 9. mammalian. The method according to claim 1 wherein the genome is The method according to claim 9 wherein the genome is human.
  8. 11. The method according to claim 1 wherein the target is selected from the group consisting of receptors, transport proteins, transcription regulatory sites and translation regulatory sites.
  9. 12. The method according to claim 1 wherein the target comprises a protein.
  10. 13. nucleic acid.
  11. 14. polysacchari The method according to claim I wherein the target comprises a The method according to claim 1 wherein the target is a de. The method according to claim 1 wherein the motif comprises 5 to S. 8 amino acid
  12. 16. The method according to claim 15 wherein the common amino acids of said motif are contiguous.
  13. 17. A method of identifying a motif comprising an amino acid sequence of a post translational gene product wherein said motif confers detectable binding properties at a natural target of said post translational gene product, said method comprising screening a library comprising a plurality of different expressed amino acid sequences for binding of members of said library to the target; separating members of the library which bind to the target; determining the amino acid sequence of the members of the library which bind to the target; -42- (0 and identifying as motifs common amino acid sequences among the determined Samino acid sequences of said target binding members of the library.
  14. 18. The method according to claim 17 wherein said library comprises a peptide library of random amino acid sequences. 00oo
  15. 19. The method according to claim 18 wherein the peptides of said library comprises a random sequence of about 10 to about 50 amino acids. The method according to claim 19 wherein the random sequence 1 comprises about 20 to 40 amino acids.
  16. 21. The method according to claim 20 wherein the random sequence consists essentially of about 20 amino acids.
  17. 22. The method according to claim 20 wherein the random sequence consists essentially of about 40 amino acids.
  18. 23. The method according to claim 17 wherein said library is a library derived from a primary library by fixing the identity of certain amino acids in 20 known positions of said members of said library.
  19. 24. The method according to claim 17 wherein the common amino acids of said motif are contiguous. A method for determining the activity of a gene product, said 25 method comprising: a) expressing said gene product in a cell; b) contacting said cells with a ligand which binds said gene product; and c) detecting a change in phenotype of cells in which said gene product is expressed.
  20. 26. The method according to claim 25 wherein said gene product is expressed in a pluralypf different cell types. -43- t 27. The method according to claim 25 wherein said ligand possess a T consensus amino acid sequence determined from a plurality of members of a peptide library which bind said gene product.
  21. 28. The method according to claim 27 wherein said ligand possesses 00 5 an amino acid sequence enabling the ligand to enter said cell.
  22. 29. The method according to claim 25 wherein the change in phenotype is detected based on a change in cell growth.
  23. 30. The method according to claim 25 wherein the change in phenotype is detected based on a change in cell morphology.
  24. 31. The method according to claim 25 wherein said ligand is homologous to a natural peptide.
  25. 32. A method of determining the phenotypic outcome of the expression of a gene product comprising; a) expressing the gene product in cells; 20 b) contacting said cells with an amino acid sequence comprising a motif which binds said gene product and wherein said motif is identified from members of a peptide library which bind to the target; and c) detecting a change in phenotype of cells in which said gene product is expressed.
  26. 33. The method according to claim 32 wherein said gene product is expressed in a plurality of different cell types.
  27. 34. The method according to claim 32 wherein said amino acid sequence possesses an amino acid sequence enabling it to enter said cell. The method according to claim 32 wherein the change in phenotype is detected based on a change in cell growth. -44-
  28. 36. The method according to claim 32 wherein the change in phenotype is detected based on a change in cell morphology. O 37. The method according to claim 32 wherein said motif is present in a naturally occunrring gene product. 00oo
  29. 38. A method of identifying a naturally occurring binding partner, or 0 binding partner precursor, for a target, said method comprising: a) identifying an amino acid sequence which binds to said cI target by screening a library comprising a plurality of different expressed amino acid sequences for binding of members of said library to the target; separating at least one member of the library which bind to the target; determining the amino acid sequence of said member of the library which bind to said target; and; b) comparing the identified amino acid sequence of said member to known amino acid sequences of a genome and identifying a gene product of said genome possessing an amino acid sequence substantially similar to said identified amino acid sequence as the naturally occurring binding partner, or partner precursor, for said target.
  30. 39. The method according to claim 38 wherein the substantially similar amino acids are identical and contiguous. The method according to claim 39 wherein at least 5 amino acids are identical and contiguous.
  31. 41. The method according to claim 38 wherein the target is an untranslated region of mRNA.
  32. 42. The method according to claim 38 wherein the target is a cellular receptor.
  33. 43. The method according to claim 38 wherein said library comprises a peptide library of random amino acid sequences. a) C4
  34. 44. The method according to claim 43 wherein the peptides of said library comprises a random sequence of about 10 to about 50 amino acids. The method according to claim 44 wherein the random sequence comprises about 20 to 40 amino acids.
  35. 46. The method according to claim 45 wherein the random sequence consists essentially of about 20 amino acids.
  36. 47. The method according to claim 45 wherein the random sequence consists essentially of about 40 amino acids.
  37. 48. mammalian. The method according to claim 38 wherein the genome is
  38. 49. The method according to claim 48 wherein the genome is human. The method according to claim 38 wherein the target is selected from the group consisting of receptors, transport proteins, transcription regulatory sites and translation regulatory sites.
  39. 51. The method according to claim 38 wherein the target comprises a protein.
  40. 52. The method according to claim 38 wherein the target comprises a nucleic acid.
  41. 53. The method according to claim 38 wherein the target is a polysaccharide.
  42. 54. The method according to claim 38 wherein the motif comprises 5 to 8 amino acids. A method for identifying a nucleic acid sequence encoding a naturally occurring binding partner, or binding partner precursor, for a target, said method comprising..__ a) identifying an amino acid sequence motif which confers detectable binding properties of a peptide comprising said motif to a target by screening a library comprising a plurality of different expressed amino acid sequences for binding of members of said library to the target; separating c members of the library which bind to the target; determining thd amino acid 00 OC) sequence of the members of the library which bind to said target; and identifying as motifs common amino acid sequences among said determined amino acid sequences; b) comparing the identified amino acid sequence motifs to known amino acid sequences of a genome and identifying a gene product of said genome possessing said motif as the naturally occurring binding partner, or partner precursor, for said target; and c) identifying said nucleic acid sequence encoding said naturally occurring binding partner, or partner precursor.
  43. 56. A method of identifying a nucleic acid sequence encoding a naturally occurring binding partner, or binding partner precursor, for a target, said method comprising: a) identifying an amino acid sequence which binds to said target by screening a library comprising a plurality of different expressed amino acid sequences for binding of members of said library to the target; separating at least one member of the library which bind to the target; determining the amino acid sequence of said member of the library which bind to said target; b) comparing the identified amino acid sequence of said member to known amino acid sequences of a genome and identifying a gene product of said genome possessing an amino acid sequence substantially similar to said identified amino acid sequence as the naturally occurring binding partner, or partner precursor, for said target; and c) identifying said nucleic acid sequence encoding said naturally occurring binding partner, or partner precursor.
AU2007200582A 2000-05-09 2007-02-09 Methods of identifying the activity of gene products Abandoned AU2007200582A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2007200582A AU2007200582A1 (en) 2000-05-09 2007-02-09 Methods of identifying the activity of gene products

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US60/202,912 2000-05-09
AU2007200582A AU2007200582A1 (en) 2000-05-09 2007-02-09 Methods of identifying the activity of gene products

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
AU2001261369 Division 2001-05-09

Publications (1)

Publication Number Publication Date
AU2007200582A1 true AU2007200582A1 (en) 2007-03-01

Family

ID=37847273

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2007200582A Abandoned AU2007200582A1 (en) 2000-05-09 2007-02-09 Methods of identifying the activity of gene products

Country Status (1)

Country Link
AU (1) AU2007200582A1 (en)

Similar Documents

Publication Publication Date Title
USRE42150E1 (en) Binding proteins for recognition of DNA
JP4060886B2 (en) Isolation and utilization of SH3-binding peptides
US6977154B1 (en) Nucleic acid binding proteins
EP1179059B1 (en) Isolating biological modulators from biodiverse gene fragment libraries
US7223547B2 (en) Polypeptides having a functional domain of interest and methods of identifying and using same
JP2003515745A (en) Direct screening method
WO2009086116A2 (en) Alternative scaffold protein fusions phage display via fusion to plx of m13 phage
JP2004506898A (en) Functional protein array
CZ20013399A3 (en) Protein isolation method and protein analysis, particularly mass spectrometry analysis
JP2004533840A (en) PDZ domain ligand by phage display
JP2002538517A (en) Methods for identifying putative peptides
EP1456240A2 (en) Target specific screening and its use for identifying target binders
WO1998023781A1 (en) Ligand detection system and methods of use thereof
EP1003853B1 (en) Methods for protein screening
Li ORF phage display to identify cellular proteins with different functions
US20030054348A1 (en) Methods of identifying the activity of gene products
AU2007200582A1 (en) Methods of identifying the activity of gene products
JP2002506828A (en) Peptide ligand for erythropoietin receptor
Kurakin et al. Target-assisted iterative screening reveals novel interactors for PSD95, Nedd4, Src, Abl and Crk proteins
WO2003045990A2 (en) Protein-protein interactions involving transforming growth factor beta signalling
Pillutla et al. A surrogate-based approach for post-genomic partner identification
AU7539800A (en) Dna library
AU726759B2 (en) Improvements in or relating to binding proteins for recognition of DNA
JP2003532431A (en) Methods for designing and screening random libraries of compounds
AU2002349903A1 (en) Target specific screening andits use for identifying target binders

Legal Events

Date Code Title Description
TH Corrigenda

Free format text: IN VOL 21, NO 9, PAGE(S) 940 UNDER THE HEADING COMPLETE APPLICATIONS FILED - NAME INDEX UNDER THE NAME DGI BIO TECHNOLOGIES INC., APPLICATION NO. 2007200582, UNDER INID (54) CORRECT THE TITLE TO READ METHODS OF IDENTIFYING THE ACTIVITY OF GENE PRODUCTS.

MK1 Application lapsed section 142(2)(a) - no request for examination in relevant period