WO1993003157A1 - Plasmids for the rapid preparation of modified proteins - Google Patents

Plasmids for the rapid preparation of modified proteins Download PDF

Info

Publication number
WO1993003157A1
WO1993003157A1 PCT/US1992/006187 US9206187W WO9303157A1 WO 1993003157 A1 WO1993003157 A1 WO 1993003157A1 US 9206187 W US9206187 W US 9206187W WO 9303157 A1 WO9303157 A1 WO 9303157A1
Authority
WO
WIPO (PCT)
Prior art keywords
fusion protein
affinity ligand
labeled
protein
glutathione
Prior art date
Application number
PCT/US1992/006187
Other languages
French (fr)
Inventor
Michael A. Blanar
William G. Kaelin
Original Assignee
Dana Farber Cancer Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dana Farber Cancer Institute filed Critical Dana Farber Cancer Institute
Publication of WO1993003157A1 publication Critical patent/WO1993003157A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • C07K14/4701Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
    • C07K14/4736Retinoblastoma protein
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/20Fusion polypeptide containing a tag with affinity for a non-protein ligand
    • C07K2319/23Fusion polypeptide containing a tag with affinity for a non-protein ligand containing a GST-tag
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/40Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
    • C07K2319/43Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation containing a FLAG-tag
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/50Fusion polypeptide containing protease site
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/90Fusion polypeptide containing a motif for post-translational modification

Definitions

  • iodination protocols relying on chlora ine T or iodogen result in modifications at tyrosine residues and some histidine residues.
  • SUBSTITUTESHEET reagent result in the modification of free amino groups on lysine residues. In some proteins this particular modification may also have a deleterious effect on structure or activity.
  • the 5 lactoperoxidase method for iodination employs gentler conditions, this method also leads to modification of tyrosine and histidine residues, with the potential for structural distortion and loss of activity.
  • biotinylation protocols are 0 frequently performed using a succinimide ester of biotin. The biotin is coupled to the protein through free amino groups, typically on lysine residues. Again, modification at one or more positions may alter structure and/or function of the protein. In ddition, extensive dialysis is needed to remove uncoupled biotin, which may be deleterious to the protein.
  • the present invention relates to expression vectors comprising nucleic acid sequences which encode an affinity ligand (e.g., an enzyme, epitope) and a modification recognition sequence.
  • the vectors further comprise at least one restriction site for the insertion of a nucleic acid sequence capable of encoding a selected polypeptide in frame with the affinity ligand and modification sequence.
  • the resulting construct codes for a fusion protein comprising an affinity ligand, the selected polypeptide and a modification recognition sequence.
  • the fusion protein may be isolated by virtue of the affinity ligand and then modified.
  • the expression vectors may further comprise a nucleotide sequence encoding a cleavable linker, such
  • the sequence encoding the cleavable linker is located between the affinity ligand and the restriction site for insertion of the sequences encoding the selected polypeptide. In this location, the linker may be
  • the invention further relates to expression vectors containing a nucleotide sequence encoding a gene for a selected polypeptide and capable of directing the expression of the selected
  • polypeptide as a fusion protein.
  • the pGEX-2TK expression vector is one embodiment of the present invention.
  • This vector encodes a protein comprising, from amino to carboxyl terminus, glutathione-S-transferase (GST) as an affinity 2 ligand, the thrombin cleavage site as a cleavable linker, and a phosphorylation recognition site for the cAMP-dependent protein kinase as a modification recognition sequence.
  • GST glutathione-S-transferase
  • a multiple cloning site comprising three restriction sites is located or. ⁇ J downstream of the sequence encoding the phosphorylation site.
  • a nucleic acid sequence encoding a selected polypeptide may be inserted into the vector using one or more of these sites.
  • the selected polypeptide is expressed as a GTK-fusion
  • a modified fusion protein Upon expression in a suitable host cell, the protein of interest is produced as a fusion protein.
  • the fusion protein can be captured on a suitable affinity matrix by virtue of an affinity ligand, which interacts reversibly with the matrix. Modification of the fusion protein can be carried out while the protein is attached to the matrix. Subsequently, the modified fusion protein may be isolated for use by releasing the fusion protein from the affinity matrix with a suitable agent.
  • the fusion protein may be cleaved in vitro to free the modified polypeptide portion, and the affinity ligand portion or any uncleaved product can be removed by adsorption on the appropriate affinity matrix.
  • the modified polypeptide portion can be released from the affinity ligand portion by cleaving the modified fusion protein at the cleavable linker while still bound to the column.
  • Modified proteins of the present invention are useful in a variety of applications.
  • Labeled (e.g., radiolabeled) proteins may be used as molecular probes.
  • antibodies can be labeled for therapeutic, diagnostic (e.g., imaging) or research purposes.
  • Proteins may be labeled and used as reagents to quantitate or identify an interacting protein, such as a receptor.
  • Figure 1 shows a portion of the nucleotide sequence around the kinase recognition site of expression vector pGEX-2TK.
  • the portion of the pGEX-2TK sequence introduced by the synthetic duplex is indicated by italics.
  • the sequences which encode the linker cleavable by thrombin (Leu-Val-Pro-Arg- Gly-Ser) , the downstream kinase recognition site (Arg-Arg-Ala-Ser-Val) and multiple cloning site with Ba HI, Smal and EcoRI sites, are shown.
  • the arrow indicates the point of thrombin cleavage.
  • Figure 2 is an illustration of the structure of expression vector pAR( ⁇ RI)59/60.
  • the general structure of the plasmid is shown at top. bla, indicates the / 9-lactamase gene which confers ampicillin resistance; ori, indicates the origin of replication.
  • bla indicates the / 9-lactamase gene which confers ampicillin resistance
  • ori indicates the origin of replication.
  • the center panel the general structure of FEK-fusion proteins is illustrated.
  • the FLAG peptide portion comprising the FLAG epitope and enterokinase cleavable linker, is indicated.
  • HMK indicates the location of.the phophorylation site.
  • Protein indicates the location of the selected polypeptide.
  • the lower panel shows a more detailed illustration of the N-terminal region of the vector.
  • the peptide sequence shown (Met-Asp-Tyr-Lys-Asp-Asp- -Asp-Asp-Lys-Ala-Arg-Arg-Ala-Ser-Val-Glu-Phe-) in the detail is a contiguous sequence.
  • the extent of the FLAG peptide, the enterokinase cleavage site, and HMK recognition (phosphorylation site) are indicated.
  • Figure 3 is a bar graph showing the effect on
  • S U BS TITUTESHEET different fusion proteins, comprising portions of the retinoblasto a susceptibitliy gene product, RB. Proteins were phosphorylated in vitro using cAMP-dependent protein kinase. The hatched bars indicate that the fusion protein (GST-RB379-792 or GST-RB379-792;pm706) was expressed from pGEX-2TK and contained an HMK sequence, while the solid bars indicate that the fusion protein (GST-RB379-792 or GST-RB379-792;pm706) was expressed from pGEX-2T and did not contain an HMK sequence.
  • orientation of particular amino acid sequences is such that the amino end is on the left and the carboxyl end is on the right.
  • the present invention relates to expression vectors comprising nucleic acid sequences which encode an affinity ligand and a modification recognition sequence.
  • the vectors further comprise at least one restriction site for the insertion of a nucleic acid sequence capable of encoding a selected polypeptide.
  • the expression vectors may further comprise a nucleotide sequence encoding a cleavable linker sequence. These vectors are referred to as parent vectors.
  • the present invention further relates to expression vectors which are derived from the parent vectors described above, by the insertion of a nucleic acid sequence capable of encoding a selected polypeptide (e.g., a natural or synthetic cDNA or
  • SHEET geno ic DNA which encodes the selected polypeptide and is capable of being expressed in the appropriate host cell) into the parent vector.
  • the parent vector can be cleaved at one or more restriction sites in order to insert sequences encoding a selected polypeptide using known techniques. Linkers or other sequences can be used to facilitate insertion. Insertion at an appropriate restriction site or site in the parent vector results in the in frame fusion of the sequences of the selected polypeptide with the amino acid sequences for an affinity ligand, modification recognition sequence and, if present, the optional cleavable linker encoded by the parent vector.
  • the resulting fusion gene encoded by the vector codes for a fusion protein comprising an affinity ligand, a modification recognition sequence, selected polypeptide, and optionally, a cleavable linker.
  • the location of the restriction site or sites selected for insertion in the vector will determine the location of the selected polypeptide relative to the affinity ligand, modification recognition sequence and optional cleavable linker in the encoded fusion protein.
  • the selected polypeptide element of the fusion protein comprises at least one peptide, polypeptide or protein of interest.
  • the fusion protein comprising an affinity ligand, a modification recognition sequence, an optional cleavable linker and a selected polypeptide forms a conti.guous polypepti.de chai.n.
  • the order of these components in the fusion gene and protein can vary. The location of these components or additional
  • B S TITUTESHEET components in the fusion gene and protein is determined by the order in which the nucleic acid sequences encoding the components are linked in the vector or the parent vector.
  • fusion proteins of the following structure can be made, where A indicates the affinity ligand, M indicates the modification recognition sequence, C indicates the optional cleavable linker, and X indicates the polypeptide of interest, and the N-terminal portion of the fusion is on the left: A-(C)-M-X A-M-(C)-X X-(C)-M-A X-M-(C)-A M-A-(C)-X
  • the cleavable linker is preferably located between the affinity ligand and selected polypeptide portions of the fusion protein.
  • intro ⁇ duction of a selected polypeptide in frame is simplified by a terminal location for the polypeptide (X) . This location can also minimize the effect of fusion on the biological activity (e.g., binding activity, an igenicity, enzymatic activity) of the selected polypeptide.
  • One or more components can be duplicated (e.g., X-M-C-A-X) .
  • multiple modification sites can be incorporated to increase the intensity of
  • HEET labeling These sites may be contiguous or noncontiguous in the encoded fusion protein.
  • additional sequences can be present in the fusion protein.
  • a sequence encoding a signal peptide or leader peptide may be incorporated into the vector, and the signal peptide will be produced as part of the fusion protein.
  • the additional sequence is preferably incorporated at the N-terminus (e.g., S-X-M-C-A) .
  • an internal or C-terminal location in the fusion protein may be desired.
  • the parent vector may also produce the affinity ligand, modification sequence, and optional cleavable linker sequences as a fusion protein.
  • This product may contain additional sequences encoded by the vector, depending on the location of a promoter or termination signals relative to the coding sequences for these elements.
  • restriction sites for insertion of sequences encoding a selected polypeptide may disrupt the reading frame of components encoded by the parent vector. Insertion of sequences encoding a selected polypeptide will restore the reading frame.
  • the expression vectors of the present invention can be designed for use in a variety of host cells, including bacterial host cells such as E. coli and eukaryotic host cells (e.g., yeast cells and mammalian cells) .
  • bacterial host cells such as E. coli
  • eukaryotic host cells e.g., yeast cells and mammalian cells
  • fusion proteins can be glycosylated.
  • yeast cells and mammalian cells e.g., yeast cells and mammalian cells
  • a promoter is provided for expression of the fusion protein in a suitable host cell.
  • Suitable promoters can be constitutive or inducible.
  • the promoter is operably linked to nucleic acid sequences encoding the fusion protein and is capable of directing the expression of the corresponding polypeptide.
  • suitable promoters for procaryotic e.g., lac promoter, tac promoter
  • eukaryotic hosts e.g. , yeast alcohol dehydrogenase (ADH1) , SV40
  • the expression vectors typically comprise a selectable marker for selection of host cells carrying the plasmid and an origin of replication, in the case of a replicable expression vector.
  • Genes encoding products which confer antibiotic resistance are common selectable markers and may be used in prokaryotic (e.g. , ⁇ -lactamase for ampicillin resistance, tetracycline resistance) and eukaryotic cells (e.g. , G418) .
  • Genes encoding the gene product of auxotrophic markers e.g., LEU2 and URA3 are commonly used as selectable markers in yeast.
  • Use of viral or phage vectors, and vectors which are capable of integrating into the genome of the host cell, such as retroviral vectors, are also contemplated.
  • the present invention also relates to cells carrying these types of expression vectors (e.g., transformed cells).
  • HEET e.g., radioactive, fluorescent
  • the invention further relates to the fusion proteins and modified (e.g., labeled) fusion proteins produced from expression vectors of the present invention. It is also possible to produce fusion proteins of the present invention using methods of peptide synthesis, or in vitro transcription/translation procedures. Modification of synthetic fusion proteins is also possible.
  • the affinity ligand present in the fusion proteins of the present invention is a polypeptide encoding an affinity ligand.
  • the affinity ligand i.e., an affinity ligand or portion thereof
  • affinity ligands include, but are not necessarily limited to, antibodies or portions thereof, antigens (e.g., influenza hemagglutinin) or epitopes (e.g., FLAG epitope), enzymes (e.g., glutathione-S-transferase (GST), ⁇ -galactosidase (lacZ) , the trpE product) , hormones, growth factors or other proteins capable binding to a specific binding partner (e.g., maltose binding protein, histidine hexamer (a heavy metal binding element) , and protein A) .
  • a specific affinity ligand is an affinity ligand comprising that protein or peptide or a portion thereof.
  • a glutathione-S-transferase affinity ligand is an affinity ligand comprising that protein or peptide or a portion thereof.
  • a glutathione-S-transferase affinity ligand is an affinity ligand comprising that protein
  • S UBSTITUTESHEET affinity ligand comprising glutathione-S-transferase or a portion thereof, which is capable of binding the specific binding partner (e.g., a substrate).
  • the affinity ligand portion of the fusion protein is 05 useful for a variety of purposes, such as providing a moiety for attachment to a support or to facilitate purification or identification.
  • the fusion proteins can be captured on an appropriate affinity matrix.
  • An 1° affinity matrix is a solid support to which is attached (preferably covalently) a specific binding partner.
  • Fusion proteins of the present invention comprising an affinity ligand, can bind to a specific binding partner via the affinity ligand part of the 15 fusion protein.
  • an affinity ligand which is an antigen an antibody or portion thereof can be used as a specific binding partner in an affinity matrix.
  • an antigen or hapten may be incorporated into an affinity matrix, and used 20 to capture fusion proteins comprising an antibody as an affinity ligand.
  • affinity ligand/affinity matrix pairs are available.
  • Glutathione-S-transferase fusion proteins may be captured on immobilized glutathione as an affinity 25 matrix, with glutathione as the specific binding partner.
  • glutathione sepharose Pharmacia
  • glutathione agarose beads Sigma Chemical Corp.
  • Fusion proteins comprising a protein A, maltose binding protein, FLAG
  • J ⁇ n ⁇ pepti.de, or a hi.sti.di.ne hexa er affi.ni.ty li.gand can be captured on IgG Sepharose 6FF (Pharmacia) , anylose resin (New England Biolabs) , anti-FLAG Ml antibody or anti-FLAG M2 antibody affinity resins (International Biotechnologies, Inc.), or a Ni 2+ affinity resin (NTA resin, Qiagen) , respectively.
  • IgG, amylose, anti-FLAG antibodies, or a heavy metal are, respectively, the specific binding partners.
  • a fusion protein or modified (e.g. labeled) fusion protein bound to an affinity matrix can be released by contacting the affinity matrix with bound fusion protein thereto with a suitable elution buffer comprising one or more release components.
  • the release component or components can be molecules which compete with the fusion protein for binding to the affinity matrix (e.g., hapten, free peptide epitopes, substrate or substrate analogs) , or which can disrupt binding of the affinity ligand to the specific binding partner.
  • an elution buffer (a buffered solution) suitable for releasing fusion proteins comprising a glutathione-S-transferase affinity ligand can be formulated comprising reduced glutathione as a release component.
  • an elution buffer comprising reduced glutathione, Tris and NaCl is used.
  • the elution buffer may comprise a buffered solution which lacks a specific component required for binding.
  • a fusion protein with an ompA signal sequence followed by a FLAG affinity ligand at the amino terminus can be expressed in E. coli. Specific removal of the ompA signal sequence upon secretion into the periplasmic space results in a fusion protein with an
  • STITUTESHEET capable of binding to the anti-FLAG Ml antibody in the presence of calcium.
  • This type of fusion protein can be eluted from an anti-FLAG Ml antibody affinity matrix using an elution buffer which lacks calcium, a specific component required for binding.
  • the expression vectors of the present invention can further comprise a sequence encoding a cleavable linker.
  • a nucleotide sequence capable of encoding such a cleavage site can be incorporated into an expression vector using known techniques (e.g. , recombinant DNA and/or mutagenesis) .
  • a cleavable linker is a peptide or polypeptide capable of being cleaved by a site specific protease. For example, a thrombin cleavage (e.g., Leu-Val-Pro-Arg-Gly-Ser) , factor Xa cleavage site (e.g.
  • enterokinase cleavage site e.g., Asp-Asp- Asp-Asp-Lys
  • enterokinase cleavage site e.g., Asp-Asp- Asp-Asp-Lys
  • the appropriate protease thrombin, factor Xa and enter ⁇ okinase, respectively
  • Cleavage by a protease can occur within a cleavable linker or at the border of the linker and another component of a fusion protein.
  • the product of the cleavage reaction of a selected fusion protein comprising the majority of the affinity ligand is referred to as the affinity ligand portion
  • the product of the cleavage reaction comprising the selected polypeptide is referred to as the selected polypeptide portion of the fusion protein.
  • These portions may comprise other parts of the encoded fusion protein (i.e., the
  • T fusion protein encoded by the expression vector and such are also fusion proteins.
  • the selected polypeptide portion can include the modification recognition sequence. If so, and if the fusion protein has been modified (e.g., labeled), then the selected polypeptide portion is referred to as the labeled selected polypeptide portion.
  • the term fusion protein refers to a protein, polypeptide, or glycoprotein comprising at least two of the components selected from the group consisting of an affinity ligand, modification sequence, cleavable linker sequence, selected polypeptide or additional sequence.
  • a cleavable linker is selected which does not cleave elsewhere within the fusion protein (e.g., in the selected polypeptide, affinity ligand or modification site) .
  • the sequences encoding the cleavable linker are preferably located between those sequences encoding the affinity ligand and selected polypeptide. In this location, the encoded fusion protein can be cleaved to separate the affinity ligand portion of the fusion protein from the selected polypeptide portion where desired.
  • sequences encoding the cleavable linker will be on located in the vector either upstream or downstream of the sequences encoding both the modification recognition sequence and the selected polypeptide (e.g., as in A-C-M-X or X-M-C-A fusions) .
  • Overlap of the sequences of portions of the fusion protein may occur provided the respective portions retain function. For example, in one
  • SUBS TITUTESHEET embodiment of the present invention pAR( ⁇ Rl)59/60, discussed in more detail below, the affinity ligand (FLAG epitope) sequence and cleavable linker sequence (enterokinase site) overlap by one amino acid.
  • the sequence of the FLAG epitope from N- to C-terminus is (Asp-Tyr-Lys-Asp)
  • the enterokinase cleavage site in this embodiment is (Asp-Asp-Asp-Asp-Lys) .
  • these sequences overlap to give the FLAG peptide, an octapeptide of the sequence Asp-Tyr-Lys-Asp-Asp-Asp-Lys, which retains the functions of the affinity ligand (e.g. , binding to a specific binding partner) and cleavable linker (e.g., cleavage by enterokinase) .
  • affinity ligand e.g. , binding to a specific binding partner
  • cleavable linker e.g., cleavage by enterokinase
  • a modification recognition sequence is also pro ⁇ vided. This sequence, incorporated into the fusion protein, directs modification of the fusion protein. Modification recognition sequences can be incorporated into a fusion protein comprising a selected polypeptide which either is naturally modified or is not naturally modified. A modification recognition sequence for phosphorylation (i.e., a phosphorylation site) can be introduced into a vector and expressed as part of a fusion protein, for example. In the presence of a suitable protein kinase, the fusion protein comprising the phosphorylation (kinase) site will be phosphorylated. Other types of modifications directed by the presence of a peptide or polypeptide sequence are envisioned in the present invention.
  • glycosylation reactions can be directed by a short peptide sequence.
  • fatty acylation of some proteins is directed by a short peptide sequence (Cys-Ala-Ala-X) .
  • the identification of additional recognition sites and the identification and purification of the corresponding modification enzyme or enzyme complex will provide alternative protocols for modification of the fusion proteins.
  • Modification of the fusion proteins is carried out using a modification enzyme or enzyme complex or other suitable means.
  • a suitable phosphorylation method is used.
  • the phosphorylation of proteins is carried out by a protein kinase.
  • Many such kinases have been described and have utility in the present invention (see e.g., Kemp, B.E., et al. , J. Biol. Che . , 252: 4888-4894 (1977); Edelman, A.M. et al. , Ann. Rev. Biochem. 56: 567-613 (1987); Glass, D.B. and E.G. Krebs, Ann. Rev. Pharmacol. Toxicol.
  • CMOS complementary metal-oxide-semiconductor
  • CMOS complementary metal-oxide-semiconductor
  • cAMP cyclic AMP
  • cGMP cyclic GMP
  • S UBSTITUTESHEET kinases A variety of serine/ threonine kinases have been identified including glycogen synthase kinase, phosphorylase kinase, casein kinase I casein kinase II, pyruvate dehydrogenase kinase, protein kinase C and myosin light chain kinase.
  • amino acid sequences present in natural substrates and artificial peptide substrates which are sufficient for activity as a protein kinase substrate have been identified (see e.g. , Edelman, A.M. et al., Ann. Rev. Biochem. 56: 567-613 (1987); Glass, D.B. and E.G. Krebs, Ann. Rev. Pharmacol. Toxicol. 20: 363-388 (1980) ; Hunter, T. and J.A. Cooper, Ann. Rev. Biochm. 54: 897-930 (1985)).
  • These amino acid sequences, in addition to those described below, and related amino acid sequences which can be specifically phosphorylated, can be used as modification recognition sequences in vectors of the present invention.
  • a corresponding protein kinase capable of phosphorylating a polypeptide comprising the selected kinase recognition sequence, can be used to phosphorylate fusion proteins of the present invention.
  • the cAMP-dependent protein kinase e.g. , the catalytic subunit of the cAMP-dependent protein kinase from bovine heart muscle
  • Xaa is an amino acid, in a variety of substrates, resulting in phosphorylation of serine.
  • expression vectors of the present invention e.g., PGEX-2TK derivatives, or pAR( ⁇ RI)59/60 derivatives
  • a sequence which encodes a version of the cAMP-dependent protein kinase recognition sequence can direct the production of fusion proteins capable of being phosphorylated by cAMP-dependent protein kinase from bovine heart muscle.
  • the Arg-Arg-Ala-Ser-Val sequence is also referred to as the HMK site (HMK, heart muscle kinase) .
  • Other specific amino acid sequences such as Arg-Arg-Ala-Ser-Leu (Li et al. , Proc. Natl.
  • cGMP protein kinases have a substrate specificity that is similar, but not identical, to that of the cAMP-dependent protein kinases. Comparative analysis of substrates has been made (Edelman, A.M. et al. , Ann. Rev. Biochem. 56: 567-613, (1987); Glass, D.B. and E.G. Krebs, Ann. Rev. Pharmacol. Toxicol.
  • casein kinase II was shown to phosphorylate the synthetic peptide Ser-Glu-Glu-Glu-Glu. Additional peptide substrates were phosphorylated by casein kinase II (e.g., in decreasing order of activity,
  • Candidate phosphorylation sites can be incorporated into synthetic peptides or into fusion proteins and assayed for activity as protein kinase substrates, using techniques similar to those described previously. Because corresponding phosphatases exist, enzymatic removal of phosphate groups is also possible.
  • the ability to modify the fusion proteins provides a convenient method of specifically labeling the fusion proteins with detectable radioactive or non-radioactive labels to produce a labeled fusion protein (e.g., a [ 32P]-labeled fusion protein).
  • a labeled fusion protein e.g., a [ 32P]-labeled fusion protein
  • Cleavage of fusion proteins can produce a selected polypeptide portion comprising a modification sequence or an affinity ligand portion comprising a modification sequence, depending on the location of the cleavable linker in relation to these components.
  • cleavage following labeling can produce a labeled selected polypeptide portion or a labeled affinity ligand portion, each of which is itself a fusion protein. Because the modification recognition sequence directs modification to a specific site in the protein, there is a high degree of control over the location and extent of modification. As shown in
  • the detectable label (e.g., radioactive, fluorescent or chemiluminescent) selected will determine by the nature of modification, the modification enzyme, and the intended use.
  • the label will be incorporated into a moiety which is transferred to the fusion protein substrate.
  • [ -labeled]ATP is used as phosphate donor (i.e., modification donor) .
  • protein kinases transfer the 7 -phosphate onto the substrate during the phosphorylation reaction, the label will be incorporated into the moiety transferred by the protein kinase.
  • a radioactive phosphate label such as
  • P, P or P, phosphate donors such as [ 7 P]ATP
  • [7 32P]ATP, or [ 7 33PJATP can be used.
  • isotopes of sulfur such as 35S or 38S isotopes, can be incorporated as the label in the [ 7 -labeled]ATP, as for example in 35S-labeled adenosme
  • S UBSTITUTESHEET In other types of modifications (e.g., glyco- sylation, fatty acylation) , a radioactive isotope, chemiluminescent or fluorescent dye, or other nonradioactive label, could be incorporated into the modification donor so that it is transferred to the fusion protein during the modification reaction.
  • a biotin adduct of a modification donor could be linked to a fusion protein.
  • the site of addition of the biotin label is controlled by the location of the modification recognition sequence.
  • GST genes from other sources, such as other bacteria or mammalian organisms, can be used.
  • a portion of a GST gene encoding a portion of GST capable of binding a specific binding partner, such as the substrate glutathione can be used.
  • expression vectors capable of expressing S. japonicum GST-fusion proteins i.e., a fusion protein comprising a GST affinity ligand
  • the parent vector is named pGEX-2TK, and vectors derived from pGEX-2TK by the insertion of a nucleotide sequence encoding a selected polypeptide, are referred to with the prefix pGTK-.
  • pGEX-2TK and pGTK- plasmids are described in Examples 1 and 2 and in Figure 1.
  • PGEX-2TK is a derivative of pGEX-2T, the latter which is described by D.B. Smith in EP 0,293,249, published November 30, 1988, PCT/AU88/00164, published December 1, 1988, and New Zealand Patent No. 224,663, issued November 27, 1990, and by D.B. Smith and K.S. Johnson in Gene 67: 31-40 (1988) .
  • pGEX-2TK encodes a fusion protein having GST as an affinity ligand at the amino terminus, followed by a thrombin cleavage site (cleavable linker) .
  • a modification recognition sequence for a protein kinase was incorporated downstream of the cleavable linker.
  • a sequence of the structure Arg-Arg-Xaa-Ser-Xaa where Xaa is an amino acid, was selected (Arg-Arg-Ala-Ser-Val) .
  • This sequence is a phosphorylation site recognized by cAMP-dependent protein kinases such as the catalytic domain of the cAMP-dependent protein kinase from bovine heart muscle.
  • cAMP-dependent protein kinases such as the catalytic domain of the cAMP-dependent protein kinase from bovine heart muscle.
  • pGEX-2TK the sequence which encodes the thrombin cleavage site is followed by several restriction sites for the insertion of sequences encoding a selected polypeptide.
  • pGEX-2TK retains the inducible tac promoter for expression, three in frame stop codons, a selectable marker (ampicillin resistance) , origin of replication, and laclq gene present in pGEX-2T.
  • a GTK-fusion protein comprising the selected polypeptide can be produced.
  • the GTK-fusion protein (where, from N- to C-terminus, G is a GST affinity ligand, T is a thrombin cleavage site, and K is a phosphorylation site for a protein kinase) is encoded by a pGTK-vector, can be produced upon expression in a suitable E. coli host.
  • These fusion proteins can be captured on immobilized glutathione, and labeled by phosphorylation with cAMP-dependent protein kinase, using [ 7 - P]ATP as a modification donor.
  • Examples 1-4 several different cDNAs were expressed from pGEX-2TK, and labeled as described.
  • the studies described in Examples 1-4 indicate that a selected polypeptide present in a GTK-fusion protein, encoded by a pGTK-type vector, such as pGTK-RB
  • the FLAG epitope is selected as the affinity ligand.
  • Vector pAR( ⁇ RI)59/60 shown in Figure 2, is an example of this type of expression vector.
  • Vector pAR( ⁇ RI)59/60 is a derivative of bacterial expression plasmid pET3a (pAR3040) described by Studier et al. , Meth. Enzymo1. 185:60-89 (1990), the teachings of which are herein incorporated by reference.
  • the construction of pAR( ⁇ RI)59/60 is described in detail in Example 5.
  • the FLAG peptide is located adjacent to the N-terminal initiator methionine ( Figure 2) .
  • the FLAG octapeptide (Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys) comprises the FLAG epitope (Asp-Tyr-Lys-Asp) affinity ligand and an overlapping enterokinase cleavable linker (Asp-Asp-Asp-Asp-Lys) .
  • Cleavage with enterokinase typically occurs precisely after the terminal lysine (underlined above) of the FLAG octapeptide.
  • the FLAG octapeptide is followed by a kinase recognition site.
  • the site is Arg-Arg-Ala-Ser-Val (HMK) , recognized by cAMP- dependent protein kinase.
  • a unique EcoRI restriction site located downstream of the sequence encoding the phosphorylation site can be used for the insertion of a sequence encoding a selected polypeptide.
  • the encoded fusion protein In plasmids derived from pAR( ⁇ RI)59/60, referred to with the prefix pFEK- F for FLAG, E for enterokinase, and K for the phosphorylation recognition site) , in which a sequence encoding a selected polypeptide has been inserted, the encoded fusion protein, referred to herein as a FEK-fusion protein, has the following general structure for N- to C-terminus: Met-[FLAG/ enterokinase cleavable linker]-phosphorylation site-selected polypeptide. An alanine residue is located between the FLAG peptide and HMK site (see Figure 2) . In addition, the EcoRI site encodes a Glu-Phe dipeptide.
  • additional residues may be inserted between the selected polypeptide and the modification recognition (HMK) site.
  • HMK modification recognition
  • SUBSTITUTESHEET pFEK-plasmids can direct the expression of FLAG- fusion proteins (i.e., fusion proteins having a FLAG epitope affinity ligand) from the T7 polymerase promoter in an appropriate host (e.g., a bacterial cell capable of constitutive or inducible expression of the T7 polymerase; for examples of suitable hosts and induction protocols, see Studier et al. , Meth. Enzymol. 185: 60-89 (1990) ) .
  • an appropriate host e.g., a bacterial cell capable of constitutive or inducible expression of the T7 polymerase; for examples of suitable hosts and induction protocols, see Studier et al. , Meth. Enzymol. 185: 60-89 (1990)
  • the construction of several pFEK-plasmids is described in Example 5.
  • the encoded fusion proteins were expressed in a bacterial host, and a lysate was prepared. Partially purified fractions
  • FEK-fusion proteins were labeled to high specific activity to give [ 32P]-labeled FEK-fusion proteins (which are also [ 32P]-labeled FLAG-fusion proteins) .
  • vectors of the present invention related to pGTK- and pFEK- vectors can be constructed lacking the cleavable linker sequence. These vectors would have a pGK- or pFK- prefix.
  • a fusion protein comprising a selected polypeptide expressed from a pGK- or pFK-vector is referred to as pGK-fusion protein or pFK-fusion protein, respectively.
  • a variety of commercially available vectors comprising an affinity ligand and cleavable linker are available which can be modified by the introduction of a modification site (e.g. , a
  • the pGEX-3X vector comprising a GST affinity ligand and Factor Xa cleavable linker
  • the pMAL-C vector comprising a malB affinity ligand and Factor Xa cleavable linker
  • the protein A gene fusion expression vectors pRIT2T and pRIT5 (Pharmacia) , comprising a protein A affinity ligand, (5, 6) as well as pDS and pQE-vectors (Qiagen) , comprising a histidine hexamer affinity ligand, could be modified in this way.
  • vector pRIT2T vector pRIT5
  • vector pRIT5 vector pRIT5
  • pDS vector pQE-vectors
  • pQE-vectors could be further modified by the insertion of a sequence encoding a cleavable linker.
  • Convenient affinity matrices for fusion proteins encoded by the foregoing were discussed above.
  • the present invention further relates to methods for producing a modified fusion protein.
  • a rapid and convenient method for labeling a fusion protein using a radiolabel or non-radioactive label donated by [7-labeled]ATP is provided.
  • the methods used are mild; a feature which is important for preservation of the structure and biological activity (e.g., binding, antigenicity, activity) of the components of the fusion protein, and of the selected polypeptide in particular.
  • fusion proteins of the present invention comprising an affinity ligand portion and a selected polypeptide portion are expressed in a host cell carrying a vector of the present invention.
  • the host cells are propagated under conditions which permit expression of the vector.
  • expression from the particular pGTK- and pFEK- vectors described in the Examples requires induction by IPTG.
  • the host cells are lysed using known techniques to obtain a lysate containing the fusion protein.
  • the lysate can be crudely fractionated as in Example 5 and directly labeled in the presence of [ 7 ⁇ labeled]ATP (e.g. ,[ 7 - 32P]ATP) , and a cAMP-dependent protein kinase, such as the catalytic subunit of the cAMP-dependent protein kinase from bovine heart muscle.
  • [ 7 ⁇ labeled]ATP e.g. ,[ 7 - 32P]ATP
  • a cAMP-dependent protein kinase such as the catalytic subunit of the cAMP-dependent protein kinase from bovine heart muscle.
  • a suitable formulation for a kinase reaction buffer and reaction stop buffer is given below.
  • the fusion protein is modified (e.g., phosphorylated) while bound to an affinity matrix. The fusion protein present in the lysate is captured on an affinity matrix.
  • This step is carried out by contacting the lysate with an appropriate affinity matrix (i.e., an affinity matrix comprising a specific binding partner of the affinity ligand present in the fusion protein) , under conditions which permit binding of the affinity ligand portion of the fusion protein to the affinity matrix.
  • an appropriate affinity matrix i.e., an affinity matrix comprising a specific binding partner of the affinity ligand present in the fusion protein
  • Suitable conditions for a variety of affinity ligands and matrices are known in the art.
  • GST-fusion proteins e.g., a GTK-fusion
  • the GST-fusion protein can be captured on the affinity matrix in the presence of a wash buffer, (which was also used as the lysis buffer) .
  • the wash buffer permits binding of the GST portion of the fusion proteins to the matrix via the specific binding partner (glutathione) .
  • the wash buffer comprises a buffer such as Tris, salt (e.g., NaCl), a chelator (e.g., EDTA), and a non-ionic detergent such as nonidet P-40.
  • FEK-fusion proteins can be captured on an anti-FLAG antibody affinity matrix.
  • anti-FLAG M2 antibody can be used as the specific binding partner.
  • Suitable conditions and formulations (e.g., for wash buffer and elution buffer) for anti-FLAG M2 affinity chromatography can be obtained from International Biotechnologies, Inc.
  • the fusion proteins can then be washed in order to remove unbound material (e.g., contaminating proteins other than the fusion protein) .
  • the wash buffers described above are suitable for this purpose.
  • the affinity matrix with bound fusion protein attached is then equilibrated (contacted) with a reaction buffer suitable for the modification reaction.
  • a reaction buffer suitable for the modification reaction for example, a kinase buffer such as HMK buffer is
  • HMK reaction buffer suitable for use with the cAMP-dependent protein kinase from bovine heart muscle (HMK) , comprises (1) a buffering agent such as Tris, (2) a reducing agent such as dithiothreitol (DTT) , (3) a salt such as NaCl, and (4) a source of magnesium ions or other appropriate ion, such as MgCl .
  • DTT dithiothreitol
  • MgCl magnesium magnesium ions or other appropriate ion
  • a protein kinase prepara- tion comprising the HMK enzyme is added, and [ 7 - labeled]ATP.
  • the protein kinase will be diluted in a suitable solution prior to addition.
  • a variety of fusion proteins were efficiently radiolabeled by phosphorylation using HMK reaction buffer, the catalytic subunit of the HMK and
  • the conditions appropriate for phosphorylation of the bound fusion protein to produce a bound labeled (e.g., radiolabeled) fusion protein will vary with the protein kinase selected. Typically, the steps subsequent to culturing the host cell are carried out at 4 °C. However, phosphorylation reactions with the catalytic subunit of the cAMP-dependent protein kinase from bovine heart muscle, have previously been carried out at 37 °C.
  • phosphorylation can be carried
  • efficient phosphorylation is carried out at 4 °C. This temperature can preserve the activity of fusion proteins, and of the selected polypeptide portion in particular.
  • the reaction can be quenched by addition of a stop buffer comprising a component capable of inhibiting the modification reaction.
  • the component capable of inhibiting the reaction will vary with the modification enzyme, but can include inhibitors (e.g., reaction products, competitors), chelators, or other agents which stop the reaction. It can be desirable to stop a reaction when, for example, in subsequent steps, undesired modifications can occur. However, wash steps can serve to remove modification enzymes.
  • an HMK stop buffer in a phosphorylation reaction with cAMP-dependent protein kinase, can be added, comprising a buffer such as sodium phos- phate, a reaction inhibitor such as sodium pyrophos- phate, and a chelator such as EDTA.
  • HMK stop buffer can optionally contain a carrier such as bovine serum albumin or glycerol.
  • sodium pyrophosphate inhibits the kinase reaction.
  • chelating Mg 2+ i.ons i.nhi.bi.ts the reaction chelating Mg 2+ i.ons i.nhi.bi.ts the reaction.
  • An advantage of the present method is that unincorporated label is easily removed by washing the affinity matrix with bound labeled fusion protein.
  • the wash buffers described above, which permit binding of the affinity ligand to the affinity matrix are described above, which permit binding of the affinity ligand to the affinity matrix
  • SU BSTITUTESHEET via the specific binding partner can be used in this step.
  • Other methods for labeling proteins with the HMK enzyme have relied upon extensive dialysis to remove unincorporated label (e.g., Zhao, X.-X. et al., Analyt. Biochem. 178: 342-347 (1989)).
  • the removal of unincorporated label by washing is advantageous because extensive dialysis can compromise the function of some proteins and results in the production of large quantities of contaminated dialysis buffer.
  • the modified fusion protein may be isolated for use by releasing the fusion protein from the affinity matrix by washing with a suitable elution buffer.
  • Elution buffers were discussed above.
  • a labeled GST-fusion protein such as a GTK-fusion protein can be eluted from immobilized glutathione with an elution buffer comprising glutathione (e.g., reduced glutathione).
  • an elution buffer comprising reduced glutathione, a buffer such as Tris, and a salt such as NaCl is used to release labeled GST-fusion proteins from the affinity matrix.
  • the fusion protein can be cleaved in vitro to free the modified (e.g., labeled) polypeptide portion.
  • Cleavage is accomplished by contacting the fusion protein with an appropriate specific protease under conditions which permit the cleavage reaction. Such conditions for cleavage by enterokinase, thrombin and factor Xa, for example, are known.
  • the affinity ligand portion or any uncleaved product can be removed by adsorption on the appropriate affinity matrix in the presence of wash buffer for example.
  • the modified or labeled selected polypeptide portion can be recovered in the supernatant.
  • the modified polypeptide portion can be released from the affinity ligand portion by cleaving the modified fusion protein at the cleavable linker.
  • An example protocol for cleavage by a specific protease on a column is provided in Abath, F. and A. Simpson, Biotechniques 10: 178 (1991), the teachings of which are herein incorporated by reference.
  • a kit for preparing a labeled fusion protein comprising (1) an affinity matrix to which a portion of the fusion protein can bind, (2) a wash buffer which permits binding of the fusion protein to the affinity matrix, (3) a modification reaction buffer such as a protein kinase reaction buffer, and (4) a modification enzyme preparation such as a protein kinase preparation.
  • the kit can contain a reaction stop buffer or an elution buffer comprising a release component. The nature of these elements has been explained above in more detail.
  • a kit can comprise a vector of the present invention (e.g. , pGEX-2TK or
  • the vector can be present in a suitable host cell or free from the host cell.
  • the kit comprises each of the elements (1) through (4) of the kit described above, and if desired, a reaction stop buffer.
  • a modification donor such as 7-labeled ATP can also be included in the kit.
  • a protein kinase preparation comprising a protein kinase.
  • the kinase may be in the form of a precipitate or solution.
  • the kinase can be dissolved or diluted as necessary prior to use.
  • the affinity matrix can be present in a suitable containing means (e.g., a vial or cartridge).
  • the affinity matrix is provided in a cartridge unit, the combination of which acts as a column.
  • Various components e.g., the lysate, wash buffers, label
  • immobilized glutathione can be incorporated into a cartridge unit for capture of GST-fusion proteins.
  • Vectors of the present invention can direct the expression of large amounts of a fusion protein in a host cell. These fusion proteins can be conveniently labeled using the methods described, and have many uses in research, diagnostic and therapeutic applications.
  • radiolabeled proteins are useful in imaging.
  • radiolabeled fusion protein or radiolabeled selected polypeptide portion can be used to locate cells carrying a ligand recognized by the selected polypeptide (e.g., a receptor, cell surface component, or antigen) .
  • a growth factor can be used as the selected polypeptide, and a labeled fusion protein of the present invention comprising the growth factor can be used to detect cells carrying a receptor for the growth factor.
  • incorporation of a suitable radiolabel permits the use of fusion proteins of the present invention in targeted radiotherapy.
  • the selected polypeptide e.g., a hormone, growth factor, antibody
  • the selected polypeptide can seek out specific cells and damage or destroy them due to the attached radiolabel.
  • An isotope with the desired energy and half-life can be selected for this purpose.
  • Other labels incorporated by methods of the present invention and capable of killing cells can also be used.
  • a labeled antibody or portion thereof e.g., F-, , Fab, etc.
  • fusion proteins would be useful for many other applications as well.
  • antibody as used herein refers to antibodies such as chimeric antibodies, single chain antibodies, bifunctional antibodies and other antibody variants, including individual chains (e.g. , a heavy chain) .
  • antibody chains are expressed as fusion proteins of the present invention in E. coli.
  • a signal sequence e.g., ompA, phoA
  • the signal sequence is cleaved from the fusion protein, freeing the amino terminus of the antibody chain, and preserving the binding function of the variable region- Fusion genes encoding both chains can be expressed in this manner from two vectors or a single vector encoding two fusion genes.
  • either the heavy or light chain could be expressed as a fusion protein of the present invention and the complementary chain could be expressed in the same cell using typical antibody expression vectors.
  • labeled fusion proteins of the present invention can be used as probes.
  • labeled fusion proteins such as 32P-GTK-fusion proteins, can be used as probes for screening expression libraries for proteins capable of binding the selected polypeptide.
  • a fusion protein of the present invention comprising a selected polypeptide is expressed, purified by virtue of the affinity ligand, and specifically labeled at the modification recognition sequence.
  • the labeled, homogeneous fusion protein is then incubated with filters on which proteins expressed by individual plaques of a target library have been immobilized. Following this hybridization step, the filters are washed, and processed for identification (i.e., by detection of the label) of plaques which produce recombinant proteins capable of specifically interacting with the labeled fusion protein probe.
  • a cDNA encoding a portion (the E7 binding domain) of the retinoblastoma susceptibility gene product (pRB) was prepared as a GTK-fusion protein and used to screen a library for proteins which interact with pRB.
  • a number of positive clones were isolated.
  • One of these clones, designated RBAPl displays properties expected of a cellular protein capable of interacting with pRB.
  • Labeled fusion proteins of the present invention comprising a selected polypeptide can also be used as probes in Western blot formats (Example 4) .
  • labeled fusion proteins of the present invention can be used to identify proteins present in complex mixtures (e.g., whole cell extracts) , which are capable of interacting with the selected polypeptide.
  • the fusion proteins can be used in protocols to isolate the interacting proteins.
  • the interacting proteins can be captured by the selected polypeptide portion of a labeled fusion protein which is bound to an affinity matrix. Subsequent cleavage at a cleavable linker could release the complex of the interacting protein and a labeled selected
  • SUBSTITUTESHEET polypeptide portion for further analysis.
  • a purified labeled fusion protein probe can also be used to determine the amount of or to detect an interacting protein or other substance in a complex mixture using immunoassay techniques, for example.
  • the binding of a specifically interacting protein (e.g., an antigen, hormone) or other substance to an immobilized (e.g. , bound to an affinity matrix) fusion protein of the present invention can induce a change in the attributes of the bound fusion protein.
  • Other substances include, but are not limited to agents such as a drug, toxin, substrate or other ligands capable of interacting with the selected polypeptide portion of the fusion protein.
  • the difference between the unbound versus the bound state could be taken advantage of, as for example, in an assay format.
  • the presence of the interacting protein or other substance in a complex mixture e.g., blood
  • binding of an interacting protein could induce a conformational change that shields the labeling (e.g. phosphorylation) site.
  • the complex mixture containing an interacting protein is contacted with the fusion protein, which is bound to the affinity matrix.
  • the combination is washed, equilibrated with a suitable reaction buffer and subjected to the labeling procedure.
  • the extent of labeling is compared to a control which lacks the interacting polypeptide and the presence of the interacting protein in the complex mixture is indicated by inhibition of labeling.
  • such a procedure can also provide information regarding the quantity of interacting protein which is present.
  • the vectors and methods of the present invention can facilitate the characterization of cDNA clones.
  • an uncharacterized cDNA can be inserted into a vector of the present invention allowing rapid purification of the protein, without specific knowledge of its properties.
  • the protein can be labeled with the maintenance of biological activity, and the labeled fusion protein can be used to detect potential interactions with other cellular proteins.
  • adenovirus E1A protein SV40 and polyomaviral T antigens
  • human papillo a virus E7 protein can bind to the retinoblastoma susceptibility gene product, pRB.
  • pRB retinoblastoma susceptibility gene product
  • oligonucleotides were synthesized using standard techniques and were annealed in vitro (Gait, M.C.J., (1984) Oligonucleotide Synthesis: A Practical Approach, (I.R.L. Press; Oxford)).
  • the resulting duplex has 5'-overhangs compatible with Ba HI- and EcoRI-cut DNA and is shown below:
  • Plasmid pGEX-2T (Pharmacia) was linearized with BamHI and EcoRI, and the vector fragment was ligated to the synthetic duplex to make plasmid pGEX-2TK. Incorporation of the duplex into pGEX-2T was confirmed by restriction and DNA sequence analysis. DNA sequencing was performed using a Sequenase 2.0 kit (United States Biochemical Corp.) with the protocol provided by the manufacturer. The structure of pGEX-2TK in the region surrounding the kinase site is shown in Figure l.
  • RB cDNAs encoding residues 379-792 or residues 379-928 of the retinoblastoma susceptibility gene product, both of which span the T/ElA-binding region, were subcloned into pGEX-2TK.
  • an RB cDNA encoding residues 379-928, having a naturally occurring, loss of function, RB point mutation (379-928;706F) which is known to abrogate T/E1A binding, was subcloned into pGEX-2TK.
  • pGTK-RB(379-792) pGTK-RB(379-928) and pGTK-RB(379-928;706F) , respectively.
  • the encoded fusion proteins are named GTK-RB(379-792) , GTK-RB(379-928) and GTK-RB(379-928;706F) , respectively.
  • These RB fusion proteins are also referred to as GST-RB fusion proteins or RB fusion proteins.
  • the RB cDNA inserted into pGEX-2TK to make pGTK-RB(379-928) corresponds to RB deletion RB dl 1-378 (Kaelin, W.G. et al. , Mol. Cell. Biol. 10(7) : 3761-3769 (1990)) .
  • the RB cDNA inserted into pGEX-2TK to make pGTK-RB(379-792) corresponds to RB deletion RB dl 1-378;793-928.
  • Residue 379 is a methionine residue.
  • Each amplimer used to generate the cDNAs contained a BamHI site, such that the resulting PCR product, upon digestion with BamHI, could be ligated in frame into the unique BamHI site in pGEX-2T (Pharmacia).
  • the 3' amplimer contained a TGA stop codon as well.
  • the PCR fragments encoding these two cDNAs were each cleaved with BamHI and cloned into pGEX-2T, which had been linearized with BamHI, and treated with calf intestinal phosphatase.
  • the resulting constructs were named pGT-RB(379-928) and pGT-RB(379-792) .
  • the cDNAs were cleaved from the latter constructs using BamHI, and were each subcloned into pGEX-2TK, which had been cleaved with BamHI.
  • the resulting plasmids are named pGTK-RB(379-928) and pGTK-RB(379-792) .
  • Two additional amino acids were incorporated into the sequence at the BamHI site due to the structure of
  • the mutant RB cDNA and referred to herein as either RB(379-928;706F) or RB(379-928;pm706) corresponds to the cDNA designated (379-928;pm 706) constructed by Kaelin et al. (Kaelin, W.G. et al. , Cell 64: 521-532 (1991)), also incorporated herein by reference.
  • the mutant cDNA fragment was generated by PCR of reverse-transcribed RNA from cells containing the mutation.
  • the PCR product was cleaved at the unique Ncol and Bsml sites in the RB gene (which span the 706F mutation), and ligated into pGT-RB(379-928) , from which the wild type RB cDNA Ncol-Bsml segment had been excised.
  • the resulting plasmid, pGT-RB(379-928;706F) encoded the 379-928;706F cDNA.
  • the plasmid was cleaved with BamHI to release the cDNA, and the cDNA fragment was subcloned into pGEX-2TK which had been cleaved with BamHI to make pGTK-RB(379-928;706F) .
  • the amino acid sequence at the junction of the phosphorylation (kinase) site and the first RB residue (Met _ ) is also
  • the products encoded by all three cDNAs have been tested previously for their ability to bind to SV40 T antigen, the adenovirus E1A gene product, and putative cellular RB-binding proteins, when expressed as pGEX-2T encoded GST fusion proteins.
  • the fusion proteins encoded by pGT-RB(379-928) and pGT-RB(379-792) were able to bind to T antigen, E1A, and the putative cellular RB-binding proteins, while the mutant protein encoded by pGT-RB(379-928;706F) did not (Kaelin, W.G. et al. , Cell 64: 521-532 (1991)).
  • coli DH5 transformed with either PGEX-2TK or pGEX-2TK recombinants, were diluted 1:10 in Luria-Bertani (LB) medium containing ampicillin (100 ⁇ g/ml) and incubated for a total of 5 hours at 37 °C, with shaking. After 1.0 hour of growth at 37 °C, isopropyl- / .-D-thiogalactopyranoside (IPTG; Bethesda Research Laboratories) was added to a final concentration of 0.1 mM.
  • LB Luria-Bertani
  • IPTG isopropyl- / .-D-thiogalactopyranoside
  • aliquots of each bacterial culture were pelleted in a microcentrifuge, were boiled in urea-SDS cracking buffer (0.01 M sodium phosphate [pH 7.2], 1% .-mercaptoethanol, 1% SDS, and 8M urea) , and were loaded onto an SDS-polyacrylamide gel. Proteins were visualized by Coomassie blue staining.
  • the clarified bacterial sonicates containing the pGEX-2TK-encoded protein or relevant GST fusion protein, were rocked for 15-30 minutes at 4 °C with glutathione sepharose (20-30 ⁇ l/ml bacterial sonicate) .
  • the glutathione sepharose beads (Glutathione Sepharose 4B, Pharmacia) ) had been washed three times and resuspended 1:1 v/v in NETN (see above) supplemented with 0.5% non-fat powdered milk prior to use. This step and subsequent steps were also carried out at 4°C.
  • IX HMK buffer (20 mM Tris pH [7.5], 100 mM NaCl, 12 mM MgCl ) .
  • the supernatant was aspirated with a 23G needle and the sepharose was resuspended in 2-3 bead volumes of IX HMK buffer containing 1 unit/ ⁇ l of the catalytic subunit of cAMP-dependent protein kinase (Sigma Chemical Co., St. Louis, MO), 1 ⁇ Ci/ ⁇ l 32 P- ⁇ ATP (6000 Ci/mMole, 10 mCi/ml, New England Nuclear) , and 1 mM DTT.
  • HMK stop buffer (10 mM Na Phosphate [pH 8.0], 10 mM Na Pyrophosphate , 10 mM EDTA, 1 mg/ml bovine serum albumin) .
  • HMK stop buffer 10 mM Na Phosphate [pH 8.0], 10 mM Na Pyrophosphate , 10 mM EDTA, 1 mg/ml bovine serum albumin.
  • the supernatant was again removed using a 23 G needle and the sepharose was washed 5 times with NETN to remove any unincorporated label. Incorporation of label can be determined while the protein is attached to the bead.
  • the fusion protein can be eluted from the beads. After the final wash the residual supernatant was aspirated using a 23 G needle and the labeled or unlabeled GST fusion protein was eluted by rocking the sepharose for 10-15 minutes in 10-50 bead volumes of 20 mM reduced glutathione, 100 mM Tris [pH 8.0], 120 mM NaCl.
  • FIG. 2 shows the effect of inserting an HMK sequence on 32P incorporation (cpm) in vitro data on four GST-RB fusion proteins.
  • GST-RB(379-792;pm706) fusion proteins which were encoded by pGEX-2TK and which contained an HMK sequence showed significant incorporation of 32P.
  • GST-RB(379-792) and GST-RB(379-792;pm706) fusion proteins which were encoded by pGEX-2T and lacked an HMK sequence were not significantly radiolabeled.
  • fusion proteins from 10 ml of bacterial culture were recovered on 25 ⁇ l of glutathione sepharose beads, and were subjected to phosphorylation using the same protocol.
  • a GST-RB(379-928) fusion protein expressed from pGEX-2TK and having a kinase site was labeled with
  • a GST-RB(379-928) ;pm706 fusion protein expressed from pGEX-2TK and having a kinase site was labeled with 32 P (3 X 10 6 cpm) , while the
  • the GST-RB fusion proteins expressed from pGEX-2T-derived plasmids pGT-RB(379-928) and pGT-RB(379-792) were able to bind to T antigen, E1A, and the putative cellular RB-binding proteins, while the mutant fusion protein encoded by pGT-RB(379-928;706F) did not (Kaelin, W.G. ___________ : •' Cell 64: 521-532 (1991)). This observation suggests that the binding function and structure of the RB proteins is not grossly disturbed when they are expressed as part of a fusion protein with GST.
  • the intensity of the bands for GTK-fusion proteins expressed in E. coli transformed with pGTK-RB(379-928) , pGTK-RB(379-792) , or pGTK-RB(379-928?pm706) was comparable to that observed with for the corresponding constructs lacking the kinase site (i.e., pGT-RB(379-928) , pGT-RB(379-792) , or pGT-RB(379-928;pm706) , respectively)
  • retinoblastoma cell line WERI-Rb27 (a gift of Dr. Wen-Hwa Lee) Huang, H.-J. et al. , Science 242: 1563-1566 (1988)) and-293 cells, a human embryonic cell line transformed by a fragment of the Adenovirus 5 genome (Graham, F.L. et al. , J. Gen. Virol. 36: 59-72 (1977)), were grown in Dulbecco's modified Eagle's medium (DMEM) with 10% fetal calf serum (Gibco) . Akata cells were grown in RPMI or DMEM, with 10% fetal calf serum (Gibco) .
  • DMEM Dulbecco's modified Eagle's medium
  • Gibco fetal calf serum
  • C57B1/6 primary mouse embryo fibroblasts (MEF) lines expressing either wild-type (MEF ⁇ Tex) or mutant (MEF ⁇ K1) forms of T antigen were grown in Dulbecco's modified Eagle's medium (DMEM) with 10% fetal calf serum (Gibco) and G418 (150 ug/ml) (Ewen, M.E. et al. , Cell 5 : 257-267 (1989)). All cells were grown at 37°C in a humidified, 10% CO -containing atmosphere. Radioisotopic labelling of cells and preparation of cell lysates was as described previously (Kaelin, W.G. et al. , Cell 64: 521-532 (1991)).
  • Tissue culture supernatants were the source of monoclonal antibodies PAb 419 and M73 (Harlow, E. et al., J. Virol. 3£ : 861-869 (1981); Harlow E. et al. , J. Virol. 55: 533-546 (1985)).
  • the use of these antibodies for immunoprecipitation and western blotting was described previously (Kaelin, W.G. et al., Cell 64: 521-532 (1991)), except that electro- phoretic transfer of proteins to nitrocellulose was performed without the addition of methanol to the transfer buffer.
  • Integrity of Phosphorylated Fusion Proteins TO determine whether the kinase reaction led to a significant alteration in the RB T/ElA-binding region of the GTK-RB fusion proteins, the following experiment was performed.
  • Whole cell lysates containing wild-type T antigen (MEF ⁇ Tex cells) , the RB-binding and transformation defective T antigen mutant Kl (MEF ⁇ K1 cells) , or EIA (293 cells) were re ⁇ solved by SDS-polyacrylamide gel electrophoresis and transferred to nitrocellulose filters (See protocols in Example 4) .
  • GST-RB fusion proteins were kinased in vitro and eluted from the glutathione sepharose in the presence of reduced glutathione as described above.
  • the eluted labeled protein was incubated overnight with nitrocellulose strips cut from the filters. The filters were then washed and subjected to autoradiography. Hybridization conditions and washes were as described below in Example 4, Hybridization of Filters. Similar to the unphosphorylated version, 32 P-GTK-RB(379-792) bound to EIA and wild-type T from MEF ⁇ Tex cells, but not to the T mutant Kl from MEF ⁇ Kl cells. The presence of equivalent amounts of T and ⁇ i in this assay was confirmed by immunoblotting with a monoclonal antibody directed against T antigen.
  • libraries were plated at approximately 40,000 pfu/150 mm plate x 30 plates.
  • Electrophoretic transfer of proteins from SDS-polyacrylamide gels to nitrocellulose for western blot analysis was carried out in 192 mM glycine, 25 mM Tris(Base), and 0.01% SDS. Plaque lifts were performed as described above.
  • the nitrocellulose filters were placed, directly into IX HBB (25 mM Hepes-KOH[pH7.7], 25 mM NaCl, 5mM MgCl 2 ,l mM DTT) supplemented with 5% non-fat powdered milk and 0.05% NP-40 without drying and incubated overnight. This and subsequent manipulations were performed at 4°C with gentle rocking.
  • the filters were then denatured and renatured as described by vinson et al. (Genes and Dev. 2_: 801-806 (1988)). Processing of multiple filters was done in batch.
  • the filters were placed in fresh IX HBB supplemented with 5% non-fat powdered milk and 0.05% NP-40, and incubated for 1 hour. The filters were then incubated in IX HBB supplemented with 1% non-fat powdered milk and 0.05% NP-40 for at least 30 minutes prior to hybridization.
  • the relevant 32P-labeled GST fusion protein was added at 100,000-250,000 cpm/ml. Hybridization was carried out at 4°C with gentle rocking overnight.
  • RB(379-792) fusion protein was loaded onto the glutathione sepharose beads, and the precleared
  • Bound proteins (directly and indirectly bound to the RB-bound beads) were loaded in wide wells, resolved by SDS-polyacrylamide gel electrophoresis, and transferred to nitrocellulose.
  • the filter was
  • Strips 3 and 4 were then washed, dried, placed under
  • BSTITUTESHEE the presence of wild-type E7 peptide prior to electrophoresis, indicating specificity of binding.
  • WERI-rb27 cells was also probed for cellular proteins capable of binding directly to the RB.
  • the unlabeled extract was subjected to SDS-polyacrylamide gel electrophoresis and transferred to nitrocellulose for Western blotting.
  • the filter was cut into strips which were probed with selected 32P-labeled fusion proteins. Again, there appeared to be at least 2 cellular proteins capable of interacting with
  • binding of these cellular proteins to wild-type 32 P-GTK-RB(379-792) fusion protein could be selectively inhibited by three different RB-binding peptides.
  • Peptide replicas of the RB-binding sequences found in T antigen (amino acids 102-115) , EIA (tyrosine followed by EIA residues 115-132) , or E7 (amino acids 16-32) each inhibited binding of the labeled probe to the two cellular proteins.
  • point mutant derivatives of these peptides failed to inhibit the interaction of the probe with the filter-bound cellular proteins.
  • the particular mutant peptides tested included the T antigen peptide with a Glu 1Q7 to Lys change, the EIA peptide with a Cys _ 5 to Gly change, and the E7 peptide with a Glu 6 to Gin change. Similar results were obtained when this assay was performed with a human Burkitt's lymphoma cell line (Akata) as well as with normal peripheral blood lymphocytes.
  • clone 5 sequence analysis of these clones suggested that the short open reading frame in clone 5 was generated by the juxtaposition of a normally untranslated cDNA segment with the ⁇ gtll £-galactosidase coding sequence.
  • the clone may not represent a cellular protein, upon expression in ⁇ gtll, it encodes a protein with properties (e.g. , a Leu-X-Cys-X-Glu motif) common with viral RB-binding motifs.
  • Clone 9 unlike clone 5, contained a long open reading frame, consistent with the possibility that it encodes an RB-interacting protein. Additional Characterization of RBAPl
  • the retinoblastoma gene product is believed to serve as a cell cycle regulatory element.
  • pRB is thought to contribute to the regulation of cell cycle progression as cells traverse the Gl/S boundary.
  • RBAPl message levels appear to respond to cell cycle events.
  • a Northern blot was prepared of total RNA obtained from resting peripheral blood lymphocytes (PBL) and from PBL at various time points following stimulation with a cocktail containing PMA, PHA, and a calcium ionophore. The Northern blot was probed with an RBAPl probe from clone 4. A single message of about 3.5 kb became detectable 24-36 hours after stimulation.
  • the RBAPl cDNA was cloned into pGEX-2T and expressed in E. coli as a GST-fusion protein. Glutathione sepharose beads were loaded with the
  • S-labeled RB proteins were prepared by in vitro transcription and translation from cDNA clones as described (Kaelin, W.G. et al. , Mol. Cell. Biol. 10(7) : 3761-3769 (1990)). The 5 labeled RB proteins, RB(379-928) and
  • RB(379-928;pm706) were incubated with the bound GST-RBAP1 fusion protein.
  • the beads were washed and analyzed for the amount of fusion protein retained. Quantitative recovery of the in vitro translated ° 32 P-GTK-RB(379-928) protein, but not of the
  • RBAPl product is capable of interacting directly with the RB gene product.
  • the RB-related protein pl07 was also expressed as a GTK-fusion protein.
  • a cDNA encoding a region of pl07 which is homologous to the T/E1A/E7 binding domain of the RB gene product was inserted into 0 pGEX-2TK.
  • a GTK-pl07 fusion protein was expressed in ⁇ __- _____L___L' a lY sa te was prepared, the fusion protein present in the lysate was captured on a glutathione-sepharose affinity column, and labeled _ with [32P] in situ m a phosphorylation reaction using the catalytic subunit of cAMP-dependent protein kinase and [ T - 32 P]ATP.
  • the [ 32 P]-labeled GTK-pl07 fusion protein was eluted from the column, and was used as probe in Western blot analysis of whole cell extracts.
  • Whole cell extracts (unlabeled) containing E1A were prepared from 293 cells, subjected to SDS-polyacrylamide gel electrophoresis, and electro- phoretically transferred to filters. Filters were cut into strips and probed with (1) anti-ElA antibody, (2) [ 32 P]-labeled GTK-pl07 fusion protein, (3) [ P]-labeled GTK-pl07 fusion protein in the presence of competing wild-type E7 peptide (residues 16-32), and (4) [ 32 P]-labeled GTK-pl07 fusion protein in the presence of mutant E7 peptide (residues 16-32, with a Glu to Gin mutation at residue 26) .
  • GTK-pl07 fusion protein did not detect the EIA product in the presence of competing wild-type E7 peptide.
  • Plasmid pAR3040 (also referred to as pET3a) was the starting material (Studier, Meth. Enzymol .185: 60-89 (1990)). This plasmid and expression plasmids derived from this vector can be maintained and induced for expression as described by Studier (Studier, Meth. Enzymol .185: 60-89 (1990)). Plasmid
  • STITUTESHEET PAR3040 was cut with EcoRI.
  • the overhangs were made blunt by "filling in” using Klenow enzyme and dNTPs, and religa ed to destroy the EcoRI site.
  • the EcoRI site was regenerated on propagation of the plasmid. Therefore, another version of of this intermediate was constructed by treating with mung bean nuclease to remove the EcoRI overhangs prior to religation. Regeneration of the EcoRI site in the latter version was not observed.
  • Both versions of the intermediate construction were modified as described below, to make two slightly different pAR( ⁇ RI)59-60 vectors (i.e., differing in the manner in which the EcoRI site was destroyed) . These vectors behave identically with respect to expression of encoded fusion proteins.
  • the intermediate vectors were then cleaved with Ndel and treated with calf intestinal alkaline phosphatase (CIP) .
  • CIP calf intestinal alkaline phosphatase
  • Two complementary oligonucleotide adaptors MAB 59 and MAB 60, were synthesized, kinased, and annealed. The adaptors were then ligated into the Ndel cleaved, CIP-treated vectors. The adaptors introduce the 17 amino acid sequence comprising the FLAG peptide and HMK site shown in Figure 2. The sequence of the adaptors is shown below:
  • Plasmids having one insert of the double-stranded adaptor were identified by restriction analysis of miniprep DNA.
  • An Xbal-Ba HI double digest releases a fragment comprising the inserted sequence.
  • SUB STITUTESHEET The first derivative encoding a selected polypeptide was prepared by the insertion of a 1.5 kb EcoRI fragment of a shPan-1 (N3) cDNA into pAR( ⁇ RI)59/60, which had been cleaved with EcoRI and treated with CIP.
  • the shPan-1 (N3) cDNA encodes a DNA binding protein, as described by German et al. (German et al., Molec. Endocrinol. 5_: 292-299 (1991)).
  • Plasmids containing the insert were detected by restriction analysis of miniprep DNA using a BamHI-StuI double digest. The correct orientation yielded a diagnostic 500 bp fragment. The sequence at the junctions between the vector and insert was verified by sequencing, using oligonucleotide primers MAB 58 and MAB 69 (see above) . The plasmid is referred to herein as pFEK-shPan-1.
  • the second derivative of pAR( ⁇ RI)59/60 was made by the insertion of N3-SH, a fragment of shPan-1 generated by PCR (German et al. , Molec. Endocrinol. 5_: 292-299 (1991)).
  • the fragment was generated by PCR, using oligonucleotide primers MAB 72 (5'-GGCCGAATTCTCCTGGTCCCACGGAGACCC-3') and MAB 73 (5'-GGCCGAATTCGCCGAGGAGGACAAGAAGGACC-3') .
  • the PCR products were purified on an agarose gel, electroeluted, treated with T4 DNA polymerase, and digested with EcoRI.
  • the resulting fragment was ligated to EcoRI-cut, CIP-treated pAR( ⁇ RI)59/60. Constructs with single inserts were identified by Xbal-BamHI double digests of miniprep DNA. The junctions and sequence of the insert were verified by dideoxy sequencing using oligonucleotides MAB 58 and MAB 69 (see above) . The resulting construct is referred to herein as pFEK-N3-SH.
  • the third derivative of pAR( ⁇ RI)59/60 was constructed by the insertion of a DNA sequence encoding amino acids 120 through 206 of the rat c-fos protein (Kouzarides, T. and E. Ziff , Nature 336: 646-651 (1988); Curran, T.
  • PCR polymerase chain reaction
  • MAB 70 5'-GGCCGAATTCGCGCAGAGCATCGGCAGAAG-3'
  • MAB 71 5'-GGCCGAATTCCTACTAGATCTTGCAGGCAGGTCGGT-3'
  • the resulting fragment was ligated into EcoRI-cut, CIP-treated pAR( ⁇ RI)59/60. Isolates with single inserts were identified by Xbal and BamHI double digests. The junctions and insert sequence were verified by dideoxy sequencing using the oligonucleotide primers MAB 58 and MAB69 (see above) .
  • the resulting plasmid is referred to herein as pFEK-c-fos(120-206) .
  • the fourth derivative of pAR( ⁇ RI)59/60 was derived by the insertion of amino acids encoding residues 206 through 340 of the human c-jun protein (Kouzarides, T. and E. Ziff, Nature 336: 646-651 (1988) ; Bohmann, D. et al. , Science 238: 1386-1392 (1987)).
  • the fragment for cloning was generated by PCR using oligonucleotide primers MAB 74 (5'-GGCCGAATTCTTTCCCGCGCAACCCCAGCA-3') and MAB 75 (5'-GGCCGAATTCCCGACGGTCTCTCTTCAAA-3') .
  • the PCR products were purified on an agarose gel,
  • each of the selected polypeptides expressed from pAR( ⁇ RI)59/60 is a DNA binding protein
  • the activity of the selected polypeptide portions in the context of a fusion protein comprising an affinity ligand and modification site, were assayed by electrophoretic mobility shift assays (EMSA) .
  • Assay conditions for the FEK-shPAN-1 fusion protein encoded by pFEK-shPan-1 and by pFEK-N3-SH, were as described by German et al. (German et al. , Molee. Endocrino1. 5_: 292-299 (1991)).
  • EMSA conditions for the fusion protein encoded by pFEK-c-fos(120-206) were as described (Kouzarides, T. and E. Ziff, Nature 336: 646-651 (1988) ; Curran, T. et al. , Oncogene 2_: 79-84 (1987) ; Nakabeppu, Y. and D. Nathans, EMBO J. Q 3833-3841 (1989)) .
  • EMSA conditions for the fusion protein encoded by pFEK-c-jun(206-340) were as described (Kouzarides, T. and E. Ziff, Nature 336: 646-651 (1988); Bohmann, D. et al., Science 238: 1386-1392 (1987)).
  • the DNA binding activity of the FEK-c-jun(206-340) fusion protein present in bacterial extracts was assayed in combination with bacterial extracts containing fos "core" (to assay binding of jun as a jun-fos heterodi er) .
  • the four unlabeled fusion proteins were made by in vitro transcription/translation.
  • the particular DNA probes used in each shift assay were radiolabeled for detection of DNA-fusion protein complexes.
  • each of the native (i.e., not fused) DNA binding proteins was prepared by in vitro transcription/translation and subjected to the shift assay.
  • the short polypeptide produced by pAR( ⁇ RI)59/60, and the transcription/translation products from a vector control for each pFEK-construct in which the selected polypeptide insert was in a reverse orientation, provided negative controls for DNA binding.
  • all four pFEK-fusion proteins bound DNA to an extent comparable to the corresponding native protein.
  • S UBSTITUTESHEET distinguished because the negative charge of the additional phosphate group on the modified fusion protein led to an alteration in migration of the DNA-fusion protein complex as compared with the unmodified fusion protein-DNA complex. Modification of all four of the fusion proteins did not alter the extent of complex formation, indicating that the biological activity of the selected polypeptide portion is not altered by modification.
  • the fusion proteins encoded by vectors pFEK-shPan-1 and pFEK-N3-SH were obtained using DEAE-Sephacel chromatography at -50% purity.
  • a fusion protein comprising the fos 'core' (encoded by pFEK-c-fos(120-206) )
  • activity of fractions was assayed by electrophoretic mobility shift assay together with reticulocyte-produced c-jun protein.
  • Peak activity for the fusion protein comprising the fos 'core' fusion protein was eluted at approximately 500-600 mM salt yielding a preparation with -50% purity.
  • All four fusion proteins produced by the pFEK- vectors (pFEK-shPan-1, pFEK-N3-SH, pFEK-c-fos(120-206) , and pFEK-c-jun(206-340) ) were capable of being modified by phosphorylation when present in crude bacterial extracts. As discussed above, modification by phosphorylation with unlabeled ATP was observed. In addition, modification using radioactively-labeled ATP was observed. Proteins which were partially purified by the methods (A and B) described above were also efficiently radiolabeled by phosphorylation using HMK enzyme and [ 7 - 32P]ATP. Labeli.ng to a speci.fi.c activity of approximately 10 7 to 108 cpm per ⁇ g of protein was observed for partially purified
  • HMK Protein Kinase
  • Catalytic subunit from bovine heart was obtained as lyophilized powder in 250 unit vials.
  • 250 units of HMK enzyme was resuspended in 25 ⁇ l of 40 mM DTT (i.e., at 10 u/ ⁇ l) .
  • the solution was allowed to stand at room te perative for 10 minutes and was stored at +4 °C. Activity was stable for 2-3 days, but HMK was usually freshly reconstituted for each use.
  • HMK Buffer 200 mM Tris-Cl (pH 7.5), 10 mM DTT, 1 M NaCl, and 120 mM MgCl_) .
  • the phosphorylation reaction mixture was as follows:
  • FEK-fusion proteins were used for Western blotting or for plaque screening.
  • the fusion protein was run a G50 column as described below in order to remove unincorporated label following the HMK phosphorylation reaction.
  • Z + 0.1 M KC1 buffer (25 mM Hepes-KOH ([pH 7.7], 12.5 mM MgCl 2 , 20% glycerol, 100 mM KC1) was prepared.
  • solutions comprising Z + KC1, and BSA the solution was filtered through a 0.2 ⁇ m filter prior to use. DTT was added to a final concentration of l mM just prior to use.
  • fractions e.g., 1.0 ⁇ l
  • SDS-polyacrylamide gels can be fractionated on SDS-polyacrylamide gels to monitor the progress of the fusion protein.
  • the above procedure can also be carried out at 4° C where desired.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medicinal Chemistry (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Toxicology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Microbiology (AREA)
  • Peptides Or Proteins (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

The present invention relates to expression vectors comprising nucleic acid sequences which encode an affinity ligand (e.g., an enzyme, epitope) and a modification recognition sequence. The vectors further comprise at least one restriction site for the insertion of a nucleic acid sequence capable of encoding a selected polypeptide. On expression, the resulting construct codes for a fusion protein comprising an affinity ligand, the selected polypeptide and a modification recognition sequence. The fusion protein may be isolated by virtue of the affinity ligand and then modified. The expression vectors may further comprise a nucleotide sequence encoding a cleavable linker, such as a thrombin or factor Xa cleavage sequence. The invention further relates to expression vectors containing a nucleotide sequence encoding a gene for a selected polypeptide and capable of directing the expression of the selected polypeptide as a fusion protein. In addition, methods of producing a modified fusion protein are disclosed. Modified or labeled fusion proteins of the present invention are useful in a variety of therapeutic, diagnostic and research applications.

Description

PLASMIDS FOR THE RAPID PREPARATION OF MODIFIED PROTEINS
Description
Background
Current methods for the production of labeled proteins include iodination, biotinylation, or the in vivo labeling of protein. In vivo labeling methods require large amounts of radioactivity are required, and the protein of interest must be separated from other labeled cellular proteins. The biotinylation and iodination procedures also require a purification step for each individual protein and, in some cases, reaction conditions which can cause inactivation of the protein. In addition, using these methods, modification of a protein may occur at a variety of sites, leading to distortions in structure or biological activity.
For example, iodination protocols relying on chlora ine T or iodogen result in modifications at tyrosine residues and some histidine residues.
Over-substitution and oxidation damage may result. Labeling procedures which use the Bolton-Hunter
SUBSTITUTESHEET reagent result in the modification of free amino groups on lysine residues. In some proteins this particular modification may also have a deleterious effect on structure or activity. Although the 5lactoperoxidase method for iodination employs gentler conditions, this method also leads to modification of tyrosine and histidine residues, with the potential for structural distortion and loss of activity. Similarly, biotinylation protocols are 0frequently performed using a succinimide ester of biotin. The biotin is coupled to the protein through free amino groups, typically on lysine residues. Again, modification at one or more positions may alter structure and/or function of the protein. In ddition, extensive dialysis is needed to remove uncoupled biotin, which may be deleterious to the protein.
Summary of the Invention
The present invention relates to expression vectors comprising nucleic acid sequences which encode an affinity ligand (e.g., an enzyme, epitope) and a modification recognition sequence. The vectors further comprise at least one restriction site for the insertion of a nucleic acid sequence capable of encoding a selected polypeptide in frame with the affinity ligand and modification sequence. On expression, the resulting construct codes for a fusion protein comprising an affinity ligand, the selected polypeptide and a modification recognition sequence.
HEET The fusion protein may be isolated by virtue of the affinity ligand and then modified.
The expression vectors may further comprise a nucleotide sequence encoding a cleavable linker, such
05 as a thrombin or factor Xa cleavage sequence. The sequence encoding the cleavable linker is located between the affinity ligand and the restriction site for insertion of the sequences encoding the selected polypeptide. In this location, the linker may be
1 cleaved to release the protein of interest following modification. The invention further relates to expression vectors containing a nucleotide sequence encoding a gene for a selected polypeptide and capable of directing the expression of the selected
15 polypeptide as a fusion protein.
The pGEX-2TK expression vector is one embodiment of the present invention. This vector encodes a protein comprising, from amino to carboxyl terminus, glutathione-S-transferase (GST) as an affinity 2 ligand, the thrombin cleavage site as a cleavable linker, and a phosphorylation recognition site for the cAMP-dependent protein kinase as a modification recognition sequence. A multiple cloning site comprising three restriction sites is located or. ώJ downstream of the sequence encoding the phosphorylation site. Thus, a nucleic acid sequence encoding a selected polypeptide may be inserted into the vector using one or more of these sites. The selected polypeptide is expressed as a GTK-fusion
*.(. protein (G, GST; T, thrombi.n; K, ki.nase) .
SUBSTITUTESHEET In addition, methods of producing a modified fusion protein are disclosed. Upon expression in a suitable host cell, the protein of interest is produced as a fusion protein. The fusion protein can be captured on a suitable affinity matrix by virtue of an affinity ligand, which interacts reversibly with the matrix. Modification of the fusion protein can be carried out while the protein is attached to the matrix. Subsequently, the modified fusion protein may be isolated for use by releasing the fusion protein from the affinity matrix with a suitable agent. In the case where a cleavable linker is present, the fusion protein may be cleaved in vitro to free the modified polypeptide portion, and the affinity ligand portion or any uncleaved product can be removed by adsorption on the appropriate affinity matrix. Alternatively, the modified polypeptide portion can be released from the affinity ligand portion by cleaving the modified fusion protein at the cleavable linker while still bound to the column.
Modified proteins of the present invention are useful in a variety of applications. Labeled (e.g., radiolabeled) proteins may be used as molecular probes. For example, antibodies can be labeled for therapeutic, diagnostic (e.g., imaging) or research purposes. Proteins may be labeled and used as reagents to quantitate or identify an interacting protein, such as a receptor.
SHEET Brief Description of the Drawings
Figure 1 shows a portion of the nucleotide sequence around the kinase recognition site of expression vector pGEX-2TK. The portion of the pGEX-2TK sequence introduced by the synthetic duplex is indicated by italics. The sequences which encode the linker cleavable by thrombin (Leu-Val-Pro-Arg- Gly-Ser) , the downstream kinase recognition site (Arg-Arg-Ala-Ser-Val) and multiple cloning site with Ba HI, Smal and EcoRI sites, are shown. The arrow indicates the point of thrombin cleavage.
Figure 2 is an illustration of the structure of expression vector pAR(ΔRI)59/60. The general structure of the plasmid is shown at top. bla, indicates the /9-lactamase gene which confers ampicillin resistance; ori, indicates the origin of replication. In the center panel, the general structure of FEK-fusion proteins is illustrated. The FLAG peptide portion, comprising the FLAG epitope and enterokinase cleavable linker, is indicated. "HMK" indicates the location of.the phophorylation site. "Protein" indicates the location of the selected polypeptide. The lower panel shows a more detailed illustration of the N-terminal region of the vector. The peptide sequence shown (Met-Asp-Tyr-Lys-Asp-Asp- -Asp-Asp-Lys-Ala-Arg-Arg-Ala-Ser-Val-Glu-Phe-) in the detail is a contiguous sequence. The extent of the FLAG peptide, the enterokinase cleavage site, and HMK recognition (phosphorylation site) are indicated. Figure 3 is a bar graph showing the effect on
32 P incorporati.on i.n vi.tro of an HMK sequence on
SUBSTITUTESHEET different fusion proteins, comprising portions of the retinoblasto a susceptibitliy gene product, RB. Proteins were phosphorylated in vitro using cAMP-dependent protein kinase. The hatched bars indicate that the fusion protein (GST-RB379-792 or GST-RB379-792;pm706) was expressed from pGEX-2TK and contained an HMK sequence, while the solid bars indicate that the fusion protein (GST-RB379-792 or GST-RB379-792;pm706) was expressed from pGEX-2T and did not contain an HMK sequence.
Unless indicated otherwise, the orientation of particular amino acid sequences is such that the amino end is on the left and the carboxyl end is on the right.
Detailed Description of the Invention
The present invention relates to expression vectors comprising nucleic acid sequences which encode an affinity ligand and a modification recognition sequence. The vectors further comprise at least one restriction site for the insertion of a nucleic acid sequence capable of encoding a selected polypeptide. The expression vectors may further comprise a nucleotide sequence encoding a cleavable linker sequence. These vectors are referred to as parent vectors.
The present invention further relates to expression vectors which are derived from the parent vectors described above, by the insertion of a nucleic acid sequence capable of encoding a selected polypeptide (e.g., a natural or synthetic cDNA or
SHEET geno ic DNA which encodes the selected polypeptide and is capable of being expressed in the appropriate host cell) into the parent vector. The parent vector can be cleaved at one or more restriction sites in order to insert sequences encoding a selected polypeptide using known techniques. Linkers or other sequences can be used to facilitate insertion. Insertion at an appropriate restriction site or site in the parent vector results in the in frame fusion of the sequences of the selected polypeptide with the amino acid sequences for an affinity ligand, modification recognition sequence and, if present, the optional cleavable linker encoded by the parent vector. On expression, the resulting fusion gene encoded by the vector codes for a fusion protein comprising an affinity ligand, a modification recognition sequence, selected polypeptide, and optionally, a cleavable linker. The location of the restriction site or sites selected for insertion in the vector will determine the location of the selected polypeptide relative to the affinity ligand, modification recognition sequence and optional cleavable linker in the encoded fusion protein. The selected polypeptide element of the fusion protein comprises at least one peptide, polypeptide or protein of interest.
The fusion protein comprising an affinity ligand, a modification recognition sequence, an optional cleavable linker and a selected polypeptide forms a conti.guous polypepti.de chai.n. The order of these components in the fusion gene and protein can vary. The location of these components or additional
BSTITUTESHEET components in the fusion gene and protein is determined by the order in which the nucleic acid sequences encoding the components are linked in the vector or the parent vector. For example, fusion proteins of the following structure can be made, where A indicates the affinity ligand, M indicates the modification recognition sequence, C indicates the optional cleavable linker, and X indicates the polypeptide of interest, and the N-terminal portion of the fusion is on the left: A-(C)-M-X A-M-(C)-X X-(C)-M-A X-M-(C)-A M-A-(C)-X
M-(C)-A-X Other permutations of the components are possible. However, the cleavable linker is preferably located between the affinity ligand and selected polypeptide portions of the fusion protein. In addition, intro¬ duction of a selected polypeptide in frame is simplified by a terminal location for the polypeptide (X) . This location can also minimize the effect of fusion on the biological activity (e.g., binding activity, an igenicity, enzymatic activity) of the selected polypeptide.
One or more components can be duplicated (e.g., X-M-C-A-X) . For example, multiple modification sites can be incorporated to increase the intensity of
HEET labeling. These sites may be contiguous or noncontiguous in the encoded fusion protein. Furthermore, additional sequences (S) can be present in the fusion protein. For instance, a sequence encoding a signal peptide or leader peptide may be incorporated into the vector, and the signal peptide will be produced as part of the fusion protein. In the case of a signal peptide, the additional sequence is preferably incorporated at the N-terminus (e.g., S-X-M-C-A) .- However, for other sequences, an internal or C-terminal location in the fusion protein may be desired.
Note that on expression in a suitable host cell, the parent vector may also produce the affinity ligand, modification sequence, and optional cleavable linker sequences as a fusion protein. This product may contain additional sequences encoded by the vector, depending on the location of a promoter or termination signals relative to the coding sequences for these elements. In some cases, restriction sites for insertion of sequences encoding a selected polypeptide may disrupt the reading frame of components encoded by the parent vector. Insertion of sequences encoding a selected polypeptide will restore the reading frame.
The expression vectors of the present invention can be designed for use in a variety of host cells, including bacterial host cells such as E. coli and eukaryotic host cells (e.g., yeast cells and mammalian cells) . Thus, fusion proteins can be glycosylated. Like other expression vectors, in the
SUBSTITUTESHEET expression vectors of the present invention, a promoter is provided for expression of the fusion protein in a suitable host cell. Suitable promoters can be constitutive or inducible. In the vectors, the promoter is operably linked to nucleic acid sequences encoding the fusion protein and is capable of directing the expression of the corresponding polypeptide. A variety of suitable promoters for procaryotic (e.g., lac promoter, tac promoter) and eukaryotic hosts (e.g. , yeast alcohol dehydrogenase (ADH1) , SV40) are available.
In addition, the expression vectors typically comprise a selectable marker for selection of host cells carrying the plasmid and an origin of replication, in the case of a replicable expression vector. Genes encoding products which confer antibiotic resistance are common selectable markers and may be used in prokaryotic (e.g. , ^-lactamase for ampicillin resistance, tetracycline resistance) and eukaryotic cells (e.g. , G418) . Genes encoding the gene product of auxotrophic markers (e.g., LEU2 and URA3) are commonly used as selectable markers in yeast. Use of viral or phage vectors, and vectors which are capable of integrating into the genome of the host cell, such as retroviral vectors, are also contemplated. The present invention also relates to cells carrying these types of expression vectors (e.g., transformed cells).
The incorporation of one or more modification recognition sequences into the fusion protein allows the production of modified fusion proteins. As shown in the Examples, incorporation of a detectable label
HEET (e.g., radioactive, fluorescent) at the modification site provides a convenient way to label the fusion protein. Thus, the invention further relates to the fusion proteins and modified (e.g., labeled) fusion proteins produced from expression vectors of the present invention. It is also possible to produce fusion proteins of the present invention using methods of peptide synthesis, or in vitro transcription/translation procedures. Modification of synthetic fusion proteins is also possible. The affinity ligand present in the fusion proteins of the present invention is a polypeptide encoding an affinity ligand. The affinity ligand (i.e., an affinity ligand or portion thereof) is a member of a specific binding pair and is capable of binding to a specific binding partner. Preferably, the binding of the affinity ligand to its specific binding partner is reversible, allowing recovery of fusion proteins comprising the affinity ligand where desired. Examples of affinity ligands include, but are not necessarily limited to, antibodies or portions thereof, antigens (e.g., influenza hemagglutinin) or epitopes (e.g., FLAG epitope), enzymes (e.g., glutathione-S-transferase (GST), β-galactosidase (lacZ) , the trpE product) , hormones, growth factors or other proteins capable binding to a specific binding partner (e.g., maltose binding protein, histidine hexamer (a heavy metal binding element) , and protein A) . A specific affinity ligand is an affinity ligand comprising that protein or peptide or a portion thereof. Thus, a glutathione-S-transferase affinity ligand is an
SUBSTITUTESHEET affinity ligand comprising glutathione-S-transferase or a portion thereof, which is capable of binding the specific binding partner (e.g., a substrate). The affinity ligand portion of the fusion protein is 05 useful for a variety of purposes, such as providing a moiety for attachment to a support or to facilitate purification or identification.
For example, the fusion proteins can be captured on an appropriate affinity matrix. An 1° affinity matrix is a solid support to which is attached (preferably covalently) a specific binding partner. Fusion proteins of the present invention, comprising an affinity ligand, can bind to a specific binding partner via the affinity ligand part of the 15 fusion protein. In the case of an affinity ligand which is an antigen, an antibody or portion thereof can be used as a specific binding partner in an affinity matrix. Alternatively, an antigen or hapten may be incorporated into an affinity matrix, and used 20 to capture fusion proteins comprising an antibody as an affinity ligand. A number of affinity ligand/affinity matrix pairs are available. Glutathione-S-transferase fusion proteins may be captured on immobilized glutathione as an affinity 25 matrix, with glutathione as the specific binding partner. For example, glutathione sepharose (Pharmacia) or glutathione agarose beads (Sigma Chemical Corp.) can be used. Fusion proteins comprising a protein A, maltose binding protein, FLAG
J ^nυ pepti.de, or a hi.sti.di.ne hexa er affi.ni.ty li.gand can be captured on IgG Sepharose 6FF (Pharmacia) , anylose resin (New England Biolabs) , anti-FLAG Ml antibody or anti-FLAG M2 antibody affinity resins (International Biotechnologies, Inc.), or a Ni 2+ affinity resin (NTA resin, Qiagen) , respectively. In the foregoing, IgG, amylose, anti-FLAG antibodies, or a heavy metal are, respectively, the specific binding partners.
When desired, a fusion protein or modified (e.g. labeled) fusion protein bound to an affinity matrix can be released by contacting the affinity matrix with bound fusion protein thereto with a suitable elution buffer comprising one or more release components. The release component or components can be molecules which compete with the fusion protein for binding to the affinity matrix (e.g., hapten, free peptide epitopes, substrate or substrate analogs) , or which can disrupt binding of the affinity ligand to the specific binding partner. For example, an elution buffer (a buffered solution) suitable for releasing fusion proteins comprising a glutathione-S-transferase affinity ligand can be formulated comprising reduced glutathione as a release component. In one embodiment, an elution buffer comprising reduced glutathione, Tris and NaCl is used.
Alternatively, the elution buffer may comprise a buffered solution which lacks a specific component required for binding. For example, a fusion protein with an ompA signal sequence followed by a FLAG affinity ligand at the amino terminus can be expressed in E. coli. Specific removal of the ompA signal sequence upon secretion into the periplasmic space results in a fusion protein with an
N-terminally located FLAG affinity ligand, which is
STITUTESHEET capable of binding to the anti-FLAG Ml antibody in the presence of calcium. This type of fusion protein can be eluted from an anti-FLAG Ml antibody affinity matrix using an elution buffer which lacks calcium, a specific component required for binding.
The expression vectors of the present invention can further comprise a sequence encoding a cleavable linker. A nucleotide sequence capable of encoding such a cleavage site can be incorporated into an expression vector using known techniques (e.g. , recombinant DNA and/or mutagenesis) . A cleavable linker is a peptide or polypeptide capable of being cleaved by a site specific protease. For example, a thrombin cleavage (e.g., Leu-Val-Pro-Arg-Gly-Ser) , factor Xa cleavage site (e.g. , Ile-Glu-Gly-Arg) or enterokinase cleavage site (e.g., Asp-Asp- Asp-Asp-Lys) may be used as a cleavable linker. The appropriate protease (thrombin, factor Xa and enter¬ okinase, respectively) can be used to cleave a fusion protein containing a corresponding cleavable linker. Cleavage by a protease (proteolysis) can occur within a cleavable linker or at the border of the linker and another component of a fusion protein.
The product of the cleavage reaction of a selected fusion protein comprising the majority of the affinity ligand is referred to as the affinity ligand portion, and the product of the cleavage reaction comprising the selected polypeptide is referred to as the selected polypeptide portion of the fusion protein. These portions may comprise other parts of the encoded fusion protein (i.e., the
T fusion protein encoded by the expression vector) , and such are also fusion proteins. For example, the selected polypeptide portion can include the modification recognition sequence. If so, and if the fusion protein has been modified (e.g., labeled), then the selected polypeptide portion is referred to as the labeled selected polypeptide portion. Thus, the term fusion protein refers to a protein, polypeptide, or glycoprotein comprising at least two of the components selected from the group consisting of an affinity ligand, modification sequence, cleavable linker sequence, selected polypeptide or additional sequence.
Preferably, a cleavable linker is selected which does not cleave elsewhere within the fusion protein (e.g., in the selected polypeptide, affinity ligand or modification site) . In addition, the sequences encoding the cleavable linker are preferably located between those sequences encoding the affinity ligand and selected polypeptide. In this location, the encoded fusion protein can be cleaved to separate the affinity ligand portion of the fusion protein from the selected polypeptide portion where desired. Similarly, where recovery of a labeled selected polypeptide portion is desired, the sequences encoding the cleavable linker will be on located in the vector either upstream or downstream of the sequences encoding both the modification recognition sequence and the selected polypeptide (e.g., as in A-C-M-X or X-M-C-A fusions) .
Overlap of the sequences of portions of the fusion protein may occur provided the respective portions retain function. For example, in one
SUBSTITUTESHEET embodiment of the present invention, pAR(ΔRl)59/60, discussed in more detail below, the affinity ligand (FLAG epitope) sequence and cleavable linker sequence (enterokinase site) overlap by one amino acid. The sequence of the FLAG epitope from N- to C-terminus is (Asp-Tyr-Lys-Asp) , while the enterokinase cleavage site in this embodiment is (Asp-Asp-Asp-Asp-Lys) . Within pAR(ΔRl)59/60, these sequences overlap to give the FLAG peptide, an octapeptide of the sequence Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys, which retains the functions of the affinity ligand (e.g. , binding to a specific binding partner) and cleavable linker (e.g., cleavage by enterokinase) . Similarly, other components of the fusion protein can overlap, providing their function is preserved.
A modification recognition sequence is also pro¬ vided. This sequence, incorporated into the fusion protein, directs modification of the fusion protein. Modification recognition sequences can be incorporated into a fusion protein comprising a selected polypeptide which either is naturally modified or is not naturally modified. A modification recognition sequence for phosphorylation (i.e., a phosphorylation site) can be introduced into a vector and expressed as part of a fusion protein, for example. In the presence of a suitable protein kinase, the fusion protein comprising the phosphorylation (kinase) site will be phosphorylated. Other types of modifications directed by the presence of a peptide or polypeptide sequence are envisioned in the present invention. For example, glycosylation reactions can be directed by a short peptide sequence. Similarly, fatty acylation of some proteins is directed by a short peptide sequence (Cys-Ala-Ala-X) . The identification of additional recognition sites and the identification and purification of the corresponding modification enzyme or enzyme complex will provide alternative protocols for modification of the fusion proteins.
Modification of the fusion proteins is carried out using a modification enzyme or enzyme complex or other suitable means. In the case of a phosphorylation reaction, a suitable phosphorylation method is used. Typically, the phosphorylation of proteins is carried out by a protein kinase. Many such kinases have been described and have utility in the present invention (see e.g., Kemp, B.E., et al. , J. Biol. Che . , 252: 4888-4894 (1977); Edelman, A.M. et al. , Ann. Rev. Biochem. 56: 567-613 (1987); Glass, D.B. and E.G. Krebs, Ann. Rev. Pharmacol. Toxicol. 20: 363-388 (1980) ; Hunter, T. and J.A. Cooper, Ann. Rev. Biochem. 54: 897-930 (1985) ; Hunter. T. , Cell 50: 823-829 (1987)). Known protein kinases catalyze the transfer of the -phosphate group of ATP to the hydroxyl groups of serine and/or threonine residues on specific substrates or alternatively, catalyze the phosphorylation of tyrosine residues. Use of protein kinases of both specifities is contemplated in the present invention. Serine/threonine kinases include cyclic AMP (cAMP) dependent and cyclic GMP (cGMP) dependent protein kinases, and cyclic nucleotide- independent protein
SUBSTITUTESHEET kinases. A variety of serine/ threonine kinases have been identified including glycogen synthase kinase, phosphorylase kinase, casein kinase I casein kinase II, pyruvate dehydrogenase kinase, protein kinase C and myosin light chain kinase.
Amino acid sequences present in natural substrates and artificial peptide substrates which are sufficient for activity as a protein kinase substrate have been identified (see e.g. , Edelman, A.M. et al., Ann. Rev. Biochem. 56: 567-613 (1987); Glass, D.B. and E.G. Krebs, Ann. Rev. Pharmacol. Toxicol. 20: 363-388 (1980) ; Hunter, T. and J.A. Cooper, Ann. Rev. Biochm. 54: 897-930 (1985)). These amino acid sequences, in addition to those described below, and related amino acid sequences which can be specifically phosphorylated, can be used as modification recognition sequences in vectors of the present invention. A corresponding protein kinase, capable of phosphorylating a polypeptide comprising the selected kinase recognition sequence, can be used to phosphorylate fusion proteins of the present invention. For example, the cAMP-dependent protein kinase (e.g. , the catalytic subunit of the cAMP-dependent protein kinase from bovine heart muscle) can recognize the consensus amino acid sequence Arg-Arg-Xaa-Ser-Xaa, where Xaa is an amino acid, in a variety of substrates, resulting in phosphorylation of serine. As shown in the Examples, expression vectors of the present invention (e.g., PGEX-2TK derivatives, or pAR(ΔRI)59/60 derivatives), incorporating a sequence which encodes a version of the cAMP-dependent protein kinase recognition sequence (Arg-Arg-Ala-Ser-Val) , can direct the production of fusion proteins capable of being phosphorylated by cAMP-dependent protein kinase from bovine heart muscle. The Arg-Arg-Ala-Ser-Val sequence is also referred to as the HMK site (HMK, heart muscle kinase) . Other specific amino acid sequences, such as Arg-Arg-Ala-Ser-Leu (Li et al. , Proc. Natl. Sci. USA 86: 558-562 (1989) and the peptide Arg-Thr-Lys-Arg-Ser-Gly-Ser-Val, can be recognized and phosphorylated by this kinase. The phosphorylation sites can function at a terminal or internal location within a fusion protein. cGMP protein kinases have a substrate specificity that is similar, but not identical, to that of the cAMP-dependent protein kinases. Comparative analysis of substrates has been made (Edelman, A.M. et al. , Ann. Rev. Biochem. 56: 567-613, (1987); Glass, D.B. and E.G. Krebs, Ann. Rev. Pharmacol. Toxicol. 20: 363-388 (1980); Glass, D.B. and E.G. Krebs, J. Biol. Chem. 254 : 9728-9738 (1979)). The substrate specificities of the two types of casein kinases has also been studied, and the activity of peptide substrates has been compared (Marin, O. et al. , Eur. J. Biochem. 160: 239-244 (1986) ; Sommercorn, J. and E.G. Krebs, J. Biol. Chem. 262: 3839-3843 (1987); Kuenzel, E.A. et al. , J. Biol. Chem. 262: 9136-9140 (1987)). For example, casein kinase II was shown to phosphorylate the synthetic peptide Ser-Glu-Glu-Glu-Glu-Glu. Additional peptide substrates were phosphorylated by casein kinase II (e.g., in decreasing order of activity,
SUBSTITUTESHEET Arg-Arg-Arg-Asp-Asp-Asp-Ser-Asp-Asp-Asp; Arg-Arg- Arg-Glu-Glu-Glu-Ser-Glu-Glu-Glu; Arg-Arg-Arg-Glu-Glu- Glu-Thr-Glu-Glu-Glu) . The substrate specificity of tyrosine kinases has also been studied (Hunter, T. and J.A. Cooper, Ann. Rev. Biochm. 54: 897-930 (1985)). Candidate phosphorylation sites (recognition sequences) can be incorporated into synthetic peptides or into fusion proteins and assayed for activity as protein kinase substrates, using techniques similar to those described previously. Because corresponding phosphatases exist, enzymatic removal of phosphate groups is also possible.
The ability to modify the fusion proteins provides a convenient method of specifically labeling the fusion proteins with detectable radioactive or non-radioactive labels to produce a labeled fusion protein (e.g., a [ 32P]-labeled fusion protein).
Cleavage of fusion proteins can produce a selected polypeptide portion comprising a modification sequence or an affinity ligand portion comprising a modification sequence, depending on the location of the cleavable linker in relation to these components.
Thus, cleavage following labeling can produce a labeled selected polypeptide portion or a labeled affinity ligand portion, each of which is itself a fusion protein. Because the modification recognition sequence directs modification to a specific site in the protein, there is a high degree of control over the location and extent of modification. As shown in
T the Examples, fusion proteins expressed and modified by the introduction of a detectable label retain both structure and function.
The detectable label (e.g., radioactive, fluorescent or chemiluminescent) selected will determine by the nature of modification, the modification enzyme, and the intended use. The label will be incorporated into a moiety which is transferred to the fusion protein substrate. For phosphorylation reactions, [ -labeled]ATP is used as phosphate donor (i.e., modification donor) . As protein kinases transfer the 7-phosphate onto the substrate during the phosphorylation reaction, the label will be incorporated into the moiety transferred by the protein kinase. For example, to incorporate a radioactive phosphate label such as
31 32 33 31
P, P or P, phosphate donors such as [7 P]ATP,
[7 32P]ATP, or [733PJATP can be used. Alternatively, isotopes of sulfur, such as 35S or 38S isotopes, can be incorporated as the label in the [7-labeled]ATP, as for example in 35S-labeled adenosme
5' [7-thio]triphosphate. The term "phosphorylation" also refers to modification with such thiophosphate analogs or other 7-labeled analogs of ATP. Other considerations such as specific activity, half-life, type of particle emitted and energy of radiation will influence selection of an appropriate radioactive label. In addition, a fluorescent moiety or chemiluminescent moiety incorporated into the 7-phosphate. Detection methods for such labels are well known in the art.
SUBSTITUTESHEET In other types of modifications (e.g., glyco- sylation, fatty acylation) , a radioactive isotope, chemiluminescent or fluorescent dye, or other nonradioactive label, could be incorporated into the modification donor so that it is transferred to the fusion protein during the modification reaction. For example, a biotin adduct of a modification donor could be linked to a fusion protein. In the present method, the site of addition of the biotin label is controlled by the location of the modification recognition sequence.
Specific Embodiments
In one embodiment of the present invention, glutathione-S-transferase (GST) from the parasitic helminth Schistosoma japonica (Mr=26,000) is selected as the affinity ligand. However, GST genes from other sources, such as other bacteria or mammalian organisms, can be used. In addition, a portion of a GST gene encoding a portion of GST capable of binding a specific binding partner, such as the substrate glutathione, can be used. In particular, expression vectors capable of expressing S. japonicum GST-fusion proteins (i.e., a fusion protein comprising a GST affinity ligand) were constructed. The parent vector is named pGEX-2TK, and vectors derived from pGEX-2TK by the insertion of a nucleotide sequence encoding a selected polypeptide, are referred to with the prefix pGTK-. The construction of pGEX-2TK and pGTK- plasmids is described in Examples 1 and 2 and in Figure 1. PGEX-2TK is a derivative of pGEX-2T, the latter which is described by D.B. Smith in EP 0,293,249, published November 30, 1988, PCT/AU88/00164, published December 1, 1988, and New Zealand Patent No. 224,663, issued November 27, 1990, and by D.B. Smith and K.S. Johnson in Gene 67: 31-40 (1988) . The teachings of EP 0,293,249, PCT/AU88/00164, New Zealand Patent No. 224,663, and Smith and K.S. Johnson (Gene 67: 31-40 (1988)) are herein incorporated by reference. pGEX-2TK encodes a fusion protein having GST as an affinity ligand at the amino terminus, followed by a thrombin cleavage site (cleavable linker) . In addition, a modification recognition sequence for a protein kinase was incorporated downstream of the cleavable linker. In particular, a sequence of the structure Arg-Arg-Xaa-Ser-Xaa , where Xaa is an amino acid, was selected (Arg-Arg-Ala-Ser-Val) . This sequence is a phosphorylation site recognized by cAMP-dependent protein kinases such as the catalytic domain of the cAMP-dependent protein kinase from bovine heart muscle. In pGEX-2TK, the sequence which encodes the thrombin cleavage site is followed by several restriction sites for the insertion of sequences encoding a selected polypeptide. pGEX-2TK retains the inducible tac promoter for expression, three in frame stop codons, a selectable marker (ampicillin resistance) , origin of replication, and laclq gene present in pGEX-2T.
When a sequence which encodes a selected polypeptide is inserted in frame into the pGEX-2TK expression vector downstream of the phosphorylation
SUBSTITUTESHEET site, a GTK-fusion protein comprising the selected polypeptide can be produced. The GTK-fusion protein (where, from N- to C-terminus, G is a GST affinity ligand, T is a thrombin cleavage site, and K is a phosphorylation site for a protein kinase) is encoded by a pGTK-vector, can be produced upon expression in a suitable E. coli host. These fusion proteins can be captured on immobilized glutathione, and labeled by phosphorylation with cAMP-dependent protein kinase, using [7- P]ATP as a modification donor. As shown in the Examples, several different cDNAs were expressed from pGEX-2TK, and labeled as described. The studies described in Examples 1-4 indicate that a selected polypeptide present in a GTK-fusion protein, encoded by a pGTK-type vector, such as pGTK-RB
(379-792) , can retain both structure and function. Furthermore, the labeling (phosphorylation) procedure was efficient and did not alter the structure or function of the selected polypeptides.
In another embodiment of the present invention, the FLAG epitope is selected as the affinity ligand. Vector pAR(ΔRI)59/60, shown in Figure 2, is an example of this type of expression vector. Vector pAR(ΔRI)59/60 is a derivative of bacterial expression plasmid pET3a (pAR3040) described by Studier et al. , Meth. Enzymo1. 185:60-89 (1990), the teachings of which are herein incorporated by reference. The construction of pAR(ΔRI)59/60 is described in detail in Example 5. In this construct, the FLAG peptide is located adjacent to the N-terminal initiator methionine (Figure 2) . The FLAG octapeptide (Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys) comprises the FLAG epitope (Asp-Tyr-Lys-Asp) affinity ligand and an overlapping enterokinase cleavable linker (Asp-Asp-Asp-Asp-Lys) . Cleavage with enterokinase typically occurs precisely after the terminal lysine (underlined above) of the FLAG octapeptide. The FLAG octapeptide is followed by a kinase recognition site. In particular, in pAR(ΔRI)59/60, the site is Arg-Arg-Ala-Ser-Val (HMK) , recognized by cAMP- dependent protein kinase. A unique EcoRI restriction site located downstream of the sequence encoding the phosphorylation site can be used for the insertion of a sequence encoding a selected polypeptide. In plasmids derived from pAR(ΔRI)59/60, referred to with the prefix pFEK- F for FLAG, E for enterokinase, and K for the phosphorylation recognition site) , in which a sequence encoding a selected polypeptide has been inserted, the encoded fusion protein, referred to herein as a FEK-fusion protein, has the following general structure for N- to C-terminus: Met-[FLAG/ enterokinase cleavable linker]-phosphorylation site-selected polypeptide. An alanine residue is located between the FLAG peptide and HMK site (see Figure 2) . In addition, the EcoRI site encodes a Glu-Phe dipeptide. Depending on the strategy for insertion of a sequence encoding a selected poly¬ peptide, additional residues (e.g., encoded by a linker or PCR primer) may be inserted between the selected polypeptide and the modification recognition (HMK) site. In many cases, however, only 17 amino acids will be fused to the selected polypeptide.
SUBSTITUTESHEET pFEK-plasmids can direct the expression of FLAG- fusion proteins (i.e., fusion proteins having a FLAG epitope affinity ligand) from the T7 polymerase promoter in an appropriate host (e.g., a bacterial cell capable of constitutive or inducible expression of the T7 polymerase; for examples of suitable hosts and induction protocols, see Studier et al. , Meth. Enzymol. 185: 60-89 (1990) ) . The construction of several pFEK-plasmids is described in Example 5. The encoded fusion proteins were expressed in a bacterial host, and a lysate was prepared. Partially purified fractions containing each fusion protein were subjected to phosphorylation with the catalytic subunit of the cAMP-dependent protein kinase using [τ_ P]ATP as a (labeled) modification donor. The
FEK-fusion proteins were labeled to high specific activity to give [ 32P]-labeled FEK-fusion proteins (which are also [ 32P]-labeled FLAG-fusion proteins) .
Note that vectors of the present invention related to pGTK- and pFEK- vectors can be constructed lacking the cleavable linker sequence. These vectors would have a pGK- or pFK- prefix. A fusion protein comprising a selected polypeptide expressed from a pGK- or pFK-vector is referred to as pGK-fusion protein or pFK-fusion protein, respectively.
A variety of commercially available vectors comprising an affinity ligand and cleavable linker are available which can be modified by the introduction of a modification site (e.g. , a
T phosphorylation site) , and where required, one or more restriction sites for the in-frame insertion of a selected polypeptide using recombinant DNA techniques. For example, (1) the pGEX-3X vector (Pharmacia) comprising a GST affinity ligand and Factor Xa cleavable linker, (2) the pMAL-C vector (Biolabs) comprising a malB affinity ligand and Factor Xa cleavable linker, (3, 4) the protein A gene fusion expression vectors, pRIT2T and pRIT5 (Pharmacia) , comprising a protein A affinity ligand, (5, 6) as well as pDS and pQE-vectors (Qiagen) , comprising a histidine hexamer affinity ligand, could be modified in this way. The vector pRIT2T, vector pRIT5, pDS and pQE-vectors, could be further modified by the insertion of a sequence encoding a cleavable linker. Convenient affinity matrices for fusion proteins encoded by the foregoing were discussed above.
Methods of Producing a Modified Fusion Protein
The present invention further relates to methods for producing a modified fusion protein. In particular, a rapid and convenient method for labeling a fusion protein using a radiolabel or non-radioactive label donated by [7-labeled]ATP is provided. In addition, the methods used are mild; a feature which is important for preservation of the structure and biological activity (e.g., binding, antigenicity, activity) of the components of the fusion protein, and of the selected polypeptide in particular.
SUBSTITUTESHEET For example, fusion proteins of the present invention comprising an affinity ligand portion and a selected polypeptide portion are expressed in a host cell carrying a vector of the present invention. The host cells are propagated under conditions which permit expression of the vector. For example, expression from the particular pGTK- and pFEK- vectors described in the Examples requires induction by IPTG. The host cells are lysed using known techniques to obtain a lysate containing the fusion protein.
In one embodiment, the lysate can be crudely fractionated as in Example 5 and directly labeled in the presence of [7~labeled]ATP (e.g. ,[7 - 32P]ATP) , and a cAMP-dependent protein kinase, such as the catalytic subunit of the cAMP-dependent protein kinase from bovine heart muscle. A suitable formulation for a kinase reaction buffer and reaction stop buffer is given below. In another embodiment, the fusion protein is modified (e.g., phosphorylated) while bound to an affinity matrix. The fusion protein present in the lysate is captured on an affinity matrix. This step is carried out by contacting the lysate with an appropriate affinity matrix (i.e., an affinity matrix comprising a specific binding partner of the affinity ligand present in the fusion protein) , under conditions which permit binding of the affinity ligand portion of the fusion protein to the affinity matrix. Suitable conditions for a variety of affinity ligands and matrices are known in the art. For example, GST-fusion proteins (e.g., a GTK-fusion) can be captured on immobilized glutathione as an affinity matrix. As shown in Example 2, the GST-fusion protein can be captured on the affinity matrix in the presence of a wash buffer, (which was also used as the lysis buffer) . The components of the wash buffer permit binding of the GST portion of the fusion proteins to the matrix via the specific binding partner (glutathione) . In one embodiment, the wash buffer comprises a buffer such as Tris, salt (e.g., NaCl), a chelator (e.g., EDTA), and a non-ionic detergent such as nonidet P-40.
As discussed above, FEK-fusion proteins can be captured on an anti-FLAG antibody affinity matrix. When the FLAG peptide is internal (e.g., not immediately at the N-terminus, as in pAR(ΔRI)59/60) , anti-FLAG M2 antibody can be used as the specific binding partner. Suitable conditions and formulations (e.g., for wash buffer and elution buffer) for anti-FLAG M2 affinity chromatography can be obtained from International Biotechnologies, Inc.
Once bound, the fusion proteins can then be washed in order to remove unbound material (e.g., contaminating proteins other than the fusion protein) . The wash buffers described above are suitable for this purpose.
The affinity matrix with bound fusion protein attached is then equilibrated (contacted) with a reaction buffer suitable for the modification reaction. In a phosphorylation reaction, for example, a kinase buffer such as HMK buffer is
SUBSTITUTESHEET selected. The particular kinase reaction buffer will be adapted to the kinase selected. Suitable reaction buffers for a variety of characterized kinases are available. HMK reaction buffer, suitable for use with the cAMP-dependent protein kinase from bovine heart muscle (HMK) , comprises (1) a buffering agent such as Tris, (2) a reducing agent such as dithiothreitol (DTT) , (3) a salt such as NaCl, and (4) a source of magnesium ions or other appropriate ion, such as MgCl . Although the DTT can be added separately, it is present in the kinase reaction buffer. In addition, a modification enzyme and modification donor are added. In one particular phosphorylation reaction, a protein kinase prepara- tion comprising the HMK enzyme is added, and [7- labeled]ATP. Typically, the protein kinase will be diluted in a suitable solution prior to addition. In the examples, a variety of fusion proteins were efficiently radiolabeled by phosphorylation using HMK reaction buffer, the catalytic subunit of the HMK and
[7- 32P]ATP as a modification donor.
The conditions appropriate for phosphorylation of the bound fusion protein to produce a bound labeled (e.g., radiolabeled) fusion protein will vary with the protein kinase selected. Typically, the steps subsequent to culturing the host cell are carried out at 4 °C. However, phosphorylation reactions with the catalytic subunit of the cAMP-dependent protein kinase from bovine heart muscle, have previously been carried out at 37 °C.
In the present method, phosphorylation can be carried
EET out at 37 °C. However, in another embodiment of the present invention, efficient phosphorylation is carried out at 4 °C. This temperature can preserve the activity of fusion proteins, and of the selected polypeptide portion in particular.
Optionally, the reaction can be quenched by addition of a stop buffer comprising a component capable of inhibiting the modification reaction. The component capable of inhibiting the reaction will vary with the modification enzyme, but can include inhibitors (e.g., reaction products, competitors), chelators, or other agents which stop the reaction. It can be desirable to stop a reaction when, for example, in subsequent steps, undesired modifications can occur. However, wash steps can serve to remove modification enzymes.
For example, in a phosphorylation reaction with cAMP-dependent protein kinase, an HMK stop buffer can be added, comprising a buffer such as sodium phos- phate, a reaction inhibitor such as sodium pyrophos- phate, and a chelator such as EDTA. HMK stop buffer can optionally contain a carrier such as bovine serum albumin or glycerol. As a reaction product of a phosphorylation reaction, sodium pyrophosphate inhibits the kinase reaction. In addition, chelating Mg 2+ i.ons i.nhi.bi.ts the reaction.
An advantage of the present method is that unincorporated label is easily removed by washing the affinity matrix with bound labeled fusion protein. The wash buffers described above, which permit binding of the affinity ligand to the affinity matrix
SUBSTITUTESHEET via the specific binding partner can be used in this step. Other methods for labeling proteins with the HMK enzyme have relied upon extensive dialysis to remove unincorporated label (e.g., Zhao, X.-X. et al., Analyt. Biochem. 178: 342-347 (1989)). The removal of unincorporated label by washing is advantageous because extensive dialysis can compromise the function of some proteins and results in the production of large quantities of contaminated dialysis buffer.
If desired, the modified fusion protein may be isolated for use by releasing the fusion protein from the affinity matrix by washing with a suitable elution buffer. Elution buffers were discussed above. For example, a labeled GST-fusion protein such as a GTK-fusion protein can be eluted from immobilized glutathione with an elution buffer comprising glutathione (e.g., reduced glutathione). In one embodiment, an elution buffer comprising reduced glutathione, a buffer such as Tris, and a salt such as NaCl is used to release labeled GST-fusion proteins from the affinity matrix.
In the case where a cleavable linker is present, after elution, the fusion protein can be cleaved in vitro to free the modified (e.g., labeled) polypeptide portion. Cleavage is accomplished by contacting the fusion protein with an appropriate specific protease under conditions which permit the cleavage reaction. Such conditions for cleavage by enterokinase, thrombin and factor Xa, for example, are known. The affinity ligand portion or any uncleaved product can be removed by adsorption on the appropriate affinity matrix in the presence of wash buffer for example. The modified or labeled selected polypeptide portion can be recovered in the supernatant.
Optionally, prior to elution from the affinity matrix, the modified polypeptide portion can be released from the affinity ligand portion by cleaving the modified fusion protein at the cleavable linker. An example protocol for cleavage by a specific protease on a column is provided in Abath, F. and A. Simpson, Biotechniques 10: 178 (1991), the teachings of which are herein incorporated by reference.
Smith (EP 0,293,249; PCT/AU88/00164; NZ 224,663) and Smith and Johnson (Gene 67: 31-40 1988)) also describe conditions for capture of GST-fusion proteins on immobilized glutathione, washes, elution and cleavage protocols.
In one embodiment of the present invention, a kit for preparing a labeled fusion protein is provided, comprising (1) an affinity matrix to which a portion of the fusion protein can bind, (2) a wash buffer which permits binding of the fusion protein to the affinity matrix, (3) a modification reaction buffer such as a protein kinase reaction buffer, and (4) a modification enzyme preparation such as a protein kinase preparation. Optionally, the kit can contain a reaction stop buffer or an elution buffer comprising a release component. The nature of these elements has been explained above in more detail. In another embodiment, a kit can comprise a vector of the present invention (e.g. , pGEX-2TK or
SUBSTITUTESHEET pAR[ΔRI]59/60) . The vector can be present in a suitable host cell or free from the host cell. Furthermore, the kit comprises each of the elements (1) through (4) of the kit described above, and if desired, a reaction stop buffer. In other embodiments, a modification donor such as 7-labeled ATP can also be included in the kit.
In a phosphorylation reaction, a protein kinase preparation is provided comprising a protein kinase. In the protein kinase preparation, the kinase may be in the form of a precipitate or solution. The kinase can be dissolved or diluted as necessary prior to use.
The affinity matrix can be present in a suitable containing means (e.g., a vial or cartridge). In one embodiment, the affinity matrix is provided in a cartridge unit, the combination of which acts as a column. Various components (e.g., the lysate, wash buffers, label) can be inserted into the cartridge and passed through the affinity column to facilitate the labeling procedure. In one embodiment, immobilized glutathione can be incorporated into a cartridge unit for capture of GST-fusion proteins.
Uses of the Present Invention
Vectors of the present invention can direct the expression of large amounts of a fusion protein in a host cell. These fusion proteins can be conveniently labeled using the methods described, and have many uses in research, diagnostic and therapeutic applications.
ET For example, radiolabeled proteins are useful in imaging. In one embodiment, radiolabeled fusion protein or radiolabeled selected polypeptide portion can be used to locate cells carrying a ligand recognized by the selected polypeptide (e.g., a receptor, cell surface component, or antigen) . For example, a growth factor can be used as the selected polypeptide, and a labeled fusion protein of the present invention comprising the growth factor can be used to detect cells carrying a receptor for the growth factor.
Similarly, incorporation of a suitable radiolabel permits the use of fusion proteins of the present invention in targeted radiotherapy. In other words, the selected polypeptide (e.g., a hormone, growth factor, antibody) present in the fusion protein can seek out specific cells and damage or destroy them due to the attached radiolabel. An isotope with the desired energy and half-life can be selected for this purpose. Other labels incorporated by methods of the present invention and capable of killing cells can also be used.
For example, a labeled antibody or portion thereof (e.g., F-, , Fab, etc.) produced by the method of the present invention could be used in this manner. Such fusion proteins would be useful for many other applications as well. The term antibody as used herein refers to antibodies such as chimeric antibodies, single chain antibodies, bifunctional antibodies and other antibody variants, including individual chains (e.g. , a heavy chain) . A variety
SUBSTITUTESHEET of methods for expressing immunoglobulins are available (see e.g., Skerra, A. and A. Pluckthun, Science 240: 1038-1041 (1988)).
In one embodiment, antibody chains are expressed as fusion proteins of the present invention in E. coli. A signal sequence (e.g., ompA, phoA) is fused to the N-terminus of an antibody chain, which is followed by at least one modification recognition sequence, an optional cleavable linker, and an affinity ligand. On expression in E. coli, the signal sequence is cleaved from the fusion protein, freeing the amino terminus of the antibody chain, and preserving the binding function of the variable region- Fusion genes encoding both chains can be expressed in this manner from two vectors or a single vector encoding two fusion genes. Alternatively, either the heavy or light chain could be expressed as a fusion protein of the present invention and the complementary chain could be expressed in the same cell using typical antibody expression vectors.
In addition, as shown herein, labeled fusion proteins of the present invention can be used as probes. As shown in Example 4, labeled fusion proteins such as 32P-GTK-fusion proteins, can be used as probes for screening expression libraries for proteins capable of binding the selected polypeptide.
For example, a fusion protein of the present invention comprising a selected polypeptide is expressed, purified by virtue of the affinity ligand, and specifically labeled at the modification recognition sequence. The labeled, homogeneous fusion protein is then incubated with filters on which proteins expressed by individual plaques of a target library have been immobilized. Following this hybridization step, the filters are washed, and processed for identification (i.e., by detection of the label) of plaques which produce recombinant proteins capable of specifically interacting with the labeled fusion protein probe.
In particular, a cDNA encoding a portion (the E7 binding domain) of the retinoblastoma susceptibility gene product (pRB) was prepared as a GTK-fusion protein and used to screen a library for proteins which interact with pRB. A number of positive clones were isolated. One of these clones, designated RBAPl, displays properties expected of a cellular protein capable of interacting with pRB. Labeled fusion proteins of the present invention comprising a selected polypeptide can also be used as probes in Western blot formats (Example 4) .
In addition, as shown in Example 4, labeled fusion proteins of the present invention can be used to identify proteins present in complex mixtures (e.g., whole cell extracts) , which are capable of interacting with the selected polypeptide. The fusion proteins can be used in protocols to isolate the interacting proteins. For example, the interacting proteins can be captured by the selected polypeptide portion of a labeled fusion protein which is bound to an affinity matrix. Subsequent cleavage at a cleavable linker could release the complex of the interacting protein and a labeled selected
SUBSTITUTESHEET polypeptide portion for further analysis. Alternatively, a purified labeled fusion protein probe can also be used to determine the amount of or to detect an interacting protein or other substance in a complex mixture using immunoassay techniques, for example.
In one embodiment, the binding of a specifically interacting protein (e.g., an antigen, hormone) or other substance to an immobilized (e.g. , bound to an affinity matrix) fusion protein of the present invention can induce a change in the attributes of the bound fusion protein. Other substances include, but are not limited to agents such as a drug, toxin, substrate or other ligands capable of interacting with the selected polypeptide portion of the fusion protein. The difference between the unbound versus the bound state could be taken advantage of, as for example, in an assay format. For example, the presence of the interacting protein or other substance in a complex mixture (e.g., blood) could be monitored. For example, binding of an interacting protein could induce a conformational change that shields the labeling (e.g. phosphorylation) site. The complex mixture containing an interacting protein is contacted with the fusion protein, which is bound to the affinity matrix. The combination is washed, equilibrated with a suitable reaction buffer and subjected to the labeling procedure. The extent of labeling is compared to a control which lacks the interacting polypeptide and the presence of the interacting protein in the complex mixture is indicated by inhibition of labeling. By comparison with the degree of inhibition achieved by a standard, such a procedure can also provide information regarding the quantity of interacting protein which is present.
The vectors and methods of the present invention can facilitate the characterization of cDNA clones. For example, an uncharacterized cDNA can be inserted into a vector of the present invention allowing rapid purification of the protein, without specific knowledge of its properties. The protein can be labeled with the maintenance of biological activity, and the labeled fusion protein can be used to detect potential interactions with other cellular proteins.
The present invention will now be illustrated by the following Examples, which are not intended to be limiting in any way.
Introduction to Examples 1-4
Recently it has been demonstrated that the adenovirus E1A protein, SV40 and polyomaviral T antigens, and the human papillo a virus E7 protein can bind to the retinoblastoma susceptibility gene product, pRB. Neither these viruses nor these proteins are thought to be closely related to one another, yet each of these proteins shares a short, homologous, colinear, transforming sequence, typified by E1A conserved region 2 (CR2) , which appears to be responsible for high affinity binding to pRB. The region of pRB which interacts with this motif has
SUBSTITUTESHEET been mapped. To date, all spontaneously occurring, loss of function, pRB mutations which do not grossly compromise pRB stability, map to this region. This led to the hypothesis that pRB, in the course of regulating cell growth, must interact with one or more cellular proteins bearing a sequence structurally resembling the viral pRB-binding motif. If such a model were correct, one would predict that pRB inactivation might be achieved either as a result of competing complex formation with one of the above viral oncoproteins, or as a result of RB mutations affecting the T/ElA/E7-binding domain.
Example 1
Construction of pGEX-2TK
Two synthetic oligonucleotides were designed, which, on annealing, encode a cAMP-dependent protein kinase recognition site and three restriction sites. Codon utilization for the synthetic duplex was based on the prokaryotic codon utilization data of (Grantham, R. et al. , Nucleic Acids. Res, j?: r43-r74 (1981) ) .
The oligonucleotides were synthesized using standard techniques and were annealed in vitro (Gait, M.C.J., (1984) Oligonucleotide Synthesis: A Practical Approach, (I.R.L. Press; Oxford)). The resulting duplex has 5'-overhangs compatible with Ba HI- and EcoRI-cut DNA and is shown below:
ArgArgA1aSerVa1 5'-GATCTCGTCGTGCATCTGTTGGATCCCCGGG
AGCAGCACGTAGACAACCTAGGGGCCCTTAA-5'
Plasmid pGEX-2T (Pharmacia) was linearized with BamHI and EcoRI, and the vector fragment was ligated to the synthetic duplex to make plasmid pGEX-2TK. Incorporation of the duplex into pGEX-2T was confirmed by restriction and DNA sequence analysis. DNA sequencing was performed using a Sequenase 2.0 kit (United States Biochemical Corp.) with the protocol provided by the manufacturer. The structure of pGEX-2TK in the region surrounding the kinase site is shown in Figure l.
SUBSTITUTESHEET As can be seen in Figure 1, the incorporation of the synthetic duplex DNA into pGEX-2T resulted in the insertion of codons for a cAMP-dependent protein kinase recognition sequence, Arg-Arg-Ala-Ser-Val, immediately downstream of the thrombin recognition site present in parent plamid pGEX-2T. In addition, the synthetic duplex encoded a multiple cloning site (MCS) downstream of the kinase recognition sequence, such that the insertion of the duplex led to the regeneration of the MCS of the parent plasmid, with restriction sites for BamHI, S al, and EcoRI. As in the parent plasmid, the MCS is followed by a sequence with stop codons in all three reading frames.
Example 2
3 Generation of P-GST Fusion Proteins
Construction of pGEX-2TK Vectors Encoding RB Proteins
RB cDNAs encoding residues 379-792 or residues 379-928 of the retinoblastoma susceptibility gene product, both of which span the T/ElA-binding region, were subcloned into pGEX-2TK. In addition, an RB cDNA encoding residues 379-928, having a naturally occurring, loss of function, RB point mutation (379-928;706F) , which is known to abrogate T/E1A binding, was subcloned into pGEX-2TK. These pGEX-2TK reco binants are named pGTK-RB(379-792) , pGTK-RB(379-928) and pGTK-RB(379-928;706F) , respectively. The encoded fusion proteins are named GTK-RB(379-792) , GTK-RB(379-928) and GTK-RB(379-928;706F) , respectively. These RB fusion proteins are also referred to as GST-RB fusion proteins or RB fusion proteins.
Two of the RB cDNAs had been generated in an earlier study by Kaelin et al. (Kaelin, W.G. et al. , Mol. Cell. Biol. 10(7) : 3761-3769 (1990)), incorporated herein by reference, using the polymerase chain reaction. The RB cDNA inserted into pGEX-2TK to make pGTK-RB(379-928) corresponds to RB deletion RB dl 1-378 (Kaelin, W.G. et al. , Mol. Cell. Biol. 10(7) : 3761-3769 (1990)) . The RB cDNA inserted into pGEX-2TK to make pGTK-RB(379-792) corresponds to RB deletion RB dl 1-378;793-928. Residue 379 is a methionine residue. Each amplimer used to generate the cDNAs contained a BamHI site, such that the resulting PCR product, upon digestion with BamHI, could be ligated in frame into the unique BamHI site in pGEX-2T (Pharmacia). The 3' amplimer contained a TGA stop codon as well. The PCR fragments encoding these two cDNAs were each cleaved with BamHI and cloned into pGEX-2T, which had been linearized with BamHI, and treated with calf intestinal phosphatase. The resulting constructs were named pGT-RB(379-928) and pGT-RB(379-792) . The cDNAs were cleaved from the latter constructs using BamHI, and were each subcloned into pGEX-2TK, which had been cleaved with BamHI. The resulting plasmids are named pGTK-RB(379-928) and pGTK-RB(379-792) . Two additional amino acids were incorporated into the sequence at the BamHI site due to the structure of
SUBSTITUTESHEET the PCR primers used to clone both cDNAs, such that the amino acid sequence at the junction of the phosphorylation (kinase) site and the first RB residue (Met ) is:
(-Arg-Arg-Ala-Ser-Val-Gly-Ser-Ala-Thr-Met -) .
The mutant RB cDNA and referred to herein as either RB(379-928;706F) or RB(379-928;pm706) corresponds to the cDNA designated (379-928;pm 706) constructed by Kaelin et al. (Kaelin, W.G. et al. , Cell 64: 521-532 (1991)), also incorporated herein by reference. The mutant cDNA fragment was generated by PCR of reverse-transcribed RNA from cells containing the mutation. The PCR product was cleaved at the unique Ncol and Bsml sites in the RB gene (which span the 706F mutation), and ligated into pGT-RB(379-928) , from which the wild type RB cDNA Ncol-Bsml segment had been excised. The resulting plasmid, pGT-RB(379-928;706F) , encoded the 379-928;706F cDNA. The plasmid was cleaved with BamHI to release the cDNA, and the cDNA fragment was subcloned into pGEX-2TK which had been cleaved with BamHI to make pGTK-RB(379-928;706F) . The amino acid sequence at the junction of the phosphorylation (kinase) site and the first RB residue (Met _ ) is also
(-Arg-Arg-Ala-Ser-Val-Gly-Ser-Ala-Thr-Met37g-) .
The products encoded by all three cDNAs have been tested previously for their ability to bind to SV40 T antigen, the adenovirus E1A gene product, and putative cellular RB-binding proteins, when expressed as pGEX-2T encoded GST fusion proteins. The fusion proteins encoded by pGT-RB(379-928) and pGT-RB(379-792) were able to bind to T antigen, E1A, and the putative cellular RB-binding proteins, while the mutant protein encoded by pGT-RB(379-928;706F) did not (Kaelin, W.G. et al. , Cell 64: 521-532 (1991)).
Expression and Purification of GST-Fusion Proteins
The expression of the pGEX-2TK-encoded protein and recombinant pGEX-2TK-encoded GST fusion proteins in E. coli (DH5Q; Bethesda Research Laboratories), and the subsequent recovery of the proteins on glutathione sepharose was carried out essentially as described by Smith and Johnson (Smith, D.B. and Johnson, K.S., Gene 67: 31-40 (1988); Kaelin, W.G. et al. , Cell 64: 521-532 (1991)). Fresh overnight cultures of E. coli DH5 , transformed with either PGEX-2TK or pGEX-2TK recombinants, were diluted 1:10 in Luria-Bertani (LB) medium containing ampicillin (100 μg/ml) and incubated for a total of 5 hours at 37 °C, with shaking. After 1.0 hour of growth at 37 °C, isopropyl-/.-D-thiogalactopyranoside (IPTG; Bethesda Research Laboratories) was added to a final concentration of 0.1 mM.
For analysis of total bacterial protein content, aliquots of each bacterial culture were pelleted in a microcentrifuge, were boiled in urea-SDS cracking buffer (0.01 M sodium phosphate [pH 7.2], 1% .-mercaptoethanol, 1% SDS, and 8M urea) , and were loaded onto an SDS-polyacrylamide gel. Proteins were visualized by Coomassie blue staining.
For phosphorylation of protein and/or protein
SUBSTITUTESHEET recovery using glutathione sepharose (Pharmacia) , bacterial cultures were pelleted at 5000 x g for 5 min at 4 °C, and resuspended 1/10 the original culture volume of NETN (20 mM Tris [pH 8.0], 100 mM NaCl, 1 mM EDTA, 0.5% Nonidet P-40) . The bacteria were then lysed on ice by mild sonication and subjected to centrifugation at 10,000 x g for 5 min at 4 °C. The clarified bacterial sonicates, containing the pGEX-2TK-encoded protein or relevant GST fusion protein, were rocked for 15-30 minutes at 4 °C with glutathione sepharose (20-30 μl/ml bacterial sonicate) . The glutathione sepharose beads (Glutathione Sepharose 4B, Pharmacia) ) had been washed three times and resuspended 1:1 v/v in NETN (see above) supplemented with 0.5% non-fat powdered milk prior to use. This step and subsequent steps were also carried out at 4°C.
Phosphorylation of Proteins
The sepharose beads, with bound protein, were then washed three times with NETN followed by one wash with IX HMK buffer (20 mM Tris pH [7.5], 100 mM NaCl, 12 mM MgCl ) . The supernatant was aspirated with a 23G needle and the sepharose was resuspended in 2-3 bead volumes of IX HMK buffer containing 1 unit/μl of the catalytic subunit of cAMP-dependent protein kinase (Sigma Chemical Co., St. Louis, MO), 1 μCi/μl 32P-γATP (6000 Ci/mMole, 10 mCi/ml, New England Nuclear) , and 1 mM DTT. The kinase reaction was allowed to proceed for 30 minutes at 4 °C, with periodic agitation of the sepharose to maintain a suspension. The reaction was terminated by the addition of 1.0 ml of HMK stop buffer (10 mM Na Phosphate [pH 8.0], 10 mM Na Pyrophosphate , 10 mM EDTA, 1 mg/ml bovine serum albumin) . Following a brief spin in a microcentrifuge, the supernatant was again removed using a 23 G needle and the sepharose was washed 5 times with NETN to remove any unincorporated label. Incorporation of label can be determined while the protein is attached to the bead.
Elution With Reduced Glutathione
For further studies, the fusion protein can be eluted from the beads. After the final wash the residual supernatant was aspirated using a 23 G needle and the labeled or unlabeled GST fusion protein was eluted by rocking the sepharose for 10-15 minutes in 10-50 bead volumes of 20 mM reduced glutathione, 100 mM Tris [pH 8.0], 120 mM NaCl.
Effect of HMK Sequence on 32P-Incorporation In Vitro
The ability of the Arg-Arg-Ala-Ser-Val sequence to convert the GST fusion proteins into substrates for the catalytic subunit of cAMP dependent protein kinase was tested. GST fusion proteins encoded by the pGEX-2TK recombinants, as well as the corresponding pGEX-2T constructs, were overexpressed in bacteria and recovered on glutathione sepharose as described above. As described in more detail above, the purified fusion proteins, while still non-covalently bound to glutathione sepharose, were incubated with 32P-7-ATP and the puri.fi.ed catalytic
SUBSTITUTESHEET εubunit of cAMP-dependent protein kinase for 30 min at 4°C, after which a kinase stop buffer was added. The sepharose, with bound protein, was extensively washed to remove unincorporated label, transferred to clean eppendorf tubes, and subjected to Cerenkov counting.
The GST-RB fusion proteins were readily phosphorylated in vitro, provided that the kinase recognition sequence was present. Figure 2 shows the effect of inserting an HMK sequence on 32P incorporation (cpm) in vitro data on four GST-RB fusion proteins. In particular, GST-RB(379-792) and
GST-RB(379-792;pm706) fusion proteins which were encoded by pGEX-2TK and which contained an HMK sequence (hatched bars) , showed significant incorporation of 32P. In contrast, GST-RB(379-792) and GST-RB(379-792;pm706) fusion proteins which were encoded by pGEX-2T and lacked an HMK sequence (solid bars) , were not significantly radiolabeled. In another experiment, fusion proteins from 10 ml of bacterial culture were recovered on 25 μl of glutathione sepharose beads, and were subjected to phosphorylation using the same protocol. A GST-RB(379-928) fusion protein expressed from pGEX-2TK and having a kinase site was labeled with
32P (3 X 106 cp ), while the GST-RB(379-928) fusion protein expressed from pGEX-2T and lacking a kinase
4 site was not comparably phosphorylated (1 X 10 cpm) .
Similarly, a GST-RB(379-928) ;pm706 fusion protein expressed from pGEX-2TK and having a kinase site was labeled with 32P (3 X 106 cpm) , while the
GST-RB(379-928) ;pm706 fusion protein expressed from pGEX-2T and lacking a kinase site was not comparably
4 phosphorylated (1 X 10 cpm) . Example 3
Structural Integrity of Fusion Proteins
Expression of Fusion Proteins in E. coli
As discussed above, the GST-RB fusion proteins expressed from pGEX-2T-derived plasmids pGT-RB(379-928) and pGT-RB(379-792) were able to bind to T antigen, E1A, and the putative cellular RB-binding proteins, while the mutant fusion protein encoded by pGT-RB(379-928;706F) did not (Kaelin, W.G. ___________:•' Cell 64: 521-532 (1991)). This observation suggests that the binding function and structure of the RB proteins is not grossly disturbed when they are expressed as part of a fusion protein with GST. It was further determined that the insertion of the Arg-Arg-Ala-Ser-Val (RRASV) sequence did not grossly alter bacterial expression of GST-RB fusion proteins expressed from pGEX-2TK. For this determination, whole cell lysates of IPTG-induced cultures were prepared and total bacterial protein content was analyzed directly on SDS-polyacrylamide gels as described above. Protein from E. coli transformed with pGTK-RB(379-928) , pGTK-RB(379-792) , or pGTK-RB(379-928;pm706) were analyzed. The band intensity for each fusion protein was comparable to that for the corresponding construct lacking the phosphorylation site. Each fusion protein had an apparent molecular weight consistent with the size of the pRB fragment encoded by the RB cDNA insert and the 26 kD GST polypeptide.
SUBSTITUTESHEET Purification of Fusion Proteins
The subsequent purification of GST-RB fusion proteins expressed from pGEX-2TK was not altered by the insertion of the RRASV sequence. Bacterial sonicates of each of the transfor ants were also prepared, and proteins were subjected to glutathione sepharose chromatography as described above. The sepharose beads, with bound protein, were then washed three times with NETN, and the bound proteins were eluted by boiling in SDS sample buffer (2% SDS, 10 % glycerol, 62 mM Tris [pH 6.8]). The supernatant was subjected to electrophoresis on a 10% polyacrylamide gel and visualized by Coomassie blue staining. After purification, the intensity of the bands for GTK-fusion proteins expressed in E. coli transformed with pGTK-RB(379-928) , pGTK-RB(379-792) , or pGTK-RB(379-928?pm706) was comparable to that observed with for the corresponding constructs lacking the kinase site (i.e., pGT-RB(379-928) , pGT-RB(379-792) , or pGT-RB(379-928;pm706) , respectively)
The following materials and methods were used in subsequent experiments.
Cells and Culture Conditions The retinoblastoma cell line WERI-Rb27 (a gift of Dr. Wen-Hwa Lee) Huang, H.-J. et al. , Science 242: 1563-1566 (1988)) and-293 cells, a human embryonic cell line transformed by a fragment of the Adenovirus 5 genome (Graham, F.L. et al. , J. Gen. Virol. 36: 59-72 (1977)), were grown in Dulbecco's modified Eagle's medium (DMEM) with 10% fetal calf serum (Gibco) . Akata cells were grown in RPMI or DMEM, with 10% fetal calf serum (Gibco) . C57B1/6 primary mouse embryo fibroblasts (MEF) lines expressing either wild-type (MEFΦTex) or mutant (MEFΦK1) forms of T antigen were grown in Dulbecco's modified Eagle's medium (DMEM) with 10% fetal calf serum (Gibco) and G418 (150 ug/ml) (Ewen, M.E. et al. , Cell 5 : 257-267 (1989)). All cells were grown at 37°C in a humidified, 10% CO -containing atmosphere. Radioisotopic labelling of cells and preparation of cell lysates was as described previously (Kaelin, W.G. et al. , Cell 64: 521-532 (1991)).
Antibodies
Tissue culture supernatants were the source of monoclonal antibodies PAb 419 and M73 (Harlow, E. et al., J. Virol. 3£: 861-869 (1981); Harlow E. et al. , J. Virol. 55: 533-546 (1985)). The use of these antibodies for immunoprecipitation and western blotting was described previously (Kaelin, W.G. et al., Cell 64: 521-532 (1991)), except that electro- phoretic transfer of proteins to nitrocellulose was performed without the addition of methanol to the transfer buffer.
Binding of Fusion Proteins to T Antigen and E1A
It was also determined that the insertion of the RRASV sequence did not demonstrably alter the binding behavior of GST-RB fusion proteins, GTK-RB(379-928) ,
SUBSTITUTESHEET GTK-RB(379-792) , and GTK-RB(379-928;706F) , expressed from pGEX-2TK. The pRB binding assays for T antigen and EIA binding were performed essentially as described by Kaelin et al. (Kaelin et al. , Cell 64: 521-532 (1991)).
Fusion proteins GTK-RB(379-928) and GTK-RB(379-792) (bound to glutathione-sepharose) retained the ability to bind to SV40 T antigen or adenovirus EIA in solution. In contrast, no significant binding of T antigen or EIA to the mutant RB fusion, GTK-RB(379-928;706F) , or GST expressed from pGEX-2TK was observed under the same conditions.
Integrity of Phosphorylated Fusion Proteins TO determine whether the kinase reaction led to a significant alteration in the RB T/ElA-binding region of the GTK-RB fusion proteins, the following experiment was performed. Whole cell lysates containing wild-type T antigen (MEFΦTex cells) , the RB-binding and transformation defective T antigen mutant Kl (MEFΦK1 cells) , or EIA (293 cells) were re¬ solved by SDS-polyacrylamide gel electrophoresis and transferred to nitrocellulose filters (See protocols in Example 4) . GST-RB fusion proteins were kinased in vitro and eluted from the glutathione sepharose in the presence of reduced glutathione as described above. The eluted labeled protein was incubated overnight with nitrocellulose strips cut from the filters. The filters were then washed and subjected to autoradiography. Hybridization conditions and washes were as described below in Example 4, Hybridization of Filters. Similar to the unphosphorylated version, 32P-GTK-RB(379-792) bound to EIA and wild-type T from MEFΦTex cells, but not to the T mutant Kl from MEFΦKl cells. The presence of equivalent amounts of T and κi in this assay was confirmed by immunoblotting with a monoclonal antibody directed against T antigen.
The binding of 32P-GTK-RB(379-792) protein to EIA was inhibited by the presence of a synthetic peptide replica of the human papillo avirus E7 RB-binding motif (wild-type E7 residues 16-32) . In contrast, a point mutant derivative of this peptide with a glu to gin change at residue 26, which is known to abrogate in vitro RB binding, was inert as an inhibitor. In addition, 32P-GTK-RB(379-928) , but not 32P-GTK-RB(379-928) ;706F) , exhibited EIA binding in this assay. Thus, it appeared that the structural integrity of the T/ElA-binding regions in the GST-RB chimeras was preserved during the kinase procedure and subsequent elution. The p ri ents described in this example indicate that the expression, behavior during purification and structural integrity of different polypeptides inserted into pGEX-2TK is not grossly affected by insertion of a kinase sequence or by subsequent phosphorylation.
SUBSTITUTESHEET Example 4
Screening for Proteins Capable of Specific
Interaction with the Protein of Interest and
Isolation of Genes Encoding Such Proteins
Library Manipulations
For primary screening, libraries were plated at approximately 40,000 pfu/150 mm plate x 30 plates.
Induction of ø-galactosidase fusion protein expression with IPTG impregnated nitrocellulose filters was performed as described by Singh et al.
(Biotechnigues 7: 252-261 (1989)). Additional library manipulations including screening with
32 P-labelled cDNA probes, purification of specific clones, preparation of recombinant phage DNA, and subcloning of cDNAs into the sequencing vector pBKS
(Stratagene) were performed using standard techniques.
Preparation of Nitrocellulose Filters for Hybridization
Electrophoretic transfer of proteins from SDS-polyacrylamide gels to nitrocellulose for western blot analysis was carried out in 192 mM glycine, 25 mM Tris(Base), and 0.01% SDS. Plaque lifts were performed as described above. The nitrocellulose filters were placed, directly into IX HBB (25 mM Hepes-KOH[pH7.7], 25 mM NaCl, 5mM MgCl2,l mM DTT) supplemented with 5% non-fat powdered milk and 0.05% NP-40 without drying and incubated overnight. This and subsequent manipulations were performed at 4°C with gentle rocking. The filters were then denatured and renatured as described by vinson et al. (Genes and Dev. 2_: 801-806 (1988)). Processing of multiple filters was done in batch.
Following renaturation, the filters were placed in fresh IX HBB supplemented with 5% non-fat powdered milk and 0.05% NP-40, and incubated for 1 hour. The filters were then incubated in IX HBB supplemented with 1% non-fat powdered milk and 0.05% NP-40 for at least 30 minutes prior to hybridization.
Hybridization of Filters
To prepare the hybridization solution, protein expression in E. coli (DH5α) transformed with pGEX-2TK and pGEX-2TK-derived plasmids were induced with IPTG as described above in Example 2. The bacteria were then pelleted and resuspended in 1/10 the original culture volume in Hyb 75 (20 mM Hepes [pH 7.7], 75 mM KCl, 0.1 mM EDTA, 2.5 mM gCl2, 1 mM DTT, 0.05% NP-40). The bacterial suspension was sonicated with a probe type sonicator (Branson) and centrifuged at 10,000 x G for 5 min at 4°C. For western blots the clarified supernatant was used undiluted or diluted 1:2 with Hyb 75. For screening plaque lifts the clarified supernatant was diluted 1:2-1:8 with Hyb 75 supplemented with 1% nonfat powdered milk.
The relevant 32P-labeled GST fusion protein was added at 100,000-250,000 cpm/ml. Hybridization was carried out at 4°C with gentle rocking overnight.
BSTITUTESHEET after which the filters were washed three times (10-15 in/wash) with Hyb 75 with 1% nonfat powdered milk. The filters were then dried, covered with Saran wrap and exposed to film at -70°C with an intensifying screen.
Screening Precleared Lysates for Specifically Interacting Proteins
Previously, a series of GST-RB fusion proteins, expressed from pGEX-2T vectors, and non-covalently bound to glutathione sepharose, were used to search for cellular proteins capable of interacting, directly or indirectly, with the pRB T/ElA/E7-binding domain (Kaelin, W.G. et al. , Cell 64:521-532 (1991)). At least 7 cellular proteins were identified which bound to GST-RB fusion proteins having an intact T/E1A/E7-binding domain. These cellular proteins failed to bind to GST-RB fusion proteins (with no HMK sequence) derived from spontaneously occurring, loss of function RB mutations. In addition, the binding of these cellular proteins to GST-RB fusion proteins was blocked by a synthetic peptide corresponding to the pRB-binding/transforming sequence found in SV40 T antigen. In contrast, a point mutant derivative of this peptide, corresponding to the sequence found in the transformation defective T mutant Kl, d d not inhibit binding.
It appeared that one or more of these proteins may fulfill the criteria for a meaningful pRB cellular ligand. Unfortunately, which of the cellular proteins bound directly to the pRB T/E1A/E7- binding domain could not be determined from these experiments. To demonstrate which of these cellular proteins is capable of interacting directly with RB, an
35 S-labeled WERI-Rb27 retinoblastoma cell lysate was prepared. The lysate was precleared by passage over
05 glutathione sepharose which had been loaded with pGEX-2T-encoded GST to remove proteins which may bind to GST rather than the protein of interest. GT-
RB(379-792) fusion protein was loaded onto the glutathione sepharose beads, and the precleared
10 lysate was incubated with the bound GT-RB(379-792) fusion protein in the presence (for strip 1) or absence (for strips 2-4) of wild-type E7 peptide
(residues 16-32) . Unbound protein was removed by washing and the proteins were eluted by boiling in
15 sample buffer.
Bound proteins (directly and indirectly bound to the RB-bound beads) were loaded in wide wells, resolved by SDS-polyacrylamide gel electrophoresis, and transferred to nitrocellulose. The filter was
20 then cut into adjacent strips and either subjected to autoradiography (strips l and 2) or probed with
2P-GTK-RB(379-792) in the presence (strip 3) or absence (strip 4) of the above-mentioned E7 peptide.
Strips 3 and 4 were then washed, dried, placed under
9". saran wrap (which greatly reduces the 35S signal) , and placed under film.
At least two bands appeared to be capable of interacting directly with the 32P-GTK-RB(379-792) probe in a peptide inhibitable manner. These bands
30 were not detected by 32P-GTK-RB(379-792) on strips in which the lysate was incubated with GT-RB(379-792) in
BSTITUTESHEE the presence of wild-type E7 peptide prior to electrophoresis, indicating specificity of binding.
Furthermore, the ability of these two cellular proteins to interact with 32P-GTK-RB(379-792) in filter binding assay was not inhibited by a point mmuuttaanntt ((GGlluu„ _„ ___oo ttoo GGiinn,_,_.t.ibo,.,.)) version of the E7 peptide defective in RB binding
Screening Whole-cell Lysates for Specifically Interacting Proteins An unlabeled whole cell lysate prepared from
WERI-rb27 cells was also probed for cellular proteins capable of binding directly to the RB. The unlabeled extract was subjected to SDS-polyacrylamide gel electrophoresis and transferred to nitrocellulose for Western blotting. The filter was cut into strips which were probed with selected 32P-labeled fusion proteins. Again, there appeared to be at least 2 cellular proteins capable of interacting with
32 P-GTK-RB fusion proteins in which the T/ElA-bmdmg region was intact (32P-GTK-RB(379-792) and 32P-GTK-RB(379-928)pm706) fusion proteins).
Furthermore, the binding of these cellular proteins to wild-type 32P-GTK-RB(379-792) fusion protein could be selectively inhibited by three different RB-binding peptides. Peptide replicas of the RB-binding sequences found in T antigen (amino acids 102-115) , EIA (tyrosine followed by EIA residues 115-132) , or E7 (amino acids 16-32) each inhibited binding of the labeled probe to the two cellular proteins. In contrast, point mutant derivatives of these peptides failed to inhibit the interaction of the probe with the filter-bound cellular proteins. The particular mutant peptides tested included the T antigen peptide with a Glu1Q7 to Lys change, the EIA peptide with a Cys _5 to Gly change, and the E7 peptide with a Glu 6 to Gin change. Similar results were obtained when this assay was performed with a human Burkitt's lymphoma cell line (Akata) as well as with normal peripheral blood lymphocytes.
Use of Labeled GST-fusion Proteins as Probes for the Isolation of Genes
In order to isolate cDNAs encoding cellular proteins capable of interacting specifically, and directly with RB, an Akata λgtll expression library was screened with the 32P-GTK-RB(379-792) probe as described in the protocols above. Six λgtll clones
(clones 1, 3, 4, 5, 6 and 9) encoding /3-galactosidase
32 fusion proteins which bound to P-GTK-RB(379-792) with high affinity were plaque purified and subjected to further analysis.
The binding of the fusion proteins encoded by 5 of these clones (clones 1, 4, 5, 6 and 9) to the
32 P-GTK-RB(379-792) probe was markedly reduced or undetectable in the presence of the wild-type E7 peptide, while the mutant E7 peptide had no effect on binding. Furthermore, the fusion proteins encoded by all of the clones bound readily to
32P-GTK-RB(379-928) , whereas binding to 32P_GTK-RB(379-928;706F) , a protein with a mutation
SUBSTITUTESHEET in the T/E7/E1A binding region of RB, was undetectable. Thus, binding to these clones displayed RB binding behavior similar to cellular proteins which interact with RB.
Cross-hybridization experiments, and subsequent sequence analysis, demonstrated that 4 of the clones (Clones l, 3, 4 and 6) contained overlapping cDNA fragments derived from a common mRNA. The gene encoding this mRNA will be referred to as RBAP1.
Sequence analysis of clone 5 predicted that it encoded a /9-galactosidase leader polypeptide fused to the sequence His-Ser-Phe-Leu-Leu-Cys-Asp-Glu-Asn-Val- Leu-Asp-Stop. Thus, this fusion protein contains the Leu-X-Cys-X-Glu motif common to the viral RB-binding motifs. Additional cDNA clones related to clone 5 were obtained by screening a 293 cell library with the clone 5 insert cDNA. Sequence analysis of these clones suggested that the short open reading frame in clone 5 was generated by the juxtaposition of a normally untranslated cDNA segment with the λgtll £-galactosidase coding sequence. Thus, while the clone may not represent a cellular protein, upon expression in λgtll, it encodes a protein with properties (e.g. , a Leu-X-Cys-X-Glu motif) common with viral RB-binding motifs.
Clone 9, unlike clone 5, contained a long open reading frame, consistent with the possibility that it encodes an RB-interacting protein. Additional Characterization of RBAPl
The retinoblastoma gene product is believed to serve as a cell cycle regulatory element. In particular, pRB is thought to contribute to the regulation of cell cycle progression as cells traverse the Gl/S boundary. Consistent with a possible role as an RB-interacting protein, RBAPl message levels appear to respond to cell cycle events. A Northern blot was prepared of total RNA obtained from resting peripheral blood lymphocytes (PBL) and from PBL at various time points following stimulation with a cocktail containing PMA, PHA, and a calcium ionophore. The Northern blot was probed with an RBAPl probe from clone 4. A single message of about 3.5 kb became detectable 24-36 hours after stimulation.
In a similar experiment, PBL were stimulated to enter the cell division cycle in the presence of hydroxyurea (HU) . Again, a single 3.5 kb mRNA was detected with the RBAPl probe. The abundance of the message in RNA from HU-treated PBLs increased to a maximum at 36 hours, and then appeared to plateau. Furthermore, the level of the mRNA in cells detected by the RBAPl probe fell dramatically within 8 hours after removal of HU.
In vitro Binding of RBAPl to pRB
The RBAPl cDNA was cloned into pGEX-2T and expressed in E. coli as a GST-fusion protein. Glutathione sepharose beads were loaded with the
SUBSTITUTESHEET GST-RBAP1 fusion. S-labeled RB proteins were prepared by in vitro transcription and translation from cDNA clones as described (Kaelin, W.G. et al. , Mol. Cell. Biol. 10(7) : 3761-3769 (1990)). The 5 labeled RB proteins, RB(379-928) and
RB(379-928;pm706) were incubated with the bound GST-RBAP1 fusion protein. The beads were washed and analyzed for the amount of fusion protein retained. Quantitative recovery of the in vitro translated ° 32P-GTK-RB(379-928) protein, but not of the
32 P-GTK-RB(379-928;pm706) fusion protein was observed. This experiment further suggests that the
RBAPl product is capable of interacting directly with the RB gene product.
5 Expression of pl07 as a GTK-fusion Protein
The RB-related protein pl07 was also expressed as a GTK-fusion protein. A cDNA encoding a region of pl07 which is homologous to the T/E1A/E7 binding domain of the RB gene product was inserted into 0 pGEX-2TK. Using the protocols described in the above examples, a GTK-pl07 fusion protein was expressed in ■__- ____L___L' a lYsate was prepared, the fusion protein present in the lysate was captured on a glutathione-sepharose affinity column, and labeled _ with [32P] in situ m a phosphorylation reaction using the catalytic subunit of cAMP-dependent protein kinase and [T-32P]ATP. The [32P]-labeled GTK-pl07 fusion protein was eluted from the column, and was used as probe in Western blot analysis of whole cell extracts. Whole cell extracts (unlabeled) containing E1A were prepared from 293 cells, subjected to SDS-polyacrylamide gel electrophoresis, and electro- phoretically transferred to filters. Filters were cut into strips and probed with (1) anti-ElA antibody, (2) [32P]-labeled GTK-pl07 fusion protein, (3) [ P]-labeled GTK-pl07 fusion protein in the presence of competing wild-type E7 peptide (residues 16-32), and (4) [32P]-labeled GTK-pl07 fusion protein in the presence of mutant E7 peptide (residues 16-32, with a Glu to Gin mutation at residue 26) .
Autor 'adiography indicated that the [32P]-labeled
GTK-pl07 fusion protein, alone or in the presence of mutant E7 peptide, was able to detect the EIA product on the blots. The specificity of interaction was indicated by the observation that the [ 32P]-labeled
GTK-pl07 fusion protein did not detect the EIA product in the presence of competing wild-type E7 peptide.
Example 5
Construction of pAR(ΔRI)59/60 and Properties of pFEK-fusion Proteins
Construction of pAR(ΔRI)59/60
Plasmid pAR3040 (also referred to as pET3a) was the starting material (Studier, Meth. Enzymol .185: 60-89 (1990)). This plasmid and expression plasmids derived from this vector can be maintained and induced for expression as described by Studier (Studier, Meth. Enzymol .185: 60-89 (1990)). Plasmid
STITUTESHEET PAR3040 was cut with EcoRI. The overhangs were made blunt by "filling in" using Klenow enzyme and dNTPs, and religa ed to destroy the EcoRI site. In an infrequent event, the EcoRI site was regenerated on propagation of the plasmid. Therefore, another version of of this intermediate was constructed by treating with mung bean nuclease to remove the EcoRI overhangs prior to religation. Regeneration of the EcoRI site in the latter version was not observed. Both versions of the intermediate construction were modified as described below, to make two slightly different pAR(ΔRI)59-60 vectors (i.e., differing in the manner in which the EcoRI site was destroyed) . These vectors behave identically with respect to expression of encoded fusion proteins. The intermediate vectors were then cleaved with Ndel and treated with calf intestinal alkaline phosphatase (CIP) .
Two complementary oligonucleotide adaptors, MAB 59 and MAB 60, were synthesized, kinased, and annealed. The adaptors were then ligated into the Ndel cleaved, CIP-treated vectors. The adaptors introduce the 17 amino acid sequence comprising the FLAG peptide and HMK site shown in Figure 2. The sequence of the adaptors is shown below:
Met Asp Tyr Lys Asp Asp Asp Asp Lys Ala Arg Arg-
5'-T ATG GAC TAC AAA GAC GAT GAC GAT AAA GCA AGA AGA- AC CTG ATG TTT CTG CTA CTG CTA TTT CGT TCT TCT-
Ala Ser Val Gin Phe-
GCA TCT GTG GAA TTC CA
CGT AGA CAC CTT AAG GT AT-5'
Plasmids having one insert of the double-stranded adaptor were identified by restriction analysis of miniprep DNA. An Xbal-Ba HI double digest releases a fragment comprising the inserted sequence. The structure of resulting plasmids, named pAR(ΔRI)59/60, was confirmed by dideoxy sequencing using oligonucleotide primers MAB 58 (5'-GCAGCCAACTCAGCTTC-3') and MAB 69 (5'-TTAATACGACTCACTAT-3') .
Construction of Derivatives of pAR(ΔRI)59/60
Four derivatives of pAR(ΔRI) 59/60 were prepared.
SUBSTITUTESHEET The first derivative encoding a selected polypeptide was prepared by the insertion of a 1.5 kb EcoRI fragment of a shPan-1 (N3) cDNA into pAR(ΔRI)59/60, which had been cleaved with EcoRI and treated with CIP. The shPan-1 (N3) cDNA encodes a DNA binding protein, as described by German et al. (German et al., Molec. Endocrinol. 5_: 292-299 (1991)).
Plasmids containing the insert were detected by restriction analysis of miniprep DNA using a BamHI-StuI double digest. The correct orientation yielded a diagnostic 500 bp fragment. The sequence at the junctions between the vector and insert was verified by sequencing, using oligonucleotide primers MAB 58 and MAB 69 (see above) . The plasmid is referred to herein as pFEK-shPan-1.
The second derivative of pAR(ΔRI)59/60 was made by the insertion of N3-SH, a fragment of shPan-1 generated by PCR (German et al. , Molec. Endocrinol. 5_: 292-299 (1991)). For cloning, the fragment was generated by PCR, using oligonucleotide primers MAB 72 (5'-GGCCGAATTCTCCTGGTCCCACGGAGACCC-3') and MAB 73 (5'-GGCCGAATTCGCCGAGGAGGACAAGAAGGACC-3') . The PCR products were purified on an agarose gel, electroeluted, treated with T4 DNA polymerase, and digested with EcoRI. The resulting fragment was ligated to EcoRI-cut, CIP-treated pAR(ΔRI)59/60. Constructs with single inserts were identified by Xbal-BamHI double digests of miniprep DNA. The junctions and sequence of the insert were verified by dideoxy sequencing using oligonucleotides MAB 58 and MAB 69 (see above) . The resulting construct is referred to herein as pFEK-N3-SH. The third derivative of pAR(ΔRI)59/60 was constructed by the insertion of a DNA sequence encoding amino acids 120 through 206 of the rat c-fos protein (Kouzarides, T. and E. Ziff , Nature 336: 646-651 (1988); Curran, T. et al. , Oncogene 2: 79-84 (1987); Nakabeppu, Y. and D. Nathans, EMBO J. : 3833-3841 (1989)). The specific fragment used for cloning was generated by the polymerase chain reaction (PCR) , using oligonucletide primers MAB 70 (5'-GGCCGAATTCGCGCAGAGCATCGGCAGAAG-3') and MAB 71 (5'-GGCCGAATTCCTACTAGATCTTGCAGGCAGGTCGGT-3') . The resulting PCR products were purified on an agarose gel, electroeluted, treated with T4 DNA polymerase, and digested with EcoRI. The resulting fragment was ligated into EcoRI-cut, CIP-treated pAR(ΔRI)59/60. Isolates with single inserts were identified by Xbal and BamHI double digests. The junctions and insert sequence were verified by dideoxy sequencing using the oligonucleotide primers MAB 58 and MAB69 (see above) . The resulting plasmid is referred to herein as pFEK-c-fos(120-206) .
The fourth derivative of pAR(ΔRI)59/60 was derived by the insertion of amino acids encoding residues 206 through 340 of the human c-jun protein (Kouzarides, T. and E. Ziff, Nature 336: 646-651 (1988) ; Bohmann, D. et al. , Science 238: 1386-1392 (1987)). The fragment for cloning was generated by PCR using oligonucleotide primers MAB 74 (5'-GGCCGAATTCTTTCCCGCGCAACCCCAGCA-3') and MAB 75 (5'-GGCCGAATTCCCGACGGTCTCTCTTCAAA-3') . The PCR products were purified on an agarose gel,
SUBSTITUTESHEET electroeluted, treated with T4 DNA polymerase, and digested with EcoRI. The resulting fragment was ligated to EcoRI-cut, CIP-treated pAR(ΔRI)59/60.
Constructs having a single insert were again identified by Xbal-BamHI digestion of mini-prep DNA. The junctions and insert sequences were confirmed by dideoxy seuqencing using primers MAB 58 and MAB 69 (see above) . The resulting plasmid is referred to herein as pFEK-c-jun(206-340) .
pFEK-fusion Proteins Retain Biological Activity and Are Efficiently Phosphorylated
As each of the selected polypeptides expressed from pAR(ΔRI)59/60 is a DNA binding protein, the activity of the selected polypeptide portions, in the context of a fusion protein comprising an affinity ligand and modification site, were assayed by electrophoretic mobility shift assays (EMSA) . Assay conditions for the FEK-shPAN-1 fusion protein encoded by pFEK-shPan-1 and by pFEK-N3-SH, were as described by German et al. (German et al. , Molee. Endocrino1. 5_: 292-299 (1991)). EMSA conditions for the fusion protein encoded by pFEK-c-fos(120-206) were as described (Kouzarides, T. and E. Ziff, Nature 336: 646-651 (1988) ; Curran, T. et al. , Oncogene 2_: 79-84 (1987) ; Nakabeppu, Y. and D. Nathans, EMBO J. Q 3833-3841 (1989)) .
EMSA conditions for the fusion protein encoded by pFEK-c-jun(206-340) were as described (Kouzarides, T. and E. Ziff, Nature 336: 646-651 (1988); Bohmann, D. et al., Science 238: 1386-1392 (1987)). The DNA binding activity of the FEK-c-jun(206-340) fusion protein present in bacterial extracts, was assayed in combination with bacterial extracts containing fos "core" (to assay binding of jun as a jun-fos heterodi er) .
The four unlabeled fusion proteins were made by in vitro transcription/translation. The particular DNA probes used in each shift assay were radiolabeled for detection of DNA-fusion protein complexes. As a positive control, each of the native (i.e., not fused) DNA binding proteins was prepared by in vitro transcription/translation and subjected to the shift assay. The short polypeptide produced by pAR(ΔRI)59/60, and the transcription/translation products from a vector control for each pFEK-construct in which the selected polypeptide insert was in a reverse orientation, provided negative controls for DNA binding. As indicated by the shift assay, all four pFEK-fusion proteins bound DNA to an extent comparable to the corresponding native protein. These results indicate that the incorporation of the selected polypeptides (fos, jun, shPan-1 (N3) , and N3-SH) into fusion proteins of the present invention comprising a modification recognition site, did not alter the biological activity (DNA binding) of these proteins.
In parallel to the experiment just described, in vitro transcribed and translated fusion proteins were modified by phosphorylation with HMK kinase and non-radioactive ATP (see below for conditions) . Modification by phosphorylation could be
SUBSTITUTESHEET distinguished because the negative charge of the additional phosphate group on the modified fusion protein led to an alteration in migration of the DNA-fusion protein complex as compared with the unmodified fusion protein-DNA complex. Modification of all four of the fusion proteins did not alter the extent of complex formation, indicating that the biological activity of the selected polypeptide portion is not altered by modification.
Induction of Protein Expression in BL21 Bacteria Induction of expression from pAR(ΔRI)59/60 and pFEK- plasmids was done essentially as described by Studier et al. (Studier, Meth. Enzymol. 185: 60-89 (1990)). Briefly, plasmids were transformed into competent BL21 or BL21 pLysS bacteria. A single colony was inoculated into Luria broth with ampicillin at 50 ug/ml and with chloramphenicol at 25 ug/ l. Growth at 37°C was allowed to proceed until an optical density (A O,nUnU) of approximately 1.0 was attained. IPTG was added as described by Studier
(Meth. Enzymol. 185: 60-89 (1990)). Cells were grown for another 3 hours or so at 37°C. A bacterial lysate was prepared using standard techniques. Following expression in E. coli, the unfractionated bacterial extract from transformants carrying expression vector pFEK-shPan-1, pFEK-N3-SH, pFEK-c-fos(120-206) , or pFEK-c-jun(206-3 0) was assayed by electrophoretic mobility shift assay as in the previous section. The results indicated that the unmodified or modified (by phosphorylation with unlabeled ATP) FEK-fusion proteins expressed in bacterial extracts retained the biological activity of the selected polypeptide portion.
Partial Purification of Proteins Produced: Following expression in E. coli, bacterial extracts from cells transformed with a pFEK- vector were fractionated to obtain partially purified FEK-fusion proteins by one of two methods.
A) DEAE-Sephacel chromatography:
-used as per directions of supplier
(Pharmacia-PL) -buffer contained 100 mM NaCl and 10% glycerol
(in addition to the buffer constituents presents in the bacterial lysates) -treated in "batch" for 60 minutes at +4°C -lysate was incubated with resin with continuous gentle agitation -supernatant solution passed over a 2 ml column of resin -flow-through collected
The fusion proteins encoded by vectors pFEK-shPan-1 and pFEK-N3-SH were obtained using DEAE-Sephacel chromatography at -50% purity.
B) Heparin-Sepharose chromatography: -used as per directions of supplier (Pharmacia-PL)
SUBSTITUTESHEET -DEAE flow-through fraction from protocol (A) was loaded onto a 1 ml bed volume column of resin -resin washed with 10 column volumes of load buffer (i.e., 100 mM salt)
-proteins were eluted with l column volume each
Of 200, 300, 400, 500, 600, 700, 800, 900 and
1000 mM salt -fractions were tested for the presence of active protein by the appropriate assay
For example, in the case of a fusion protein comprising the fos 'core' (encoded by pFEK-c-fos(120-206) ) , activity of fractions was assayed by electrophoretic mobility shift assay together with reticulocyte-produced c-jun protein. Peak activity for the fusion protein comprising the fos 'core' fusion protein was eluted at approximately 500-600 mM salt yielding a preparation with -50% purity.
Phosphorylation of Proteins with HMK:
All four fusion proteins produced by the pFEK- vectors (pFEK-shPan-1, pFEK-N3-SH, pFEK-c-fos(120-206) , and pFEK-c-jun(206-340) ) were capable of being modified by phosphorylation when present in crude bacterial extracts. As discussed above, modification by phosphorylation with unlabeled ATP was observed. In addition, modification using radioactively-labeled ATP was observed. Proteins which were partially purified by the methods (A and B) described above were also efficiently radiolabeled by phosphorylation using HMK enzyme and [7- 32P]ATP. Labeli.ng to a speci.fi.c activity of approximately 10 7 to 108 cpm per μg of protein was observed for partially purified
FEK-fusion proteins encoded by pFEK-shPan-1,
PFEK-N3-SH, and pFEK-c-fos(120-206) . The protocols used for protein labeling are described below.
Phosphorylation with Heart Muscle Kinase:
Sigma # P-2645 Protein Kinase, Catalytic subunit from bovine heart was obtained as lyophilized powder in 250 unit vials. Typically, 250 units of HMK enzyme was resuspended in 25 μl of 40 mM DTT (i.e., at 10 u/μl) . The solution was allowed to stand at room te perative for 10 minutes and was stored at +4 °C. Activity was stable for 2-3 days, but HMK was usually freshly reconstituted for each use.
A 10X preparation of HMK Buffer was prepared (200 mM Tris-Cl (pH 7.5), 10 mM DTT, 1 M NaCl, and 120 mM MgCl_) . The phosphorylation reaction mixture was as follows:
3 μl (lOx) HMK buffer
2-5 μl of [7 32P] (e.g., NEN-Dupont #NEG-035C [7 32P]-ATP >7000Ci/mmol)
1-10 ul of protein extract (amount will vary with the particular protein)
Figure imgf000075_0001
water to a total of 30 μl
STITUTESHEET The reaction mixture was incubated at 37 °C for 30-60 minutes, and was stored on ice until use. Some proteins were labeled almost as well when the incubation was carried out on ice. The low temperature incubation can serve to preserve the biological activity of the modified fusion protein. These procedures can be scaled up as required.
G50 column chromatography to Remove Unincorporated Label
FEK-fusion proteins were used for Western blotting or for plaque screening. For these applications, the fusion protein was run a G50 column as described below in order to remove unincorporated label following the HMK phosphorylation reaction.
"Z + 0.1 M KC1" buffer (25 mM Hepes-KOH ([pH 7.7], 12.5 mM MgCl2, 20% glycerol, 100 mM KC1) was prepared. For solutions comprising Z + KC1, and BSA, the solution was filtered through a 0.2 μm filter prior to use. DTT was added to a final concentration of l mM just prior to use.
—G50 fine (pre-swollen in water) was washed (IX) with 5 ml of (Z + 0.1 M KC1 + 3 mg/ml BSA + l mM DTT) and resuspended at [1:1] in the same buffer
-the beads were rotated at RT for -1 hour, and washed three times with (Z + 0.1 M KC1 + l mg/ml BSA + 1 mM DTT)
-a 1 ml sterile plastic pipette was packed with the washed G5'0 (bed vol -1.2 ml) and equilibrated at room temperature with -50-10 ml of (Z + 0.1 M
KC1 + 1 mg/ml BSA + 1 mM DTT) -immediately prior to loading the column, the total volume of the HMK reaction was brought to
100 μl with ice-cold (Z + o.l M KC1 + 1 mM DTT) -the column was loaded and run at room temperature, taking 1 drop fractions (-45 μl/drop) -fractions were stored immediately on ice -2-5 μl aliquots were removed from each fraction and counted by Cerenkov counting -the excluded peak fractions (usually at 1/3 to 1/2 the column volume) were pooled
Optionally aliquots of the fractions (e.g., 1.0 μl) can be fractionated on SDS-polyacrylamide gels to monitor the progress of the fusion protein. The above procedure can also be carried out at 4° C where desired.
Equivalents
Those skilled in the art will be able to recognize, or be able to ascertain, using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be encompassed by the following claims.
SUBSTITUTESHEET

Claims

1. An expression vector comprising a nucleotide sequence which encodes at least one affinity ligand, at least one modification recognition sequence and at least one restriction site for the in frame insertion of a nucleotide sequence encoding a selected polypeptide.
2. The expression vector of Claim 1 wherein the affinity ligand is a glutathione-S-transferase affinity ligand or a FLAG epitope affinity ligand, and the modification recognition sequence is a phosphorylation site.
3. The expression vector of Claim l further comprising a nucleotide sequence which encodes a cleavable linker located between the affinity ligand and the restriction site for insertion of a selected polypeptide.
4. The expression vector of Claim 1, wherein the affinity ligand is a glutathione-S-transferase affinity ligand or a FLAG epitope affinity ligand, the modification recognition sequence is a phosphorylation site, and the cleavable linker is selected from the group consisting of a thrombin cleavage site. Factor Xa cleavage site, and enterokinase cleavage site.
5. The expression vector of Claim 4, wherein the vector is pGEX-2TK.
6. The expression vector of Claim 4, wherein the vector is pAR(ΔRI)59/60.
7. An expression vector comprising a nucleotide sequence which encodes a fusion protein comprising an affinity ligand, a modification recognition sequence and a selected polypeptide.
8. The expression vector of Claim 7, wherein the affinity ligand is a glutathione-S-transferase affinity ligand or a FLAG epitope affinity ligand, and the modification recognition sequence is a phosphorylation site.
9. The expression vector of Claim 7 wherein the nucleotide sequence further encodes a cleavable linker located between the affinity ligand and the selected polypeptide.
10. The expression vector of Claim 9, wherein the affinity ligand is glutathione-S-transferase affinity ligand or a FLAG epitope affinity ligand, the modification recognition sequence is a phosphorylation site, and the cleavable linker is selected from the group consisting of a thrombin cleavage site, factor Xa cleavage site and enterokinase cleavage site.
SUBSTITUTESHEET
11. A method of producing a labeled fusion protein comprising: a) expressing a fusion protein comprising an affinity ligand, a phosphorylation recognition sequence and a polypeptide in a host cell using an expression vector; b) lysing the host cells to obtain a lysate containing the fusion protein; c) contacting the lysate with an affinity matrix under conditions sufficient to bind the affinity ligand portion of the fusion protein to the affinity matrix; d) washing the product of step (c) to remove unbound material; and e) contacting the bound fusion protein with
[7-labeled]ATP and a kinase reaction buffer comprising a protein kinase under conditions appropriate for phosphorylation of the bound fusion protein to thereby produce a labeled fusion protein which is bound to the affinity matrix.
12. The method of Claim 11 further comprising the steps of: f) optionally stopping the reaction of step (e) by addition of a stop buffer; g) washing the affinity matrix with labeled fusion protein bound thereto to remove unincorporated label; and h) eluting the labeled fusion protein with a suitable elution buffer comprising a release component thereby obtaining a labeled fusion protein.
13. The method of Claim 11 further comprising the steps of: f) optionally stopping the reaction of step (e) by addition of a stop buffer; g) washing the bound labeled fusion protein to remove unincorporated label; and h) cleaving the labeled fusion protein with a site specific protease to produce a labeled selected polypeptide portion.
14. A method of producing a labeled fusion protein comprising: a) expressing a fusion protein comprising a GST affinity ligand, a phosphorylation recognition sequence and a polypeptide in a host cell using an expression vector; b) lysing the host cells to obtain a lysate containing the fusion protein; c) contacting the lysate with immobilized glutathione under conditions sufficient to bind the GST portion of the fusion protein to bind to glutathione, thereby obtaining immobilized glutathione with the bound fusion protein;
SUBSTITUTESHEET d) washing the immobilized glutathione with bound fusion protein to remove unbound material; and e) contacting the bound fusion protein with [γ-labeled]ATP and a kinase reaction buffer comprising a protein kinase under conditions appropriate for phosphorylation of the bound fusion protein to thereby produce immobilized glutathione with labeled fusion protein which is bound to immobilized glutathione.
15. The method of Claim 14 wherein the protein kinase is the catalytic subunit of a cAMP-dependent protein kinase.
16. The method of Claim 14 wherein the [7-labeled]ATP is [γ-32P]ATP.
17. The method of Claim 14 further comprising the steps of: f) optionally stopping the reaction of step (e) by addition of a stop buffer; g) washing the immobilized glutathione with bound labeled fusion protein to remove unincorporated label; and h) eluting the bound labeled fusion protein with a solution comprising reduced glutathione thereby obtaining a labeled fusion protein.
18. The method of Claim 14 further comprising the steps of: f) optionally stopping the reaction of step (e) by addition of a stop buffer; g) washing the immobilized glutathione with bound labeled fusion protein to remove unincorporated label; and h) cleaving the labeled fusion protein with thrombin to produce a labeled selected polypeptide portion.
19. A kit for preparing a radiolabeled glutathione-S-transferase (GST) fusion protein comprising: a) an immobilized glutathione affinity matrix; b) a wash buffer which permits binding of a
GST-fusion protein to the affinity matrix; c) a kinase reaction buffer; d) a protein kinase preparation; e) an elution buffer comprising reduced glutathione for releasing the fusion protein from the affinity matrix; and optionally f) a reaction stop buffer.
20. A labeled fusion protein comprising an affinity ligand, a modification recognition sequence, and a polypeptide.
SUBSTITUTESHEET
21. The labeled fusion protein of Claim 20, wherein the affinity ligand is a glutathione-S-transferase affinity ligand and the modification recognition sequence is a phosphorylation recognition sequence.
22. The labeled fusion protein of Claim 20, wherein the affinity ligand is the FLAG epitope, and the modification recognition sequence is a phosphorylation recognition sequence.
23. A host cell containing the expression vector of Claim 1.
PCT/US1992/006187 1991-07-29 1992-07-24 Plasmids for the rapid preparation of modified proteins WO1993003157A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US73684791A 1991-07-29 1991-07-29
US736,847 1991-07-29

Publications (1)

Publication Number Publication Date
WO1993003157A1 true WO1993003157A1 (en) 1993-02-18

Family

ID=24961551

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1992/006187 WO1993003157A1 (en) 1991-07-29 1992-07-24 Plasmids for the rapid preparation of modified proteins

Country Status (1)

Country Link
WO (1) WO1993003157A1 (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996008570A1 (en) * 1994-09-14 1996-03-21 Fuji Immunopharmaceuticals Corporation Expression and export technology of proteins as immunofusins
EP0723402A1 (en) * 1993-08-30 1996-07-31 Baylor College Of Medicine Senescent cell-derived inhibitors of dna synthesis
WO1999001554A1 (en) * 1997-07-03 1999-01-14 Zymogenetics, Inc. Mammalian secretory peptide-9
WO1999016870A1 (en) * 1997-09-29 1999-04-08 Zymogenetics, Inc. Secreted protein zsig-11
WO1999057312A1 (en) * 1998-04-30 1999-11-11 MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. New method for the selection of clones of an expression library involving rearraying
WO2000031115A1 (en) * 1998-11-20 2000-06-02 Commonwealth Scientific And Industrial Research Organisation Methods of identifying antigen gene sequences
WO2001098366A2 (en) * 2000-06-19 2001-12-27 Dyax Corp. Enterokinase cleavage sequences and their use
WO2003002611A2 (en) * 2001-06-29 2003-01-09 Biaffin Gmbh & Co Kg Fusion proteins, method for the production and use thereof
EP1323823A2 (en) * 1997-07-03 2003-07-02 Zymogenetics Inc Mammalian secretory peptide 9, antibodes against it and their use
US6617135B1 (en) 1999-08-09 2003-09-09 Emd Lexigen Research Center Corp. Multiple cytokine protein complexes
US7262255B2 (en) 2001-03-30 2007-08-28 Ashland-Sudchemie-Kernfest Gmbh Polyester resin emulsion and two-component aqueous paint
US7432357B2 (en) 2004-01-22 2008-10-07 Merck Patent Gmbh Anti-cancer antibodies with reduced complement fixation
US8691952B2 (en) 2005-12-30 2014-04-08 Merck Patent Gmbh Anti-CD19 antibodies with reduced immunogenicity
US8907066B2 (en) 2009-04-22 2014-12-09 Merck Patent Gmbh Antibody fusion proteins with a modified FcRn binding site
US8926973B2 (en) 2001-03-30 2015-01-06 Merck Patent Gmbh Reducing the immunogenicity of fusion proteins
US9029330B2 (en) 2005-12-30 2015-05-12 Merck Patent Gmbh Methods of treating cancer using interleukin-12p40 variants having improved stability

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0293249A1 (en) * 1987-05-28 1988-11-30 Amrad Corporation Limited Novel fusion protein

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0293249A1 (en) * 1987-05-28 1988-11-30 Amrad Corporation Limited Novel fusion protein

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BIO/TECHNOLOGY vol. 6, no. 10, October 1988, NATURE AMERICA, INC., NEW YORK, US pages 1204 - 1210 T.P.HOPP ET AL. 'A short polypeptide marker sequence useful for recombinant protein identification and purification' *
BIOCHEMISTRY vol. 29, no. 39, 2 October 1990, AM. CHEM. SOC., EASTON, PA, US; pages 9274 - 9281 N.P. GERARD AND C. GERARD 'Construction and expression of a novel recombinant anaphylatoxin, c5a-N19, as a probe for the human C5a receptor' *
PROC. NATL. ACAD. SCI. vol. 86, no. 2, January 1989, NATL ACAD. SCI., WASHINGTON, DC, US; pages 558 - 562 B.-L. LI ET AL. 'Creation of phosphorylation sites in proteins: Construction of a phosphorylatable human interferon alpha' cited in the application *

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0723402A1 (en) * 1993-08-30 1996-07-31 Baylor College Of Medicine Senescent cell-derived inhibitors of dna synthesis
EP0723402A4 (en) * 1993-08-30 1997-12-17 Baylor College Medicine Senescent cell-derived inhibitors of dna synthesis
WO1996008570A1 (en) * 1994-09-14 1996-03-21 Fuji Immunopharmaceuticals Corporation Expression and export technology of proteins as immunofusins
WO1999001554A1 (en) * 1997-07-03 1999-01-14 Zymogenetics, Inc. Mammalian secretory peptide-9
EP1323823A3 (en) * 1997-07-03 2003-11-19 Zymogenetics Inc Mammalian secretory peptide 9, antibodes against it and their use
EP1323823A2 (en) * 1997-07-03 2003-07-02 Zymogenetics Inc Mammalian secretory peptide 9, antibodes against it and their use
WO1999016870A1 (en) * 1997-09-29 1999-04-08 Zymogenetics, Inc. Secreted protein zsig-11
WO1999057312A1 (en) * 1998-04-30 1999-11-11 MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. New method for the selection of clones of an expression library involving rearraying
WO2000031115A1 (en) * 1998-11-20 2000-06-02 Commonwealth Scientific And Industrial Research Organisation Methods of identifying antigen gene sequences
US6759516B2 (en) 1998-11-20 2004-07-06 Commonwealth Scientific And Industrial Research Organisation Methods of identifying antigen gene sequences
US6617135B1 (en) 1999-08-09 2003-09-09 Emd Lexigen Research Center Corp. Multiple cytokine protein complexes
JP2004503219A (en) * 2000-06-19 2004-02-05 ダイアックス コーポレーション Novel enterokinase cleavage sequence
WO2001098366A3 (en) * 2000-06-19 2003-04-17 Dyax Corp Enterokinase cleavage sequences and their use
WO2001098366A2 (en) * 2000-06-19 2001-12-27 Dyax Corp. Enterokinase cleavage sequences and their use
US6906176B2 (en) 2000-06-19 2005-06-14 Dyax Corp. Enterokinase cleavage sequences
US7262255B2 (en) 2001-03-30 2007-08-28 Ashland-Sudchemie-Kernfest Gmbh Polyester resin emulsion and two-component aqueous paint
US8926973B2 (en) 2001-03-30 2015-01-06 Merck Patent Gmbh Reducing the immunogenicity of fusion proteins
WO2003002611A3 (en) * 2001-06-29 2003-11-06 Biaffin Gmbh & Co Kg Fusion proteins, method for the production and use thereof
WO2003002611A2 (en) * 2001-06-29 2003-01-09 Biaffin Gmbh & Co Kg Fusion proteins, method for the production and use thereof
US8835606B2 (en) 2004-01-22 2014-09-16 Merck Patent Gmbh Anti-cancer antibodies with reduced complement fixation
US7432357B2 (en) 2004-01-22 2008-10-07 Merck Patent Gmbh Anti-cancer antibodies with reduced complement fixation
US9617349B2 (en) 2004-01-22 2017-04-11 Merck Patent Gmbh Anti-cancer antibodies with reduced complement fixation
US10017579B2 (en) 2004-01-22 2018-07-10 Meck Patent Gmbh Anti-cancer antibodies with reduced complement fixation
US10633452B2 (en) 2004-01-22 2020-04-28 Merck Patent Gmbh Anti-cancer antibodies with reduced complement fixation
US8691952B2 (en) 2005-12-30 2014-04-08 Merck Patent Gmbh Anti-CD19 antibodies with reduced immunogenicity
US8957195B2 (en) 2005-12-30 2015-02-17 Merck Patent Gmbh Anti-CD19 antibodies with reduced immunogenicity
US9029330B2 (en) 2005-12-30 2015-05-12 Merck Patent Gmbh Methods of treating cancer using interleukin-12p40 variants having improved stability
US10072092B2 (en) 2005-12-30 2018-09-11 Merck Patent Gmbh Methods of use of anti-CD19 antibodies with reduced immunogenicity
US11208496B2 (en) 2005-12-30 2021-12-28 Cancer Research Technology Ltd. Anti-CD19 antibodies with reduced immunogenicity
US8907066B2 (en) 2009-04-22 2014-12-09 Merck Patent Gmbh Antibody fusion proteins with a modified FcRn binding site

Similar Documents

Publication Publication Date Title
Dunn Assembly of functional bacteriophage lambda virions incorporating C-terminal peptide or protein fusions with the major tail protein
Lee et al. Hepatitis B virus X protein interacts with a probable cellular DNA repair protein
Fisher et al. A novel cyclin associates with M015/CDK7 to form the CDK-activating kinase
Hughes et al. E1 protein of human papillomavirus is a DNA helicase/ATPase
Webster et al. The adenovirus protease is activated by a virus-coded disulphide-linked peptide
Lutz et al. The product of the adenovirus intermediate gene IX is a transcriptional activator
WO1993003157A1 (en) Plasmids for the rapid preparation of modified proteins
Roberts et al. The effect of protein context on nuclear location signal function
Turk et al. Selective inhibition of amino-terminal methionine processing by TNP-470 and ovalicin in endothelial cells
Singh et al. Identification of calreticulin as a rubella virus RNA binding protein.
US4745055A (en) Fused protein for enzyme immunoassay system
Erikson et al. Protein phosphorylation mediated by partially purified avian sarcoma virus transforming-gene product
US5534410A (en) TATA-binding protein associated factors drug screens
Tong et al. Interaction between duck hepatitis B virus and a 170-kilodalton cellular protein is mediated through a neutralizing epitope of the pre-S region and occurs during viral infection
Templeton Nuclear binding of purified retinoblastoma gene product is determined by cell cycle-regulated phosphorylation
Tillmann et al. Nuclear localization and cell cycle regulation of a murine protein tyrosine phosphatase
White et al. UDP-N-acetyl-α-d-galactosamine: polypeptide N-acetylgalactosaminyltransferase: identification and separation of two distinct transferase activities
JPH05507700A (en) Compositions and methods for identifying biologically active molecules
WO2001087958A2 (en) CRYSTAL STRUCTURE OF WORM NitFhit REVEALS THAT A Nit TETRAMER BINDS TWO Fhit DIMERS
EP0222876A1 (en) Fused proteine for enzyme immunoassay system
EP0918871A2 (en) Afc1 and rce1:isoprenylated caax processing enzymes
Ho et al. An essential surface motif (WAQKW) of yeast RNA triphosphatase mediates formation of the mRNA capping enzyme complex with RNA guanylyltransferase
Weisshart et al. An N-terminal deletion mutant of simian virus 40 (SV40) large T antigen oligomerizes incorrectly on SV40 DNA but retains the ability to bind to DNA polymerase alpha and replicate SV40 DNA in vitro
GARRIGA et al. Phosphorylation site specificity of the CDC2-related kinase PITALRE
WO1995032307A1 (en) Epstein-barr virus transcription factor binding assay

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): CA JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IT LU MC NL SE

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: CA