WO2003064656A1 - Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state - Google Patents

Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state Download PDF

Info

Publication number
WO2003064656A1
WO2003064656A1 PCT/GB2003/000362 GB0300362W WO03064656A1 WO 2003064656 A1 WO2003064656 A1 WO 2003064656A1 GB 0300362 W GB0300362 W GB 0300362W WO 03064656 A1 WO03064656 A1 WO 03064656A1
Authority
WO
WIPO (PCT)
Prior art keywords
protein
interest
bccp
nucleic acid
tag moiety
Prior art date
Application number
PCT/GB2003/000362
Other languages
French (fr)
Inventor
Mitali Samaddar
Jonathan Michael Blackburn
Darren James Hart
Michael Richard Dyson
Original Assignee
Sense Proteomic Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sense Proteomic Limited filed Critical Sense Proteomic Limited
Priority to US10/502,581 priority Critical patent/US8999897B2/en
Priority to EP03734757A priority patent/EP1470229B1/en
Priority to AU2003238441A priority patent/AU2003238441B2/en
Priority to CA2474457A priority patent/CA2474457C/en
Priority to JP2003564248A priority patent/JP4377242B2/en
Priority to DE60305643T priority patent/DE60305643T2/en
Publication of WO2003064656A1 publication Critical patent/WO2003064656A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/93Ligases (6)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/67General methods for enhancing the expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • C12P21/02Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/20Fusion polypeptide containing a tag with affinity for a non-protein ligand
    • C07K2319/23Fusion polypeptide containing a tag with affinity for a non-protein ligand containing a GST-tag
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/60Fusion polypeptide containing spectroscopic/fluorescent detection, e.g. green fluorescent protein [GFP]
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/90Fusion polypeptide containing a motif for post-translational modification

Definitions

  • This invention relates to the use of biotin carboxyl carrier protein (BCCP) as a protein folding marker and protein solubility enhancer in the orientated surface capture of products of heterologously expressed genes.
  • BCCP biotin carboxyl carrier protein
  • Affinity tags are a convenient method of purification and immobilisation of recombinant proteins.
  • Hexahistidine tags (6 amino acids (aa); Qiagen/Roche), Escherichia coli maltose binding protein ("MBP", 300 aa; New England Biolabs) and Schistosoma japonicum glutathione-S-transferase (GST, 220 aa; Amersham Pharmacia Biotech/Novagen) are effective, but have the disadvantage that heterologous host proteins interact with the affinity matrices used for purification of fusion proteins. This results in impure protein preparations and an additional clean up step is often required.
  • Biotin can be attached chemically to proteins (e.g. using NHS-activated biotin), or via genetically fused protein domains which are biotinylated in vivo.
  • the "PinPointTM" vectors from Promega are designed to facilitate the creation of fusions to the biotinylation domain (which is a fragment of the biotin carboxyl carrier protein (BCCP) of methylmalonyl-CoA carboxyl transferase from Propionibacterium freudenheimii shermanii [US Patent 5,252,466]). This protein has 40% homology with the E. coli BCCP.
  • BCCP biotin carboxyl carrier protein
  • This system allows the production of BCCP-protein fusions capable of being biotinylated either in vivo or in vitro by biotin ligase, allowing one to use the highly specific biotin — streptavidin interaction for surface capture.
  • phage display selected short peptides capable of being biotinylated on a lysine residue have been commercialised by Avidity Inc. [US Patent 5,932,433].
  • the Inventors herein describe a novel approach whereby BCCP from E. coli is fused either N- or C-terminally to a protein partner. In addition to the function of permitting orientated immobilization of the fusion protein to microarray compatible surfaces derivatised with avidin, streptavidin or neutravidin.
  • the Inventors describe new, previously unreported functions of BCCP which greatly facilitate the creation of libraries of solubly expressed folded human, mammalian, fungal, plant or microbial proteins in heterologous systems.
  • N-terminally or C-terminally fused BCCP improves levels of folding of fusion partner
  • the factors determining the solubility of recombinant proteins are poorly understood and so rational design of solubility and increased expression into recombinant proteins is only possible to a limited extent.
  • both properties can be greatly improved compared with expression of ORFs alone. Examples include MBP, GST and thioredoxin (Trx, 109 aa; Novagen).
  • a possible mechanism of action is thought to be the recruitment of chaperones to the nascent polypeptide and co-over-expression of chaperones can result in increased yield of soluble protein.
  • Some fusion proteins can then be purified via their fusion protein domain (e.g. amylose resin for MBP or gluta hione resin for GST.
  • Trx tag has not been used for protein purification it can both improve the solubility of many target proteins and it appears to catalyse the formation of disulphide bonds in the cytoplasm of E. coli trx B mutants.
  • the Inventors have determined that addition of BCCP to the N-terminus or C- terminus of a protein increases the solubility of the fusion protein and in the case of addition to the N-terminus at least, increases the proportion of clones in a library that express encoded proteins (relative to a library that is not modified to also encode a BCCP tag).
  • the BCCP domain is biotinylated in vivo. This is particularly useful when attempting to multiplex protein purification for fabrication of protein arrays since the proteins can be simultaneously purified from cellular lysates and immobilised in a single step via the high affinity and specificity exhibited by a streptavidin surface. The Inventors term this simultaneous purification and immobilisation as "surface capture”.
  • N-terminally or C-terminally fused BCCP permits monitoring of folding of fusion partner
  • reporter proteins with an assayable activity
  • reporter systems known in the art utilise green fluorescent protein (GFP), chloramphenicol acetyl transferase (CAT), ⁇ - galactosidase and the ⁇ -complementation of ⁇ -galactosidase.
  • GFP green fluorescent protein
  • CAT chloramphenicol acetyl transferase
  • ⁇ - galactosidase ⁇ -complementation of ⁇ -galactosidase
  • the Inventors have determined that addition of BCCP to the N-terminus or C- terminus of a protein permits the monitoring of fusion protein folding by measuring the extent of in vivo biotinylation. This can be measured by standard blotting procedures, using SDS-PAGE or in situ colony lysis and transfer of samples to a membrane, followed by detection of biotinylated proteins using a streptavidin conjugate such as streptavidin-horseradish peroxidase.
  • a streptavidin conjugate such as streptavidin-horseradish peroxidase.
  • the addition of biotin to the BCCP domain permits purification by surface capture as described above.
  • the invention provides the use of a tag moiety comprising a biotinylation domain for increasing the solubility of a protein of interest by attachment of said tag moiety to the N-terminal or C-terminus of said protein of interest.
  • a tag moiety comprising a biotinylation domain as defined herein is an amino acid sequence comprising a protein or protein domain which is capable of being biotinylated, or to which a biotin group can be attached.
  • the tag is highly soluble in the cytoplasm of the host cell in which it is expressed as a tag attached to a protein of interest.
  • the biotinylation domain of the invention is a protein or protein domain having secondary and tertiary structure and which is biotinylated in vivo post translationally.
  • the secondary and tertiary structure of the protein or domain is essential for recognition and hence biotinylation by the biotin ligase of the host cell in which expression of the tag is taking place.
  • biotinylation domain of the tag comprises the sequence of E. coli BCCP (Biotin Carboxyl Carrier Protein of Acetyl-Coa Carboxylase (ACCB) - Swiss-Prot Database Accession no. P02905), the nucleotide and amino acid sequence of which is:
  • BCCP proteins from the Swiss-Prot database:
  • Acety propionyl-coenzyme A carboxylase alpha chain [Includes: Biotin carboxylase (EC 6.3.4.14); Biotin carboxyl carrier protein (BCCP)].
  • BCCA Biotin carboxylase OR ML0726 OR B1308_C1_129 ⁇ - Mycobacterium leprae
  • BCCA MYCTU (P46401) Acetyl-Zpropionyl-coenzyme A carboxylase alpha chain [Includes: Biotin carboxylase (EC 6.3.4.14); Biotin carboxyl carrier protein (BCCP)].
  • ⁇ GENE ACCA1 OR BCCA OR RV2501C OR MT2576 OR MTCY07A7.07C ⁇ - Mycobacterium tuberculosis
  • BCCP ANASP Q06881 Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP).
  • Biotin carboxyl carrier protein of acetyl-CoA carboxylase, chloroplast precursor BCCP.
  • BCCP acetyl-CoA carboxylase, chloroplast precursor
  • Biotin carboxyl carrier protein of acetyl-CoA carboxylase BCCP.
  • BCCP CHLMU Q9PKR5 Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP).
  • Biotin carboxyl carrier protein of acetyl-CoA carboxylase BCCP.
  • BCCP Biotin carboxyl carrier protein of acetyl-CoA carboxylase
  • Biotin carboxyl carrier protein of acetyl-CoA carboxylase BCCP.
  • ⁇ GENE ACCB or CT123 ⁇ - Chlamydia trachomatis
  • Biotin carboxyl carrier protein of acetyl-CoA carboxylase BCCP.
  • BCCP Biotin carboxyl carrier protein of acetyl-CoA carboxylase
  • Biotin carboxyl carrier protein of acetyl-CoA carboxylase BCCP.
  • BCCP Biotin carboxyl carrier protein of acetyl-CoA carboxylase
  • BCCP LYCES P05115
  • BCCP LYCES Biotin carboxyl carrier protein of acetyl-CoA carboxylase
  • Biotin carboxyl carrier protein of acetyl-CoA carboxylase BCCP.
  • ⁇ GENE ACCB ⁇ - Porphyra purpurea [Chloroplast]
  • Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP).
  • ⁇ GENE ACCB-1 ⁇ - Glycine max (Soybean)
  • BCCP Biotin carboxyl carrier protein
  • biotinylation domains encoded by or comprising artificial sequences, for example where one or more amino acids have been altered by conservative substitution.
  • sequences can be rationally designed or derived from the sequences of BCCP given above, by methods known in the art. It is essential that these sequences have a secondary and tertiary structure that permits the artificial sequence to be recognised and biotinylated by a biotin ligase enzyme.
  • the invention provides the use of a tag moiety comprising a biotinylation domain for determining the folded state of a protein of interest by attachment of said tag moiety to the N-terminus or C-terminus of said protein of interest.
  • the tag moiety comprising a biotinylation domain as defined herein is a protein or protein domain which is conditionally biotinylated by a biotinylating enzyme, for example biotin ligase expressed in the host cell in which expression takes place or exogenously applied biotin ligase, for example, used to biotinylate proteins in a cell-free extract.
  • a biotinylating enzyme for example biotin ligase expressed in the host cell in which expression takes place or exogenously applied biotin ligase, for example, used to biotinylate proteins in a cell-free extract.
  • the domain can only be biotinylated through recognition of the folded structure of the domain by the enzyme such that the domain in linear, mis-folded or aggregated, form for example in inclusion bodies, is not biotinylated.
  • the folding of the tag and its subsequent biotinylation is dependent on the correct folding of the protein N-terminal to the C- terminal tag and vice versa
  • the invention provides a method of increasing the solubility of a protein of interest when expressed in a host cell comprising the steps of: a) attaching a first nucleic acid molecule encoding a tag moiety comprising a biotinylation domain to a second nucleic acid molecule encoding said protein of interest to form a construct such that the tag moiety in the expressed product of the combined first and second nucleic acid molecules comprises said tag moiety located at the N-terminus or C-terminus of said protein of interest b) expressing said construct in a host cell
  • the invention provides a method of determining the folded state of a protein of interest comprising the steps of: a) attaching a first nucleic acid molecule encoding a tag moiety comprising a biotinylation domain to a second nucleic acid molecule encoding said protein of interest to form a construct such that the tag moiety in the expressed product of the combined first and second nucleic acid molecules comprises is located at the N- terminus or C-terminus of said protein of interest b) expressing said construct in a host cell under conditions such that only a correctly folded biotinylation domain present in said tag moiety is ligated with biotin c) determining the folded state of the protein of interest comprising said tag moiety by the presence or absence of a biotin group in the protein expressed from said construct
  • the uses of the first and second aspect of the invention and the methods of the third and fourth aspects of the invention are preferably carried out in a multiplexed manner on more than one protein of interest. For example, wherein the protein of interest is
  • the invention provides a library of nucleic acid molecules encoding proteins of interest wherein each coding sequence is modifed to incorporate at the N-terminus or C-terminus of the encoded protein a tag moiety comprising a biotinylation domain.
  • libraries may be generated using known techniques in the art.
  • the library can be generated using the COVET methodology described in WO 01/57198.
  • the invention provides a library of proteins produced from the methods of the third and fourth aspects of the invention or expressed from the library of the fifth aspect of the invention.
  • libraries may be arrayed on a solid substrate, for example through immobilisation to that substrate via, for example, a streptavidin-biotin link via the BCCP tag present on the proteins of the library.
  • the Inventors have also determined that the addition of DNA encoding a BCCP tag 5' to and in-frame with genes of interest in a library has the effect of significantly increasing the number of encoded proteins of interest which are expressed from that library compared to a library encoding the same proteins, but lacking the BCCP tag encoding sequence.
  • Such relative expression differences between "tagged” and “untagged” libraries can be detected or measured qualitatively, for example using western blotting techniques as known in the art.
  • the invention provides the use of a nucleic acid molecule encoding a tag moiety comprising a biotinylation domain for increasing the proportion of clones in a library that express the protein of interest encoded by each of said clones at detectable levels, for example as measured by conventional western blotting, by attachment of said nucleic acid molecule encoding said tag 5' to and in-frame with the gene encoding said protein of interest in each of said clones.
  • the invention provides a method of increasing the proportion of clones in a library that express the protein of interest encoded by each of said clones in a host cell at detectable levels, comprising the steps of: a) attaching a first nucleic acid molecule encoding a tag moiety comprising a biotinylation domain 5' to and in-frame with a second nucleic acid molecule encoding said protein of interest in a clonal member of said library to form a construct such that the tag moiety in the expressed product of the combined first and second nucleic acid molecules comprises said tag moiety located at the N-terminus of said protein of interest b) expressing said construct in a host cell
  • tags, methods and libraries of the invention are particularly suited to facilitating parallel expression and purification immobilisation of proteins encoded by a library of sequences (by a common method of solublisation and purification of the proteins of interest), the invention can also be applied to other methodologies known in the art.
  • an N-terminal or C-terminal tag according to the invention for example BCCP
  • BCCP can be used to increase both protein expression and solubility in:
  • Antigen production used for the generation of monoclonal or polyclonal antibodies, monoclonal antibody or single chain antibody production
  • Drug target validation by generation of protein drug targets including, but not exclusively, kinases, phosphatases, cell receptors or proteases for screening, enzyme and / or toxicology studies and any other biochemical analysis.
  • Figure 1 shows the colony western data using Streptavidin-HRP conjugate as the probe.
  • the clones expressing in-frame GFP-BCCP that fluoresced green are also biotinylated.
  • the bottom row are clones that harbour pMSC301 (no beep gene sequence in the plasmid), and signal obtained is the background signal of endogenous biotinylated AccB.
  • the second row from the bottom are the clones harbouring pMSC302 (overexpressing accB).
  • the other negative clones (out of frame fusions or vector re-ligated did not fluoresce green and were not biotinylated).
  • Figure 2 shows colony western data using Streptavidin-HRP conjugate as the probe.
  • the clones expressing in-frame GST-GFP-BCCP that fluoresced green are also biotinylated. Also shown as biotinylation positive signal is the protein GST- BCCP.
  • the negative control is clones that harbour pMSC301 (no beep gene sequence in the plasmid), and signal obtained is the background signal of endogenous biotinylated AccB.
  • the positive control is the clone harbouring pMSC302 (overexpressing accB).
  • the other negative clones (out of frame fusions or vector re-ligated did not fluoresce green and were not biotinylated).
  • Figure 3 shows western blot analysis of the protein extract from cells expressing GFP-BCCP.
  • the signal obtained at approximately 37 kDa. is the expected Mr of GFP-BCCP.
  • Another signal seen at 18 kDa is that of endogenous biotinylated AccB protein, also seen in the GFP-BCCP negative lanes. As expected, the 18 kDa. signal is stronger, when no recombinant biotinylated protein is expressed.
  • Lanes 1, 2 and 3 Protein extract from clones harbouring pGFP-BCCP, expressing intact GFP-BCCP protein.
  • Lanes 4, 5 and 6 Protein extract from clones harbouring pMSC301A, B, and C respectively, used as negative control in the experiment.
  • Figure 4 shows western blot analysis of protein extracts from cells expressing GST-GFP-BCCP, and GST-BCCP. Biotinylated proteins of expected Mr. are observed (63 kDa for GST-GFP-BCCP and 37 kDa for GST-BCCP). In all the lanes 18 kDa signal for endogenous AccB is present.
  • Lanes 1, 2 and 4 are protein extract from cells expressing GST-GFP-BCCP.
  • Lane 3 is the protein extract from cells expressing GFP-BCCP as a positive control in this expt.
  • Lanes 5 and 6 Protein extract from clones harbouring pMSC301A, and B as negative controls in the blot.
  • Lanes 7 and 8 Protein extracts from cells expressing GST-BCCP.
  • Figure 5 shows a colony western blot using streptavidin-HRP as the probe for biotinylation of BCCP in the fusion protein. All clones that were marked to be fluorescing green when excited at 365 nm wavelength, were also biotinylated
  • Figure 6 shows protein expression results of the human gene set cloned into the Avi-Tag vector pQE82L-GFP-biotin.
  • LB-Amp 100 ⁇ g/ ml ampicillin
  • the molecular weight markers are: aprotin (7.6 kDa), lysozyme (18.4 kDa), soybean trysin inhibitor (32.5 kda), carbonic anhydrase (45.7 kDa), BSA (78 kDa), B-galactosidase (132 kDa) and myosin (216 kDa).
  • Figure 7 shows protein expression results of the human gene set cloned into the BCCP expressing vector pMD004.
  • the molecular weight markers are: aprotin (7.6 kDa), lysozyme (18.4 kDa), soybean trysin inhibitor (32.5 kDa), carbonic anhydrase (45.7 kDa), BSA (78 kDa), B-galactosidase (132 kDa) and myosin (216 kDa).
  • Figure 8 shows plasmid maps of pMD002 and pMD004.
  • Figure 9 shows a plasmid map of pIFMlOlA/B/C
  • Figure 10 shows the cloning site of plasmid pIFMlOlA
  • Figure 11 shows the cloning site of plasmid pIFMlO IB
  • Figure 12 shows the cloning site of plasmid pIFMlOlC
  • the DNA sequence encoding the entire coding region of acetyl-CoA carboxylase was amplified by PCR from genomic DNA of XL 1 -Blue (Stratagene) cells, using the following gene specific primers. accbforl: 5' GATGGATCCGATATTCGTAAGATTAAAAAACTGATCG 3' with BamHI site at the 5' end. bccprevl: 5'
  • the PCR amplification was carried out using Pwo polymerase (Roche) using standard cycling conditions (94°C 5 min; 94°C 30 sec; 64°C 1 min; 72°C lmin; 30 cycles; 72°C 5 min).
  • the PCR amplified gene sequence was cloned into the Bam ⁇ and Sacl site of the E. coli expression vector pQE-80 (Qiagen) inframe with the N-terminus hexahistidine tag to form the plasmid pMSC302.
  • the identity of the gene sequence was confirmed by restriction mapping and DNA sequencing.
  • the DNA sequence corresponding to the C-terminal domain of AccB known as biotin carboxyl carrier protein (BCCP) was amplified by PCR using the same reverse primer as above and a new forward primer.
  • BCCP biotin carboxyl carrier protein
  • the vector pQE-80 was redesigned to delete the DNA sequence for hexahistidine tag, add additional cloning sites (Notl and Sfi ⁇ ), and have three different reading frames from the start ATG ( ⁇ MSC301 A/B/C). This was carried out by inverse PCR using the primer sets; pQErevl: 5'P
  • the beep gene sequence was cloned into the Pstl-HindHL sites of pMSC301 A, B, and C vectors to generate pMSC301A,B,C/BCCP.
  • the DNA sequence encoding GFPuv (Clontech) was amplified by PCR using the primer set pQEGFPforl: 5' GGGCCGGTGGCAGCGCGAGTAAAGGAG AAGA ACTTTTCACTGG 3' (with Smal half site and a linker region) and pQEGFPrevl: 5' GATCTGCAGGGTACCGGATCCTTTGTAGAGCTCATCCATGCC 3' (with Pstl, Kpn I and Bam HI sites).
  • the PCR amplified product was cloned into the Smal-Pstl sites of pMSC301A, B and C/BCCP in-frame to DNA sequence encoding the N-terminus of BCCP (GFP-BCCP) to generate the vectors pMSC303A, B, and C.
  • the plasmid construct pMSC303B was restricted with Notl, the staggered ends were made blunt using the filling in reaction of T4 Polymerase (NEB), restricted with Sma I and religated (plasmid designated as pGFP-BCCP) .
  • the vectors pMSC301A/BCCP and pMSC303A were restricted with Notl, the overhangs blunted using T4 D ⁇ A polymerase, restricted with Smal and were used to clone the D ⁇ A fragment encoding GST forming the plasmid constructs pGST- BCCP and pGST-GFP-BCCP respectively.
  • the DNA sequence encoding GST was amplified by PCR using the primers; GSTfwdOl: 5' TCCCCTATACTAGGTTATTGG 3' and GSTrevexoN: 5' GGGCGTCACGA TGAATTCCCGGG 3' andpGEX-2T (Pharmacia) as template.
  • the Noil and Sf ⁇ l cloning sites of the vectors pMSC303 A,B and C were replaced by the Sfil overhang compatible restriction site, DralU to generate the vectors pIFMlOlA, B, and C.
  • the reverse primer used was pQErevl as described earlier.
  • the PCR conditions used were same as before.
  • ⁇ 100ng template plasmid library human heart cDNA library in pDNR-LIB from Clontech
  • SP5forward 5 ' ATGCTCATGAGGCCGGCCGGGAATTC GGCCATTACG GCCGG3' with Fsel and Sfil sites
  • SP3reverse 5'GTCTAGAAAGCTTCTCGAGGGCCG3 ⁇ to optimally incorporate alpha-phosphothioate dTTPs ( ⁇ -S-dTTP; Amersham).
  • the PCR reaction was carried out using 50pmol each primer, 2.5 units thermostable polymerase (lacking a 3' to 5' exonuclease activity e.g.
  • Taq polymerase a standard buffer and the deoxynucleotide triphosphate mix: 200 ⁇ M dATP, 200 ⁇ M dGTP, 200 ⁇ M dCTP, lOO ⁇ M dTTP, lOO ⁇ M ⁇ -S-dTTP.
  • the PCR amplified products were purified using QIAquick PCR cleanup kits (Qiagen) and subjected to Fsel digestion to produce a 3' nucleotide overhang which protects the 5' end of the dsDNA from subsequent hydrolysis by exonuclease III (NEB). Exonuclease III digestion was performed using standard conditions and the presence of phosphothioate internucleotide linkages blocked any further hydrolysis.
  • the E. coli strains XL 1 -Blue or XLIO-Gold (stratagene) were used as host cells and were transformed (electroporation or chemical method) using various plasmid constructs.
  • the transformation mixture was plated at an appropriate dilution on a nitrocellulose membrane placed on LB-Agar containing 100 ⁇ g/ml carbenicillin. After overnight incubation at 30°C the membranes were transferred onto LB-Agar containing 400 ⁇ M IPTG and carbenicillin and incubated for another 4-5 hrs at 30°C.
  • the GFP activity of the clones were assessed by visualizing the clones at 365 nm wavelength of the UV-transilluminator.
  • the membranes were processed for detecting biotinylated BCCP or GFP.
  • the cultures were induced at mid log phase (optical density at 600 nm of 0.5 to 0.6) by adding 400 ⁇ M of IPTG to the culture and growth of cells continued for another 3-4 hours at 30°C.
  • cells were harvested, proteins resolved on 10-20 % gradient SDS-gel (Invitrogen), blotted onto nitrocellulose membrane and probed with various antibodies or streptavidin. 5.
  • the biotinylation of BCCP was detected by probing with a streptavidin-horseradish peroxidase (HRP) conjugate (Amersham) on colony blots (as described) or on western blots as known in the art.
  • HRP streptavidin-horseradish peroxidase
  • the clones were either gridded robotically, or the transformation mix was plated, onto nitrocellulose membrane (Amersham) placed on a LB agar plate containing carbenicillin. After overnight incubation at 30°C, the membrane was placed onto a fresh LB agar plate containing carbenicillin and IPTG (400 ⁇ M). The plate was incubated for another 4-5 hours at 30°C.
  • the colonies on the membrane were subjected to alkaline lysis and the membrane blocked prior to addition of the probe.
  • the membrane is first placed on two sheets of Whatmann 3 paper pre soaked with 0.5 (M) NaOH, 1.5 (M) NaCl for 10 min.
  • the membrane is neutralised by placing on Whatmann 3 sheets soaked with 1 (M) TrisHCl pH 7.5, 1.5 (M) NaCl for 5 min, two times.
  • the membrane is then transferred onto Whatmann 3 sheets wetted in PBS-T (0.1%) containing 1% SDS for 10 mins.
  • the membrane is then washed thoroughly in PBS-T ensuring that all the cell debris has been dislodged.
  • the blot is then ready to be processed in the same manner as a western blot.
  • the Streptavidin-HRP conjugate was used at a dilution of 1 :4000 and the signal was detected by chemiluminescence using the ECL system from Amersham.
  • the green fluorescence of GFP was visualized by exciting the colonies at 365 nm wavelength using a transilluminator.
  • Figures 1 and 2 show the colony western data using streptavidin-horseradish peroxidase as the probe.
  • GST-GFP-BCCP Only the correct in-frame fusion of GST-GFP-BCCP, GST-BCCP and GFP-BCCP gave strong positive signal significantly above the general background from endogenous biotinylated AccB.
  • All and only biotinylated fusion proteins (GST-GFP-BCCP and GFP-BCCP) fluoresced green when excited at 365 nm.
  • Figure 5 shows a colony western blot probed with streptavidin-horseradish peroxidase conjugate.
  • the positive hits are the ones that were marked as green when visualized 365 nm. Only 4 out of 36 were biotinylated but not green visually. This could be due to the fact that the detection method used for biotinylation of BCCP is much more sensitive than visual detection of green fluorescence.
  • the ⁇ QE82L-GFP-biotin and pMD004 plasmids were constructed by standard techniques (T. Maniatis et al (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Press) and both consist of a pQE82L vector (Qiagen) backbone, with a RGS-His tag followed by either the "Avi-Tag” sequence or BCCP protein domain respectively, followed by a multi-cloning site.
  • the 5'- phosphorylated forward primers consist of the first 24 bp at the beginning of the relevant sequence, starting with a full codon. Some of the forward primers are longer to incorporate a G or C at the 3' end.
  • the reverse primers consist of the last 24 bp of the relevant sequence (longer if necessary to incorporate a G or C at the 5' end) which is then appended to the beginning of the reverse primer template (TGATAGAAGAGCGGCCGC). The final reverse primer would be the reverse complement of this.
  • This primer results in the stop codon of all the fusions being defined and followed by a Notl site for cloning into the ⁇ -terminal tagging vector described above.
  • Two cD ⁇ A templates were combined at a final concentration of lOng/ ⁇ l. These were a) human heart cD ⁇ A plasmid library (Life Technologies) & b) HeLa cell cD ⁇ A plasmid library (Invitrogen). All primers were reconstituted in distilled water to lOOpmols/ ⁇ l.
  • a master mix was prepared (without primers) from: Template (lOng), PWO polymerase buffer with magnesium sulphate (lx final concentration), d ⁇ TPs (5mM final cone), PWO polymerase (2.5 units), dimethyl sulfoxide (10% final cone.) and distilled water to a final volime of 48 ⁇ l per reaction.
  • the master mix was aliquoted into 96 well PCR plates (Eppendorf) and 1 ⁇ l of each primer added on ice. Conditions were as follows: 94 for 3 mins then 94 for 30 sees, 59 for 30 sees, 72 for 2 mins (32 cycles) and finally 72 for 7 mins. Products were checked on 2% agarose gels/TBE and purified using Qiaquick PCR purification columns (Qiagen). Clean dsDNA was digested with Notl in a standard digestion mixture and cleaned again.
  • Hoescht 33258 assay To quantify the dsD ⁇ A in preparation for cloning a low range standard curve of an unrelated, clean PCR product in 1:1000 Hoescht dye (stock lmg/ml)/lxT ⁇ E (Tris lOmM, EDTA ImM, ⁇ aCl 0.2 M pH 7.4) was set up at 80, 40, 20, 10, 5, 2.5, 1.25, 0 ng/100 ⁇ l. 1 ⁇ l of each experimental PCR product was added to 99 ⁇ l of 1:1000 Hoescht/T ⁇ E, mixed in clear bottomed, black sided 96 well microtiter plates (Corning) and fluorescence read at 365/465nm. The standard curve was plotted and dsD ⁇ A content of each 'insert preparation 1 calculated as ng/ ⁇ l
  • Inserts were ligated to the vector prep with an approximate molar ratio of 3:1 (insert: vector). Ligations were carried out in a 96-well PCR plate with the rapid D ⁇ A ligation kit (Roche). The ligations (2 ⁇ l of each) were used to transform 30 ⁇ l of XL 1 -Gold Supercompetent cells (Stratagene), according to the protocol, in a thin wall 96-well PCR plate. After heat shock, the transformations were added to 300 ⁇ l of pre- warmed SOC medium in a 96-well deep well block and shaken at 37°C for 45 minutes.
  • the BCCP domain can increase the overall number of clones expressing soluble protein when expressed as an N- terminal fusion to the target protein.
  • the result indicate that the BCCP domain can increase the solubility of a protein of interest.
  • the tight correlation observed between biotinylation and solubility of expressed fusions demonstrates that biotinylation of BCCP acts as a folding marker when fused to the N-terminus of a protein of interest.
  • the ability of the BCCP protein to be biotinylated provides a highly specific means to capture the protein on a streptavidin surface.
  • Table 1 Protein Expression Summary. Proteins were chosen and corresponding gene inserts were cloned into the pQE-GFP-biotin (vector 1) or the BCCP pMD004 (vector 2) resulting in fusions to the C-terminus of either a hexa-histidine-Avi-Tag peptide or a hexa-histidine-BCCP protein. Only inserts cloned into both vectors are compared in terms of protein expression. Key to table: internal coding number. 2 Protein database accession number (www.oca.ebi.ac.uk). 3 DNA gene length in base-pairs. 4 . Protein size when expressed as a fusion with BCCP in amino acids (aa). 5 .
  • Bar-to-Autoint. 5 2EZZ 285 200 26.0 1-89/89 orf C.H.B.S. C.H.B.S.
  • Carb. Anhyd. II 9 1A42 798 371 48.2 371 / 371 orf C. C.H.B.S.
  • Hck Kinase 19 3HCK 336 217 28.2 C.H.B.S. C.H.B.S. 245/526orf
  • Rhoa 51 1CXZ 561 292 38.0 1-181/193orf C. C.H.B.S.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medicinal Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Peptides Or Proteins (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Heterocyclic Carbon Compounds Containing A Hetero Ring Having Oxygen Or Sulfur (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Burglar Alarm Systems (AREA)
  • Radar Systems Or Details Thereof (AREA)

Abstract

The use of a tag moiety comprising a biotinylation domain, such as biotin carboxyl carrier protein (BCCP), as a protein folding marker and protein solubility enhancer in the orientated surface capture of products of heterologously expressed genes is described. Methods for increasing the solubility of proteins and determining the folded state of a protein are also disclosed. The uses and methods of the invention can be carried out in a multiplexed manner on more than one protein in the formation of libraries. In addition the nucleic acid molecule encoding the biotinylation domain of the tag moiety can be used to increase the proportion of clones in a library that express the protein of interest.

Description

PROTEIN TAG COMPRISING A BIOTINYLATION DOMAIN AND METHOD FOR INCREASING SOLUBILITY AND DETERMINING FOLDING STATE
This invention relates to the use of biotin carboxyl carrier protein (BCCP) as a protein folding marker and protein solubility enhancer in the orientated surface capture of products of heterologously expressed genes.
Expression of human proteins in heterologous systems such as bacteria, yeast, insect cells or mammalian cells can result in the production of incorrectly folded proteins resulting in the formation of insoluble aggregates or a low yield of expressed proteins because of the targeting of the unfolded proteins to the proteosome. For all functional protein procedures the production of correctly folded or native proteins is essential and a great deal of work is often performed to optimise the expression of individual proteins. However, many areas of protein biochemistry involve working with libraries or groups of proteins of such a size that optimisation of individual expression and purification conditions for each protein is impractical. Hence, there exists an unmet need in the art for reagents, protocols and methology that facilitate the multiplexing of these processes.
Affinity tags are a convenient method of purification and immobilisation of recombinant proteins. Hexahistidine tags (6 amino acids (aa); Qiagen/Roche), Escherichia coli maltose binding protein ("MBP", 300 aa; New England Biolabs) and Schistosoma japonicum glutathione-S-transferase (GST, 220 aa; Amersham Pharmacia Biotech/Novagen) are effective, but have the disadvantage that heterologous host proteins interact with the affinity matrices used for purification of fusion proteins. This results in impure protein preparations and an additional clean up step is often required. Additionally, the relatively weak affinity of these proteins for their ligands results in dissociation, or "leaching" of the fusion proteins from surfaces to which they are immobilised. Such reversible interactions are exploited during resin-based purifications on resins in column or batch formats where, because of the high local concentrations of ligand, dissociated proteins rapidly rebind, yet are rapidly eluted by free ligand. In contrast, immobilisation of proteins to planar surfaces such as microtiter plates or microarrays, for example, biochips, requires that they remain bound and do not leach from the substrate during storage and use. As such, lower affinity tags as used for purification (e.g. MBP, GST and hexahistidine tags) are suboptimal. Frequently, covalent immobilisation strategies are employed such as coupling of purified proteins via surface lysine residues to amine-reactive chemical groups. This is generally accepted to result in reduced activity of the protein.
In contrast to the lower affinity, non-covalent interactions described above, the interaction of biotin with streptavidin, avidin or neutravidin exhibits some of the highest affinities known in biology, with equilibrium dissociation constants of 10"15 M (several orders of magnitude higher affinity than the MBP - amylose or GST - glutathione interactions). Whilst still a weaker interaction than covalent coupling, biotinylated proteins bound to a streptavidin-derivatised surface show negligible dissociation. This interaction therefore provides a improved means for tethering proteins to a planar surface for applications such as protein arrays and enzyme- linked immunoassays (ELISAs).
Biotin can be attached chemically to proteins (e.g. using NHS-activated biotin), or via genetically fused protein domains which are biotinylated in vivo. The "PinPoint™" vectors from Promega are designed to facilitate the creation of fusions to the biotinylation domain (which is a fragment of the biotin carboxyl carrier protein (BCCP) of methylmalonyl-CoA carboxyl transferase from Propionibacterium freudenreichii shermanii [US Patent 5,252,466]). This protein has 40% homology with the E. coli BCCP. This system allows the production of BCCP-protein fusions capable of being biotinylated either in vivo or in vitro by biotin ligase, allowing one to use the highly specific biotin — streptavidin interaction for surface capture. In addition to the BCCP domain, phage display selected short peptides capable of being biotinylated on a lysine residue have been commercialised by Avidity Inc. [US Patent 5,932,433].
The Inventors herein describe a novel approach whereby BCCP from E. coli is fused either N- or C-terminally to a protein partner. In addition to the function of permitting orientated immobilization of the fusion protein to microarray compatible surfaces derivatised with avidin, streptavidin or neutravidin. The Inventors describe new, previously unreported functions of BCCP which greatly facilitate the creation of libraries of solubly expressed folded human, mammalian, fungal, plant or microbial proteins in heterologous systems.
i) N-terminally or C-terminally fused BCCP improves levels of folding of fusion partner The factors determining the solubility of recombinant proteins are poorly understood and so rational design of solubility and increased expression into recombinant proteins is only possible to a limited extent. However, by fusing well expressed soluble proteins to the N-terminus of a protein, both properties can be greatly improved compared with expression of ORFs alone. Examples include MBP, GST and thioredoxin (Trx, 109 aa; Novagen). A possible mechanism of action is thought to be the recruitment of chaperones to the nascent polypeptide and co-over-expression of chaperones can result in increased yield of soluble protein. Some fusion proteins can then be purified via their fusion protein domain (e.g. amylose resin for MBP or gluta hione resin for GST. Although the Trx tag has not been used for protein purification it can both improve the solubility of many target proteins and it appears to catalyse the formation of disulphide bonds in the cytoplasm of E. coli trx B mutants. The Inventors have determined that addition of BCCP to the N-terminus or C- terminus of a protein increases the solubility of the fusion protein and in the case of addition to the N-terminus at least, increases the proportion of clones in a library that express encoded proteins (relative to a library that is not modified to also encode a BCCP tag). Additionally, the BCCP domain is biotinylated in vivo. This is particularly useful when attempting to multiplex protein purification for fabrication of protein arrays since the proteins can be simultaneously purified from cellular lysates and immobilised in a single step via the high affinity and specificity exhibited by a streptavidin surface. The Inventors term this simultaneous purification and immobilisation as "surface capture".
ii) N-terminally or C-terminally fused BCCP permits monitoring of folding of fusion partner
Fusion of reporter proteins (with an assayable activity) onto the C-terminus of partner proteins has been previously shown to allow monitoring of the folding of the partner. Notable examples of reporter systems known in the art utilise green fluorescent protein (GFP), chloramphenicol acetyl transferase (CAT), β- galactosidase and the α-complementation of β-galactosidase.
The Inventors have determined that addition of BCCP to the N-terminus or C- terminus of a protein permits the monitoring of fusion protein folding by measuring the extent of in vivo biotinylation. This can be measured by standard blotting procedures, using SDS-PAGE or in situ colony lysis and transfer of samples to a membrane, followed by detection of biotinylated proteins using a streptavidin conjugate such as streptavidin-horseradish peroxidase. Importantly, the addition of biotin to the BCCP domain permits purification by surface capture as described above.
Thus in a first aspect the invention provides the use of a tag moiety comprising a biotinylation domain for increasing the solubility of a protein of interest by attachment of said tag moiety to the N-terminal or C-terminus of said protein of interest.
A tag moiety comprising a biotinylation domain as defined herein is an amino acid sequence comprising a protein or protein domain which is capable of being biotinylated, or to which a biotin group can be attached. In accordance with the first aspect of the invention the tag is highly soluble in the cytoplasm of the host cell in which it is expressed as a tag attached to a protein of interest.
Essentially, the biotinylation domain of the invention is a protein or protein domain having secondary and tertiary structure and which is biotinylated in vivo post translationally. Generally the secondary and tertiary structure of the protein or domain is essential for recognition and hence biotinylation by the biotin ligase of the host cell in which expression of the tag is taking place.
Preferably the biotinylation domain of the tag comprises the sequence of E. coli BCCP (Biotin Carboxyl Carrier Protein of Acetyl-Coa Carboxylase (ACCB) - Swiss-Prot Database Accession no. P02905), the nucleotide and amino acid sequence of which is:
BCCP domain: Nucleotide
gcagcagcggaaatcagtggtcacatcgtacgttccccgatggttggtactttcta ccgcaccccaagcccggacgcaaaagcgttcatcgaagtgggtcagaaagtcaacg tgggcgataccctgtgcatcgttgaagccatgaaaatgatgaaccagatcgaagcg gacaaatccggtaccgtgaaagcaattctggtcgaaagtggacaaccggtagaatt tgacgagccgctggtcgtcatcgagtaa
Amino acid:
AS-AEISGHIVRSPiVrVGTFYRTPSPDAKAFIEVGQKVNVGDTLCIVEAMKM NQIEA DKSGTVKAILVESGQPVEFDEPLWIE- Alternatively, other sequences encoding BCCP known in the art can be used as the biotinylation domain of the invention, for example other BCCP proteins from the Swiss-Prot database:
BCCA MYCLE (P46392)
Acety propionyl-coenzyme A carboxylase alpha chain [Includes: Biotin carboxylase (EC 6.3.4.14); Biotin carboxyl carrier protein (BCCP)]. {GENE: BCCA OR ML0726 OR B1308_C1_129} - Mycobacterium leprae
BCCA MYCTU (P46401) Acetyl-Zpropionyl-coenzyme A carboxylase alpha chain [Includes: Biotin carboxylase (EC 6.3.4.14); Biotin carboxyl carrier protein (BCCP)]. {GENE: ACCA1 OR BCCA OR RV2501C OR MT2576 OR MTCY07A7.07C} - Mycobacterium tuberculosis
BCCP ANASP (Q06881) Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE:
ACCB} - Anabaena sp. (strain PCC 7120)
BCCP ARATH (Q42533)
Biotin carboxyl carrier protein of acetyl-CoA carboxylase, chloroplast precursor (BCCP). {GENE: CAC1 OR BCCP1 OR AT5G16390 OR MQK4.12} - Arabidopsis thaliana (Mouse-ear cress)
BCCP BACSU (P49786)
Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE: ACCB OR FABE} - Bacillus subtilis
BCCP CHLMU (Q9PKR5) Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE:
ACCB OR TC0399} - Chlamydia muridarum
BCCP CHLPN (Q9Z901)
Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE: ACCB OR CPN0183 OR CP0585} - Chlamydia pneumoniae (Chlamydophila pneumoniae)
BCCP CHLTR (084125)
Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE: ACCB or CT123} - Chlamydia trachomatis
BCCP CYACA (019918) Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE:
ACCB} - Cyanidium caldarium [Chloroplast]
BCCP ECOLI (P02905)
Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE: ACCB OR FABE OR B3255 OR Z4615 OR ECS4127} - Escherichia coli, Escherichia coli 0157:H7
BCCP HAELN (P43874)
Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE: ACCB OR FABE OR HI0971} - Haemophilus influenzae
BCCP LYCES (P05115) Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP)
(Fragment). - Lycopersicon esculentum (Tomato) BCCP PORPU (P51283)
Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE: ACCB} - Porphyra purpurea [Chloroplast]
BCCP PROFR (P02904)
Biotin carboxyl carrier protein of methylmalonyl-CoA carboxyl-transferase
(Transcarboxylase, 1.3S subunit). - Propionibacterium freudenreichii shermanii BCCP PSEAE (P37799
Biotin carboxyl carrier protein of acetyl-CoA carboxylase (BCCP). {GENE:
ACCB OR FABE OR PA4847} - Pseudomonas aeruginosa BCCP SOYBN (Q42783)
Biotin carboxyl carrier protein of acetyl-CoA carboxylase, chloroplast precursor (BCCP). {GENE: ACCB-1 } - Glycine max (Soybean)
BCCP STRMU (P29337)
Biotin carboxyl carrier protein (BCCP). - Streptococcus mutans
Also included within the scope of the invention are biotinylation domains encoded by or comprising artificial sequences, for example where one or more amino acids have been altered by conservative substitution. Such sequences can be rationally designed or derived from the sequences of BCCP given above, by methods known in the art. It is essential that these sequences have a secondary and tertiary structure that permits the artificial sequence to be recognised and biotinylated by a biotin ligase enzyme.
In a second aspect, the invention provides the use of a tag moiety comprising a biotinylation domain for determining the folded state of a protein of interest by attachment of said tag moiety to the N-terminus or C-terminus of said protein of interest.
In this second aspect, the tag moiety comprising a biotinylation domain as defined herein is a protein or protein domain which is conditionally biotinylated by a biotinylating enzyme, for example biotin ligase expressed in the host cell in which expression takes place or exogenously applied biotin ligase, for example, used to biotinylate proteins in a cell-free extract. Essentially, the domain can only be biotinylated through recognition of the folded structure of the domain by the enzyme such that the domain in linear, mis-folded or aggregated, form for example in inclusion bodies, is not biotinylated. The folding of the tag and its subsequent biotinylation is dependent on the correct folding of the protein N-terminal to the C- terminal tag and vice versa.
In a third aspect the invention provides a method of increasing the solubility of a protein of interest when expressed in a host cell comprising the steps of: a) attaching a first nucleic acid molecule encoding a tag moiety comprising a biotinylation domain to a second nucleic acid molecule encoding said protein of interest to form a construct such that the tag moiety in the expressed product of the combined first and second nucleic acid molecules comprises said tag moiety located at the N-terminus or C-terminus of said protein of interest b) expressing said construct in a host cell
In a fourth aspect the invention provides a method of determining the folded state of a protein of interest comprising the steps of: a) attaching a first nucleic acid molecule encoding a tag moiety comprising a biotinylation domain to a second nucleic acid molecule encoding said protein of interest to form a construct such that the tag moiety in the expressed product of the combined first and second nucleic acid molecules comprises is located at the N- terminus or C-terminus of said protein of interest b) expressing said construct in a host cell under conditions such that only a correctly folded biotinylation domain present in said tag moiety is ligated with biotin c) determining the folded state of the protein of interest comprising said tag moiety by the presence or absence of a biotin group in the protein expressed from said construct The uses of the first and second aspect of the invention and the methods of the third and fourth aspects of the invention are preferably carried out in a multiplexed manner on more than one protein of interest. For example, wherein the protein of interest is encoded by nucleic acid molecule which forms part of a library comprising two or more different coding sequences and, optionally, wherein the different coding sequences are modified to contain the tag moiety and expressed in parallel.
Thus in a fifth aspect the invention provides a library of nucleic acid molecules encoding proteins of interest wherein each coding sequence is modifed to incorporate at the N-terminus or C-terminus of the encoded protein a tag moiety comprising a biotinylation domain. Such libraries may be generated using known techniques in the art. Usefully, the library can be generated using the COVET methodology described in WO 01/57198.
Accordingly, in a sixth aspect, the invention provides a library of proteins produced from the methods of the third and fourth aspects of the invention or expressed from the library of the fifth aspect of the invention. Such libraries may be arrayed on a solid substrate, for example through immobilisation to that substrate via, for example, a streptavidin-biotin link via the BCCP tag present on the proteins of the library.
The Inventors have also determined that the addition of DNA encoding a BCCP tag 5' to and in-frame with genes of interest in a library has the effect of significantly increasing the number of encoded proteins of interest which are expressed from that library compared to a library encoding the same proteins, but lacking the BCCP tag encoding sequence. Such relative expression differences between "tagged" and "untagged" libraries can be detected or measured qualitatively, for example using western blotting techniques as known in the art. Thus, in a seventh aspect, the invention provides the use of a nucleic acid molecule encoding a tag moiety comprising a biotinylation domain for increasing the proportion of clones in a library that express the protein of interest encoded by each of said clones at detectable levels, for example as measured by conventional western blotting, by attachment of said nucleic acid molecule encoding said tag 5' to and in-frame with the gene encoding said protein of interest in each of said clones.
Accordingly in an eighth aspect, the invention provides a method of increasing the proportion of clones in a library that express the protein of interest encoded by each of said clones in a host cell at detectable levels, comprising the steps of: a) attaching a first nucleic acid molecule encoding a tag moiety comprising a biotinylation domain 5' to and in-frame with a second nucleic acid molecule encoding said protein of interest in a clonal member of said library to form a construct such that the tag moiety in the expressed product of the combined first and second nucleic acid molecules comprises said tag moiety located at the N-terminus of said protein of interest b) expressing said construct in a host cell
Preferred features of each aspect of the invention are as defined for each other aspect, mutatis mutandis.
Whilst the tags, methods and libraries of the invention are particularly suited to facilitating parallel expression and purification immobilisation of proteins encoded by a library of sequences (by a common method of solublisation and purification of the proteins of interest), the invention can also be applied to other methodologies known in the art. For example, an N-terminal or C-terminal tag according to the invention (for example BCCP) can be used to increase both protein expression and solubility in:
Vaccine production Therapeutic protein production
Antigen production used for the generation of monoclonal or polyclonal antibodies, monoclonal antibody or single chain antibody production
• Enzyme production
Drug target discovery by mapping cellular protein-protein interactions "the interactome"
• Drug target validation by generation of protein drug targets including, but not exclusively, kinases, phosphatases, cell receptors or proteases for screening, enzyme and / or toxicology studies and any other biochemical analysis.
The invention will now be further described by the following non-limiting examples which refer to the accompanying figures in which:
Figure 1 shows the colony western data using Streptavidin-HRP conjugate as the probe. The clones expressing in-frame GFP-BCCP that fluoresced green are also biotinylated. The bottom row are clones that harbour pMSC301 (no beep gene sequence in the plasmid), and signal obtained is the background signal of endogenous biotinylated AccB. The second row from the bottom are the clones harbouring pMSC302 (overexpressing accB). The other negative clones (out of frame fusions or vector re-ligated did not fluoresce green and were not biotinylated).
Figure 2 shows colony western data using Streptavidin-HRP conjugate as the probe. The clones expressing in-frame GST-GFP-BCCP that fluoresced green are also biotinylated. Also shown as biotinylation positive signal is the protein GST- BCCP. The negative control is clones that harbour pMSC301 (no beep gene sequence in the plasmid), and signal obtained is the background signal of endogenous biotinylated AccB. The positive control is the clone harbouring pMSC302 (overexpressing accB). The other negative clones (out of frame fusions or vector re-ligated did not fluoresce green and were not biotinylated). Figure 3 shows western blot analysis of the protein extract from cells expressing GFP-BCCP. The signal obtained at approximately 37 kDa., is the expected Mr of GFP-BCCP. Another signal seen at 18 kDa is that of endogenous biotinylated AccB protein, also seen in the GFP-BCCP negative lanes. As expected, the 18 kDa. signal is stronger, when no recombinant biotinylated protein is expressed.
Lanes 1, 2 and 3: Protein extract from clones harbouring pGFP-BCCP, expressing intact GFP-BCCP protein.
Lanes 4, 5 and 6: Protein extract from clones harbouring pMSC301A, B, and C respectively, used as negative control in the experiment.
Figure 4 shows western blot analysis of protein extracts from cells expressing GST-GFP-BCCP, and GST-BCCP. Biotinylated proteins of expected Mr. are observed (63 kDa for GST-GFP-BCCP and 37 kDa for GST-BCCP). In all the lanes 18 kDa signal for endogenous AccB is present.
Lanes 1, 2 and 4 are protein extract from cells expressing GST-GFP-BCCP.
Lane 3 is the protein extract from cells expressing GFP-BCCP as a positive control in this expt.
Lanes 5 and 6: Protein extract from clones harbouring pMSC301A, and B as negative controls in the blot.
Lanes 7 and 8: Protein extracts from cells expressing GST-BCCP.
Figure 5 shows a colony western blot using streptavidin-HRP as the probe for biotinylation of BCCP in the fusion protein. All clones that were marked to be fluorescing green when excited at 365 nm wavelength, were also biotinylated
(positive signal above the background). The intensities of positive signals varies as does the green phenotype. Increased sensitivity of detection using streptavidin-HRP conjugate, picked up few additional clones.
Figure 6 shows protein expression results of the human gene set cloned into the Avi-Tag vector pQE82L-GFP-biotin. Single ampicillin resistant colonies were used to inoculate 1 ml of LB media containing 100 μg/ ml ampicillin (LB-Amp) and grown over-night at 37°C with shaking. The next day a 1:100 dilution was made into fresh LB-Amp and cells grown at 37°C until OD600 = 0.6 to 1.0. IPTG was then added to a final concentration of 1 mM and growth continued at 30°C for 4 hours. 10 μl of cell culture was then taken and analysed by 4 - 20% SDS-PAGE Western blot and probed with HRP-conjugated streptavidin. Numbers labeled for each lane refer to the B# in Table 1. The molecular weight markers are: aprotin (7.6 kDa), lysozyme (18.4 kDa), soybean trysin inhibitor (32.5 kda), carbonic anhydrase (45.7 kDa), BSA (78 kDa), B-galactosidase (132 kDa) and myosin (216 kDa).
Figure 7 shows protein expression results of the human gene set cloned into the BCCP expressing vector pMD004. Single ampicillin resistant colonies were used to inoculate 1 ml of LB media containing 100 μg/ ml ampicillin (LB-Amp) and grown over-night at 37°C with shaking. The next day a 1:100 dilution was made into fresh LB-Amp and cells grown at 37°C until OD600 = 0.6 to 1.0. IPTG was then added to a final concentration of 1 mM and growth continued at 30°C for 4 hours. 10 μl of cell culture was then taken and analysed by 4 -"- 20% SDS-PAGE Western blot and probed with HRP-conjugated streptavidin. Numbers labeled for each lane refer to the B# in Tables 1 and 2. The molecular weight markers are: aprotin (7.6 kDa), lysozyme (18.4 kDa), soybean trysin inhibitor (32.5 kDa), carbonic anhydrase (45.7 kDa), BSA (78 kDa), B-galactosidase (132 kDa) and myosin (216 kDa).
Figure 8 shows plasmid maps of pMD002 and pMD004.
Figure 9 shows a plasmid map of pIFMlOlA/B/C Figure 10 shows the cloning site of plasmid pIFMlOlA
Figure 11 shows the cloning site of plasmid pIFMlO IB
Figure 12 shows the cloning site of plasmid pIFMlOlC
EXAMPLES
Example 1: Use of BCCP as a Protein Folding Marker
Methods:
1. Isolation of Biotin carboxyl carrier protein (C-terminal domain of acetyl-CoA carboxylase) from E. coli K 12 strain
The DNA sequence encoding the entire coding region of acetyl-CoA carboxylase was amplified by PCR from genomic DNA of XL 1 -Blue (Stratagene) cells, using the following gene specific primers. accbforl: 5' GATGGATCCGATATTCGTAAGATTAAAAAACTGATCG 3' with BamHI site at the 5' end. bccprevl: 5'
GATGAGCTCAAGCTTTTACTCGATGACGACCAGCGGCTCGTC 3 ' containing Sacl and Htwdlll site.
The PCR amplification was carried out using Pwo polymerase (Roche) using standard cycling conditions (94°C 5 min; 94°C 30 sec; 64°C 1 min; 72°C lmin; 30 cycles; 72°C 5 min).
The PCR amplified gene sequence was cloned into the BamΑΪ and Sacl site of the E. coli expression vector pQE-80 (Qiagen) inframe with the N-terminus hexahistidine tag to form the plasmid pMSC302. The identity of the gene sequence was confirmed by restriction mapping and DNA sequencing. The DNA sequence corresponding to the C-terminal domain of AccB known as biotin carboxyl carrier protein (BCCP) was amplified by PCR using the same reverse primer as above and a new forward primer. bccpforl :
5 'GATCJΓGCAGGGCTCCGCAGCAGCGGAAATCAGTGGTCACATCG 3 ' containing Pstl site for cloning and two extra codons for glycine and serine.
2. Construction of vectors : The vector pQE-80 was redesigned to delete the DNA sequence for hexahistidine tag, add additional cloning sites (Notl and Sfiϊ), and have three different reading frames from the start ATG (ρMSC301 A/B/C). This was carried out by inverse PCR using the primer sets; pQErevl: 5'P
CATAGTTAATTTCTCCTCTTTAATGAATTCTG 3'; pQEfwdl: 5' GCGGCCGCGGCCATTACGGCCGGATCCGCATGCGAGCTCGG TACCCCC 3'; pQEfwd2: 5' G + pQEfwdl; pQEfwd3: 5' GC + pQEfwdl for A, B, and C reading frames respectively. The PCR was carried out using Pwo polymerase (94°C 2 min; 94°C 30 sec; 63.5°C 1 min; 72°C 6min; 25 cycles; 72°C 10 min).
The beep gene sequence was cloned into the Pstl-HindHL sites of pMSC301 A, B, and C vectors to generate pMSC301A,B,C/BCCP.
The DNA sequence encoding GFPuv (Clontech) was amplified by PCR using the primer set pQEGFPforl: 5' GGGCCGGTGGCAGCGCGAGTAAAGGAG AAGA ACTTTTCACTGG 3' (with Smal half site and a linker region) and pQEGFPrevl: 5' GATCTGCAGGGTACCGGATCCTTTGTAGAGCTCATCCATGCC 3' (with Pstl, Kpn I and Bam HI sites). The PCR amplified product was cloned into the Smal-Pstl sites of pMSC301A, B and C/BCCP in-frame to DNA sequence encoding the N-terminus of BCCP (GFP-BCCP) to generate the vectors pMSC303A, B, and C.
The plasmid construct pMSC303B was restricted with Notl, the staggered ends were made blunt using the filling in reaction of T4 Polymerase (NEB), restricted with Sma I and religated (plasmid designated as pGFP-BCCP) .
The vectors pMSC301A/BCCP and pMSC303A were restricted with Notl, the overhangs blunted using T4 DΝA polymerase, restricted with Smal and were used to clone the DΝA fragment encoding GST forming the plasmid constructs pGST- BCCP and pGST-GFP-BCCP respectively. The DNA sequence encoding GST was amplified by PCR using the primers; GSTfwdOl: 5' TCCCCTATACTAGGTTATTGG 3' and GSTrevexoN: 5' GGGCGTCACGA TGAATTCCCGGG 3' andpGEX-2T (Pharmacia) as template. The Noil and Sfϊl cloning sites of the vectors pMSC303 A,B and C were replaced by the Sfil overhang compatible restriction site, DralU to generate the vectors pIFMlOlA, B, and C. This was carried out by inverse PCR using the primers; DrafwdA: 5' CACTTAGTGGGATCCGCATGCGAGCTCGGTACCCC 3'; DrafwdB: 5' G + DrafwdA; DrafwdC: GA + DrafwdA. The reverse primer used was pQErevl as described earlier. The PCR conditions used were same as before.
A set of nested deletions recessed at 3' ends of human heart cDNAs (Clontech) were cloned into the Dralll-Smal sites of the vectors pIFMlOlA, B, and C to form the plasmid pX-GFP-BCCP.
The correct DNA sequence of all the constructs used in the study were confirmed by sequencing.
3. Generation of nested deletions (recessed at 3' ends) of human heart cDNAs
The COVET methodology was used to generate the deletion set which is the subject of patent application Nos. GB0020357.0, USSN 60/247995 and WO 01/57198.
In brief, ~100ng template plasmid library (human heart cDNA library in pDNR-LIB from Clontech) was amplified by PCR using vector-specific primers SP5forward: 5 ' ATGCTCATGAGGCCGGCCGGGAATTC GGCCATTACG GCCGG3' with Fsel and Sfil sites, and SP3reverse: 5'GTCTAGAAAGCTTCTCGAGGGCCG3χ to optimally incorporate alpha-phosphothioate dTTPs (α-S-dTTP; Amersham). The PCR reaction was carried out using 50pmol each primer, 2.5 units thermostable polymerase (lacking a 3' to 5' exonuclease activity e.g. Taq polymerase), a standard buffer and the deoxynucleotide triphosphate mix: 200μM dATP, 200μM dGTP, 200μM dCTP, lOOμM dTTP, lOOμM α-S-dTTP. The PCR amplified products were purified using QIAquick PCR cleanup kits (Qiagen) and subjected to Fsel digestion to produce a 3' nucleotide overhang which protects the 5' end of the dsDNA from subsequent hydrolysis by exonuclease III (NEB). Exonuclease III digestion was performed using standard conditions and the presence of phosphothioate internucleotide linkages blocked any further hydrolysis. This generated a nested set of sense strand 3' deletions. Mung bean nuclease (New England Biolabs) was used to remove ssDNA from the antisense strand and therefore blunt the dsDNAs in preparation for directional cloning after further digestion with Sfil. These inserts after size fractionation by agarose gel electrophoresis were cloned into the Draϊll and Smal sites of the vectors pIFMlOlA, B and C. The ligated products were then used to transform XL 1 -Blue cells (Stratagene). 4. Expression of the fusion proteins
The E. coli strains XL 1 -Blue or XLIO-Gold (stratagene) were used as host cells and were transformed (electroporation or chemical method) using various plasmid constructs. The transformation mixture was plated at an appropriate dilution on a nitrocellulose membrane placed on LB-Agar containing 100 μg/ml carbenicillin. After overnight incubation at 30°C the membranes were transferred onto LB-Agar containing 400 μM IPTG and carbenicillin and incubated for another 4-5 hrs at 30°C. The GFP activity of the clones were assessed by visualizing the clones at 365 nm wavelength of the UV-transilluminator. The membranes were processed for detecting biotinylated BCCP or GFP. For analysing the proteins by western blot the cultures were induced at mid log phase (optical density at 600 nm of 0.5 to 0.6) by adding 400 μM of IPTG to the culture and growth of cells continued for another 3-4 hours at 30°C. At the end of the induction period, cells were harvested, proteins resolved on 10-20 % gradient SDS-gel (Invitrogen), blotted onto nitrocellulose membrane and probed with various antibodies or streptavidin. 5. Detection of biotinylated BCCP
The biotinylation of BCCP was detected by probing with a streptavidin-horseradish peroxidase (HRP) conjugate (Amersham) on colony blots (as described) or on western blots as known in the art. The clones were either gridded robotically, or the transformation mix was plated, onto nitrocellulose membrane (Amersham) placed on a LB agar plate containing carbenicillin. After overnight incubation at 30°C, the membrane was placed onto a fresh LB agar plate containing carbenicillin and IPTG (400μM). The plate was incubated for another 4-5 hours at 30°C. The colonies on the membrane were subjected to alkaline lysis and the membrane blocked prior to addition of the probe. The membrane is first placed on two sheets of Whatmann 3 paper pre soaked with 0.5 (M) NaOH, 1.5 (M) NaCl for 10 min. The membrane is neutralised by placing on Whatmann 3 sheets soaked with 1 (M) TrisHCl pH 7.5, 1.5 (M) NaCl for 5 min, two times. The membrane is then transferred onto Whatmann 3 sheets wetted in PBS-T (0.1%) containing 1% SDS for 10 mins. The membrane is then washed thoroughly in PBS-T ensuring that all the cell debris has been dislodged. The blot is then ready to be processed in the same manner as a western blot.
The Streptavidin-HRP conjugate was used at a dilution of 1 :4000 and the signal was detected by chemiluminescence using the ECL system from Amersham.
6. Detection of GFP activity
The green fluorescence of GFP was visualized by exciting the colonies at 365 nm wavelength using a transilluminator.
7. Detection of GST An anti-GST monoclonal antibody (Sigma) was used as an immunoprobe to detect expression of GST. The antibody was used at a dilution of (1:3000) and the immunoreactive signal was detected using the ECL system from Amersham.
Results Absolute correlation of GFP activity and biotinylation of BCCP
Figures 1 and 2 show the colony western data using streptavidin-horseradish peroxidase as the probe. Only the correct in-frame fusion of GST-GFP-BCCP, GST-BCCP and GFP-BCCP gave strong positive signal significantly above the general background from endogenous biotinylated AccB. Out-of-frame fusions resulting from the cloning strategy used, did not give rise to positive signals. All and only biotinylated fusion proteins (GST-GFP-BCCP and GFP-BCCP) fluoresced green when excited at 365 nm. The fluorescence is indicative of correct folding of the fusion protein and this result demonstrated that correctly folded proteins with BCCP as the C-terminal fusion partner is an active substrate for biotin protein ligase (BPL). Figures 3 and 4 show that the biotinylated proteins are of expected molecular weight, confirming the proteins as intact and unproteolysed.
A more comprehensive study of a group of proteins Human heart cDNAs were recessed at 3' ends so as to remove the stop codon of the ORFs using controlled Exonuclease III (NEB) digestion. This 3' nested deletion set was then cloned into the vectors pIFM101A,B and C (see Figures 9 to 12). The library of resulting fusions to GFP-BCCP will be either in or out of frame. The in frame fusion proteins when expressed as correctly folded soluble proteins fluoresced green under ultraviolet light at 365 nm (GFP is a visual folding marker) and were also biotinylated. Figure 5 shows a colony western blot probed with streptavidin-horseradish peroxidase conjugate. The positive hits (significantly above the background) are the ones that were marked as green when visualized 365 nm. Only 4 out of 36 were biotinylated but not green visually. This could be due to the fact that the detection method used for biotinylation of BCCP is much more sensitive than visual detection of green fluorescence.
In this experiment many of the fusion proteins would be in-frame to GFP-BCCP but would not fluoresce green as they do not fold properly and are insoluble. The streptavidin-HRP western blot data with a set of complex fusion proteins (figure 5) shows that only when the fusion proteins are correctly folded and soluble, as assessed by green fluorescence of GFP, is the BCCP domain of the fusion protein biotinylated. These observations demonstrate that biotinylation of BCCP in the fusion protein is a folding marker as is the green fluorescence of GFP. Since it is known in the art that GFP is a reliable indicator of correct folding then the results here demonstrate that biotinylation of BCCP is also a reliable indicator of correct folding.
Example 2: Use of BCCP as a Protein Solubility Enhancer
Materials and Methods Vectors. The ρQE82L-GFP-biotin and pMD004 plasmids (Figure 8) were constructed by standard techniques (T. Maniatis et al (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Press) and both consist of a pQE82L vector (Qiagen) backbone, with a RGS-His tag followed by either the "Avi-Tag" sequence or BCCP protein domain respectively, followed by a multi-cloning site. They encode the lacl repressor for tight regulation of the T5 promoter, and when cut with Smal and Noil release either the GFP or p53 stuffer fragments to give the vectors ready for gene cloning inserts with a 5'-phosphorylated, blunt end and a 3' — Notl sticky end.
Gene Insert Production. Human protein domains were chosen and the corresponding genes were PCR amplified from cDΝA libraries. The 5'- phosphorylated forward primers consist of the first 24 bp at the beginning of the relevant sequence, starting with a full codon. Some of the forward primers are longer to incorporate a G or C at the 3' end. The reverse primers consist of the last 24 bp of the relevant sequence (longer if necessary to incorporate a G or C at the 5' end) which is then appended to the beginning of the reverse primer template (TGATAGAAGAGCGGCCGC). The final reverse primer would be the reverse complement of this. This primer results in the stop codon of all the fusions being defined and followed by a Notl site for cloning into the Ν-terminal tagging vector described above. Two cDΝA templates were combined at a final concentration of lOng/μl. These were a) human heart cDΝA plasmid library (Life Technologies) & b) HeLa cell cDΝA plasmid library (Invitrogen). All primers were reconstituted in distilled water to lOOpmols/μl. A master mix was prepared (without primers) from: Template (lOng), PWO polymerase buffer with magnesium sulphate (lx final concentration), dΝTPs (5mM final cone), PWO polymerase (2.5 units), dimethyl sulfoxide (10% final cone.) and distilled water to a final volime of 48 μl per reaction. The master mix was aliquoted into 96 well PCR plates (Eppendorf) and 1 μl of each primer added on ice. Conditions were as follows: 94 for 3 mins then 94 for 30 sees, 59 for 30 sees, 72 for 2 mins (32 cycles) and finally 72 for 7 mins. Products were checked on 2% agarose gels/TBE and purified using Qiaquick PCR purification columns (Qiagen). Clean dsDNA was digested with Notl in a standard digestion mixture and cleaned again.
Hoescht 33258 assay. To quantify the dsDΝA in preparation for cloning a low range standard curve of an unrelated, clean PCR product in 1:1000 Hoescht dye (stock lmg/ml)/lxTΝE (Tris lOmM, EDTA ImM, ΝaCl 0.2 M pH 7.4) was set up at 80, 40, 20, 10, 5, 2.5, 1.25, 0 ng/100 μl. 1 μl of each experimental PCR product was added to 99 μl of 1:1000 Hoescht/TΝE, mixed in clear bottomed, black sided 96 well microtiter plates (Corning) and fluorescence read at 365/465nm. The standard curve was plotted and dsDΝA content of each 'insert preparation1 calculated as ng/μl
Cloning the inserts into pQE82L-GFP-biotin or pMD004. Inserts were ligated to the vector prep with an approximate molar ratio of 3:1 (insert: vector). Ligations were carried out in a 96-well PCR plate with the rapid DΝA ligation kit (Roche). The ligations (2 μl of each) were used to transform 30 μl of XL 1 -Gold Supercompetent cells (Stratagene), according to the protocol, in a thin wall 96-well PCR plate. After heat shock, the transformations were added to 300 μl of pre- warmed SOC medium in a 96-well deep well block and shaken at 37°C for 45 minutes. 200 μl of each was plated and incubated at 37°C overnight. Approximately 0.02 pmoles of vector was used for each ligation. Ampicillin resistant clones were analysed by colony PCR to check for correct insert size and positive clones taken forward for expression screening.
Protein Expression. Single ampicillin resistant colonies were used to inoculate 1 ml of LB media containing 100 μg/ ml ampicillin (LB-Amp) and grown over-night at 37°C with shaking. The next day a 1:100 dilution was made into fresh LB-Amp and cells grown at 37°C until OD600 = 0.6 to 1.0. IPTG was then added to a final concentration of 1 mM and growth continued at 30°C for 4 hours. 10 μl of cell culture was then taken and analysed by 4 - 20% SDS-PAGE Western blot as described and probed with HRP-conjugated streptavidin.
Results and Discussion
To prove that the BCCP domain can aid protein folding, a defined set of 49 human proteins were cloned into the Sma I / Not I sites of two different vectors: pQE82L- GFP-biotin or pMD004 (Figure 8). Protein expression from these constructs resulted in proteins being expressed with either a short (19 aa) N-terminal peptide tag (consisting of a hexa-histidine sequence followed by the "Avi-Tag" sequence (www.avidity.com; US Patent 5,932,433) for pQE82L-GFP-biotin or as fusions to the C-terminus of the E. coli BCCP protein (pMD004). A significantly higher success rate for the production of soluble protein was observed when the proteins were expressed as fusions with the BCCP protein (see Figures 6 and 7), as summarized in Table 1. For example when fused to the BCCP domain 98 % of proteins were expressed solubly compared with when expressed in the absence of the BCCP domain only 48 % of clones gave observable expression of which 81 % were soluble. The observation that a greater overall number of clones expressed from the pMD004 vector compared with the expression from the pQE82L-GFP- biotin in unlikely to be explained by the "N-end rule" where the amino acids at the N-terminus can be crucial in determining targeting to the proteosome for degradation (Rao H, Uhlmann F, Nasmyth K, Varshavsky A. (2001) Nature, 410, 955-9), since in both constructs the N-terminal 12 amino acids are identical. More likely an explanation is that the constructs expressed with an-N-terminal BCCP domain aid protein folding of the downstream proteins, preventing the targeting of the mis-folded proteins to the proteosome. This is also supported by the observation that more proteins expressed in a soluble manner when expressed downstream of BCCP compared with expression from the pQE82L-GFP-biotin vector. The mechanism by which BCCP aids the folding of down-stream protein domains could be either by recruitment of chaperones or by increasing the overall solubility of the fusion protein.
The results presented here strongly indicate that the BCCP domain can increase the overall number of clones expressing soluble protein when expressed as an N- terminal fusion to the target protein. In addition the result indicate that the BCCP domain can increase the solubility of a protein of interest. The tight correlation observed between biotinylation and solubility of expressed fusions demonstrates that biotinylation of BCCP acts as a folding marker when fused to the N-terminus of a protein of interest. In addition, the ability of the BCCP protein to be biotinylated provides a highly specific means to capture the protein on a streptavidin surface.
Table 1. Protein Expression Summary. Proteins were chosen and corresponding gene inserts were cloned into the pQE-GFP-biotin (vector 1) or the BCCP pMD004 (vector 2) resulting in fusions to the C-terminus of either a hexa-histidine-Avi-Tag peptide or a hexa-histidine-BCCP protein. Only inserts cloned into both vectors are compared in terms of protein expression. Key to table: internal coding number. 2Protein database accession number (www.oca.ebi.ac.uk). 3DNA gene length in base-pairs. 4. Protein size when expressed as a fusion with BCCP in amino acids (aa). 5. Protein size when expressed as a fusion with BCCP in kilodalton (kda). 6 Region of ORF cloned (aa). C - cloned but no expression; H - expressing hexahistidine positive protein in a SDS-PAGE Western blot; B - expressing biotin positive protein in a SDS-PAGE Western blot; S - expressing soluble protein. Table 1.
Insert Fusion Fusion Expression Expression
Gene B #1 PDB2 Part Cloned8 Length bp3 aa4 Kdas Vector 1 Vector 2
1-136/136
Ac.Fib. Gr. Factor 1 2AXM 408 241 31.3 C.H.B.S. C.H.B.S. orf
Ale. Dehyd. 2 1DEH 1143 486 63.2 .^ 1-370/374 orf C. C.H.B.S.
22-362/362
Ad. Kinase 3 1BX4 1044 453 58.9 C.H.B.S. C.H.B.S. orf
Aid. Red 4 1AZ1 960 425 55.3 2-315/315 orf C.H.B.S. C.H.B.S.
Bar-to-Autoint. 5 2EZZ 285 200 26.0 1-89/89 orf C.H.B.S. C.H.B.S.
Bleo. Hyd. 6 1CB5 1380 565 73.5 1-454/455 orf C.H.B.S. C.H.B.S.
291-
Bone Morph. P2 7 3BMP 198 171 22.2 C.H.B.S. C.H.B.S.
Figure imgf000025_0001
Carb. Anhyd. II 9 1A42 798 371 48.2 371 / 371 orf C. C.H.B.S.
Cyclin-dep Kin 2 11 1F5Q 912 409 53.2 1-298/298orf C.H.B.S. C.H.B.S.
56-
C-Rafl 12 1GUA 246 187 24.3 CH. C.H.B.S. 131/648orf
80-
3- et . DNA Glyc. 14 663 326 42.4 c. C.H.B.S.
1BNK 294/298orf
DlMA Pase jβ 15 1BPX 1010 442 57.4 4-334/334orf CH. C.H.B.S.
57-
Gr. F. Rec-bid. P2 17 1CJ1 306 207 26.9 C.H.B.S. C.H.B.S.
152217orf
140-
Hck Kinase 19 3HCK 336 217 28.2 C.H.B.S. C.H.B.S. 245/526orf
255-
C-Jun Proto-Onc 20 1F0SJ 189 168 21.8 C.H.B.S. C.H.B.S. 322/340orf
85-
Urac-DNA Glyc. 21 4SKN 678 331 43.0 C. C. 304/304orf
Quin. Red. 22 2QR2 711 342 44.5 1-230/230orf C. C.H.B.S.
GSTP1 23 9GSS 652 322 41.9 1-209/209orf C. C.H.B.
238-
Orn. Aminotr. 25 2CAN 1224 513 66.7 C.H.B.S. C.H.B.S. 439/439orf
25-
Angiogenin 26 1AWZ 369 228 29.6 C. C.H.B.S. 147/ 47orf
18-
Prot. Disulf. Isom. 28 1MEK 378 231 30.0 C C.H.B.S. 137/508orf
Glyc-lnh. Factor 29 1GIF 363 226 29.4 1-114/114orf C.H.B.S. C.H.B.S.
Fk506-Bind. Prot 30 1NSG 325 213 27.7 1-107/107orf C. C.H.B.S.
40-
Annexin I 34 1B09 237 184 23.9 CH. C.H.B.S. 112 345orf
Cyclophillin A 36 1BCK 495 270 35.1 1-164/164orf C.H.B.S. C.H.B.S
Ser.-Thr. Phos. B-B 41 1AU1B 507 274 35.6 2-170/170orf C C.H.B.S.
112-
Transcr. Factor iib 42 1TFB 633 316 41.1 C. C.H.B.S. 316/316orf
69-
S-Admeth. Decarb. 47 1JEN 800 372 48.3 C. C.H.B.S. 329/334orf 19-
Procathepsin B 49 3PBH 948 421 54.7 C C.H .B.S. 333/339orf
Rhoa 51 1CXZ 561 292 38.0 1-181/193orf C. C.H.B.S.
Acid Phosphotase 1A 51A P24666 471 257 28.0 1-157/157or c. C.H .B.S.
Pax-6 53 6PAX 417 244 31.7 4-136/422orf c. C.H.B.S.
Phostyr. Phoslip 55 5PNT 492 269 35.0 1-157/157orf C.H.B.S. C.H.B.S.
Thyroid Hormone BP 57A Q14894 942 314 45 1-314/314orf C. C.H.B.S.
Hsp86 58A - 684 333 43.3 8-235/731orf C.H.B.S. C.H.B.S.
Hsp40 59 1HDJ 231 182 23.7 1-76/340orf C. C.H.B.S.
37-
NK/ B52 61 1A3Q 891 402 52.3 C.H.B.S. C.H.B.S.
Figure imgf000026_0001
Fruc-Bisph. Aid.* 64 1D0S 1095 470 61.1 1-358/358orf C.H.B.S. C.H.B.S.
93-
Fadd 65 1E3Y 312 209 27.2 C. C.H.B.S. 192/208orf
Transcr. Factor Max 66 1HLO 285 200 26.0 4-92/160orf C. C.H.B.S.
47-
IL-6 67 2IL6 515 276.7 36.0 CH. C.H.B.S.
Figure imgf000026_0002
Hyp.-Guan. Phribtr. 71 1 NST 660 325 42.3 4-217/217orf C. C.H.B.S.
Glyoxylase II 78 1QH5 198 371 48.2 1-260/260orf C. C.H.B.
319-
Srebp-1a 80 1AM9 258 191 24.8 C. C.H.B.S.
398/1147orf

Claims

1. Use of a tag moiety comprising a biotinylation domain for increasing the solubility of a protein of interest by attachment of said tag moiety to the N-terminus or C-terminus of said protein of interest.
2. Use of a tag moiety comprising a biotinylation domain for determining the folded state of a protein of interest by attachment of said tag moiety to the N- terminus or C-terminus of said protein of interest.
3. Use of a nucleic acid molecule encoding a tag moiety comprising a biotinylation domain for increasing the proportion of clones in a library that express the protein of interest encoded by each of said clones at detectable levels by attachment of said nucleic acid molecule encoding said tag 5' to and in-frame with the gene encoding said protein of interest in each of said clones .
4. A method of increasing the solubility of a protein of interest when expressed in a host cell comprising the steps of: a) attaching a first nucleic acid molecule encoding a tag moiety comprising a biotinylation domain to a second nucleic acid molecule encoding said protein of interest to form a construct such that the tag moiety in the expressed product of the combined first and second nucleic acid molecules comprises is located at the N- terminus or C-terminus of said protein of interest b) expressing said construct in a host cell
5. A method of determining the folded state of a protein of interest comprising the steps of: a) attaching a first nucleic acid molecule encoding a tag moiety comprising a biotinylation domain to a second nucleic acid molecule encoding said protein of interest to form a construct such that the tag moiety in the expressed product of the combined first and second nucleic acid molecules comprises is located at the N- terminus or C-terminus of said protein of interest b) expressing said construct in a host cell under conditions such that only a correctly folded biotinylation domain present in said tag moiety is ligated with biotin c) determining the folded state of the protein of interest comprising said tag moiety by the presence or absence of a biotin group in the protein expressed from said construct
6. A method of increasing the proportion of clones in a library that express the protein of interest encoded by each of said clones in a host cell at detectable levels, comprising the steps of: a) attaching a first nucleic acid molecule encoding a tag moiety comprising a biotinylation domain 5' to and in-frame with a second nucleic acid molecule encoding said protein of interest in a clonal member of said library to form a construct such that the tag moiety in the expressed product of the combined first and second nucleic acid molecules comprises said tag moiety located at the N-terminus of said protein of interest b) expressing said construct in a host cell
7. The use as claimed in claim 1 or claim 2, or the method as claimed in claim 4 or claim 5 wherein said protein of interest is encoded by nucleic acid molecule which forms part of a library comprising two or more different coding sequences.
8. The use or method as claimed in claim 7 wherein said different coding sequences are modified to contain said tag moiety and expressed in parallel.
9. A library of nucleic acid molecules encoding proteins of interest wherein each coding sequence is modifed to incorporate a tag moiety comprising a biotinylation domain at the N-terminus or C-terminus of the encoded protein.
10. A library of soluble folded proteins expressed from the library of claim 9.
11. The use, method or libraries of any of claims 1 to 10, wherein said biotinylation domain is E. coli BCCP.
12. A protein produced by the method of any one of claims 3 to 8 or 11.
PCT/GB2003/000362 2002-01-29 2003-01-29 Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state WO2003064656A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
US10/502,581 US8999897B2 (en) 2002-01-29 2003-01-29 Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state
EP03734757A EP1470229B1 (en) 2002-01-29 2003-01-29 Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state
AU2003238441A AU2003238441B2 (en) 2002-01-29 2003-01-29 Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state
CA2474457A CA2474457C (en) 2002-01-29 2003-01-29 Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state
JP2003564248A JP4377242B2 (en) 2002-01-29 2003-01-29 Protein tag comprising a biotinylated domain, method for increasing solubility and method for determining folding state
DE60305643T DE60305643T2 (en) 2002-01-29 2003-01-29 PROTEIN MARKER OF A BIOTINYLATION DOMAIN CONTAINS AND USES TO INCREASE THE SOLUBILITY AND DETERMINE THE FOLDING STATUS

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0202018.8 2002-01-29
GBGB0202018.8A GB0202018D0 (en) 2002-01-29 2002-01-29 Tag and method

Publications (1)

Publication Number Publication Date
WO2003064656A1 true WO2003064656A1 (en) 2003-08-07

Family

ID=9929956

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2003/000362 WO2003064656A1 (en) 2002-01-29 2003-01-29 Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state

Country Status (11)

Country Link
US (1) US8999897B2 (en)
EP (1) EP1470229B1 (en)
JP (1) JP4377242B2 (en)
AT (1) ATE328092T1 (en)
AU (1) AU2003238441B2 (en)
CA (1) CA2474457C (en)
DE (1) DE60305643T2 (en)
DK (1) DK1470229T3 (en)
ES (1) ES2268376T3 (en)
GB (1) GB0202018D0 (en)
WO (1) WO2003064656A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006024875A2 (en) * 2004-09-03 2006-03-09 European Molecular Biology Laboratory Method for determinig protein solubility
GB2442048A (en) * 2006-07-25 2008-03-26 Proimmune Ltd Biotinylated MHC complexes
WO2011114139A1 (en) 2010-03-15 2011-09-22 Sense Proteomic Limited Auto-antigen biomarkers for prostate cancer
EP2375252A1 (en) 2008-06-11 2011-10-12 Sense Proteomic Limited Biomarkers for lupus
WO2012049664A2 (en) 2010-10-15 2012-04-19 Sense Proteomic Limited Auto-antigen biomarkers for lupus
DE102010056289A1 (en) 2010-12-24 2012-06-28 Geneart Ag Process for the preparation of reading frame correct fragment libraries
US8999897B2 (en) 2002-01-29 2015-04-07 Sense Proteomic Limited Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state
WO2022108522A1 (en) * 2020-11-18 2022-05-27 Sengenics Corporation Pte Ltd Biomarkers for predicting immunogenicity and therapeutic responses to adalimumab in rheumatoid arthritis patients

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7094568B2 (en) * 2000-08-17 2006-08-22 Sense Proteomic Ltd. Method for producing proteins tagged at the N- or C-terminus
ATE361471T1 (en) 2001-12-05 2007-05-15 Sense Proteomic Ltd PROTEIN ARRAYS FOR ALLEL VARIANTS AND THEIR USE
GB0205910D0 (en) * 2002-03-13 2002-04-24 Sense Proteomic Ltd Arrays and methods
US20030228709A1 (en) * 2002-03-25 2003-12-11 Kozlowski Roland Zbignieiw Arrays
JP5754135B2 (en) 2007-03-26 2015-07-29 アジェナス インコーポレイテッド Cell surface display, screening, and production of proteins of interest
WO2014014206A1 (en) 2012-07-20 2014-01-23 University-Industry Cooperation Group Of Kyung Hee University Novel peptide tag and uses thereof
US9758571B2 (en) 2012-07-20 2017-09-12 University—Industry Cooperation Group Of Kyung Hee University Antibody for epitope tagging, hybridoma cell line and uses thereof
US11446398B2 (en) 2016-04-11 2022-09-20 Obsidian Therapeutics, Inc. Regulated biocircuit systems
WO2019241315A1 (en) 2018-06-12 2019-12-19 Obsidian Therapeutics, Inc. Pde5 derived regulatory constructs and methods of use in immunotherapy
US20210386788A1 (en) 2018-10-24 2021-12-16 Obsidian Therapeutics, Inc. Er tunable protein regulation
WO2020185632A1 (en) 2019-03-08 2020-09-17 Obsidian Therapeutics, Inc. Human carbonic anhydrase 2 compositions and methods for tunable regulation
WO2020252404A1 (en) 2019-06-12 2020-12-17 Obsidian Therapeutics, Inc. Ca2 compositions and methods for tunable regulation
EP3983538A1 (en) 2019-06-12 2022-04-20 Obsidian Therapeutics, Inc. Ca2 compositions and methods for tunable regulation
US20220348937A1 (en) 2019-09-06 2022-11-03 Obsidian Therapeutics, Inc. Compositions and methods for dhfr tunable protein regulation

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1990014431A1 (en) * 1989-05-19 1990-11-29 Biotechnology Research And Development Corporation Fusion proteins having an in vivo post-translational modification site and methods of manufacture and purification
EP0511747A1 (en) * 1991-04-19 1992-11-04 Rohm And Haas Company Hybrid polypeptide containing an avidin binding polypeptide
WO1995025172A1 (en) * 1994-03-17 1995-09-21 Universite Louis Pasteur Recombinant antibody fragments which are synthesized and biotinylated in e. coli, their use in immunoassays and immunopurification techniques

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5252466A (en) * 1989-05-19 1993-10-12 Biotechnology Research And Development Corporation Fusion proteins having a site for in vivo post-translation modification and methods of making and purifying them
US5801233A (en) * 1992-10-02 1998-09-01 Arch Development Corporation Nucleic acid compositions encoding acetyl-coa carboxylase and uses therefor
AU7516694A (en) * 1993-07-30 1995-02-28 Affymax Technologies N.V. Biotinylation of proteins
JP3466765B2 (en) * 1994-07-27 2003-11-17 キッコーマン株式会社 Biotinylated firefly luciferase, biotinylated firefly luciferase gene, novel recombinant DNA, method for producing biotinylated firefly luciferase and bioluminescence analysis method
JP2003512057A (en) 1999-10-19 2003-04-02 ルードヴィッヒ インスティテュート フォー キャンサー リサーチ MAGE-A12 antigen peptide and use thereof
US7816098B2 (en) 2000-01-31 2010-10-19 Sense Proteomic Limited Methods of making and using a protein array
AU3037501A (en) 2000-01-31 2001-08-14 Sense Proteomic Limited Methods
US7148058B2 (en) * 2000-06-05 2006-12-12 Chiron Corporation Protein microarrays on mirrored surfaces for performing proteomic analyses
US7094568B2 (en) 2000-08-17 2006-08-22 Sense Proteomic Ltd. Method for producing proteins tagged at the N- or C-terminus
IL154486A0 (en) 2000-08-17 2003-09-17 Sense Proteomic Ltd Method
EP1414974A2 (en) 2000-12-26 2004-05-06 Applied Molecular Evolution, Inc. Butyrylcholinesterase polypeptide variants with increased catalytic efficiency and methods of use
US7871767B2 (en) 2001-06-01 2011-01-18 Pgxhealth, Llc Polymorphisms in the human gene for cytochrome P450 polypeptide 2C8 and their use in diagnostic applications
ATE361471T1 (en) 2001-12-05 2007-05-15 Sense Proteomic Ltd PROTEIN ARRAYS FOR ALLEL VARIANTS AND THEIR USE
GB0202018D0 (en) 2002-01-29 2002-03-13 Sense Proteomic Ltd Tag and method
GB0205910D0 (en) 2002-03-13 2002-04-24 Sense Proteomic Ltd Arrays and methods
US20030228709A1 (en) 2002-03-25 2003-12-11 Kozlowski Roland Zbignieiw Arrays

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1990014431A1 (en) * 1989-05-19 1990-11-29 Biotechnology Research And Development Corporation Fusion proteins having an in vivo post-translational modification site and methods of manufacture and purification
EP0511747A1 (en) * 1991-04-19 1992-11-04 Rohm And Haas Company Hybrid polypeptide containing an avidin binding polypeptide
WO1995025172A1 (en) * 1994-03-17 1995-09-21 Universite Louis Pasteur Recombinant antibody fragments which are synthesized and biotinylated in e. coli, their use in immunoassays and immunopurification techniques

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
GERMINO F J ET AL: "SCREENING FOR IN VIVO PROTEIN-PROTEIN INTERACTIONS", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA, NATIONAL ACADEMY OF SCIENCE. WASHINGTON, US, vol. 90, February 1993 (1993-02-01), pages 933 - 937, XP002095757, ISSN: 0027-8424 *
JAEGER W ET AL: "CORYNEBACTERIUM GLUTAMICUM GENE ENCODING A TWO-DOMAIN PROTEIN SIMILAR TO BIOTIN CARBOXYLASES AND BIOTIN-CARBOXYL-CARRIER PROTEINS", ARCHIVES OF MICROBIOLOGY, BERLIN, DE, vol. 166, no. 2, August 1996 (1996-08-01), pages 76 - 82, XP000946309, ISSN: 0302-8933 *
MURTIF V L ET AL: "MUTAGENESIS AFFECTING THE CARBOXYL TERMINUS OF THE BIOTINYL SUBUNIT OF TRANSCARBOXYLASE EFFECTS ON BIOTINATION", JOURNAL OF BIOLOGICAL CHEMISTRY, vol. 262, no. 24, 1987, pages 11813 - 11816, XP002244696, ISSN: 0021-9258 *
ZHEN X W ET AL: "Vectors for a @?double-tagging@? assay for protein-protein interactions: localization of the CDK2-binding domain of human p21", GENE, ELSEVIER BIOMEDICAL PRESS. AMSTERDAM, NL, vol. 173, no. 2, 16 September 1996 (1996-09-16), pages 147 - 154, XP000953324, ISSN: 0378-1119 *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8999897B2 (en) 2002-01-29 2015-04-07 Sense Proteomic Limited Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state
EP2402760A1 (en) 2004-09-03 2012-01-04 European Molecular Biology Laboratory Method for determining protein solubility
WO2006024875A3 (en) * 2004-09-03 2006-06-08 European Molecular Biology Lab Embl Method for determinig protein solubility
JP2008511310A (en) * 2004-09-03 2008-04-17 ユーロピアン モレキュラー バイオロジー ラボラトリー Methods for determining protein solubility
WO2006024875A2 (en) * 2004-09-03 2006-03-09 European Molecular Biology Laboratory Method for determinig protein solubility
AU2005279000B2 (en) * 2004-09-03 2011-08-04 European Molecular Biology Laboratory Method for determining protein solubility
US8754012B2 (en) * 2004-09-03 2014-06-17 European Molecular Biology Laboratory Method for determining protein solubility
CN101040188B (en) * 2004-09-03 2012-06-20 欧洲分子生物学实验室 Method for determining protein solubility
GB2442048B (en) * 2006-07-25 2009-09-30 Proimmune Ltd Biotinylated MHC complexes and their uses
GB2442048A (en) * 2006-07-25 2008-03-26 Proimmune Ltd Biotinylated MHC complexes
EP2375252A1 (en) 2008-06-11 2011-10-12 Sense Proteomic Limited Biomarkers for lupus
WO2011114139A1 (en) 2010-03-15 2011-09-22 Sense Proteomic Limited Auto-antigen biomarkers for prostate cancer
WO2012049664A2 (en) 2010-10-15 2012-04-19 Sense Proteomic Limited Auto-antigen biomarkers for lupus
DE102010056289A1 (en) 2010-12-24 2012-06-28 Geneart Ag Process for the preparation of reading frame correct fragment libraries
WO2012084923A1 (en) 2010-12-24 2012-06-28 Geneart Ag Method for producing reading-frame-corrected fragment libraries
WO2022108522A1 (en) * 2020-11-18 2022-05-27 Sengenics Corporation Pte Ltd Biomarkers for predicting immunogenicity and therapeutic responses to adalimumab in rheumatoid arthritis patients

Also Published As

Publication number Publication date
GB0202018D0 (en) 2002-03-13
US20050221308A1 (en) 2005-10-06
DE60305643D1 (en) 2006-07-06
CA2474457A1 (en) 2003-08-07
AU2003238441B2 (en) 2008-10-30
EP1470229A1 (en) 2004-10-27
ATE328092T1 (en) 2006-06-15
JP4377242B2 (en) 2009-12-02
JP2005516074A (en) 2005-06-02
DK1470229T3 (en) 2006-10-02
EP1470229B1 (en) 2006-05-31
US8999897B2 (en) 2015-04-07
CA2474457C (en) 2014-04-15
ES2268376T3 (en) 2007-03-16
DE60305643T2 (en) 2007-05-03

Similar Documents

Publication Publication Date Title
CA2474457C (en) Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state
AU2003238441A1 (en) Protein tag comprising a biotinylation domain and method for increasing solubility and determining folding state
US7820384B2 (en) Methods and compositions for protein expression and purification
EP0711303B1 (en) Biotinylation of proteins
JP2005516074A6 (en) Protein tag comprising a biotinylated domain, method for increasing solubility and method for determining folding state
EP1392717B1 (en) Rapidly cleavable sumo fusion protein expression system for difficult to express proteins
US7655413B2 (en) Methods and compositions for enhanced protein expression and purification
CN104053779A (en) Split inteins and uses thereof
US7220576B2 (en) Methods and compositions for protein expression and purification
US7790420B2 (en) Method for determining protein solubility
US20100035300A1 (en) Producing a Target Protein Using Intramolecular Cleavage by TEV Protease
RU2807615C2 (en) Method of obtaining biologically active recombinant proteins
JP4487036B2 (en) New vectors and their use
EP2684952A1 (en) Azoline compound and azole compound library and method for producing same
US6632638B1 (en) Enhanced solubility of recombinant proteins using Uracil DNA glycosylase inhibitor

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2474457

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2003564248

Country of ref document: JP

Ref document number: 2003238441

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2003734757

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2003734757

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10502581

Country of ref document: US

WWG Wipo information: grant in national office

Ref document number: 2003734757

Country of ref document: EP