AU2003287673A1 - Method for analysing amino acids, peptides and proteins using mass spectroscopy of fixed charge-modified derivatives - Google Patents

Method for analysing amino acids, peptides and proteins using mass spectroscopy of fixed charge-modified derivatives Download PDF

Info

Publication number
AU2003287673A1
AU2003287673A1 AU2003287673A AU2003287673A AU2003287673A1 AU 2003287673 A1 AU2003287673 A1 AU 2003287673A1 AU 2003287673 A AU2003287673 A AU 2003287673A AU 2003287673 A AU2003287673 A AU 2003287673A AU 2003287673 A1 AU2003287673 A1 AU 2003287673A1
Authority
AU
Australia
Prior art keywords
ion
amino acid
charge
peptide
mass
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2003287673A
Inventor
Richard O'hair
Gavin Reid
Richard Simpson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ludwig Institute for Cancer Research Ltd
Original Assignee
Ludwig Institute for Cancer Research Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2002952747A external-priority patent/AU2002952747A0/en
Application filed by Ludwig Institute for Cancer Research Ltd filed Critical Ludwig Institute for Cancer Research Ltd
Priority to AU2003287673A priority Critical patent/AU2003287673A1/en
Publication of AU2003287673A1 publication Critical patent/AU2003287673A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6842Proteomic analysis of subsets of protein mixtures with reduced complexity, e.g. membrane proteins, phosphoproteins, organelle proteins

Description

WO 2004/046731 PCT/US2003/036739 METHOD FOR ANALYSING AMINO ACIDS, PEPTIDES AND PROTEINS Field of the Invention The present invention is concerned with a method for the analysis of amino acids, 5 peptides or proteins. Background of the Invention With completion of the first draft of the human genome sequence, the challenge now facing researchers is to understand gene function. However, the biological function of a gene 10 cannot be determined from a simple examination of its DNA sequence. Comprehensive analysis of the proteins expressed by the genome, therefore, promises to bridge the gap between the gene and its biological function. The term proteomics has become synonymous with (i) the identification and characterization of all proteins synthesized by a particular cell type or tissue at any given time, and quantitation of the global changes in protein expression 15 levels observed between two different cell states (collectively known as expression proteomics) and, (ii) with the identification of components of functionally active protein complexes and characterization of the intricate protein-protein interactions involved in intracellular protein trafficking and signaling pathways (collectively known as cell-mapping proteomics). Taken together, these approaches allow comprehensive examination, at the 20 protein level, of the complex cellular changes that occur following transformation of cells from one state to another [Blackstock, W.P. and Weir, M.P. Trends Biotechnol. 1999, 17, 121-127.; Pandey, A. and Mann, M. Nature 2000, 405, 837-846.]. Recent developments in mass spectrometry (MS), coupled with the development of sophisticated bioinfonnatic tools for database interrogation of MS derived data, have been the 25 major factors enabling proteomics [Mann, M., Hendrickson, R.C. and Pandey, A. Annu. Rev. Biochem. 2001, 70, 437-473.]. In particular, the speed, specificity, and sensitivity of mass spectrometry make it especially attractive for use in strategies requiring rapid protein identification and characterization. Conventional MS approaches to proteomics generally involve one or two-dimensional electrophoretic (2DE) separation of protein mixtures, after 30 which the protein spot or gel slice is cut out and subjected to in situ proteolysis using trypsin. Peptides are then extracted and subjected to mass spectrometric analysis. The masses of these peptides are characteristic of the protein, and provide a peptide "mass fingerprint" which can be used in database searches to identify the protein [Henzel, W.J. Billeci T.M., WO 2004/046731 PCT/US2003/036739 -2- Stults J.T., Wong S.C., Grimley C. and Watanabe C. Proc. NatL. Acad. Sci. USA. 1993, 90, 5011-5015.]. A more comprehensive approach, particularly for the identification and quantitation of individual components present in complex protein mixtures, is to subject each of the proteolytically derived peptides to tandem mass spectrometry. Subsequent 5 identification of each peptide may be performed by either database analysis of the uninterpreted product ion spectrum [Eng, J.K. McCormack, A.L. and Yates, J.R. J. Am. Soc. Mass Spectrom. 1994, 5, 976-989.], through database searching of a partially derived amino acid "sequence tag" [Mann, M. and Wilm, M. Anal. Cheni. 1994, 66, 4390-4399.], or by "de novo" sequence analysis [Hunt, D. F., Yates, J.R., Shabanowitz J., Winston S. and Hauer, 10 C.R. Proc. NatL. Acad. Sci. USA 1986, 83, 6233-6237.]. This general analysis strategy is complicated however, by several limitations. Foremost is that 2D-gels are only capable of resolving approximately 1500-2000 proteins, yet there are typically greater than 8000 proteins expressed per cell. Therefore, only the most abundant proteins are observed and a significant portion of 2-DE separated gel spots 15 contain more than one protein due to co-electrophoresis and/or differentially modified (or processed) forms of the same protein. Additionally, several classes of proteins, notably hydrophobic proteins, low abundance proteins, and those with extremes of pI and molecular weight are poorly represented in 2D-gel based separations [Gygi, S.P., Corthals, G.L., Zhang, Y., Rochon, Y., Aebersold, R. Proc. Nati. Acad. Sci. U.S.A. 2000, 97, 9390-9395.]. Also, 2 20 DE is labor-intensive, slow, technically demanding, often suffers from poor reproducibility, and hence is not readily amenable to high-throughput or automation. Secondly, while the identification of only one peptide from a protein digest by MS/MS is required to unambiguously identify that protein, provided that the peptide is unique to a single protein, in practice a significant amount of time during the analysis of 25 complex peptide mixtures is spent in analyzing either the same peptide ion (e.g., in sequential scans during chromatographic separation of a complex peptide mixture) or different peptides of the same protein. Routinely therefore, peptides present at high relative abundance are preferentially sampled, and information regarding the identity of proteins represented in the complex mixture as low abundance peptides is commonly not obtained. Also, up to a quarter 30 of peptide MS/MS spectra obtained during MS based peptide sequencing are unassignable (i.e., product ion spectra are too complex, too low in abundance, or lack sufficient ions to enable their interpretation) [Simpson, R.J., Connelly, L.M., Eddes, J.S., Pereira, J.J., Moritz, R.L. and Reid, G.E. Electrophoresis. 2000, 21, 1707-1732].
WO 2004/046731 PCT/US2003/036739 -3 In order to partially overcome these limitations, dynamic exclusion during MS/MS acquisition has been employed to allow greater numbers of distinct peptide ions to be selected throughout the course of an LC/MS/MS experiment [Davis, M.T.; Spahr, C.S.; McGinley, M.D.; Robinson, J.H.; Bures, E.J.; Beierle, J.; Mort, J.; Yu, W.; Luethy, R.; Patterson, S.D. 5 Proteonics 2001, 1, 108-117.]. Several strategies for simplification of the peptide mixture prior to mass spectrometric analysis, either by multidimensional chromatographic methods [Washburn, M.P., Wolters, D. and Yates, J.R. Nat. Biotechnol. 2001, 19, 242-247.], or by selective enrichment in solution of only those peptides containing certain amino acids, by affinity selection [Spahr, C.S. Susin, S.A., Bures, E.J., Robinson, J.H., Davis, M.T., 10 McGinley, M.D., Kroemer, G. and Scott D. Patterson, S.D. Electrophoresis. 2000, 21, 1635 1650] or by differential chromatography [Gevaert, K., Van Damme, J., Goethals, M., Thomas, G.R., Hoorelbeke, B., Demol, H., Martens, L., Puype, M., Staes, A. and Vandekerckhove, J. Mol. Cell. Proteomics. 2002, 1, 896-903.], have also been described. In combination with the dynamic exclusion and multidimensional chromatographic approached 15 outlined above, these methods allow further increases in the number of proteins that can be identified by mass spectrometric analysis. However, the identification of low abundance proteins in these complex mixtures, where the expression of individual proteins may vary from 10 to 106 copies per cell, including those potentially containing important information such as the structure and location of post-translational modifications, still remains a 20 significant challenge. A fundamental aspect of proteomics research is the determination of protein expression levels between two different states of a biological system (i.e., relative quantification of protein levels), such as that encountered between a normal and diseased cell or tissue. As there is a marked disparity between changes in mRNA expression levels 25 (transcriptomics) and their corresponding proteins (proteomics), it is clear that array-based gene expression monitoring or other gene expression methods for measuring mRNA abundances, alone, are insufficient for analyzing the cell's protein complement [Gygi, S.P., Rochon, Y., Franza, B.R. and Aebersold, R. Mol. Cell. Biol. 1999, 19, 1720-1730.]. Qualitative and quantitative analysis of changes in protein expression profiles as a 30 function of, for example, cell-cycle regulation, disease state or drug exposure has traditionally been performed by 2DE via image analysis of individual stained protein spots, due to the high dimensionality afforded by the orthogonal isoelectric point and molecular weight separation modes [Patton & Beechem Curr Opinion Chem Biol 2002, 6, 63-69.; Zhou, WO 2004/046731 PCT/US2003/036739 -4- G., Li, H., DeCamp, D., Chen, S., Shu, H., Gong, Y., Flaig, M., Gillespie, J.W., Hu, N., Taylor, P.R., Emmert-Buck, M.R., Liotta, L.A., Petricoin, E.F. and Zhao, Y. Mol Cell Proteomics 2002 1, 117-123]. However, these 2DE gel based quantitation approaches suffer from the same limitations as those discussed above for 2DE gel based protein identification, 5 thereby limiting quantitative analysis to those proteins that are present as relatively abundant, pure spots on the gel and that can be adequately visualized by the staining method employed [Gygi, S.P., Corthals, G.L., Zhang, Y., Rochon, Y., Aebersold, R. Proc. Natl. Acad. Sci. U.S.A. 2000, 97, 9390-9395.]. Some of these shortcomings have been overcome by the use of more sensitive detection methods, such as the SYPRO ruby fluorescent protein stain 10 [Berggren, K., Chernokalskaya, E., Steinberg, T.H., Kemper, C., Lopez, M.F., Diwu, Z., Haugland, R.P. and Patton, W.F. Electrophoresis 2000, 21, 2509-2521], and by the use of 2 D difference gel electrophoresis (DIGE) [Unlu, M., Morgamn, M.E. and Minden, J.S. Electrophoresis, 1997, 18, 2071-2077.]. However, these methods are fundamentally reliant on the use of adequate software packages for spot detection, gel matching and spot 15 quantitation [Raman, B., Cheung, A. and Martin, M.R. Electrophoresis, 2002, 23, 2194 2202.] and cannot account for situations where overlapping protein spots are present. Two general approaches for MS based quantitation of differential protein expression levels between two different cell/tissue states, using isotopic labeling, have been developed. First, the groups of Smith [Pasa-Tolic, L; Jensen, P. K.; Anderson, G. A.; Lipton, M. S.; 20 Peden, K. K.; Martinovic, S; Tolic, N; Bruce, J. E.; Smith, R. D. J. Am. Chem. Soc. 1999, 121, 7949-7950.; Veenstra, T.D., Martinovic, S., Anderson, G.A., Pasa-Tolic, L. and Smith, R.D. J Am. Soc. Mass Spectrom. 2000, 11, 78-82.; Conrads, T.P. et. al. Anal. Chem. 2001, 73, 2132-2139; Smith, R.D.; Anderson, G.A.; Lipton, M.S.; Pasa-Tolic, L.; Shen, Y.; Conrads, T.P.; Veenstra, T.D.; Udseth, H.R. Proteomics 2002, 2, 513-523.], Chait [Oda, Y.; 25 Huang, K.; Cross, F. R.; Cowburn, D.; Chait, B. T. Proc. Natl. Acad. Sci. U.S.A. 1999, 96, 6591-6596.], and others [Chen, X., Smith, L.M., Bradbury, E.M. Anal. Chem. 2001, 72, 1134-1143; Ong, Shao-En., Blagoev, B., Kratchmarova, I., Bach-Kristensen, D., Steen, H., Pandey, A. and Mann, M. Mol. Cell. Proteonics. 2002, 1, 376-3 86; Jiang, H. and English, A.M. J. Proteome. Res. 2002, 1, 345-350; Zhu, H., Hunter, T.C., Pan, S., Yau, P.M., 30 Bradbury, E.M. and Chen, X. Anal. Chem. 2002, 74, 1687-1694.] have all separately presented results involving the metabolic incorporation of isotopically depleted ( 1 3C-, 1 5 N-, and 2 H-depleted) or enriched amino acids (either by uniform labeling with 1 5 N, or by incorporation of selected amino acids containing heavy isotopes (eg., 1 3 C, 1 5 N, 2H)) into a WO 2004/046731 PCT/US2003/036739 -5 cellular protein population. After isolation of proteins from the cellular matrix, the sample is combined with one incorporating natural isotopes, then individual proteins are resolved by electrophoretic or chromatographic methods, digested and the masses of the peptides determined by MS. The site specific labels allow efficient identification of those peptides 5 containing the enriched amino acid mass "tags", via comparison with their unlabelled forms. Additionally, by comparing the relative abundances of the peptides from the isotopically enriched sample with those from a sample prepared using naturally abundant isotopes, quantitation of changes in the level of protein expression between the two samples may be obtained. This in vivo labelling approach is limited, however, to those systems where cells 10 can be cultured under conditions suitable for incorporation of the isotopic label. The second approach involves in vitro chemical derivatization with isotopically enriched labels following isolation of the proteins from the cellular matrix. One approach that has received much attention to date is the isotope coded affinity tag (ICAT) technique developed by Aebersold and co-workers [Gygi, S. P.; Rist, B; Gerber, S. A.; Turecek, F; 15 Gelb, M. H.; Aebersold, R. Nat. Biotechnol. 1999, 17, 994-999.; Smolka, M; Zhou, H; Aebersold, R. Mol. Cell. Proteomics. 2002, 1, 19-29.; Shiio, Y; Donohoe, S; Yi, E.C.; Goodlett, D.R.; Aebersold, R; Eisenman, R.N. EMBO 2002, 21, 5088-5096.; Han, D.K., Eng, J., Zhou, H. and Aebersold, R. Nature Biotechnol. 2001, 19, 946-95 1.; Griffin, T.J., Han, D.K.M., Gygi, S.P., Rist, B., Lee, H., Aebersold, R. and Parker, K.C. J. Am. Soc. Mass. 20 Spectrom. 2001, 12, 1238-1246.; Gygi, S.P., Rist, B. and Aebersold, R. Current Opinions in Biotechnol. 2000, 11, 396-401.]. Proteins from two different cell/tissue states are reduced and S-alkylated with either naturally abundant (light) or isotopically enriched (heavy) ICAT reagents, respectively, each containing a biotin moiety for subsequent affinity selection of cysteine-containing peptides by streptavidin affinity purification, leading to simplification of 25 the mixture prior to MS analysis. In the same manner as described above for metabolic labelled samples, the abundance ratios of the proteolytically derived peptides containing the heavy isotope "diseased" sample compared to those originating from the light isotope "normal" sample (or between those containing structural labels) for each of these methods are indicative of changes in the level of protein expression between the two samples, thereby 30 allowing their differential quantitation. A number of similar strategies for selective peptide identification and differential quantitation have since been described [Spahr, C.S. Susin, S.A., Bures, E.J., Robinson, J.H., Davis, M.T., McGinley, M.D., Kroemer, G. and Patterson, S.D. Electrophoresis. 2000, 21, WO 2004/046731 PCT/US2003/036739 -6 1635-1650.; Adamczyk, M. and Gebler, J.C.; Wu, J. Rapid. Commun. Mass Spectrom. 1999, 13, 1813-1817.; Sechi, S. Anal. Chem. 1998, 70, 5750-5158.; Sechi, Salvatore. Rapid Commun. Mass Spectrom. 2002, 16, 1416-1424.; Gehanne, S., Cecconi, D., Carboni, L., Giorgio Righetti, P., Domenici, E. and Handam, M. Rapid Commun. Mass Spectrom. 2002, 5 16, 1692-1698.; Wang, S.; Regnier, F.E. J. Chrom. A 2001, 924(1-2), 345-357.; Amini, A; Chakraborty, A; Regnier, Fred E.. J. Chrom. B. 2002, 772, 35-44.; Wang, S., Zhang X. and Regnier, F.E. J Chron. A. 2002, 949, 153-162.; Mirgorodskaya, O.A. Kozmin, Y.P., Titov, M.I., Komer, R., Sonksen, C.P. and Roepstorff, P. Rapid Commun. Mass Spectrom. 2000, 14, 1226-1232.; Stewart, I.I.; Thomson, T; Figeys, D. Rapid Commun. Mass Spectrom. 2001, 15, 10 2456-2465.; Yao, X.; Freas, A; Ramirez, J.; Demirev, P.A.; Fenselau, C. Anal. Chem. 2001, 73, 2836-2842.; Peters, E.C., Horn, E.C., Tully, D.C. and Brock, A. Rapid Commun. Mass Spectrom. 2001, 15, 2387-2392.; Muenchbach, M., Quadroni, M., Miotto, G. and James, P. Anal. Chem. 2000, 72, 4047-4057; Chakraborty, A; Regnier, F. E. J Chrom. A 2002, 949, 173-184.; Geng, M.; Ji, J.; Regnier, F. E.. J Chrom. A 2000, 870, 295-313.; Ji, J.; 15 Chakraborty, A.; Geng, M.; Zhang, X.; Amini, A.; Bina, M.; Regnier, F. J. Chrom. B 2000, 745, 197-210.; Goodlett, D.R., Keller, A., Watts, J.D., Newitt, R., Yi, E.C., Purvine, S., Eng, J., von Haller, P., Aebersold, R. and Kolker, E. Rapid Commun. Mass Spectrom. 2001, 15, 1214-1221.; Liu, P; Regnier, F.E. J. Proteome Res. 2002, 1, 443-450.]. More recently, two groups have described covalent solid phase cysteine capture methods for mixture 20 simplification. [Zhou, H.; Ranish, J. A.; Watts, J. D.; Aebersold, R. Nature Biotechnol. 2002, 20, 512-515.; Qiu, Y; Sousa, E. A.; Hewick, R. M.; Wang, J. H. Anal. Chem. 2002, 74, 4969 4979.]. Several non-isotopic labelling strategies employing structural labelling [Cagney, G., Emili, A. Nature Biotechnol. 2002, 20, 163-170.; Beardsley, R.L. and Reilly, J.P. J. Proteome Res. 2003, 2, 15-21.], as well as a method for performing differential quantitation by 25 comparing the chromatographically resolved MS ion abundances between a sample of interest and a control have also been described [Bondarenko, P.V., Chelius, D. and Shaler, T.A. Anal. Chem. 2002, 74, 4741-4749.; Chelius, D. Bondarenko. P.V. J. Proteome Res. 2002, 1, 317 - 323.]. A number of reviews have compared the relative merits and limitations of several of these approaches [Moseley, M.A. Trends in Biotechnol. 2001, 19, SO-S16.; 30 Patton, W.F., Schulenberg, B. and Steinberg, T.H. Curr. Opin. Biotechnol. 2002, 13, 321 328.; Turecek, F. J Mass Spectrom. 2002, 37, 1-14.; Regnier, F. E.; Riggs, L.; Zhang, R.; Xiong, L.; Liu, P.; Chakraborty, A.; Seeley, E.; Sioma, C.; Thompson, R. A. J Mass Spectrom. 2002, 37, 133-145.].
WO 2004/046731 PCT/US2003/036739 -7 The two major post translational modifications (PTM's) of proteins are phosphorylation and glycosylation. Of these, the reversible phosphorylation of proteins ranks among the most important PTM that occurs in the cell. Commonly, two approaches for enrichment of phosphoprotein/phosphopeptides prior to their detection and subsequent 5 microcharacterisation have been employed. There are (i) Immunoprecipitation; [Gronborg, M., Kristiansen, T.Z., Stensballe, A., Andersen, J.S., Ohara, 0., Mann, M., Jensen, O.N. and Pandey, A. Mol. Cell. Proteomics. 2002, 1, 517-527.], and (ii) hnmobilized metal ion affinity chromatography (IMAC); [Posewitz, M.C. and Tempst, P. Anal. Chem. 1999, 71, 2883 2892.; Ficarro, Scott B.; McCleland, Mark L.; Stukenberg, P. Todd; Burke, Daniel J.; Ross, 10 Mark M.; Shabanowitz, Jeffrey; Hunt, Donald F.; White, Forest M. Nat Biotechnol. 2002, 20, 301-305.]. Proteins resolved by 2D gels or 1D SDS-PAGE can be detected by autoradiography or storage phosphorimaging using in vivo or in vitro 3 7P labelling [Ji H, Baldwin GS, Burgess AW, Moritz RL, Ward LD, and Simpson RJ. JBiol Chem 1993, 268, 13396-13405.; Boyle 15 W.J., Geer Van der P. and Hunter T. Methods Enzymol. 1991, 201, 110-149.; Yan J.X., Packer N.H., Gooley A.A. and Williams K.L. J. Chromatogr. 1998, 808, 23-4 1.], or by western blotting using antibodies to detect phosphorylated proteins. Historically, sequencing of 32 P radiolabelled peptides was performed by Edman degradation [Wettenhall, R. E., Aebersold, R. H., and Hood, L. E. Methods Enzymol 1991, 201, 186-99.]. However, this 20 approach typically requires prior knowledge of the protein sequence in order to correlate a loss of radioactivity, indicative of the phosphorylation site, with the amino acid sequence of the peptide and can be hampered by a low stoichiometry of phosphorylation at a given site [Katze, M. G., Kwieciszewski, B., Goodlett, D. R., Blakely, C. M., Nedderman, P., Tan, S-L, Aebersold, R. Virology. 2000, 278, 501-513], or by the presence of multiple differentially 25 phosphorylated forms of the same protein being present [Storm, S. M., and Khawaja, X. Z. Brain. Res. Mol. Brain. Res. 1999, 71, 50-60.]. More recently, mass spectrometry has proven to be particularly useful for the analysis of protein phosphorylation [Neubauer, G. and Mann, M. Anal Chem 1999, 71, 235-242.]. However, the mass spectrometric analysis of phosphopeptides is more complicated compared 30 to that for unmodified peptides, due to their lower ionization efficiencies in positive ion MS analysis mode [Liao P.C., Leykam J., Andrews P.C., Cage D.A. and Allison J. Anal. .Biochem. 1994, 219, 9-20.], and the difficulties encountered in rapidly switching between ionisation polarities, to allow both identification (in negative mode) and characterization (in WO 2004/046731 PCT/US2003/036739 positive mode), during the course of a single experiment [Janek K., Wenschuh H., Bienert M., and Krause E. Rapid Commun. Mass Spectrom. 2001, 15, 1593-1599.; Ma Y., Lu Y., Mo W., and Neubert T.A.. Rapid Cominun. Mass Spectrom. 2001, 15, 1693-1700.]. Also, phosphoserine- and phosphothreonine-containing peptides can readily undergo the facile loss 5 of phosphoric acid (H 3
PO
4 ) upon ESI and MALDI ionization, and upon low energy CID. There are several ways to circumvent the difficulties associated with ionization suppression of phosphopeptides in positive ion mode. The sample may be enriched for phosphopeptides, as discussed above, in order to reduce the excess of unmodified peptides that suppress ionisation. However, regardless of attempts made to overcome signal 10 suppression in the analysis of phosphopeptides, limitations still exist when attempting to identify the site of phosphorylation by tandem mass spectrometry, due to the lability of the phosphate side chain. A simple solution is to perform analysis of the phosphopeptide samples by the parallel analysis of samples treated with and without alkaline phosphatase [Larsen, M.R., Sorensen, G.L., Fey, S.J., Larsen, P.M. and Roepstorff, P. Proteomics, 2001, 15 1, 223-238.]. Phosphopeptide identification may also be performed by precursor ion scan mode monitoring of the characteristic phosphate specific product ions at rn/z 79 (P03), and m/z 89 (H 2 P0 4 ), following collision induced dissociation (CID) of the phosphopeptide ions in negative ion mode, or by monitoring for the loss of H 3
PO
4 (98 Da) or HPO 3 (80 Da) in positive ion neutral loss scan mode [Carr S.A., Huddleston M.J., and Annan R.S. Anal. 20 Biochen. 1996, 239, 180-192.; Schlosser A., Pipkom R., Bossemeyer D., and Lehmann W.D. Anal. Chem. 2001, 73, 170-176.]. Product ion scan mode MS/MS of the intact phosphopeptide can sometimes be used to characterize the precise site of phosphate attachment within the peptide, as the b- and y type fragment ions formed from peptide backbone cleavage may contain information on the 25 specific location of the phosphorylated amino acid residue. However, as indicated above, this technique is often hampered by extensive gas-phase dephosphorylation and elimination of phosphoric acid (H 3
PO
4 , 98 Da) from phosphoserine- and phosphothreonine, making it difficult to unambiguously locate the site of modification. The phosphotyrosine side chain, however, is relatively stable under MS and MS/MS conditions due to the relatively high 30 stability of the arylphosphate modification, therefore the location of phosphotyrosine residues can be readily determined by the mass difference between two successive fragment ions of 243 Da. Indeed, the characteristic 'reporter' immonium ion of phosphotyrosine at 216.043 Da has been used for precursor ion experiments in positive ion mode MS/MS for selective WO 2004/046731 PCT/US2003/036739 -9 identification of phosphotyrosine containing peptides [Steen H., Kuster B., Fernandez M., Pandey A., and Mann M. 2001. Anal. Chem. 73: 1440-1448]. A common way to overcome the lability of the phosphoserine and phosphothreonine side chains during MS/MS is to replace the phosphate group with a more stable, less acidic 5 side chain functionality. This can be accomplished by fl-elimination of phosphoserine and phosphothreonine residues under strongly alkaline conditions to yield dehydroalanine or dehydroaminobutyric acid residues, respectively. Subsequent Michael addition of a nucleophile allows a simple means for derivatizing the formerly phosphorylated serine or threonine residues prior to mass spectrometric analysis [Weckwerth, W., Wilhnitzer, L. and 10 Fiehn, 0. Rapid Commun. Mass Spectrom. 2000, 14, 1677-1681.; Molloy, M.P. and Andrews, P.C. Anal. Chem. 2001, 73, 5387-5394.; Li, W., Boykins, R.A., Backlund, P.S., Wang, G. and Chen, H-C. Anal. Chem. 2002 74, 5701-5710.; Steen, H. and Mann, M. J. Am. Soc. Mass Spectrom. 2002, 13, 996-1003.]. The method can also be used to incorporate an affinity "tag", thereby allowing enrichment of the derivatized peptides prior to their analysis 15 [Adamczyk, M., Gebler, J.C. and Wu, J. Rapid Commun Mass Spectrom. 2001, 15, 1481 1488.; Oda, Y., Nagasu, T. and Chait, B.T. Nature Biotechnol. 2001, 19, 379-382.], and, when coupled with the incorporation of an isotopic label, for quantitation of the degree of phosphorylation observed between two different sample sets [Goshe M.B., Conrads, T.P., Panisko, E.A., Angell, N.H., Veenstra, T.D. and Smith, R.D. Anal Chem. 2001, 73, 2578 20 2586.; Goshe, M.B.,Veenstra, T.D., Panisko, E.A., Conrads, T.P., Angell, N.H. and Smith, R.D. Anal. Chem, 2002, 74, 607-616.; Adamczyk, M; Gebler, J.C. and Wu, J. Rapid Commun. Mass Spectrom. 2002, 16, 999-1001.]. Note that p-elimination affects both 0-phospho and O-glycosidic linkages to the same extent. Therefore the P-elimination method cannot distinguish between O-linked post 25 translational modifications of seine and threonine residues. Another chemical method for analyzing the phosphoproteome, which is applicable to phosphotyrosine-containing peptides as well as those containing phosphoserine and phosphothreonine residues, has recently been described [Zhou, H., Watts, J.D. and Aebersold, R. Nature Biotechnol 2001, 19, 375-378]. For a recent review of quantitative 30 phosphoproteome analysis see [Mann, M., Ong, S-E., Gronborg, M., Steen, H., Jensen, O.N. and Pandey, A. Trends Biotechnol. 2002, 20, 261-268.]. Determination of protein-protein interactions are a crucial aspect of integrated biological studies aimed at understanding the complex pathways involved in cellular WO 2004/046731 PCT/US2003/036739 - 10 signaling and protein trafficking. Already, a large proportion of known protein-protein interactions in yeast [Bader, G.D. and Hogue, C.W.V. Nature Biotechnol. 2002, 20, 991-997] have been identified by genome-scale yeast two-hybrid assays [Uetz, P. et al. Nature 2000, 403, 623-627.; Ito, T., Chiba, T., Ozawa, R., Yoshida, M., Hattori, M. and Sakaki, Y. Proc. 5 Natl. Acad. Sci. USA, 2001, 98, 4569-4574.; Legrain, P. Nature. Biotechnol. 2002, 20, 128 129.], and direct affinity capture methods such as (i) co-precipitation using affinity-tagged recombinant proteins [Gavin, A-C. et al. Nature 2002, 415, 141-147.; Ho, Y. et al. Nature. 2002, 415, 180-183.], (ii) 'pull down' techniques using antibodies directed against one of the component proteins, (iii) protein-affinity-interaction chromatography (e.g., using 10 recombinant glutathione S-transferase (GST)-fusion proteins and glutathione-affinity chromatography), or (iv) isolation of intact multiprotein complexes (e.g., nuclear pore complexes, ribosome complexes, and spliceosomes). These later methods, coupled with mass spectrometry for protein identification, constitute the major methods for cell mapping proteomics, which aims to describe all protein-protein interactions (both spatially and 15 temporally) within a given cell [Blackstock, W.P. and Weir, M.P. Trends Biotechnol. 1999, 17, 121-127.]. However, the interactions detected by these physical methods may include large numbers of non-specific interactions with no biological significance [Bader, G.D. and Hogue, C.W.V. Nature Biotechnol. 2002, 20, 991-997.]. Mass spectrometry combined with cross-linking [Rappsilber, J., Siniossoglou, S., 20 Hurt, E. C., and Mann, M. Anal. Chem. 2000, 72, 267-275.], or hydrogen/deuterium exchange [Yamada, N., Suzuki, E., and Hirayama, K. Rapid Commun. Mass Spectrom. 2002, 16, 293-299] can be used for the rapid low-resolution evaluation of the three dimensional structures of proteins and protein complexes. Cross-linking generally involves chemical [Uy, R., and Wold, F., 1977, In: "Protein Cross-linking" (Friedman, M., ed.), Plenum, New York.; 25 Fancy, D.A. Current Opin. Chem. Biol. 2000, 4, 28-33.] or photochemical [Chowdhry, V., and Westheimer, F.H., Annu. Rev. Biochem. 1979, 48, 293-325.; Fancy, D.A. and Kodadek, T. Proc. Natl. Acad. Sci. USA 1999, 96, 6020-6024.] cross-linking of an isolated protein complex, followed by proteolytic digestion and MS and/or MS/MS analysis of the resulting peptide mixture, to subsequently locate proximally adjacent regions of the proteins being 30 examined. However, the large number of peptide species that are generated following digestion of cross-linked proteins makes it often difficult to rapidly and unambiguously identify those peptides that are cross-linked. A number of different groups have developed methods to partially overcome this limitation, either by prior reduction of thiol cleavable WO 2004/046731 PCT/US2003/036739 - 11 cross-linkers [Bennett, K.L., Kussmann, M., Bjork, P., Godzwon, M., Mikkelsen, M., Sorensen, P. and Roepstorff, P. Protein Sci. 2000, 9, 1503-1518.], by using isotopic labelling methodologies [Chen, X., Chen, Y. H., and Anderson, V. E. Anal. Biochem 1999, 273, 192 203.; Pearson, K.M., Panell, L.K. and Fales, H.M. Rapid Commun. Mass Spectrom. 2002, 16, 5 149-159.; Muller, D. R., Schindler, P., Towbin, H., Wirth, U., Voshol, H., Hoving, S., and Steinmetz, M.O. Anal. Chem. 2001, 73, 1927-1934.; Back, J.W., Notenboom, V., de Koning, L.J., Muijsers, A.O., Sixina, T.K., de Koster, C.G., and de Jong, L. Anal. Chem. 2002, 74, 4417-4422.; Taverner, T., Hall, N.E., O'Hair, R.A.J. and Simpson, R.J. J Biol. Chem. 2002, 277, 46487-46492], or via the incorporation of a labile MS/MS "tag" on the cross linker itself 10 [Back, J.W., Hartog, A.F., Dekker, H.L., Muijsers, A.O., de Koning, L.J. and de Jong, L. J. Am. Soc. Mass Spectrom. 2001, 12, 222-227.]. It is important to recognise that all of the techniques described above for selective identification and differential quantitation based on incorporation of a differential isotopic "signature" rely on a common approach for peptide identification. That is, the isotopic 15 signatures of the peptide ions of interest are detected by mass analysis of their intact precursor ions. Thus, limitations of the approach are encountered when; (i) ions of interest are present at low levels (i.e., approaching or below the level of chemical noise present in the spectrum) [Krutchinsky, A.N. and Chait, B.T. J. Am. Soc. Mass Spectrom. 2002, 13, 129 134.], (ii) the masses of low abundance differentially labeled peptides overlap with other 20 higher abundance components present in the mixture, (iii) separation of the differential labelled "heavy" and "light" peptides occurs during chromatographic fractionation of the peptide mixture [Zhang, R.; Sioma, C.S.; Wang, S.; Regnier, F.E. Anal. Chem. 2001, 73, 5142-5149. Zhang, R., Sioma, C.S.; Thompson, R.A.; Xiong, L.; Regnier, F.E. Anal. Chem. 2002, 74, 3662-3669.; Zhang, R. and Regnier, F.E. J. Proteome Res. 2002, 1, 139-147], or 25 (iv) the mass spectrometer lacks sufficient resolution to adequately resolve the two labelled components, thereby precluding their detection. Tandem mass spectrometry (MS/MS) dissociation methods [McLuckey, S.A. and Goeringer, D.E. J Mass Spectrom. 1997, 32, 461-474.] methods, whereby a precursor ion of interest is mass selected, subjected to fragmentation via collision-induced dissociation (CID) 30 and then the resultant product ions mass analyzed to derive structural information relating to the amino acid sequence of the peptide or to indicate the presence and location of post translational modifications, may be used to address at least some of the limitations indicated above. Due to the reduction in chemical noise associated with the formation of product ions WO 2004/046731 PCT/US2003/036739 -12 at different m/z values to that of the mass selected precursor ion, low abundance ions that may not otherwise be apparent in the mass spectrum (i.e., below the level of chemical noise) may be successfully detected and analysed by MS/MS methods. Indeed, operation of a tandem mass spectrometer in precursor ion [Schwartz, J. C.; Wade, A. P.; Enke, C. G.; 5 Cooks, R. G. AnaL Chem. 1990, 62, 1809-1818.; Schlosser A., Pipkorn R., Bossemeyer D., and Lehmann W.D. 2001. Anal. Chem. 73: 170-176.; McClellan, J. E.; Quarmby, S. T.; Yost, R. A.; Anal. Chem. 2002, 74, 5799-5806.] or neutral loss [Schwartz, J. C.; Wade, A. P.; Enke, C. G.; Cooks, R. G. Anal.Chem. 1990, 62, 1809-1818.; Wilm M; Neubauer G; Mann M. Anal. Chem. 1996, 68, 527-33.; McClellan, J. E.; Quannby, S. T.; Yost, R. A.; Anal. 10 Chem. 2002, 74, 5799-5806.] MS/MS scan modes, whereby the mass spectrometer is set to detect diagnostic low mass product ions, or product ions offset by a given mass from the precursor ion, respectively, has been demonstrated to increase sensitivity by 1-2 orders of magnitude over conventional MS based detection methods [Wilm M; Neubauer G; Mann M. Anal. Chem. 1996, 68, 527-533.]. These precursor ion or neutral loss scan modes can be used 15 to indicate the presence of peptides containing certain structural features via formation of diagnostic "signature ions" formed upon CID MS/MS, giving greater specificity compared to that of MS based methods, and overcoming the problem of low abundance peptides overlapping with other higher abundance components present in the mixture. One of the limitations of MS/MS based approaches however, is that fragmentation giving rise to the 20 product ion or neutral loss of interest is usually only one of many dissociation channels, thereby "diluting" the spectrum, limiting sensitivity and potentially complicating subsequent interpretation of the spectra. Also, MS/MS methods for quantitation of differential protein expression have not been described. 25 Summary of the Invention We have developed a novel strategy for protein identification, differential quantitation and for the analysis of post translational modification status and cross-linking, involving the fixed-charge derivatization of selected amino acids, peptides or proteins containing certain structural features, (e.g., the side chains of selected amino acids, or those containing post 30 translational modifications or cross linking agents), that is, in its preferred forms, capable of addressing all of the MS and MS/MS limitations discussed above, with an expected increase in selectively and sensitivity of several orders of magnitude over the existing MS based approaches.
WO 2004/046731 PCT/US2003/036739 -13 Derivatization strategies for mass spectrometric analysis are commonplace and have been reviewed previously [Knapp, D.R. Methods Enzymology 1990, 193, 314-329.; Anderegg, R.J. Mass Spectrom. Rev. 1988, 7, 395-424.; Roth, K.D.W., Huang, Z-H., Sadagopan N, and Watson J.T. Mass Spectromn. Rev. 1998, 17, 255-274.; Sadagopan, N. and 5 Watson J.T. J Am. Soc. Mass. Spectrom. 2001, 12, 399-409.; Jones, M.B., Jeffrey, W.A., Hansen, H.F., Pappin, D.J.C. Rapid Commun. Mass Spectrom. 1994, 8, 737-42.; Spengler, B., Luetzenkirchen, F., Metzger, S., Chaurand, P., Kaufmann, R., Jeffery, W., Bartlet-Jones, M. and Pappin, D.J.C. Int. J Mass Spectrom. Ion Proc. 1997, 169/170, 127-140.; Keogh, T., Lacey, M.P., and Youngquist, R.S. Rapid. Commum. Mass Spectromn. 2000, 14, 2348.]. 10 Although fixed charge derivatives of peptides have been used in conjunction with tandem mass spectrometry for sequencing applications, previous work has entirely focused on directing fragmentation toward the formation of a particular series of backbone cleavage derived sequence ions (i.e., maximizing sequence coverage), and has been limited to the derivatization of the N- and C-termini, as well as lysine and arginine side chains. 15 In contrast, the present invention is based on a fixed-charge derivatization approach, which is designed to direct the dissociation of the peptide toward a single predictable fragmentation channel, resulting in the formation of a single product, thereby allowing its selective identification from a complex mixture by precursor ion or neutral loss scan mode MS/MS, then subjecting it to further structural interrogation, using MS/MS or multistage 20 MS/MS (MS"), or by determination of an "accurate mass tag" of the precursor or product ion [Conrads, T.P., Anderson, G.A., Veenstra, T.D., Pasa-Tolic, L. and Smith, R.D. Anal. Chem. 2000, 72, 3349-3354.; Goodlet, D.R., Bruce, J.E., Anderson, G.A., Rist, B.., Pasa-Tolic, L., Fiehn, 0., Smith, R.D. and Aebersold, R. Anal. Chem. 2000, 72, 1112-1118.; Strittmatter, E.F., Ferguson, P.L., Tang, K. and Smith, R.D. J. Am. Soc. Mass Spectronz. 2003, 14, 980 25 991.], to enable its characterization. The general principles behind the use of the fixed-charge peptide ion derivatization approach for directed CID MS/MS fragmentation are demonstrated here for the selective identification and differential quantitation of side chain fixed-charge sulfonium ion derivatives of peptides containing methionine (fixed-charge sulfonium ion derivatives of the 30 amino acids methionine and cysteine, which were observed to fragment exclusively via neutral loss of the side chain CH 3 SR, where R was a substituted alkyl group [Reid, G.E., Simpson, R.J. and O'Hair, R.A.J. J. Am. Soc. Mass Spectrom. 2000, 11, 1047-1060.; O'Hair, R.A.J. and Reid, G.E. Eur. Mass Spectroin. 1999, 5, 325-334.], have previously been WO 2004/046731 PCT/US2003/036739 - 14 employed for the independent gas-phase synthesis of potential product ions involved in the side chain fragmentation reactions of amino acids.). Comparable strategies for the selective analysis of peptides containing other amino acids, those that contain post translationally modified amino acids, as well as for the characterization of cross-linked peptides are also 5 described. The present invention provides methods of analysis of amino acids, peptides or proteins, the methods comprising: (1) derivatizing a mixture of amino acids, peptides or proteins, to forn at least one amino acid, peptide or protein derivatized to contain a fixed-charge ion, other than at the C-terminal 10 or N-terminal end thereof; (2) introducing the mixture of amino acids, peptides or proteins containing at least one amino acid, peptide or protein derivatized to contain a fixed-charge ion other than at the C terminal or N-terminal end thereof, to a mass spectrometer; (3) passing the mixture of amino acids, peptides or proteins containing at least one amino 15 acid, peptide or protein derivatized to contain a fixed-charge ion, other than at the C-terminal or N-terminal end thereof, through a first mass resolving spectrometer to select precursor protein or peptide ions having a first desired mass-to-charge ratio; (4) subjecting the precursor ions of the first mass to charge ratio to dissociation to form a product ion having a characteristic second mass-to-charge ratio that is characteristic of a 20 fragmentation occurring at a site adjacent to the fixed charge; and (5) detecting the product ions having the second mass-to-charge ratio. The product ion having the second characteristic mass-to-charge ratio may be either a charged amino acid, peptide or protein containing product ion formed by neutral loss of the fixed charge from the precursor ion, or a product ion formed by charged loss of the fixed 25 charge from the precursor ion. Preferably, the method of the invention comprises the further step of: (6) determining the identity of the peptide or protein. Step (6) may be perfonned by first repeating steps (1), (2), (3) and (4) and then subjecting the product ion having the second characteristic mass-to-charge ratio formed by (i) 30 neutral loss from the precursor, which will have a charge state the same as that of the precursor, or (ii) the complementary product ion to the charged product ion formed by charged loss from the precursor ion, which corresponds to a protein or peptide containing product ion having a charge state one lower than the precursor, to a further stage of WO 2004/046731 PCT/US2003/036739 - 15 dissociation to form a series of product ions having a range of mass to charge ratios, for the purpose of determining the amino acid sequence of the peptide and subsequently, the identity of its protein of origin. Alternatively, step (6) may be carried out by use of high resolution mass analyzers to 5 obtain mass accuracies of approximately 1-5 ppm on the product ion detected in step (5), or its complementary product ion (i.e., to derive an "accurate product ion mass tag"). This, coupled with database searching, may be employed for subsequent identification of those peptides found to contain a fixed charge derivative. Previously, in cases where an "accurate mass tag" has been obtained for a precursor ion, and the presence of a particular amino acid is 10 known (for example the presence of a cysteine residue), the specificity of database searching algorithms can be improved such that unambiguous identification of the protein from which the peptide is derived has been achieved from this information alone. The amino acid, peptide or protein ion may be dissociated by any suitable dissociation method including, but not limited to, collisions with an inert gas (known as collision-induced 15 dissociation (CID or collisionally-activated dissociation (CAD); (ii) collisions with a surface (known as surface-induced dissociation or SID); (iii) interaction with photons (e.g. via a laser) resulting in photodissociation; (iv) thermal/black body infrared radiative dissociation (BIRD), and (v) interaction with an electron beam, resulting in electron-induced dissociation for singly charged cations (EID), electron-capture dissociation (ECD) for multiply charged 20 cations, or combinations thereof. The methods of analysis of the present invention may be used for amino acid, peptide or protein identification, differential quantitation, analysis of post translational modification status, analysis of cross-linking status or interaction of proteins. The present invention provides methods of analysis of amino acids, peptides or 25 proteins containing a fixed charge derivative, at a site other than at the C-terminal or N terminal end thereof. The fixed-charge derivative may be contained on the side-chain of a selected amino acid, or a side-chain of a selected amino acid residue contained within a protein or peptide. Preferably the selected amino acid residue is that of a "rare" amino acid, as described in more 30 detail below. The fixed-charge derivative may be contained on a side-chain of a post translational modified amino acid residue, as described in more detail below. The fixed charge derivative may be on a cross-link contained between two proteins or peptides, as described in more detail below.
WO 2004/046731 PCT/US2003/036739 - 16 The selected amino acid residue may be one containing a S atom in the side chain thereof. Preferred amino acid residues are methionine, cysteine, homocysteine or selenocysteine. The selected amino acid residue may also be tryptophan or tyrosine. The side chain may also contain an S-alkyl group. Preferred amino acid residues are methionine, S 5 alkyl cysteine, S-alkyl homocysteine, S-alkyl tryptophan or S-alkyl tyrosine. Derivatization of the side chain of a selected amino acid residue to introduce a fixed-charge may be achieved by strategies known in the art. The fixed-charge may also be contained on an 0-linked post-translationally modified amino acid residue (for example, a dehydroalanine residue formed by #-elimination from an 10 0-linked post-translationally modified serine amino acid residue, or a dehydroamino-2 butyric acid residue formed by f-elimination from an O-linked post-translationally modified threonine amino acid residue) contained within a protein or peptide. Derivatization of a formerly post-translationally modified amino acid residue to introduce a fixed-charge may be achieved by strategies known in the art. 15 The fixed-charge may also be contained within a cross-linking reagent, or a cross-link contained between two amino acids, peptides or proteins. Derivatization of a cross-linking reagent or a cross-link contained between two amino acids, peptides or proteins to introduce a fixed-charge may be achieved by strategies known in the art. Non-limiting examples of the fixed-charge include a sulfonium ion, a quaternary 20 alkylammonium or a quaternary alkylphosphonium ion. Analysis of the amino acid, peptide or protein ion may be performed by tandem mass spectrometry. The tandem mass spectrometer may be equipped with electrospray ionization (ESI) or matrix assisted laser desorption ionization (MALDI) interfaces to transfer the protein or peptide ion from solution into the gas-phase. Mass analysers that are applicable to tandem 25 mass spectrometry fall into two basic categories: tandem-in-space and tandem-in-time. Combinations of these types also can be used. Tandem-in-space mass spectrometers have discreet mass analysers for each stage of mass spectrometry; examples include sector (commonly double focusing sector and "hybrid" combinations of sector and quadrupole analyser instruments), time of flight and triple quadrupole instruments, as well as "hybrid" 30 combinations of time of flight and quadrupole instruments. Tandem-in-time mass instruments have only one mass analyser, and each stage of mass spectrometry takes place in the same region, but is separated in time via a sequence of events. Examples of tandem in WO 2004/046731 PCT/US2003/036739 -17 time mass analysers include both two- and three-dimensional quadrupole ion trap and Fourier-transform ion cyclotron resonance (FT-ICR) mass spectrometers. The methods of the invention in certain embodiments also include one or more steps of protein extraction, protein separation, reduction and alkylation of cysteine disulfides 5 and/or digestion. The fixed-charge derivatization approach in the method of the present invention may be applied to the quantitation of differential protein expression based on the incorporation of suitable isotopic (e.g., 13 C, 1 5 N, 2fH) or structural labels to the fixed charge. The fixed-charge derivatization approach in the method of the present invention may be also applied to the 10 identification and quantitation of post translational modification status in proteins by incorporation of the fixed-charge derivative, via the #-elimination/Michael addition method for forming mass spectrometry stable derivatives of O-phosphorylated and O-glycosylated serine, or 0-phosphorylated and O-glycosylated threonine, for example, that is described in Meyer, H., Hoffnan-Posorske, E., Korte, H. and Heilmeyer, L.J. FEBS. Lett. 1986, 204, 61 15 66., the disclosure of which is incorporated herein by reference. The fixed-charge derivatization approach in the method of the present invention may also be applied to the identification and characterization of protein-protein interactions via incorporation of the fixed-charge derivative into a suitable cross-linking reagent. As already mentioned above, tandem mass spectrometry of the fixed-charge 20 derivatized peptides results in exclusive formation of a product ion upon dissociation that is characteristic of fragmentation at the fixed-charge site upon dissociation, thereby allowing specific identification of only those peptides containing the derivatization, without need for prior resolution or otherwise enrichment of the complex mixture prior to analysis. The method of the present invention, in one particular embodiment, will be described 25 below with reference to the selective identification and differential quantitation of peptides present at a range of different ratios and by the incorporation of isotopically labelled versions of the fixed-charge derivative, to yield a 'light' form (containing only natural isotopes), and a 'heavy' form (containing isotopic or structural labels incorporated into the substituent), with selective identification and differential quantitation of peptide levels performed by neutral 30 loss mode tandem mass spectrometry. Application of the general approach to the analysis of post translational modification status as well as selective identification of protein-protein interactions by cross-linking analysis are also described.
WO 2004/046731 PCT/US2003/036739 - 18 In preferred embodiments of the invention, the foregoing methods are performed with derivatization of amino acids, peptides or proteins using a substituted acetophenone, or a salt thereof, or a solvate thereof, having the following formula: O R1 XRCH C;Re R1 ' R2 R 2 1' R 'R 5
R
3 R, R4' R3 5 R4 X preferably is any halogen, sulfonic ester, perchlorate ester or chlorosulfonate. In certain embodiments, R 1
-R
5 are H, and R 1 ' - R6' are 12C. In other embodiments, the substituted acetophenone is an isotopically encoded substituted acetophenone, or a salt thereof, or a 10 solvate thereof, preferably in which at least one of, more preferably two or more, and more preferably still at least three of R 1
-R
5 are 2 H, and R 1
'-R
6 ' are "C, or in which R,-R 5 are H, and at least one of, preferably two or more, and more preferably at least three of R 1
'-R
6 ' are 13 C. In certain embodiments, at least one of RI-R 5 is a functional group containing an atom 15 other than hydrogen or carbon. Preferably the substituted acetophenone is water soluble. Certain water soluble molecules include the foregoing molecules in which R 1 is SO 2 H, and R 2
-R
5 are H, and R 1
'
R
6 ' are 1 2 C; R 1 is H, R 2 is SO 2 H, and R 3 -Rs are H, and R 1
'-R
6 ' are 1 2 C; R 1
.
2 are H, R 3 is
SO
2 H, and R4-R 5 are H, and R 1
'-R
6 ' are 12 C; R, is SO 3 H, and R2-R5 are H, and R 1
'-R
6 ' are 20 2 C; R1 is H, R2 is SO 3 H, and R 3
-R
5 are H, and Rl'-R6' are 12C; or R12 are H, R3 is SO 3 H, and R4-R5 are H, and R1'-R6' are 1C. Isotopically encoded water soluble substituted acetophenone molecules include the foregoing wherein R1 is SO 2 H, and at least one of, and preferably at least three of R2-R5 are 2H, and R1-'- R6' are 12 C; R 1 is SO 2 H, and R 2
-R
5 are H, and at least one of, and preferably at 25 least three of R '-- R6' are 13 C; R2 is SO 2 H, and at least one of, and preferably at least three of R. and R3-R5 are 2 H, and Ri'-R6' are 1 2 C; R 2 is SO 2 H, and R 1 and R 3 -Rs are H, and at least one of, and preferably at least three of R1'- R6' are 13 C; R3 is SO 2 H, and at least one of, and preferably at least three of RI-R 2 and R4-R5 are 2 H, and R 1 '- R6' are 12 C; R3 is SO 2 H, and R- WO 2004/046731 PCT/US2003/036739 -19
R
2 and R 4
-R
5 are H, and at least one of, and preferably at least three of R 1 '- R 6 ' are "C; R 1 is
SO
3 H, and at least one of, and preferably at least three of R 2
-R
5 are 2H, and R 1 '- R 6 ' are 1C;
R
1 is SO 3 H, and R 2
-R
5 are H, and at least one of, and preferably at least three of Ri'- R 6 ' are 1 3 C; R 2 is SO 3 H, and at least one of, and preferably at least three of R 1 and R 3 - R 5 are 2H, and 5 R 1 '- R 6 ' are 12 C; R 2 is SO 3 H, and R 1 and R 3
-R
5 are H, and at least one of, and preferably at least three of R 1 '- R 6 ' are 13 C; R 3 is SO 3 H, and at least one of, and preferably at least three of
R
1
-R
2 and R 4 - R 5 are 2H, and R 1 '- R 6 ' are 12C; or R 3 is SO 3 H, and R 1
-R
2 and 14-R 5 are H, and at least one of, and preferably at least three of R 1 '-are 1C. In these molecules, it will be understood that other combinations and amounts of the 10 listed substituents are possible and the invention embraces such combinations. According to another aspect of the invention, substituted acetophenones (or salts thereof, or solvates thereof) are provided that have the following formula: 0 R1 x CH C- R, ;. R1' R2 R2 X | 0 R ,' R 6 '
~R
5 14 R 3 R4 15 wherein X is a sulfonic ester, perchlorate ester or chlorosulfonate. Preferred substituted acetophenones are those in which R 1
-R
5 are H, and R 1
'-R
6 ' are 1 2 C. The invention also includes substituted acetophenones that are isotopically encoded substituted acetophenones, salts thereof, and solvates thereof Certain preferred substituted 20 isotopically encoded acetophenones are those in which at least one of, and preferably at least three of R 1
-R
5 are 2H, and R 1
'-R
6 ' are 12 C, or those in which R 1
-R
5 are H, and at least one of, and preferably at least three of R 1
'-R
6 ' are 13 C. The invention further includes water soluble substituted acetophenones, salts thereof, and solvates thereof, having the following formula: 25 WO 2004/046731 PCT/US2003/036739 -20 0 R1 X C' -1 R1 R2
CH
2 R' R Rs{ R 3 R1 4 3 R4 In some embodiments, X is any halogen, sulfonic ester, perchlorate ester or chlorosulfonate. In preferred embodiments, X is Br or I. 5 In other preferred embodiments, the water soluble substituted acetophenones are those in which R 1 is SO 2 H, R 2
-R
5 are H, and R 1 '- R 6 ' are 1 2 C; R 1 is H, R 2 is SO 2 H, R 3
-R
5 are H, and R 1 '- R 6 ' are "C; R1- 2 are H, R 3 is SO 2 H, Rh-R 5 are H, and R 1 '- R 6 ' are "C; R 1 is SO 3 H,
R
2
-R
5 are H, and R 1 '- R 6 ' are 12C; R 1 is H, R 2 is SO 3 H, R 3
-R
5 are H, and R 1 '- R 6 ' are "C; or
R
1
.
2 are H, R 3 is SO 3 H, R 4
-R
5 are H, and R 1 '- R 6 ' are "C. 10 The water soluble substituted acetophenone also can be isotopically encoded forms of those described herein, preferably wherein X is any halogen, sulfonic ester, perchlorate ester or chlorosulfonate. In particular, the isotopically encoded water soluble substituted acetophenones are those in which R 1 is SO 2 H, at least one of, and preferably at least three of R2 - R 5 are 2H, and Ri'-R6' are 1C; R 1 is SO 2 H, R 2
-R
5 are H, and at least one of, and 15 preferably at least three of R 1
'-R
6 ' are 13C; R 2 is SO 2 H, at least one of, and preferably at least three of R 1 and R 3
-R
5 are 2H, and R 1
'-R
6 ' are "C; R 2 is SO 2 H, R, and R 3
-R
5 are H, and at least one of, and preferably at least three of R 1
'-R
6 ' are 13C; R 3 is SO 2 H, at least one of, and preferably at least three of R1-R 2 and R 4 - R 5 are 2H, and R 1
'-R
6 ' are "C; R 3 is SO 2 H, R 1
-R
2 and R 4 - R 5 are H, and at least one of, and preferably at least three of R 1
'-R
6 ' are 13C; R 1 is 20 SO 3 H, at least one of, and preferably at least three of R 2 - R 5 are 2H, and R 1
'-R
6 ' are "C; R 1 is
SO
3 H, R 2
-R
5 are H, and at least one of, and preferably at least three of R 1
'-R
6 ' are 3C; R 2 is
SO
3 H, at least one of, and preferably at least three of R 1 and R3-R5 are 2 H, and Ri'-R 6 ' are "C; R 2 is SO 3 H, R, and R 3
-R
5 are H, and at least one of, and preferably at least three of R1' R6' are 3 C; R3 is SO 3 H, at least one of, and preferably at least three of Ri-R2 and R 4 - R5 are 25 2H, and R 1
'-R
6 ' are "C; R 3 is SO 3 H, R 1
-R
2 and R4-R5 are H, and at least one of, and preferably at least three of R 1
'-R
6 ' are 13C. In these molecules, it will be understood that other combinations and amounts of the listed substituents are possible and the invention embraces such combinations.
WO 2004/046731 PCT/US2003/036739 -21 The invention in another aspect also includes reagent kits for analysis of amino acids, peptides or proteins by mass spectrometry. The kits include one or more containers containing the substituted acetophenones described herein. Certain reagent kits for analysis of amino acids, peptides or proteins by mass 5 spectrometry of the invention include a container containing a substituted acetophenone, or a salt thereof, or a solvate thereof, having the following formula: 0 R1 x
,,R
2
CH
2
R
6 '
R
5 14 R4 10 wherein X is a halide, preferably Br or 1. Certain preferred substituted acetophenones in the kits are those in which R 1
-R
5 are H, and Ri'-R 6 ' are 1C. Other preferred substituted acetophenones in the kits are isotopically encoded substituted acetophenones, or salts thereof, or solvates thereof, preferably those in which at least one of, and preferably at least three of R1-R5 are 2H, and R 1
'-R
6 ' are 1 2 C, or in which R 1
-R
5 are H, and at least one of, and 15 preferably at least three of R2'-R 6 ' are 13C. The water soluble derivatives of the substituted acetophenones listed above also can be included in the kits. Instructions for use of the substituted acetophenones in the analysis of amino acids, peptides or proteins by mass spectrometry also may be included. The reagent kit further can contain one or more containers containing: cysteine 20 disulfide reducing agents, cysteine alkylating reagents, proteases or chemical cleavage agents, and/or solvents. The cysteine disulfide reducing agents preferably include dithiothreitol (DTT), 0-mercaptoethanol, tris-carboxyethyl phosphine (TCEP), and/or tributylphosphine (TBP). The cysteine alkylating reagents preferably include alkylhalides (e.g. iodoacetic acid, iodoacetamide), vinylpyridine and/or acrylamide. The proteases or 25 chemical cleavage agents preferably include trypsin, Endoproteinase Lys-C, Endoproteinase Asp-N, Endoproteinase Glu-C, pepsin, papain, thermolysin, cyanogen bromide, hydroxylamine hydrochloride, 2-[2'-nitrophenylsulfenyl]-3-methyl-3'-bromoindole
(BNPS
skatole), iodosobenzoic acid, pentafluoropropionic acid and/or dilute hydrochloric acid. The WO 2004/046731 PCT/US2003/036739 - 22 solvents preferably include urea, guanidine hydrochloride, acetonitrile, methanol and/or water. The invention also includes amino acids, or peptides comprising an amino acid, derivatized to include a side chain fixed-charge sulfonium ion, quaternary alkylannonium 5 ion or quaternary alkylphosphonium ion. Preferably the amino acid is derivatized using the substituted acetophenone described herein. In particularly preferred embodiments, the amino acid is derivatized using a substituted acetophenone, or a salt thereof, or a solvate thereof, having the following formula: 0 R1
CH
2
R
6 R2 I || R5{ R 3
R
5 4 ' 3 10 1 4 10 R4 wherein X is a halide, preferably Br or I. Certain preferred substituted acetophenones used to derivatize the amino acids or peptides are those in which RI-R 5 are H, and R 1 '-R' are 2c. Other preferred substituted acetophenones used to derivatize the amino acids or peptides are 15 isotopically encoded substituted acetophenones, or salts thereof, or solvates thereof, preferably those in which at least one of, and preferably at least three of R-R 5 are 2 H, and
R
1
'-R
6 ' are 1 2 C, or in which R 1
-R
5 are H, and at least one of, and preferably at least three of
R
1
'-R
6 ' are 13 C. The water soluble derivatives of the substituted acetophenones listed above also can be used to derivatize the amino acids or peptides. In preferred embodiments, the 20 amino acid derivative is isotopically encoded. The invention in another aspect provides methods for providing an internal standard in a mass spectrometer method comprising adding to a sample a predetermined quantity of the fixed charge derivatized amino acid or peptide described above. The present invention will now be described with reference to particular 25 embodiments, however, the method of the present invention is not limited to these particular embodiments.
WO 2004/046731 PCT/US2003/036739 -23 Brief Description of the Drawings Figure 1. Quadrupole ion trap tandem mass spectrometry of singly charged GAILMGAILA (SEQ ID NO:1) and its methionine side chain fixed-charge 5 derivative. (A) CID MS/MS of the [M+H]* ion. (B) CID MS/MS of the methionine side chain fixed-charge acetophenone (AP) sulfonium [M(AP)]* ion. (inset) CID MS 3 of the [M-CH 3 S(AP)]* product ion in Figure 1B. Figure 2. Quadrupole ion trap tandem mass spectrometry of doubly charged 10 GAILMGAILA (SEQ ID NO:1) and its methionine side chain fixed-charge derivative. (A) CID MS/MS of the [M+2H] 2 + ion. (B) CID MS/MS of the methionine side chain fixed-charge acetophenone (AP) sulfonium
[M(AP)+H]
2 + ion. (inset) CID MS 3 of the [M-CH 3
S(AP)]
2 + product ion in Figure 2B. 15 Figure 3. Quadrupole ion trap tandem mass spectrometry of singly charged GAILMGAILK (SEQ ID NO:2) and its methionine side chain fixed-charge derivative. (A) CID MS/MS of the [M+H]* ion. (B) CID MS/MS of the methionine side chain fixed-charge acetophenone (AP) sulfonium [M(AP)]* 20 ion. (inset) CID MS 3 of the [M-CH 3 S(AP)]* product ion in Figure 3B. Figure 4. Quadrupole ion trap tandem mass spectrometry of doubly charged GAILMGAILK (SEQ ID NO:2) and its methionine side chain fixed-charge derivative. (A) CID MS/MS of the [M+2H2+ ion. (B) CID MS/MS of the 25 methionine side chain fixed-charge acetophenone (AP) sulfonium
[M(AP)+H]
2 + ion. (inset) CID MS 3 of the [M-CH 3
S(AP)]
2 + product ion in Figure 4B. Figure 5. Quadrupole ion trap tandem mass spectrometry of the triply charged 30 acetophenone (AP) sulfonium ion derivative of GAILMGAILK (SEQ ID NO:2). (A) CID MS/MS of the [M(AP)+2H] 3 + ion. (inset) CID MS 3 of the [M
CH
3 S(AP)]'+ product ion.
WO 2004/046731 PCT/US2003/036739 - 24 Figure 6. Quadrupole ion trap tandem mass spectrometry of singly charged GAILMGAILR (SEQ ID NO:3) and its methionine side chain fixed-charge derivative. (A) CID MS/MS of the [M+H]* ion. (B) CID MS/MS of the methionine side chain fixed-charge acetophenone (AP) sulfonium [M(AP)]* 5 ion. (inset) CID MS 3 of the [M-CH 3 S(AP)]* product ion in Figure 6B. Figure 7. Quadrupole ion trap tandem mass spectrometry of doubly charged GAILMGAILR (SEQ ID NO:3) and its methionine side chain fixed-charge derivative. (A) CID MS/MS of the [M+2H] 2 + ion. (B) CID MS/MS of the 10 methionine side chain fixed-charge acetophenone (AP) sulfonium
[M(AP)+H]
2 + ion. (inset) CID MS 3 of the [M-CH 3
S(AP)]
2 + product ion in Figure 7B. Figure 8. Quadrupole ion trap tandem mass spectrometry of the triply charged 15 acetophenone (AP) sulfonium ion derivative of GAILMGAILR (SEQ ID NO:3). (A) CID MS/MS of the [M(AP)+2H] 3 * ion. (inset) CID MS 3 of the [M
CH
3 S(AP)]3+ product ion. Figure 9. (A) Energy resolved CID MS/MS of the doubly charged methionine side chain 20 fixed-charge acetophenone (AP) sulfonium [M(AP)+HJ2+ ion of GAILMGAILK (SEQ ID NO:2). (B) CID MS/MS product ion spectra of the
[M(AP)+H]
2 + ion obtained at a collision energy (laboratory frame) of 12V. (C) CID MS/MS product ion spectra of the [M(AP)+H] 2 + ion obtained at a collision energy (laboratory frame) of 22V. 25 Figure 10. Energy resolved CID MS/MS of the doubly charged [M+2H] 2 + ions of GAILAGAILK (SEQ ID NO:5) and GAILMGAILK (SEQ ID NO:2). Figure 11. (A) Total ion current trace from a 200 Rm I.D. capillary RP-HPLC - MS 30 analysis of 10 pmol of a reduced and S-carboxyamidomethylated tryptic digest of bovine serum albumin, following derivatization using bromoacetophenone. Individual mass spectra for two regions of the total ion current chromatogram, indicated by arrows, are shown in Figures 11 B and 11C.
WO 2004/046731 PCT/US2003/036739 -25 Figure 12. (A) Total ion current trace from a 200 ptm I.D. capillary RP-HPLC - neutral loss scan mode CID MS/MS analysis of 10 pmol of a reduced and S carboxyamidomethylated tryptic digest of bovine serum albumin, following 5 derivatization using bromoacetophenone. The peptides are IETMR (SEQ ID NO:9), ETYGDMADCCEK (SEQ ID NO:10), TVMENFVAFVDK (SEQ ID NO: 11), and MPCTEDYLSLILNR (SEQ ID NO:12). Individual neutral loss scan mode mass spectra for two regions of the total ion current chromatogram, indicated by arrows, are shown in Figures 12B and 12C. 10 Figure 13. (A) Mass spectrum and (B) neutral loss scan mode CID MS/MS spectra (neutral loss of 83.0 Da and 85.5 Da for the doubly charged do- and d 5 containing [M(AP)+H] 2 + ions, and neutral loss of 55.3 Da and 57.0 Da for the triply charged do- and d 5 - containing [M(AP)+2H] 3 * ions) of a 1.0 : 1.0 15 pmol/tL mixture of do and d 5 containing methionine side chain fixed-charge acetophenone (AP) sulfonium ion derivative of GAILMGAILR (SEQ ID NO:3). Figure 14. (A) Mass spectrum of a 1.0 : 0.1 pmol/[L mixture of do and d 5 containing 20 methionine side chain fixed-charge acetophenone sulfonium ion derivative of GAILMGAILR (SEQ ID NO:3). Expanded regions of the m/z range for the triply and doubly charged precursors are shown in panels (B) and (C) respectively. 25 Figure 15. (A) Summed neutral loss scan mode CID MS/MS spectrum of a 1.0 : 0.1 pmol/ L mixture of do and d 5 containing methionine side chain fixed-charge acetophenone sulfonium ion derivative of GAILMGAILR (SEQ ID NO:3). Individual neutral loss mass spectra for the triply and doubly charged precursors are shown in panels (B) and (C), respectively. 30 Figure 16. (A) Total ion current trace from a 75 pm I.D. capillary RP-HPLC - MS analysis of a 1.0 : 1.0 pmol mixture of do and d 5 containing methionine side chain fixed-charge acetophenone (AP) sulfonium ion derivative of WO 2004/046731 PCT/US2003/036739 - 26 GAILMGAILR (SEQ ID NO:3). (B) Summed mass spectra from the region indicated in panel (A). (C) Total ion current trace from a 75 [tm I.D. capillary RP-HPLC -neutral loss scan mode CID MS/MS analysis of a 1.0 : 1.0 pmol mixture of do and d 5 containing methionine side chain fixed-charge 5 acetophenone (AP) sulfonium ion derivative of GAILMGAILR (SEQ ID NO:3). (D) Summed mass spectra from the region indicated in panel (C). Embodiments of the Invention 10 Definitions Unless the context indicates otherwise, all technical and scientific terms used herein generally have the same meaning as commonly understood in the art to which the present invention belongs. "Fixed-charge", as used herein, includes any charge localised to a specific heteroatom 15 contained within the protein or peptide, or to a specific heteroatom contained within the derivatization reagent (e.g., in solution or in the gas-phase), by the attachment of any moiety. "Fixed charge derivatization", as used here, means the introduction of a fixed charge as defined above. For example, the fixed charge may be introduced either by introducing a neutral reagent to subsequently form the fixed charge at a specific site within the protein or 20 peptide, or by introduction of a reagent containing the fixed charge to a specific site within the protein or peptide. The fixed charge derivative thus formed preferably has a structure such that it allows the exclusive formation of a product ion upon dissociation that is characteristic of the fixed charge derivative. 25 "Protein", as used herein, means any protein, including, but not limited to peptides, enzymes, glycoproteins, hormones, receptors, antigens, antibodies, growth factors, etc., without limitation. Proteins may be endogenous, or produced from other proteins by chemical or proteolytic cleavage. Preferred proteins include those comprised of at least 15-20 amino acid residues. The term includes cross-linked proteins. 30 "Peptide" as used herein includes any substance comprising two or more amino acids and includes di-, tri-, oligo and polypeptides etc according to the number of amino acids linked by amide(s) bonds. Peptides may be endogenous, or produced from other peptides or WO 2004/046731 PCT/US2003/036739 - 27 proteins by chemical or proteolytic cleavage. Preferred peptides include those comprised of up to 15-20 amino acid residues. The term includes cross-linked peptides. When the amino acids are c'-amino acids, either the L-optical isomer or the D-optical isomer can be used. The L -isomers are generally preferred. For a general review, see, 5 [Spatola, A. F., in Chemistry and Biochemistry of amino acids, peptides and proteins. 1983, B. Weinstein, eds., Marcel Dekker, New York, p. 267.] The term "alkyl" is used herein to refer to a branched or unbranched, saturated or unsaturated, monovalent hydrocarbon. Suitable alkyl groups include, for example, structures containing one or more methylene, methine and/or methyne groups. Branched structures have 10 a branching motif similar to i-propyl, t-butyl, i-butyl, 2-ethylpropyl, etc. As used herein, the term encompasses "substituted alkyls," and "cyclic alkyl." "Substituted alkyl" refers to alkyl as just described including one or more substituents such as, for example, alkyl, aryl, acyl, halogen (i.e., alkylhalos, e.g., CF 3 ), hydroxy, amino, amide, alkoxy, alkylamino, acylamino, thioamido, acyloxy, aryloxy, aryloxyalkyl, ether, 15 ester, disulfide, mercapto, thia, aza, oxo, both saturated and unsaturated cyclic hydrocarbons, heterocycles and the like. These groups may be attached to any carbon or substituent of the alkyl moiety. Additionally, these groups may be pendent from, or integral to, the alkyl chain. For the purpose of covalent solid phase pre-enriclunent of fixed charge derivatives, the substituted alkyl group may be covalently attached to an insoluble bead or polymer, and 20 contain a chemical or photochemical cleavage site between the insoluble bead or polymer and the alkyl group. The tenn "aryl" is used herein to refer to an aromatic substituent, which may be a single aromatic ring or multiple aromatic rings which are fused together, linked covalently, or linked to a common group such as a methylene or ethylene moiety. The common linking 25 group may also be a carbonyl as in acetophenone. The aromatic ring(s) may include phenyl, naphthyl, biphenyl, diphenylmethyl and benzophenone among others. The term "aryl" encompasses "arylalkyl" and "substituted aryl." "Substituted aryl" refers to aryl as just described including one or more functional groups such as lower alkyl, acyl, halogen, alkylhalos (e.g. CF 3 ), hydroxy, amino, alkoxy, 30 alkylamino, acylamino, acyloxy, phenoxy, mercapto and both saturated and unsaturated cyclic hydrocarbons which are fused to the aromatic ring(s), linked covalently or linked to a common group such as a methylene or ethylene moiety The linking group may also be a WO 2004/046731 PCT/US2003/036739 -28 carbonyl such as in cyclohexyl phenyl ketone. The term "substituted aryl" encompasses "substituted arylalkyl." The term "arylalkyl" is used herein to refer to a subset of "aryl" in which the aryl group is attached to another group by an alkyl group as defined herein. 5 "Substituted arylalkyl" defines a subset of "substituted aryl" wherein the substituted aryl group is attached to another group by an alkyl group as defined herein. The term "acyl" is used to describe a ketone substituent, --(O)R, where R is alkyl or substituted alkyl, aryl or substituted aryl as defined herein. 10 Abbreviations CID Collision Induced Dissociation ESI Electrospray Ionization MALDI Matrix Assisted Laser Desorption Ionization MS Mass spectrometry 15 MS/MS Tandem mass spectrometry MS" Multistage tandem mass spectrometry, where n >2 Materials and Methods. 20 Materials The model synthetic peptides (GAILMGAILA (SEQ ID NO: 1), GAILMGAILK (SEQ ID NO:2), GAILMGAILR (SEQ ID NO:3), GAILAGAILA (SEQ ID NO:4), GAILAGAILK (SEQ ID NO:5) and GAILAGAILR (SEQ ID NO:6)) were obtained from 25 Auspep (Melbourne, Vic, Australia) and used directly without further purification. lodomethane, iodoethane, iodobenzene, iodomethylbenzene, d 5 -acetophenone and bromoacetophenone were purchased from Aldrich (Castle Hill, NSW, Australia). Polymer supported pyridyl bromide perbromide, iodoacetic acid and iodoacetamide were from Sigma (St. Louis, MI, USA). Methanol, and acetonitrile (Chromar grade) were purchased from 30 Mallinkrodt (Paris, KY, USA). Formic acid and acetic acid were obtained from BDH Laboratories (Poole, England). All solutions were prepared using deionised water purified by a tandem Milli-Q and Milli-RO system (Millipore, Bedford, MA., USA).
WO 2004/046731 PCT/US2003/036739 - 29 Synthesis of ds-bromoacetophenone. d 5 -bromoacetophenone was synthesized according to the method of Frechet et al [Frechet, J.M.J., Farrall, M.J. and Nuyens, L.J. J Macromol. Sci.-Chenz. 1977, All, 507 5 514.] Briefly, 1.25 mL of ds-acetophenone was added to 4.8 g Polymer supported pyridyl bromide perbromide (~3 meq Br3~ /g resin) in 30 mL methanol then allowed to react with stirring at room temperature for 4 hours. The resultant a-bromo ketone was obtained in pure form by filtration of the reaction mixture and evaporation of the solvent, followed by recrystallization prior to use. 10 Side chain fixed-charge derivatization of methionine containing peptides. Side chain fixed-charge sulfonium ion derivatives of methionine-containing peptides were produced by the addition of 10 AL of a IM solution of alkylating reagent to 100 pg of 15 peptide dissolved in 100 pL of aqueous 20% acetic acid containing 30% CH 3 CN. The reaction was allowed to proceed for 16 hours at room temperature after which the sample was diluted and then introduced to the mass spectrometer with no further purification. Mass Spectrometry 20 Mass spectrometric analysis was performed using either a (i) quadrupole ion trap (Finnigan-MAT model LCQ-DECA, San Jose, CA), (ii) quadrupole-time-of-flight (Micromass model Q-TOF2, Manchester, UK), or (ii) triple quadrupole (Finnigan model TSQ, San Jose, CA) mass spectrometer, all equipped with electrospray ionization interfaces. 25 Quadrupole Ion Trap Mass Spectrometry. Samples (10 pmol/IpL in 50:50:1 H 2 0:CH 3 CN:acetic acid) were introduced to the mass spectrometer at 2 pL/min. The ESI conditions were optimized to maximize the 30 intensity of the ion of interest. Typical conditions were: spray voltage -5 kV, Nitrogen sheath gas, 30 psi, heated capillary temperature 150'C. MS/MS and MS 3 experiments were performed on mass selected ions using standard isolation and excitation procedures. All spectra collected were the average of 10 scans.
WO 2004/046731 PCT/US2003/036739 -30 Quadrupole-time-of-flight mass spectromety. Spectra were acquired using the V-optics mode of the time-of-flight mass analyser 5 following infusion of samples (same concentration as solution composition as for ion trap experiments) at 1 yL/min. Electrospray interface conditions were optimized to maximize the intensity of the ion of interest. Typical conditions were: spray voltage, -4.5 kV; nitrogen source gas, 1 psi; cone gas, 100 (arbitrary units); source temperature, 50 C; desolvation temperature, 150'C; cone voltage, 50V (singly charged ions), 30V (doubly charged ions) and 10 20V (triply charged ions). Energy resolved CID MS/MS experiments were automatically acquired on an isolated isotopic envelope precursor ion population to allow determination of product ion charge states, using argon as the inert collision gas at a pressure of 10 psi. The collision energy was ramped from 4 to 58 V in 1V increments for singly charged ions, and from 4 to 20 V in 1V increments and from 20 to 46V in 2V increments for doubly and triply 15 charged ions. Spectra obtained for each collision energy value are the average of 10 scans. Triple Quadrupole Mass Spectrometry. Samples (1 pmol/pL in 50:50:1 H 2 0:CH 3 CN:acetic acid) were introduced to the mass 20 spectrometer by a home built nano-electrospray ionization source at a flow rate of 200 nL/min. The spray voltage was maintained at-1.8 kV. The heated capillary temperature was 150'C. The argon collision gas pressure was maintained at 1.5 mtorr. The instrument was operated under unit resolution conditions. Neutral loss mode MS/MS scans (neutral losses of 83 and 85.5 Da for do- and d 5 - containing doubly charged ions, and 55.3 and 57 Da for do 25 and d 5 - containing triply charged ions) were performed at collision energies of 18V and 13V, respectively. Product ion CID MS/MS spectra of peptide ions selectively identified by neutral loss scans were then acquired at 18V and 3 1V, and 13V and 18V for doubly and triply charged ions, respectively. All spectra shown are the average of 20 scans. Capillary RP-HPLC was performed using (i) a column (200 ptm I.D. x 150 mm O.D 30 fused silica), packed with Brownlee RP-300, 7pim dimethyloctyl silica, developed at a flow rate of 3.6 piL mini using a linear 60 minute gradient from 0-1 00%B, where solvent A was 0. 1M aqueous acetic acid, and solvent B was 0. 1M aqueous acetic acid containing 60% acetonitrile, or (ii) a column (75 pm I.D. x 150 mm O.D fused silica), packed with Vydac C4, WO 2004/046731 PCT/US2003/036739 - 31 developed at a flow rate of 200 nL min-' using a linear 60 minute gradient from 0-100%B, where solvent A was 0.1M aqueous acetic acid, and solvent B was 0.1M aqueous acetic acid containing 60% acetonitrile. 5 Results and Discussion Identification ofproteins by CID MS/IMS of their fixed-charge derivative containing peptides. 10 Side chain fixed-charge derivatization of peptides containing methionine has been employed here to demonstrate the general strategy for selective identification and quantitation of peptides by selective CID MS/MS dissociation. The rationale for using fixed charge derivatives of methionine containing peptides is based on the idea that approaches for selective identification and quantitation of differential protein expression should be specific 15 for the detection of peptides containing amino acids that are rare, thereby limiting the number of peptides required to be analyzed, yet providing comprehensive coverage of proteins in the sequence databases. Release 40.0 of the SWISS-PROT database contains a total of 101,602 protein entries. The occurrence of methionine in this database is 2.37%. In contrast, the occurrence of cysteine, the amino acid targeted in the popular ICAT and related methods for 20 selective mass spectrometric detection and quantitation, is 1.63%. Of the individual database entries, 85.33% contain cysteine while 96.9% contain methionine. 98.35% coverage could be attained if proteins containing both amino acids were targeted for analysis, as only 1.65% of entries do not contain either methionine or cysteine. Therefore, approaches directed toward the identification of proteins containing methionine (or both methionine and cysteine) should 25 cover a significantly greater fraction of the proteins represented in the database than approaches targeting for cysteine alone. The main strategy devised here for the selective identification and subsequent differential quantitation of methionine-containing peptides, as well as for S-alkyl cysteine, by fixed-charge sulfonium ion derivatization (i.e., derivatives in which the side chain of 30 methionine or S-alkyl cysteine has been alkylated) and tandem mass spectrometry, are outlined below. Comparable strategies for the selective identification and differential quantitation of cysteine containing peptides, as well as a strategy for the selective WO 2004/046731 PCT/US2003/036739 - 32 identification and differential quantitation of tryptophan (1.21% occurrence) containing peptides, via sulfonium or quaternary alkyl fixed charge derivatization are also discussed. Fixed-charge sulfonium ion derivatization ofpeptides containing methionine. 5 Identification of methionine-containing peptides via CID of its sulfonium ion fixed charge derivative is attractive for several reasons. Methionine is a rare amino acid (see above). The chemistry and biological applications of sulfonium ions are well documented, giving precedent for developing reagents to form these ions in applications involving specific 10 derivatization of peptides [Stirling, M.J. and Patai, S. The Chemistry of the Sulfonium Group, Wiley: New York, 1981.; Lundblad. Techniques in Protein Modification, Chapter 8 "The Modification of Methionine" CRC Press. Florida, 1995.; Liu T.-H. "The role of Sulfur in Proteins" In: The Proteins. Volune III, Neurath, Hill and Boeder. Eds., Academic NY, 1977, Chapter 3, pp 239-402.; Gundlach, H.G. Moore, S, and Stein, W.H. J Biol. Chem., 1959, 15 234, 1761-1764.; Lawson, W.B. Gross, E., Foltz, C.M. and Witkop, B. J Am. Chem. Soc., 1961, 83, 1509-15 10.; Lawson, W.B. Gross, E., Foltz, C.M. and Witkop, B. J. Am. Chem. Soc., 1962, 84, 1715-1718.; Gross E. Methods Enzymology 1967, 11, 238-255.; Degen, J. and Kyte, J Anal. Biochem. 1978, 89, 529-539.; Kyte, J., Degen, J. and Harkins, R.N. Methods Enzymology 1983, 91, 367-377.]. The alkylation of methionine residues in proteins and 20 peptides is specific at low pH regardless of the alkylating reagent used [Lundblad. Techniques in Protein Modification, Chapter 8 "The Modification of Methionine" CRC Press. Florida, 1995.; Liu T.-H. "The role of Sulfur in Proteins" In: The Proteins. Volume III, Neurath, Hill and Boeder. Eds., Academic NY, 1977, Chapter 3, pp 239-402.]. The reason for this selectivity is that the rate of alkylation of the thioether sulfur atom of the methionine side 25 chain is virtually independent of pH (even down to pH1) whereas the reactivity of all other nucleophilic functional groups (e.g. cysteine, lysine and histidine residues) decreases at low pH (due to protonation). Previously, sulfonium ions have been exploited to: (a) tag methionine sites in peptides and proteins [Lundblad. Techniques in Protein Modification, Chapter 8 "The Modification of Methionine" CRC Press. Florida, 1995.; Liu T.-H. "The role 30 of Sulfur in Proteins" In: The Proteins. Volume III, Neurath, Hill and Boeder. Eds., Academic NY, 1977, Chapter 3, pp 239-402.; Gundlach, H.G. Moore, S, and Stein, W.H. . Biol. Chem., 1959, 234, 1761-1764.; Rogers, G.A. Shaltiel, N, and Boyer, P.D. J. Biol. Chem., 1976, 251, 5711-571.7.; Toennies, G. and Kolb, J.J. J. Am. Chem. Soc., 1945, 67, 849- WO 2004/046731 PCT/US2003/036739 - 33 851.; Lavine, T.F. Floyd, N.F, and Cammaroti, M.S. J. Am. Chem. Soc., 1945, 67, 849-85 1.; Lawson, W.B. and Schramm, H.J. J. Am. Chem. Soc., 1962, 84, 2017-2018.]; (b) to induce cleavage of peptide bonds adjacent to methionine residues via the solution phase fragmentation of the sulfonium ions. [Lawson, W.B. Gross, E., Foltz, C.M. and Witkop, B. J 5 Am. Chem. Soc., 1961, 83, 1509-1510.; Lawson, W.B. Gross, E., Foltz, C.M. and Witkop, B. J. Am. Chem. Soc., 1962, 84, 1715-1718.; Jendrek, J.P., Barker, R.H. and Altschul, A.M. Biochim Biophys Acta, 1967, 136, 409-411.; Gross E. Methods Enzymology 1967, 11, 238 255.] The most famous and widely studied system is the cyanogen bromide reaction [Gross E. Methods Enzymology 1967, 11, 238-255.], although other alkylating reagents have been 10 used to similar effect [Tang, J.R. and Hartley, B.S. Biochem. J. 1967, 102, 593.; Tang, J.R. and Hartley, B.S. Biochem. J. 1970, 118, 611.]. Note that related reactions have been observed in peptide syntheses [Gain, M., Lloyd-Williams, P., Albericio, F. and Giralt, E. Tetrahedron Lett. 1994, 35, 175-178.], as an autolytic protein cleavage reaction [Taylor, KL.; Pohl, J; Kinkade, J.M., J. Biol. Chem. 1992, 267, 25282-25288.], in the purification of 15 peptides [Tang, J.R. and Hartley, B.S. Biochem. J. 1967, 102, 593.; Degen, J. and Kyte, J. Anal. Biochem. 1978, 89, 529-539.; Kyte, J., Degen, J. and Harkins, R.N. Methods Enzymology 1983, 91, 367-377.; Weinberger, S.R.; Viner, R.I. and Ho, P. Electrophoresis. 2002, 23, 3182-3192.], and to transform methionine residues to homocysteine residues [Chassaing, G; Lavielle, S; Marquet, A. J. Org. Chem. 1983, 48, 1757-60.]. Preliminary 20 experiments involving sulfonium ions derivatives of methionine have been performed, using substituted alkyl halides R 1 X (where X is the halogen and R is the substituted alkyl group) [O'Hair, R.A.J. and Reid, G.E. Eur. Mass Spectrom. 1999, 5, 325-334.]. The reagent used in these preliminary studies reacts specifically with the side chain of methionine in solution under acidic pH conditions to form a stable side chain sulfonium ion derivative ( 25 CH(CH 2
)
2
S(CH
3
)R
1 *) (step (i) of Scheme 1). Under these conditions, the reagent does not react with the free thiol of cysteine containing peptides. The reagent may however, be used for derivatization of cysteine-containing peptides after initial S-alkylation of the cysteine thiol side chain under basic pH conditions to yield the derivative (-CHCH 2
SR
2 ) (step (i) of Scheme 2). Further alkylation of these S-alkyl cysteine containing peptides under identical 30 conditions to those employed for alkylation of methionine can then be used to form stable sulfonium ion derivatives of cysteine (-CHCH 2
SRR
2 *) [Foti, S., Saletti, R. and Marletta, D. Org. Mass Spectrom. 1991, 26, 903-907.; Lapko, V.N., Smith, D.L. and Smith, J.B. Mass Spectrom. 2000, 35, 572-575.], provided that the R 2 substituent used for the initial cysteine WO 2004/046731 PCT/US2003/036739 - 34 alkylation enhances sulfonium ion stability (i.e., the charge is stable) (step (ii) of Scheme 2). Importantly, conditions could be chosen so that the initial S-alkyl cysteine derivative is not stable to sulfonium ion formation, allowing specific derivatization of methionine containing peptides in cases where cysteines have been previously reduced and S-alkylated to enable 5 efficient proteolysis during sample preparation. Neutral loss mode scan mode CID tandem mass spectrometry (MS/MS) scans, or post data acquisition neutral loss product ion detection software following conventional CID MS/MS, can be used to selectively identify methionine and cysteine containing peptides via the characteristic loss of CH 3
SR
1 from methionine and
R
1
SR
2 from cysteine, with formation of [M+nH-CH 3
SR
1 ]"**)+ and [M+nH-R 1
SR
2 ](n+D 10 product ions, respectively. Methionine and cysteine containing peptides can therefore be identified individually, through the use of an R 2 group that does not correspond to -CH 3 , (i.e., the side chain alkyl group of methionine) or simultaneously, via initial alkylation of cysteine where the R 2 group corresponds to -CH 3 .
H
3 C\ H 3 C R S S+
CH
2 CH2 () R X (iii) CID 0 CH 2 0 CH 2 - [M-nH-CH 2 SR]"* 11 1 H (ii) MS 11 H -CHSR 1 rC NCH, N C N CH'CN H H j 15 [M] [M+nH+R1("+)+ Scheme 1 SH R 2 , R 2
/R
1 I S S+ 0 CH 2 (i) O CH 2 O(ii) RCX (iv) CID CN, CH N H || 1 H '[M+nH -R2SR,]("*)+ N C- I CN"CH 'CN (iii)MS C N"CH'C N -R2SR, O H H 00 [M] [M+nH+R] Scheme 2 20 In order for this derivatization strategy to be successful, both the solution phase and gas phase chemistries of the resultant sulfonium ions must be considered. These solution and gas phase chemistries, as well as characterization of the fragmentation behaviour of the fixed charge sulfonium ion derivatives of methionine containing peptides have been examined 25 using a series of model peptides designed to resemble those resulting from tryptic digestion, WO 2004/046731 PCT/US2003/036739 -35 i.e., the class of peptides most commonly employed for protein identification by tandem mass spectrometry. The amino acid sequences of the model peptides (GAILX 1
GAILX
2 (SEQ ID NO:7)), where X 1 is either alanine or methionine, and X 2 is either alanine, lysine or arginine), were chosen to allow comprehensive examination of the effects of charge state and amino 5 acid composition on the observed fragmentation behaviour. Solution phase requirements for sulfonium ion formation: Reactions should be easy to perform using readily available reagents. Many of the 10 potentially useful reagents are commercially available. Other potentially useful reagents may be synthesized by simple methods described in the literature. Importantly, the ability to incorporate a suitable isotopic label into the selected derivative is an important consideration in allowing subsequent quantitation of differential protein expression using the same approach. Thus, isotopically labelled derivatives of these reagents should be commercially 15 available or readily synthesized. For example, isotopically labeled d 5 -bromoacetophenone has been readily prepared here from d 5 -acteophenone using the brominating reagent, poly(4 vinylpyridinium tribromide), in a one-step process (see Materials and Methods). Reactions should be fast, proceed to completion and the resultant sulfonium ions must be chemically and thermally stable in solution [Stirling, M.J. and Patai, S. The Chemistry of the Sulfonium 20 Group, Wiley: New York, 1981.; Stirling C.J.M. Sulfonium Salts in Organic Chemistry of Sulfur Oae, S. Ed. Plenum Press NY 1977, Chapter 9, pp 473-525.; Capozzi, G, and Modena, G. Stud. Org. Chem. 1985, 19, 246-298.]. Using the model peptide GAILMGAILK (SEQ ID NO:2), a range of potential alkylating reagents for methionine side chain fixed-charge sulfonium ion formation, iodomethane, iodoethane, iodobenzene, iodomethylbenzene, 25 iodoacetic acid, iodoacetamide and bromoacetophenone have been evaluated. The peptide was reacted for 4, 8, 16, and 32 hours with a 100 fold molar excess of each alkylating reagent then analysed by mass spectrometry to determine the extent of reaction. After 16 hours, only the acetophenone sulfonium ion derivative had reacted to completion [March J. "Advanced Organic Chemistry", 4th Ed., Wiley, New York, 1992, pp 343.; Halvorsen, S. J Chem. Soc. 30 Chem Comm, 1978, 327.; Yoh, L. Tetrahedron Leti, 1988, 29, 4431]. The ethyl derivative was found to have the least reactivity and stability in solution (only 1% reaction after 16 hours). All other reagents ranged in reactivity between these two extremes. The acetophenone derived sulfonium ion of the peptide was found to be stable in solution for several weeks WO 2004/046731 PCT/US2003/036739 -36 without significant degradation. As free SH groups of cysteine residues have been shown to effect the reversal of the alkylation of methionine residues [Scheitjter, A. and Aviram, I. FEBS Lett 1972, 21, 293-296.], and transfer of a methyl group from a sulfonium salt to a free sulfhydryl acceptor is a well known possible side reaction [Toennies, G. and Kolb, J.J. J Am. 5 Chem. Soc., 1945, 67, 1141-1144.; Naider, F. and Bohak, Z. Biochemistry 1972, 11, 3208 3211.; Cantoni, G.L. Comp. Biochem. 1960, 1, 18 1.], the commonly employed step involving S-alkylation of cysteines prior to proteolysis may be a useful but necessary reaction. Gas phase requirements for sulfonium ion fragmentation 10 All sulfonium ions examined, including those with poor reactivity in solution, were found to be stable in the gas-phase so that they could be mass selected and manipulated for MS/MS experiments. All ions were found to give a unique fingerprint by fragmenting via the neutral loss of CH 3
SR
1 (see Scheme 1). The loss of CH 3
SR
1 from fixed-charge derivatives has been 15 observed previously in small model systems [Reid, G.E., Simpson, R.J. and O'Hair, R.A.J. J Am. Soc. Mass Spectrom. 2000, 11, 1047-1060.; O'Hair, R.A.J. and Reid, G.E. Eur. Mass Spectrom. 1999, 5, 325-334.], peptides [Foti, S., Saletti, R. and Marletta, D. Org. Mass Spectrom. 1991, 26, 903-907.] and proteins [Lapko, V.N., Smith, D.L. and Smith, J.B. Mass Spectrom. 2000, 35, 572-575.]. Note that these gas phase reaction have solution phase 20 analogies as discussed above. Side reactions such as charge migration via intramolecular proton transfer, alkyl group transfer, as well as breaking of bonds to give stable alkyl cations or alkenes must be avoided [O'Hair, R.A.J., Freitas, M.A., Gronert, S., Schmidt, J.A.R. and Williams, T.D. J. Org. Chem., 1995, 60, 1990-1998.; Mudd, S.H. et. al. Biochem, 1966, 5, 1653.; Deakyne, C.A. et. al. J. Mol. Struct. 1999, 485/486, 33-41.; Markham, G.D. and Bock, 25 C.W. JPhys Chem, 1993, 97, 5562-5569.; Katritzky, A.R. et. al. Int JMass Spectrom, 1997, 165/166, 577-583.; Buckley, N. et. al. J Org. Chem. 1996, 61, 2753-62.; Mestdagh, H. et. al. Org. Mass Spectrom. 1986, 21, 321-327.; Mestdagh, H. et. al. Org. Mass Spectrom. 1988, 23, 246-251.]. Proton transfer (Scheme 3, pathway 1) would give an sulfur ylide. Although the acidity of the (CH 3
)
3 S* ion (or the proton affinity of the conjugate base (CH 3
)
2
S(CH
2 )) is 30 unknown, sulfur ylides may be formed in solution by deprotonating sulfonium ions with bases such as hydroxide ion (HO-) or alkyl lithium reagents [Trost, B.M. and Melvin, L.S. "Sulfur Ylides: Emerging Synthetic Intermediates". Academic Press: NY 1975.] Alkyl transfer (Scheme 3, pathway 2) is not likely to be a problem since intramolecular SN 2 rarely WO 2004/046731 PCT/US2003/036739 -37 occur due to high energy transition states (co-linear geometry) [O'Hair, R.A.J., Freitas, M.A., Gronert, S., Schmidt, J.A.R. and Williams, T.D. J. Org. Chem., 1995, 60, 1990-1998.]. The solution phase methyl cation affinities (MCAs) of some biological sulfur systems have been studied [Mudd, W.A., Klee P.D. Biochem, 1966, 5, 1653.]. Unfortunately, gas phase MCAs 5 are limited to experimental and theoretical estimates for dimethylsulfide [Deakyne, C. A.; Knuth, D. M.; Meot-Ner, M.; Breneman, C. M.; Liebman, J. F. J. Mol. Struct. 1999, 485/486, 33-41] and theoretical estimates of some other sulfides [Markham, G.D. and Bock, C.W. JPhys Chen, 1993, 97, 5562-5569.]. The bond strengths of other breaking alkyl groups appear to be unknown in the both the condensed phase and gas phase. There have however, 10 been several studies on the fragmentation reactions of sulfonium ions as studied by mass spectrometry [Katritzky, AR Shipkova, PA Watson, CH Eyler, JR. and Kevill DN. Int JMass Spectrom, 1997, 165/166, 577-583.; Buckley, N; Maltby, D; Burlingame, A L.; Oppenheimer, N J. J. Org. Chen. 1996, 61, 2753-62; (d) Mestdagh, H; Morin, N; Rolando, C. Org. Mass Spectrom. 1986, 21, 321-327.; Mestdagh, H; Morin, N; Rolando, C. Org. Mass 15 Spectrom. 1988, 23, 246-5 1.]. Scheme 3 (pathway 3) is highly unlikely given that the least stable carbocation is formed (CH 3 *). Problems associated with Scheme 3 (pathways 4 and 5) can be avoided by choosing R 1 groups which do not yield stable ions (e.g. avoid benzyl groups etc) and which do not form an alkene (e.g. avoid ethyl or substituted ethyl groups and higher homologues), respectively. Differentiation between the fragmentation reactions of 20 sulfonium ions of methionine and S-alkyl cysteine residues (i.e., loss of CH 3
SR
1 from methionine (Scheme 1) versus loss of R 1
SR
2 from cysteine (Scheme 2) is straightforward. T11is can be achieved here by using an alkyl group R 2 which is NOT a methyl group (i.e. R 2 #
CH
3
).
WO 2004/046731 PCT/US2003/036739 -38
H
2 C~ R 1 -+
CH
2 4 H
H
3 C R 1 0 CH, II.s I Ii I H + I -r-rC" N ",CHIC. N
CH
2 H 1j I - 0 o CH 2 || | H + N CH C N R H
CH
2 + CH 3 0 H 0 CH, 2 || | H C N CH C N H L 0 -CH3* 3 [M-CH 3 ]a
H
3 C ,S Ri -R 3 45 [M-R 1 0 CH,
I-H
3 C"' + O H 2 HCs 1 1 H -C' N CH'C N -[R 1 -H]o H2 +H H |C o 5 11 o J-rCI N -CHC-' NA H 1 0 Scheme 3 General structure of the sulfoniuin ion reagents suitable for use in the work described here. 5 Of the reagents examined here to date, the a-carbonyl containing alkylating reagents, iodoacetic acid, iodoacetamide and bromoacetophenone are those that exhibit the favourable solution and gas phase reaction properties discussed above. Of these, acetophenone allows ready incorporation of a suitable isotopic label (for example, via synthesis of d 5 10 bromoacetophenone or "C6-bromoacetophenone), to enable subsequent quantitation, and has therefore been the reagent used from here on. However, any reagent having the general structure XR, where X is any suitable leaving group consisting of, for example, halides, sulfonic esters, perchlorate esters or chlorosulfonates [March J. "Advanced Organic WO 2004/046731 PCT/US2003/036739 -39 Chemistry", 4th Ed., Wiley, New York, 1992, pp 352-357], and R is any substrate with a carbon adjacent to the leaving group, such as alkyl, allyl or ac-carbonyl groups, with the exception of those indicated above, and where an isotopic or structural label can be incorporated to allow its use for differential quantitation, would be of interest. 5 Examination of thefragmentation behaviour offixed-charge sulfonium ion derivatives of methionine: CID MS/MS and MS 3 in a quadrupole ion trap mass spectrometer. 10 The product ion spectra obtained by CID MS/MS of the singly and doubly charged ions of the unmodified and the acetophenone side chain derivatized sulfonium ion methionine containing peptides of GAILMGAILX (SEQ ID NO:8), where X is either alanine, lysine or arginine, and the triply charged lysine and arginine containing ions of the acetophenone side 15 chain derivatized sulfonium ion methionine containing peptides, in a quadrupole ion trap, are shown in Figures 1 to 8. The singly and doubly charged forms of the unmodified peptide ions, ionized by the addition of protons only, were all observed to form product ions resulting primarily from dissociations along the peptide backbone, with b- and y-type ions dominating. In contrast, the acetophenone side chain sulfonium methionine containing peptide ion 20 derivatives each contain a single fixed-charge resulting from the sulfonium ion, with 0, 1 or 2 protons making up the balance for the singly, doubly or triply charged ions, respectively. For these ions, in each case the overwhelmingly dominant fragmentation pathway following CID MS/MS of the isolated precursor ion corresponded to selective cleavage of the a-CH 2 -S bond at the site of the fixed-charge on the methionine side chain, to yield the neutral side chain loss 25 of S-methyl acetophenone (CH 3
SCH
2
COC
6
H
5 ) (labelled as -CH 3 S(AP)). Therefore, it is important to note that product ions formed by neutral loss from the side chain have the same charge state (and number of ionizing protons) as that of the precursor ion. The triply charged ion examples were the only partial exceptions, where some charged side chain loss of protonated S-methyl acetophenone (CH 3
SCH
2
COC
6
H
5 +H*), labeled as (-CH 3 S(AP)+H*) in 30 Figures 5A and 8A, (with the corresponding protonated S-methyl acetophenone
(CH
3 S(AP)+H*) ion also labeled), was also observed. It is expected that the charged side chain loss from the triply charged precursor ions occurs by transfer of a proton from the neutral loss product ion following dissociation of the precursor ions covalent -CH 2 -S bond, WO 2004/046731 PCT/US2003/036739 - 40 but before dissociation of the resultant ion-molecule complex to the individual products, due to columbic repulsion. A potential mechanism, with two possible pathways for the neutral loss of
CH
3
SCH
2
COC
6
H
5 from the fixed-charge derivatives is shown in Scheme 4. 5 83 Da
H
3 C CH 2 C O CHCH, | 1| -CH 3
SCH
2 COCH, 11 I H H CH 2 0 C NCH C 0 HC u l |HH Pathway 1 H C N C N O H Pathway 2a Pathway 2a CH, CH2 IC . H - H 3
SCHI
2
CO
6
H
5 C CH 1, 1o--3SHC C QCH N N C 0 CH 2 Pathway 2 1 1 H Ho C C N -- Nu, H Pathway 2b Pathway 2b CHI O CH2 H N C NuH 0 Scheme 4 Pathway 1 of Scheme 4 involves direct proton transfer by an intramolecular nucleophile 10 (Nu), such as the basic side chains of arginine or lysine, the N-terminal amino group or backbone amide carbonyl groups, with elimination of the neutral to yield a novel amino acid derivative, 3-amino- 1 -butenoic acid (vinyl glycine), with a residue mass of 83 Da. Alternatively, the fragmentation may be directed by a nucleophilic attack mechanism, whereby an adjacent amide carbonyl attacks the side chain to facilitate the neutral loss, 15 yielding a cyclic product (shown in Pathway 2 of Scheme 4 as the 6 membered cyclic product WO 2004/046731 PCT/US2003/036739 - 41 formed from nucleophilic attack by the amide carbonyl on the N-terminal side of the modified methionine residue). This mechanism is consistent with the growing recognition that nucleophilic attack processes are the major processes involved in the fragmentation of protonated peptide ions [O'Hair, R.A.J. J. Mass Spectrom. 2000, 35, 1377-1381. Schlosser, 5 A. and Lehmann, W.D J. Mass Spectron. 2000, 35, 1382-1390]. Intramolecular transfer of the acidic proton present in either the precursor ion (Pathway 1), or the product ion formed following methionine side chain cleavage (Pathway 2A or 2B) would result in "mobilization" of this proton, thereby allowing it to be involved in subsequent fragmentation reactions according to the mobile proton model developed for rationalizing the fragmentation of 10 protonated peptide ions [Wysocki, V.H., Tsaprailis, G., Smith, L.L. and Breci, L.A. J. Mass Spectrom. 2000, 35, 1399-1406.]. Note that if the reaction proceeds via pathway 2 of Scheme 4, the amide bond incorporated into the cyclic product would not be amenable to subsequent cleavage. However, ring opening of the cyclic product via intramolecular transfer (Pathway 2A) would yield the acyclic amino acid derivative, thereby allowing subsequent 15 fragmentation of the product ion to occur on both N- and C-terminal sides of the newly formed amino acid derivative. Given that the sulfonium ion derivatized peptides fragment almost exclusively via the neutral loss of CH 3
SCH
2
COC
6
H
5 to fonn a charged product ion, the neutral loss product ion for each peptide ion was mass selected and then subjected to a further stage of tandem mass 20 spectrometry (MS 3 ) to generate additional structural information to obtain evidence for the preferred fragmentation pathway and also to allow its identification by sequence analysis. The spectra obtained following CID MS 3 of each of the neutral loss product ions discussed above are shown in the insets to Figures 1B to 4B, the inset to Figure 5A, the insets to Figures 6B and 7B, and the inset to Figure 8A. Similar to that of the singly and doubly 25 protonated unmodified peptide ions, MS 3 of the [M+nH-CH 3
SCH
2
COC
6
H
5 s](+I 1 ) product ions formed by neutral loss from the fixed-charge derivatives resulted in the formation of products resulting primarily from dissociations along the peptide backbone, with b- and y-type ions dominating. A direct comparison of the MS/MS product ion spectra of the unmodified peptides and the MS3 spectra of the [M+nH-CHSCH 2
COC
6 H]J0 1 * product ions reveals a 20 number of interesting points; (i) in most cases, cleavage of the amide bonds on either side of the modified vinyl glycine residue formed by loss of the sulfonium ion side chain was observed following MS 3 , indicating that the fragmenting species did contain an acyclic vinyl glycine residue WO 2004/046731 PCT/US2003/036739 - 42 formed either by pathway 1 of Scheme 4 or by ring opening of the cyclic product formed by pathway 2 of Scheme 4; (ii) for MS 3 of the singly charged methionine side chain fixed-charge peptide ions (insets to Figures 1B, 3B and 6B), more extensive "sequence ion" fragmentation was observed 5 following MS 3 of the [M+nH-CH3SCH2C6Hs]("*0* product ions, with less "non sequence" ion cleavages such as loss of water or ammonia and formation of a-type product ions, compared to that obtained by MS/MS of the unmodified methionine containing peptides. This observation may be readily explained by the mobile proton model for peptide fragmentation [Wysocki, V.H., Tsaprailis, G., Smith, L.L. and Breci, 10 L.A. J. Mass Spectrom. 2000, 35, 1399-1406.]. In the case of the singly charged sulfonium ion peptide derivatives, the acidic proton formed upon loss of the side chain can be readily mobilized onto the peptide backbone by the direct proton transfer mechanism for cleavage in pathway 1 of Scheme 4, or by ring opening of the cyclic product ion shown in pathway 2A of Scheme 4, allowing subsequent fragmentation to 15 take place at sites along the backbone, whereas, in the case of the singly protonated ions formed directly by electrospray, where the ionising proton is initially located on a basic side chain such as arginine or lysine, significant energy to overcome the barriers to proton transfer must be initially supplied in order for subsequent fragmentation of the backbone to occur. As such, other processes involving the loss of small molecules from 20 the precursor and product ions such as water and ammonia, and formation of a-type product ions can therefore successfully compete with backbone cleavage. For the other peptides, the product ions formed by MS 3 from the doubly protonated lysine and arginine containing methionine side chain fixed-charge [M+nH
CH
3
SCH
2
COC
6 Hs]n'"*' MS/MS product ions (insets to Figures 4B and 7B) were 25 similar to those obtained from MS/MS of the unmodified doubly protonated peptide ions. However, MS 3 of the doubly protonated product ion formed from dissociation of the acetophenone modified GAILMGAILA (SEQ ID NO: 1) peptide (inset to Figure 2B) yielded less extensive products than those formed by MS/MS of the unmodified peptide, with b 9 and b 9 2 + ions dominating. This suggests that in this case, the proton 30 initially present in the precursor and the acidic proton formed by loss of the side chain from the doubly charged sulfonium ion derivative are not as "mobile" as those of the doubly protonated peptide where one proton is expected to reside on the amino terminal and one along the backbone; again consistent with an initial cyclic product formed from WO 2004/046731 PCT/US2003/036739 -43 the fixed-charge derivative. The triply protonated lysine and arginine containing methionine side chain fixed-charge [M+nH-CH3SCH 2
COC
6 H]("+)+ MS 3 product ions (insets to Figures 5A and 8A, respectively) were found to yield mainly y82+ product ions, with other y-type ions also evident at low abundance; 5 (iii) the collision energy required for MS 3 of the [M+nH-CH 3 S(AP)]"*+') product ions was consistently higher than that required for MS/MS of the unmodified peptide [M+nH]"* precursor ions for each of the charge states. This suggests that the fragmentation of the precursor ions formed by neutral loss have higher activation barriers compared to the precursors formed directly by electrospray ionization, and is again suggestive of the 10 initial formation of the cyclic product ion in pathway 2 in Scheme 4 from the fixed charge derivatives, with a necessity for ring opening prior to subsequent further dissociation. Energy resolved CID MS/MS and in a quadrupole-time-of-flight mass spectrometer. 15 In order to determine the relative energetics for the fragmentation processes observed from the fixed-charge derivatives compared to the unmodified methionine containing peptides as well as those containing an alanine residue at the equivalent position to methionine (alanine has the closest structure of the common acids to the vinyl glycine residue 20 formed by neutral loss from the fixed-charge derivatives and was therefore considered to be a better derivative than methionine for direct comparison of the relative energies required for fragmentation), and to obtain evidence for the structure of the resultant product ions, the fragmentation reactions of the sulfonium ion containing peptides described above were further examined by energy resolved CID in a quadrupole-time-of-flight (Q-TOF) mass 25 spectrometer. The energy resolved CID breakdown and appearance curves for the doubly charged sulfonium ion derivative of GAILMGAILK (SEQ ID NO:2) is shown in Figure 9A. The breakdown curve of the precursor [M(AP)+HJ2+ ion, the appearance/breakdown curve of the initial neutral loss product ion (-CH3S(AP)), as well as the appearance curves of the 30 individual product ions and that of the summed product ion abundances (sum of product ions) are indicated. It can be seen that there is essentially no overlap between the breakdown of the
[M(AP)+H]
2 + ion and the appearance of the individual product ions, indicating that these ions are not formed directly from the initial precursor. Rather, these ions are formed from the WO 2004/046731 PCT/US2003/036739 -44 initial neutral loss product ion (-CH 3 S(AP)). Representative product ion spectra taken under collision energies sufficient to obtain 70-90% reduction of the precursor ion abundance (12eV) and 70-90% reduction of the initial neutral loss (-CH 3 S(AP) product ion abundance (22eV) are shown in Figure 9B and C, respectively. The observed product ion spectra under 5 these "low" and "high" energy quadrupole CID conditions, respectively, were essentially identical to those observed in the quadrupole ion trap mass spectrometry MS/MS and MS 3 experiments, indicating that the time frame (millisecond versus microsecond time scales for the ion trap and quadrupole experiments, respectively), and mode of ion activation had no appreciable effect on the fragmentation process. Indeed, all of the peptides examined under 10 these "low" and "high" energy quadrupole CID conditions exhibited very similar product ion spectra compared to the equivalent ion trap derived MS/MS and MS 3 spectra (data not shown). Thus, the directed fragmentation of the fixed-charge derivatives gives the ability to use low energy CID conditions for formation of the initial neutral loss product ion and then to obtain a "pseudo MS3" spectrum under "high" energy CID conditions to obtain further 15 structural information. For comparison, the appearance curves for the formation of the summed product ion abundances from the doubly protonated unmodified methionine and alanine containing peptides are shown in Figure 10. A summary of the collision energy required to achieve 50% formation of product ions from each of the protonated unmodified peptides, and from the initial neutral loss product 20 ions from the fixed-charge derivatives is given in Table 1. These data can be used as a measure of the stabilities of the sulfonium ion derivative neutral loss product ions relative to their unmodified forms. The data indicates that the collision energies required for dissociation of the initial neutral loss product ions from the sulfonium ion derivatives are consistently higher than those of the alanine and unmodified methionine containing peptides, giving 25 further support for the formation of an initial cyclic product ion which is required to overcome a barrier to ring opening prior to subsequent dissociation (Scheme 4, pathway 2A).
WO 2004/046731 PCT/US2003/036739 - 45 Table 1. Collision Energy required to achieve 50% reduction in the abundance of the unmodified precursor ions, or the initial neutral loss product ions from the fixed-charge sulfonium ion derivatives, in a Q-TOF mass spectrometer. 5 Peptide Ion Charge State +1 +2 +3 Peptide Collision Energy (V) GAILAGAILA (SEQ ID NO:4) 21.7 8.1 NA GAILMGAILA (SEQ ID NO:1) 22.7 (+1.0)a 8.6 (+0.5)a NA GAILM(AP)GAILA (SEQ ID NO:1) 34.5 (+12.8)' 14.8 (+6.7)a NA GAILAGAILK (SEQ ID NO:5) 36.8 10.8 NA GAILMGAILK (SEQ ID NO:2) 38.4 (+1.6)a 11.4 (+0.6)a 11.4 GAILM(AP)GAILKb (SEQ ID NO:2) 43.0 (+6.2)a 21.5 (+10.7)a 11.6 GAILAGAILR (SEQ ID NO:6) 47.1 10.8 NA GATILMGAILR (SEQ ID NO:3) 48.6 (+1.5)a 11.6 (+0.8)a NA GAILM(AP)GAILRb (SEQ ID NO:3) 53.7 (+6.6)" 22.1 (+11.3)a 11.7 a Difference between the collision energy required for 50% dissociation of the peptide of interest and its corresponding unmodified alanine containing peptide ion. b (AP) = acetophenone side chain modification to methionine 10 Note that an alternative to performing an additional stage of CID MS/MS to determine the identities of those peptides found to contain a fixed charge derivative, would be the use of high resolution mass accuracies (an "accurate mass tag") coupled with database searching. Previously, in cases where an "accurate mass tag" approaching 1 ppm has been obtained, and 15 the presence of a particular amino acid is known (for example the presence of a cysteine residue), the specificity of database searching algorithms can be improved such that unambiguous identification of the protein from which the peptide is derived has been achieved from this information alone [Conrads, T.P., Anderson, G.A., Veenstra, T.D., Pasa Tolic, L. and Smith, R.D. Anal Chem. 2000, 72, 3349-3354.; Goodlet, D.R., Bruce, J.E., 20 Anderson, G.A., Rist, B., Pasa-Tolic, L., Fiehn, 0., Smith, R.D. and Aebersold, R. Anal. Chem. 2000, 72, 1112-1118.; Strittmatter, E.F., Ferguson, P.L., Tang, K. and Smith, R.D. J. Am. Soc. Mass Spectrom. 2003, 14, 980-991.].
WO 2004/046731 PCT/US2003/036739 - 46 Identification ofpeptides containing methionine by neutral loss scan mode CID MS/MS of their sulfonium ion fixed-charge derivatives in a triple quadrupole mass spectrometer. To demonstrate the utility of the fixed charge derivatives for selectively identifying 5 peptides containing certain amino acid resides in a relatively complex mixture by neutral loss scan mode CID MS/MS, 10 pmol of a reduced and S-carboxyamidomethylated tryptic digest of bovine serum albumin, derivatized using bromoacetophenone, was subjected to capillary RP-HPLC-MS (Figure 11). Then, in a duplicate experiment, those peptides giving rise to a neutral loss of m/z 83 (i.e., the loss of CH 3
SCH
2
COC
6
H
5 from doubly charged precursor 10 ions) were then identified in a neutral loss scan mode CID MS/MS experiment. The resultant total ion current trace for this experiment is shown in Figure 12A. Four main peaks were observed and the mass spectra obtained from two of these are shown in Figures 12B and 12C. The masses of these peaks were found to correspond to the four methionine containing tryptic peptides of bovine serum albumin. It can be observed by comparison of the data in Figures 15 11B and C (the mass spectra of the two regions of the total ion current chromatogram from the MS experiment indicated by arrows), and Figures 12B and C, that the ions detected in the neutral loss scan mode experiment are present at relatively low abundance (the ion at m/z 799.4 in Figure 12B is only the 8 th most abundant ion in Figure 11B, and the ion at m/z 759.8 in Figure 12C is only the 3 most abundant ion in Figure 1 IG). Therefore, these data 20 demonstrate the utility of the fixed charge derivatives for selectively identifying peptides containing certain amino acid resides by neutral loss scan mode CID MS/MS of their fixed charge derivatives (the four methionine containing tryptic peptides of bovine serum albumin were selectively identified from a total of approximately 100 peptides present in the digest). 25 Strategies for protein identification by neutral loss scan mode CID MS/MS ofpeptides containing other amino acid fixed-charge derivatives. Tryptophan and Cysteine 30 Modification of tryptophan, by electrophilic aromatic substitution, to yield a fixed charge sulfonium ion derivative could be achieved by initial reaction of tryptophan containing peptides with a sulfenylhalide (Ri SX) to yield a derivative with a thioether functional group in the 2 position of the indole side chain (Scheme 5A) [Scoffone, E., WO 2004/046731 PCT/US2003/036739 - 47 Fontana, A. and Rochhi, R. Biochemistry. 1968, 7, 971.], followed by the formation of the sulfonium ion derivative of tryptophan under conditions identical to those discussed above for methionine and S-alkyl cysteine sulfonium ion derivatives. Note that the sulfenylhalide would also react with cysteine containing peptides to yield an unsymmetrical disulfide 5 (Scheme 5B), however, specificity of the reaction to tryptophan may be imparted by initial S alkylation of cysteine, or reduction of the unsymmetrical disulfide prior to sulfonium ion formation. The solution phase and gas phase criteria for sulfoniuin ion formation from 2 indole S-alkyl tryptophan modified residues are the same as those discussed above for methionine and for S-alkyl cysteine sulfonium ion formation. As sulfonium ion derivatives of 10 the 2-indole S-alkyl derivatives of tryptophan will be formed under conditions identical to those discussed above for methionine and S-alkyl cysteine, both residues would be modified in this approach. However, for the reactions discussed above for methionine and S-alkyl cysteine sulfonium ion derivatives, methionine and tryptophan containing peptides could be identified individually, through the use of an R 1 group that does not correspond to -CH 3 , (i.e., 15 the side chain alkyl group of methionine) or simultaneously, via initial alkylation of tryptophan where the R 1 group corresponds to -CH 3 . Likewise, 2-indole S-alkyl tryptophan and S-alkyl cysteine containing peptides could be identified individually, through the introduction of an R 1 group to the S-alkyl tryptophan derivative that does not match that of the R 1 group used for initial alkylation of cysteine, or simultaneously, via initial alkylation of 20 tryptophan using the same R 1 group as that used for initial cysteine alkylation. Note that under appropriate conditions, similar electrophilic aromatic substitution reactions could also be used for the introduction of an S-alkyl group to the side chain of tyrosine residues, followed by subsequent formation of a sulfonium ion derivative.
WO 2004/046731 PCT/US2003/036739 -48 A 0 0 0
H
2 N-CH-C-OH H 2 N-CH-C--OH H 2 N-CH-C-OH
CH
2
R
1 SXi
CH
2 R2X2 R2 CH 2 S S0 + R R1 HN HN HN B 0
H
2 N-CH-C -OH I I
H
2 N-CH-C-OH RisX 1
CH
2
OH
2 SH Scheme 5 General structure of the fixed-charge ion reagents suitable for use in the work described 5 here. Any reagent having the general structure R 1
SX
1 , where X 1 is any suitable leaving group consisting of, for example, halides, sulfonic esters, perchlorate esters or chlorosulfonates [March J. "Advanced Organic Chemistry", 4th Ed., Wiley, New York, 1992, 10 pp 352-357], and R 1 is any substrate with a carbon adjacent to the leaving group, such as alkyl, allyl or a-carbonyl groups, with the exception of those indicated above, and where an isotopic or structural label can be incorporated to allow its use for differential quantitation, and R 2
X
2 , where X 2 is any suitable leaving group consisting of, for example, halides, sulfonic esters, perchlorate esters or chlorosulfonates [March J. "Advanced Organic Chemistry", 4th 15 Ed., Wiley, New York, 1992, pp 352-357], and R 2 is any substrate with a carbon adjacent to the leaving group, such as alkyl, allyl or a-carbonyl groups, with the exception of those indicated above, and where an isotopic or structural label can be incorporated to allow its use for differential quantitation. Sulfoniurn ions of tryptophan and cysteine containing peptides may also be fonned 20 directly, by initial reaction with a sulfenylhalide (X 2
RSX
1 ) containing a suitable leaving WO 2004/046731 PCT/US2003/036739 - 49 group X 2 , to subsequently yield a cyclic sulfonium ion derivative of tryptophan (Scheme 6A) or cysteine (Scheme 6B). A O 0 0
H
2 N-CH-C-OH H 2 N-CH-C-OH H2N--CH-C--OH
CH
2
X
2
RSX
1
CH
2
CH
2 S ,S / I T / {I, / HN X2 HN HN B 0 0 O H 2 N-CH-C-OH H2N
H
2 N-CH-C-OH I X 2
RSX
1 CH2 _ CH2 CH )P I C 2s s SH S + 1X 2 R 5 Scheme6 General structure of the fixed-charge ion reagents suitable for use in the work described here. 10 Any reagent having the general structure X 2
RSX
1 , where X1 is any suitable leaving group consisting of, for example, halides, sulfonic esters, perchlorate esters or chlorosulfonates [March J. "Advanced Organic Chemistry", 4th Ed., Wiley, New York, 1992, pp 352-357], R is any substrate with a carbon adjacent to the leaving group, such as alkyl, allyl or a-carbonyl groups, with the exception of those indicated above, and where an isotopic 15 or structural label can be incorporated to allow its use for differential quantitation, and X 2 is any suitable leaving group consisting of, for example, halides, sulfonic esters, perchlorate esters or chlorosulfonates [March J. "Advanced Organic Chemistry", 4th Ed., Wiley, New York, 1992, pp 352-357]. 20 WO 2004/046731 PCT/US2003/036739 -50 Cysteine. Cyclic sulfonium ions may also be formed specifically from cysteine containing peptides, by initial alkylation with a reagent (X 2
RX
1 ) (Scheme 7A) or by initial disulfide 5 exchange with a reagent (X 2
RSSX
1 ) (Scheme 7B), each containing a suitable leaving group
X
2 , with subsequent formation of a cyclic sulfonium ion derivative. A SH
RX
2 0R SHI o K H X 2 RXI 0 KX 0 + H 11 H 1 r H N C N'. CNNICHC N C N CHC N H H II 0 o 0 B R S S-RX SHI SH
X
2 RSSX1 0
-X
2 S H - 0 K C iCH N H || r H N N C'NCCH N C NCH' N H N C r N 'C '' H H H 10 Scheme 7 General structure of the fixed-charge ion reagents suitable for use in the work described here. 15 Any reagent having the general structure (i) X 2
RX
1 , where X 1 is any suitable leaving group consisting of, for example, halides, sulfonic esters, perchlorate esters or chlorosulfonates [March J. "Advanced Organic Chemistry", 4th Ed., Wiley, New York, 1992, pp 352-357], R is any substrate with a carbon adjacent to the leaving group, such as alkyl, allyl or a-carbonyl groups, with the exception of those indicated above, and where an isotopic 20 or structural label can be incorporated to allow its use for differential quantitation, and X 2 is any suitable leaving group consisting of, for example, halides, sulfonic esters, perchlorate esters or chlorosulfonates under neutral or basic pH conditions, or protonated amines or WO 2004/046731 PCT/US2003/036739 - 51 alcohols under acidic pH conditions [March J. "Advanced Organic Chemistry", 4th Ed., Wiley, New York, 1992, pp 352-357], or (ii) X 2 RSSXI, where X 1 is, for example, nitrobenzoic acid, R is any substrate with a carbon adjacent to the leaving group, such as alkyl, allyl or a-carbonyl groups, with the exception of those indicated above, and where an 5 isotopic or structural label can be incorporated to allow its use for differential quantitation, and X 2 is any suitable leaving group consisting of, for example, halides, sulfonic esters, perchlorate esters or chlorosulfonates [March J. "Advanced Organic Chemistry", 4th Ed., Wiley, New York, 1992, pp 352-357]. 10 Strategies for protein identification by precursor ion scan inode CID MS/MS ofpeptides containing other amino acid fixed-charge derivatives. In contrast to the fragmentation reactions discussed above, where neutral losses are 15 observed (note that the site of the fixed charge in these derivatives is directly adjacent to the site of bond cleavage), the fragmentation of peptides derivatized on their side chains where the fixed charge is located remote to the site of bond cleavage, may result in the loss of a charged product, which may be selectively detected using a precursor ion scan mode MS/MS experiment (Scheme 8). For example, fixed-charge sulfonium, quaternary alkylammonium or 20 quaternary alkylphosphonium ion reagents should result in charged loss of the side chain via cis 1,2 elimination, with formation of a dominant low mass product ion with formation of its complementary neutral peptide species (if the initial charge state of the precursor was one), or a charged peptide ion species with a charge state one lower than the precursor (if the initial charge state of the precursor was greater than one). These characteristic low mass product 25 ion(s) can be selectively detected using a parent ion scan mode MS/MS experiment. This side chain cleavage may be further enhanced by oxidation of the thioether [Steen, H. and Mann, M. J. Am. Soc. Mass Spectrom. 2002, 13, 996-1003.]. Note that the fragmentation of cysteine-containing peptides alkylated with reagents containing basic functionalities, such as iodoacetamide, acrylamide and vinylpyridine, have been observed previously to give rise to 30 some fragmentation of the side chain [Moritz, R.L. Eddes, J.S., Reid, G.E. and Simpson, R.J. Electrophoresis. 1995, 17, 907-917.]. The solution phase and gas phase requirements for quaternary alkylammonium or quaternary alkylphosphonium ion formation and fragmentation are the same as those for WO 2004/046731 PCT/US2003/036739 -52 sulfonium ion formation and fragmentation discussed above. Note that a fixed-charge derivative of cysteine (using (2-bromoethyl)trimethylammonium bromide (BrCH 2
CH
2
(CH
3
)
3 NBr) for introduction of the fixed charge to the side chain thiol of the cysteine residue), has been used previously to enable carboxypeptidase C-terminal peptide 5 sequence analysis, with mass spectrometry used for detection of the reaction products [Bonetto, V., Bergman, A-C., Jornvall, H. and Sillard, R. Anal. Chenz. 1997, 69, 1315-1319.]. , SH 0 CH2 . CH CN R | .,SH 0 0 CH 2 R+X CD 11 1 0 CH 2 ' C N CH NH M S C O CID || || 0 C NHCC [M+nH+R+ (n+l) )- d'C + HSR+ Scheme 8 10 General structure of the fixed-charge ion reagents suitable for use in the wIork described here. Any reagent having the general structure R*X, where X is any suitable leaving group 15 consisting of, for example, halides, sulfonic esters, perchlorate esters or chlososulfonates [March J. "Advanced Organic Chemistry", 4th Ed., Wiley, New York, 1992, pp 352-3571, and R* is any substrate with a carbon adjacent to the leaving group, such as alkyl, allyl or a carbonyl groups, and with a terminal sulfonium, quaternary alkylammonium or quaternary alkylphosphonitun ion, and where an isotopic or structural label can be incorporated to allow 20 its use for differential quantitation. Other mass spectrometrv instrumentation and ionization source considerations. It is important to recognise that the dissociation of fixed charge derivatives with neutral 25 loss of the side chain, results in a peptide product ion having the same charge state (and number of ionising protons) as that of the precursor ion. Hence, these derivatives are WO 2004/046731 PCT/US2003/036739 - 53 amenable to both ESI (where multiply charged ions are usually observed) and MALDI (where predominantly singly charged ions are observed) ionization methods when identification of the peptide sequence is to be achieved by "high" energy MS/MS or MS3. In contrast, the dissociation of fixed charge derivatives with charged loss of the side chain (i.e., 5 formation of a low mass product ion), results in a peptide product that carries one less charge than that of the precursor ion. Therefore, if identification of the peptide sequence is to be achieved by further dissociation, this approach is only applicable when using ESI methods, where a charged product ion containing the peptide would be formed in addition to the charged side chain cleavage low mass product ion. 10 Differential quantitation ofproteins by CID MS/MS of their fixed-charge derivative containing peptides. Neutral loss scan mode CID MS/MS in a triple quadrupole mass spectrometer. 15 In a general strategy for differential quantitation of protein expression using the fixed charge derivatization approach described here, a first sample is derivatized with either sulfonium or tertiary alkyl fixed-charge reagents containing naturally abundant (light) isotopes, then mixed with a second sample derivatized with the same reagent containing 20 isotopically enriched (heavy) isotopes, prior to their introduction to the mass spectrometer. Then, using either neutral loss or parent ion scan mode MS/MS, the abundance ratios of the product ions formed by neutral loss or parent ion scan mode MS/MS of the light isotope tag containing peptide ions, compared to those containing the heavy isotope tag can be determined, which is indicative of changes in the abundance levels of proteins between the 25 two samples. The product ion representing the peptide product can then be automatically selected for further structural interrogation by "high" energy MS/MS or MS 3 . Note that the approach described here could also be readily combined with a "global" labelling (i.e., N-or C-terminal labelling) strategy, whereby the isotopic label is introduced at a position remote to the site of the fixed-charge so that a common neutral loss could be used to identify both light 30 and heavy isotope containing derivatives. The ESI mass spectrum of a nominal 1.0 pmol/gL : 1.0 pmol/gL mixture of the GAILMGAILR (SEQ ID NO:3) peptide derivatized with either do- or d 5 - acetophenone and introduced by electrospray ionization to a triple quadrupole mass spectrometer is shown in WO 2004/046731 PCT/US2003/036739 - 54 Figure 13A. From this mass spectrum, a ratio of 1.04: 1 can be obtained for the doubly (m/z 567.2 and 569.7) charged, and 1.06 : 1 for the triply (m/z 378.4 and 380.1) charged states of the peptide. A 10 times expanded region of the spectrum around m/z 700-800 shows the level of chemical noise associated with the mass spectrum. In comparison, the neutral loss scan 5 mode MS/MS spectra of the same mixture (neutral losses of 83 Da and 85.5 Da, respectively for the doubly charged precursors, and neutral losses of 55.3 Da and 57 Da, respectively for the triply charged precursors) are shown in Figure 13B. Again, the m/z 700-800 regions for each spectrum have been expanded 10 times to indicate the level of chemical noise. While the observed ratios for the MS/MS neutral loss scan mode ion abundances in Figure 13B were 10 essentially identical to those observed for the MS ion abundances in Figure 13A (a ratio of 1.02 : 1 was obtained for both doubly and triply charged states of the peptide), the level of chemical noise is reduced by at least two orders of magnitude compared to that observed in the mass spectrum in Figure 13A. Indeed, with the exception of electronic noise, characterised by single data point noise "spikes", the neutral loss MS/MS spectra showed a 15 complete absence of non-specific chemical noise along the baseline. The mass spectrum of a nominal 1 pmol/ptL: 0.1 pmol/gL mixture of do- and d 5 containing methionine side chain fixed-charge acetophenone (AP) sulfonium ion derivative of GAILMGAILR (SEQ ID NO:3) is shown in Figure 14. The abundance ratios obtained for the doubly and triply charged ions were determined to be 10.12:1 and 9.72:1, respectively. 20 The ratios of the ions obtained by neutral loss scan mode MS/MS experiments (Figure 15) were determined to be 9.45 : 1 and 10.83 : 1. Once again, the level of chemical noise in the neutral loss scan mode experiment was found to be at least two orders of magnitude lower compared to that observed in the mass spectrum. For all neutral loss scan mode MS/MS experiments, (ratios of 0.1 : 1, 0.2: 1.0, 0.5 :1.0, 1.0: 1.0, 1.0 : 0.5, 1.0: 0.2 and 1.0 : 0.1), 25 errors were all within 10% of the expected values. Note that for the neutral loss product ions formed from the sulfonium ion derivatized peptide precursors, a common product is formed from both "heavy" and "light" ions, therefore either product may be selected for further structural analysis by "high" energy CID or MS 3 . 30 One of the additional advantages of the MS/MS method described here over those involving identification by MS peak abundances alone is that both ions (i.e., light and heavy) are not required to be present in order for detection of one of the pairs to be achieved. Also, WO 2004/046731 PCT/US2003/036739 -55 in these instances, the mass of the neutral loss from the parent ion indicates its origin (i.e., from the "light" or "heavy" sample). To demonstrate the utility of the isotopically labelled fixed charge derivatization 5 approach coupled with neutral loss scan mode MS/MS for differential quantitation of peptides containing certain amino acid resides, 1 pmol each of do- and d 5 -acetophenone derivatized GAILMGAILR (SEQ ID NO:3) peptide was subjected to capillary RP-HPLC-MS and neutral loss scan mode CID MS/MS (Figures 16A and 16C, respectively). The resultant mass spectra obtained from the MS (ratios of 1.06 : 1 and 1.04: 1 for the doubly and triply 10 charged ions) and neutral loss MS/MS scans (ratios of 1.02 : 1 and 0.99 : 1 for the doubly and triply charged ions) are shown in Figures 16B and 16C. The neutral loss scans for 83 Da and 85.5 Da for the doubly charged do- and d 5 - labelled precursors, and 55.3 Da and 57 Da for the triply charged do- and d 5 - labelled precursors have all been summed. Less than 10% resolution of the do- and d 5 - forms of the fixed charge derivatives was observed, indicating 15 that there is not a strong deuterium isotope effect on the chromatographic separation for these fixed charge derivatives. This is consistent with recent reports on the factors controlling deuterium isotope effects on chromatography [Zhang, R.; Sioma, C.S.; Wang, S.; Regnier, F.E. Anal. Chem. 2001, 73, 5142-5149.; Zhang, R., Sioma, C.S.; Thompson, R.A.; Xiong, L.; Regnier, F.E. Anal. Chem. 2002, 74, 3662-3 669.; Zhang, R. and Regnier, F.E. J. Proteome 20 Res. 2002, 1, 139-147. Identification and differential quantitation of 0-linked post translationally modified proteins by CID MS/MS or ion-ion reaction MS/MS of their peptides containing fixed charge derivatives: 25 If combined with the -elimination/Michael addition chemistry for forming mass spectrometry stable derivatives of serine and threonine O-linked post translationally modified amino acid containing peptides, the fixed-charge derivatization approach could be extended. toward the identification and quantitation of O-linked post translational modification. status in 30 proteins. Following 0-elimination of O-linker serine or threonine post translational modifications, a one or two step Michael addition process could be used for fixed-charge introduction. Scheme 9 illustrates a one step process whereby the fixed-charge derivative (for example, a quaternary ammonium or phosphonium ion derivative) is introduced directly.
WO 2004/046731 PCT/US2003/036739 -56 Alternatively, in a two step process, a neutral side chain derivative could be introduced (e.g., an alkyl thiol), followed by subsequent derivatization to form a fixed charge sulfonium ion. Therefore, sulfonium, quaternary alkylammonium, or quaternary alkylphosphonium ion chemistries as outlined above, could also be used for selective detection of peptides 5 containing the modified amino acid residue. A similar method has been described by Steen and Mann for the introduction of a dimethylamine-containing sulfenic acid derivative to # eliminated phospho-serine and phospho-threonine residues, thereby allowing a characteristic protonated low mass fragment ion at m/z 122.06 to be detected by precursor ion scan mode MS/MS upon low energy CID (Steen, H. and Mann, M. J. Am. Soc. Mass Spectrom. 2002, 10 13, 996-1003). However, this ion is not always observed as a dominant product as the formation of this ion can be competitive with backbone fragmentation processes. Therefore, a mixture of product ions without and with the modified side chain can be present simultaneously, resulting in a complex product ion spectrum, thereby making it potentially difficult to interpret without prior knowledge of the expected peptide sequence. The fixed 15 charge derivatization method described here however, would result in the exclusive loss of the modified side chain, allowing improved detection of these peptides at higher levels of sensitivity and with better control of the subsequent dissociation process for subsequent structural elucidation. When combined with the differential isotopic labelling strategy outlined above, the fixed charge derivatization method of the present invention would also 20 allow differential quantitation of the O-linked post translational modification status. OPhos X + 0 CH2 (i) p-elimination 0 CH2 (ii) Michael addition o CH2 (iv) CID 1 i H 1 i I1 H - 0 | I H - [M+nH-X]("++ N-Fc, CH'C N NLCH'C NT (iii)MS 'C" NCH I N, .X NN *CN'~ ~ H |I H I| H 0 00 [M+nH+X]("+D+ Scheme 9 25 Characterization of Protein-Protein Interactions by CID MS/MS or ion-ion reaction MS/MS Identification of cross-linked peptides containing fixed-charge derivatives. The fixed-charge derivatization approach could also be extended toward the improved 30 characterization of protein-protein interactions by incorporation of the fixed-charge WO 2004/046731 PCT/US2003/036739 -57 derivative into a suitable cross-linking reagent prior to cross-linking reactions, or by deritivatization of a cross-link contained between two proteins or peptides after a cross linking reaction which, upon CID MS/MS, fragments via the specific loss of a neutral or charged entity from the cross-link. While the incorporation of a labile MS/MS "tag" on a 5 cross linker has been used by Back et al. to identify cross linked peptides via the detection of a characteristic low mass product ion formed upon low energy CID [Back, J.W., Hartog, A.F., Dekker, H.L., Muijsers, A.O., de Koning, L.J. and de Jong, L. J. Am. Soc. Mass Spectrom. 2001, 12, 222-227.], the product ion thus formed is seen as only a relatively low abundance product ion. In contrast, in a similar manner to that employed for peptide 10 identification and quantitation using fixed-charge derivatives as outlined above, the method of the present invention could be employed here to direct the fragmentation of cross linked peptides toward exclusive cleavage of the fixed-charge, to yield either a characteristic neutral loss or low mass product ion (Scheme 10). The product ion containing the cross linked peptide may then be automatically selected for further structural interrogation by higher 15 energy MS/MS or by MS 3 . If quantitative comparison of cross linking between two different samples was required, an isotopic label could be also incorporated into the cross-link. Peptide 1 11 Peptide 1 H+ fixed charge cross-link CID Peptide 2 Peptide 2 H+ Scheme 10 20 If desired, any of the examples discussed above containing fixed-charge derivatives could also be selectively pre-enriched prior to mass spectrometric analysis by known chromatographic methods [Tang, J.R. and Hartley, B.S. Bioch em. J. 1967, 102, 593.; Degen, J. and Kyte, J. Anal. Biochem. 1978, 89, 529-539.; Kyte, J., Degen, J. and Harkins, R.N. 25 Methods Enzymology 1983, 91, 367-377.; Gevaert, K., Van Damme, J., Goethals, M., Thomas, G.R., Hoorelbeke, B., Demol, H., Martens, L., Puype, M., Staes, A. and Vandekerckhove, J. Mol. Cell. Proteomics. 2002, 1, 896-903.].
WO 2004/046731 PCT/US2003/036739 -58- Alternatively the fixed-charge derivatives could be selectively pre-enriched by solid phase capture methods, using fixed charge reagents covalently coupled to beads or insoluble polymers [Weinberger, S.R.; Viner, R.I. and Ho, P. Electrophoresis. 2002, 23, 3182-3192.; Zhou, H.; Ranish, J. A.; Watts, J. D.; Aebersold, R. Nature Biotechnol. 2002, 20, 512-515.; 5 Qiu, Y; Sousa, E. A.; Hewick, R. M.; Wang, J. H. Anal. Chemn. 2002, 74, 4969-4979.; Qian, W-J, Goshe, M.B., Camp D.G., Yu, L-R, Tang, K. and Smith. R.D. Anal. Chein. 2003, 75, 5441 - 5450.]. Chemical or photolytic cleavage would then be employed to release the fixed charge derivative of the peptide (shown for a sulfonium ion fixed charge derivative in Scheme 11), thereby reducing the sample complexity of the mixture to be analysed prior to its 10 introduction to the mass spectrometer. Bead Chemical or photo-labile linker R
R
1 couple cleave Leaving group X R 1 Peptide + 5R 2 Peptide + R2F Peptide
SR
2 Scheme 11 15 Preferred embodiments for Methionine fixed charge derivatization reagents The two structures immediately shown below are those that have been used in the work described here to date. These are bromoacetophenone and the isotopically encoded 20 deuterated (d 5 ) version. O H 0 D Br H D
CH
2
CH
2 H H D D H D bromoacetophenone d 5 - bromoacetophenone WO 2004/046731 PCT/US2003/036739 -59 A preferred isotopically encoded reagent of this form is a 1 3
C
6 labelled version, whose structure is shown below. 1 3 C labelling is preferred over deuterium labelling in order to minimise any chromatographic separation of the isotopically labelled versus unlabelled 5 reagents. This reagent would be expected by those of skill in the art to have the same properties as the unlabelled form. 0 H Br 131 Br CH C 1C C H 13 13
CH
2 c c 13C ;3C H C H 10 1C6 - bromoacetophenone A more preferred reagent is a water soluble form of the foregoing reagent, as this added property confers benefits such as allowing for more streamlined sample preparation workflows. Examples of water soluble fonns of the reagents are shown below, as well as 15 their 1 3
C
6 labelled versions. Note that the addition of a sulfinic or sulfonic acid functional group could be achieved at the ortho, meta- or para positions relative to the acetyl group. The synthetic strategy can be chosen to direct the synthesis toward the any one of these positions; the para substituted position is shown below. O H O H Br H Br 13 C H 11< Gil 2 1c"" ' 3c 13C 3C H SO2H H C SO 2 H 20 H H 4-bromoacetyl-benzenesulfinic acid 4-bromoacetyl- 13
C
6 -- benzenesulfinic acid WO 2004/046731 PCT/US2003/036739 -60 O H O H Br C H Br C 13 C H
CH
2
CH
2 '113C 13c 3 13 H SO 3 H H C SO 3 H H H 5 4-bromoacetyl-benzenesulfonic acid 4-bromoacetyl- 3
C
6 -benzenesulfonic acid A non-limiting method for the synthesis of these derivatives may be obtained using methods known to those skilled in the art [Buess, C.M. and Kharasch, N. J. Am. Chem. Soc. 1950, 72, 3529-3532.; Kharasch, N. and Swidler, R. J. Org.Chem. 1954, 19, 1704-1707.; 10 Vogels Textbook of Practical Organic Chemistry. 4th ed. 1978. p 773-778.; Szmant, H.H. and Irwin, D.A. J. Am. Chem. Soc. 1956, 78, 4386-4389.] via (i) reaction of benzene with 2,4 dinitrophenylsulfenyl chloride to yield phenyl 2,4-dinitrophenyl sulfide, (ii) further reaction with acetic anhydride to yield the p-acetylbenzene 2,4-dinitrophenyl sulfide (iii) oxidization to yield the p-acetylbenzene 2,4-dinitrophenyl sulfone (iv) cleaved with methanolic alkali to 15 yield the p-acetylbenzene 2,4-dinitrophenyl sulfinic acid, then either (v) further oxidised to the sulfonic acid and then brominated or (v) directly brominated. Alternatively, following steps (i) and (ii), the derivative may be (iii) cleaved by thiolysis to yield the p-acetylthiobenzene derivative (iv) oxidised to the sulfinic or sulfonic acids, then (v) brominated. 20 While a bromine atom is shown as the leaving group (i.e., X, of RX) in the foregoing structures, an iodine also preferred. Other leaving groups that could be used include: halides such as a chlorine atom, as well as sulfonic esters, perchlorate esters, chlorosulfonates, or protonated amines or alcohols [March J. "Advanced Organic Chemistry", 4th Ed., Wiley, New York, 1992, pp 352-357]. 25 In addition to the free acids shown above, these compounds may be prepared in their salt forms (e.g., Na or HCl), and can be prepared as solvates.
WO 2004/046731 PCT/US2003/036739 -61 The invention also includes reagent kits for analysis of amino acids, peptides or proteins by mass spectrometry. The reagent kits include one or more of the foregoing compounds and other reagents such as protein disulfide reducing agents (e.g., dithiothreitol (DTT), -mercaptoethanol, tris-carboxyethyl phosphine (TCEP), tributylphosphine (TBP)), 5 cysteine alkylating reagents (e.g., alkylhalides (e.g., iodoacetic acid, iodoacetamide) as well as vinylpyridine or acrylamide), proteases or chemical cleavage agents (e.g., trypsin, Lys-C, Asp-N, pepsin, thermolysin, cyanogen bromide, dilute acid). The reagent kits also can contain solvents. Typical solvents that are used include urea, guanidine hydrochloride, acetonitrile, methanol, water. 10 The various reagents can be used to perform the foregoing methods steps in solution, or on a solid phase, (e.g., by having the reagents immobilized on a column). Amino acids or peptides labelled using the method of the invention can be used, in addition to analysis of amino acids, peptides and proteins, for internal standards. For 15 example, the derivatized peptides described herein can be used in a manner analogous to several recent reports that used known quantities of an isotopically encoded peptide derivative spiked into a sample as an internal standard in order to perform absolute quantitation of the selected peptide of interest (see Gygi, Steven P.; Peng, Junmin. PCT Int. Appl. (2003) WO 2003078962; Gerber, Scott A.; Rush, John; Stemman, Olaf; Kirschner, 20 Marc W.; Gygi, Steven P. Proc. Natl. Acad. Sci. U.S.A. (2003), 100, 6940-6945.). Note that these reports do not employ the use of fixed charge derivatives amenable to tandem mass spectrometry detection as are describe herein. It will be appreciated by persons skilled in the art that numerous variations and/or 25 modifications may be made to the invention as shown in the specific embodiments without departing from the spirit or scope of the invention as broadly described. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive. The disclosures of the references cited herein are hereby incorporated herein by reference. 30 Throughout this specification the word "comprise", ot variations such as "comprises" or "comprising", will be understood to imply the inclusion of a stated element, integer or step, or group of elements, integers or steps, but not the exclusion of any other element, integer or step, or group of elements, integers or steps.
WO 2004/046731 PCT/US2003/036739 - 62 Any discussion of documents, acts, materials, devices, articles or the like which has been included in the present specification is solely for the purpose of providing a context for the present invention. It is not to be taken as an admission that any or all of these matters form part of the prior art base or were common general knowledge in the field relevant to the 5 present invention as it existed before the priority date of each claim of this application.

Claims (89)

1. A method of analysis of amino acids, peptides or proteins, the method comprising: (1) derivatizing a mixture of amino acids, peptides or proteins, to form at least one amino acid, peptide or protein derivatized to contain a fixed-charge ion, other than at the C terminal or N-terminal end thereof; 5 (2) introducing the mixture of amino acids, peptides or proteins containing at least one amino acid, peptide or protein derivatized to contain a fixed-charge ion other than at the C-terminal or N-terminal end thereof, to a mass spectrometer; (3) passing the mixture of amino acids, peptides or proteins containing at least one amino acid, peptide or protein derivatized to contain a fixed-charge ion, other than at the C 10 terminal or N-terminal end thereof, through a first mass resolving spectrometer to select precursor protein or peptide ions having a first mass-to-charge ratio; (4) subjecting the precursor ions of the first mass-to-charge ratio to dissociation to form product ions having a second mass-to-charge ratio that is characteristic of a fragmentation occurring at a site adjacent to the fixed charge; and 15 (5) detecting the product ions having the second mass-to-charge ratio.
2. The method of claim 1, wherein the product ions having the second mass-to-charge ratio are product ions formed by neutral loss of the fixed charge from the precursor ions. 20
3. The method of claim 1, wherein the product ion having the second mass-to-charge ratio are product ions formed by charged loss of the fixed charge from the precursor ions.
4. The method of claim 1, further comprising the step of: (6) determining the identity of the derivatized peptide or protein. 25
5. The method of claim 4, wherein the step of determining the identity of the derivatized peptide or protein is performed by first repeating steps (1), (2), (3) and (4) and then subjecting the product ions having the second mass-to-charge ratio to dissociation to form a series of product ions having a range of mass-to-charge ratios, for the purpose of determining 30 the amino acid sequence of the peptide or protein. WO 2004/046731 PCT/US2003/036739 - 64
6. The method of claim 5, wherein the product ions having the second mass-to-charge ratio are formed by neutral loss from the precursor.
7. The method of claim 5, wherein the wherein the product ion having the second mass 5 to-charge ratio is formed by charged loss from the precursor ion.
8. The method of claim 4, wherein the step of determining the identity of the derivatized peptide or protein is performed by use of high resolution mass analyzers. 10
9. The method of claim 8, wherein the use of high resolution mass analyzers provides mass accuracies of approximately 1-5 ppm on the product ion detected in step (5), or its complementary product ion.
10. The method of claim 4, wherein the step of determining the identity of the derivatized 15 peptide or protein comprises database searching to identify those peptides found to contain a fixed charge derivative.
11. The method of claim 1, wherein the step of dissociation comprises a method selected from the group consisting of (i) collisions with an inert gas (collision-induced dissociation 20 (CID) or collisionally-activated dissociation (CAD)); (ii) collisions with a surface (surface induced dissociation (SID)); (iii) interaction with photons resulting in photodissociation, optionally using a laser; (iv) thermal/black body infrared radiative dissociation (BIRD); and (v) interaction with an electron beam, resulting in electron-induced dissociation for singly charged cations (EID), electron-capture dissociation (ECD) for multiply charged cations, or 25 combinations thereof.
12. The method of any of claims 1-11, wherein the method is used for identification of amino acids, peptides or proteins. 30
13. The method of any of claims 1- 11, wherein the method is used for quantitation of amino acids, peptides or proteins, WO 2004/046731 PCT/US2003/036739 - 65
14. The method of any of claims 1-11, wherein the method is used for amino acid, peptide or protein differential quantitation based on the incorporation of suitable isotopic or structural labels to the fixed charge. 5
15. The method of claim 14, wherein the isotopic labels are one or more of "C, 15N, and 2 H.
16. The method of any of claims 1-11, wherein the method is used for analysis of post translational modification status of amino acids, peptides or proteins. 10
17. The method of claim 16, wherein the analysis of post translational modification status of amino acids, peptides or proteins comprises incorporation of the fixed-charge derivative via a -elimination/Michael addition method for forming mass spectrometry stable derivatives of O-phosphorylated and 0-glycosylated seine, or 0-phosphorylated and 0 15 glycosylated threonine.
18. The method of any of claims 1-11, wherein the method is used for analysis of cross linking status of amino acids, peptides or proteins. 20
19. The method of any of claims 1-11, wherein the method is used for analysis of interaction of proteins.
20. The method of any of claims 1-19, wherein the fixed-charge derivative is contained on the side-chain of a selected amino acid residue or a side-chain of a selected amino acid 25 residue contained within a protein or peptide.
21. The method of claim 20, wherein the selected amino acid residue is that of a rare amino acid. 30
22. The method of claim 20, wherein the selected amino acid residue contains a S atom in the side chain thereof. WO 2004/046731 PCT/US2003/036739 -66
23. The method of claim 22, wherein the amino acid residue is methionine, cysteine, homocysteine or selenocysteine.
24. The method of claim 23, wherein the amino acid residue is methionine and wherein 5 the method is performed according to any of Schemes 1, 3 and 4.
25. The method of claim 23, wherein the amino acid residue is cysteine and wherein the method is performed according to any of Schemes 2, 5, 6, 7 and 8. 10
26. The method of claim 20, wherein the selected amino acid residue is tryptophan or tyrosine.
27. The method of claim 26, wherein the amino acid residue is tyrosine or tryptophan and wherein the method is performed according to any of Schemes 5 and 6. 15
28. The method of claim 20, wherein the side chain contains an S-alkyl group.
29. The method of claim 28, wherein the amino acid residue is methionine, S-alkyl cysteine, S-alkyl homocysteine, S-alkyl tryptophan or S-alkyl tyrosine. 20
30. The method of any of claims 1-19, wherein the fixed-charge derivative is contained on a side-chain of a post-translationally modified amino acid residue.
31. The method of claim 30, wherein the fixed-charge derivative is contained on an 0 25 linked post-translationally modified amino acid residue.
32. The method of claim 30, wherein the O-linked post-translationally modified amino acid residue is a dehydroalanine residue formed by f-elimination from an 0-linked post translationally modified serine amino acid residue (Scheme 9). 30
33. The method of claim 30, wherein the O-linked post-translationally modified amino acid residue is a dehydroamino-2-butyric acid residue formed by f-elimination from an 0 linked post-translationally modified threonine amino acid residue. WO 2004/046731 PCT/US2003/036739 - 67
34. The method of any of claims 1-19, wherein the fixed-charge derivative is contained on a cross-link contained between two amino acids, peptides or proteins (Scheme 10). 5
35. The method of any of claims 1-19, wherein the fixed-charge derivative is contained within a cross-linking reagent.
36. The method of any of claims 1-35, wherein the fixed-charge derivative is selectively pre-enriched by solid phase capture methods using fixed charge reagents covalently coupled 10 to beads or insoluble polymers (Scheme 11).
37. The method of any of claims 1-36, wherein the fixed-charge ion is a sulfonium ion, a quaternary alkylammonium or a quaternary alkylphosphonium ion. 15
38. The method of any of claims 1-37, wherein the analysis of the amino acid, peptide or protein ion is performed by tandem mass spectrometry.
39. The method of claim 38, wherein the tandem mass spectrometer is equipped with electrospray ionization (ESI) or matrix assisted laser desorption ionization (MALDI) 20 interfaces to transfer the protein or peptide ion into the gas-phase.
40. The method of claim 38, wherein the tandem mass spectrometer is a tandem-in-space mass spectrometer, a tandem-in-time mass spectrometer, or a combination thereof. 25
41. The method of claim 40, wherein the tandem-in-space mass spectrometer is a sector mass spectrometer, a time of flight mass spectrometer, a triple quadrupole mass spectrometer, or a hybrid mass spectrometer combining time of flight and quadrupole instruments.
42. The method of claim 41, wherein the sector mass spectrometer is a double focusing 30 sector mass spectrometer or a hybrid mass spectrometer combining sector and quadrupole instruments. WO 2004/046731 PCT/US2003/036739 -68
43. The method of claim 38, wherein the tandem-in-time mass spectrometer is a two dimensional quadrupole ion trap mass spectrometer, a three-dimensional quadrupole ion trap mass spectrometer or a Fourier-transform ion cyclotron resonance (FT-ICR) mass spectrometer. 5
44. The method of any of the foregoing claims, wherein the method further includes one or more steps of protein extraction, protein separation, reduction and alkylation of cysteine disulfides and/or digestion. 10
45. The method of any of the foregoing claims, wherein the amino acids, peptides or proteins are derivatized using a substituted acetophenone, or a salt thereof, or a solvate thereof, having the following formula: O R1 C H C R , 1{ 2 R 2 CH 2 R 6 Ra' ~R 5 '~ ' _"R 3 ' R 5 R 4 ' R R4 15
46. The method of claim 45, wherein X is any halogen, sulfonic ester, perchlorate ester or chlorosulfonate.
47. The method of claim 45, wherein Ri-Rz are H, and R' - R 6 ' are 12 C. 20
48. The method of claim 45, wherein the substituted acetophenone is an isotopically encoded substituted acetophenone, or a salt thereof, or a solvate thereof.
49. The method of claim 48, wherein X is any halogen, sulfonic ester, perchlorate ester or 25 chlorosulfonate.
50. The method of claim 49, wherein at least one of, and preferably at least three of R 1 -R 5 are 2 H, and Ri'-R 6 ' are 1 2 C. WO 2004/046731 PCT/US2003/036739 - 69
51. The method of claim 49, wherein R 1 -R 5 are H, and at least one of, and preferably at least three of R 1 '-R 6 ' are 1 3 C. 5
52. The method of any of claims 45-51, wherein at least one of RI-R 5 is a functional group containing an atom other than hydrogen or carbon.
53. The method of any of claims 45-52, wherein the substituted acetophenone is water soluble. 10
54. The method of claim 53, wherein R 1 is SO 2 H, and R 2 -R 5 are H, and R 1 '-R 6 ' are 12c; R 1 is H, R 2 is SO 2 H, and R 3 -R 5 are H, and R 1 '-R 6 ' are 12C; R1-2 are H, R 3 is SO 2 H, and R 4 .-R 5 are H, and R 1 '-R 6 ' are "C; 15 R 1 is SO 3 H, and R 2 -R 5 are H, and R 1 '-R 6 ' are 12C; R 1 is H, R 2 is SO 3 H, and R 3 -R 5 are H, and R 1 '-R 6 ' are 12C; or R1-2 are H, R 3 is SO 3 H, and R 4 -R 5 are H, and R 1 '-R 6 ' are 12C.
55. The method of claim 53, wherein 20 R 1 is SO 2 H, and at least one of, and preferably at least three of R 2 -R 5 are 2 H, and R 1 ' R6' are "C; R 1 is SO2H, and R 2 -R 5 are H, and at least one of, and preferably at least three of R 1 ' R 6 ' are 13C; R 2 is SO2H, and at least one of, and preferably at least three of R 1 and R 3 -R 5 are 2H, 25 and R'-R6' are 1C; R 2 is SO 2 H, and R 1 and R 3 -R 5 are H, and at least one of, and preferably at least three of R 1 '- R 6 ' are "C; R 3 is SO 2 H, and at least one of, and preferably at least three of R-R 2 and R 4 -R 5 are 2 H, and R 1 '- R 6 ' are 12C; 30 R 3 is SO2H, and R1-R 2 and R 4 -R 5 are H, and at least one of, and preferably at least three of R'- R 6 ' are "c; R 1 is SO 3 H, and at least one of, and preferably at least three of R 2 -R 5 are 2H, and R 1 ' R6' are "C; WO 2004/046731 PCT/US2003/036739 -70 R 1 is SO 3 H, and R 2 -R 5 are H, and at least one of, and preferably at least three of R 1 ' R 6 ' are 1 3 C; I R 2 is SO 3 H, and at least one of, and preferably at least three of R 1 and R 3 - R 5 are 2H, and R 1 '- R 6 ' are 12c; 5 R 2 is SO 3 H, and R 1 and R 3 -R 5 are H, and at least one of, and preferably at least three of Ri'- R 6 ' are 13c; R 3 is SO 3 H, and at least one of, and preferably at least three of R 1 -R 2 and R 4 - R 5 are 2 H, and R 1 '- R 6 ' are 12C; and R 3 is SO 3 H, and R 1 -R 2 and R 4 -R 5 are H, and at least one of, and preferably at least 10 three of R 1 '-are 13 C.
56. A substituted acetophenone, or a salt thereof, or a solvate thereof, having the following formula: 0 R1 x R, R CH 2 R 6 ' R R 5 R' R 3 i14 15 R4 wherein X is a sulfonic ester, perchlorate ester or chlorosulfonate.
57. The substituted acetophenone of claim 56, wherein R 1 -R 5 are H, and R 1 '-R 6 ' are 12C. 20
58. The substituted acetophenone of claim 56, wherein the substituted acetophenone is an isotopically encoded substituted acetophenone, or a salt thereof, or a solvate thereof.
59. The substituted acetophenone of claim 58, wherein at least one of, and preferably at 25 least three of R 1 -R 5 are 2H, and Rl'-R 6 ' are 12 C.
60. The substituted acetophenone of claim 58, wherein R 1 -R 5 are H, and at least one of, and preferably at least three of R 1 '-R 6 ' are 1 3 C. WO 2004/046731 PCT/US2003/036739 - 71
61. A water soluble substituted acetophenone, or a salt thereof, or a solvate thereof, having the following formula: 0 R X" C R, R2 CH 2 R,' R 2 5 R4
62. The water soluble substituted acetophenone of claim 61, wherein X is any halogen, sulfonic ester, perchlorate ester or chlorosulfonate. 10
63. The water soluble substituted acetophenone of claim 62, wherein X is Br or I.
64. The water soluble derivative of the substituted acetophenone of claim 63, wherein R 1 is SO 2 H, R 2 -R 5 are H, and R 1 '- R 6 ' are "C; R, is H, R 2 is SO 2 H, R 3 -R 5 are H, and R 1 '- R 6 ' are 12C; 15 R..2 are H, R 3 is SO 2 H, R 4 -R 5 are H, and R 1 '- R 6 ' are 12C; R1 is SO 3 H, R 2 -R 5 are H, and R 1 '- R 6 ' are 12C; RI is H, R 2 is SO 3 H, R 3 -R 5 are H, and R1'- R 6 ' are 12C; or R,- 2 are H, R 3 is SO 3 H, Re-R 5 are H, and R 1 '- R 6 ' are "C. 20
65. An isotopically encoded form of the water soluble substituted acetophenone of claim 61.
66. The isotopically encoded water soluble substituted acetophenone of claim 64, wherein X is any halogen, sulfonic ester, perchlorate ester or chlorosulfonate. 25
67. The isotopically encoded water soluble substituted acetophenone of claim 66, wherein R 1 is SO 2 H, at least one of, and preferably at least three of R 2 - R 5 are 2H, and R 1 '-R 6 ' are 2c; WO 2004/046731 PCT/US2003/036739 - 72 R 1 is SO 2 H, R 2 -R 5 are H, and at least one of, and preferably at least three of R 1 '-R 6 ' are 1c; R 2 is SO 2 H, at least one of, and preferably at least three of R 1 and R 3 -R 5 are 2 H, and R 1 '-R 6 ' are 12C; 5 R 2 is SO 2 H, R 1 and R 3 -R 5 are H, and at least one of, and preferably at least three of R 1 '-R 6 ' are 1C; R 3 is SO 2 H, at least one of, and preferably at least three of Ri-R 2 and R 4 - R 5 are 2H, and R 1 '-R 6 ' are "C; R 3 is SO 2 H, R 1 -R 2 and R 4 - R 5 are H, and at least one of, and preferably at least three 10 of R 1 '-R 6 ' are 13c; R 1 is SO 3 H, at least one of, and preferably at least three of R 2 - R 5 are 2H, and R 1 '-R 6 ' are 12c; R, is SO 3 H, R 2 -R 5 are H, and at least one of, and preferably at least three of R 1 '-R 6 ' are "C; 15 R 2 is SO 3 H, at least one of, and preferably at least three of R 1 and R 3 -R 5 are 2 H, and Rl'-R 6 ' are 1C; R 2 is SO 3 H, R 1 and R 3 -R 5 are H, and at least one of, and preferably at least three of R 1 '-R 6 ' are 1C; R 3 is SO 3 H, at least one of, and preferably at least three of R 1 -R 2 and R 4 - R 5 are 2H, 20 and R 1 '-R 6 ' are 12C; R 3 is SO 3 H, R 1 -R 2 and R 4 -R 5 are H, and at least one of, and preferably at least three of R 1 '-R 6 ' are 13 C.
68. A reagent kit for analysis of amino acids, peptides or proteins by mass spectrometry 25 comprising a container containing the substituted acetophenone of any of claims 56-67.
69. A reagent kit for analysis of amino acids, peptides or proteins by mass spectrometry comprising a container containing a substituted acetophenone, or a salt thereof, or a solvate thereof, having the following formula: 30 WO 2004/046731 PCT/US2003/036739 -73 0 R1 Il 1 X 'C R1' R2 CH 2 R 6 R 2 ~R 5 ' R 3 K R 5 R 4 R R 3 R4 wherein X is a halide. 5
70. The reagent kit of claim 69, wherein X is Br or I.
71. The reagent kit of claim 69 or 70, wherein R 1 -R 5 are H, and R 1 '-R 6 ' are 1 2 C.
72. The reagent kit of claim 69, wherein the substituted acetophenone is an isotopically 10 encoded substituted acetophenone, or a salt thereof, or a solvate thereof.
73. The reagent kit of claim 72, wherein at least one of, and preferably at least three of R1-Rs are 2 H, and R 1 '-R 6 ' are 1 2 C. 15
74. The reagent kit of claim 72, wherein R 1 -R 5 are H, and at least one of, and preferably at least three of R 1 '-R 6 ' are 1 3 C.
75. The reagent kit of claim 68-74, further comprising one or more containers containing: cysteine disulfide reducing agents, cysteine alkylating reagents, proteases or chemical 20 cleavage agents, and solvents.
76. The reagent kit of claim 75, wherein the cysteine disulfide reducing agents are: dithiothreitol (DTT), f-mercaptoethanol, tris-carboxyethyl phosphine (TCEP), and/or tributylphosphine (TBP). 25
77. The reagent kit of claim 75, wherein the cysteine alkylating reagents are alkylhalides (e.g. iodoacetic acid, iodoacetamide), vinylpyridine or acrylamide. WO 2004/046731 PCT/US2003/036739 -74
78. The reagent kit of claim 75, wherein the proteases or chemical cleavage agents are trypsin, Endoproteinase Lys-C, Endoproteinase Asp-N, Endoproteinase Glu-C, pepsin, papain, thermolysin, cyanogen bromide, hydroxylamine hydrochloride, 2-[2' nitrophenylsulfenyl]-3-methyl-3'-bromoindole (BNPS-skatole), iodosobenzoic acid, 5 pentafluoropropionic acid and/or dilute hydrochloric acid.
79. The reagent kit of claim 75, wherein the solvents are urea, guanidine hydrochloride, acetonitrile, methanol and/or water. 10
80. An amino acid or peptide comprising an amino acid derivatized to include a side chain fixed-charge sulfonium ion, quaternary alkylammonium ion or quaternary alkylphosphonium ion.
81. The amino acid or peptide of claim 80, wherein the amino acid is derivatized using 15 the substituted acetophenone of any of claims 55-66.
82. The amino acid or peptide of claim 80, wherein the amino acid is derivatized using a substituted acetophenone, or a salt thereof, or a solvate thereof, having the following formula: 0 R1 CH C R R'R I I I .x. R, R, R R R 4 ' R 3 20 R4 wherein X is a halide.
83. The amino acid or peptide of claim 82, wherein X is Br or I. 25
84. The amino acid or peptide of claim 82 or 83, wherein Rr--R 5 are H, and R 1 '--R 6 ' are 1 2 c. WO 2004/046731 PCT/US2003/036739 - 75
85. The amino acid or peptide of claim 82, wherein the substituted acetophenone is an isotopically encoded substituted acetophenone, or a salt thereof, or a solvate thereof.
86. The amino acid or peptide of claim 85, wherein at least one of, and preferably at least 5 three of R 1 -R 5 are 2H, and R 1 '-R 6 ' are "C.
87. The amino acid or peptide of claim 85, wherein R,-R 5 are H, and at least one of, and preferably at least three of R 1 '-R 6 ' are 13 C. 10
88. The amino acid or peptide of any of claims 80-87, wherein the amino acid derivative is isotopically encoded.
89. A method for providing an internal standard in a mass spectrometer method comprising adding to a sample a predetermined quantity of the fixed charge derivatized 15 amino acid or peptide claimed in claim 80-88.
AU2003287673A 2002-11-18 2003-11-18 Method for analysing amino acids, peptides and proteins using mass spectroscopy of fixed charge-modified derivatives Abandoned AU2003287673A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2003287673A AU2003287673A1 (en) 2002-11-18 2003-11-18 Method for analysing amino acids, peptides and proteins using mass spectroscopy of fixed charge-modified derivatives

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
AU2002952747 2002-11-18
AU2002952747A AU2002952747A0 (en) 2002-11-18 2002-11-18 Method for analysing peptides
AU2003287673A AU2003287673A1 (en) 2002-11-18 2003-11-18 Method for analysing amino acids, peptides and proteins using mass spectroscopy of fixed charge-modified derivatives
PCT/US2003/036739 WO2004046731A2 (en) 2002-11-18 2003-11-18 Method for analysing amino acids, peptides and proteins using mass spectroscopy of fixed charge-modified derivatives

Publications (1)

Publication Number Publication Date
AU2003287673A1 true AU2003287673A1 (en) 2004-06-15

Family

ID=34378192

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2003287673A Abandoned AU2003287673A1 (en) 2002-11-18 2003-11-18 Method for analysing amino acids, peptides and proteins using mass spectroscopy of fixed charge-modified derivatives

Country Status (1)

Country Link
AU (1) AU2003287673A1 (en)

Similar Documents

Publication Publication Date Title
US20060094121A1 (en) Method for analysing amino acids, peptides and proteins
Reid et al. Selective identification and quantitative analysis of methionine containing peptides by charge derivatization and tandem mass spectrometry
EP1565752A2 (en) Method for analysing amino acids, peptides and proteins using mass spectroscopy of fixed charge-modified derivates
JP4290003B2 (en) Mass label
EP1397686B1 (en) Method for characterizing polypeptides
US7250266B2 (en) Selective labeling agent for phosphoproteome analysis and phosphorylated site analysis
US20090053817A1 (en) Fixed charge reagents
Zhou et al. Electron transfer dissociation of modified peptides and proteins
WO2006023398A2 (en) Modular isotope labelled mass spectrometry reagents and methods for quantitation of amino acids, peptides and proteins
US20050042713A1 (en) Characterising polypeptides
EP1267170A1 (en) Method for characterising polypeptides
Czeszak et al. Identification of substituted sites on MUC5AC mucin motif peptides after enzymatic O‐glycosylation combining β‐elimination and fixed‐charge derivatization
US7629744B2 (en) In-gel tagging and in-gel digestion for phosphoproteins analysis and phosphorylation site identification
AU2003287673A1 (en) Method for analysing amino acids, peptides and proteins using mass spectroscopy of fixed charge-modified derivatives
AU2002310611B2 (en) Method for characterizing polypeptides
AU2002310611A1 (en) Method for characterizing polypeptides
AU2002310610A1 (en) Characterising polypeptides
AU2002302837A1 (en) Characterising polypeptides
Chen Development and Applications of Mass Spectrometric Methods for Proteome Analysis and Protein Sequence Characterization

Legal Events

Date Code Title Description
MK1 Application lapsed section 142(2)(a) - no request for examination in relevant period