EP3345003A1 - Identification de protéines en l'absence d'identifications de peptides - Google Patents

Identification de protéines en l'absence d'identifications de peptides

Info

Publication number
EP3345003A1
EP3345003A1 EP16840911.8A EP16840911A EP3345003A1 EP 3345003 A1 EP3345003 A1 EP 3345003A1 EP 16840911 A EP16840911 A EP 16840911A EP 3345003 A1 EP3345003 A1 EP 3345003A1
Authority
EP
European Patent Office
Prior art keywords
theoretical
protein
product ions
ion
product
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP16840911.8A
Other languages
German (de)
English (en)
Other versions
EP3345003A4 (fr
Inventor
James Andrew SHERMAN
Stephen A. Tate
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
DH Technologies Development Pte Ltd
Original Assignee
DH Technologies Development Pte Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by DH Technologies Development Pte Ltd filed Critical DH Technologies Development Pte Ltd
Publication of EP3345003A1 publication Critical patent/EP3345003A1/fr
Publication of EP3345003A4 publication Critical patent/EP3345003A4/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6848Methods of protein analysis involving mass spectrometry
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/10Signal processing, e.g. from mass spectrometry [MS] or from PCR
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01JELECTRIC DISCHARGE TUBES OR DISCHARGE LAMPS
    • H01J49/00Particle spectrometers or separator tubes
    • H01J49/0027Methods for using particle spectrometers
    • H01J49/0036Step by step routines describing the handling of the data generated during a measurement

Definitions

  • Various embodiments relate generally to tandem mass spectrometry. More
  • DIA data independent acquisition
  • a common problem in proteomics is determining the identity of protein present in the sample.
  • proteins are identified in a sample using a two-step tandem mass spectrometry process.
  • the proteins in the sample are digested using an enzyme such a trypsin, producing one or more peptides for each proteins.
  • a peptide as used herein, is a digested portion of a protein.
  • Some proteins can be digested intact, so a peptide can also be the entire protein. However, in most cases peptides are digested portions of proteins.
  • the peptides digested from proteins are then separated from the sample over time using a sample introduction device or separation device.
  • the separated peptides are then ionized using an ion source.
  • the ionized peptides, or peptide precursor ions are selected by mass-to-charge ratio (m/z), the selected precursor ions are fragmented, and the resulting product ions are mass analyzed using a tandem mass spectrometer.
  • the result of the first step is a collection of one or more product ion mass spectra measured at one or more different times.
  • the known proteins are obtained from a database, and are computationally digested using the same enzyme used in the tandem mass spectrometry experiment, producing one or more theoretical peptides for each known protein.
  • Theoretical peptides are computationally selected and fragmented, producing theoretical product ions for each known protein.
  • the resulting theoretical product ions are then compared to each of the one or more measured product ion mass spectra at each of the one or more different times.
  • known proteins are scored based on how well their theoretical product ions match the one or more measured product ion mass spectra.
  • the proteins in the sample are then identified from the highest scoring known proteins.
  • a second problem with the traditional method of identifying proteins in a sample is that there is no way to establish increasing quantifiable levels of confidence (since traditional peptide ID is non-deterministic).
  • a known protein may be identified in comparison to other known proteins with a 95% confidence level.
  • Such levels of confidence do not allow tandem mass spectrometry identifications to be used in clinical analysis. Error rates required for clinical use are typically on the order of 0.01% (1 in 10,000) or 0.001% (1 in 100,000), for example.
  • a third problem with the traditional method of identifying proteins in a sample is that, for DIA workflows, the traditional identification method typically requires additional precursor ion information to disambiguate convolved experimental product ions. In other words, the traditional method cannot reliably identify proteins directly from DIA product ion data without peptide precursor ion identification information.
  • DIA is a tandem mass spectrometry workflow. In general, tandem mass
  • tandem mass spectrometry involves ionization of one or more compounds from a sample, selection of one or more precursor ions of the one or more compounds, fragmentation of the one or more precursor ions into product ions, and mass analysis of the product ions. Tandem mass spectrometry can provide both qualitative and quantitative information.
  • the product ion spectrum can be used to identify a molecule of interest.
  • the intensity of one or more product ions can be used to quantitate the amount of the compound present in a sample.
  • a large number of different types of experimental methods or workflows can be performed using a tandem mass spectrometer.
  • Three broad categories of these workflows are, targeted acquisition, information dependent acquisition (IDA) or data dependent acquisition (DDA), and DIA.
  • one or more transitions of a peptide precursor ion to a product ion are predefined for one or more proteins.
  • the one or more transitions are interrogated during each time period or cycle of a plurality of time periods or cycles.
  • the mass spectrometer selects and fragment the peptide precursor ion of each transitions and performs a targeted mass analysis for the product ion of the transition.
  • a mass spectrum is produced for each transitions.
  • Targeted acquisition methods include, but are not limited to, multiple reaction monitoring (MRM) and selected reaction monitoring (SRM).
  • IDA is a flexible tandem mass spectrometry method in which a user can specify criteria for performing targeted or untargeted mass analysis of product ions while a sample is being introduced into the tandem mass spectrometer.
  • a precursor ion or mass spectrometry (MS) survey scan is performed to generate a precursor ion peak list.
  • the user can select criteria to filter the peak list for a subset of the precursor ions on the peak list.
  • MS/MS is then performed on each precursor ion of the subset of precursor ions.
  • a product ion spectrum is produced for each precursor ion.
  • MS/MS is repeatedly performed on the precursor ions of the subset of precursor ions as the sample is being introduced into the tandem mass spectrometer.
  • DIA methods have been used to increase the reproducibility and comprehensiveness of data collection from complex samples.
  • DIA methods can also be called non-specific fragmentation methods.
  • the actions of the tandem mass spectrometer are not varied among MS/MS scans based on data acquired in a previous precursor or product ion scan. Instead a precursor ion mass range is selected.
  • a precursor ion mass selection window is then stepped across the precursor ion mass range. All precursor ions in the precursor ion mass selection window are fragmented and all of the product ions of all of the precursor ions in the precursor ion mass selection window are mass analyzed.
  • the precursor ion mass selection window used to scan the mass range can be very narrow so that the likelihood of multiple precursors within the window is small.
  • This type of DIA method is called, for example, MS/MS ⁇ .
  • a precursor ion mass selection window of about 1 amu is scanned or stepped across an entire mass range.
  • a product ion spectrum is produced for each 1 amu precursor mass window.
  • a product ion spectrum for the entire precursor ion mass range is produced by combining the product ion spectra for each mass selection window.
  • the time it takes to analyze or scan the entire mass range once is referred to as one scan cycle. Scanning a narrow precursor ion mass selection window across a wide precursor ion mass range during each cycle, however, is not practical for some instruments and experiments.
  • a larger precursor ion mass selection window, or selection window with a greater width is stepped across the entire precursor mass range.
  • This type of DIA method is called, for example, SWATHTM acquisition.
  • SWATHTM acquisition the precursor ion mass selection window stepped across the precursor mass range in each cycle may have a width of 5-25 amu, or even larger.
  • all the precursor ions in each precursor ion mass selection window are fragmented, and all of the product ions of all of the precursor ions in each mass isolation window are mass analyzed.
  • the cycle time can be significantly reduced in comparison to the cycle time of the MS/MS ALL method.
  • U.S. Patent No. 8,809,770 describes how SWATHTM acquisition can be used to provide quantitative and qualitative information about the precursor ions of compounds of interest.
  • the product ions found from fragmenting a precursor ion mass selection window are compared to a database of known product ions of compounds of interest.
  • ion traces or extracted ion chromatograms (XICs) of the product ions found from fragmenting a precursor ion mass selection window are analyzed to provide quantitative and qualitative information.
  • SWATHTM acquisition can be difficult, because there is no peptide precursor ion information provided with a precursor ion mass selection window to help determine the precursor ion that produces each product ion. In addition, because there is no peptide precursor ion information provided with a precursor ion mass selection window, it is also difficult to determine if a product ion is convolved with or includes contributions from multiple precursor ions within the precursor ion mass selection window.
  • the mass spectrometry industry currently is unable to deterministically identify proteins in a sample from experimental product ion data alone. Also, the mass spectrometry industry currently is unable to perform identifications of proteins from any type of experimental data with increasing levels of confidence. Finally, the mass spectrometry industry currently is unable to reliably identify proteins in a sample produced by a DIA tandem mass spectrometry method from product ion data without the aid of any precursor ion data.
  • a system for deterministically identifying to a desired confidence probability a known protein of a sample from other known proteins of the sample using a combination of two or more exclusive combinations of theoretical product ions as an error detection and correction code in a tandem mass spectrometry data independent acquisition (DIA) method that does not provide peptide identifying information.
  • the system includes a separation device, an ion source, a tandem mass spectrometer, and a processor.
  • the separation device separates peptides of two or more known proteins from a sample over time.
  • the ion source receives the peptides from the separation device and ionizes the peptides, producing an ion beam of precursor ions.
  • the tandem mass spectrometer receives the ion beam.
  • the tandem mass spectrometer divides a mass-to-charge ratio (m/z) range of the ion beam into two or more precursor ion mass selection windows.
  • the tandem mass spectrometer selects and fragments the two or more precursor ion mass selection windows during each cycle of a plurality of cycles, producing a plurality of measured product ion spectra.
  • the processor receives the plurality of measured product ion spectra from the tandem mass spectrometer.
  • the processor receives a desired confidence probability for the identification of at least one known protein of two or more known proteins.
  • the processor calculates two or more combinations of N theoretical product ions that are exclusive to the at least one known protein.
  • the processor identifies the at least one known protein by finding product ions in the plurality of measured product ion spectra that match two or more of the two or more combinations of N theoretical product ions that provide a combined confidence probability greater than or equal to the desired confidence probability.
  • a method is disclosed for deterministically identifying a known protein of a sample from other known proteins of the sample using a tandem mass spectrometry DIA method that does not provide peptide identifying information.
  • a plurality of measured product ion spectra are received from a tandem mass spectrometer using a processor.
  • the plurality of measured product ion spectra are produced by the tandem mass spectrometer by dividing an m/z range of an ion beam into two or more precursor ion mass selection windows and selecting and fragmenting the two or more precursor ion mass selection windows during each cycle of a plurality of cycles.
  • the ion beam is produced by an ion source that ionizes peptides of two or more known proteins, producing an ion beam of precursor ions.
  • the peptides are separated from a sample by a separation device.
  • a desired confidence probability for the identification of at least one known protein of two or more known proteins is received using the processor.
  • Two or more combinations of N theoretical product ions that are exclusive to the at least one known protein are calculated using the processor.
  • the at least one known protein is identified by finding product ions in the plurality of measured product ion spectra that match two or more of the two or more combinations of N theoretical product ions that provide a combined confidence probability greater than or equal to the desired confidence probability using the processor.
  • a computer program product includes a non-transitory and tangible computer-readable storage medium whose contents include a program with instructions being executed on a processor so as to perform a method for deterministically identifying to a desired confidence probability a known protein of a sample from other known proteins of the sample using a combination of two or more exclusive combinations of theoretical product ions as an error detection and correction code in a tandem mass spectrometry DIA method that does not provide peptide identifying information.
  • the method includes providing a system, wherein the system comprises one or more distinct software modules, and wherein the distinct software modules comprise a measurement module and an analysis module.
  • the measurement module receives a plurality of measured product ion spectra from a tandem mass spectrometer.
  • the plurality of measured product ion spectra are produced by the tandem mass spectrometer by dividing an m/z range of an ion beam into two or more precursor ion mass selection windows and selecting and fragmenting the two or more precursor ion mass selection windows during each cycle of a plurality of cycles.
  • the ion beam is produced by an ion source that ionizes peptides of two or more known proteins, producing an ion beam of precursor ions.
  • the peptides are separated from a sample by a separation device.
  • the analysis module receives a desired confidence probability for the identification of at least one known protein of two or more known proteins.
  • the analysis module calculates two or more combinations of N theoretical product ions that are exclusive to the at least one known protein.
  • the analysis module identifies the at least one known protein by finding product ions in the plurality of measured product ion spectra that match two or more of the two or more combinations of N theoretical product ions that provide a combined confidence probability greater than or equal to the desired confidence probability.
  • Figure 1 is a block diagram that illustrates a computer system, upon which
  • Figure 2 is an exemplary diagram of a precursor ion mass-to-charge ratio (m/z) range that is divided into ten precursor ion mass selection windows for a data independent acquisition (DIA) workflow, in accordance with various embodiments.
  • m/z precursor ion mass-to-charge ratio
  • Figure 3 is an exemplary diagram that graphically depicts the steps for obtaining product ion traces or XICs from each precursor ion mass selection window during each cycle of a DIA workflow, in accordance with various embodiments.
  • Figure 4 is an exemplary diagram that shows the three-dimensionality of an XIC obtained for a precursor ion mass selection window over time, in accordance with various embodiments.
  • Figure 5 is an exemplary diagram that graphically depicts the steps for calculating exclusive addresses for known proteins in a sample, in accordance with various embodiments.
  • Figure 6 is an exemplary diagram that graphically depicts the step for identifying known proteins in a sample by using their exclusive addresses, in accordance with various embodiments.
  • Figure 7 is a schematic diagram of a system for deterministically identifying to a desired confidence probability a known protein of a sample from other known proteins of the sample using a combination of two or more exclusive combinations of theoretical product ions as an error detection and correction code in a tandem mass spectrometry DIA method that does not provide peptide identifying information, in accordance with various embodiments.
  • Figure 8 is a flowchart showing a method for deterministically identifying to a desired confidence probability a known protein of a sample from other known proteins of the sample using a combination of two or more exclusive combinations of theoretical product ions as an error detection and correction code in a tandem mass spectrometry DIA method that does not provide peptide identifying information, in accordance with various embodiments.
  • Figure 9 is a schematic diagram of a system that includes one or more distinct software modules that performs a method for deterministically identifying to a desired confidence probability a known protein of a sample from other known proteins of the sample using a combination of two or more exclusive combinations of theoretical product ions as an error detection and correction code in a tandem mass spectrometry DIA method that does not provide peptide identifying information, in accordance with various embodiments.
  • FIG. 1 is a block diagram that illustrates a computer system 100, upon which embodiments of the present teachings may be implemented.
  • Computer system 100 includes a bus 102 or other communication mechanism for communicating information, and a processor 104 coupled with bus 102 for processing information.
  • Computer system 100 also includes a memory 106, which can be a random access memory (RAM) or other dynamic storage device, coupled to bus 102 for storing instructions to be executed by processor 104.
  • Memory 106 also may be used for storing temporary variables or other intermediate information during execution of instructions to be executed by processor 104.
  • Computer system 100 further includes a read only memory (ROM) 108 or other static storage device coupled to bus 102 for storing static information and instructions for processor 104.
  • ROM read only memory
  • a storage device 110 such as a magnetic disk or optical disk, is provided and coupled to bus 102 for storing information and instructions.
  • Computer system 100 may be coupled via bus 102 to a display 112, such as a cathode ray tube (CRT) or liquid crystal display (LCD), for displaying information to a computer user.
  • a display 112 such as a cathode ray tube (CRT) or liquid crystal display (LCD)
  • An input device 114 is coupled to bus 102 for communicating information and command selections to processor 104.
  • cursor control 116 is Another type of user input device, such as a mouse, a trackball or cursor direction keys for communicating direction information and command selections to processor 104 and for controlling cursor movement on display 112.
  • This input device typically has two degrees of freedom in two axes, a first axis (i.e., x) and a second axis (i.e., y), that allows the device to specify positions in a plane.
  • a computer system 100 can perform the present teachings. Consistent with
  • results are provided by computer system 100 in response to processor 104 executing one or more sequences of one or more instructions contained in memory 106. Such instructions may be read into memory 106 from another computer-readable medium, such as storage device 110. Execution of the sequences of instructions contained in memory 106 causes processor 104 to perform the process described herein. Alternatively hard-wired circuitry may be used in place of or in combination with software instructions to implement the present teachings. Thus implementations of the present teachings are not limited to any specific combination of hardware circuitry and software.
  • computer system 100 can be connected to one or more other computer systems, like computer system 100, across a network to form a networked system.
  • the network can include a private network or a public network such as the Internet.
  • one or more computer systems can store and serve the data to other computer systems.
  • the one or more computer systems that store and serve the data can be referred to as servers or the cloud, in a cloud computing scenario.
  • the one or more computer systems can include one or more web servers, for example.
  • the other computer systems that send and receive data to and from the servers or the cloud can be referred to as client or cloud devices, for example.
  • Non-volatile media includes, for example, optical or magnetic disks, such as storage device 110.
  • Volatile media includes dynamic memory, such as memory 106.
  • Transmission media includes coaxial cables, copper wire, and fiber optics, including the wires that comprise bus 102.
  • a floppy disk a flexible disk, hard disk, magnetic tape, or any other magnetic medium
  • a CD-ROM digital video disc (DVD), a Blu-ray Disc, any other optical medium
  • a thumb drive a memory card, a RAM, PROM, and EPROM, a FLASH-EPROM, any other memory chip or cartridge, or any other tangible medium from which a computer can read.
  • Various forms of computer readable media may be involved in carrying one or more sequences of one or more instructions to processor 104 for execution.
  • the instructions may initially be carried on the magnetic disk of a remote computer.
  • the remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem.
  • a modem local to computer system 100 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal.
  • An infra-red detector coupled to bus 102 can receive the data carried in the infra-red signal and place the data on bus 102.
  • Bus 102 carries the data to memory 106, from which processor 104 retrieves and executes the instructions.
  • the instructions received by memory 106 may optionally be stored on storage device 110 either before or after execution by processor 104.
  • instructions configured to be executed by a processor to perform a method are stored on a computer-readable medium.
  • the computer-readable medium can be a device that stores digital information.
  • a computer-readable medium includes a compact disc read-only memory (CD-ROM) as is known in the art for storing software.
  • CD-ROM compact disc read-only memory
  • the computer- readable medium is accessed by a processor suitable for executing instructions configured to be executed.
  • proteins are deterministically identified in a sample from experimental product ion data alone. This is done by calculating for each protein in the sample one or more combinations of two or more product ions that are exclusive to the protein and comparing the one or more combinations to the experimental product ion data. For each protein, each of the one or more combinations can be thought of as an address or code for the protein. The terms address and code are used interchangeably herein. As a result, one or more unique addresses or codes are calculated for each known protein in the sample.
  • proteins are deterministically identified in a sample with increasing confidence by manipulating the addresses or codes exclusive to each protein according to information theory.
  • An address or code is deterministic if has enough information to differentiate the protein from all other proteins in the group.
  • deterministic addresses can further be manipulated to include more information to increase the confidence in the identification.
  • each address can include more product ions, or two or more addresses can be used together.
  • proteins are reliably and deterministically identified with increasing levels of confidence from a sample produced by a DIA tandem mass spectrometry method from product ion data without the aid of any precursor ion data.
  • DIA tandem mass spectrometry methods increase the reproducibility and comprehensiveness of data collection from complex samples. They collect product ion spectra for all precursor ions within a large m/z range.
  • the proteins in a sample are reliably and deterministically identified with increasing levels of confidence and without the aid of any precursor ion data or precursor ion identification.
  • SRM selected reaction monitoring
  • Sherman et al. calculated addresses or codes exclusive to peptides and not to proteins. In other words, they found sets of precursor ion to product ion transitions are exclusive to peptides. The various embodiments described herein calculate sets of precursor ion to product ion transitions that are exclusive to proteins. As a result, an address of protein can include precursor ion to product ion transitions that are not exclusive to any of it peptides.
  • Sherman et al. do not describe any specific solutions adapted from the field of signal analysis. They also do not provide how such solutions can be adapted to peptide identification let alone to protein identification.
  • DIA methods provide a larger amount of data than targeted acquisition methods like SRM.
  • DIA methods do not provide information about the specific precursor ions from which each product ion is produced.
  • FIG. 2 is an exemplary diagram 200 of a precursor ion mass-to-charge ratio (m/z) range that is divided into ten precursor ion mass selection windows for a data independent acquisition (DIA) workflow, in accordance with various embodiments.
  • the m/z range shown in Figure 2 is 200 m/z. Note that the terms “mass” and “m/z” are used interchangeably herein. Generally, mass spectrometry measurements are made in m/z and converted to mass by multiplying charge.
  • Each of the ten precursor ion mass selection or isolation windows spans or has a width of 20 m/z. Three of the ten precursor ion mass selection windows, windows 201, 202, and 210, are shown in Figure 2.
  • Precursor ion mass selection windows 201, 202, and 210 are shown as non-overlapping windows with the same width.
  • precursor ion mass selection windows can overlap and/or can have variable widths.
  • U.S. Patent Application No. 14/401,032 describes using overlapping precursor ion mass selection windows in a single cycle of SWATHTM acquisition, for example.
  • U.S. Patent No. 8,809,772 describes using precursor ion mass selection windows with variable widths in a single cycle of SWATHTM acquisition using variable precursor ion mass selection windows in SWATHTM acquisition, for example.
  • each of the ten precursor ion mass selection windows is selected and then fragmented, producing ten product ion spectra for the entire m/z range shown in Figure 2.
  • FIG. 2 depicts non-variable and non-overlapping precursor ion mass selection windows used in a single cycle of an exemplary SWATHTM acquisition.
  • a tandem mass spectrometer that can perform a SWATHTM acquisition method can further be coupled with a sample introduction device.
  • the proteins of a sample are typically digested using an enzyme, such as trypsin before the sample is introduced into the tandem mass spectrometer.
  • the sample introduction device separates one or more proteins digested proteins, or peptides, from the sample over time, for example.
  • a sample introduction device can introduce a sample to the tandem mass spectrometer using a technique that includes, but is not limited to, injection, liquid chromatography, gas
  • the separated one or more peptides are ionized by an ion source, producing an ion beam of precursor ions of the one or more proteins that are selected and fragmented by the tandem mass spectrometer.
  • each of the ten precursor ion mass selection windows is selected and then fragmented, producing ten product ion spectra for the entire m/z range.
  • each of the ten precursor ion mass selection windows is selected and then fragmented during each cycle of a plurality of cycles.
  • Figure 3 is an exemplary diagram 300 that graphically depicts the steps for obtaining product ion traces or XICs from each precursor ion mass selection window during each cycle of a DIA workflow, in accordance with various embodiments. For example, ten precursor ion mass selection windows, represented by precursor ion mass selection windows 201, 202, and 210 in Figure 3, are selected and fragmented during each cycle of a total of 1000 cycles.
  • product ion spectrum 311 is obtained by fragmenting precursor ion mass selection window 201 during cycle 1
  • product ion spectrum 312 is obtained by fragmenting precursor ion mass selection window 201 during cycle 2
  • product ion spectrum 313 is obtained by fragmenting precursor ion mass selection window 201 during cycle 1000.
  • XICs are obtained for each precursor ion mass selection window.
  • XIC 320 is calculated from the 1000 product ion spectra of precursor ion mass selection window 201.
  • XIC 320 includes XIC peaks or traces for all of the product ions that are produced from fragmenting precursor ion mass selection window 201 during the 1000 cycles. Note that XICs can be plotted in terms of time or cycles.
  • XIC 320 is shown plotted in two dimensions in Figure 3. However, each XIC of each precursor ion mass selection window is actually three-dimensional, because the different XIC peaks represent different m/z values.
  • Figure 4 is an exemplary diagram 400 that shows the three-dimensionality of an
  • XIC obtained for a precursor ion mass selection window over time in accordance with various embodiments.
  • the x axis is time or cycle number
  • the y axis is product ion intensity
  • the z axis is m/z. From this three-dimensional plot, more information is obtained.
  • XIC peaks 410 and 420 both have the same shape and occur at the same time, or same retention time. However, XIC peaks 410 and 420 have different m/z values. This may mean that XIC peaks 410 and 420 are isotopic peaks or represent different product ions from the same precursor ion. Similarly, XIC peaks 430 and 440 have the same m/z value, but occur at different times. This may mean that XIC peaks 430 and 440 are the same product ion, but they are from two different precursor ions.
  • proteins Due to the three-dimensional nature of DIA XICs, proteins can also be identified based on the elution order and time between product ions. As a result, in various embodiments, addresses or codes exclusive to proteins can also include timing information.
  • the XIC for each precursor ion mass selection window can be filtered so that it includes only product ion XIC peaks produced from peptides. For example, XIC peaks with intensities below predetermined threshold intensity and non-isotopic XIC peaks can be excluded from each XIC.
  • the charge state of product ions of peptides can be determined through high resolution mass spectrometry. This is done, for example, by determining the m/z difference between isotopic peaks. As a result, the mass or molecular weight of product ions of peptides can be determined from high resolution mass spectrometry. In various embodiments, therefore, product ion data can be expressed as ion mass or molecular weight rather than m/z.
  • proteins in a sample are identified by comparing theoretical product ions of known proteins to the product ion experimental data.
  • Theoretical product ions are computationally generated from stored information about the one or more known proteins expected to be the sample. This stored information can be stored in many different forms including, but not limited to, databases and flat files.
  • stored information about known proteins or peptides is obtained from a FASTA file.
  • the FASTA file is parsed.
  • the proteins parsed from the FASTA file are then computationally digested using the same enzyme used to digest the sample in the experiments.
  • Computational digestion of the one or more known proteins produces one or more theoretical peptides, or one or more peptide precursor ions, for each protein.
  • Theoretical product ions for each protein are obtained by computationally fragmenting theoretical peptide precursor ions of each protein. For example, theoretical product ions are obtained by selecting the b and j fragments of theoretical peptide precursor ions.
  • theoretical peptides and product ions of each protein are compared with theoretical peptides and product ions of every other protein expected in the sample in order to determine an exclusive combination of theoretical peptides and product ions for each protein.
  • a sample may include three known proteins, Pi, Pi, and j.
  • the z 'th protein is denoted as Pi.
  • the / h theoretical peptide is denoted as pj.
  • Theoretical peptide pj has fragments or product ions denoted as f(j,i) ... fo,nj . Protein and peptide indexes are independent.
  • Pi cannot be identified, because it is composed entirely of shared theoretical peptides pi, pi, and p3. Also, if there is insufficient information to confidently identify theoretical peptides p4 and ps, proteins Pi and j also cannot be identified. Peptide identifications cannot be used to identify proteins Pi and j, because there is no way to determine that one source of pi was Pi and the other source of pi was Pi. In addition, if there is also only partial fragmentation, the peptide may not be identifiable. Differentiating from greater than one million peptides is a significantly more difficult problem that differentiating one protein from 20,000 possible proteins.
  • the product ions for each theoretical peptide are: pi ⁇ ⁇ f(i,i),f(i,i) ⁇ ; pi ⁇ ⁇ f(i,i),f(i,i) ⁇ ; P3 ⁇ ⁇ f(3,i),f(3,i) ⁇ ; p4 ⁇ ps ⁇ ⁇ f(5,i),f(5,i) ⁇ .
  • a product ion combination length for the proteins is chosen. The length can be two or more product ions. In this example, the length is three.
  • choosing the length of the product ion combinations can be an iterative process. A short length can be chosen first, and the length can be increased until exclusive combinations can be found for the proteins of interest. [0079] In this example, the length is chosen to be three product ions. Every permutation of three product ions for each of proteins P2 and j is then calculated. Each combination is then compared against every other combination. Shared combinations or addresses are discarded.
  • Table 1 shows some of the permutations or combinations, Uk, calculated for proteins P2 and j that are exclusive to proteins P2 and j.
  • Figure 5 is an exemplary diagram 500 that graphically depicts the steps for
  • Step 510 the sequences of known or expected proteins in a sample are obtained from stored information 501 about proteins.
  • Stored information 501 can be, but is not limited to, a flat file or database. As show in Figure 5, and like the example described above, three protein sequences, Pi,P2, andP3 are obtained, parsed, or queried from stored information 501.
  • each of the three protein sequences are computationally digested based on the same enzyme used in the mass spectrometry experiment.
  • the three sequences ⁇ , ⁇ , and j are digested into different combinations five peptides, pi,
  • each of the peptides of each protein is computationally fragmented. As shown in Figure 5, each peptide fragments into two product ions. For example, peptide pi fragments into f(i,i) and f(i,2).
  • step 540 every permutation of combinations of three product ions of each protein are calculated. These are the addresses or codes of each protein. The use of three product ions in each combination or address is arbitrary. Generally the number of product ions, or the length of the address, can be increased iteratively.
  • step 550 each combination or address of the known proteins is compared to every other combination or address of the known proteins. Redundant combinations or addresses are removed. The remaining addresses are the exclusive addressed for each protein. Note that Pi has no exclusive addresses. This is because it is composed entirely of shared theoretical peptides pi, p2, and p3, as described above. As a result, Pi cannot be identified.
  • one or more of the addresses are compared to the sample product ion spectra. If an exclusive address of a protein matches the sample product ion spectra, a protein is identified in the sample.
  • FIG 6 is an exemplary diagram 600 that graphically depicts the step for identifying known proteins in a sample by using their exclusive addresses, in accordance with various embodiments.
  • Step 610 for example, one or more of the exclusive addresses of proteins P2 and j are compared to the product ion spectral data collected for a sample using a DIA method.
  • the exclusive addresses of proteins P2 and P3 are the same exclusive addresses shown in Figure 5, and the product ion spectral data is the same data shown in Figure 3, for example.
  • exclusive addresses calculated for known proteins are used to provide clinical levels of confidence for protein identifications. More specifically exclusive addresses calculated for known proteins are used as error correcting codes to increase the levels of confidence for protein identifications.
  • Error correcting codes are a part of information theory that has long been used in the field of communications. More specifically, error correcting codes have been used to improve the communication of digital signals between a transmitter and a receiver through a channel that includes noise. At some point the digital ones and zeros sent through the channel get corrupted along the way.
  • An exemplary error correcting code is simply to send each digital packet from the transmitter two or more times through the channel. The redundancy of the packets is increased until the uncertainty of the digital message is driven below some small level, for example. Error correcting codes could be applied to protein identification using mass spectrometry by, for example, repeatedly rerunning the same sample. This is like increasing the redundancy of the transmitted message in the field of
  • error correcting codes are applied in tandem mass spectrometry by increasing the redundancy of the identifications. In the field of communications, this is like increasing the number of times a message is read at the receiver.
  • the tandem mass spectrometer is like a transmitter that sends product ion spectra rather than messages.
  • a processor in communication with the tandem mass spectrometer is like the receiver that compares multiple exclusive combinations of product ions of a particular protein to the product ion spectra.
  • Error correcting codes that can transform long lists of exclusive addresses of known proteins into high confidence identifications include, but are not limited to, best-N out of M, Reed-Solomon codes to mass spectrometry (MS)-data, low density parity check (LDPC) codes, convolutional codes, and the Bahl, Cocke, Jelinek and Raviv (BCJR) algorithm.
  • MS mass spectrometry
  • LDPC low density parity check
  • BCJR Raviv
  • the error correcting codes used in protein identification are simply the addresses of exclusive product ions for each known protein.
  • the level of confidence for these codes can be established in a number of ways. For, example, a large set of data can be used and the likelihood of each product ion occurring at random is calculated directly. Alternatively, a likelihood of random occurrence is estimated that errors on the safe side.
  • the likelihood that three product ions are all true is the product of the likelihood of each product ion.
  • the probability of exclusive address Uoi of protein P2 matching the measure product ion spectra is Pr(Uoi), where
  • the likelihood of a protein is increased by identifying more exclusive addresses in the sample data. For example, if addresses Uoi and U04 of protein P2 are identified, the confidence in the identification is given by the probability of Uoi OR Uo4 (logical inclusive disjunction), PrfUoi V U04), where
  • PrfUoi V U04) Pr(Uoi) Pr(U 04 ) + Pr(Uoi) Q(U 04 ) + Q(Uoi) Pr(U 04 )
  • PrfUoi V U04 1 - ⁇ 2 .
  • Table 2 shows how the confidence level of protein identification is increased through the use of increasing numbers of exclusive addresses or codes.
  • each element of an address can be a single product ion from a different peptide of a protein. So, each element has a different m/z value.
  • each m/z value of the address In order for an address to match the measured mass spectra, each m/z value of the address must have an intensity in the measured mass spectra above a certain threshold value. In other words, an intensity threshold level is used to determine if each element of an address is present, if each element of an address is a single product ion from a different peptide of a protein.
  • each element of an address is a pair of product ions from the same peptide of the protein. If pairs of product ions from the same peptide are used as an element of the address, XIC curve subtraction can be used to determine if each element of address is present. XIC curve subtraction is described in U.S. Provisional Patent Application No. 62,112,212, which is incorporated herein by reference. Essentially, XIC curve subtraction allows the presence of a pair of product ions of the peptide to be confirmed by calculating a measure of the difference between the XICs of the two product ions rather than by their measured intensities.
  • every permutation of product ions for the length of address chosen is calculated for each of the known proteins. Then every permutation or address of a protein is compared against every permutation or address of the other known proteins to find the exclusive permutations or addresses for each protein. This method of calculating the exclusive addresses is independent of the measured data.
  • the exclusive addresses are more efficiently found by using the measured data in their calculation. For example, every product ion for each of the known proteins is still computationally generated. However, the list of product ions for each protein is shortened by determining if each product ion is present in the measured data first. Product ions are identified if they are above a certain threshold intensity level, for example. All the permutations of identified product ions for the length of address chosen are then calculated for each of the known proteins, and every permutation or address of identified product ions of a protein is compared against every permutation or address of identified product ions of the other known proteins to find the exclusive permutations or addresses for each protein.
  • Figure 7 is a schematic diagram of a system 700 for deterministically identifying to a desired confidence probability a known protein of a sample from other known proteins of the sample using a combination of two or more exclusive combinations of theoretical product ions as an error detection and correction code in a tandem mass spectrometry DIA method that does not provide peptide identifying information, in accordance with various embodiments.
  • System 700 includes ion source 710, tandem mass spectrometer 720, and processor 730.
  • system 700 can also include separation device 740.
  • Separation device 740 can separate peptides of two or more known proteins from a sample over time using one of a variety of techniques. These techniques include, but are not limited to, ion mobility, gas chromatography (GC), liquid chromatography (LC), capillary electrophoresis (CE), or flow injection analysis (FIA).
  • GC gas chromatography
  • LC liquid chromatography
  • CE capillary electrophoresis
  • FOA flow injection analysis
  • Ion source 710 can be part of tandem mass spectrometer 720, or can be a separate device. Ion source 710 receives the peptides from separation device 740 and ionizes the peptides, producing an ion beam of precursor ions.
  • Tandem mass spectrometer 720 can include, for example, one or more physical mass filters and one or more physical mass analyzers.
  • a mass analyzer of tandem mass spectrometer 720 can include, but is not limited to, a time-of-flight (TOF), quadrupole, an ion trap, a linear ion trap, an orbitrap, or a Fourier transform mass analyzer.
  • Tandem mass spectrometer 720 receives the ion beam from ion source 710.
  • Tandem mass spectrometer 720 divides an m/z range of the ion beam into two or more precursor ion mass selection windows and selects and fragments the two or more precursor ion mass selection windows during each cycle of a plurality of cycles, producing a plurality of measured product ion spectra.
  • Processor 730 can be, but is not limited to, a computer, microprocessor, or any device capable of sending and receiving control signals and data from tandem mass spectrometer 720 and processing data.
  • Processor 730 can be, for example, computer system 100 of Figure 1.
  • processor 730 is in communication with tandem mass spectrometer 720 and separation device 740.
  • Processor 730 performs a number of steps.
  • processor 730 receives the plurality of measured product ion spectra from tandem mass spectrometer 720.
  • processor 730 receives a desired confidence probability for the identification of at least one known protein of two or more known proteins.
  • processor 730 calculates two or more combinations of N theoretical product ions that are exclusive to the at least one known protein.
  • processor 730 identifies the at least one known protein by finding product ions in the plurality of measured product ion spectra that match two or more of the two or more combinations of N theoretical product ions that provide a combined confidence probability greater than or equal to the desired confidence probability.
  • processor 730 identifies the at least one known protein by finding product ions in the plurality of measured product ion spectra the match combinations the two or more combinations of N theoretical product ions.
  • Confidence probabilities are then calculated for the matching combinations. Two or more matching combinations are then combined to provide a combined confidence probability greater than or equal to the desired confidence probability.
  • a confidence probability is calculated for each combination of the two or more of the two or more combinations of N theoretical product ions. Combinations of two or more of the two or more combinations of N theoretical product ions that provide a combined confidence probability greater than or equal to the desired confidence probability are then compared to the plurality of measured product ion spectra to identify the at least one known protein.
  • the two or more of the two or more combinations of N theoretical product ions that match the product ions in the plurality of measured product ion spectra represent an error detection and correction code.
  • processor 730 calculates two or more combinations of N theoretical product ions that are exclusive to the at least one known protein in the set of two of more proteins by performing a number of additional steps.
  • processor 730 retrieves from a memory a sequence for each protein of the two or more known proteins.
  • processor 730 calculates for each sequence of the two or more known proteins one or more theoretical peptides and computationally selects and fragments each theoretical peptide of the one or more theoretical peptides, producing a plurality of theoretical product ions for each protein of the two or more known proteins.
  • processor 730 selects a number, N, of theoretical product ions to be used to identify known proteins.
  • step (civ) from the plurality of theoretical product ions for each protein of the two or more known proteins, processor 730 calculates every different combination of N theoretical product ions, producing one or more combinations for each protein.
  • processor 730 compares each combination of each protein of the two or more known proteins to each combination of every other protein of the two or more known proteins to determine two or more combinations of N theoretical product ions that exclusively identify the at least one known protein of the two or more known proteins.
  • processor 730 further calculates a confidence probability for each of the two or more combinations of N theoretical product ions by directly calculating a likelihood that each product ion of each combination occurs at random from data stored about the product ions.
  • processor 730 further calculates a confidence probability for each of the two or more combinations of N theoretical product ions by estimating a likelihood that any product ion occurs at random, Y, calculating a confidence probability for any product ion as (1 - Y), and calculating a confidence probability for each of the two or more combinations of N theoretical product ions as a product of the confidence probabilities of the product ions of the at least one exclusive combination, (1 - Y) N .
  • processor 730 further iteratively executes steps (ciii)-(cv) and increases the number N in each iteration until the processor further determines in step (cv) two or more combinations of N theoretical product ions that exclusively identify the at least one known protein of the two or more known proteins.
  • step (d) processor 730 finds product ions in the
  • processor 730 further selects a number, N, of theoretical product ion pairs to be used to identify known proteins.
  • processor 730 further, from the plurality of theoretical product ions for each protein of the two or more known proteins, calculates every different combination of N theoretical product ion pairs, producing one or more combinations for each protein, wherein each theoretical ion pair is from the same theoretical peptide.
  • processor 730 further finds product ion pairs in the plurality of measured product ion spectra that match two or more of the two or more combinations of N theoretical product ion pairs that provide a combined confidence probability greater than or equal to the desired confidence probability by performing curve subtraction on two ion extracted chromatograms (XICs) calculated from the plurality of measured product ion spectra for each pair of product ions of two or more of the two or more combinations of N theoretical product ion pairs.
  • XICs ion extracted chromatograms
  • processor 730 compares each product ion of each plurality of theoretical product ions for each protein of the two or more known proteins to the plurality of measured product ion spectra and removes the product ion from the plurality of theoretical product ions if the product ion is not present in the plurality of measured product ion spectra.
  • processor 730 further calculates an elution order for each theoretical peptide of each known protein and stores the elution order with each theoretical product ion calculated from theoretical peptide.
  • processor 730 further uses elution order in comparing each combination of each protein of the two or more known proteins to each combination of every other protein of the two or more known proteins to determine two or more combinations of N theoretical product ions that exclusively identify the at least one known protein of the two or more known proteins.
  • processor 730 finds product ion pairs in the plurality of measured product ion spectra that match two or more of the two or more combinations of N theoretical product ion pairs that provide a combined confidence probability greater than or equal to the desired confidence probability by calculating extracted ion chromatograms (XICs) for product ions of the plurality of measured product ion spectra, comparing the measured intensity levels of the XICs of the product ions of the plurality of measured product ion spectra at the m/z values of the product ions of two or more of the two or more combinations of N theoretical product ions that provide a combined confidence probability greater than or equal to the desired confidence probability to a threshold level that indicates the presence of a product ion, and comparing the retention times of the XICs of the product ions of the plurality of measured product ion spectra at the m/z values of the product ions of the two or more of the two or more combinations of N theoretical product ions to the e
  • XICs extracted i
  • Figure 8 is a flowchart showing a method 800 for deterministically identifying to a desired confidence probability a known protein of a sample from other known proteins of the sample using a combination of two or more exclusive combinations of theoretical product ions as an error detection and correction code in a tandem mass spectrometry DIA method that does not provide peptide identifying information, in accordance with various embodiments.
  • step 810 of method 800 a plurality of measured product ion spectra are
  • the plurality of measured product ion spectra are produced by the tandem mass spectrometer by dividing an m/z range of an ion beam into two or more precursor ion mass selection windows and selecting and fragmenting the two or more precursor ion mass selection windows during each cycle of a plurality of cycles.
  • the ion beam is produced by an ion source that ionizes peptides of two or more known proteins, producing an ion beam of precursor ions.
  • the peptides separated from a sample by a separation device.
  • a desired confidence probability for the identification of at least one known protein of two or more known proteins is received using the processor.
  • step 830 two or more combinations of N theoretical product ions that are
  • step 840 the at least one known protein is identified by finding product ions in the plurality of measured product ion spectra that match two or more of the two or more combinations of N theoretical product ions that provide a combined confidence probability greater than or equal to the desired confidence probability using the processor.
  • computer program products include a tangible computer- readable storage medium whose contents include a program with instructions being executed on a processor so as to perform a method for deterministically identifying to a desired confidence probability a known protein of a sample from other known proteins of the sample using a combination of two or more exclusive combinations of theoretical product ions as an error detection and correction code in a tandem mass spectrometry DIA method that does not provide peptide identifying information.
  • This method is performed by a system that includes one or more distinct software modules.
  • Figure 9 is a schematic diagram of a system 900 that includes one or more distinct software modules that performs a method for deterministically identifying to a desired confidence probability a known protein of a sample from other known proteins of the sample using a combination of two or more exclusive combinations of theoretical product ions as an error detection and correction code in a tandem mass spectrometry DIA method that does not provide peptide identifying information, in accordance with various embodiments.
  • System 900 includes measurement module 910 and analysis module 920.
  • Measurement module 910 receives a plurality of measured product ion spectra from a tandem mass spectrometer.
  • the plurality of measured product ion spectra are produced by the tandem mass spectrometer by dividing an m/z range of an ion beam into two or more precursor ion mass selection windows and selecting and fragmenting the two or more precursor ion mass selection windows during each cycle of a plurality of cycles.
  • the ion beam is produced by an ion source that ionizes peptides of two or more known proteins, producing an ion beam of precursor ions.
  • the peptides are separated from a sample by a separation device.
  • Analysis module 920 receives a desired confidence probability for the
  • Analysis module 920 calculates two or more combinations of N theoretical product ions that are exclusive to the at least one known protein. Analysis module 920 identifies the at least one known protein by finding product ions in the plurality of measured product ion spectra that match two or more of the two or more combinations of N theoretical product ions that provide a combined confidence probability greater than or equal to the desired confidence probability.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Molecular Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Medical Informatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Public Health (AREA)
  • Epidemiology (AREA)
  • Evolutionary Computation (AREA)
  • Bioethics (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Databases & Information Systems (AREA)
  • Evolutionary Biology (AREA)
  • Data Mining & Analysis (AREA)
  • Chemical & Material Sciences (AREA)
  • Urology & Nephrology (AREA)
  • Biomedical Technology (AREA)
  • Signal Processing (AREA)
  • Hematology (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Cell Biology (AREA)
  • Food Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)

Abstract

La présente invention concerne une pluralité de spectres ioniques de produit mesurés produits au moyen d'un procédé de spectrométrie de masse en tandem avec acquisition indépendante des données pour un échantillon de deux protéines connues ou plus. Une probabilité de confiance souhaitée pour l'identification d'au moins une protéine connue sur deux protéines connues ou plus est reçue. Deux combinaisons ou plus de N ions de produit théoriques qui sont exclusifs à la ou les protéine(s) connue(s) sont calculées. La ou les protéine(s) connue(s) est/sont identifiée(s) en retrouvant les ions de produit dans la pluralité de spectres ioniques de produit mesuré qui correspondent à deux combinaisons ou plus des deux combinaisons ou plus de N ions de produit théoriques qui donnent une probabilité de confiance combinée supérieure ou égale à la probabilité de confiance souhaitée.
EP16840911.8A 2015-08-31 2016-08-19 Identification de protéines en l'absence d'identifications de peptides Withdrawn EP3345003A4 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201562212282P 2015-08-31 2015-08-31
PCT/IB2016/054990 WO2017037563A1 (fr) 2015-08-31 2016-08-19 Identification de protéines en l'absence d'identifications de peptides

Publications (2)

Publication Number Publication Date
EP3345003A1 true EP3345003A1 (fr) 2018-07-11
EP3345003A4 EP3345003A4 (fr) 2019-05-15

Family

ID=58188507

Family Applications (1)

Application Number Title Priority Date Filing Date
EP16840911.8A Withdrawn EP3345003A4 (fr) 2015-08-31 2016-08-19 Identification de protéines en l'absence d'identifications de peptides

Country Status (2)

Country Link
EP (1) EP3345003A4 (fr)
WO (1) WO2017037563A1 (fr)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112534267A (zh) * 2018-08-31 2021-03-19 Dh科技发展私人贸易有限公司 复杂样本中相关化合物的识别和评分
KR20200087700A (ko) * 2019-01-11 2020-07-21 난토믹스, 엘엘씨 질량 분석법에 의한 sdhb 단백질 발현의 평가

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005057208A1 (fr) * 2003-12-03 2005-06-23 Prolexys Pharmaceuticals, Inc. Procede d'identification de peptides et de proteines
US8063359B2 (en) * 2007-10-08 2011-11-22 University Of Central Florida Research Foundation, Inc. Systems and methods for identifying substances contained in a material
EP2427844A4 (fr) * 2009-05-08 2016-11-02 Thermo Finnigan Llc Procédés et systèmes de mise en correspondre de productions et d'ions de précurseur
CA2810473C (fr) * 2010-09-15 2018-06-26 Dh Technologies Development Pte. Ltd. Acquisition independante des donnees de spectre d'ion de produit et concordance de bibliotheque de spectres de reference
US8530831B1 (en) * 2012-03-13 2013-09-10 Wisconsin Alumni Research Foundation Probability-based mass spectrometry data acquisition
JP6400086B2 (ja) * 2013-06-05 2018-10-03 ディーエイチ テクノロジーズ デベロップメント プライベート リミテッド バイオ医薬品タンパク質製品における宿主細胞タンパク質夾雑物の検出のためのswathデータ独立取得テクノロジ

Also Published As

Publication number Publication date
EP3345003A4 (fr) 2019-05-15
WO2017037563A1 (fr) 2017-03-09

Similar Documents

Publication Publication Date Title
US11456164B2 (en) Systems and methods for identifying compounds from MS/MS data without precursor ion information
CN109643633B (zh) 自动化质谱库保留时间校正
CN109526237B (zh) 使用swath方法扩展动态范围
US10163613B2 (en) Deconvolution of mixed spectra
EP3345003A1 (fr) Identification de protéines en l'absence d'identifications de peptides
US20220365046A1 (en) Identification and Scoring of Related Compounds in Complex Samples
US11181511B2 (en) Rapid scoring of LC-MS/MS peptide data
EP3249678B1 (fr) Systèmes et procédés pour regrouper des transitions ms/ms
US10141169B2 (en) Systems and methods for identifying compounds from MS/MS data without precursor ion information
US10825668B2 (en) Library search tolerant to isotopes
US20230005727A1 (en) Method of Mass Analysis - SWATH with Orthogonal Fragmentation Methodology
EP3482211B1 (fr) Analyse itérative de données d'andain en fonction des résultats
US10429364B2 (en) Detecting low level LCMS components by chromatographic reconstruction
US20220392758A1 (en) Threshold-based IDA Exclusion List
WO2023148663A1 (fr) Identification de composé à l'aide d'un spectre de masse

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20180322

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20190411

RIC1 Information provided on ipc code assigned before grant

Ipc: G01N 33/68 20060101AFI20190405BHEP

Ipc: G16B 40/00 20190101ALI20190405BHEP

Ipc: H01J 49/00 20060101ALI20190405BHEP

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20210302