WO2017081454A1 - Method and apparatus for determining a composition of a spectrum - Google Patents

Method and apparatus for determining a composition of a spectrum Download PDF

Info

Publication number
WO2017081454A1
WO2017081454A1 PCT/GB2016/053490 GB2016053490W WO2017081454A1 WO 2017081454 A1 WO2017081454 A1 WO 2017081454A1 GB 2016053490 W GB2016053490 W GB 2016053490W WO 2017081454 A1 WO2017081454 A1 WO 2017081454A1
Authority
WO
WIPO (PCT)
Prior art keywords
spectrum
estimate
determining
detector
peaks
Prior art date
Application number
PCT/GB2016/053490
Other languages
French (fr)
Inventor
Simon Maher
Simon MASKELL
Sarfaraz Uddin Ahmed Hashmi SYED
Stephen Taylor
Original Assignee
The University Of Liverpool
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The University Of Liverpool filed Critical The University Of Liverpool
Publication of WO2017081454A1 publication Critical patent/WO2017081454A1/en

Links

Classifications

    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01JELECTRIC DISCHARGE TUBES OR DISCHARGE LAMPS
    • H01J49/00Particle spectrometers or separator tubes
    • H01J49/0027Methods for using particle spectrometers
    • H01J49/0036Step by step routines describing the handling of the data generated during a measurement
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/12Classification; Matching
    • G06F2218/14Classification; Matching by matching peak patterns

Definitions

  • spectroscopy is a range over which one or more measurable properties of a physical phenomenon, such as the frequency of sound or electromagnetic radiation, or the mass of specific kinds of particles, can vary.
  • the study of spectra is termed spectroscopy.
  • An example apparatus for mass spectrometry is a quadrupole mass spectrometer (QMS), although it will be realised that other MS apparatus exist.
  • QMS quadrupole mass spectrometer
  • ions from a sample are injected into a quadrupole mass filter to which direct and alternating voltages are applied.
  • Stable ions are able to pass through the filter to reach a detector.
  • a spectral peak at a given mass is output by the detector.
  • An amplitude or height of the peak is proportional to a concentration of ions in the sample at the given mass.
  • problems can arise in spectroscopy when ions of similar mass arrive at the detector.
  • the output of the detector is indicative of two or more spectral peaks at similar mass which at least partly overlap.
  • problems can arise in determining whether a peak at a single mass exists at very low amplitude in relation to noise present in the signal output by the detector.
  • a computer- implemented method of determining a composition of a spectrum recorded at a detector comprising determining a first estimate of at least one characteristic of a spectrum, simulating the spectrum at a detector based on the first estimate, wherein in said simulation an amplitude of one or more peaks in the spectrum is constrained to be positive, comparing the simulated spectrum with a spectrum recorded at the detector, and determining an updated estimate of the spectrum based on the comparison.
  • the spectrum may comprise zero or more peaks.
  • the spectrum comprises one or more peaks and the amplitude of each peak is constrained to be positive.
  • the method may comprise determining an initial estimate at least one initial characteristic of the spectrum.
  • the method may comprise iteratively performing the steps of simulating the spectrum, comparing the simulated spectrum and determining the updated estimate of the spectrum.
  • the method may comprise determining a second estimate of at least one characteristic of the spectrum, simulating the spectra at the detector based on the first and second estimates, selecting one of the first and second estimates based on the comparison between the simulated spectra and the spectrum recorded at the detector.
  • the selecting of one of the first and second estimates may be further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
  • the updated estimate of the spectrum may be based upon one or more statistical rules.
  • the one or more statistical rules may define one or more of a likelihood of adding a peak to the spectrum, a likelihood of removing a peak from the spectrum and a likelihood of modifying one or more attributes of peaks forming the spectrum.
  • a computer-implemented method of determining a composition of a spectrum recorded at a detector comprising providing a first estimate of at least one characteristic of a spectrum, determining second estimate of the at least one characteristic of the spectrum, simulating spectra at a detector corresponding to the first estimate and the second estimate, wherein said simulation includes injecting ions into a simulation of a detection apparatus comprising the detector according to a Monte-Carlo method, and selecting one of the first and second estimates based on the simulation and a spectrum recorded at a detector.
  • the ions may be injected into the detection apparatus randomised in one or both of space and time.
  • the method may comprise determining an initial estimate at least one initial characteristic of the spectrum, and iteratively performing the steps of determining the second estimate, simulating the spectrum, and selecting one of the first and second estimates.
  • the selected estimate may be utilised as the first estimate in a following iteration of the method.
  • the method may comprise comparing the simulated spectra with the spectrum recorded at the detector, and selecting one of the first and second estimates based on the comparison.
  • the selecting is optionally further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
  • an amplitude of the one or more peaks may be constrained to be positive
  • One or both of the first and second estimates may comprises one or more peaks present in the spectrum and an attribute of the one or more peaks.
  • computer software which, when executed by a computer, is arranged to perform a method according to an aspect of the invention.
  • the computer software may be stored on a computer readable medium.
  • the computer software may be tangibly stored on a computer readable medium.
  • a computing apparatus comprising a memory and at least one processor, the memory storing computer executable instructions which, when executed by the at least one processor, perform a method according to an aspect of the invention.
  • Figure 1 shows a method according to an embodiment of the invention
  • Figure 2 is a schematic illustration of a quadrupole mass spectrometer
  • Figure 3 is an illustration of experimental and simulated spectra for hydrogen and helium according to an embodiment of the invention.
  • FIG. 1 illustrates a method 100 according to an embodiment of the invention.
  • the method 100 is a computer-implemented method of determining one or more aspects of a measured spectrum.
  • the spectrum may be that output by a quadrupole mass spectrometer (QMS), although it will be realised that embodiments of the invention are not limited in this respect and that the spectrum may be determined in other ways such as Mass Spectrometry (MS), Nuclear Magnetic Resonance (NMR), Raman Spectroscopy, etc. However, for the purpose of illustration, embodiments will be described in relation to QMS.
  • a QMS 200 is schematically illustrated in Figure 2.
  • the QMS 200 comprises a source of ions 210 which are injected into the QMS 200, as the skilled person will appreciate.
  • the QMS comprises a number of electrodes, which are excited to provide a quadrupole electric field, which functions as a mass filter via application of a suitable combination of variable amplitude direct (U) and variable amplitude alternating (V) voltages from a voltage generator 230.
  • U variable amplitude direct
  • V variable amplitude alternating
  • ions of a given mass are stable (resonant) and others not.
  • Stable ions reach a detector 240 where they form a spectral peak at the given mass (mass spectrum). The peak height is proportional to the concentration of ions in the sample at that mass.
  • step 1 10 comprises determining a set of parameters.
  • Step 110 comprises determining an estimate of the spectrum.
  • the spectrum may comprise one or more peaks, although it will be realised that the spectrum may not comprise any peaks. That is, the spectrum may only comprise noise.
  • the spectrum is parameterised by a vector ⁇ .
  • the vector ⁇ has a dimension of zero or more. In the case of the vector having a dimension of zero the spectrum does not comprise any peaks.
  • the vector ⁇ defines parameters for each peak forming the spectrum. For each peak in the spectrum one or more parameters define the respective peaks.
  • the one or more parameters per peak may comprise one or both of a position and width of each peak.
  • the vector ⁇ may be determined not to include parameters defining an amplitude of each peak, such as a relative or absolute amplitude of the respective peak. In this case, the amplitude of each peak is determined in a subsequent step, as will be explained.
  • the vector ⁇ may define a ratio of mass over charge for each peak. Thus, given ⁇ , a shape of each peak forming the spectrum is defined.
  • the vector ⁇ is a random set of initial parameters ⁇ wherein the subscript 0 is indicative of the initial nature of the parameters i.e. for a zero-th iteration.
  • the initial parameters may be sampled from an initial probability density function (pdf) 3 ⁇ 4(3 ⁇ 4) ⁇ I n an example embodiment, samples from the initial pdf are constrained to consist of one peak. However it will be realised that other numbers of initial peaks may be selected.
  • the position of the initial peak may be determined to be distributed across an extent of the mass over charge range for the spectrum, such that wherein ⁇ is an interval of valid values for the peak position.
  • one of more attributes of each peak are determined. For example, a position of each peak is selected in step 1 10.
  • each peak in the spectrum.
  • the shape of each peak is related to parameters that include a U/V voltage ratio of the detector, as will be appreciated by the skilled person.
  • other attributes of each peak may be selected, such as the width of each peak.
  • the number of peaks and the respective position of each peak may be selected as a respective random, or pseudo-random, value, as will be appreciated.
  • a further set of parameters is determined.
  • the further set of parameters may be referred to as a new set of parameters.
  • the new parameters may represent an updated estimate, or guess, as to a composition of peaks forming a spectrum.
  • the first set of parameters ⁇ may represent a current set of parameters.
  • the new parameters may be defined by a second, updated, vector, ⁇ .
  • the second vector ⁇ may comprise more or less parameters than the first vector ⁇ i.e. more or less peaks may be defined by the new set of parameters.
  • the new set of parameters may be based upon the current set of parameters ⁇ .
  • new set of parameters may be determined according to one or more statistical
  • a peak is removed from the current set of parameters ⁇ with a predetermined probability, which may be 10% although it will be realised that other probabilities may be used.
  • a peak is added to the current set of parameters ⁇ with a predetermined probability, which may be 10% although it will be realised that other probabilities may be used.
  • one or more attributes of the peaks forming the current parameters ⁇ are modified with a predetermined probability, which may be 80% although it will be realised that other probabilities may be used.
  • the total probabilities for all rules equals 100% as will be appreciated. It will be realised that the removal of peaks from the current set of parameters ⁇ is performed with respect to a minimum number of peaks.
  • the minimum number of peaks may be zero such that a peak may not be removed from a spectrum that does not comprise any peaks.
  • the one or more attribute(s) of that added peak are determined in step 120. For example, the position of the new peak is determined.
  • a predetermined modification may be applied.
  • the predetermined modification may be an addition of noise to the current parameters ⁇ .
  • the noise may be zero-mean Gaussian noise having a predetermined variance, such as 1.
  • step 120 a probability of moving to the new set of parameters given the current ⁇ set of parameters is determined. That is, a value of is determined in step
  • the probability may be determined as:
  • P is the predetermined probability of adding, removing or retaining the same number of peaks, such as 10%, as discussed above, and N is the number of peaks prior to the add or remove.
  • spectra are generated. The spectra are generated based on each of the current ⁇ and new sets of parameters In embodiments where each vector
  • a QMS mass spectrum consists of an ion current (y axis) plotted against a mass (m) to charge (q) position (x-axis).
  • ions of a given mass to charge ratio which may be selected by a user, are injected by simulation into a QMS mass filter model.
  • a large number of ions e.g. 100000, although it will be realised that other numbers may be used) are injected at each mass point (x axis position), thereby simulating the action of an ion source in a real QMS instrument.
  • Ions are determined to start from a random spatial position over a disc corresponding to an ion source aperture, and their injection is randomised in time (i.e. occurs at any point of an applied voltage waveform).
  • the random injection of ions in space and time justifies the term 'Monte Carlo ' to describe the nature of the simulation.
  • E may be calculated from a geometry of the instrument by field plotting routines (e.g. finite difference methods).
  • an acceleration a is determined at each point in space, and v (ion velocity) by numerical integration, trajectory s as a function of time by numerical integration of v.
  • v ion velocity
  • trajectory s as a function of time by numerical integration of v.
  • the model provides a method of computation to determine the individual trajectories of large numbers of ions injected from the ion source 210 into the quadrupole mass filter (QMF) 220.
  • a simulated mass scan is produced, which may comprise at least 10 5 ions, injected into the quadrupole model at, at least some, or each point on the mass scale.
  • the model provides an accurate, physics based, forward model of mass filter behaviour and is able to predict the mass spectral peak shapes.
  • Ions from the source 210 are assumed to originate at any point on a circular disk centred on a quadrupole axis and set at right angles to the axis.
  • a quadrupole field starts immediately when the ions leave the source 210.
  • the radii of both source and exit disks may be different and varied freely. In the simulation each ion is generated at a point in the source disk selected at random with no correlation between the points used for successive ions; all positions are equally probable, this corresponds to uniform source illumination.
  • the time of origin in the source is also selected randomly; that is ions enter the quadrupole at random values of the phase of the radio frequency voltage, the alternating voltage, used to operate the filter.
  • the simulation may be considered a Monte-Carlo type simulation.
  • a result of such simulation is illustrated in Figure 3.
  • Ions may be simulated to travel through the filter 220 with constant velocity in the z direction. This is because the fringe fields may be ignored and all the electric fields experienced by ions are at right angles to the z direction; therefore there is no component of force to change the velocity in the z direction.
  • rO filter radius
  • Ions that pass through an exit aperture form the received signal at the detector 240. Ions may be traced through the filter 220 by determining their motion in the hyperbolic field.
  • Their travel may be divided into small time intervals and their motion over each small time interval computed using the local field they experience -the field is function of time because the applied voltage may include an AC component.
  • the motion may be approximated using a fourth order Runge-Kutta algorithm.
  • step 140 the two spectra are each compared against a measured spectrum 150.
  • the measured spectrum 150 is that measured by the QMS detector.
  • the comparison in step 140 is based upon a General Linear Model (GLM).
  • GLM General Linear Model
  • an amplitude of the peaks, A is constrained to be positive.
  • Figure 3 illustrates a simulated spectrum 310 comprising three peaks produced by Monte-Carlo simulation as described above according to an embodiment of the invention and a measured spectrum 310 comprising two peaks measured experimentally.
  • Reference numerals in Figure 3 specifically indicate a peak in the spectrum corresponding to helium. In embodiments of the invention the following is calculated:
  • GLM is defined as:
  • M - 1 is the dimensionality of ⁇ representing the number of peaks that ⁇ hypothesises
  • d is the measured spectral data of length N
  • Equation 3 may be approximated via numerical integration, wherein a sample is taken a predetermined number of times from a student-T pdf defined in Equation (3). The predetermined number of times may be 1000, although other values may be used. A fraction of the samples that satisfy the above-mentioned constraint that all peaks have positive amplitude is then calculated. Note that the bottom most element of A relates to an offset which is allowed to be negative.
  • step 160 it is determined whether to accept the new set of parameters
  • step 160 is based upon the measured spectrum 150 and the current and new sets of parameters.
  • the determination in step 160 is based upon a probability that the measured spectrum 150 corresponds to each of the current and new sets of parameters respectively, and a probability of
  • Equation 6 The determination may be made in embodiment of the invention according to Equation 6 :
  • the new set of parameters ⁇ is accepted, thus becoming the current parameters for a future iteration of the method, if ⁇ is greater than a threshold value.
  • the threshold value may vary between each iteration of the method in some embodiments.
  • the threshold may be a random number drawn from a uniform distribution between zero and one. If ⁇ is less than or equal to the threshold value then the current ⁇ set of parameters is retained in step 170 i.e. the new set of parameters ⁇ is discarded.
  • the new set of parameters, ⁇ are not accepted then the current set of parameters, ⁇ , are logged or stored in step 180. Alternatively, if the new set of parameters, ⁇ , are accepted, then the new set of parameters are logged or stored in step 180.
  • a set of parameters is stored at each iteration of step 180. As indicated by arrow between steps 180 and 120, the new set of parameters is provided to step 120 as an input for a next iteration of the method 100.
  • the diversity of the sets of stored parameters is indicative of uncertainty of the parameters of the measured spectrum, including uncertainty related to whether the measured spectrum contains a peak at a given position or not. This captures uncertainty relating to whether a low amplitude peak is present and whether a peak is present in close proximity to another, such that embodiments of the invention can achieve enhanced detection of low amplitude peaks and improved resolution of closely located peaks.
  • a most likely number of peaks present in the measured spectrum can be identified by identifying a number of peaks that occurs most frequently in the set of stored parameters. To produce a single estimated output, an average may then be determined across the samples with that number of peaks once the list of peak positions has been sorted in order of ascending mass over charge. We can also manipulate the samples to derive an estimate of the amplitude corresponding to each peak (190). It is innovative that, in steps 190 and 150, we enforce the constraint that the amplitude relates to a physical abundance and is therefore positive.
  • Embodiments of the present invention provide a method of determining a composition of a spectrum by iteratively simulating a spectrum and comparing the simulated spectrum with a measured spectrum.
  • the composition of the measured spectrum may be determined with increased accuracy.
  • embodiments of the present invention can be realised in the form of hardware, software or a combination of hardware and software. Any such software may be stored in the form of volatile or non-volatile storage such as, for example, a storage device like a ROM, whether erasable or rewritable or not, or in the form of memory such as, for example, RAM, memory chips, device or integrated circuits or on an optically or magnetically readable medium such as, for example, a CD, DVD, magnetic disk or magnetic tape. It will be appreciated that the storage devices and storage media are embodiments of machine-readable storage that are suitable for storing a program or programs that, when executed, implement embodiments of the present invention.
  • embodiments provide a program comprising code for implementing a system or method as claimed in any preceding claim and a machine readable storage storing such a program. Still further, embodiments of the present invention may be conveyed electronically via any medium such as a communication signal carried over a wired or wireless connection and embodiments suitably encompass the same.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)

Abstract

Embodiments of the present invention provide a computer-implemented method of determining a composition of a spectrum recorded at a detector, comprising determining a first estimate of at least one characteristic of a spectrum, simulating the spectrum at a detector based on the first estimate, wherein in said simulation the spectrum comprises one or more peaks and the amplitude of each peak is constrained to be positive, comparing the simulated spectrum with a spectrum recorded at the detector, and determining an updated estimate of the spectrum based on the comparison.

Description

Method and Apparatus for Determining a Composition of a Spectrum
Background A spectrum is a range over which one or more measurable properties of a physical phenomenon, such as the frequency of sound or electromagnetic radiation, or the mass of specific kinds of particles, can vary. The study of spectra is termed spectroscopy. Various spectroscopy techniques exist which include, Mass Spectrometry (MS), Nuclear Magnetic Resonance (NMR), Raman Spectroscopy, etc.
An example apparatus for mass spectrometry is a quadrupole mass spectrometer (QMS), although it will be realised that other MS apparatus exist. In the QMS, ions from a sample are injected into a quadrupole mass filter to which direct and alternating voltages are applied. Stable ions are able to pass through the filter to reach a detector. A spectral peak at a given mass is output by the detector. An amplitude or height of the peak is proportional to a concentration of ions in the sample at the given mass.
Problems can arise in spectroscopy when ions of similar mass arrive at the detector. In this case the output of the detector is indicative of two or more spectral peaks at similar mass which at least partly overlap. Alternatively or additionally, problems can arise in determining whether a peak at a single mass exists at very low amplitude in relation to noise present in the signal output by the detector. Although improvements have been made in producing instruments with very low noise and reducing overlap between peaks at a given separation, these instruments tend to have a large size and/or high cost.
It is an object of embodiments of the invention to at least mitigate one or more of the problems of the prior art.
Statements of Invention
According to an aspect of the present invention there is provided methods and apparatus as set forth in the appended claims. According to an aspect of the present invention there is provided a computer- implemented method of determining a composition of a spectrum recorded at a detector, comprising determining a first estimate of at least one characteristic of a spectrum, simulating the spectrum at a detector based on the first estimate, wherein in said simulation an amplitude of one or more peaks in the spectrum is constrained to be positive, comparing the simulated spectrum with a spectrum recorded at the detector, and determining an updated estimate of the spectrum based on the comparison.
The spectrum may comprise zero or more peaks. Optionally the spectrum comprises one or more peaks and the amplitude of each peak is constrained to be positive.
The method may comprise determining an initial estimate at least one initial characteristic of the spectrum. The method may comprise iteratively performing the steps of simulating the spectrum, comparing the simulated spectrum and determining the updated estimate of the spectrum.
The method may comprise determining a second estimate of at least one characteristic of the spectrum, simulating the spectra at the detector based on the first and second estimates, selecting one of the first and second estimates based on the comparison between the simulated spectra and the spectrum recorded at the detector.
The selecting of one of the first and second estimates may be further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
The updated estimate of the spectrum may be based upon one or more statistical rules. The one or more statistical rules may define one or more of a likelihood of adding a peak to the spectrum, a likelihood of removing a peak from the spectrum and a likelihood of modifying one or more attributes of peaks forming the spectrum. A computer-implemented method of determining a composition of a spectrum recorded at a detector, comprising providing a first estimate of at least one characteristic of a spectrum, determining second estimate of the at least one characteristic of the spectrum, simulating spectra at a detector corresponding to the first estimate and the second estimate, wherein said simulation includes injecting ions into a simulation of a detection apparatus comprising the detector according to a Monte-Carlo method, and selecting one of the first and second estimates based on the simulation and a spectrum recorded at a detector. In said Monte-Carlo method the ions may be injected into the detection apparatus randomised in one or both of space and time.
The method may comprise determining an initial estimate at least one initial characteristic of the spectrum, and iteratively performing the steps of determining the second estimate, simulating the spectrum, and selecting one of the first and second estimates.
The selected estimate may be utilised as the first estimate in a following iteration of the method.
The method may comprise comparing the simulated spectra with the spectrum recorded at the detector, and selecting one of the first and second estimates based on the comparison. The selecting is optionally further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
In said simulation an amplitude of the one or more peaks may be constrained to be positive;
One or both of the first and second estimates may comprises one or more peaks present in the spectrum and an attribute of the one or more peaks. According to another aspect of the present invention there is provided computer software which, when executed by a computer, is arranged to perform a method according to an aspect of the invention. The computer software may be stored on a computer readable medium. The computer software may be tangibly stored on a computer readable medium.
A computing apparatus comprising a memory and at least one processor, the memory storing computer executable instructions which, when executed by the at least one processor, perform a method according to an aspect of the invention.
Brief Description of the Drawings
Embodiments of the invention will now be described by way of example only, with reference to the accompanying figures, in which:
Figure 1 shows a method according to an embodiment of the invention;
Figure 2 is a schematic illustration of a quadrupole mass spectrometer; and
Figure 3 is an illustration of experimental and simulated spectra for hydrogen and helium according to an embodiment of the invention.
Detailed Description of Embodiments of the Invention
Figure 1 illustrates a method 100 according to an embodiment of the invention. The method 100 is a computer-implemented method of determining one or more aspects of a measured spectrum. The spectrum may be that output by a quadrupole mass spectrometer (QMS), although it will be realised that embodiments of the invention are not limited in this respect and that the spectrum may be determined in other ways such as Mass Spectrometry (MS), Nuclear Magnetic Resonance (NMR), Raman Spectroscopy, etc. However, for the purpose of illustration, embodiments will be described in relation to QMS. A QMS 200 is schematically illustrated in Figure 2. The QMS 200 comprises a source of ions 210 which are injected into the QMS 200, as the skilled person will appreciate. The QMS comprises a number of electrodes, which are excited to provide a quadrupole electric field, which functions as a mass filter via application of a suitable combination of variable amplitude direct (U) and variable amplitude alternating (V) voltages from a voltage generator 230. For a given combination of U, V, frequency and electrode geometry ions of a given mass are stable (resonant) and others not. Stable ions reach a detector 240 where they form a spectral peak at the given mass (mass spectrum). The peak height is proportional to the concentration of ions in the sample at that mass.
Returning to Figure 1, step 1 10 comprises determining a set of parameters. Step 110 comprises determining an estimate of the spectrum. The spectrum may comprise one or more peaks, although it will be realised that the spectrum may not comprise any peaks. That is, the spectrum may only comprise noise. The spectrum is parameterised by a vector Θ. The vector Θ has a dimension of zero or more. In the case of the vector having a dimension of zero the spectrum does not comprise any peaks. The vector Θ defines parameters for each peak forming the spectrum. For each peak in the spectrum one or more parameters define the respective peaks. The one or more parameters per peak may comprise one or both of a position and width of each peak. The vector Θ may be determined not to include parameters defining an amplitude of each peak, such as a relative or absolute amplitude of the respective peak. In this case, the amplitude of each peak is determined in a subsequent step, as will be explained. In embodiments related to mass spectrometry, such as the exemplary explained embodiment, the vector Θ may define a ratio of mass over charge for each peak. Thus, given Θ, a shape of each peak forming the spectrum is defined.
In some embodiments of step 110, the vector Θ is a random set of initial parameters θο wherein the subscript 0 is indicative of the initial nature of the parameters i.e. for a zero-th iteration. The initial parameters may be sampled from an initial probability density function (pdf) ¾(¾) · In an example embodiment, samples from the initial pdf are constrained to consist of one peak. However it will be realised that other numbers of initial peaks may be selected. The position of the initial peak may be determined to be distributed across an extent of the mass over charge range for the spectrum, such that wherein Δ is an interval of valid values for the peak position.
Figure imgf000007_0001
In step 110 one of more attributes of each peak are determined. For example, a position of each peak is selected in step 1 10. As the described embodiment relates to QMS, it is only necessary to select the position of each peak in the spectrum. The shape of each peak is related to parameters that include a U/V voltage ratio of the detector, as will be appreciated by the skilled person. However, in other non-QMS embodiments, other attributes of each peak may be selected, such as the width of each peak. The number of peaks and the respective position of each peak may be selected as a respective random, or pseudo-random, value, as will be appreciated.
In step 120 a further set of parameters is determined. The further set of parameters may be referred to as a new set of parameters. The new parameters may represent an updated estimate, or guess, as to a composition of peaks forming a spectrum. In this sense, the first set of parameters Θ may represent a current set of parameters. The new parameters may be defined by a second, updated, vector, θ . The second vector Θ may comprise more or less parameters than the first vector Θ i.e. more or less peaks may be defined by the new set of parameters.
The new set of parameters may be based upon the current set of parameters Θ. The
Figure imgf000007_0002
new set of parameters may be determined according to one or more statistical
rules. In one embodiment, a peak is removed from the current set of parameters Θ with a predetermined probability, which may be 10% although it will be realised that other probabilities may be used. In one embodiment, a peak is added to the current set of parameters Θ with a predetermined probability, which may be 10% although it will be realised that other probabilities may be used. Furthermore, in one embodiment, one or more attributes of the peaks forming the current parameters Θ are modified with a predetermined probability, which may be 80% although it will be realised that other probabilities may be used. The total probabilities for all rules equals 100% as will be appreciated. It will be realised that the removal of peaks from the current set of parameters Θ is performed with respect to a minimum number of peaks. The minimum number of peaks may be zero such that a peak may not be removed from a spectrum that does not comprise any peaks. In the case of a peak being added, the one or more attribute(s) of that added peak are determined in step 120. For example, the position of the new peak is determined. Similarly, where the attributes of the current peaks are modified a predetermined modification may be applied. The predetermined modification may be an addition of noise to the current parameters Θ. The noise may be zero-mean Gaussian noise having a predetermined variance, such as 1. Thus, following step 120 the new set of parameters θ is determined.
In step 120 a probability of moving to the new set of parameters
Figure imgf000008_0004
given the current Θ set of parameters is determined. That is, a value of is determined in step
Figure imgf000008_0003
120. The probability may be determined as:
Figure imgf000008_0002
where p is the probability, P is the predetermined probability of adding, removing or retaining the same number of peaks, such as 10%, as discussed above, and N is the number of peaks prior to the add or remove.
Similarly, a reverse probability of moving from the new set of parameters to the
Figure imgf000008_0006
old set of parameters is determined.
Figure imgf000008_0001
In step 130 spectra are generated. The spectra are generated based on each of the current Θ and new sets of parameters In embodiments where each vector
Figure imgf000008_0005
comprises elements each defining a respective peak, a single peak is generated for the rth element of each of Θ and where i may be between 1 and the number of peaks
Figure imgf000008_0008
defined by each set of parameters I. The single peak spectrum for the rth element of the new set of parameters is In step 130 an output of Monte-Carlo simulation is
Figure imgf000008_0007
used to determine one or more templates for predicting a shape of each peak given at least some of the parameters. The Monte-Carlo simulation may be performed offline. As will be appreciated, a QMS mass spectrum consists of an ion current (y axis) plotted against a mass (m) to charge (q) position (x-axis). In the simulation of step 140, ions of a given mass to charge ratio, which may be selected by a user, are injected by simulation into a QMS mass filter model. A large number of ions (e.g. 100000, although it will be realised that other numbers may be used) are injected at each mass point (x axis position), thereby simulating the action of an ion source in a real QMS instrument. Ions are determined to start from a random spatial position over a disc corresponding to an ion source aperture, and their injection is randomised in time (i.e. occurs at any point of an applied voltage waveform). The random injection of ions in space and time justifies the term 'Monte Carlo ' to describe the nature of the simulation. For each individual ion, the force (F) on the ion is calculated from a knowledge of the electric field (E) at each point according to F = q E where q = charge on the ion. E may be calculated from a geometry of the instrument by field plotting routines (e.g. finite difference methods). From F/m an acceleration a is determined at each point in space, and v (ion velocity) by numerical integration, trajectory s as a function of time by numerical integration of v. Hence, for each individual ion its trajectory in space and time is calculated and it may be determined whether said ion is stable (and forms part of the mass spectrum) or unstable (and is rejected from the QMS). Repeating for each of the ions (e.g. 100000 ions) allows a mass spectrum to be simulated for a given set of instrument conditions, geometry, electrode size and spacings, voltage excitation on electrodes and input ion energy.
The model provides a method of computation to determine the individual trajectories of large numbers of ions injected from the ion source 210 into the quadrupole mass filter (QMF) 220. A simulated mass scan is produced, which may comprise at least 105 ions, injected into the quadrupole model at, at least some, or each point on the mass scale. The model provides an accurate, physics based, forward model of mass filter behaviour and is able to predict the mass spectral peak shapes.
Ions from the source 210 are assumed to originate at any point on a circular disk centred on a quadrupole axis and set at right angles to the axis. A quadrupole field starts immediately when the ions leave the source 210. The radii of both source and exit disks may be different and varied freely. In the simulation each ion is generated at a point in the source disk selected at random with no correlation between the points used for successive ions; all positions are equally probable, this corresponds to uniform source illumination. The time of origin in the source is also selected randomly; that is ions enter the quadrupole at random values of the phase of the radio frequency voltage, the alternating voltage, used to operate the filter. Because of the random nature of the ion injection in space and time, the simulation may be considered a Monte-Carlo type simulation. A result of such simulation is illustrated in Figure 3. Ions may be simulated to travel through the filter 220 with constant velocity in the z direction. This is because the fringe fields may be ignored and all the electric fields experienced by ions are at right angles to the z direction; therefore there is no component of force to change the velocity in the z direction. At any time when the magnitude of either the x or y coordinate of an ion exceeds a filter radius, rO, it is rejected. Ions that pass through an exit aperture form the received signal at the detector 240. Ions may be traced through the filter 220 by determining their motion in the hyperbolic field. Their travel may be divided into small time intervals and their motion over each small time interval computed using the local field they experience -the field is function of time because the applied voltage may include an AC component. In some embodiments the motion may be approximated using a fourth order Runge-Kutta algorithm.
In step 140 the two spectra are each compared against a measured spectrum 150. The measured spectrum 150 is that measured by the QMS detector. Thus the measured spectrum 150 is provided as an input to step 140. The comparison in step 140 is based upon a General Linear Model (GLM). However, in embodiments of the invention, an amplitude of the peaks, A, is constrained to be positive.
Figure 3 illustrates a simulated spectrum 310 comprising three peaks produced by Monte-Carlo simulation as described above according to an embodiment of the invention and a measured spectrum 310 comprising two peaks measured experimentally. Reference numerals in Figure 3 specifically indicate a peak in the spectrum corresponding to helium. In embodiments of the invention the following is calculated:
Figure imgf000011_0002
Where the GLM is defined as:
Figure imgf000011_0001
and
M - 1 is the dimensionality of Θ representing the number of peaks that Θ hypothesises;
d is the measured spectral data of length N;
is a matrix formed as where 1 is a vector of the same size as
Figure imgf000011_0003
d with all the elements equal to unity and used to model any offset present, although the skilled person will realise other values can be used; and γο and δ are hyper-parameters, which may be assumed to be equal to unity. The integral in Equation 3 may be approximated via numerical integration, wherein a sample is taken a predetermined number of times from a student-T pdf defined in Equation (3). The predetermined number of times may be 1000, although other values may be used. A fraction of the samples that satisfy the above-mentioned constraint that all peaks have positive amplitude is then calculated. Note that the bottom most element of A relates to an offset which is allowed to be negative.
In step 160 it is determined whether to accept the new set of parameters
Figure imgf000012_0003
determination in step 160 is based upon the measured spectrum 150 and the current and new sets of parameters. In particular, the determination in step 160 is based upon a probability that the measured spectrum 150 corresponds to each of the current and new sets of parameters respectively, and a probability of
Figure imgf000012_0001
moving to the new set of parameters given the current Θ set of parameters, and
Figure imgf000012_0002
vice-versa. The determination may be made in embodiment of the invention according to Equation 6 :
Figure imgf000012_0004
Eqn. 6 The new set of parameters θ is accepted, thus becoming the current parameters for a future iteration of the method, if η is greater than a threshold value. The threshold value may vary between each iteration of the method in some embodiments. The threshold may be a random number drawn from a uniform distribution between zero and one. If η is less than or equal to the threshold value then the current Θ set of parameters is retained in step 170 i.e. the new set of parameters θ is discarded.
If the new set of parameters, Θ, are not accepted then the current set of parameters, Θ, are logged or stored in step 180. Alternatively, if the new set of parameters, Θ, are accepted, then the new set of parameters are logged or stored in step 180. Thus, over repeated iterations of the method 100, a set of parameters is stored at each iteration of step 180. As indicated by arrow between steps 180 and 120, the new set of parameters is provided to step 120 as an input for a next iteration of the method 100. The diversity of the sets of stored parameters is indicative of uncertainty of the parameters of the measured spectrum, including uncertainty related to whether the measured spectrum contains a peak at a given position or not. This captures uncertainty relating to whether a low amplitude peak is present and whether a peak is present in close proximity to another, such that embodiments of the invention can achieve enhanced detection of low amplitude peaks and improved resolution of closely located peaks.
A most likely number of peaks present in the measured spectrum can be identified by identifying a number of peaks that occurs most frequently in the set of stored parameters. To produce a single estimated output, an average may then be determined across the samples with that number of peaks once the list of peak positions has been sorted in order of ascending mass over charge. We can also manipulate the samples to derive an estimate of the amplitude corresponding to each peak (190). It is innovative that, in steps 190 and 150, we enforce the constraint that the amplitude relates to a physical abundance and is therefore positive.
Embodiments of the present invention provide a method of determining a composition of a spectrum by iteratively simulating a spectrum and comparing the simulated spectrum with a measured spectrum. Advantageously the composition of the measured spectrum may be determined with increased accuracy.
It will be appreciated that embodiments of the present invention can be realised in the form of hardware, software or a combination of hardware and software. Any such software may be stored in the form of volatile or non-volatile storage such as, for example, a storage device like a ROM, whether erasable or rewritable or not, or in the form of memory such as, for example, RAM, memory chips, device or integrated circuits or on an optically or magnetically readable medium such as, for example, a CD, DVD, magnetic disk or magnetic tape. It will be appreciated that the storage devices and storage media are embodiments of machine-readable storage that are suitable for storing a program or programs that, when executed, implement embodiments of the present invention. Accordingly, embodiments provide a program comprising code for implementing a system or method as claimed in any preceding claim and a machine readable storage storing such a program. Still further, embodiments of the present invention may be conveyed electronically via any medium such as a communication signal carried over a wired or wireless connection and embodiments suitably encompass the same.
All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and/or all of the steps of any method or process so disclosed, may be combined in any combination, except combinations where at least some of such features and/or steps are mutually exclusive.
Each feature disclosed in this specification (including any accompanying claims, abstract and drawings), may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features.
The invention is not restricted to the details of any foregoing embodiments. The invention extends to any novel one, or any novel combination, of the features disclosed in this specification (including any accompanying claims, abstract and drawings), or to any novel one, or any novel combination, of the steps of any method or process so disclosed. The claims should not be construed to cover merely the foregoing embodiments, but also any embodiments which fall within the scope of the claims.

Claims

CLAIMS 1. A computer-implemented method of determining a composition of a spectrum recorded at a detector, comprising: determining a first estimate of at least one characteristic of a spectrum; simulating the spectrum at a detector based on the first estimate, wherein in said simulation the spectrum comprises one or more peaks and the amplitude of each peak is constrained to be positive; comparing the simulated spectrum with a spectrum recorded at the detector; and determining an updated estimate of the spectrum based on the comparison.
2. The method of claim 1, comprising determining an initial estimate at least one initial characteristic of the spectrum, and iteratively performing the steps of simulating the spectrum, comparing the simulated spectrum and determining the updated estimate of the spectrum.
3. The method of claim 1 or 2, comprising: determining a second estimate of at least one characteristic of the spectrum; simulating the spectra at the detector based on the first and second estimates; selecting one of the first and second estimates based on the comparison between the simulated spectra and the spectrum recorded at the detector.
4. The method of claim 3, wherein the selecting of one of the first and second estimates is further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
5. The method of any preceding claim, wherein determining the updated estimate of the spectrum is based upon one or more statistical rules.
6. The method of claim 5, wherein the one or more statistical rules define one or more of a likelihood of adding a peak to the spectrum, a likelihood of removing a peak from the spectrum and a likelihood of modifying one or more attributes of peaks forming the spectrum.
7. A computer-implemented method of determining a composition of a spectrum recorded at a detector, comprising: providing a first estimate of at least one characteristic of a spectrum; determining second estimate of the at least one characteristic of the spectrum; simulating spectra at a detector corresponding to the first estimate and the second estimate, wherein said simulation includes injecting ions into a simulation of a detection apparatus comprising the detector according to a Monte-Carlo method; and selecting one of the first and second estimates based on the simulation and a spectrum recorded at a detector.
8. The method of claim 7, wherein in said Monte-Carlo method the ions are injected into the detection apparatus randomised in one or both of space and time.
9. The method of claim 7 or 8, comprising determining an initial estimate at least one initial characteristic of the spectrum, and iteratively performing the steps of determining the second estimate, simulating the spectrum, and selecting one of the first and second estimates.
10. The method of claim 9, wherein the selected estimate is utilised as the first estimate in a following iteration of the method.
11. The method of any of claims 7 to 10, comprising: comparing the simulated spectra with the spectrum recorded at the detector; and selecting one of the first and second estimates based on the comparison.
12. The method of claim 11, wherein the selecting is further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
13. The method of any of claims 7 to 12, wherein in said simulation an amplitude of the one or more peaks is constrained to be positive;
14. The method of any of claims 7 to 13, wherein one or both of the first and second estimates comprises one or more peaks present in the spectrum and an attribute of the one or more peaks.
15. Computer software which, when executed by a computer, is arranged to perform a method according to any preceding claim.
16. The computer software of claim 15 stored on a computer readable medium.
17. A computing apparatus comprising a memory and at least one processor, the memory storing computer executable instructions which, when executed by the at least one processor, perform a method according to any of claims 1 to 6 or 7 to 14.
PCT/GB2016/053490 2015-11-09 2016-11-08 Method and apparatus for determining a composition of a spectrum WO2017081454A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB1519736.1 2015-11-09
GBGB1519736.1A GB201519736D0 (en) 2015-11-09 2015-11-09 Method and apparatus for determining a composition of a spectrum

Publications (1)

Publication Number Publication Date
WO2017081454A1 true WO2017081454A1 (en) 2017-05-18

Family

ID=55132506

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2016/053490 WO2017081454A1 (en) 2015-11-09 2016-11-08 Method and apparatus for determining a composition of a spectrum

Country Status (2)

Country Link
GB (1) GB201519736D0 (en)
WO (1) WO2017081454A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006050226A2 (en) * 2004-10-28 2006-05-11 Cerno Bioscience Llc Qualitative and quantitative mass spectral analysis
WO2011128702A1 (en) * 2010-04-15 2011-10-20 Micromass Uk Limited Method and system of identifying a sample by analyising a mass spectrum by the use of a bayesian inference technique
US20140358451A1 (en) * 2013-06-04 2014-12-04 Arizona Board Of Regents On Behalf Of Arizona State University Fractional Abundance Estimation from Electrospray Ionization Time-of-Flight Mass Spectrum

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006050226A2 (en) * 2004-10-28 2006-05-11 Cerno Bioscience Llc Qualitative and quantitative mass spectral analysis
WO2011128702A1 (en) * 2010-04-15 2011-10-20 Micromass Uk Limited Method and system of identifying a sample by analyising a mass spectrum by the use of a bayesian inference technique
US20140358451A1 (en) * 2013-06-04 2014-12-04 Arizona Board Of Regents On Behalf Of Arizona State University Fractional Abundance Estimation from Electrospray Ionization Time-of-Flight Mass Spectrum

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LI ET AL.: "Accurate Identification of Mass Peaks for Tandem Mass Spectra Using MCMC Model", TSINGHUA SCIENCE AND TECHNOLOGY, vol. 20, no. 5, October 2015 (2015-10-01), pages 453 - 459, XP002767171 *
S. U. A. H. SYED ET AL: "Quadrupole mass filter operation under the influence of magnetic field", JOURNAL OF MASS SPECTROMETRY., vol. 48, no. 12, 1 December 2013 (2013-12-01), GB, pages 1325 - 1339, XP055344700, ISSN: 1076-5174, DOI: 10.1002/jms.3293 *

Also Published As

Publication number Publication date
GB201519736D0 (en) 2015-12-23

Similar Documents

Publication Publication Date Title
CMS collaboration Identification of b-quark jets with the CMS experiment
Chatrchyan et al. Identification of b-quark jets with the CMS experiment
Heinrich et al. NLO QCD corrections to $${W}^{+}{W}^{-} b\overline {b} $$ production with leptonic decays in the light of top quark mass and asymmetry measurements
Nachman et al. Significance variables
Carbone Signals of the Giant Pairing Vibration in 14 C and 15 C nuclei populated by (18 O, 16 O) two-neutron transfer reactions
Beaujean et al. P-values for model evaluation
Leigh et al. $\nu $-flows: Conditional neutrino regression
Grote Pattern recognition in high-energy physics
Haley et al. Processing APT spectral backgrounds for improved quantification
CN101488456B (en) Etching amount calculating method and etching amount calculating apparatus
US20130197861A1 (en) Method for spectrometric analysis and related device
US7072772B2 (en) Method and apparatus for modeling mass spectrometer lineshapes
JP6318722B2 (en) Environmental load molecule generation source evaluation method, environmental load molecule generation source evaluation system, and computer program
WO2017081454A1 (en) Method and apparatus for determining a composition of a spectrum
Maxeiner et al. Simulation of ion beam scattering in a gas stripper
Jusko et al. Cold CAS ion trap–22 pole trap with ring electrodes for astrochemistry
Cheng et al. Measuring invisible particle masses using a single short decay chain
Lamparth et al. Gaussian processes and bayesian optimization for high precision experiments
Zinser Double differential cross section for Drell-Yan production of high-mass $ e^+ e^-$-pairs in $ pp $ collisions at $\sqrt {s}= 8$ TeV with the ATLAS experiment
EP4105672A1 (en) Systems and methods for provisioning training data to enable neural networks to analyze signals in nmr measurements
Peters Top Quark Pair Production Cross Section at the Tevatron
Ibrahimi et al. Accelerated time-of-flight mass spectrometry
Petrović et al. Expert System for threshold spectra analysis of nitrogen molecules
US20140358451A1 (en) Fractional Abundance Estimation from Electrospray Ionization Time-of-Flight Mass Spectrum
Gosz et al. Application of deep neural network in finding of repulsive part of molecular potential based on dispersed emission spectra

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16797601

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16797601

Country of ref document: EP

Kind code of ref document: A1