WO2017081454A1  Method and apparatus for determining a composition of a spectrum  Google Patents
Method and apparatus for determining a composition of a spectrum Download PDFInfo
 Publication number
 WO2017081454A1 WO2017081454A1 PCT/GB2016/053490 GB2016053490W WO2017081454A1 WO 2017081454 A1 WO2017081454 A1 WO 2017081454A1 GB 2016053490 W GB2016053490 W GB 2016053490W WO 2017081454 A1 WO2017081454 A1 WO 2017081454A1
 Authority
 WO
 WIPO (PCT)
 Prior art keywords
 spectrum
 estimate
 determining
 detector
 peaks
 Prior art date
Links
Classifications

 H—ELECTRICITY
 H01—ELECTRIC ELEMENTS
 H01J—ELECTRIC DISCHARGE TUBES OR DISCHARGE LAMPS
 H01J49/00—Particle spectrometers or separator tubes
 H01J49/0027—Methods for using particle spectrometers
 H01J49/0036—Step by step routines describing the handling of the data generated during a measurement

 G—PHYSICS
 G06—COMPUTING; CALCULATING OR COUNTING
 G06F—ELECTRIC DIGITAL DATA PROCESSING
 G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
 G06F2218/12—Classification; Matching
 G06F2218/14—Classification; Matching by matching peak patterns
Definitions
 spectroscopy is a range over which one or more measurable properties of a physical phenomenon, such as the frequency of sound or electromagnetic radiation, or the mass of specific kinds of particles, can vary.
 the study of spectra is termed spectroscopy.
 An example apparatus for mass spectrometry is a quadrupole mass spectrometer (QMS), although it will be realised that other MS apparatus exist.
 QMS quadrupole mass spectrometer
 ions from a sample are injected into a quadrupole mass filter to which direct and alternating voltages are applied.
 Stable ions are able to pass through the filter to reach a detector.
 a spectral peak at a given mass is output by the detector.
 An amplitude or height of the peak is proportional to a concentration of ions in the sample at the given mass.
 problems can arise in spectroscopy when ions of similar mass arrive at the detector.
 the output of the detector is indicative of two or more spectral peaks at similar mass which at least partly overlap.
 problems can arise in determining whether a peak at a single mass exists at very low amplitude in relation to noise present in the signal output by the detector.
 a computer implemented method of determining a composition of a spectrum recorded at a detector comprising determining a first estimate of at least one characteristic of a spectrum, simulating the spectrum at a detector based on the first estimate, wherein in said simulation an amplitude of one or more peaks in the spectrum is constrained to be positive, comparing the simulated spectrum with a spectrum recorded at the detector, and determining an updated estimate of the spectrum based on the comparison.
 the spectrum may comprise zero or more peaks.
 the spectrum comprises one or more peaks and the amplitude of each peak is constrained to be positive.
 the method may comprise determining an initial estimate at least one initial characteristic of the spectrum.
 the method may comprise iteratively performing the steps of simulating the spectrum, comparing the simulated spectrum and determining the updated estimate of the spectrum.
 the method may comprise determining a second estimate of at least one characteristic of the spectrum, simulating the spectra at the detector based on the first and second estimates, selecting one of the first and second estimates based on the comparison between the simulated spectra and the spectrum recorded at the detector.
 the selecting of one of the first and second estimates may be further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
 the updated estimate of the spectrum may be based upon one or more statistical rules.
 the one or more statistical rules may define one or more of a likelihood of adding a peak to the spectrum, a likelihood of removing a peak from the spectrum and a likelihood of modifying one or more attributes of peaks forming the spectrum.
 a computerimplemented method of determining a composition of a spectrum recorded at a detector comprising providing a first estimate of at least one characteristic of a spectrum, determining second estimate of the at least one characteristic of the spectrum, simulating spectra at a detector corresponding to the first estimate and the second estimate, wherein said simulation includes injecting ions into a simulation of a detection apparatus comprising the detector according to a MonteCarlo method, and selecting one of the first and second estimates based on the simulation and a spectrum recorded at a detector.
 the ions may be injected into the detection apparatus randomised in one or both of space and time.
 the method may comprise determining an initial estimate at least one initial characteristic of the spectrum, and iteratively performing the steps of determining the second estimate, simulating the spectrum, and selecting one of the first and second estimates.
 the selected estimate may be utilised as the first estimate in a following iteration of the method.
 the method may comprise comparing the simulated spectra with the spectrum recorded at the detector, and selecting one of the first and second estimates based on the comparison.
 the selecting is optionally further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
 an amplitude of the one or more peaks may be constrained to be positive
 One or both of the first and second estimates may comprises one or more peaks present in the spectrum and an attribute of the one or more peaks.
 computer software which, when executed by a computer, is arranged to perform a method according to an aspect of the invention.
 the computer software may be stored on a computer readable medium.
 the computer software may be tangibly stored on a computer readable medium.
 a computing apparatus comprising a memory and at least one processor, the memory storing computer executable instructions which, when executed by the at least one processor, perform a method according to an aspect of the invention.
 Figure 1 shows a method according to an embodiment of the invention
 Figure 2 is a schematic illustration of a quadrupole mass spectrometer
 Figure 3 is an illustration of experimental and simulated spectra for hydrogen and helium according to an embodiment of the invention.
 FIG. 1 illustrates a method 100 according to an embodiment of the invention.
 the method 100 is a computerimplemented method of determining one or more aspects of a measured spectrum.
 the spectrum may be that output by a quadrupole mass spectrometer (QMS), although it will be realised that embodiments of the invention are not limited in this respect and that the spectrum may be determined in other ways such as Mass Spectrometry (MS), Nuclear Magnetic Resonance (NMR), Raman Spectroscopy, etc. However, for the purpose of illustration, embodiments will be described in relation to QMS.
 a QMS 200 is schematically illustrated in Figure 2.
 the QMS 200 comprises a source of ions 210 which are injected into the QMS 200, as the skilled person will appreciate.
 the QMS comprises a number of electrodes, which are excited to provide a quadrupole electric field, which functions as a mass filter via application of a suitable combination of variable amplitude direct (U) and variable amplitude alternating (V) voltages from a voltage generator 230.
 U variable amplitude direct
 V variable amplitude alternating
 ions of a given mass are stable (resonant) and others not.
 Stable ions reach a detector 240 where they form a spectral peak at the given mass (mass spectrum). The peak height is proportional to the concentration of ions in the sample at that mass.
 step 1 10 comprises determining a set of parameters.
 Step 110 comprises determining an estimate of the spectrum.
 the spectrum may comprise one or more peaks, although it will be realised that the spectrum may not comprise any peaks. That is, the spectrum may only comprise noise.
 the spectrum is parameterised by a vector ⁇ .
 the vector ⁇ has a dimension of zero or more. In the case of the vector having a dimension of zero the spectrum does not comprise any peaks.
 the vector ⁇ defines parameters for each peak forming the spectrum. For each peak in the spectrum one or more parameters define the respective peaks.
 the one or more parameters per peak may comprise one or both of a position and width of each peak.
 the vector ⁇ may be determined not to include parameters defining an amplitude of each peak, such as a relative or absolute amplitude of the respective peak. In this case, the amplitude of each peak is determined in a subsequent step, as will be explained.
 the vector ⁇ may define a ratio of mass over charge for each peak. Thus, given ⁇ , a shape of each peak forming the spectrum is defined.
 the vector ⁇ is a random set of initial parameters ⁇ wherein the subscript 0 is indicative of the initial nature of the parameters i.e. for a zeroth iteration.
 the initial parameters may be sampled from an initial probability density function (pdf) 3 ⁇ 4(3 ⁇ 4) ⁇ I n an example embodiment, samples from the initial pdf are constrained to consist of one peak. However it will be realised that other numbers of initial peaks may be selected.
 the position of the initial peak may be determined to be distributed across an extent of the mass over charge range for the spectrum, such that wherein ⁇ is an interval of valid values for the peak position.
 one of more attributes of each peak are determined. For example, a position of each peak is selected in step 1 10.
 each peak in the spectrum.
 the shape of each peak is related to parameters that include a U/V voltage ratio of the detector, as will be appreciated by the skilled person.
 other attributes of each peak may be selected, such as the width of each peak.
 the number of peaks and the respective position of each peak may be selected as a respective random, or pseudorandom, value, as will be appreciated.
 a further set of parameters is determined.
 the further set of parameters may be referred to as a new set of parameters.
 the new parameters may represent an updated estimate, or guess, as to a composition of peaks forming a spectrum.
 the first set of parameters ⁇ may represent a current set of parameters.
 the new parameters may be defined by a second, updated, vector, ⁇ .
 the second vector ⁇ may comprise more or less parameters than the first vector ⁇ i.e. more or less peaks may be defined by the new set of parameters.
 the new set of parameters may be based upon the current set of parameters ⁇ .
 new set of parameters may be determined according to one or more statistical
 a peak is removed from the current set of parameters ⁇ with a predetermined probability, which may be 10% although it will be realised that other probabilities may be used.
 a peak is added to the current set of parameters ⁇ with a predetermined probability, which may be 10% although it will be realised that other probabilities may be used.
 one or more attributes of the peaks forming the current parameters ⁇ are modified with a predetermined probability, which may be 80% although it will be realised that other probabilities may be used.
 the total probabilities for all rules equals 100% as will be appreciated. It will be realised that the removal of peaks from the current set of parameters ⁇ is performed with respect to a minimum number of peaks.
 the minimum number of peaks may be zero such that a peak may not be removed from a spectrum that does not comprise any peaks.
 the one or more attribute(s) of that added peak are determined in step 120. For example, the position of the new peak is determined.
 a predetermined modification may be applied.
 the predetermined modification may be an addition of noise to the current parameters ⁇ .
 the noise may be zeromean Gaussian noise having a predetermined variance, such as 1.
 step 120 a probability of moving to the new set of parameters given the current ⁇ set of parameters is determined. That is, a value of is determined in step
 the probability may be determined as:
 P is the predetermined probability of adding, removing or retaining the same number of peaks, such as 10%, as discussed above, and N is the number of peaks prior to the add or remove.
 spectra are generated. The spectra are generated based on each of the current ⁇ and new sets of parameters In embodiments where each vector
 a QMS mass spectrum consists of an ion current (y axis) plotted against a mass (m) to charge (q) position (xaxis).
 ions of a given mass to charge ratio which may be selected by a user, are injected by simulation into a QMS mass filter model.
 a large number of ions e.g. 100000, although it will be realised that other numbers may be used) are injected at each mass point (x axis position), thereby simulating the action of an ion source in a real QMS instrument.
 Ions are determined to start from a random spatial position over a disc corresponding to an ion source aperture, and their injection is randomised in time (i.e. occurs at any point of an applied voltage waveform).
 the random injection of ions in space and time justifies the term 'Monte Carlo ' to describe the nature of the simulation.
 E may be calculated from a geometry of the instrument by field plotting routines (e.g. finite difference methods).
 an acceleration a is determined at each point in space, and v (ion velocity) by numerical integration, trajectory s as a function of time by numerical integration of v.
 v ion velocity
 trajectory s as a function of time by numerical integration of v.
 the model provides a method of computation to determine the individual trajectories of large numbers of ions injected from the ion source 210 into the quadrupole mass filter (QMF) 220.
 a simulated mass scan is produced, which may comprise at least 10 5 ions, injected into the quadrupole model at, at least some, or each point on the mass scale.
 the model provides an accurate, physics based, forward model of mass filter behaviour and is able to predict the mass spectral peak shapes.
 Ions from the source 210 are assumed to originate at any point on a circular disk centred on a quadrupole axis and set at right angles to the axis.
 a quadrupole field starts immediately when the ions leave the source 210.
 the radii of both source and exit disks may be different and varied freely. In the simulation each ion is generated at a point in the source disk selected at random with no correlation between the points used for successive ions; all positions are equally probable, this corresponds to uniform source illumination.
 the time of origin in the source is also selected randomly; that is ions enter the quadrupole at random values of the phase of the radio frequency voltage, the alternating voltage, used to operate the filter.
 the simulation may be considered a MonteCarlo type simulation.
 a result of such simulation is illustrated in Figure 3.
 Ions may be simulated to travel through the filter 220 with constant velocity in the z direction. This is because the fringe fields may be ignored and all the electric fields experienced by ions are at right angles to the z direction; therefore there is no component of force to change the velocity in the z direction.
 rO filter radius
 Ions that pass through an exit aperture form the received signal at the detector 240. Ions may be traced through the filter 220 by determining their motion in the hyperbolic field.
 Their travel may be divided into small time intervals and their motion over each small time interval computed using the local field they experience the field is function of time because the applied voltage may include an AC component.
 the motion may be approximated using a fourth order RungeKutta algorithm.
 step 140 the two spectra are each compared against a measured spectrum 150.
 the measured spectrum 150 is that measured by the QMS detector.
 the comparison in step 140 is based upon a General Linear Model (GLM).
 GLM General Linear Model
 an amplitude of the peaks, A is constrained to be positive.
 Figure 3 illustrates a simulated spectrum 310 comprising three peaks produced by MonteCarlo simulation as described above according to an embodiment of the invention and a measured spectrum 310 comprising two peaks measured experimentally.
 Reference numerals in Figure 3 specifically indicate a peak in the spectrum corresponding to helium. In embodiments of the invention the following is calculated:
 GLM is defined as:
 M  1 is the dimensionality of ⁇ representing the number of peaks that ⁇ hypothesises
 d is the measured spectral data of length N
 Equation 3 may be approximated via numerical integration, wherein a sample is taken a predetermined number of times from a studentT pdf defined in Equation (3). The predetermined number of times may be 1000, although other values may be used. A fraction of the samples that satisfy the abovementioned constraint that all peaks have positive amplitude is then calculated. Note that the bottom most element of A relates to an offset which is allowed to be negative.
 step 160 it is determined whether to accept the new set of parameters
 step 160 is based upon the measured spectrum 150 and the current and new sets of parameters.
 the determination in step 160 is based upon a probability that the measured spectrum 150 corresponds to each of the current and new sets of parameters respectively, and a probability of
 Equation 6 The determination may be made in embodiment of the invention according to Equation 6 :
 the new set of parameters ⁇ is accepted, thus becoming the current parameters for a future iteration of the method, if ⁇ is greater than a threshold value.
 the threshold value may vary between each iteration of the method in some embodiments.
 the threshold may be a random number drawn from a uniform distribution between zero and one. If ⁇ is less than or equal to the threshold value then the current ⁇ set of parameters is retained in step 170 i.e. the new set of parameters ⁇ is discarded.
 the new set of parameters, ⁇ are not accepted then the current set of parameters, ⁇ , are logged or stored in step 180. Alternatively, if the new set of parameters, ⁇ , are accepted, then the new set of parameters are logged or stored in step 180.
 a set of parameters is stored at each iteration of step 180. As indicated by arrow between steps 180 and 120, the new set of parameters is provided to step 120 as an input for a next iteration of the method 100.
 the diversity of the sets of stored parameters is indicative of uncertainty of the parameters of the measured spectrum, including uncertainty related to whether the measured spectrum contains a peak at a given position or not. This captures uncertainty relating to whether a low amplitude peak is present and whether a peak is present in close proximity to another, such that embodiments of the invention can achieve enhanced detection of low amplitude peaks and improved resolution of closely located peaks.
 a most likely number of peaks present in the measured spectrum can be identified by identifying a number of peaks that occurs most frequently in the set of stored parameters. To produce a single estimated output, an average may then be determined across the samples with that number of peaks once the list of peak positions has been sorted in order of ascending mass over charge. We can also manipulate the samples to derive an estimate of the amplitude corresponding to each peak (190). It is innovative that, in steps 190 and 150, we enforce the constraint that the amplitude relates to a physical abundance and is therefore positive.
 Embodiments of the present invention provide a method of determining a composition of a spectrum by iteratively simulating a spectrum and comparing the simulated spectrum with a measured spectrum.
 the composition of the measured spectrum may be determined with increased accuracy.
 embodiments of the present invention can be realised in the form of hardware, software or a combination of hardware and software. Any such software may be stored in the form of volatile or nonvolatile storage such as, for example, a storage device like a ROM, whether erasable or rewritable or not, or in the form of memory such as, for example, RAM, memory chips, device or integrated circuits or on an optically or magnetically readable medium such as, for example, a CD, DVD, magnetic disk or magnetic tape. It will be appreciated that the storage devices and storage media are embodiments of machinereadable storage that are suitable for storing a program or programs that, when executed, implement embodiments of the present invention.
 embodiments provide a program comprising code for implementing a system or method as claimed in any preceding claim and a machine readable storage storing such a program. Still further, embodiments of the present invention may be conveyed electronically via any medium such as a communication signal carried over a wired or wireless connection and embodiments suitably encompass the same.
Abstract
Embodiments of the present invention provide a computerimplemented method of determining a composition of a spectrum recorded at a detector, comprising determining a first estimate of at least one characteristic of a spectrum, simulating the spectrum at a detector based on the first estimate, wherein in said simulation the spectrum comprises one or more peaks and the amplitude of each peak is constrained to be positive, comparing the simulated spectrum with a spectrum recorded at the detector, and determining an updated estimate of the spectrum based on the comparison.
Description
Method and Apparatus for Determining a Composition of a Spectrum
Background A spectrum is a range over which one or more measurable properties of a physical phenomenon, such as the frequency of sound or electromagnetic radiation, or the mass of specific kinds of particles, can vary. The study of spectra is termed spectroscopy. Various spectroscopy techniques exist which include, Mass Spectrometry (MS), Nuclear Magnetic Resonance (NMR), Raman Spectroscopy, etc.
An example apparatus for mass spectrometry is a quadrupole mass spectrometer (QMS), although it will be realised that other MS apparatus exist. In the QMS, ions from a sample are injected into a quadrupole mass filter to which direct and alternating voltages are applied. Stable ions are able to pass through the filter to reach a detector. A spectral peak at a given mass is output by the detector. An amplitude or height of the peak is proportional to a concentration of ions in the sample at the given mass.
Problems can arise in spectroscopy when ions of similar mass arrive at the detector. In this case the output of the detector is indicative of two or more spectral peaks at similar mass which at least partly overlap. Alternatively or additionally, problems can arise in determining whether a peak at a single mass exists at very low amplitude in relation to noise present in the signal output by the detector. Although improvements have been made in producing instruments with very low noise and reducing overlap between peaks at a given separation, these instruments tend to have a large size and/or high cost.
It is an object of embodiments of the invention to at least mitigate one or more of the problems of the prior art.
Statements of Invention
According to an aspect of the present invention there is provided methods and apparatus as set forth in the appended claims.
According to an aspect of the present invention there is provided a computer implemented method of determining a composition of a spectrum recorded at a detector, comprising determining a first estimate of at least one characteristic of a spectrum, simulating the spectrum at a detector based on the first estimate, wherein in said simulation an amplitude of one or more peaks in the spectrum is constrained to be positive, comparing the simulated spectrum with a spectrum recorded at the detector, and determining an updated estimate of the spectrum based on the comparison.
The spectrum may comprise zero or more peaks. Optionally the spectrum comprises one or more peaks and the amplitude of each peak is constrained to be positive.
The method may comprise determining an initial estimate at least one initial characteristic of the spectrum. The method may comprise iteratively performing the steps of simulating the spectrum, comparing the simulated spectrum and determining the updated estimate of the spectrum.
The method may comprise determining a second estimate of at least one characteristic of the spectrum, simulating the spectra at the detector based on the first and second estimates, selecting one of the first and second estimates based on the comparison between the simulated spectra and the spectrum recorded at the detector.
The selecting of one of the first and second estimates may be further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
The updated estimate of the spectrum may be based upon one or more statistical rules. The one or more statistical rules may define one or more of a likelihood of adding a peak to the spectrum, a likelihood of removing a peak from the spectrum and a likelihood of modifying one or more attributes of peaks forming the spectrum.
A computerimplemented method of determining a composition of a spectrum recorded at a detector, comprising providing a first estimate of at least one characteristic of a spectrum, determining second estimate of the at least one characteristic of the spectrum, simulating spectra at a detector corresponding to the first estimate and the second estimate, wherein said simulation includes injecting ions into a simulation of a detection apparatus comprising the detector according to a MonteCarlo method, and selecting one of the first and second estimates based on the simulation and a spectrum recorded at a detector. In said MonteCarlo method the ions may be injected into the detection apparatus randomised in one or both of space and time.
The method may comprise determining an initial estimate at least one initial characteristic of the spectrum, and iteratively performing the steps of determining the second estimate, simulating the spectrum, and selecting one of the first and second estimates.
The selected estimate may be utilised as the first estimate in a following iteration of the method.
The method may comprise comparing the simulated spectra with the spectrum recorded at the detector, and selecting one of the first and second estimates based on the comparison. The selecting is optionally further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
In said simulation an amplitude of the one or more peaks may be constrained to be positive;
One or both of the first and second estimates may comprises one or more peaks present in the spectrum and an attribute of the one or more peaks.
According to another aspect of the present invention there is provided computer software which, when executed by a computer, is arranged to perform a method according to an aspect of the invention. The computer software may be stored on a computer readable medium. The computer software may be tangibly stored on a computer readable medium.
A computing apparatus comprising a memory and at least one processor, the memory storing computer executable instructions which, when executed by the at least one processor, perform a method according to an aspect of the invention.
Brief Description of the Drawings
Embodiments of the invention will now be described by way of example only, with reference to the accompanying figures, in which:
Figure 1 shows a method according to an embodiment of the invention;
Figure 2 is a schematic illustration of a quadrupole mass spectrometer; and
Figure 3 is an illustration of experimental and simulated spectra for hydrogen and helium according to an embodiment of the invention.
Detailed Description of Embodiments of the Invention
Figure 1 illustrates a method 100 according to an embodiment of the invention. The method 100 is a computerimplemented method of determining one or more aspects of a measured spectrum. The spectrum may be that output by a quadrupole mass spectrometer (QMS), although it will be realised that embodiments of the invention are not limited in this respect and that the spectrum may be determined in other ways such as Mass Spectrometry (MS), Nuclear Magnetic Resonance (NMR), Raman Spectroscopy, etc. However, for the purpose of illustration, embodiments will be described in relation to QMS.
A QMS 200 is schematically illustrated in Figure 2. The QMS 200 comprises a source of ions 210 which are injected into the QMS 200, as the skilled person will appreciate. The QMS comprises a number of electrodes, which are excited to provide a quadrupole electric field, which functions as a mass filter via application of a suitable combination of variable amplitude direct (U) and variable amplitude alternating (V) voltages from a voltage generator 230. For a given combination of U, V, frequency and electrode geometry ions of a given mass are stable (resonant) and others not. Stable ions reach a detector 240 where they form a spectral peak at the given mass (mass spectrum). The peak height is proportional to the concentration of ions in the sample at that mass.
Returning to Figure 1, step 1 10 comprises determining a set of parameters. Step 110 comprises determining an estimate of the spectrum. The spectrum may comprise one or more peaks, although it will be realised that the spectrum may not comprise any peaks. That is, the spectrum may only comprise noise. The spectrum is parameterised by a vector Θ. The vector Θ has a dimension of zero or more. In the case of the vector having a dimension of zero the spectrum does not comprise any peaks. The vector Θ defines parameters for each peak forming the spectrum. For each peak in the spectrum one or more parameters define the respective peaks. The one or more parameters per peak may comprise one or both of a position and width of each peak. The vector Θ may be determined not to include parameters defining an amplitude of each peak, such as a relative or absolute amplitude of the respective peak. In this case, the amplitude of each peak is determined in a subsequent step, as will be explained. In embodiments related to mass spectrometry, such as the exemplary explained embodiment, the vector Θ may define a ratio of mass over charge for each peak. Thus, given Θ, a shape of each peak forming the spectrum is defined.
In some embodiments of step 110, the vector Θ is a random set of initial parameters θο wherein the subscript 0 is indicative of the initial nature of the parameters i.e. for a zeroth iteration. The initial parameters may be sampled from an initial probability density function (pdf) ¾(¾) · I^{n an} example embodiment, samples from the initial pdf are constrained to consist of one peak. However it will be realised that other numbers of initial peaks may be selected. The position of the initial peak may be
determined to be distributed across an extent of the mass over charge range for the spectrum, such that wherein Δ is an interval of valid values for the peak position.
In step 110 one of more attributes of each peak are determined. For example, a position of each peak is selected in step 1 10. As the described embodiment relates to QMS, it is only necessary to select the position of each peak in the spectrum. The shape of each peak is related to parameters that include a U/V voltage ratio of the detector, as will be appreciated by the skilled person. However, in other nonQMS embodiments, other attributes of each peak may be selected, such as the width of each peak. The number of peaks and the respective position of each peak may be selected as a respective random, or pseudorandom, value, as will be appreciated.
In step 120 a further set of parameters is determined. The further set of parameters may be referred to as a new set of parameters. The new parameters may represent an updated estimate, or guess, as to a composition of peaks forming a spectrum. In this sense, the first set of parameters Θ may represent a current set of parameters. The new parameters may be defined by a second, updated, vector, θ . The second vector Θ may comprise more or less parameters than the first vector Θ i.e. more or less peaks may be defined by the new set of parameters.
new set of parameters may be determined according to one or more statistical
rules. In one embodiment, a peak is removed from the current set of parameters Θ with a predetermined probability, which may be 10% although it will be realised that other probabilities may be used. In one embodiment, a peak is added to the current set of parameters Θ with a predetermined probability, which may be 10% although it will be realised that other probabilities may be used. Furthermore, in one embodiment, one or more attributes of the peaks forming the current parameters Θ are modified with a predetermined probability, which may be 80% although it will be realised that other probabilities may be used. The total probabilities for all rules equals 100% as will be appreciated. It will be realised that the removal of peaks from the current set of parameters Θ is performed with respect to a minimum number of
peaks. The minimum number of peaks may be zero such that a peak may not be removed from a spectrum that does not comprise any peaks. In the case of a peak being added, the one or more attribute(s) of that added peak are determined in step 120. For example, the position of the new peak is determined. Similarly, where the attributes of the current peaks are modified a predetermined modification may be applied. The predetermined modification may be an addition of noise to the current parameters Θ. The noise may be zeromean Gaussian noise having a predetermined variance, such as 1. Thus, following step 120 the new set of parameters θ is determined.
In step 120 a probability of moving to the new set of parameters
given the current Θ set of parameters is determined. That is, a value of is determined in step
120. The probability may be determined as:
where p is the probability, P is the predetermined probability of adding, removing or retaining the same number of peaks, such as 10%, as discussed above, and N is the number of peaks prior to the add or remove.
Similarly, a reverse probability of moving from the new set of parameters to the
old set of parameters is determined.
In step 130 spectra are generated. The spectra are generated based on each of the current Θ and new sets of parameters In embodiments where each vector
comprises elements each defining a respective peak, a single peak is generated for the rth element of each of Θ and where i may be between 1 and the number of peaks
defined by each set of parameters I. The single peak spectrum for the rth element of the new set of parameters is In step 130 an output of MonteCarlo simulation is
used to determine one or more templates for predicting a shape of each peak given at least some of the parameters. The MonteCarlo simulation may be performed offline.
As will be appreciated, a QMS mass spectrum consists of an ion current (y axis) plotted against a mass (m) to charge (q) position (xaxis). In the simulation of step 140, ions of a given mass to charge ratio, which may be selected by a user, are injected by simulation into a QMS mass filter model. A large number of ions (e.g. 100000, although it will be realised that other numbers may be used) are injected at each mass point (x axis position), thereby simulating the action of an ion source in a real QMS instrument. Ions are determined to start from a random spatial position over a disc corresponding to an ion source aperture, and their injection is randomised in time (i.e. occurs at any point of an applied voltage waveform). The random injection of ions in space and time justifies the term 'Monte Carlo ' to describe the nature of the simulation. For each individual ion, the force (F) on the ion is calculated from a knowledge of the electric field (E) at each point according to F = q E where q = charge on the ion. E may be calculated from a geometry of the instrument by field plotting routines (e.g. finite difference methods). From F/m an acceleration a is determined at each point in space, and v (ion velocity) by numerical integration, trajectory s as a function of time by numerical integration of v. Hence, for each individual ion its trajectory in space and time is calculated and it may be determined whether said ion is stable (and forms part of the mass spectrum) or unstable (and is rejected from the QMS). Repeating for each of the ions (e.g. 100000 ions) allows a mass spectrum to be simulated for a given set of instrument conditions, geometry, electrode size and spacings, voltage excitation on electrodes and input ion energy.
The model provides a method of computation to determine the individual trajectories of large numbers of ions injected from the ion source 210 into the quadrupole mass filter (QMF) 220. A simulated mass scan is produced, which may comprise at least 10^{5} ions, injected into the quadrupole model at, at least some, or each point on the mass scale. The model provides an accurate, physics based, forward model of mass filter behaviour and is able to predict the mass spectral peak shapes.
Ions from the source 210 are assumed to originate at any point on a circular disk centred on a quadrupole axis and set at right angles to the axis. A quadrupole field starts immediately when the ions leave the source 210. The radii of both source and exit disks may be different and varied freely. In the simulation each ion is generated at
a point in the source disk selected at random with no correlation between the points used for successive ions; all positions are equally probable, this corresponds to uniform source illumination. The time of origin in the source is also selected randomly; that is ions enter the quadrupole at random values of the phase of the radio frequency voltage, the alternating voltage, used to operate the filter. Because of the random nature of the ion injection in space and time, the simulation may be considered a MonteCarlo type simulation. A result of such simulation is illustrated in Figure 3. Ions may be simulated to travel through the filter 220 with constant velocity in the z direction. This is because the fringe fields may be ignored and all the electric fields experienced by ions are at right angles to the z direction; therefore there is no component of force to change the velocity in the z direction. At any time when the magnitude of either the x or y coordinate of an ion exceeds a filter radius, rO, it is rejected. Ions that pass through an exit aperture form the received signal at the detector 240. Ions may be traced through the filter 220 by determining their motion in the hyperbolic field. Their travel may be divided into small time intervals and their motion over each small time interval computed using the local field they experience the field is function of time because the applied voltage may include an AC component. In some embodiments the motion may be approximated using a fourth order RungeKutta algorithm.
In step 140 the two spectra are each compared against a measured spectrum 150. The measured spectrum 150 is that measured by the QMS detector. Thus the measured spectrum 150 is provided as an input to step 140. The comparison in step 140 is based upon a General Linear Model (GLM). However, in embodiments of the invention, an amplitude of the peaks, A, is constrained to be positive.
Figure 3 illustrates a simulated spectrum 310 comprising three peaks produced by MonteCarlo simulation as described above according to an embodiment of the invention and a measured spectrum 310 comprising two peaks measured experimentally. Reference numerals in Figure 3 specifically indicate a peak in the spectrum corresponding to helium.
In embodiments of the invention the following is calculated:
Where the GLM is defined as:
and
M  1 is the dimensionality of Θ representing the number of peaks that Θ hypothesises;
d is the measured spectral data of length N;
d with all the elements equal to unity and used to model any offset present, although the skilled person will realise other values can be used; and γο and δ are hyperparameters, which may be assumed to be equal to unity.
The integral in Equation 3 may be approximated via numerical integration, wherein a sample is taken a predetermined number of times from a studentT pdf defined in Equation (3). The predetermined number of times may be 1000, although other values may be used. A fraction of the samples that satisfy the abovementioned constraint that all peaks have positive amplitude is then calculated. Note that the bottom most element of A relates to an offset which is allowed to be negative.
In step 160 it is determined whether to accept the new set of parameters
determination in step 160 is based upon the measured spectrum 150 and the current and new sets of parameters. In particular, the determination in step 160 is based upon a probability that the measured spectrum 150 corresponds to each of the current and new sets of parameters respectively, and a probability of
viceversa. The determination may be made in embodiment of the invention according to Equation 6 :
Eqn. 6 The new set of parameters θ is accepted, thus becoming the current parameters for a future iteration of the method, if η is greater than a threshold value. The threshold value may vary between each iteration of the method in some embodiments. The threshold may be a random number drawn from a uniform distribution between zero and one. If η is less than or equal to the threshold value then the current Θ set of parameters is retained in step 170 i.e. the new set of parameters θ is discarded.
If the new set of parameters, Θ, are not accepted then the current set of parameters, Θ, are logged or stored in step 180. Alternatively, if the new set of parameters, Θ, are accepted, then the new set of parameters are logged or stored in step 180. Thus, over repeated iterations of the method 100, a set of parameters is stored at each iteration of step 180. As indicated by arrow between steps 180 and 120, the new set of parameters is provided to step 120 as an input for a next iteration of the method 100.
The diversity of the sets of stored parameters is indicative of uncertainty of the parameters of the measured spectrum, including uncertainty related to whether the measured spectrum contains a peak at a given position or not. This captures uncertainty relating to whether a low amplitude peak is present and whether a peak is present in close proximity to another, such that embodiments of the invention can achieve enhanced detection of low amplitude peaks and improved resolution of closely located peaks.
A most likely number of peaks present in the measured spectrum can be identified by identifying a number of peaks that occurs most frequently in the set of stored parameters. To produce a single estimated output, an average may then be determined across the samples with that number of peaks once the list of peak positions has been sorted in order of ascending mass over charge. We can also manipulate the samples to derive an estimate of the amplitude corresponding to each peak (190). It is innovative that, in steps 190 and 150, we enforce the constraint that the amplitude relates to a physical abundance and is therefore positive.
Embodiments of the present invention provide a method of determining a composition of a spectrum by iteratively simulating a spectrum and comparing the simulated spectrum with a measured spectrum. Advantageously the composition of the measured spectrum may be determined with increased accuracy.
It will be appreciated that embodiments of the present invention can be realised in the form of hardware, software or a combination of hardware and software. Any such software may be stored in the form of volatile or nonvolatile storage such as, for example, a storage device like a ROM, whether erasable or rewritable or not, or in the form of memory such as, for example, RAM, memory chips, device or integrated circuits or on an optically or magnetically readable medium such as, for example, a CD, DVD, magnetic disk or magnetic tape. It will be appreciated that the storage devices and storage media are embodiments of machinereadable storage that are suitable for storing a program or programs that, when executed, implement embodiments of the present invention. Accordingly, embodiments provide a program comprising code for implementing a system or method as claimed in any preceding claim and a machine readable storage storing such a program. Still further,
embodiments of the present invention may be conveyed electronically via any medium such as a communication signal carried over a wired or wireless connection and embodiments suitably encompass the same.
All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and/or all of the steps of any method or process so disclosed, may be combined in any combination, except combinations where at least some of such features and/or steps are mutually exclusive.
Each feature disclosed in this specification (including any accompanying claims, abstract and drawings), may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features.
The invention is not restricted to the details of any foregoing embodiments. The invention extends to any novel one, or any novel combination, of the features disclosed in this specification (including any accompanying claims, abstract and drawings), or to any novel one, or any novel combination, of the steps of any method or process so disclosed. The claims should not be construed to cover merely the foregoing embodiments, but also any embodiments which fall within the scope of the claims.
Claims
CLAIMS 1. A computerimplemented method of determining a composition of a spectrum recorded at a detector, comprising: determining a first estimate of at least one characteristic of a spectrum; simulating the spectrum at a detector based on the first estimate, wherein in said simulation the spectrum comprises one or more peaks and the amplitude of each peak is constrained to be positive; comparing the simulated spectrum with a spectrum recorded at the detector; and determining an updated estimate of the spectrum based on the comparison.
2. The method of claim 1, comprising determining an initial estimate at least one initial characteristic of the spectrum, and iteratively performing the steps of simulating the spectrum, comparing the simulated spectrum and determining the updated estimate of the spectrum.
3. The method of claim 1 or 2, comprising: determining a second estimate of at least one characteristic of the spectrum; simulating the spectra at the detector based on the first and second estimates; selecting one of the first and second estimates based on the comparison between the simulated spectra and the spectrum recorded at the detector.
4. The method of claim 3, wherein the selecting of one of the first and second estimates is further based on one or both of a probability of moving from the
first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
5. The method of any preceding claim, wherein determining the updated estimate of the spectrum is based upon one or more statistical rules.
6. The method of claim 5, wherein the one or more statistical rules define one or more of a likelihood of adding a peak to the spectrum, a likelihood of removing a peak from the spectrum and a likelihood of modifying one or more attributes of peaks forming the spectrum.
7. A computerimplemented method of determining a composition of a spectrum recorded at a detector, comprising: providing a first estimate of at least one characteristic of a spectrum; determining second estimate of the at least one characteristic of the spectrum; simulating spectra at a detector corresponding to the first estimate and the second estimate, wherein said simulation includes injecting ions into a simulation of a detection apparatus comprising the detector according to a MonteCarlo method; and selecting one of the first and second estimates based on the simulation and a spectrum recorded at a detector.
8. The method of claim 7, wherein in said MonteCarlo method the ions are injected into the detection apparatus randomised in one or both of space and time.
9. The method of claim 7 or 8, comprising determining an initial estimate at least one initial characteristic of the spectrum, and iteratively performing the steps
of determining the second estimate, simulating the spectrum, and selecting one of the first and second estimates.
10. The method of claim 9, wherein the selected estimate is utilised as the first estimate in a following iteration of the method.
11. The method of any of claims 7 to 10, comprising: comparing the simulated spectra with the spectrum recorded at the detector; and selecting one of the first and second estimates based on the comparison.
12. The method of claim 11, wherein the selecting is further based on one or both of a probability of moving from the first estimate to the second estimate and from the second estimate to the first estimate of the spectrum.
13. The method of any of claims 7 to 12, wherein in said simulation an amplitude of the one or more peaks is constrained to be positive;
14. The method of any of claims 7 to 13, wherein one or both of the first and second estimates comprises one or more peaks present in the spectrum and an attribute of the one or more peaks.
15. Computer software which, when executed by a computer, is arranged to perform a method according to any preceding claim.
16. The computer software of claim 15 stored on a computer readable medium.
17. A computing apparatus comprising a memory and at least one processor, the memory storing computer executable instructions which, when executed by the at least one processor, perform a method according to any of claims 1 to 6 or 7 to 14.
Applications Claiming Priority (2)
Application Number  Priority Date  Filing Date  Title 

GBGB1519736.1A GB201519736D0 (en)  20151109  20151109  Method and apparatus for determining a composition of a spectrum 
GB1519736.1  20151109 
Publications (1)
Publication Number  Publication Date 

WO2017081454A1 true WO2017081454A1 (en)  20170518 
Family
ID=55132506
Family Applications (1)
Application Number  Title  Priority Date  Filing Date 

PCT/GB2016/053490 WO2017081454A1 (en)  20151109  20161108  Method and apparatus for determining a composition of a spectrum 
Country Status (2)
Country  Link 

GB (1)  GB201519736D0 (en) 
WO (1)  WO2017081454A1 (en) 
Citations (3)
Publication number  Priority date  Publication date  Assignee  Title 

WO2006050226A2 (en) *  20041028  20060511  Cerno Bioscience Llc  Qualitative and quantitative mass spectral analysis 
WO2011128702A1 (en) *  20100415  20111020  Micromass Uk Limited  Method and system of identifying a sample by analyising a mass spectrum by the use of a bayesian inference technique 
US20140358451A1 (en) *  20130604  20141204  Arizona Board Of Regents On Behalf Of Arizona State University  Fractional Abundance Estimation from Electrospray Ionization TimeofFlight Mass Spectrum 

2015
 20151109 GB GBGB1519736.1A patent/GB201519736D0/en not_active Ceased

2016
 20161108 WO PCT/GB2016/053490 patent/WO2017081454A1/en active Application Filing
Patent Citations (3)
Publication number  Priority date  Publication date  Assignee  Title 

WO2006050226A2 (en) *  20041028  20060511  Cerno Bioscience Llc  Qualitative and quantitative mass spectral analysis 
WO2011128702A1 (en) *  20100415  20111020  Micromass Uk Limited  Method and system of identifying a sample by analyising a mass spectrum by the use of a bayesian inference technique 
US20140358451A1 (en) *  20130604  20141204  Arizona Board Of Regents On Behalf Of Arizona State University  Fractional Abundance Estimation from Electrospray Ionization TimeofFlight Mass Spectrum 
NonPatent Citations (2)
Title 

LI ET AL.: "Accurate Identification of Mass Peaks for Tandem Mass Spectra Using MCMC Model", TSINGHUA SCIENCE AND TECHNOLOGY, vol. 20, no. 5, October 2015 (20151001), pages 453  459, XP002767171 * 
S. U. A. H. SYED ET AL: "Quadrupole mass filter operation under the influence of magnetic field", JOURNAL OF MASS SPECTROMETRY., vol. 48, no. 12, 1 December 2013 (20131201), GB, pages 1325  1339, XP055344700, ISSN: 10765174, DOI: 10.1002/jms.3293 * 
Also Published As
Publication number  Publication date 

GB201519736D0 (en)  20151223 
Similar Documents
Publication  Publication Date  Title 

CMS collaboration  Identification of bquark jets with the CMS experiment  
Heinrich et al.  NLO QCD corrections to $${W}^{+}{W}^{} b\overline {b} $$ production with leptonic decays in the light of top quark mass and asymmetry measurements  
Nachman et al.  Significance variables  
Carbone  Signals of the Giant Pairing Vibration in 14 C and 15 C nuclei populated by (18 O, 16 O) twoneutron transfer reactions  
Beaujean et al.  Pvalues for model evaluation  
Leigh et al.  $\nu $flows: Conditional neutrino regression  
Grote  Pattern recognition in highenergy physics  
Haley et al.  Processing APT spectral backgrounds for improved quantification  
CN101488456B (en)  Etching amount calculating method and etching amount calculating apparatus  
US20130197861A1 (en)  Method for spectrometric analysis and related device  
Werner  Simulation of electron spectra for surface analysis using the partial‐intensity approach (PIA)  
US7072772B2 (en)  Method and apparatus for modeling mass spectrometer lineshapes  
JP6318722B2 (en)  Environmental load molecule generation source evaluation method, environmental load molecule generation source evaluation system, and computer program  
WO2017081454A1 (en)  Method and apparatus for determining a composition of a spectrum  
Maxeiner et al.  Simulation of ion beam scattering in a gas stripper  
Lamparth et al.  Gaussian processes and bayesian optimization for high precision experiments  
Cheng et al.  Measuring invisible particle masses using a single short decay chain  
Zinser  Double differential cross section for DrellYan production of highmass $ e^+ e^$pairs in $ pp $ collisions at $\sqrt {s}= 8$ TeV with the ATLAS experiment  
Stillings  Search for the associated production of a W boson and a top quark with the ATLAS detector at 7 TeV  
EP4105672A1 (en)  Systems and methods for provisioning training data to enable neural networks to analyze signals in nmr measurements  
Ibrahimi et al.  Accelerated timeofflight mass spectrometry  
Petrović et al.  Expert System for threshold spectra analysis of nitrogen molecules  
US20140358451A1 (en)  Fractional Abundance Estimation from Electrospray Ionization TimeofFlight Mass Spectrum  
Gosz et al.  Application of deep neural network in finding of repulsive part of molecular potential based on dispersed emission spectra  
Foppiani  Identifying Neutrinos: Tracks and Showers 
Legal Events
Date  Code  Title  Description 

121  Ep: the epo has been informed by wipo that ep was designated in this application 
Ref document number: 16797601 Country of ref document: EP Kind code of ref document: A1 

NENP  Nonentry into the national phase 
Ref country code: DE 

122  Ep: pct application nonentry in european phase 
Ref document number: 16797601 Country of ref document: EP Kind code of ref document: A1 