WO2016178033A1 - Mixture identification - Google Patents

Mixture identification Download PDF

Info

Publication number
WO2016178033A1
WO2016178033A1 PCT/GB2016/051315 GB2016051315W WO2016178033A1 WO 2016178033 A1 WO2016178033 A1 WO 2016178033A1 GB 2016051315 W GB2016051315 W GB 2016051315W WO 2016178033 A1 WO2016178033 A1 WO 2016178033A1
Authority
WO
WIPO (PCT)
Prior art keywords
mixture
isotope
compound
data
sample mixture
Prior art date
Application number
PCT/GB2016/051315
Other languages
French (fr)
Inventor
Michael Andrew SUDNIK
Original Assignee
Isoprime Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Isoprime Limited filed Critical Isoprime Limited
Priority to EP16728070.0A priority Critical patent/EP3292402A1/en
Publication of WO2016178033A1 publication Critical patent/WO2016178033A1/en

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/86Signal analysis
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/86Signal analysis
    • G01N30/8675Evaluation, i.e. decoding of the signal into analytical information
    • G01N30/8679Target compound analysis, i.e. whereby a limited number of peaks is analysed
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/86Signal analysis
    • G01N30/8675Evaluation, i.e. decoding of the signal into analytical information
    • G01N30/8686Fingerprinting, e.g. without prior knowledge of the sample components
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/86Signal analysis
    • G01N30/8696Details of Software
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/20Identification of molecular entities, parts thereof or of chemical compositions
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01JELECTRIC DISCHARGE TUBES OR DISCHARGE LAMPS
    • H01J49/00Particle spectrometers or separator tubes
    • H01J49/0027Methods for using particle spectrometers
    • H01J49/0036Step by step routines describing the handling of the data generated during a measurement
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/84Preparation of the fraction to be distributed
    • G01N2030/8405Preparation of the fraction to be distributed using pyrolysis
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/88Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86
    • G01N2030/8809Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86 analysis specially adapted for the sample
    • G01N2030/8868Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86 analysis specially adapted for the sample elemental analysis, e.g. isotope dilution analysis

Definitions

  • the present invention relates to an apparatus and method for identifying mixtures of compounds and/or compounds within a mixture.
  • the present invention relates to an apparatus and method for processing isotope ratio mass spectrometry data resulting from chemical analysis of a sample and identifying mixtures of compounds and/or compounds within a mixture for said sample.
  • GC-MS gas chromatography mass spectrometry
  • the output chromatograms can be used to analyze compound retention time in the gas chromatography apparatus; each mixture should have a unique chromatogram profile and therefore, by comparison with reference spectra, the mixture is identifiable.
  • retention time is highly susceptible to environmental variations and so very careful control over the conditions during GC-MS is required.
  • a computer- implemented method of identifying a sample mixture based on relative isotope composition for a plurality of compounds in the sample mixture comprising: receiving, at a computing device, first data comprising a set of three or more isotope ratios associated with a sample mixture comprising a plurality of unknown compounds, wherein each isotope ratio in the set of three or more isotope ratios corresponds to an isotope ratio for a respective unknown compound in the plurality of unknown compounds measured using an uncalibrated spectroscopy process; receiving, at the computing device, second data indicative of a first set of three or more delta values associated with a first reference mixture comprising a plurality of known compounds, wherein each delta value in the first set of three or more delta values is indicative of an isotope ratio for a respective known compound in the plurality of known compounds relative to a first isotope ratio for a standard compound measured using a calibrated spectroscopy
  • each known compound in the plurality of known compounds is associated with a respective compound identifier in a plurality of compound identifiers
  • the method further comprises generating, at the computing device, data associating at least one compound identifier in the plurality of compound identifiers with at least one unknown compound in the plurality of unknown compounds in the sample mixture, when the correlation measure satisfies the mapping condition.
  • the method further comprises, at the computing device, associating data representing a peak in a chromatograph for the sample mixture with the at least one compound identifier in the plurality of compound identifiers, wherein the peak corresponds to the at least one unknown compound in the plurality of unknown compounds.
  • the first data is generated by application of isotope ratio spectrometry, isotope ratio mass spectrometry, cavity ring-down spectroscopy, infrared isotope spectroscopy, integrated cavity output spectroscopy, and/or quantum cascade laser spectroscopy to one or more analytes separated from the sample mixture.
  • the one or more analytes are separated from the sample mixture by application of gas chromatography, liquid chromatography, ion chromatography, and/or supercritical fluid chromatography to the sample mixture.
  • the correlation measure is determined based on a regression analysis performed on the set of three or more isotope ratios associated with the sample mixture and the first set of three or more delta values associated with the first reference mixture.
  • the correlation measure is an R-squared measure determined based on the regression analysis.
  • the mapping condition specifies a threshold correlation measure.
  • a non-transitory computer-readable medium comprising computer-executable instructions that, when executed by a processor in a computing device, cause the computing device to perform a method of identifying a sample mixture based on relative isotope composition for a plurality of compounds in the sample mixtures, the method comprising: receiving, at the computing apparatus, first data indicative of a set of three or more isotope ratios associated with a sample mixture comprising a plurality of unknown compounds, wherein each isotope ratio in the set of three or more isotope ratios is indicative of an isotope ratio for a respective unknown compound in the plurality of unknown compounds; receiving, at the computing device, second data indicative of a first set of three or more delta values associated with a first reference mixture comprising a plurality of known compounds, wherein each delta value in the first set of three or more delta values is indicative of an isotope ratio for a respective known compound in the plurality of known compounds relative to
  • a system for identifying a sample mixture based on relative isotope composition for a plurality of compounds in the sample mixture comprising at least one memory including computer program code and at least one processor in data communication with the at least one memory, wherein the at least one processor is configured to execute the computer program code and cause the system to: receive first data comprising a set of three or more isotope ratios associated with a sample mixture comprising a plurality of unknown compounds, wherein each isotope ratio in the set of three or more isotope ratios corresponds to an isotope ratio for a respective unknown compound in the plurality of unknown compounds measured using an uncalibrated spectroscopy process; receive second data indicative of a first set of three or more delta values associated with a first reference mixture comprising a plurality of known compounds, wherein each delta value in the first set of three or more delta values is indicative of an isotope ratio for a respective known compound in the plurality of known compounds relative
  • Figure 1 is a schematic diagram showing a system for analyzing an unidentified mixture in accordance with an embodiment.
  • Figure 2 is a flow chart showing a method for analyzing the unidentified mixture of compounds in accordance with an embodiment.
  • Figure 3 is a flow chart showing a method for identifying compounds in the unidentified mixture in accordance with an embodiment.
  • Figure 4 is a chart showing a chromatogram for the unidentified mixture in accordance with an embodiment.
  • Figure 5 is a chart showing a scatter plot of reference delta values plotted against a first set of sampled delta values for compounds in the unidentified mixture in accordance with an embodiment.
  • Figure 6 is a chart showing the chromatogram for the unidentified mixture with peaks labelled according to an embodiment.
  • Figure 7 is a chart showing a scatter plot of reference delta values plotted against a second set of sampled delta values for compounds in the unidentified mixture in accordance with an embodiment.
  • Figure 8 is a schematic diagram showing a computer system for analyzing an unidentified mixture in accordance with an embodiment.
  • the isotope ratio of a given compound is affected by the route by which the compound is made, and in some cases, by the environmental conditions under which the compound is made. However, for stable isotopes such as 13 C and deuterium, the isotope ratio will not normally change once the compound has been made. Thus, in a mixture of compounds, each compound will have a specific and typically invariable isotope ratio.
  • the isotope ratio of a given compound may be quantified using a "delta value" or ⁇ , as is known in the art.
  • the delta value provides a measure of an isotope ratio for the given compound, relative to a standard isotope ratio.
  • the delta value for a given compound is calculated according to the following formula:
  • the standard isotope ratio featuring in the Equation 1 is that of a standard material and is selected depending on the element in question.
  • Examples of standard isotope ratios used for calculation of delta values are the standard isotope ratios defined and published by the International Atomic Energy Agency (IAEA) and the National Institute of Standards and Technology (NIST).
  • the isotope ratio of a compound may be measured using an isotope ratio spectrometer, such as an isotope ratio mass spectrometer.
  • an isotope ratio spectrometer such as an isotope ratio mass spectrometer.
  • the compound of interest is converted (e.g., through catalytic combustion) to obtain one or more products, which are provided to the isotope ratio spectrometer as "analytes" for isotope ratio analysis.
  • the analytes are quantitative conversion products of the compound of interest (whose isotopic compositions cannot be determined through spectroscopic means).
  • Examples of typical analytes include CO2 (where the isotope ratio measured is 13 C/ 12 C), SO2 (where the isotope ratio measured is 34 S/ 32 S), N2 (where the isotope ratio measured is 15 N/ 14 N), CO (where the isotope ratio measured is 18 0/ 16 0) H2 (where the isotope ratio measured is 2 H/3 ⁇ 4, i.e. D/H) and H2O (where the isotope ratio measured is 2 H/3 ⁇ 4, i.e. D/H, and 18 0/ 16 0).
  • the corresponding delta values for these example analytes are 5 13 C, 5 34 S, ⁇ 15 ⁇ , ⁇ 18 0 and 5D respectively.
  • the delta values for an unknown compound are determined relative to the isotope ratios measured for monitoring gas (e.g. CO2), which has known delta values.
  • the delta values for an unknown compound may calculated according to the following formula:
  • the accuracy of delta values measured according Equation 2 are sensitive to factors such as isotopic drift at different delta value magnitudes and transient changes to the delta values of the monitoring gas.
  • the standard or reference compound is analyzed using an uncalibrated isotope ratio spectrometer, it is likely that the measured delta value will be different to those specified according to the published standard.
  • Calibration of the isotope ratio spectrometer is typically performed by simultaneously analyzing the sample compound and a standard compound for which the delta values are known. The delta values obtained for the standard compound are compared to the known or expected delta values to enable calculation of a correction factor which can be applied to the delta values measure for the sample. Once calibrated, the corrected delta values for the sample can be used to for identification purposes by, for example, performing a direct comparison with the delta values for a plurality of compounds with known delta values. Typically, this calibration process can be time consuming and requires specialist knowledge of the underlying isotopic ratio spectrometer being used.
  • embodiments of the present invention facilitate identification of a mixture of unknown or unidentified compounds by utilizing the relative differences between their respective isotope ratios as a basis for correlation with one or more reference mixtures.
  • embodiments of the invention utilize the plurality of isotope ratios measured for a given mixture of compounds as a "relative isotope ratio signature" which is compared or correlated against the relative delta value signature of one or more reference mixtures, thereby facilitating identification of the given mixture.
  • This approach is advantageous in that it does not require calibration of the monitoring gas to a particular standard and is therefore less susceptible to environmental variations when compared to prior art techniques for mixture identification.
  • Figure 1 shows a system 100 for analyzing a sample mixture 102 comprising a plurality of compounds in accordance with an embodiment.
  • the system 100 comprises a gas chromatograph 104, an isotope ratio mass spectrometer 108 and an analysis engine 112, which are configured to analyze and identify the sample mixture 102 based on the particular isotope ratios of its constituent compounds.
  • the gas chromatograph 104 separates the constituent compounds of the sample mixture 102 using gas chromatography, as is known in the art.
  • the separated compounds exit the gas chromatograph 104 temporally spaced according to their respective retention times.
  • the temporal spacing between the separated compounds introduced by the gas chromatography is maintained throughout the subsequent isotope ratio analysis to allow the isotope ratios for each separated compound to be determined individually, as discussed below.
  • the separated compounds output by the gas chromatograph 104 are converted (i.e. oxidized or reduced) to form one of more respective analytes 106, which are in turn delivered to the isotope ratio mass spectrometer 108 for analysis.
  • the isotope ratio mass spectrometer 108 for analysis, only a subset of the analytes 106 obtained from the combustion process is provided to the isotope ratio mass spectrometer 108 for analysis, depending on the particular isotope ratios to be measured.
  • all analytes 106 are provided to the isotope ratio mass spectrometer 108 but only a subset of the analytes 106 are measured.
  • the subset of analytes 106 provided to or measured by the isotope ratio mass spectrometer 108 may comprise CO2, H2 and/or H2O, thereby enabling the measurement of the CI C and/or D/H isotope ratios, and calculation of the respective 5 13 C and 5D delta values for each separated compound of the sample mixture.
  • the isotope ratio mass spectrometer 108 is arranged to measure to the isotope ratios of the received analytes 106 and to calculate the corresponding delta values for the respective compounds of the sample mixture 102 based on an appropriate monitoring gas.
  • the isotope ratio mass spectrometer 108 outputs data 1 10 indicative of the determined delta values for each compound of the sample mixture, and transmits data 1 10 to the analysis engine 1 12 for further processing and identification of the sample mixture 102.
  • the isotope ratio mass spectrometer 108 may output and transmit data 1 10 on a per-compound basis to account for the temporal spacing between compounds and their respective analytes introduced by the gas chromatograph 104.
  • the isotope ratio mass spectrometer 108 may delay output and transmission of data 1 10 until the analytes for a predetermined number of separated compounds have been analyzed.
  • the isotope ratio mass spectrometer 108 may be uncalibrated or calibrated to an arbitrary standard, and as a result the delta values obtained for mixture of compounds may be considered as being essentially uncalibrated delta values.
  • the analysis engine 1 12 is a computing device provided with appropriate software for processing of the data 1 10 received from the isotope ratio mass spectrometer 108 in accordance with the embodiments described below.
  • the analysis engine 1 12 may be integrated with the isotope ratio mass spectrometer 108 in a single physical unit.
  • the analysis engine may be a standalone computing device which is configured to receive data 1 10 over a communications network or via a data storage medium, such as a CD-ROM or a USB storage device.
  • the analysis engine 1 12 is configured to perform a correlation process based on the calculated delta values for the sample mixture, represented by data 1 10, and delta values for one or more reference mixtures of known identity and composition.
  • Data 1 14 indicative of delta values for the one or more reference mixtures is received and/or stored by the analysis engine 1 12.
  • the delta values for the one or more reference mixtures may be provided to the analysis engine 1 12 by a user via a graphical user interface of the analysis engine 1 12.
  • data 1 14 may be received by the analysis engine 1 12 via a communications network or a data storage medium, such as a CD-ROM or a USB storage device.
  • the reference delta values represented by data 114 are those published by the Biogeochemical Laboratories, Indiana University, United States of America, for one or more "Hydrogen and Carbon Stable Isotope Reference Materials".
  • a specific example of such a reference material is the "Fatty Acid Ester Mixture F8".
  • the reference delta values may be obtained from a sample mixture analyzed using system 100 and stored for comparison with subsequent sample mixtures.
  • the analysis engine 112 is configured to perform a correlation process on the basis of data 110 and data 114 to determine the identity of the sample mixture 102.
  • the analysis engine 112 is configured to output the result of the correlation process as data 116, which is indicative of an association between the sample mixture 102 and the determined identity of the sample mixture 102. Further details of the correlation process performed by the analysis engine 112 and the nature of data 116 resulting from the correlation process are discussed below with reference to Figures 2 to 7.
  • FIG. 2 shows a method 200 for identifying a sample mixture in accordance with an embodiment.
  • the sample mixture 102 is received by the analysis system 100 for identification (step S202).
  • the sample mixture 102 is separated into its constituent compounds by gas chromatography performed by the gas chromatograph 112 (step S204).
  • Each separated compound is separately evolved by combustion to produce one or more respective analytes 106 (step S206).
  • the analytes 106 resulting from step S206 will comprise Fh, FhO and/or CO2.
  • step S206 the analytes 106 resulting from step S206 (or a subset thereof) are fed to the isotope ratio mass spectrometer 108 on a per-compound basis, and the isotope abundance is measured relative to those measured for the particular monitoring gas being used [step S208].
  • the relative isotope ratios for a given compound of the sample mixture has been measured, the corresponding delta value(s) for the given compound are calculated, based on the relative isotope ratios measured for the particular monitoring gas [step S210].
  • data 110 indicative of the calculated delta values for compounds in the sample mixture 102 is sent to the analysis engine 112 and a correlation process is performed based on data 110 and data 114 to identify the sample mixture 102 [step S212].
  • Figure 3 shows a correlation process 300 performed by the analysis engine 112 to identify the sample mixture 110 in accordance with an embodiment.
  • the analysis engine 112 receives data 110 indicative of the delta values calculated for a plurality of compounds in the sample mixture 102 [step S302].
  • the analysis engine 112 receives or retrieves data 114 indicative of the delta values for a plurality of known compounds in a reference mixture [step S304].
  • a correlation measure for the respective delta values is calculated [step S306].
  • the calculated correlation quantifies the statistical relationship between the delta values for the sample mixture 102 and the delta values for the reference mixture, and thus indicates the likelihood that the sample mixture 102 corresponds to the reference mixture.
  • the correlation measure may correspond to the well-known R-squared value calculated as part of a regression analysis performed on the respective delta values.
  • the analysis engine 112 determines whether the calculated correlation measure satisfies one or more criteria defined by a mapping condition [step S308].
  • the mapping condition may specify that the calculated correlation measure must satisfy a predetermined threshold correlation measure for the reference material.
  • the analysis engine 112 upon determining that the calculated correlation measure satisfies the mapping condition, the analysis engine 112 generates data associating the sample mixture 102 with the identity of the reference mixture [step S310]. For example, the analysis engine 112 may generate data indicating the association between the sample mixture and the identity of the reference mixture, and optionally the calculated correlation measure to provide an indication of "certainty" for the association. According to some embodiments, once the identity of the sample mixture 102 has been determined, the analysis engine 112 may optionally utilize the determined correlation between the delta values of the sample mixture 102 and the delta values of the reference mixture to identify one or more peaks in a chromatogram for the sample mixture [step S312]. This latter step is explained further below with reference to Figures 4 to 7.
  • the method 300 of Figure 3 may be repeated for a plurality of reference mixtures to determine a plurality of respective correlation measures.
  • the sample mixture 102 may be identified as this reference mixture which is associated with the "best" correlation as indicated by the respective correlation measure.
  • the correlation measure may by calculated on a subset of the delta values for the sample mixture and/or may be recalculated as additional delta value data becomes available as a result of the temporal spacing between the separated compounds introduced by the gas chromatography.
  • a plurality of correlation measures may be calculated for different respective subsets of the delta value data to enable identification of anomalous delta values (e.g. due to noise) and to ensure that such anomalous delta values are disregarded for the purpose of identifying the sample mixture.
  • Figure 4 show an example of a chromatogram 400 generated by the gas chromatograph 104 for the sample mixture 102.
  • the chromatogram 400 is a plot of retention time 402 against intensity 404 and comprises a set of six peaks 406 which occur at different respective retention times.
  • Each peak 406 in the chromatogram 400 corresponds to a respective compound separated from the sample mixture 110, but the identity of the respective compounds is initially unknown.
  • Figure 5 is a scatter chart 500 illustrating a technique for identifying compounds in the sample mixture 102 in accordance with an embodiment.
  • the scatter chart 500 is a plot of delta values 502 (6 re f) for a given reference mixture against a first set of sampled delta values 504 (6 sa mi) for the unidentified mixture.
  • the scatter chart 500 shows six data points 506 corresponding to the six unidentified peaks appearing in the corresponding chromatogram 400 of Figure 4, and a line of best fit 508 determined using a linear regression analysis during the correlation process of Figure 3.
  • the calculated correlation measure is the R-squared value with respect to the line of best fit 508, and it is assumed that the correlation measure satisfies the mapping condition specified in step S308 of Figure 3.
  • the unidentified mixture 102 corresponds to the reference mixture
  • the data point 506-3 corresponds to the sampled delta value determined for a compound corresponding to the fourth peak 406-4 in the chromatogram 400 of Figure 4 and also to "compound A" of the reference mixture.
  • the compound corresponding to the fourth peak in chromatogram 400 may be identified as “compound A” and the analysis engine 112 may generate data indicating an association between the fourth peak 406-4 in chromatogram 400 and "compound A".
  • the analysis engine 112 may generate data indicating an association between the fourth peak 406-4 in chromatogram 400 and "compound A”.
  • this identification of the unidentified mixture and individual peaks in the associated chromatogram is independent of monitoring gas used to obtain the sample delta values.
  • identification of the unidentified mixture and the individual peaks in the associated chromatogram can be achieved without requiring calibration of the monitoring gas to a recognized isotopic standard.
  • the monitoring gas isotope ratio simply serves to scale the measured isotope ratios obtained for the sample mixture by an arbitrary and uncalibrated amount, and does not have a substantive bearing on the correlation process described above. Nevertheless, use of a monitoring gas may be desirable in situations where it is necessary or desirable to monitor for transient variations in delta value due to environmental conditions or the like.
  • Figure 7 illustrates the correlation between the same sample mixture 102 but, due to a different calibration of the isotope ratio spectrometer 108 (or use of a different isotope ratio spectrometer), a second set of sampled delta values 704 (6 sa m2) is produced, different to the first set of delta values 504 (6 sam i) shown in Figure 5. Specifically, the second set of sampled delta values 704 (6 sa m2) is scaled by an unknown amount relative to the first set of sampled delta values 704 (6 sam i) and remains uncalibrated with respect to the delta values 702 (6 re f) for the reference mixture.
  • the delta values 702 (6 re f) for the reference mixture are unaffected by the calibration of the isotope ratio spectrometer 108 and thus are unchanged from the values 502 (6 re f) shown in Figure 5.
  • the scales for the x and y axes shown in Figures 5 and 7 are the same.
  • the scatter chart 700 of Figure 7 shows six data points 706 corresponding to the six unidentified peaks appearing in the corresponding chromatogram 400 of Figure 4, and a line of best fit 708 determined using a linear regression analysis during the correlation process of Figure 3.
  • the calculated correlation measure e.g. the R-squared value
  • the second set of sampled delta values 704 (6 sa m2) with respect to the line of best fit 508 will be the same as the correlation measure calculated for the first set of sampled delta values 504 (6 sam i), irrespective of the different calibrations used.
  • identification of the sample may be performed by direct correlation between the associated sample isotope rations and the reference delta values, without the involvement of a monitoring gas, calibration or otherwise.
  • the analysis engine 112 may be embedded as an embedded computer device in the isotope ratio mass spectrometer 108 of Figure 1.
  • the combined analysis engine 112 and isotope ratio mass spectrometer 108 function as a standalone apparatus for identification of unidentified sample mixtures.
  • the standalone apparatus may be provided with a display means (e.g. an LCD display panel), and input means (e.g. a keyboard and mouse) which enables a user of the apparatus to control the analysis of the sample mixture and view results of the identification and correlation process described above.
  • At least parts of the methods discussed above with reference to Figures 1 to 6 may be implemented using software instructions stored on a computer useable storage medium for execution by a computing device.
  • a computer program product includes a computer useable storage medium to store a computer readable program that, when executed on a computing device, causes the computing device to perform operations, as described hereinbefore.
  • embodiments of the invention can be embodied in the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computing device or any instruction execution system.
  • a computer-usable or computer-readable medium can be any apparatus that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
  • the apparatus may be a transitory or a non-transitory computer- readable medium.
  • the computer-useable or computer-readable medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device), or a propagation medium.
  • Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a random access memory (RAM), a read-only memory (ROM), a rigid magnetic disk, and an optical disk.
  • FIG. 8 depicts schematically an example of a suitable computer 800 that includes a processor 802, a memory 804, a storage device 806 and a network interface 808.
  • the processor 802 may include a multifunction processor and/or an application-specific processor, examples of which include the PowerPCTM family of processors by IBMTM and the x86 and x86-64 family of processors by INTELTM.
  • the memory 804 within the computer is typically RAM and storage device 806 is typically a large capacity permanent storage device such as a magnetic hard disk drive or solid state memory device.
  • the network interface 808 enables communications with other computers in a network using as suitable protocol, such as the Internet Protocol (IP) and the processor 802 executes computer readable instructions stored in storage 806 to implement embodiments of the invention as described hereinbefore with reference to Figures 1 to 7.
  • IP Internet Protocol
  • the embodiments described above have employed a gas chromatography isotope ratio mass spectrometer to determine the isotope ratios of the analytes 106, it will be appreciated that other techniques may be employed.
  • the separation may be carried out using a thermogravimetric analyzer, gas chromatography, liquid chromatography, ion chromatography, supercritical fluid chromatography, based apparatus and the like.
  • the isotope ratio may be measured using a spectroscopy-based system, such as cavity ring- down spectroscopy (CRDS), infrared isotope spectroscopy, integrated cavity output spectroscopy, quantum cascade laser spectroscopy, based apparatus and the like.
  • the technique used to obtain the isotope ratios for the sample material is not limited to be the same technique as used to obtain the isotope ratios for the reference material.
  • the delta values for more than one analytes may be determined for each separated compound.
  • the analytes 106 can be formed for a given separated compound by the same processing step. Measurement of two or more sets of delta values allows two or more corresponding comparisons to be made with reference data values, further enhancing the reliability of the mixture identification.
  • the sample induced into the isotope ratio mass spectrometer 108 is substantially all analyte for each separated compound.
  • the analyte per se may be present in the compounds prior to separation by the gas chromatograph 104, but at least some of the mixed compounds will be processed to form the analyte.
  • the mixture compounds may be processed into the analyte using an oxidation or reduction reaction. In some embodiments, the mixture compounds may be processed into the analyte using combustion.
  • the compounds of the sample mixture are separated in time by the gas chromatograph 104.
  • the compounds may physically be separated into separate containers (e.g. fractional collection).
  • separation may be effected by chromatography, suitably gas chromatography, or by other methods such as thermogravimetric separation.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Pathology (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Computing Systems (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Theoretical Computer Science (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)

Abstract

Methods, systems and computer-readable media for identifying a sample mixture based on relative isotope ratios for a plurality of compounds in the sample mixture by correlating a set of three of more relative isotope ratios for a sample mixture with a set of three or more delta values for one of more reference mixtures, wherein the delta values are indicative of a relative isotope composition for a plurality of compounds in the reference mixtures. In this manner, the set of three or more isotope ratios for the sample mixture serves as an isotope ratio signature which is independent of any underlying calibration of the apparatus used to obtain the relative isotope ratio values.

Description

MIXTURE IDENTIFICATION
Technical Field
[0001] The present invention relates to an apparatus and method for identifying mixtures of compounds and/or compounds within a mixture. In particular, but not exclusively, the present invention relates to an apparatus and method for processing isotope ratio mass spectrometry data resulting from chemical analysis of a sample and identifying mixtures of compounds and/or compounds within a mixture for said sample.
Background
[0002] In many technical fields, it is useful to be able to identify mixtures of compounds. In some cases, it is useful to be able to identify compounds within a mixture. It is also useful to be able to identify the source of a mixture; this may involve differentiating between a synthetic mixture and a naturally occurring mixture or between two synthetic mixtures. For example, such techniques can be useful in detecting blood doping and the like.
[0003] Known methods, such as chromatography and thermogravimetric analysis, allow compounds within a mixture to be separated and analyzed. In some cases, the relative output of the analysis allows the overall mixture to be identified without specific identification of the compounds per se. In other cases, the compounds are identified.
[0004] One such known method is gas chromatography mass spectrometry (GC-MS). The output chromatograms can be used to analyze compound retention time in the gas chromatography apparatus; each mixture should have a unique chromatogram profile and therefore, by comparison with reference spectra, the mixture is identifiable. However, retention time is highly susceptible to environmental variations and so very careful control over the conditions during GC-MS is required.
Summary
[0005] According to a first aspect of the present inventions, there is provided a computer- implemented method of identifying a sample mixture based on relative isotope composition for a plurality of compounds in the sample mixture, the method comprising: receiving, at a computing device, first data comprising a set of three or more isotope ratios associated with a sample mixture comprising a plurality of unknown compounds, wherein each isotope ratio in the set of three or more isotope ratios corresponds to an isotope ratio for a respective unknown compound in the plurality of unknown compounds measured using an uncalibrated spectroscopy process; receiving, at the computing device, second data indicative of a first set of three or more delta values associated with a first reference mixture comprising a plurality of known compounds, wherein each delta value in the first set of three or more delta values is indicative of an isotope ratio for a respective known compound in the plurality of known compounds relative to a first isotope ratio for a standard compound measured using a calibrated spectroscopy process; determining, at the computing device, a correlation measure indicative of a degree of correlation between the ratios between the set of three or more isotope ratios associated with the sample mixture and the ratios between the set of three or more delta values associated with the first reference mixture, based on the first data and the second data; and generating, at the computing device, data associating the sample mixture with a mixture identifier associated with the reference mixture, when the correlation measure satisfies a mapping condition.
[0006] According to some embodiments, each known compound in the plurality of known compounds is associated with a respective compound identifier in a plurality of compound identifiers, and the method further comprises generating, at the computing device, data associating at least one compound identifier in the plurality of compound identifiers with at least one unknown compound in the plurality of unknown compounds in the sample mixture, when the correlation measure satisfies the mapping condition.
[0007] According to some embodiments, the method further comprises, at the computing device, associating data representing a peak in a chromatograph for the sample mixture with the at least one compound identifier in the plurality of compound identifiers, wherein the peak corresponds to the at least one unknown compound in the plurality of unknown compounds.
[0008] According to some embodiments, the first data is generated by application of isotope ratio spectrometry, isotope ratio mass spectrometry, cavity ring-down spectroscopy, infrared isotope spectroscopy, integrated cavity output spectroscopy, and/or quantum cascade laser spectroscopy to one or more analytes separated from the sample mixture.
[0009] According to some embodiments, the one or more analytes are separated from the sample mixture by application of gas chromatography, liquid chromatography, ion chromatography, and/or supercritical fluid chromatography to the sample mixture. [0010] According to some embodiments, the correlation measure is determined based on a regression analysis performed on the set of three or more isotope ratios associated with the sample mixture and the first set of three or more delta values associated with the first reference mixture.
[0011] According to some embodiments, the correlation measure is an R-squared measure determined based on the regression analysis.
[0012] According to some embodiments, the mapping condition specifies a threshold correlation measure.
[0013] According to some embodiments, the threshold correlation measure is indicative of a degree of correlation between the ratios between the set of three or more isotope ratios associated with the sample mixture and the ratios between a second set of three or more delta values associated with a second reference mixture.
[0014] According to a second aspect of the present invention, there is provided a non-transitory computer-readable medium comprising computer-executable instructions that, when executed by a processor in a computing device, cause the computing device to perform a method of identifying a sample mixture based on relative isotope composition for a plurality of compounds in the sample mixtures, the method comprising: receiving, at the computing apparatus, first data indicative of a set of three or more isotope ratios associated with a sample mixture comprising a plurality of unknown compounds, wherein each isotope ratio in the set of three or more isotope ratios is indicative of an isotope ratio for a respective unknown compound in the plurality of unknown compounds; receiving, at the computing device, second data indicative of a first set of three or more delta values associated with a first reference mixture comprising a plurality of known compounds, wherein each delta value in the first set of three or more delta values is indicative of an isotope ratio for a respective known compound in the plurality of known compounds relative to a first measured isotope ratio for a standard compound; determining, at the computing device, a correlation measure indicative of a degree of correlation between the set of three or more isotope ratios associated with the sample mixture and the set of three or more delta values associated with the first reference mixture, based on the first data and the second data; and generating, at the computing device, data associating the sample mixture with a mixture identifier associated with the reference mixture, when the correlation measure satisfies a mapping condition. [0015] According to a third aspect of the present invention, there is provided a system for identifying a sample mixture based on relative isotope composition for a plurality of compounds in the sample mixture, comprising at least one memory including computer program code and at least one processor in data communication with the at least one memory, wherein the at least one processor is configured to execute the computer program code and cause the system to: receive first data comprising a set of three or more isotope ratios associated with a sample mixture comprising a plurality of unknown compounds, wherein each isotope ratio in the set of three or more isotope ratios corresponds to an isotope ratio for a respective unknown compound in the plurality of unknown compounds measured using an uncalibrated spectroscopy process; receive second data indicative of a first set of three or more delta values associated with a first reference mixture comprising a plurality of known compounds, wherein each delta value in the first set of three or more delta values is indicative of an isotope ratio for a respective known compound in the plurality of known compounds relative to a first measured isotope ratio for a standard compound measured using a calibrated spectroscopy process; determine a correlation measure indicative of a degree of correlation between the ratios between the set of three or more isotope ratios associated with the sample mixture and the ratios between the set of three or more delta values associated with the first reference mixture, based on the first data and the second data; and generate data associating the sample mixture with a mixture identifier associated with the reference mixture, when the correlation measure satisfies a mapping condition.
Brief Description of the Drawings
[0016] Further features and advantages of the invention will become apparent from the following description of preferred embodiments of the invention, given by way of example only, which is made with reference to the accompanying drawings, of which:
[0017] Figure 1 is a schematic diagram showing a system for analyzing an unidentified mixture in accordance with an embodiment.
[0018] Figure 2 is a flow chart showing a method for analyzing the unidentified mixture of compounds in accordance with an embodiment.
[0019] Figure 3 is a flow chart showing a method for identifying compounds in the unidentified mixture in accordance with an embodiment. [0020] Figure 4 is a chart showing a chromatogram for the unidentified mixture in accordance with an embodiment.
[0021] Figure 5 is a chart showing a scatter plot of reference delta values plotted against a first set of sampled delta values for compounds in the unidentified mixture in accordance with an embodiment.
[0022] Figure 6 is a chart showing the chromatogram for the unidentified mixture with peaks labelled according to an embodiment.
[0023] Figure 7 is a chart showing a scatter plot of reference delta values plotted against a second set of sampled delta values for compounds in the unidentified mixture in accordance with an embodiment.
[0024] Figure 8 is a schematic diagram showing a computer system for analyzing an unidentified mixture in accordance with an embodiment.
Detailed Description
[0025] The isotope ratio of a given compound is affected by the route by which the compound is made, and in some cases, by the environmental conditions under which the compound is made. However, for stable isotopes such as 13C and deuterium, the isotope ratio will not normally change once the compound has been made. Thus, in a mixture of compounds, each compound will have a specific and typically invariable isotope ratio.
[0026] The isotope ratio of a given compound may be quantified using a "delta value" or δ, as is known in the art. The delta value provides a measure of an isotope ratio for the given compound, relative to a standard isotope ratio. The delta value for a given compound is calculated according to the following formula:
Figure imgf000006_0001
[0028] The standard isotope ratio featuring in the Equation 1 is that of a standard material and is selected depending on the element in question. Examples of standard isotope ratios used for calculation of delta values are the standard isotope ratios defined and published by the International Atomic Energy Agency (IAEA) and the National Institute of Standards and Technology (NIST).
[0029] The isotope ratio of a compound may be measured using an isotope ratio spectrometer, such as an isotope ratio mass spectrometer. In this case, the compound of interest is converted (e.g., through catalytic combustion) to obtain one or more products, which are provided to the isotope ratio spectrometer as "analytes" for isotope ratio analysis. In this context, the analytes are quantitative conversion products of the compound of interest (whose isotopic compositions cannot be determined through spectroscopic means). Examples of typical analytes include CO2 (where the isotope ratio measured is 13C/12C), SO2 (where the isotope ratio measured is 34S/32S), N2 (where the isotope ratio measured is 15N/14N), CO (where the isotope ratio measured is 180/160) H2 (where the isotope ratio measured is 2H/¾, i.e. D/H) and H2O (where the isotope ratio measured is 2H/¾, i.e. D/H, and 180/160). The corresponding delta values for these example analytes are 513C, 534S, δ15Ν, δ180 and 5D respectively.
[0030] In a laboratory setting, the delta values for an unknown compound are determined relative to the isotope ratios measured for monitoring gas (e.g. CO2), which has known delta values. Thus, the delta values for an unknown compound may calculated according to the following formula:
Figure imgf000007_0001
[0032] However, the accuracy of delta values measured according Equation 2 are sensitive to factors such as isotopic drift at different delta value magnitudes and transient changes to the delta values of the monitoring gas. Thus, in order to enable a meaningful comparison of measured delta values of the unknown sample to the known delta values of a standard or reference compound, it is necessary to calibrate the isotope ratio spectrometer to ensure that the measured isotope ratio of the monitoring gas is equal to that used for the standard or reference compound. To illustrate this problem, if the standard or reference compound is analyzed using an uncalibrated isotope ratio spectrometer, it is likely that the measured delta value will be different to those specified according to the published standard.
[0033] Calibration of the isotope ratio spectrometer is typically performed by simultaneously analyzing the sample compound and a standard compound for which the delta values are known. The delta values obtained for the standard compound are compared to the known or expected delta values to enable calculation of a correction factor which can be applied to the delta values measure for the sample. Once calibrated, the corrected delta values for the sample can be used to for identification purposes by, for example, performing a direct comparison with the delta values for a plurality of compounds with known delta values. Typically, this calibration process can be time consuming and requires specialist knowledge of the underlying isotopic ratio spectrometer being used.
[0034] In order to obviate this calibration requirement, embodiments of the present invention facilitate identification of a mixture of unknown or unidentified compounds by utilizing the relative differences between their respective isotope ratios as a basis for correlation with one or more reference mixtures. In other words, embodiments of the invention utilize the plurality of isotope ratios measured for a given mixture of compounds as a "relative isotope ratio signature" which is compared or correlated against the relative delta value signature of one or more reference mixtures, thereby facilitating identification of the given mixture. This approach is advantageous in that it does not require calibration of the monitoring gas to a particular standard and is therefore less susceptible to environmental variations when compared to prior art techniques for mixture identification.
[0035] Figure 1 shows a system 100 for analyzing a sample mixture 102 comprising a plurality of compounds in accordance with an embodiment. In this example, the identity of the sample mixture 102 and its constituent compounds are initially unknown or unidentified. The system 100 comprises a gas chromatograph 104, an isotope ratio mass spectrometer 108 and an analysis engine 112, which are configured to analyze and identify the sample mixture 102 based on the particular isotope ratios of its constituent compounds.
[0036] The gas chromatograph 104 separates the constituent compounds of the sample mixture 102 using gas chromatography, as is known in the art. The separated compounds exit the gas chromatograph 104 temporally spaced according to their respective retention times. The temporal spacing between the separated compounds introduced by the gas chromatography is maintained throughout the subsequent isotope ratio analysis to allow the isotope ratios for each separated compound to be determined individually, as discussed below.
[0037] The separated compounds output by the gas chromatograph 104 are converted (i.e. oxidized or reduced) to form one of more respective analytes 106, which are in turn delivered to the isotope ratio mass spectrometer 108 for analysis. According some embodiments, only a subset of the analytes 106 obtained from the combustion process is provided to the isotope ratio mass spectrometer 108 for analysis, depending on the particular isotope ratios to be measured. According to alternative embodiments, all analytes 106 are provided to the isotope ratio mass spectrometer 108 but only a subset of the analytes 106 are measured. In a typical example, the subset of analytes 106 provided to or measured by the isotope ratio mass spectrometer 108 may comprise CO2, H2 and/or H2O, thereby enabling the measurement of the CI C and/or D/H isotope ratios, and calculation of the respective 513C and 5D delta values for each separated compound of the sample mixture.
[0038] The isotope ratio mass spectrometer 108 is arranged to measure to the isotope ratios of the received analytes 106 and to calculate the corresponding delta values for the respective compounds of the sample mixture 102 based on an appropriate monitoring gas. The isotope ratio mass spectrometer 108 outputs data 1 10 indicative of the determined delta values for each compound of the sample mixture, and transmits data 1 10 to the analysis engine 1 12 for further processing and identification of the sample mixture 102. In some embodiments, the isotope ratio mass spectrometer 108 may output and transmit data 1 10 on a per-compound basis to account for the temporal spacing between compounds and their respective analytes introduced by the gas chromatograph 104. In an alternative embodiment, the isotope ratio mass spectrometer 108 may delay output and transmission of data 1 10 until the analytes for a predetermined number of separated compounds have been analyzed. In some embodiments, the isotope ratio mass spectrometer 108 may be uncalibrated or calibrated to an arbitrary standard, and as a result the delta values obtained for mixture of compounds may be considered as being essentially uncalibrated delta values.
[0039] According to some embodiments, the analysis engine 1 12 is a computing device provided with appropriate software for processing of the data 1 10 received from the isotope ratio mass spectrometer 108 in accordance with the embodiments described below. In some embodiments, the analysis engine 1 12 may be integrated with the isotope ratio mass spectrometer 108 in a single physical unit. In alternative embodiments, the analysis engine may be a standalone computing device which is configured to receive data 1 10 over a communications network or via a data storage medium, such as a CD-ROM or a USB storage device.
[0040] The analysis engine 1 12 is configured to perform a correlation process based on the calculated delta values for the sample mixture, represented by data 1 10, and delta values for one or more reference mixtures of known identity and composition. Data 1 14 indicative of delta values for the one or more reference mixtures is received and/or stored by the analysis engine 1 12. For example, in some embodiments, the delta values for the one or more reference mixtures may be provided to the analysis engine 1 12 by a user via a graphical user interface of the analysis engine 1 12. Alternatively, data 1 14 may be received by the analysis engine 1 12 via a communications network or a data storage medium, such as a CD-ROM or a USB storage device. In a typical example, the reference delta values represented by data 114 are those published by the Biogeochemical Laboratories, Indiana University, United States of America, for one or more "Hydrogen and Carbon Stable Isotope Reference Materials". A specific example of such a reference material is the "Fatty Acid Ester Mixture F8". In an alternative example, the reference delta values may be obtained from a sample mixture analyzed using system 100 and stored for comparison with subsequent sample mixtures.
[0041] As discussed above, the analysis engine 112 is configured to perform a correlation process on the basis of data 110 and data 114 to determine the identity of the sample mixture 102. The analysis engine 112 is configured to output the result of the correlation process as data 116, which is indicative of an association between the sample mixture 102 and the determined identity of the sample mixture 102. Further details of the correlation process performed by the analysis engine 112 and the nature of data 116 resulting from the correlation process are discussed below with reference to Figures 2 to 7.
[0042] Further details of the steps involved in identification of the sample mixture 102 using the analysis system 100 are illustrated by Figure 2, which shows a method 200 for identifying a sample mixture in accordance with an embodiment. In a first step, the sample mixture 102 is received by the analysis system 100 for identification (step S202). Next, the sample mixture 102 is separated into its constituent compounds by gas chromatography performed by the gas chromatograph 112 (step S204). Each separated compound is separately evolved by combustion to produce one or more respective analytes 106 (step S206). For example, where the compounds are organic in nature (i.e. hydrocarbons) the analytes 106 resulting from step S206 will comprise Fh, FhO and/or CO2. Next, the analytes 106 resulting from step S206 (or a subset thereof) are fed to the isotope ratio mass spectrometer 108 on a per-compound basis, and the isotope abundance is measured relative to those measured for the particular monitoring gas being used [step S208]. Once the relative isotope ratios for a given compound of the sample mixture has been measured, the corresponding delta value(s) for the given compound are calculated, based on the relative isotope ratios measured for the particular monitoring gas [step S210]. Finally, data 110 indicative of the calculated delta values for compounds in the sample mixture 102 is sent to the analysis engine 112 and a correlation process is performed based on data 110 and data 114 to identify the sample mixture 102 [step S212]. [0043] Figure 3 shows a correlation process 300 performed by the analysis engine 112 to identify the sample mixture 110 in accordance with an embodiment. In a first step, the analysis engine 112 receives data 110 indicative of the delta values calculated for a plurality of compounds in the sample mixture 102 [step S302]. Next, the analysis engine 112 receives or retrieves data 114 indicative of the delta values for a plurality of known compounds in a reference mixture [step S304]. Once the data 110 for the sample mixture 102 and the data 114 for the reference mixture is available at the analysis engine 112, a correlation measure for the respective delta values is calculated [step S306]. The calculated correlation quantifies the statistical relationship between the delta values for the sample mixture 102 and the delta values for the reference mixture, and thus indicates the likelihood that the sample mixture 102 corresponds to the reference mixture. In a typical example, the correlation measure may correspond to the well-known R-squared value calculated as part of a regression analysis performed on the respective delta values. Next, the analysis engine 112 determines whether the calculated correlation measure satisfies one or more criteria defined by a mapping condition [step S308]. In this context, when the calculated correlation measure satisfies the mapping condition, it indicates that there is a reasonable likelihood that the sample mixture is a match to the reference material and can therefore be identified as such. For example, the mapping condition may specify that the calculated correlation measure must satisfy a predetermined threshold correlation measure for the reference material. In the present embodiment, upon determining that the calculated correlation measure satisfies the mapping condition, the analysis engine 112 generates data associating the sample mixture 102 with the identity of the reference mixture [step S310]. For example, the analysis engine 112 may generate data indicating the association between the sample mixture and the identity of the reference mixture, and optionally the calculated correlation measure to provide an indication of "certainty" for the association. According to some embodiments, once the identity of the sample mixture 102 has been determined, the analysis engine 112 may optionally utilize the determined correlation between the delta values of the sample mixture 102 and the delta values of the reference mixture to identify one or more peaks in a chromatogram for the sample mixture [step S312]. This latter step is explained further below with reference to Figures 4 to 7.
[0044] According to some embodiments, the method 300 of Figure 3 may be repeated for a plurality of reference mixtures to determine a plurality of respective correlation measures. According to this approach, the sample mixture 102 may be identified as this reference mixture which is associated with the "best" correlation as indicated by the respective correlation measure.
[0045] With regard to the regression analysis discussed above with reference to step S306 of Figure 3, it will be appreciated that when a R-squared measure is employed, a minimum of three delta values will be required to calculated the R-squared correlation measure (i.e. the R- squared value for two points is always equal to 1). Consequently, for embodiments where the R-squared correlation measure is employed, it is necessary to obtain the delta values for at least three compounds in the mixture to be identified.
[0046] According to some embodiments, the correlation measure may by calculated on a subset of the delta values for the sample mixture and/or may be recalculated as additional delta value data becomes available as a result of the temporal spacing between the separated compounds introduced by the gas chromatography. Moreover, according to some embodiments, a plurality of correlation measures may be calculated for different respective subsets of the delta value data to enable identification of anomalous delta values (e.g. due to noise) and to ensure that such anomalous delta values are disregarded for the purpose of identifying the sample mixture.
[0047] Figure 4 show an example of a chromatogram 400 generated by the gas chromatograph 104 for the sample mixture 102. The chromatogram 400 is a plot of retention time 402 against intensity 404 and comprises a set of six peaks 406 which occur at different respective retention times. Each peak 406 in the chromatogram 400 corresponds to a respective compound separated from the sample mixture 110, but the identity of the respective compounds is initially unknown.
[0048] Figure 5 is a scatter chart 500 illustrating a technique for identifying compounds in the sample mixture 102 in accordance with an embodiment. In the present embodiment, the scatter chart 500 is a plot of delta values 502 (6ref) for a given reference mixture against a first set of sampled delta values 504 (6sami) for the unidentified mixture. In this respect, the scatter chart 500 shows six data points 506 corresponding to the six unidentified peaks appearing in the corresponding chromatogram 400 of Figure 4, and a line of best fit 508 determined using a linear regression analysis during the correlation process of Figure 3. In the present example, the calculated correlation measure is the R-squared value with respect to the line of best fit 508, and it is assumed that the correlation measure satisfies the mapping condition specified in step S308 of Figure 3. Thus, based on the premise that the unidentified mixture 102 corresponds to the reference mixture, it is possible to map the unidentified peaks in the chromatogram 400 of Figure 4 to known compounds in the reference mixture shown in Figure 5. To provide an example, from Figure 5 it is known that the data point 506-3 corresponds to the sampled delta value determined for a compound corresponding to the fourth peak 406-4 in the chromatogram 400 of Figure 4 and also to "compound A" of the reference mixture. On this basis, the compound corresponding to the fourth peak in chromatogram 400 may be identified as "compound A" and the analysis engine 112 may generate data indicating an association between the fourth peak 406-4 in chromatogram 400 and "compound A". In this respect, it will be noted that there is no correlation between the order of the peaks 406 in the chromatogram 400 and the magnitude of the associated sampled delta values for the unidentified mixture.
[0049] The result of the mapping described above with Figure 5 is illustrated by the chromatogram 600 of Figure 6, which has been updated to identify or "label" the fourth peak 406-4 as "compound A". Although only one peak has been identified in the present example, it will be appreciated that the technique described above with reference to Figure 5 can be employed to identity any number of peaks in the chromatogram 400 of Figure 4. Thus, the present embodiment enables identification of individual peaks in a chromatogram for the sample mixture 102, in addition to identification of the sample mixture 102 itself. This identification is based solely on the relative delta value signature of the sample mixture 102 (i.e. the relative ratios between delta values obtained for the sample mixture 102) and is therefore independent of any potential variation caused by compound retention time or other systematic variations. In particular, this identification of the unidentified mixture and individual peaks in the associated chromatogram is independent of monitoring gas used to obtain the sample delta values. Thus, identification of the unidentified mixture and the individual peaks in the associated chromatogram can be achieved without requiring calibration of the monitoring gas to a recognized isotopic standard.
[0050] Further, according so some embodiments, it may be sufficient to perform the correlation process described above based directly on the isotope ratios obtained for the sample, without converting to delta values relative to a monitoring gas. This is because the isotope ratios are essentially proportional to the delta values, so the correlation with respect to the delta values of the standard mixture is unaffected. Thus, according to these embodiments, the monitoring gas isotope ratio simply serves to scale the measured isotope ratios obtained for the sample mixture by an arbitrary and uncalibrated amount, and does not have a substantive bearing on the correlation process described above. Nevertheless, use of a monitoring gas may be desirable in situations where it is necessary or desirable to monitor for transient variations in delta value due to environmental conditions or the like.
[0051] Advantageously, application of the R-squared technique in the embodiments described above with reference Figures 2-6 ensures that the correlation measure is essentially independent of any variation in delta value magnitude and scaling between the sample delta values and the reference delta values due to calibration error or lack of calibration. Thus, in an example where mixture identification is performed on a set of sample delta values of 1, 2, 3, 4 & 5 with respect of a reference mixture with delta values 20, 30, 40, 50 & 60, an R-squared value of 1 would result irrespective of the translation and scaling between the sample and reference delta values.
[0052] To illustrate this aspect, Figure 7 illustrates the correlation between the same sample mixture 102 but, due to a different calibration of the isotope ratio spectrometer 108 (or use of a different isotope ratio spectrometer), a second set of sampled delta values 704 (6sam2) is produced, different to the first set of delta values 504 (6sami) shown in Figure 5. Specifically, the second set of sampled delta values 704 (6sam2) is scaled by an unknown amount relative to the first set of sampled delta values 704 (6sami) and remains uncalibrated with respect to the delta values 702 (6ref) for the reference mixture. In contrast, the delta values 702 (6ref) for the reference mixture are unaffected by the calibration of the isotope ratio spectrometer 108 and thus are unchanged from the values 502 (6ref) shown in Figure 5. In this respect, the scales for the x and y axes shown in Figures 5 and 7 are the same.
[0053] The scatter chart 700 of Figure 7 shows six data points 706 corresponding to the six unidentified peaks appearing in the corresponding chromatogram 400 of Figure 4, and a line of best fit 708 determined using a linear regression analysis during the correlation process of Figure 3. In this respect, it is important to note that the calculated correlation measure (e.g. the R-squared value) for the second set of sampled delta values 704 (6sam2) with respect to the line of best fit 508 will be the same as the correlation measure calculated for the first set of sampled delta values 504 (6sami), irrespective of the different calibrations used. Therefore, by relying on the relative ratios between delta values for compounds in the unidentified mixture rather than a direct comparison of absolute delta values, it is possible to perform mixture identification without requiring calibration of the underlying isotope ratio spectrometer. Indeed, according to some embodiments, identification of the sample may be performed by direct correlation between the associated sample isotope rations and the reference delta values, without the involvement of a monitoring gas, calibration or otherwise.
[0054] According to some embodiments, the analysis engine 112 may be embedded as an embedded computer device in the isotope ratio mass spectrometer 108 of Figure 1. In such embodiments, the combined analysis engine 112 and isotope ratio mass spectrometer 108 function as a standalone apparatus for identification of unidentified sample mixtures. For example, the standalone apparatus may be provided with a display means (e.g. an LCD display panel), and input means (e.g. a keyboard and mouse) which enables a user of the apparatus to control the analysis of the sample mixture and view results of the identification and correlation process described above.
[0055] At least parts of the methods discussed above with reference to Figures 1 to 6 may be implemented using software instructions stored on a computer useable storage medium for execution by a computing device. As an example, an embodiment of a computer program product includes a computer useable storage medium to store a computer readable program that, when executed on a computing device, causes the computing device to perform operations, as described hereinbefore. Furthermore, embodiments of the invention can be embodied in the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computing device or any instruction execution system. For the purposes of this description, a computer-usable or computer-readable medium can be any apparatus that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. The apparatus may be a transitory or a non-transitory computer- readable medium. For example, the computer-useable or computer-readable medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device), or a propagation medium. Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a random access memory (RAM), a read-only memory (ROM), a rigid magnetic disk, and an optical disk. Current examples of optical disks include a compact disk with read only memory (CD-ROM), a compact disk with read/write (CD-R/W), and a digital versatile disk (DVD). [0056] Embodiments of the analysis engine 112 described above with reference to Figures 1 to 7 are typically performed by a computer that executes computer readable instructions. Figure 8 depicts schematically an example of a suitable computer 800 that includes a processor 802, a memory 804, a storage device 806 and a network interface 808. The processor 802 may include a multifunction processor and/or an application-specific processor, examples of which include the PowerPC™ family of processors by IBM™ and the x86 and x86-64 family of processors by INTEL™. The memory 804 within the computer is typically RAM and storage device 806 is typically a large capacity permanent storage device such as a magnetic hard disk drive or solid state memory device. The network interface 808 enables communications with other computers in a network using as suitable protocol, such as the Internet Protocol (IP) and the processor 802 executes computer readable instructions stored in storage 806 to implement embodiments of the invention as described hereinbefore with reference to Figures 1 to 7.
[0057] Although the embodiments described above have employed a gas chromatography isotope ratio mass spectrometer to determine the isotope ratios of the analytes 106, it will be appreciated that other techniques may be employed. For example the separation may be carried out using a thermogravimetric analyzer, gas chromatography, liquid chromatography, ion chromatography, supercritical fluid chromatography, based apparatus and the like. In addition, the isotope ratio may be measured using a spectroscopy-based system, such as cavity ring- down spectroscopy (CRDS), infrared isotope spectroscopy, integrated cavity output spectroscopy, quantum cascade laser spectroscopy, based apparatus and the like. Similarly, the technique used to obtain the isotope ratios for the sample material is not limited to be the same technique as used to obtain the isotope ratios for the reference material.
[0058] As discussed above, according to some embodiments the delta values for more than one analytes may be determined for each separated compound. In particularly convenient cases, the analytes 106 can be formed for a given separated compound by the same processing step. Measurement of two or more sets of delta values allows two or more corresponding comparisons to be made with reference data values, further enhancing the reliability of the mixture identification.
[0059] In some embodiments, the sample induced into the isotope ratio mass spectrometer 108 is substantially all analyte for each separated compound. The analyte per se may be present in the compounds prior to separation by the gas chromatograph 104, but at least some of the mixed compounds will be processed to form the analyte. In some embodiments, the mixture compounds may be processed into the analyte using an oxidation or reduction reaction. In some embodiments, the mixture compounds may be processed into the analyte using combustion.
[0060] According to the embodiments described above, the compounds of the sample mixture are separated in time by the gas chromatograph 104. In other embodiments, the compounds may physically be separated into separate containers (e.g. fractional collection). In some embodiments, separation may be effected by chromatography, suitably gas chromatography, or by other methods such as thermogravimetric separation.
[0061] The above embodiments are to be understood as illustrative examples of the invention. Further embodiments of the invention are envisaged. It is to be understood that any feature described in relation to any one embodiment may be used alone, or in combination with other features described, and may also be used in combination with one or more features of any other of the embodiments, or any combination of any other of the embodiments. Furthermore, equivalents and modifications not described above may also be employed without departing from the scope of the invention, which is defined in the accompanying claims.

Claims

CLAIMS What is claimed is:
1. A computer-implemented method of identifying a sample mixture based on relative isotope composition for a plurality of compounds in the sample mixture, the method comprising:
receiving, at a computing device, first data comprising a set of three or more isotope ratios associated with a sample mixture comprising a plurality of unknown compounds, wherein each isotope ratio in the set of three or more isotope ratios corresponds to an isotope ratio for a respective unknown compound in the plurality of unknown compounds measured using an uncalibrated spectroscopy process;
receiving, at the computing device, second data indicative of a first set of three or more delta values associated with a first reference mixture comprising a plurality of known compounds, wherein each delta value in the first set of three or more delta values is indicative of an isotope ratio for a respective known compound in the plurality of known compounds relative to a first isotope ratio for a standard compound measured using a calibrated spectroscopy process;
determining, at the computing device, a correlation measure indicative of a degree of correlation between the ratios between the set of three or more isotope ratios associated with the sample mixture and the ratios between the set of three or more delta values associated with the first reference mixture, based on the first data and the second data; and
generating, at the computing device, data associating the sample mixture with a mixture identifier associated with the reference mixture, when the correlation measure satisfies a mapping condition.
2. A computer-implemented method according to claim 1 , wherein each known compound in the plurality of known compounds is associated with a respective compound identifier in a plurality of compound identifiers, and the method further comprises generating, at the computing device, data associating at least one compound identifier in the plurality of compound identifiers with at least one unknown compound in the plurality of unknown compounds in the sample mixture, when the correlation measure satisfies the mapping condition.
3. A computer-implemented method according to claim 1, further comprising, at the computing device, associating data representing a peak in a chromatograph for the sample mixture with the at least one compound identifier in the plurality of compound identifiers, wherein the peak corresponds to the at least one unknown compound in the plurality of unknown compounds.
4. A computer-implemented method according to any one of the preceding claims, wherein the first data is generated by application of isotope ratio spectrometry, isotope ratio mass spectrometry, cavity ring-down spectroscopy, infrared isotope spectroscopy, integrated cavity output spectroscopy, and/or quantum cascade laser spectroscopy to one or more analytes separated from the sample mixture.
5. A computer-implemented method according to claim 4, wherein the one or more analytes are separated from the sample mixture by application of gas chromatography, liquid chromatography, ion chromatography, and/or supercritical fluid chromatography to the sample mixture.
6. A computer-implemented method according to any one of the preceding claims, wherein the correlation measure is determined based on a regression analysis performed on the set of three or more isotope ratios associated with the sample mixture and the first set of three or more delta values associated with the first reference mixture.
7. A computer-implemented method according to claim 6, wherein the correlation measure is an R-squared measure determined based on the regression analysis.
8. A computer-implemented method according to any one of the preceding claims, wherein the mapping condition specifies a threshold correlation measure.
9. A computer-implemented method according to claim 8, wherein the threshold correlation measure is indicative of a degree of correlation between the ratios between the set of three or more isotope ratios associated with the sample mixture and the ratios between a second set of three or more delta values associated with a second reference mixture.
10. A non-transitory computer-readable medium comprising computer-executable instructions that, when executed by a processor in a computing device, cause the computing device to perform a method of identifying a sample mixture based on relative isotope composition for a plurality of compounds in the sample mixtures, the method comprising: receiving, at the computing apparatus, first data indicative of a set of three or more isotope ratios associated with a sample mixture comprising a plurality of unknown compounds, wherein each isotope ratio in the set of three or more isotope ratios is indicative of an isotope ratio for a respective unknown compound in the plurality of unknown compounds;
receiving, at the computing device, second data indicative of a first set of three or more delta values associated with a first reference mixture comprising a plurality of known compounds, wherein each delta value in the first set of three or more delta values is indicative of an isotope ratio for a respective known compound in the plurality of known compounds relative to a first measured isotope ratio for a standard compound;
determining, at the computing device, a correlation measure indicative of a degree of correlation between the set of three or more isotope ratios associated with the sample mixture and the set of three or more delta values associated with the first reference mixture, based on the first data and the second data; and
generating, at the computing device, data associating the sample mixture with a mixture identifier associated with the reference mixture, when the correlation measure satisfies a mapping condition.
11. A system for identifying a sample mixture based on relative isotope composition for a plurality of compounds in the sample mixture, comprising at least one memory including computer program code and at least one processor in data communication with the at least one memory, wherein the at least one processor is configured to execute the computer program code and cause the system to:
receive first data comprising a set of three or more isotope ratios associated with a sample mixture comprising a plurality of unknown compounds, wherein each isotope ratio in the set of three or more isotope ratios corresponds to an isotope ratio for a respective unknown compound in the plurality of unknown compounds measured using an uncalibrated spectroscopy process;
receive second data indicative of a first set of three or more delta values associated with a first reference mixture comprising a plurality of known compounds, wherein each delta value in the first set of three or more delta values is indicative of an isotope ratio for a respective known compound in the plurality of known compounds relative to a first measured isotope ratio for a standard compound measured using a calibrated spectroscopy process;
determine a correlation measure indicative of a degree of correlation between the ratios between the set of three or more isotope ratios associated with the sample mixture and the ratios between the set of three or more delta values associated with the first reference mixture, based on the first data and the second data; and
generate data associating the sample mixture with a mixture identifier associated with the reference mixture, when the correlation measure satisfies a mapping condition.
PCT/GB2016/051315 2015-05-07 2016-05-09 Mixture identification WO2016178033A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP16728070.0A EP3292402A1 (en) 2015-05-07 2016-05-09 Mixture identification

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB1507841.3 2015-05-07
GB1507841.3A GB2538232A (en) 2015-05-07 2015-05-07 Mixture identification

Publications (1)

Publication Number Publication Date
WO2016178033A1 true WO2016178033A1 (en) 2016-11-10

Family

ID=53489281

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2016/051315 WO2016178033A1 (en) 2015-05-07 2016-05-09 Mixture identification

Country Status (3)

Country Link
EP (1) EP3292402A1 (en)
GB (1) GB2538232A (en)
WO (1) WO2016178033A1 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1067488A1 (en) * 1999-07-09 2001-01-10 John P. Jasper A stable isotopic identification and method for identifying products by isotopic concentration

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007142945A2 (en) * 2006-05-30 2007-12-13 Jasper John P Tracing processes between precursors and products by utilizing isotopic relationships
GB0710113D0 (en) * 2007-05-25 2007-07-04 Mass Spec Analytical Ltd Authentication of articles
US9625436B2 (en) * 2013-04-29 2017-04-18 Hewlett-Packard Development Company, L.P. Determining the geographic origin of metals

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1067488A1 (en) * 1999-07-09 2001-01-10 John P. Jasper A stable isotopic identification and method for identifying products by isotopic concentration

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DEBAJYOTI PAUL ET AL: "Normalization of measured stable isotopic compositions to isotope reference scales - a review", RAPID COMMUNICATIONS IN MASS SPECTROMETRY., vol. 21, no. 18, 17 August 2007 (2007-08-17) - 30 September 2007 (2007-09-30), GB, pages 3006 - 3014, XP055299968, ISSN: 0951-4198, DOI: 10.1002/rcm.3185 *
MAROULA K KIOUSSI ET AL: "External calibration in Gas ChromatographyCombustionIsotope Ratio Mass Spectrometry measurements of endogenous androgenic anabolic steroids in sports doping control", JOURNAL OF CHROMATOGRAPHY, ELSEVIER SCIENCE PUBLISHERS B.V, NL, vol. 1218, no. 33, 5 June 2011 (2011-06-05), pages 5675 - 5682, XP028247422, ISSN: 0021-9673, [retrieved on 20110617], DOI: 10.1016/J.CHROMA.2011.06.014 *
YING ZHANG ET AL: "Calibration and data processing in gas chromatography combustion isotope ratio mass spectrometry", DRUG TESTING AND ANALYSIS, vol. 4, no. 12, 24 February 2012 (2012-02-24), GB, pages 912 - 922, XP055299967, ISSN: 1942-7603, DOI: 10.1002/dta.394 *

Also Published As

Publication number Publication date
GB2538232A (en) 2016-11-16
GB201507841D0 (en) 2015-06-17
EP3292402A1 (en) 2018-03-14

Similar Documents

Publication Publication Date Title
JP5427962B2 (en) Mass spectrometer, analytical method and calibration sample
JP5375411B2 (en) Chromatograph mass spectrometry data analysis method and apparatus
Blum et al. Reporting of variations in the natural isotopic composition of mercury
US20160025691A1 (en) Chromatography/mass spectrometry data processing device
Isaacman-VanWertz et al. Using advanced mass spectrometry techniques to fully characterize atmospheric organic carbon: current capabilities and remaining gaps
JPWO2008053530A1 (en) Quantitative measurement method
US10539540B2 (en) Liquid chromatograph and method for correcting detector output value fluctuation of liquid chromatograph
US20140067304A1 (en) Gas chromatograph data processing device, recording medium recording data processing program, and data processing method
US10598639B2 (en) Three-dimensional spectral data processing device and processing method
US20190086374A1 (en) Chromatographic Data System Processing Apparatus
US11592398B2 (en) Augmented Raman analysis of a gas mixture
JP4953175B2 (en) Method for improving quantitative accuracy in chromatograph / mass spectrometer
Childs et al. The use of carbon dioxide as the reaction cell gas for the separation of uranium and plutonium in quadrupole inductively coupled plasma mass spectrometry (ICP-MS) for nuclear forensic samples
Kolotov et al. iPlasmaProQuad: A Computer System Based on a Relational DBMS for Processing and Monitoring the Results of Routine Analysis by the ICP-MS Method
Hategan et al. The improvement of honey recognition models built on 1H NMR fingerprint through a new proposed approach for feature selection
EP3292402A1 (en) Mixture identification
Fernandes et al. Algorithm for automatic peak detection and quantification for GC-IMS spectra
Gucciardi et al. Simplified method for microlitre deuterium measurements in water and urine by gas chromatography–high‐temperature conversion–isotope ratio mass spectrometry
EP3276343B1 (en) Method and device for characterising an analyte
Mahani et al. A novel inductively coupled plasma atomic emission spectrometry method for uranium isotope ratio measurements using chemometric techniques
Kaňa et al. Software Solution for Post-Column Isotope Dilution Liquid Chromatography–Inductively Coupled Plasma Mass Spectrometry
Filimonov et al. Manual method of visually identifying candidate signals for a targeted peptide
Lee et al. Application of Skyline software for detecting prohibited substances in doping control analysis
Jakubowska et al. Deviations from bilinearity in multivariate voltammetric calibration models
Klus et al. Multivariate Approach to Detection of Uranium Using LIBS

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16728070

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2016728070

Country of ref document: EP