CN110501294A - A kind of multivariate calibration methods based on information fusion - Google Patents

A kind of multivariate calibration methods based on information fusion Download PDF

Info

Publication number
CN110501294A
CN110501294A CN201910725286.2A CN201910725286A CN110501294A CN 110501294 A CN110501294 A CN 110501294A CN 201910725286 A CN201910725286 A CN 201910725286A CN 110501294 A CN110501294 A CN 110501294A
Authority
CN
China
Prior art keywords
matrix
column
vector
prediction
row
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910725286.2A
Other languages
Chinese (zh)
Other versions
CN110501294B (en
Inventor
吴雪梅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian University
Original Assignee
Xian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian University filed Critical Xian University
Priority to CN201910725286.2A priority Critical patent/CN110501294B/en
Publication of CN110501294A publication Critical patent/CN110501294A/en
Application granted granted Critical
Publication of CN110501294B publication Critical patent/CN110501294B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/17Systems in which incident light is modified in accordance with the properties of the material investigated
    • G01N21/25Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
    • G01N21/31Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/17Systems in which incident light is modified in accordance with the properties of the material investigated
    • G01N21/25Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
    • G01N21/31Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
    • G01N21/35Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
    • G01N21/359Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light using near infrared light
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis

Abstract

The present invention relates to chemometric techniques fields, and in particular to a kind of multivariate calibration methods based on information fusion.A kind of multivariate calibration methods based on information fusion, including first part: model is established;Second part: predicted portions.The present invention proposes a kind of multivariate calibration methods based on information fusion, and after being merged by the method that this method provides to information, predicted root mean square error, which is significantly less than, to be fusedKThe predicted root mean square error of a method;Fusion method of the present invention can offset the prediction error of each algorithm, to reduce total prediction error.

Description

A kind of multivariate calibration methods based on information fusion
Technical field
The present invention relates to chemometric techniques fields, and in particular to a kind of multivariate calibration methods based on information fusion.
Background technique
Multivariate Correction is widely used in Chemical Measurement, and main function is that founding mathematical models describe dependent variable (such as group score value) with independent variable (such as spectrum) between relationship, then utilize established mathematical model to unknown sample dependent variable into Row prediction.By taking near-infrared spectrum analysis as an example, the process of Multivariate Correction is generally as follows: first being carried out experiment by certain rule and is set Meter is selected or is prepared corresponding multicomponent sample by experimental design;Sample is put near infrared spectrometer again to measure, is obtained The independent variable (spectrum) of each sample;Then using multivariate calibration methods (such as offset minimum binary) to independent variable (spectrum) and dependent variable (group score value) is modeled;The independent variable of unknown sample (spectrum) is finally substituted into established model, obtains unknown sample Dependent variable (group score value).
Currently used multivariate calibration methods mainly have classical least square (CLS), main composition to return (PCR), partially minimum Two multiply (PLS) etc..Every kind of multivariate calibration methods have the advantage and disadvantage of its own, are difficult to determine which kind of algorithm can obtain optimum prediction As a result.In practical applications, all there is error in every kind of multivariate calibration methods prediction result.If by multiple Multivariate Correction algorithms Prediction result is merged, and the prediction error of component is made to cancel out each other, then more accurate prediction result can be obtained.
Summary of the invention
The present invention is directed in view of the above-mentioned problems, proposing a kind of Multivariate Correction algorithm based on information fusion, fusion treatment is each The prediction result of algorithm makes it predict that error is cancelled out each other, to improve prediction accuracy.
Technical program of the present invention lies in:
A kind of multivariate calibration methods based on information fusion,
(1) model is established:
Dependent variable matrix Yc is pcRow m column;It indicates: total pcA sample, each sample have m dependent variable;
Independent variable matrix Xc is pcRow q column;It indicates: total pcA sample, each sample have q channel measurements;
Dependent variable matrix Yv is pvRow m column;Independent variable matrix Xv is pvRow q column;
Unknown sample independent variable xuFor q member row vector;
Detailed process is as follows:
1) the K Multivariate Correction algorithm used is determined;
2) independent variable matrix Xc, dependent variable matrix Yc are modeled using each Multivariate Correction algorithm, obtains K prediction model;
3) it is utilized respectively each prediction model and acts on independent variable matrix Xv, obtain i-th (i=1,2,3 ..., m) a dependent variable Predict column vector ypvk, and calculate the prediction error vector of a i-th of dependent variable of prediction model of kth (k=1,2 ..., K)
ek=ypvk-yvi (1)
Wherein, ypvkPrediction result for kth (k=1,2 ..., K) a prediction model to i-th of dependent variable, yviFor matrix Yv's I-th column vector, i.e. the i-th column of dependent variable matrix Yv;ekFor the prediction error vector of k-th of prediction model.
4) the prediction error vector is formed into pvRow K column matrix e
E=[e1 e2 … eK] (2)
5) the autocorrelation matrix R of matrix e is calculated
6) column vector w is calculatedi
In formula, a is the row vector of K member, and each element is [1,1 ..., 1] 1, a=;
(2) predicted portions:
7) unknown sample independent variable x is acted on K prediction model respectivelyu, obtain the prediction result y of component iuk(k=1,2 ..., K), and vector y=[y is formedu1,yu2,…,yuK]T
Calculate fusion forecasting result
The technical effects of the invention are that:
The present invention proposes a kind of multivariate calibration methods based on information fusion, is melted by the method that this method provides to information After conjunction, predicted root mean square error is significantly less than the predicted root mean square error for K method being fused;Fusion method of the present invention The prediction error of each algorithm can be offset, to reduce total prediction error.
Detailed description of the invention
Fig. 1 is a kind of flow chart of the multivariate calibration methods based on information fusion of the present invention.
Specific embodiment
Present embodiment is tested with analogue data, and sample is mixed by 4 kinds of pure components in test, measure spectrum wave Number range is 10000~12500cm-1, sampling interval 10cm-1(i.e. each sample has q=251 channel measurements), it is single The spectrum of position pure component is respectively as follows:
In formula, n (n=1,2,3 ..., N) is wave number serial number, λnFor n-th of wave number value, it can use 10000,10010 respectively, 10020 ..., 12500;
The spectrum of pure component is denoted as 4 row q column matrix Xs
If modeling group sub-matrix used is random quantity matrix Yc (the i.e. p of 100 rows 4 columnc=100, m=4), it establishes Independent variable column matrix Xc used in model is
Xc=YcXs+Ec
Wherein, the line number p of Xcc=100, each element of columns q=251, Yc are obeyed (0,1) and are uniformly distributed;
Ec is spectral measurement error matrix, each of which element all obeys zero-mean normal distribution, and variance is 0.0001.
For determining that the group score matrix of fusion parameters is pv=100 rows, the random matrix Yv of m=4 column;
For determining that the independent variable matrix Xv of fusion parameters is
Xv=YvXs+Ev
Wherein, the line number p of Xvc=100, each element of columns q=251, Yv are obeyed (0,1) and are uniformly distributed;
Ev is spectral measurement error matrix, each of which element all obeys zero-mean normal distribution, and variance is 0.0001.
The group score value of unknown sample is m (=4) column random vector yu, the independent variable of the unknown sample for prediction is vector xuFor
xu=yuXs+eu
Wherein, xuFor q (=251) column vector, yuEach element obey (0,1) be uniformly distributed;
euFor spectral measurement error vector, each of which element all obeys zero-mean normal distribution, and variance is 0.0001.
In the present embodiment, the method that is fused is PLS, PCR, CLS tri-, i.e. K=3;Repeat to generate 100 in prediction in advance Survey spectral vector xu, each prediction spectrum is predicted using pre- flow gauge, obtains the prediction result of each component, and calculate The predicted root mean square error of each group score value.
A kind of process of the multivariate calibration methods based on information fusion is as follows:
(1) model is established:
Dependent variable matrix Yc is pcRow m column;It indicates: total pcA sample, each sample have m dependent variable;
Independent variable matrix Xc is pcRow q column;It indicates: total pcA sample, each sample have q channel measurements;
Dependent variable matrix Yv is pvRow m column;Independent variable matrix Xv is pvRow q column;
Unknown sample independent variable xuFor q member row vector.
1) tri- Multivariate Correction algorithms of PLS, PCR, CLS, i.e. K=3 are determined;
2) above-mentioned 3 Multivariate Correction algorithms are utilized respectively to model independent variable matrix Xc and dependent variable matrix Yc, obtain 3 predictions Model;
3) above-mentioned 3 prediction models act on independent variable matrix Xv, obtain the prediction column of i-th (i=1,2 ..., m) a dependent variable Vector ypvk, and calculate the prediction error vector of a i-th of dependent variable of prediction model of kth (k=1,2 ..., K)
ek=ypvk-yvi (1)
Wherein, ypvkPrediction result for kth (k=1,2 ..., K) a prediction model to i-th of dependent variable, yviFor matrix Yv's I-th column vector, i.e. the i-th column of dependent variable matrix Yv;ekFor the prediction error vector of k-th of prediction model.
4) the prediction error vector is formed into pvRow K column matrix e
E=[e1 e2 … eK] (2)
5) the autocorrelation matrix R of matrix e is calculated
In formula, subscript T representing matrix transposition;
6) column vector w is calculatedi
In formula, a is the row vector of K member, and each element is [1,1 ..., 1] 1, a=.
(2) pre- flow gauge:
7) unknown sample independent variable x is acted on 3 prediction models respectivelyu, obtain the prediction result y of component iuk(k=1, 2 ..., K), and form vector y=[yu1,yu2,…,yuK]T
Calculate fusion forecasting result
It is evaluated with performance of the predicted root mean square error to each method.The calculation formula of predicted root mean square error is as follows:
In formula, riFor the true value of dependent variable, correspond to each y in embodimentuIn element (i.e. each time prediction true component Value), rpiFor the prediction result of multivariate calibration methods, I is the sample size of forecast set, the present embodiment 100.
After the completion of modeling, the calculated fusion vector of step 6) is as shown in table 1.
Table 1 merges vector value
Method Component 1 Component 2 Component 3 Component 4
PLS 0.0126 -0.0741 -0.0090 0.1920
PCR -0.9084 -0.7617 -0.8648 -0.9943
CLS 1.8958 1.8358 1.8738 1.8024
In table, each column indicate that a fusion vector, i.e., the i-th column (i=1,2,3,4) indicate wiValue.
The predicted root mean square error of each method is as shown in table 2.
2 each method predicted root mean square error of table
As can be seen from Table 2, after fusion treatment through the invention, predicted root mean square error be significantly less than PLS, PCR with The predicted root mean square error of CLS, the fusion method can be missed with the prediction error of each algorithm of partial offset to reduce total prediction Difference, this blending algorithm are very effective.

Claims (1)

1. a kind of multivariate calibration methods based on information fusion, this method are as follows:
(1) model is established:
Dependent variable matrix Yc is pcRow m column;It indicates: total pcA sample, each sample have m dependent variable;
Independent variable matrix Xc is pcRow q column;It indicates: total pcA sample, each sample have q channel measurements;
Dependent variable matrix Yv is pvRow m column;Independent variable matrix Xv is pvRow q column;
Unknown sample independent variable xuFor q member row vector;
It is characterized by:
1) the K Multivariate Correction algorithm used is determined;
2) independent variable matrix Xc, dependent variable matrix Yc are modeled using each Multivariate Correction algorithm, obtains K prediction model;
3) it is utilized respectively each prediction model and acts on independent variable matrix Xv, obtain i-th (i=1,2,3 ..., m) a dependent variable Predict column vector ypvk, and calculate the prediction error vector of a i-th of dependent variable of prediction model of kth (k=1,2 ..., K)
ek=ypvk-yvi (1)
Wherein, ypvkPrediction result for kth (k=1,2 ..., K) a prediction model to i-th of dependent variable, yviFor matrix Yv's I-th column vector, i.e. the i-th column of dependent variable matrix Yv;ekFor the prediction error vector of k-th of prediction model;
4) the prediction error vector is formed into pvRow K column matrix e
E=[e1 e2 … eK] (2)
5) the autocorrelation matrix R of matrix e is calculated
6) column vector w is calculatedi
In formula, a is the row vector of K member, and each element is [1,1 ..., 1] 1, a=;
(2) it predicts:
7) unknown sample independent variable x is acted on K prediction model respectivelyu, obtain the prediction result y of component iuk(k=1,2 ..., K), and vector y=[y is formedu1,yu2,…,yuK]T
Calculate fusion forecasting result
CN201910725286.2A 2019-08-07 2019-08-07 Multivariate correction method based on information fusion Active CN110501294B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910725286.2A CN110501294B (en) 2019-08-07 2019-08-07 Multivariate correction method based on information fusion

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910725286.2A CN110501294B (en) 2019-08-07 2019-08-07 Multivariate correction method based on information fusion

Publications (2)

Publication Number Publication Date
CN110501294A true CN110501294A (en) 2019-11-26
CN110501294B CN110501294B (en) 2021-09-28

Family

ID=68586859

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910725286.2A Active CN110501294B (en) 2019-08-07 2019-08-07 Multivariate correction method based on information fusion

Country Status (1)

Country Link
CN (1) CN110501294B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111024229A (en) * 2019-11-28 2020-04-17 天津津航技术物理研究所 Single-chip integrated spectral imaging micro-system spectral data correction method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101608914A (en) * 2009-07-23 2009-12-23 武汉大学 RPC parametric optimization method based on multi-collinearity analysis
JP2012113676A (en) * 2010-11-29 2012-06-14 Internatl Business Mach Corp <Ibm> Configuring method, system, and program for controller
CN104089911A (en) * 2014-06-27 2014-10-08 桂林电子科技大学 Spectral model transmission method based on unary linear regression
CN104123451A (en) * 2014-07-16 2014-10-29 河海大学常州校区 Dredging operation yield prediction model building method based on partial least squares regression
CN105004278A (en) * 2015-07-10 2015-10-28 东南大学 Real-time base line and denoising processing method based on distributed sensing and wavelet analyzing technologies
CN107958292A (en) * 2017-10-19 2018-04-24 山东科技大学 Transformer based on cost sensitive learning obscures careful reasoning method for diagnosing faults

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101608914A (en) * 2009-07-23 2009-12-23 武汉大学 RPC parametric optimization method based on multi-collinearity analysis
JP2012113676A (en) * 2010-11-29 2012-06-14 Internatl Business Mach Corp <Ibm> Configuring method, system, and program for controller
CN104089911A (en) * 2014-06-27 2014-10-08 桂林电子科技大学 Spectral model transmission method based on unary linear regression
CN104123451A (en) * 2014-07-16 2014-10-29 河海大学常州校区 Dredging operation yield prediction model building method based on partial least squares regression
CN105004278A (en) * 2015-07-10 2015-10-28 东南大学 Real-time base line and denoising processing method based on distributed sensing and wavelet analyzing technologies
CN107958292A (en) * 2017-10-19 2018-04-24 山东科技大学 Transformer based on cost sensitive learning obscures careful reasoning method for diagnosing faults

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111024229A (en) * 2019-11-28 2020-04-17 天津津航技术物理研究所 Single-chip integrated spectral imaging micro-system spectral data correction method

Also Published As

Publication number Publication date
CN110501294B (en) 2021-09-28

Similar Documents

Publication Publication Date Title
CN102944583B (en) Metal-oxide gas sensor array concentration detecting method based on drift compensation
Olivieri et al. Standard error of prediction in parallel factor analysis of three-way data
CN110455722A (en) Rubber tree blade phosphorus content EO-1 hyperion inversion method and system
CN101750404B (en) Method for correcting plasma emission spectral line self-absorption effect
CN107271382A (en) A kind of different growing rape leaf SPAD value remote sensing estimation methods
CN107958267B (en) Oil product property prediction method based on spectral linear representation
Sakowska et al. Monitoring of carbon dioxide fluxes in a subalpine grassland ecosystem of the Italian Alps using a multispectral sensor
Dillner et al. Predicting ambient aerosol thermal-optical reflectance (TOR) measurements from infrared spectra: organic carbon
CN104990895A (en) Near infrared spectral signal standard normal correction method based on local area
Wang et al. Nonlinear decline-rate dependence and intrinsic variation of type Ia supernova luminosities
Mattern et al. Improving variational data assimilation through background and observation error adjustments
Cysneiros et al. Modeling of tree height–diameter relationships in the Atlantic Forest: effect of forest type on tree allometry
CN107966499B (en) Method for predicting crude oil carbon number distribution by near infrared spectrum
CN110501294A (en) A kind of multivariate calibration methods based on information fusion
Xiang et al. Simultaneous identification of geographical origin and grade of flue-cured tobacco using NIR spectroscopy
CN111896497B (en) Spectral data correction method based on predicted value
CN104865228A (en) Quantitative laser-induced breakdown spectroscopy (LIBS) detecting method based on fusion entropy optimization
CN105787518B (en) A kind of near infrared spectrum preprocess method based on kernel projection
Hjelkrem et al. Sensitivity analysis and Bayesian calibration for testing robustness of the BASGRA model in different environments
CN105092509A (en) Sample component measurement method based on PCR-ELM algorithm
Kalivas et al. Selectivity‐relaxed classical and inverse least squares calibration and selectivity measures with a unified selectivity coefficient
刘秀英 et al. Hyperspectral model for estimation of soil potassium content in loessal soil
US11754539B2 (en) System and computer-implemented method for extrapolating calibration spectra
CN103837484B (en) A kind of angular multivariable technique for eliminating the spectrum property taken advantage of random error
CN106872397A (en) A kind of method based on existing calibration model quick detection agricultural product chemical constituent

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant