CN110672552B - Confidence coefficient estimation method for vehicle fuel oil near infrared spectrum detection result - Google Patents
Confidence coefficient estimation method for vehicle fuel oil near infrared spectrum detection result Download PDFInfo
- Publication number
- CN110672552B CN110672552B CN201910910971.2A CN201910910971A CN110672552B CN 110672552 B CN110672552 B CN 110672552B CN 201910910971 A CN201910910971 A CN 201910910971A CN 110672552 B CN110672552 B CN 110672552B
- Authority
- CN
- China
- Prior art keywords
- test
- sample
- confidence
- near infrared
- detection result
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 29
- 238000000034 method Methods 0.000 title claims abstract description 27
- 238000002329 infrared spectrum Methods 0.000 title claims abstract description 26
- 239000000295 fuel oil Substances 0.000 title description 2
- 238000009826 distribution Methods 0.000 claims abstract description 18
- 239000000446 fuel Substances 0.000 claims abstract description 6
- 238000012360 testing method Methods 0.000 claims description 68
- 238000012937 correction Methods 0.000 claims description 51
- 239000011159 matrix material Substances 0.000 claims description 50
- 230000002159 abnormal effect Effects 0.000 claims description 33
- 238000001228 spectrum Methods 0.000 claims description 32
- 230000003595 spectral effect Effects 0.000 claims description 28
- 238000000354 decomposition reaction Methods 0.000 claims description 16
- 238000012545 processing Methods 0.000 claims description 7
- 238000000513 principal component analysis Methods 0.000 abstract description 22
- 238000004458 analytical method Methods 0.000 abstract description 8
- 238000004364 calculation method Methods 0.000 abstract description 4
- 238000005516 engineering process Methods 0.000 abstract description 3
- 238000004445 quantitative analysis Methods 0.000 abstract description 3
- 238000003745 diagnosis Methods 0.000 abstract description 2
- 238000011156 evaluation Methods 0.000 abstract description 2
- 239000002283 diesel fuel Substances 0.000 description 20
- 239000003502 gasoline Substances 0.000 description 20
- 239000003921 oil Substances 0.000 description 12
- 230000001186 cumulative effect Effects 0.000 description 9
- 230000008859 change Effects 0.000 description 6
- 230000008569 process Effects 0.000 description 4
- 230000007423 decrease Effects 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 238000004088 simulation Methods 0.000 description 3
- 238000004497 NIR spectroscopy Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000002547 anomalous effect Effects 0.000 description 1
- 238000002790 cross-validation Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000004886 process control Methods 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 238000004451 qualitative analysis Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/17—Systems in which incident light is modified in accordance with the properties of the material investigated
- G01N21/25—Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
- G01N21/31—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry
- G01N21/35—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light
- G01N21/359—Investigating relative effect of material at wavelengths characteristic of specific elements or molecules, e.g. atomic absorption spectrometry using infrared light using near infrared light
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
Abstract
The invention provides a confidence estimation method for a vehicle fuel near infrared spectrum detection result, which is used for completing the calculation of Mahalanobis distance based on principal component analysis, obtaining a significance level according to F distribution on the basis and completing the confidence estimation of the detection result of a sample to be detected. And according to the Mahalanobis distance and the statistic obtained by the Mahalanobis distance, obeying to F distribution, confidence coefficient estimation on the reliability of the detection result is provided for the practical application of the near infrared spectrum analysis technology, and a quantitative basis is provided for the next qualitative diagnosis of the analysis object or the effectiveness evaluation of the quantitative analysis result.
Description
Technical Field
The invention relates to near infrared spectrum anomaly detection of vehicle fuel oil, in particular to a near infrared detection result confidence degree estimation method based on Mahalanobis distance.
Background
Based on group frequency and frequency multiplication absorption of hydrogen group stretching vibration in organic molecules, the near infrared spectrum can establish a linear or nonlinear relation between the spectrum and a quality index through a chemometrics method, quickly and efficiently complete qualitative and quantitative analysis of a sample, and can overcome the defects of complicated process, high cost, low efficiency and the like in the traditional oil analysis technology.
In recent years, the near infrared spectroscopy is widely and more mature to be applied to the measurement of the content of various components of oil products so as to improve the production management and quality supervision level of the oil products. In the acquisition process of the near infrared spectrum, abnormal spectrum data can be generated due to factors such as change of sample properties, change of experimental conditions, measurement errors of instruments, artificial measurement errors and the like; the presence of abnormal spectra affects the data characteristic performance, and further reduces the reliability of the spectrum detection result. Therefore, identifying and rejecting abnormal samples is a necessary condition for building a reliable near-infrared analysis model. Common abnormal sample point elimination methods include Mahalanobis Distance (MD), a lever method, monte carlo cross validation, and the like. However, in the actual process of oil product rapid detection, the difference of the same oil product spectral data is often unavoidable in consideration of different production processes and adulteration possibility existing in different oil refineries. Therefore, simple abnormal sample rejection is often not desirable, and enterprises need to provide a suitable judgment standard (for example, the confidence level is not less than 80%) to complete the judgment and screening of samples, which is important for the rapid detection of oil quality indexes and further diagnostic analysis.
However, in the field of near infrared spectroscopy, there is no mature confidence estimation method for the detection result of the spectral data. In the field of process control, the quadratic calculation of mahalanobis distance (PCA-MD) due to Principal Component Analysis (PCA) is equivalent to hotelling T2PCA-MD is commonly used for T2Checking; by T2The comparison of the control limit and the square of the PCA-MD value can judge whether the sample to be tested is in a normal state. On the basis of the above, T is combined2And the statistic accords with the F distribution, so that the calculation of the significance level of the sample can be completed. Therefore, by means of the data distribution idea, the significance level of the sample can be calculated by calculating the square value of near infrared spectrum data PCA-MD, and the confidence degree of the detection result can be further estimated.
Disclosure of Invention
Aiming at the rapid detection of oil products, the invention provides a near-infrared detection result confidence coefficient estimation method based on Mahalanobis distance, so that the qualification judgment of a sample can be conveniently and effectively completed according to the confidence coefficient in practical application, and the reliability of an analysis result can be ensured.
The method completes the calculation of the Mahalanobis distance based on the principal component analysis, obtains the significance level according to the F distribution on the basis, and completes the confidence estimation of the detection result of the sample to be detected.
The implementation of the process specifically comprises the following steps:
s1, carrying out standardization processing on spectral data to obtain correction set spectral data X;
s2, adopting PCA-MD T2Detecting and removing abnormal samples in the correction set to ensure that the spectral data X in the correction set are all normal samples;
s3, carrying out PCA decomposition on the spectrum data X of the correction set, and combining the spectrum data X of the test settestCalculating the squared value of the Mahalanobis distance between the sample to be measured and the sample in the correction set
S4, according toCalculating the significance level alpha according to the F distributiontestThen obtaining confidence coefficient c of the near infrared spectrum detection resulttest。
Step S2 includes:
s21: the PCA decomposition of the calibration set spectral data can be expressed as:
in the formula, T is belonged to Rn×pFor the score matrix, n represents the number of samples, P represents the number of principal components, and P belongs to Rm×pM represents the number of variables, which is a load matrix;
s22: the square of the PCA-MD value of the ith sample of the calibration set spectral data can be expressed as:
wherein, tiRepresenting the ith row vector of the scoring matrix T, and sigma being the covariance matrix of T;
S23:T2the control limit may be expressed as:
where α is the significance level and the confidence in the control limits is 1- α. At this time, ifIf the value is less than the control limit, judging the sample to be a normal sample; if it isIf the value is larger than the control limit, the abnormal sample is judged.
Step S3 includes:
s31: after the abnormal samples are removed, PCA decomposition is carried out on the spectrum data X of the correction set according to a formula (1), and the covariance matrix sigma of the load matrix P and the scoring matrix T is updated;
s32: calculating a score matrix of the spectral data of the sample set to be detected:
Ttest=XtestP (4)
in the formula, XtestA sample set to be detected and a correction set load matrix P are obtained;
s33: the square value of the Mahalanobis distance between the ith sample to be detected and the calibration setCan be expressed as:
in the formula, ttest-iScore matrix T representing sample set to be testedtestThe ith row vector of (1).
Step S4 includes:
s41: significance level α is achieved based on F distribution with p and n-p degrees of freedomtestFor the ith test set sample, the significance level αtest-iCan be obtained according to the following formula:
s42: confidence level c for the ith test set sampletest-iCan be expressed as:
ctest-i=1-αtest-i (7)
the method has the advantages that according to the Mahalanobis distance and the statistic obtained by the Mahalanobis distance, the F distribution obeys, confidence degree estimation of the reliability of the detection result is provided for the practical application of the near infrared spectrum analysis technology, and a quantitative basis is provided for the next qualitative diagnosis of an analysis object or the effectiveness evaluation of the quantitative analysis result.
Drawings
FIG. 1 is a flow chart of a PCA-MD based near infrared anomalous spectral confidence quantification method;
FIG. 2 is T taken from PCA-MD2Checking and eliminating a line graph of abnormal samples in the correction set;
FIG. 3 is a sample confidence estimate line plot of a near infrared spectrum of diesel blended gasoline;
FIG. 4 is a sample confidence estimate line graph for simulation case 1;
fig. 5 is a line graph of confidence estimates for simulation case 2.
Detailed description of the preferred embodiments
The technical scheme adopted by the method for performing confidence estimation on the oil product near infrared spectrum detection result is as follows:
s1, carrying out standardization processing on spectral data to obtain correction set spectral data X;
s2, adopting PCA-MD T2Detecting and removing abnormal samples in the correction set to ensure that the spectral data X in the correction set are all normal samples;
s3, carrying out PCA decomposition on the spectrum data X of the correction set, and combining the spectrum data X of the test settestCalculating the squared value of the Mahalanobis distance between the sample to be measured and the sample in the correction set
S4, according toCalculating the significance level alpha by following the F distributiontestThen obtaining confidence coefficient c of the near infrared spectrum detection resulttest。
Step S2 includes:
s21: the PCA decomposition of the calibration set spectral data can be expressed as:
in the formula, T is belonged to Rn×pFor the score matrix, n represents the number of samples, P represents the number of principal components, and P belongs to Rm×pM represents the number of variables as a load matrix;
s22: the square of the PCA-MD value of the ith sample of the calibration set spectral data can be expressed as:
Wherein, tiRepresenting the ith row vector of the scoring matrix T, and sigma being the covariance matrix of T;
S23:T2the control limit may be expressed as:
where α is the significance level (typically set at 0.01 or 0.05) and the confidence of the control limit is 1- α. At this time, ifIf the value is less than the control limit, judging the sample to be a normal sample; if it isIf the value is larger than the control limit, the abnormal sample is judged.
Step S3 includes:
s31: after the abnormal samples are removed, PCA decomposition is carried out on the spectrum data X of the correction set according to a formula (1), and the covariance matrix sigma of the load matrix P and the scoring matrix T is updated;
s32: calculating a score matrix of the spectral data of the sample set to be detected:
Ttest=XtestP (4)
in the formula, XtestA sample set to be detected and a correction set load matrix P are obtained;
s33: the square value of the Mahalanobis distance between the ith sample to be detected and the calibration setCan be expressed as:
in the formula, ttest-iScore matrix T representing sample set to be testedtestThe ith row vector of (1).
Step S4 includes:
s41: significance level α is achieved based on F distribution with degrees of freedom p and n-ptestFor the ith test set sample, the significance level αtest-iCan be obtained according to the following formula:
s42: confidence level c for the ith test set sampletest-iCan be expressed as:
ctest-i=1-αtest-i (7)
example 1:
taking the detection of a gasoline sample doped with a certain percentage of diesel oil as an example. The method comprises the steps of carrying out spectrum collection on diesel oil and gasoline samples provided by main oil refineries in south-johnson of Shandong through a near-infrared spectrometer with the model number of Thermo Fisher Antaris II, and fitting as a correction set. And simultaneously collecting the near infrared spectrum of the doped gasoline as a test set.
S1, carrying out standardization processing on spectral data to obtain correction set spectral data X;
s2, adopting PCA-MD T2Inspection rejectionCorrecting the concentrated abnormal samples to ensure that the spectral data X of the correction set are all normal samples;
s3, carrying out PCA decomposition on the spectrum data X of the correction set, and combining the spectrum data X of the test settestCalculating the squared value of the Mahalanobis distance between the sample to be measured and the sample in the correction set
S4, according toCalculating the significance level alpha by following the F distributiontestThen obtaining confidence coefficient c of the near infrared spectrum detection resulttest。
The present invention is further detailed by simulating the above method by MATLAB in conjunction with fig. 1:
the first step is as follows: and completing the sample division and data standardization processing of the correction set and the test set. The calibration set comprises 81 pure gasoline near infrared spectrum samples, the test set comprises 1 pure gasoline spectrum sample and 10 gasoline spectrum samples respectively doped with diesel oil with different contents, and the diesel oil contents respectively account for 5.26%, 5.88%, 8.33%, 9.09%, 10%, 11.11%, 12.5%, 14.29%, 16.67% and 20%.
The second step is that: carrying out PCA model decomposition on the spectrum data of the correction set so as to calculate the square value of the Mahalanobis distanceThen according to T2The control limit (α set to 0.05) determines the presence or absence of an abnormal sample point. Since the gasoline near infrared spectrum data has too many variables (wavelength points), the first 6 principal components are selected here to analyze the differential contribution rate. As can be seen from table 1, after the number of principal components exceeds 3 by PCA decomposition, the cumulative variance contribution rate does not increase significantly, so 3 principal components are selected to calculate MD values and abnormal values are eliminated. As can be seen from fig. 2, the red dotted line represents the 95% confidence control limit, and the samples 53 and 54 are significantly out of the control limit range, and therefore are determined to be abnormal samples. After the abnormal samples are eliminated, the number of the samples in the correction set is 79。
TABLE 1 influence of the number of principal components of the PCA model on the contribution rate and the cumulative contribution rate
The third step: carrying out PCA decomposition on the correction set without the abnormal samples and updating the covariance matrix sigma of the load matrix P and the score matrix T, and then combining the test spectrum data XtestCompleting the test set spectral data scoring matrix TtestAnd the squared value of the mahalanobis distance between the test set and the correction set samplesAs can be seen from table 2, after the abnormal samples in the correction set are removed, the variance contribution rate and the cumulative variance contribution rate of the principal component of the PCA model slightly change, and according to the criterion that the cumulative variance contribution rate does not significantly rise, 3 principal components are still selected to update the covariance matrix Σ of the load matrix P and the score matrix T.
TABLE 2 influence of PCA model principal component number on contribution rate and cumulative contribution rate
The fourth step: according toLevel of significance achieved for F distributions with set and degrees of freedom 3 and 76testAnd confidence ctestAnd (4) calculating. As shown in fig. 3, the confidence of the pure gasoline near infrared spectrum sample in the test set is above 90%; after 5.26% of diesel oil is doped, the confidence coefficient of the sample is rapidly reduced to about 20%; when 5.88% -11.11% of diesel oil is doped, the confidence coefficient reduction trend is not obvious, but is between 20% -35%, and the condition can be caused by uneven mixing of samples or partial volatilization of gasoline in actual operation; when the content of the doped diesel oil exceeds 11.11 percent, the confidence of the sample is close to 30 percent Gradually decreases to about 1%.
Example 2:
the mixture of diesel oil and gasoline is simulated by taking the single spectrum as an example. A near-infrared spectrometer with the model number of Thermo Fisher Antaris II is adopted to carry out spectrum collection on diesel oil and gasoline samples provided by main oil refineries in south and Ji of Shandong. The calibration set is 81 pure gasoline near infrared spectrum samples in the embodiment 1, 11 samples in the test set are respectively formed by adding 1 gasoline spectrum and 1 diesel spectrum according to a specific proportion, and the diesel content respectively accounts for 0 percent (pure gasoline), 2 percent, 4 percent, 6 percent, 8 percent, 10 percent, 12 percent, 14 percent, 16 percent, 18 percent and 20 percent.
The present invention is further detailed by simulating the above method by MATLAB in conjunction with fig. 1:
the first step is as follows: and completing the data standardization processing of the correction set and the test set.
The second step is that: carrying out PCA model decomposition on the spectrum data of the correction set so as to calculate the square value of the Mahalanobis distanceThen according to T2The control limit (α set to 0.05) determines the presence or absence of an abnormal sample point. Since the correction set is consistent with the correction set of embodiment 1, 3 principal components are still selected to calculate the MD values of the correction set and to remove the abnormal values, and after the abnormal samples are removed, the number of samples in the correction set is 79.
The third step: carrying out PCA decomposition on the correction set after the abnormal sample is removed, updating the covariance matrix sigma of the load matrix P and the score matrix T, and then combining the test collected spectrum data XtestCompleting the test set spectral data scoring matrix TtestAnd the squared value of the mahalanobis distance between the test set and the correction set samplesAs can be seen from Table 2, after the abnormal samples in the correction set are eliminated, the variance contribution rate and the cumulative variance contribution rate of the principal components of the PCA model slightly change, and according to the criterion that the cumulative variance contribution rate does not obviously rise, 3 principal components are still selected to update the load matrix P and the scoring momentCovariance matrix sigma of array T.
The fourth step: according toLevel of significance achieved for F distributions with set and degrees of freedom 3 and 76testAnd confidence ctestAnd (4) calculating. As shown in fig. 4, as the proportion of doped diesel increases, the sample confidence of the simulation test set shows a smooth decreasing curve; when the proportion of the blended diesel oil is 2-10%, the confidence coefficient is reduced to be lower than 50% quickly, and when the proportion of the blended diesel oil is 6%; as the proportion of blended diesel oil continues to increase, the confidence rate of decline gradually slows down.
Example 3:
the mixture of diesel oil and gasoline is simulated by the mixture ratio of a plurality of spectra. Firstly, a near-infrared spectrometer with the model number of Thermo Fisher Antaris II is adopted to carry out spectrum collection on diesel oil and gasoline samples provided by main oil refineries in Shandong Jinan. The calibration set is 81 pure gasoline near infrared spectrum samples in the example 1, 11 samples in the test set are obtained by averaging after adding a plurality of gasoline spectrums and diesel oil spectrums, and the diesel oil contents respectively account for 0%, 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45% and 50%.
The present invention is further detailed by simulating the above method by MATLAB and referring to fig. 1:
the first step is as follows: and completing the data standardization processing of the correction set and the test set.
The second step: carrying out PCA model decomposition on the spectrum data of the correction set so as to calculate the square value of the Mahalanobis distanceThen according to T2The control limit (α set to 0.05) determines the presence or absence of an abnormal sample point. Since the correction set is consistent with the correction set of embodiment 1, 3 principal components are still selected to calculate the MD values of the correction set and to remove the abnormal values, and after the abnormal samples are removed, the number of samples in the correction set is 79.
The third step: the correction set after the abnormal sample is removed is carried outPCA decomposes and updates covariance matrix sigma of load matrix P and score matrix T, and combines test spectrum data XtestCompleting the test set spectral data scoring matrix TtestAnd the squared value of the mahalanobis distance between the test set and the correction set samplesAs can be seen from table 2, after the abnormal samples in the correction set are removed, the variance contribution rate and the cumulative variance contribution rate of the principal component of the PCA model slightly change, and according to the criterion that the cumulative variance contribution rate does not significantly rise, 3 principal components are still selected to update the covariance matrix Σ of the load matrix P and the score matrix T.
The fourth step: according toLevel of significance achieved for F distributions with set and degrees of freedom 3 and 76testAnd confidence ctestAnd (4) calculating. As can be seen from FIG. 5, as the proportion of blended diesel increases, the confidence of the test set samples decreases overall. When the proportion of the doped diesel oil is 5-15%, the confidence coefficient is reduced at the fastest speed; when the diesel oil blending ratio is 15-25%, the confidence coefficient descending speed gradually slows down; when the diesel oil blending ratio is 25% -50%, the confidence coefficient is close to 0, and no obvious change exists.
According to 3 implementation cases, the confidence coefficient of a detected sample integrally falls along with the increase of the proportion of diesel oil mixed in gasoline, which shows the effectiveness of judging whether a near infrared detection result is abnormal or not by adopting data distribution in the method. The confidence coefficient can be compared with the judgment standard of an enterprise through the sample significance level and the confidence coefficient estimation provided by the Mahalanobis distance and the F distribution; if the confidence coefficient is not less than the judgment standard, the near infrared detection result of the sample is considered to be normal, and if the confidence coefficient is less than the judgment standard, the sample is considered to be suspicious, and the quality index needs to be further determined. Therefore, the method effectively guarantees the reliability of the near infrared spectrum detection result.
The above embodiments are provided only for illustrating the present invention and not for limiting the present invention, and those skilled in the art should make various changes or modifications without departing from the spirit and scope of the present invention.
The foregoing shows and describes the general principles, principal features, and advantages of the invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the foregoing description only for the purpose of illustrating the principles of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the invention as defined by the appended claims, specification and equivalents thereof.
Claims (4)
1. A confidence degree estimation method for a vehicle fuel near infrared spectrum detection result is characterized by comprising the following steps:
s1, carrying out standardization processing on spectral data to obtain correction set spectral data X;
s2, adopting PCA-MD T2Detecting and removing abnormal samples in the correction set spectral data X to ensure that the correction set spectral data X are all normal samples;
s3, carrying out PCA decomposition on the spectrum data X of the correction set, and combining the spectrum data X of the test settestCalculating the squared value of the Mahalanobis distance between the sample to be measured and the sample in the correction setThe step S3 includes:
s31: after the abnormal samples are removed, PCA decomposition is carried out on the spectrum data X of the correction set according to a formula (1), and the covariance matrix sigma of the load matrix P and the scoring matrix T is updated;
S32: calculating a score matrix of the spectral data of the sample set to be detected:
Ttest=XtestP (4)
in the formula, XtestA sample set to be detected and a correction set load matrix P are obtained;
s33: the square value of the Mahalanobis distance between the ith sample to be detected and the calibration setExpressed as:
in the formula, ttest-iScore matrix T representing sample set to be testedtestThe ith row vector of (1);
2. The method for estimating the confidence of the detection result of the near infrared spectrum of the vehicle fuel according to claim 1, wherein the step S2 includes:
s21: the PCA decomposition of the calibration set spectral data can be expressed as:
in the formula, T is belonged to Rn×pFor the scoring matrix, n represents the number of samples, P represents the number of principal components, and P belongs to Rm×pM represents the number of variables, which is a load matrix;
s22: the square of the PCA-MD value of the ith sample of the calibration set spectral data can be expressed as:
wherein, tiRepresenting the ith row vector of the scoring matrix T, and sigma being the covariance matrix of T;
S23:T2the control limit is expressed as:
3. The method for estimating the confidence of the detection result of the near infrared spectrum of the vehicle fuel according to claim 2, wherein the step S4 includes:
s41: significance level α is achieved based on F distribution with degrees of freedom p and n-ptestFor the ith test set sample, the significance level αtest-iObtained according to the following formula:
s42: confidence level c for the ith test set sampletest-iCan be expressed as:
ctest-i=1-αtest-i (7)。
4. the method for estimating the confidence of the detection result of the near infrared spectrum of the vehicle fuel according to claim 2 or 3, wherein the significance level α is set to be 0.01 or 0.05.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910910971.2A CN110672552B (en) | 2019-09-25 | 2019-09-25 | Confidence coefficient estimation method for vehicle fuel oil near infrared spectrum detection result |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910910971.2A CN110672552B (en) | 2019-09-25 | 2019-09-25 | Confidence coefficient estimation method for vehicle fuel oil near infrared spectrum detection result |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110672552A CN110672552A (en) | 2020-01-10 |
CN110672552B true CN110672552B (en) | 2022-06-07 |
Family
ID=69079401
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910910971.2A Active CN110672552B (en) | 2019-09-25 | 2019-09-25 | Confidence coefficient estimation method for vehicle fuel oil near infrared spectrum detection result |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110672552B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113740294B (en) * | 2021-07-29 | 2024-03-08 | 北京易兴元石化科技有限公司 | Near infrared modeling-based gasoline/diesel oil detection and analysis method and device |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5606164A (en) * | 1996-01-16 | 1997-02-25 | Boehringer Mannheim Corporation | Method and apparatus for biological fluid analyte concentration measurement using generalized distance outlier detection |
CN104713846A (en) * | 2015-02-03 | 2015-06-17 | 贵州省烟草科学研究院 | Modeling method for rapidly detecting content of starch in tobacco by using near infrared spectroscopy |
CN107748146A (en) * | 2017-10-20 | 2018-03-02 | 华东理工大学 | A kind of crude oil attribute method for quick predicting based near infrared spectrum detection |
CN109060711A (en) * | 2018-08-28 | 2018-12-21 | 中蓝晨光成都检测技术有限公司 | A method of calculating white oil content in white oil-doped organosilicon product |
-
2019
- 2019-09-25 CN CN201910910971.2A patent/CN110672552B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5606164A (en) * | 1996-01-16 | 1997-02-25 | Boehringer Mannheim Corporation | Method and apparatus for biological fluid analyte concentration measurement using generalized distance outlier detection |
CN104713846A (en) * | 2015-02-03 | 2015-06-17 | 贵州省烟草科学研究院 | Modeling method for rapidly detecting content of starch in tobacco by using near infrared spectroscopy |
CN107748146A (en) * | 2017-10-20 | 2018-03-02 | 华东理工大学 | A kind of crude oil attribute method for quick predicting based near infrared spectrum detection |
CN109060711A (en) * | 2018-08-28 | 2018-12-21 | 中蓝晨光成都检测技术有限公司 | A method of calculating white oil content in white oil-doped organosilicon product |
Non-Patent Citations (4)
Title |
---|
On-line monitoring of powder blending with near-infrared spectroscopy;De Maesschalck, R;《APPLIED SPECTROSCOPY》;19980531;第52卷(第5期);第725-731页 * |
一种基于简化正交距离的近红外异常光谱判断方法;孟丹蕊;《光谱学与光谱分析》;20180430;第38卷(第4期);第1053-1058页 * |
制浆材木质素含量近红外分析模型传递研究;熊智新;《中国造纸学报》;20190915;第34卷(第3期);第43-49页 * |
近红外光谱法分析油页岩含油率中异常样品识别和剔除方法的研究;赵振英;《光谱学与光谱分析》;20140630;第34卷(第6期);第1707-1710页 * |
Also Published As
Publication number | Publication date |
---|---|
CN110672552A (en) | 2020-01-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111123188A (en) | Electric energy meter comprehensive verification method and system based on improved least square method | |
CN107067100B (en) | Wind power abnormal data identification method and identification device | |
CN105117550A (en) | Product multidimensional correlation-oriented degradation failure modeling method | |
CN111367959B (en) | Zero-time-lag nonlinear expansion Granger causal analysis method | |
CN111537845A (en) | Method for identifying aging state of oil paper insulation equipment based on Raman spectrum cluster analysis | |
CN110672552B (en) | Confidence coefficient estimation method for vehicle fuel oil near infrared spectrum detection result | |
CN115453064B (en) | Fine particulate matter air pollution cause analysis method and system | |
CN111222095B (en) | Rough difference judging method, device and system in dam deformation monitoring | |
CN109978059B (en) | Early warning method for tower flushing faults of primary distillation tower in crude oil distillation process | |
CN108508860B (en) | Process industrial production system data monitoring method based on coupling relation | |
CN102542284B (en) | Method for identifying spectrum | |
CN114038501A (en) | Background bacterium judgment method based on machine learning | |
CN112630343A (en) | Method for testing, tracing and rectifying hydrocarbon pollutants of automobile tire | |
CN112949680A (en) | Pollution source identification method based on corresponding analysis and multiple linear regression | |
CN108267422B (en) | Abnormal sample removing method based on near infrared spectrum analysis | |
CN107941758B (en) | Coal quality heat value measuring method based on spectrum pretreatment | |
CN112949169B (en) | Coal sample test value prediction method based on spectral analysis | |
CN112685912B (en) | Multivariate generalized Wiener process performance degradation reliability analysis method | |
CN112711911B (en) | Rapid pollution tracing method applied to boundary observation based on pollution source spectrum library | |
US7970551B2 (en) | Analysis of DNA | |
WO2019041732A1 (en) | Evaluation method and apparatus for manufacturing process capability | |
CN113297722A (en) | Coastal soil salinity assessment method and system | |
CN107727597B (en) | Ultraviolet differential concentration inversion and interference suppression method for mixed gas | |
CN106776485A (en) | A kind of method that expert's conclusion based on probability calculation quantifies | |
CN113655050B (en) | Method for improving Raman spectrum detection limit of trace crude oil in light oil |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |