CN103398971A - Chemometrics method for determining cetane number of diesel oil - Google Patents
Chemometrics method for determining cetane number of diesel oil Download PDFInfo
- Publication number
- CN103398971A CN103398971A CN2013103180872A CN201310318087A CN103398971A CN 103398971 A CN103398971 A CN 103398971A CN 2013103180872 A CN2013103180872 A CN 2013103180872A CN 201310318087 A CN201310318087 A CN 201310318087A CN 103398971 A CN103398971 A CN 103398971A
- Authority
- CN
- China
- Prior art keywords
- diesel oil
- cetane number
- model
- prediction
- models
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 47
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 title claims abstract description 34
- 239000002283 diesel fuel Substances 0.000 title claims abstract description 31
- 238000002329 infrared spectrum Methods 0.000 claims abstract description 14
- 238000012216 screening Methods 0.000 claims abstract description 7
- 238000001228 spectrum Methods 0.000 claims abstract description 6
- 230000008030 elimination Effects 0.000 claims abstract description 5
- 238000003379 elimination reaction Methods 0.000 claims abstract description 5
- 238000012935 Averaging Methods 0.000 claims abstract description 4
- 238000012569 chemometric method Methods 0.000 claims description 8
- 230000000717 retained effect Effects 0.000 claims description 3
- 239000000654 additive Substances 0.000 claims 1
- 230000000996 additive effect Effects 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 3
- 239000011159 matrix material Substances 0.000 abstract description 3
- 238000001320 near-infrared absorption spectroscopy Methods 0.000 abstract 1
- 238000012549 training Methods 0.000 description 5
- 238000005457 optimization Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 238000001134 F-test Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 238000002790 cross-validation Methods 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 238000000611 regression analysis Methods 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000012706 support-vector machine Methods 0.000 description 2
- 239000004215 Carbon black (E152) Substances 0.000 description 1
- 238000002485 combustion reaction Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 238000012417 linear regression Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 239000003209 petroleum derivative Substances 0.000 description 1
Images
Landscapes
- Investigating Or Analysing Materials By Optical Means (AREA)
Abstract
The invention discloses a chemometrics method for determining a cetane number of diesel oil. The chemometrics method comprises the following steps of firstly, carrying out standardized treatment on determined original near infrared spectrum data of the diesel oil and screening variables (wavelengths) in the data by adopting an uninformative variable elimination UVE method; then, establishing a partial least squares PLS model between a known diesel oil spectrum matrix containing different variable numbers and the corresponding cetane number; predicating the cetane number of an unknown sample by adopting a multi-model consensus method; calculating a total predicated value by carrying out weighting method calculation on a predicated value of a single model; finally, calculating the predicated values by an averaging method through repeating a predication process for a plurality of times so as to obtain the final predicated result. The method disclosed by the invention is applied to the determination for the cetane number of an actual diesel oil sample to obtain a good effect; the performance for predicating the cetane number of the diesel oil by combining the PLS linear model with a near-infrared spectrometry is improved in the aspects of accuracy and stability; the cetane number of the diesel oil can be detected accurately, rapidly and simply.
Description
Technical Field
The invention relates to a method for measuring the cetane number of diesel oil by combining chemometrics with a near infrared spectrum technology.
Background
The structural groups that absorb in the near infrared spectral region (NIRs) are predominantly X-H, and the various compounds that make up the petroleum product have this structure. Therefore, the NIRs are almost used in each link of petrochemical engineering detection by the characteristics of simplicity, rapidness, no damage, good reproducibility and the like, and play an important role. As a complex hydrocarbon mixture, the diesel oil has a very complex near infrared spectrum structure and a serious spectrogram overlapping phenomenon. Therefore, the characteristic information extraction of the near infrared spectrum data of the diesel oil is carried out, and a relation model between indexes (cetane number, density, condensation point and the like) and spectrum data is established, so that the method is a key step for ensuring the quality of the oil product.
The Cetane Number (CN) of diesel oil is a main index for measuring the combustion performance of diesel oil. The traditional cetane number measuring method adopts an ASTM D613 experimental method internationally, and GB/T386 established in China also refers to the ASTM D613 method. The method needs a large amount of oil samples, takes long time and has high measuring cost. At present, the near infrared spectrum technology is combined with a chemometric method for measuring the cetane number of diesel oil, and the method is reported in documents. The cetane number of diesel oil and a spectrum signal have a nonlinear relation, and the modeling is better than linear algorithms such as Multiple Linear Regression (MLR) and Partial Least Squares (PLS) by adopting a nonlinear algorithm Support Vector Machine (SVM), an Artificial Neural Network (ANN) and the like.
The invention adopts the non-information variable elimination (UVE) method combined with the multi-model consensus PLS method to establish a novel quantitative model, improves the performance of predicting the cetane number of diesel oil by a PLS linear model, and obtains good effect.
Disclosure of Invention
The invention aims to provide a method for rapidly and accurately detecting the cetane number of diesel oil, and provides a new correction model for the quantitative regression analysis of near infrared spectrum and the determination of the cetane number of the diesel oil.
The principle of the method is that firstly, variables contained in the diesel near infrared spectrum data after standardized processing are screened by a non-information variable elimination (UVE) method, and then a plurality of PLS models containing different variable numbers are used for jointly predicting the cetane number of an unknown diesel sample, so that higher prediction accuracy and stability than those of a single PLS model are obtained.
The method comprises the following specific steps:
1. the original near infrared spectrum of the diesel oil is subjected to standardization treatment, and a data set is randomly divided into a training set, a checking set and a prediction set.
2. The number of factors for the PLS model was established according to the Monte Carlo cross validation in combination with the F-test method. The Root Mean Square Error (RMSEP) of the test set predicted values is used as an optimization criterion for all parameters in the model.
Wherein,denotes the predicted value of the i-th sample, yiRepresenting the true value of the ith sample, and n is the number of predicted samples.
3. According to the size of a stability value of a variable (wavelength) in a UVE algorithm, different variable screening thresholds are established, wavelength optimization screening is carried out, and therefore a plurality of PLS models with different variable numbers are established.
4. Randomly selecting a certain number of models from the models established in the step 3, and optimizing the number of the selected models through predicting a test set sample; and predicting all training set samples by using the selected fixed number of models, calculating the prediction error of each model and the weight of the corresponding model, and finally calculating the prediction result of each model by a weighting method to obtain a total prediction value.
5. The prediction set samples are predicted using the same prediction method as in step 4. And then, setting the times of repeated prediction, and combining the predicted values of the prediction set in the multiple prediction processes by an averaging method to obtain the final prediction result of the prediction set. And calculating the mean and standard deviation (sigma) of the Root Mean Square Error (RMSEP) of the prediction results of the prediction set.
The chemometric method improves the performance of predicting the cetane number of the diesel oil by combining a PLS linear model with near infrared spectrum from the aspects of accuracy and stability, and can accurately, quickly and simply detect the cetane number in the diesel oil.
Drawings
FIG. 1 is a flow chart of the method of the present invention
FIG. 2 is a near infrared spectrum of diesel oil in the example
Detailed Description
Example (b):
NIR spectra and corresponding cetane number profiles for 245 diesel samples were downloaded from http:// software. The near infrared spectrum scanning range is 750-1550nm, the scanning interval is 2nm, and the scanning interval comprises 401 wavelength points (variables). The method comprises the following specific steps:
1. and carrying out standardization processing on the original NIR spectral data to obtain a standardized data matrix. The normalized matrix has a mean value of 0 and a variance of 1 for each variable in the variable space.
2. The original data set was randomly divided into 155 samples as a training set, 45 samples as a test set, and the remaining 45 samples as a prediction set.
3. A Partial Least Squares (PLS) correlation model was established between NIR spectra of diesel training set samples and corresponding cetane number data according to Monte Carlo (Monte Carlo) cross validation in combination with the F-test method. The number of factors for determining the PLS predictive model method is 13, taking Root Mean Square Error (RMSEP) of the predicted values of the test set samples as an optimization criterion.
4. And obtaining the stability value of each variable by a non-information variable UVE (ultraviolet ray) elimination method, and setting K times of the maximum value in the absolute values of the stability values of the added noise as a threshold value for variable screening. The K value is in the interval from 0 to 0.8, and the value is taken at intervals of 0.02. In the normalized data obtained in step 1, variables having stability values with absolute values greater than or equal to the size of the screening threshold are retained. Thus 40 PLS models were obtained which retained a different number of variables.
5. And (4) randomly selecting 25 models (the number of the models is determined after optimization) from the 40 models established in the step (4), and respectively modeling by using a PLS method. Calculating all training set samples x for each model predictioniRelative error δ between predicted value and actual value of (i ═ 1.. multidot.n)i,t
Then delta will bei,tThe normalization processing is carried out, and the normalization processing is carried out,
then delta will bei,tThe average value is taken to obtain the prediction error beta of the modeltThereby calculating each model CtIs/are as follows
Then calculate each model CtThe weight of (a) is determined,
calculating the results of 25 PLS model prediction set samples by a weighting method to obtain a total prediction value,
6. and (5) repeating the operation 100 times to predict the cetane number of the prediction set sample. The mean of the root mean square error RMSEP and the relative standard deviation (σ) of RMSEP of the predicted results are calculated.
7. The method of the present invention having the above steps is named multi-model BPLS method. Each model C in the step 5 is processedtThe weights of (a) are set to be equal, namely, a total predicted value is obtained through a simple averaging method, other steps are not changed, and the method is named as a multi-model CPLS method. The operation is repeated 100 times, and the cetane number of the prediction set sample is predicted. The results of the two methods are compared and are shown in table 1. The result shows that the chemometric method provided by the invention has good effect on the accuracy and stability of prediction, and provides a novel and practical correction model for the diesel cetane number determination and the quantitative regression analysis of the near infrared spectrum.
TABLE 1 comparison of results of different methods for predicting cetane number of diesel oil samples
Claims (5)
1. A chemometric method for determining the cetane number of diesel oil, which is characterized by comprising the following steps:
(1) measuring to obtain near infrared spectrum data of diesel oil, and carrying out standardization treatment on the data;
(2) screening variables contained in the data obtained in the step (1) by adopting a non-information variable UVE elimination method;
(3) establishing a Partial Least Squares (PLS) correlation model between diesel spectrum matrixes containing different variable quantities and corresponding diesel cetane numbers;
(4) and (4) randomly extracting a certain number of models established in the step (3) and jointly predicting the cetane number in the unknown diesel oil sample. Then, calculating the total predicted value of the prediction results of the models by a weighting method;
(5) and (4) repeatedly operating the step (4) for multiple times, and combining the prediction results in the prediction processes through an averaging method to finally give a prediction result of the cetane number of the unknown diesel oil sample.
2. The chemometric method for determining the cetane number of diesel oil as claimed in claim 1, wherein, in the step (1), the wavelength range of the characteristic region in the near infrared spectrum is 750-1550 nm.
3. The chemometric method for determining the cetane number of diesel oil according to claim 1, characterized in that in step (2), K times the maximum value of the absolute values of the stability values of the additive noise (K is in the interval from 0 to 0.8 and is 0.02 per interval) in the UVE method is taken as the threshold value of the variable screening, and the variables with the absolute values of the stability values greater than or equal to the threshold value are retained.
4. The chemometric method for determining the cetane number of diesel oil according to claim 1, wherein in the step (4), 25 models are randomly selected from the 40 PLS models established in the step (3) in a ratio of 5/8.
5. The chemometric method for measuring the cetane number of diesel oil according to claim 1, wherein in the step (4), the prediction error of each model and the weight of the model are calculated, and the total prediction result is calculated from the prediction results of these models by a weighting method:
Wherein t (t ═ 1.., 25) is a model number;
each model CtHas a weight of
The final prediction result is
Wherein x isi(i 1.., n) is a prediction sample.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013103180872A CN103398971A (en) | 2013-07-19 | 2013-07-19 | Chemometrics method for determining cetane number of diesel oil |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013103180872A CN103398971A (en) | 2013-07-19 | 2013-07-19 | Chemometrics method for determining cetane number of diesel oil |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103398971A true CN103398971A (en) | 2013-11-20 |
Family
ID=49562637
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2013103180872A Pending CN103398971A (en) | 2013-07-19 | 2013-07-19 | Chemometrics method for determining cetane number of diesel oil |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103398971A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104091089A (en) * | 2014-07-28 | 2014-10-08 | 温州大学 | Infrared spectrum data PLS modeling method |
CN104964943A (en) * | 2015-05-28 | 2015-10-07 | 中北大学 | Self-adaptive Group Lasso-based infrared spectrum wavelength selection method |
CN106468708A (en) * | 2015-08-19 | 2017-03-01 | 中国石化扬子石油化工有限公司 | A kind of Forecasting Methodology of Lubricity of Low-Sulfur Diesel Fuels |
CN106645012A (en) * | 2016-12-19 | 2017-05-10 | 中国石油化工股份有限公司 | Method for carrying out rapid quantitative analysis on ester compounds in finished product gasoline and diesel |
CN109358021A (en) * | 2018-12-13 | 2019-02-19 | 黄河三角洲京博化工研究院有限公司 | A kind of method of tert-butyl peroxide ether content in infrared spectroscopic determination diesel oil |
CN113740294A (en) * | 2021-07-29 | 2021-12-03 | 北京易兴元石化科技有限公司 | Gasoline/diesel oil detection and analysis method and device based on near infrared modeling |
-
2013
- 2013-07-19 CN CN2013103180872A patent/CN103398971A/en active Pending
Non-Patent Citations (1)
Title |
---|
LI YAN-KUN: "《Determination of diesel cetane number by consensus modeling based on uninformative variable elimination》", 《ANALYTICAL METHODS》 * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104091089A (en) * | 2014-07-28 | 2014-10-08 | 温州大学 | Infrared spectrum data PLS modeling method |
CN104091089B (en) * | 2014-07-28 | 2016-04-27 | 温州大学 | A kind of ir data PLS modeling method |
CN104964943A (en) * | 2015-05-28 | 2015-10-07 | 中北大学 | Self-adaptive Group Lasso-based infrared spectrum wavelength selection method |
CN104964943B (en) * | 2015-05-28 | 2017-07-18 | 中北大学 | A kind of infrared spectrum Wavelength selecting method based on self adaptation Group Lasso |
CN106468708A (en) * | 2015-08-19 | 2017-03-01 | 中国石化扬子石油化工有限公司 | A kind of Forecasting Methodology of Lubricity of Low-Sulfur Diesel Fuels |
CN106645012A (en) * | 2016-12-19 | 2017-05-10 | 中国石油化工股份有限公司 | Method for carrying out rapid quantitative analysis on ester compounds in finished product gasoline and diesel |
CN109358021A (en) * | 2018-12-13 | 2019-02-19 | 黄河三角洲京博化工研究院有限公司 | A kind of method of tert-butyl peroxide ether content in infrared spectroscopic determination diesel oil |
CN113740294A (en) * | 2021-07-29 | 2021-12-03 | 北京易兴元石化科技有限公司 | Gasoline/diesel oil detection and analysis method and device based on near infrared modeling |
CN113740294B (en) * | 2021-07-29 | 2024-03-08 | 北京易兴元石化科技有限公司 | Near infrared modeling-based gasoline/diesel oil detection and analysis method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103398971A (en) | Chemometrics method for determining cetane number of diesel oil | |
CN105388123B (en) | A kind of method by near infrared spectrum prediction oil property | |
CN104990894B (en) | A kind of gasoline property detection method based on weighting absorbance and similar sample | |
CN106918567B (en) | A kind of method and apparatus measuring trace metal ion concentration | |
CN102305772A (en) | Method for screening characteristic wavelength of near infrared spectrum features based on heredity kernel partial least square method | |
CN105466884B (en) | It is a kind of by near infrared light spectrum discrimination crude oil species and its method for property | |
Li et al. | A consensus PLS method based on diverse wavelength variables models for analysis of near-infrared spectra | |
CN106529008B (en) | A kind of double integrated offset minimum binary modeling methods based on Monte Carlo and LASSO | |
CN108830253B (en) | Screening model establishing method, spectrum screening device and method | |
CN107632010B (en) | Method for quantifying steel sample by combining laser-induced breakdown spectroscopy | |
CN102998276B (en) | By the method for infrared spectrum measurement true boiling point curve of crude oil | |
CN102128805A (en) | Method and device for near infrared spectrum wavelength selection and quick quantitative analysis of fruit | |
Yan-kun | Determination of diesel cetane number by consensus modeling based on uninformative variable elimination | |
CN103134763B (en) | The method of oil density is predicted by infrared spectrum | |
CN113504198A (en) | Amaranth pigment concentration detection method and device and storage medium | |
CN113903407A (en) | Component identification method, component identification device, electronic equipment and storage medium | |
Tian et al. | Application of nir spectral standardization based on principal component score evaluation in wheat flour crude protein model sharing | |
Huang et al. | Optimal wavelength selection for hyperspectral scattering prediction of apple firmness and soluble solids content | |
Riau et al. | Selection of compound group to identify the authenticity one of jamu product using the group lasso for logistic regression | |
CN114414524A (en) | Method for rapidly detecting properties of aviation kerosene | |
CN103134762B (en) | The method of crude oil nitrogen content is predicted by infrared spectrum | |
CN110646371A (en) | Method for measuring water content of tobacco essence perfume | |
CN109724939B (en) | Method for predicting hydrogenated tail oil property by near infrared spectrum | |
CN111027025A (en) | Method for screening infrared spectrum characteristic wavelength to predict wine quality parameters | |
CN110646373A (en) | Method for measuring sugar content of tobacco flavor and fragrance |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20131120 |