CN103398971A - Chemometrics method for determining cetane number of diesel oil - Google Patents

Chemometrics method for determining cetane number of diesel oil Download PDF

Info

Publication number
CN103398971A
CN103398971A CN2013103180872A CN201310318087A CN103398971A CN 103398971 A CN103398971 A CN 103398971A CN 2013103180872 A CN2013103180872 A CN 2013103180872A CN 201310318087 A CN201310318087 A CN 201310318087A CN 103398971 A CN103398971 A CN 103398971A
Authority
CN
China
Prior art keywords
diesel oil
cetane number
model
prediction
models
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2013103180872A
Other languages
Chinese (zh)
Inventor
李艳坤
孙伟
景璟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
North China Electric Power University
Original Assignee
North China Electric Power University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by North China Electric Power University filed Critical North China Electric Power University
Priority to CN2013103180872A priority Critical patent/CN103398971A/en
Publication of CN103398971A publication Critical patent/CN103398971A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Investigating Or Analysing Materials By Optical Means (AREA)

Abstract

The invention discloses a chemometrics method for determining a cetane number of diesel oil. The chemometrics method comprises the following steps of firstly, carrying out standardized treatment on determined original near infrared spectrum data of the diesel oil and screening variables (wavelengths) in the data by adopting an uninformative variable elimination UVE method; then, establishing a partial least squares PLS model between a known diesel oil spectrum matrix containing different variable numbers and the corresponding cetane number; predicating the cetane number of an unknown sample by adopting a multi-model consensus method; calculating a total predicated value by carrying out weighting method calculation on a predicated value of a single model; finally, calculating the predicated values by an averaging method through repeating a predication process for a plurality of times so as to obtain the final predicated result. The method disclosed by the invention is applied to the determination for the cetane number of an actual diesel oil sample to obtain a good effect; the performance for predicating the cetane number of the diesel oil by combining the PLS linear model with a near-infrared spectrometry is improved in the aspects of accuracy and stability; the cetane number of the diesel oil can be detected accurately, rapidly and simply.

Description

Chemometrics method for determining cetane number of diesel oil
Technical Field
The invention relates to a method for measuring the cetane number of diesel oil by combining chemometrics with a near infrared spectrum technology.
Background
The structural groups that absorb in the near infrared spectral region (NIRs) are predominantly X-H, and the various compounds that make up the petroleum product have this structure. Therefore, the NIRs are almost used in each link of petrochemical engineering detection by the characteristics of simplicity, rapidness, no damage, good reproducibility and the like, and play an important role. As a complex hydrocarbon mixture, the diesel oil has a very complex near infrared spectrum structure and a serious spectrogram overlapping phenomenon. Therefore, the characteristic information extraction of the near infrared spectrum data of the diesel oil is carried out, and a relation model between indexes (cetane number, density, condensation point and the like) and spectrum data is established, so that the method is a key step for ensuring the quality of the oil product.
The Cetane Number (CN) of diesel oil is a main index for measuring the combustion performance of diesel oil. The traditional cetane number measuring method adopts an ASTM D613 experimental method internationally, and GB/T386 established in China also refers to the ASTM D613 method. The method needs a large amount of oil samples, takes long time and has high measuring cost. At present, the near infrared spectrum technology is combined with a chemometric method for measuring the cetane number of diesel oil, and the method is reported in documents. The cetane number of diesel oil and a spectrum signal have a nonlinear relation, and the modeling is better than linear algorithms such as Multiple Linear Regression (MLR) and Partial Least Squares (PLS) by adopting a nonlinear algorithm Support Vector Machine (SVM), an Artificial Neural Network (ANN) and the like.
The invention adopts the non-information variable elimination (UVE) method combined with the multi-model consensus PLS method to establish a novel quantitative model, improves the performance of predicting the cetane number of diesel oil by a PLS linear model, and obtains good effect.
Disclosure of Invention
The invention aims to provide a method for rapidly and accurately detecting the cetane number of diesel oil, and provides a new correction model for the quantitative regression analysis of near infrared spectrum and the determination of the cetane number of the diesel oil.
The principle of the method is that firstly, variables contained in the diesel near infrared spectrum data after standardized processing are screened by a non-information variable elimination (UVE) method, and then a plurality of PLS models containing different variable numbers are used for jointly predicting the cetane number of an unknown diesel sample, so that higher prediction accuracy and stability than those of a single PLS model are obtained.
The method comprises the following specific steps:
1. the original near infrared spectrum of the diesel oil is subjected to standardization treatment, and a data set is randomly divided into a training set, a checking set and a prediction set.
2. The number of factors for the PLS model was established according to the Monte Carlo cross validation in combination with the F-test method. The Root Mean Square Error (RMSEP) of the test set predicted values is used as an optimization criterion for all parameters in the model.
RMSEP = [ 1 n Σ i = 1 n ( y i - y ^ i ) 2 ] 1 / 2 , - - - ( 1 )
Wherein,
Figure BSA0000093093060000022
denotes the predicted value of the i-th sample, yiRepresenting the true value of the ith sample, and n is the number of predicted samples.
3. According to the size of a stability value of a variable (wavelength) in a UVE algorithm, different variable screening thresholds are established, wavelength optimization screening is carried out, and therefore a plurality of PLS models with different variable numbers are established.
4. Randomly selecting a certain number of models from the models established in the step 3, and optimizing the number of the selected models through predicting a test set sample; and predicting all training set samples by using the selected fixed number of models, calculating the prediction error of each model and the weight of the corresponding model, and finally calculating the prediction result of each model by a weighting method to obtain a total prediction value.
5. The prediction set samples are predicted using the same prediction method as in step 4. And then, setting the times of repeated prediction, and combining the predicted values of the prediction set in the multiple prediction processes by an averaging method to obtain the final prediction result of the prediction set. And calculating the mean and standard deviation (sigma) of the Root Mean Square Error (RMSEP) of the prediction results of the prediction set.
The chemometric method improves the performance of predicting the cetane number of the diesel oil by combining a PLS linear model with near infrared spectrum from the aspects of accuracy and stability, and can accurately, quickly and simply detect the cetane number in the diesel oil.
Drawings
FIG. 1 is a flow chart of the method of the present invention
FIG. 2 is a near infrared spectrum of diesel oil in the example
Detailed Description
Example (b):
NIR spectra and corresponding cetane number profiles for 245 diesel samples were downloaded from http:// software. The near infrared spectrum scanning range is 750-1550nm, the scanning interval is 2nm, and the scanning interval comprises 401 wavelength points (variables). The method comprises the following specific steps:
1. and carrying out standardization processing on the original NIR spectral data to obtain a standardized data matrix. The normalized matrix has a mean value of 0 and a variance of 1 for each variable in the variable space.
2. The original data set was randomly divided into 155 samples as a training set, 45 samples as a test set, and the remaining 45 samples as a prediction set.
3. A Partial Least Squares (PLS) correlation model was established between NIR spectra of diesel training set samples and corresponding cetane number data according to Monte Carlo (Monte Carlo) cross validation in combination with the F-test method. The number of factors for determining the PLS predictive model method is 13, taking Root Mean Square Error (RMSEP) of the predicted values of the test set samples as an optimization criterion.
4. And obtaining the stability value of each variable by a non-information variable UVE (ultraviolet ray) elimination method, and setting K times of the maximum value in the absolute values of the stability values of the added noise as a threshold value for variable screening. The K value is in the interval from 0 to 0.8, and the value is taken at intervals of 0.02. In the normalized data obtained in step 1, variables having stability values with absolute values greater than or equal to the size of the screening threshold are retained. Thus 40 PLS models were obtained which retained a different number of variables.
5. And (4) randomly selecting 25 models (the number of the models is determined after optimization) from the 40 models established in the step (4), and respectively modeling by using a PLS method. Calculating all training set samples x for each model predictioniRelative error δ between predicted value and actual value of (i ═ 1.. multidot.n)i,t
δ i , t = e i , t y i - - - ( 2 )
Wherein,
Figure BSA0000093093060000032
t (t ═ 1.., 25) is the model number.
Then delta will bei,tThe normalization processing is carried out, and the normalization processing is carried out,
δ i , t = δ i , t max ( δ i , t ) ; - - - ( 3 )
then delta will bei,tThe average value is taken to obtain the prediction error beta of the modeltThereby calculating each model CtIs/are as follows
α t = 1 / 21 g 1 - β t β t ; - - - ( 4 )
Then calculate each model CtThe weight of (a) is determined,
w t = α t / Σ t = 1 25 α t ; - - - ( 5 )
calculating the results of 25 PLS model prediction set samples by a weighting method to obtain a total prediction value,
C ( x i ) = Σ t = 1 25 w t C t ( x i ) . - - - ( 6 )
6. and (5) repeating the operation 100 times to predict the cetane number of the prediction set sample. The mean of the root mean square error RMSEP and the relative standard deviation (σ) of RMSEP of the predicted results are calculated.
7. The method of the present invention having the above steps is named multi-model BPLS method. Each model C in the step 5 is processedtThe weights of (a) are set to be equal, namely, a total predicted value is obtained through a simple averaging method, other steps are not changed, and the method is named as a multi-model CPLS method. The operation is repeated 100 times, and the cetane number of the prediction set sample is predicted. The results of the two methods are compared and are shown in table 1. The result shows that the chemometric method provided by the invention has good effect on the accuracy and stability of prediction, and provides a novel and practical correction model for the diesel cetane number determination and the quantitative regression analysis of the near infrared spectrum.
TABLE 1 comparison of results of different methods for predicting cetane number of diesel oil samples
Figure BSA0000093093060000041

Claims (5)

1. A chemometric method for determining the cetane number of diesel oil, which is characterized by comprising the following steps:
(1) measuring to obtain near infrared spectrum data of diesel oil, and carrying out standardization treatment on the data;
(2) screening variables contained in the data obtained in the step (1) by adopting a non-information variable UVE elimination method;
(3) establishing a Partial Least Squares (PLS) correlation model between diesel spectrum matrixes containing different variable quantities and corresponding diesel cetane numbers;
(4) and (4) randomly extracting a certain number of models established in the step (3) and jointly predicting the cetane number in the unknown diesel oil sample. Then, calculating the total predicted value of the prediction results of the models by a weighting method;
(5) and (4) repeatedly operating the step (4) for multiple times, and combining the prediction results in the prediction processes through an averaging method to finally give a prediction result of the cetane number of the unknown diesel oil sample.
2. The chemometric method for determining the cetane number of diesel oil as claimed in claim 1, wherein, in the step (1), the wavelength range of the characteristic region in the near infrared spectrum is 750-1550 nm.
3. The chemometric method for determining the cetane number of diesel oil according to claim 1, characterized in that in step (2), K times the maximum value of the absolute values of the stability values of the additive noise (K is in the interval from 0 to 0.8 and is 0.02 per interval) in the UVE method is taken as the threshold value of the variable screening, and the variables with the absolute values of the stability values greater than or equal to the threshold value are retained.
4. The chemometric method for determining the cetane number of diesel oil according to claim 1, wherein in the step (4), 25 models are randomly selected from the 40 PLS models established in the step (3) in a ratio of 5/8.
5. The chemometric method for measuring the cetane number of diesel oil according to claim 1, wherein in the step (4), the prediction error of each model and the weight of the model are calculated, and the total prediction result is calculated from the prediction results of these models by a weighting method:
according to each model CtIs predicted by the prediction error betatCalculating
Figure FSA0000093093050000011
Wherein t (t ═ 1.., 25) is a model number;
each model CtHas a weight of w t = α t / Σ t = 1 25 α t - - - ( 2 )
The final prediction result is C ( x i ) = Σ t = 1 25 w t C t ( x i ) - - - ( 3 )
Wherein x isi(i 1.., n) is a prediction sample.
CN2013103180872A 2013-07-19 2013-07-19 Chemometrics method for determining cetane number of diesel oil Pending CN103398971A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2013103180872A CN103398971A (en) 2013-07-19 2013-07-19 Chemometrics method for determining cetane number of diesel oil

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2013103180872A CN103398971A (en) 2013-07-19 2013-07-19 Chemometrics method for determining cetane number of diesel oil

Publications (1)

Publication Number Publication Date
CN103398971A true CN103398971A (en) 2013-11-20

Family

ID=49562637

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2013103180872A Pending CN103398971A (en) 2013-07-19 2013-07-19 Chemometrics method for determining cetane number of diesel oil

Country Status (1)

Country Link
CN (1) CN103398971A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104091089A (en) * 2014-07-28 2014-10-08 温州大学 Infrared spectrum data PLS modeling method
CN104964943A (en) * 2015-05-28 2015-10-07 中北大学 Self-adaptive Group Lasso-based infrared spectrum wavelength selection method
CN106468708A (en) * 2015-08-19 2017-03-01 中国石化扬子石油化工有限公司 A kind of Forecasting Methodology of Lubricity of Low-Sulfur Diesel Fuels
CN106645012A (en) * 2016-12-19 2017-05-10 中国石油化工股份有限公司 Method for carrying out rapid quantitative analysis on ester compounds in finished product gasoline and diesel
CN109358021A (en) * 2018-12-13 2019-02-19 黄河三角洲京博化工研究院有限公司 A kind of method of tert-butyl peroxide ether content in infrared spectroscopic determination diesel oil
CN113740294A (en) * 2021-07-29 2021-12-03 北京易兴元石化科技有限公司 Gasoline/diesel oil detection and analysis method and device based on near infrared modeling

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
LI YAN-KUN: "《Determination of diesel cetane number by consensus modeling based on uninformative variable elimination》", 《ANALYTICAL METHODS》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104091089A (en) * 2014-07-28 2014-10-08 温州大学 Infrared spectrum data PLS modeling method
CN104091089B (en) * 2014-07-28 2016-04-27 温州大学 A kind of ir data PLS modeling method
CN104964943A (en) * 2015-05-28 2015-10-07 中北大学 Self-adaptive Group Lasso-based infrared spectrum wavelength selection method
CN104964943B (en) * 2015-05-28 2017-07-18 中北大学 A kind of infrared spectrum Wavelength selecting method based on self adaptation Group Lasso
CN106468708A (en) * 2015-08-19 2017-03-01 中国石化扬子石油化工有限公司 A kind of Forecasting Methodology of Lubricity of Low-Sulfur Diesel Fuels
CN106645012A (en) * 2016-12-19 2017-05-10 中国石油化工股份有限公司 Method for carrying out rapid quantitative analysis on ester compounds in finished product gasoline and diesel
CN109358021A (en) * 2018-12-13 2019-02-19 黄河三角洲京博化工研究院有限公司 A kind of method of tert-butyl peroxide ether content in infrared spectroscopic determination diesel oil
CN113740294A (en) * 2021-07-29 2021-12-03 北京易兴元石化科技有限公司 Gasoline/diesel oil detection and analysis method and device based on near infrared modeling
CN113740294B (en) * 2021-07-29 2024-03-08 北京易兴元石化科技有限公司 Near infrared modeling-based gasoline/diesel oil detection and analysis method and device

Similar Documents

Publication Publication Date Title
CN103398971A (en) Chemometrics method for determining cetane number of diesel oil
CN105388123B (en) A kind of method by near infrared spectrum prediction oil property
CN104990894B (en) A kind of gasoline property detection method based on weighting absorbance and similar sample
CN106918567B (en) A kind of method and apparatus measuring trace metal ion concentration
CN102305772A (en) Method for screening characteristic wavelength of near infrared spectrum features based on heredity kernel partial least square method
CN105466884B (en) It is a kind of by near infrared light spectrum discrimination crude oil species and its method for property
Li et al. A consensus PLS method based on diverse wavelength variables models for analysis of near-infrared spectra
CN106529008B (en) A kind of double integrated offset minimum binary modeling methods based on Monte Carlo and LASSO
CN108830253B (en) Screening model establishing method, spectrum screening device and method
CN107632010B (en) Method for quantifying steel sample by combining laser-induced breakdown spectroscopy
CN102998276B (en) By the method for infrared spectrum measurement true boiling point curve of crude oil
CN102128805A (en) Method and device for near infrared spectrum wavelength selection and quick quantitative analysis of fruit
Yan-kun Determination of diesel cetane number by consensus modeling based on uninformative variable elimination
CN103134763B (en) The method of oil density is predicted by infrared spectrum
CN113504198A (en) Amaranth pigment concentration detection method and device and storage medium
CN113903407A (en) Component identification method, component identification device, electronic equipment and storage medium
Tian et al. Application of nir spectral standardization based on principal component score evaluation in wheat flour crude protein model sharing
Huang et al. Optimal wavelength selection for hyperspectral scattering prediction of apple firmness and soluble solids content
Riau et al. Selection of compound group to identify the authenticity one of jamu product using the group lasso for logistic regression
CN114414524A (en) Method for rapidly detecting properties of aviation kerosene
CN103134762B (en) The method of crude oil nitrogen content is predicted by infrared spectrum
CN110646371A (en) Method for measuring water content of tobacco essence perfume
CN109724939B (en) Method for predicting hydrogenated tail oil property by near infrared spectrum
CN111027025A (en) Method for screening infrared spectrum characteristic wavelength to predict wine quality parameters
CN110646373A (en) Method for measuring sugar content of tobacco flavor and fragrance

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20131120