CN112420132A

CN112420132A - Product quality optimization control method in gasoline catalytic cracking process

Info

Publication number: CN112420132A
Application number: CN202011180154.5A
Authority: CN
Inventors: 毛永芳; 黄秋娟; 柴毅; 张镠; 曾建学
Original assignee: Chongqing University
Current assignee: Chongqing University
Priority date: 2020-10-29
Filing date: 2020-10-29
Publication date: 2021-02-26

Abstract

A gasoline catalytic cracking process product quality optimization control method comprises the steps of collecting historical data of M times of gasoline catalytic cracking processes to obtain original data containing M samples; performing K-means clustering on the M original data to obtain K data sets, preprocessing the data of each data set by adopting the same data cleaning method to obtain processed data, and constructing an optimization control model through the processed data; the method comprises the steps of collecting gasoline to be catalytically cracked as verification data, matching the verification data into a data set of an optimization control model, obtaining technological parameters needing to be optimized in the gasoline catalytic cracking process through the optimization control model, adjusting the technological parameters and carrying out gasoline catalytic cracking to obtain a final optimized product.

Description

Product quality optimization control method in gasoline catalytic cracking process

Technical Field

The invention relates to the field of gasoline refining, in particular to a product quality optimization control method in a gasoline catalytic cracking process.

Background

About 70 percent of the Chinese imported crude oil is intermediate base or naphthenic base crude oil, and the crude oil is characterized by high sulfur content. The quality of gasoline has important influence on the performance and emission of automobiles, and it is very critical to reduce atmospheric pollution, produce high-cleanness gasoline and reduce the sulfur content in the gasoline. Octane number is the most important index for reflecting the combustion performance of gasoline and is used as a commercial brand of gasoline, and a series of operations in the process of carrying out desulfurization and olefin reduction on catalytic cracking gasoline in the prior art can cause the reduction of the octane number in crude oil, so that the quality of the gasoline is reduced. However, when the catalytic cracking gasoline is refined, the whole operation process is controllable, the prior art carries out desulfurization and olefin reduction on the whole process according to certain specifications, and the adjustment target is in accordance with the safety range of the current operation. Thus, although the target product can be obtained, the octane number loss of the product has large fluctuation, and the influence on the quality of the gasoline is large. In addition, although the traditional octane number measurement is accurate, the time consumption is too long, the operation is complicated, and the control of related enterprises on the quality of gasoline is directly influenced. According to relevant documents and a large amount of enterprise measured data, the loss of about 150 yuan/ton is equivalent to the loss of each 1 unit of octane number. Taking a 100 ten thousand ton/year catalytic cracking gasoline refining device as an example, if the octane number loss can be reduced by 0.3 unit, the economic benefit can reach four thousand five million yuan. In the catalytic cracking desulfurization process, on the premise of ensuring that the sulfur content of the finished product is lower than a certain index, the method has very important significance in improving the octane value content as much as possible. However, due to the complexity of the oil refining process and the diversity of equipment, the related influence variables have the characteristics of high dimensionality, high coupling, nonlinearity and cluster distribution, and the catalytic cracking process has great hysteresis, so that the process parameters cannot be adjusted in time according to the quality of the output product to optimize the product quality.

Disclosure of Invention

The invention aims to provide a product quality optimization control method in a gasoline catalytic cracking process,

the invention aims to realize the technical scheme that 1) the original data is acquired: collecting historical data of M times of gasoline catalytic cracking processes to obtain original data T { (X) containing M samples₁,Y₁,S₁),(X₂,Y₂,S₂),...,(X_M,Y_M,S_M) In which X is_i＝(x₁,x₂,...,x_N)i＝[1～M]Containing N in gasoline catalytic cracking process₁Characteristic variables of individual material parameters and N₂Characteristic variables of the process operating parameters; y is_iIs the measured octane number loss value, S_iIs the measured sulfur content of the product;

2) data processing: performing K-means clustering on the M original data T acquired in the step 1) to obtain K data sets, preprocessing the data of each data set by adopting the same data cleaning method to obtain processed data, and then performing data processing on each data set according to the following steps of 9: 1 is randomly divided into a training group and a testing group;

3) constructing an optimization control model: aiming at a training group of a single data set, constructing Q octane number loss prediction models and Q sulfur content prediction models based on a recursive characteristic variable elimination algorithm and a random forest, bringing test set data into the Q octane number loss prediction models and the Q sulfur content prediction models, respectively calculating loss functions of the models, selecting the optimal octane number loss prediction model and the optimal sulfur content prediction model of the data set according to the loss functions, and obtaining the octane number loss prediction models and the sulfur content prediction models of K data sets by the same method to obtain an optimized control model;

4) optimizing and controlling the product quality: acquiring verification data Z of gasoline to be catalytically cracked, matching the verification data into the data set of the optimization control model obtained in the step 3) according to the Euclidean distance minimum principle from a clustering center, taking characteristic variables belonging to operation parameters in the verification data Z as decision variables, and obtaining the optimal solution of the decision variables through the optimization control model; and adjusting each process parameter of the gasoline catalytic cracking process according to the optimal solution of the decision variable, and performing gasoline catalytic cracking to obtain a final optimized product.

Further, the specific steps of data processing in step 2) are as follows:

2-1) original data diversity: randomly selecting K samples from the raw data T collected in the step 1) as an initial mean vector mu₁,μ₂,...,μ_k}; calculate each sample X_iWith each mean vector mu_jA distance of X_iInscribe the nearest mu_jCorresponding data set C_jPerforming the following steps; computing a data set C_jNew mean vector μ'_j: mu.s of_j≠μ′_jThen mu's'_jIs given to mu_jIteratively updating the mean vector; if: mu.s_j＝μ′_jThen the output data set C ═ C₁,C₂,...,C_K}；

2-2) data preprocessing: sample t of the single dataset obtained in step 2-1)_iPerforming longitudinal processing to remove the sample t_iFor variable x_LPerforming transverse processing to remove variables with high abnormal incidence rate to obtain a preprocessed data set D_j{(X_j,Y_j,S_j)}(j＝1,2,...,k)；

2-3) data classification: for a single data set D after the pre-processing of step 2-2)_j{(X_j,Y_j,S_j) Data of (j ═ 1, 2.., k) are randomly sorted, where: using 90% of samples as training population and the rest 10% of samples as testing population to obtain training set data D_tr{X_tr,Y_tr,S_trTest set data D_te{X_te,Y_te,S_te}；

Wherein X_tr{(X₁₁,X₁₂),(X₂₁,X₂₂),...,(X_k1,X_k2)},X_j1,

X_j1,X_j2Respectively representing variables belonging to the material properties and the operating parameters in the characteristic variables.

Further, the specific steps of the original data diversity in step 2-1) are as follows:

2-1-1) randomly selecting K samples from the raw data T collected in step 1) as an initial mean vector [ mu ]₁,μ₂,...,μ_k}；

2-1-2) calculating Each sample X in the raw data T_i(i 1-M) and each mean vector mu_j(j is 1 to k):

d_ij＝||X_i-μ_j||₂ (1)

mixing X_iInscribe the nearest mu_jCorresponding data set C_jPerforming the following steps;

2-1-3) computing the data set C_jNew mean vector μ'_j：

If: mu.s_j≠μ′_jThen mu's'_jIs given to mu_jReturning to the step 2-1-2) to iteratively update the mean vector;

if: mu.s_j＝μ′_jThen the output data set C ═ C₁,C₂,...,C_KTherein of

m_jIs the number of samples of the jth class of data set, where each sample t_i(i＝1,2,...,m_j) Containing N characteristic variables x_L(L＝1,2,...,N)。

Further, the specific steps of preprocessing the data in the step 2-2) are as follows:

2-2-1) samples t on a single dataset_iPerforming longitudinal processing to calculate single data set sample t_iArithmetic mean of

And residual error v of a single sample_iAccording to Bessel formulaThe standard error σ is calculated:

if a certain measured value t_bIs of a residual error v_bSatisfy the requirement of

Then consider t_bThe error value is a bad value containing a large error value, and the marking position in the corresponding homotype all-zero matrix is 1;

and calculating the abnormal occurrence rate p of the sample variable according to the mark matrix_eiAnd rate of sample anomaly measurement p_xe:

In the formula (7), P represents a sample x_LThe number of variables in the abnormal sampling value;

adjusting the measurement rate p_xeThe threshold value is directly removed, and the abnormal data of the samples which do not exceed the threshold value are replaced by adopting a mean value interpolation method;

2-2-2) for variable x_LPerforming transverse processing, and measuring each variable x by using Pearson correlation coefficient rho_L(L ═ 1, 2.., N) correlation with the key target variable y/s of the band study, the formula calculated is as follows:

correlation coefficient rho for each variable simultaneously_xy、ρ_xsAnd abnormal incidence p of variables_eiSetting a threshold value, and eliminating variables with higher abnormal incidence rate through the threshold value;

2-2-3) carrying out normalization treatment on the data processed in the step 2-2-1) and the step 2-2-2):

obtaining a pre-processed data set D_j{(X_j,Y_j,S_j)}(j＝1,2,...,k)。

Further, the specific steps of constructing the optimal control model in the step 3) are as follows:

3-1) utilizing N characteristic variables X in training population data_trAnd octane number loss Y_trConstructing a random forest-based octane number loss prediction model f, and combining characteristic variables x of the model_Nj(ii) a Using test set data X_teAnd Y_teCalculating the loss function root mean square error of the octane number loss prediction model:

in formula (11), y_iThe true value of octane number loss for the ith sample,

the corresponding model estimated value is taken as the model estimated value;

3-2) sequentially calculating the importance of N characteristic variables in the training set data and the characteristic variable X of the training population_trAnd octane number loss Y_trObeying a gaussian distribution:

in formula (12), τ ═ τ (τ)₁,τ₂,...,τ_N)^T,τ_i＝C(X_i,Y),C＝[C(X_i,Y_j)]I.e. X_trThe covariance matrix of (a);

the importance of the ith feature variable:

in the formula (13), α_i＝[C^-1τ]V () represents a calculation variable variance function;

3-3) importance of N characteristic variables I (x)_i) Sorting, and deleting c characteristic variables with low importance to obtain a new combination of N 'characteristic variables, wherein N' ═ N-c;

3-4) returning the new characteristic variable combination N' to the step 3-1), and repeating the steps 3-1) and 3-3) until the number of the characteristic variables N is 0, so as to obtain Q octane number loss prediction models F ═ (F ═ 0)₁,f₂,...,f_Q) And Q characteristic variable combinations v ═ v (v)₁,v₂,...,v_Q)；

3-5) comparing the root mean square error of the loss functions of the Q prediction models, selecting the octane number loss prediction model with the minimum root mean square error of the loss function as the octane number loss prediction model of the data set and obtaining the characteristic variable combination of the models, wherein the variables belonging to the material properties in the characteristic variable combination are recorded as v_y1In the combination of characteristic variables, the variable belonging to an operating parameter is denoted v_y2；

3-6) repeating the steps 3-1) -3-5) aiming at the k data sets, so as to obtain octane number loss prediction models of the k data sets and corresponding characteristic variable combinations thereof;

3-7) D Using training set data_tr＝{X_tr,S_trTest set data D_te＝{X_te,S_teObtaining a product sulfur content prediction model S of K data sets according to the methods of the steps 3-1) to 3-6)_jAnd its corresponding variable v which is a property of the feedstock_s1And variables v belonging to the operating parameters_s2。

Further, the specific steps of performing product quality optimization control on the gasoline catalytic cracking process by using the octane number loss prediction model and the sulfur content prediction model in the step 4) are as follows:

4-1) collecting data Z of gasoline to be catalytically cracked, and calculating Euclidean distance d between verification data Z and each subset clustering center_j(j ═ 1, 2.. once, k), according to the Euclidean distance minimum principle from the verification data to the clustering center, matching the verification data Z to the data set corresponding to the optimization control model constructed in the step 3), and obtaining a corresponding octane number loss prediction model f (Z) (Z₁,Z₂) And sulfur content prediction model S (Z)_S1,Z_S2) Wherein Z is₁For characteristic variables, Z, belonging to the properties of the feedstock in the model for the prediction of octane number loss₂For the characteristic variables belonging to the operating parameters in the model for predicting octane number loss, Z_S1For the characteristic variable, Z, of the nature of the feedstock in the model for predicting the sulphur content of the finished product_S2Characteristic variables belonging to the operation parameters in the finished product sulfur content prediction model are obtained;

4-2) predicting model f (Z) according to octane number loss in step 4-1)₁,Z₂) And sulfur content prediction model S (Z)_S1,Z_S2) Fixing the characteristic variable Z belonging to the nature of the feedstock₁And Z_S1Unchanged as a characteristic variable Z belonging to the operating parameter₂And Z_S2Simultaneously optimizing an octane number loss prediction model and a finished product sulfur content prediction model for decision variables, wherein an optimization objective function of the multi-objective optimization problem is as follows:

in the formula (14), the Δ set represents the adjustable range of each operating parameterAnd using genetic algorithm to obtain global optimum solution

And

the decision variable for optimizing the quality of gasoline catalytic cracking products is a characteristic variable Z belonging to an operating parameter in an octane number loss prediction model₂And the characteristic variable Z belonging to the operation parameter in the finished product sulfur content prediction model_S2Union of (1), i.e. optimal solution of decision variables

4-3) adjusting process parameters: solution Z obtained according to the optimization in step 4-2)^*Adjusting technological parameters of the gasoline catalytic cracking process, and performing gasoline catalytic cracking to obtain a final optimized product.

Due to the adoption of the technical scheme, the invention has the following advantages:

1. the method is different from the traditional data association or mechanism modeling, and the factors most relevant to the octane number content of the finished product are mined from a large amount of data, so that the influence of data noise on model construction and prediction is weakened to the maximum extent;

2. clustering is carried out on the data sets which are distributed in a cluster shape and have more complex distribution, the complexity of data distribution is reduced, a prediction model is established for each type, and then the fitting pressure of the prediction model is reduced;

3. the product quality can be predicted in time according to the product quality prediction model, the defect of production flow delay is avoided, and the process parameters of the optimal target are solved by utilizing a multi-target optimization algorithm and the prediction result, so that the process parameters can be adjusted in time to achieve the purpose of optimal product quality.

Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof.

Drawings

The drawings of the present invention are described below.

FIG. 1 is a flow chart of the present invention.

Detailed Description

The invention is further illustrated by the following figures and examples.

A method for optimizing and controlling the quality of products in the gasoline catalytic cracking process comprises the following steps:

1) collecting original data: collecting historical data of M times of gasoline catalytic cracking processes to obtain original data T { (X) containing M samples₁,Y₁,S₁),(X₂,Y₂,S₂),...,(X_M,Y_M,S_M) In which X is_i＝(x₁,x₂,...,x_N)i＝[1～M]Containing N in gasoline catalytic cracking process₁Characteristic variables of individual material parameters and N₂Characteristic variables of the process operating parameters; y is_iIs the measured octane number loss value, S_iIs the measured sulfur content of the product;

2) data processing: performing K-means clustering on the M original data T acquired in the step 1) to obtain K data sets, preprocessing the data of each data set by adopting the same data cleaning method to obtain processed data, and then performing data processing on each data set according to the following steps of 9: 1 is randomly divided into a training group and a testing group; the method comprises the following specific steps:

2-1) original data diversity: randomly selecting K samples from the raw data T collected in the step 1) as an initial mean vector mu₁,μ₂,...,μ_k}; calculate each sample X_iWith each mean vector mu_jA distance of X_iInscribe the nearest mu_jCorresponding data set C_jPerforming the following steps; computing a data set C_jNew mean vector μ'_j: mu.s of_j≠μ′_jThen mu's'_jIs given to mu_jTo, forIteratively updating the mean vector; if: mu.s_j＝μ′_jThen the output data set C ═ C₁,C₂,...,C_KThe method comprises the following specific steps:

d_ij＝||X_i-μ_j||₂ (15)

2-1-3) computing the data set C_jNew mean vector μ'_j：

if: mu.s_j＝μ′_jThen the output data set C ═ C₁,C₂,...,C_KTherein of

2-2) data preprocessing: sample t of the single dataset obtained in step 2-1)_iPerforming longitudinal processing to remove the sample t_iFor variable x_LPerforming transverse processing to remove variables with high abnormal incidence rate to obtain a preprocessed data set D_j{(X_j,Y_j,S_j)}(j＝1,2,...,k)；The method comprises the following specific steps:

And residual error v of a single sample_iAnd calculating the standard error sigma according to a Bessel formula:

In the formula (21), P represents a sample x_LThe number of variables in the abnormal sampling value;

adjusting the measurement rate p_xeThreshold value, for exceeding threshold valueDirectly removing samples, and replacing abnormal data of the samples which do not exceed the threshold value by adopting a mean value interpolation method;

obtaining a pre-processed data set D_j{(X_j,Y_j,S_j)}(j＝1,2,...,k)。

Wherein X_tr{(X₁₁,X₁₂),(X₂₁,X₂₂),...,(X_k1,X_k2)},

，X_j1,X_j2And respectively representing variables belonging to the material property and the operation parameter in the characteristic variables, and the data of the test set are the same.

3) Constructing an optimization control model: aiming at a training group of a single data set, constructing Q octane number loss prediction models and Q sulfur content prediction models based on a recursive characteristic variable elimination algorithm and a random forest, bringing test set data into the Q octane number loss prediction models and the Q sulfur content prediction models, respectively calculating loss functions of the models, selecting the optimal octane number loss prediction model and the optimal sulfur content prediction model of the data set according to the loss functions, and obtaining the octane number loss prediction models and the sulfur content prediction models of K data sets by the same method to obtain an optimized control model; the method comprises the following specific steps:

in the formula (25), y_iThe true value of octane number loss for the ith sample,

the corresponding model estimated value is taken as the model estimated value;

in the formula (26), τ＝(τ₁,τ₂,...,τ_N)^T,τ_i＝C(X_i,Y),C＝[C(X_i,Y_j)]I.e. X_trThe covariance matrix of (a);

the importance of the ith feature variable:

in the formula (27), α_i＝[C^-1τ]V () represents a calculation variable variance function;

4) Optimizing and controlling the product quality: acquiring verification data Z of gasoline to be catalytically cracked, matching the verification data into the data set of the optimization control model obtained in the step 3) according to the Euclidean distance minimum principle from a clustering center, taking characteristic variables belonging to operation parameters in the verification data Z as decision variables, and obtaining the optimal solution of the decision variables through the optimization control model; adjusting each process parameter of the gasoline catalytic cracking process according to the optimal solution of the decision variable, and performing gasoline catalytic cracking to obtain a final optimized product; the method comprises the following specific steps:

in the formula (28), each set of Δ representsThe adjustable range of each operation parameter is used for solving the global optimal solution by utilizing a genetic algorithm

And

As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

Finally, it should be noted that: the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting the same, and although the present invention is described in detail with reference to the above embodiments, those of ordinary skill in the art should understand that: modifications and equivalents may be made to the embodiments of the invention without departing from the spirit and scope of the invention, which is to be covered by the claims.

Claims

1. A method for optimizing and controlling the quality of products in the gasoline catalytic cracking process is characterized by comprising the following specific steps:

1) collecting original data: collecting historical data of M times of gasoline catalytic cracking processes to obtain original data T { (X) containing M samples₁,Y₁,S₁),(X₂,Y₂,S₂),...,(X_M,Y_M,S_M) In which X is_i＝(x₁,x₂,...,x_N)i＝[1～M]Comprising gasoline catalytic crackingIn the process N₁Characteristic variables of individual material parameters and N₂Characteristic variables of the process operating parameters; y is_iIs the measured octane number loss value, S_iIs the measured sulfur content of the product;

2. The method for optimizing and controlling the product quality of the gasoline catalytic cracking process as claimed in claim 1, wherein the data processing in the step 2) comprises the following steps:

2-1) original data diversity: randomly selecting K samples from the raw data T collected in the step 1) as an initial mean vector mu₁,μ₂,...,μ_k}; calculate each sampleX_iWith each mean vector mu_jA distance of X_iInscribe the nearest mu_jCorresponding data set C_jPerforming the following steps; computing a data set C_jNew mean vector μ'_j: mu.s of_j≠μ′_jThen mu's'_jIs given to mu_jIteratively updating the mean vector; if: mu.s_j＝μ′_jThen the output data set C ═ C₁,C₂,...,C_K}；

Wherein X_tr{(X₁₁,X₁₂),(X₂₁,X₂₂),...,(X_k1,X_k2)},

3. The method for optimizing and controlling the product quality of the gasoline catalytic cracking process as claimed in claim 2, wherein the step 2-1) comprises the following steps:

2-1-1) randomly selecting K samples from the raw data T collected in the step 1) as an initial mean valueVector mu₁,μ₂,...,μ_k}；

d_ij＝||X_i-μ_j||₂ (1)

2-1-3) computing the data set C_jNew mean vector μ'_j：

if: mu.s_j＝μ′_jThen the output data set C ═ C₁,C₂,...,C_KTherein of

4. The method for optimizing and controlling the product quality of the gasoline catalytic cracking process as claimed in claim 2, wherein the step 2-2) of preprocessing the data comprises the following steps:

2-2-1) samples t on a single dataset_iPerforming longitudinal processing to calculate single data set sample t_iThe calculated average value t and the residual error v of a single sample_iAnd calculating the standard error sigma according to a Bessel formula:

obtaining a pre-processed data set D_j{(X_j,Y_j,S_j)}(j＝1,2,...,k)。

5. The method for optimizing and controlling the product quality of the gasoline catalytic cracking process according to claim 1, wherein the specific steps of constructing the optimization control model in the step 3) are as follows:

in formula (11), y_iThe true value of octane number loss for the ith sample,

the corresponding model estimated value is taken as the model estimated value;

the importance of the ith feature variable:

6. The method for optimizing and controlling the product quality in the gasoline catalytic cracking process according to claim 1, wherein the step 4) of performing the product quality optimization control in the gasoline catalytic cracking process by using the octane number loss prediction model and the sulfur content prediction model comprises the following specific steps:

4-2) predicting model f (Z) according to octane number loss in step 4-1)₁,Z₂) And sulfur content prediction model S (Z)_S1,Z_S2) Fixing the characteristic variable Z belonging to the nature of the feedstock₁And Z_S1Unchanged as a characteristic variable Z belonging to the operating parameter₂And Z_S2Optimizing octane number loss prediction model and finished product sulfur content prediction model simultaneously for decision variables, and performing multi-objective optimizationThe optimization objective function of the problem is:

in the formula (14), the Δ set represents the adjustable range of each operation parameter, and a global optimal solution is obtained by using a genetic algorithm

And