CN110264079B

CN110264079B - Hot-rolled product quality prediction method based on CNN algorithm and Lasso regression model

Info

Publication number: CN110264079B
Application number: CN201910538744.1A
Authority: CN
Inventors: 徐林; 李丛丛
Original assignee: Northeastern University China
Current assignee: Northeastern University China
Priority date: 2019-06-20
Filing date: 2019-06-20
Publication date: 2022-12-06
Anticipated expiration: 2039-06-20
Also published as: CN110264079A

Abstract

The invention belongs to the technical field of steel rolling product quality prediction, and particularly relates to a hot rolling product quality prediction method based on a CNN algorithm and a Lasso regression model. The method comprises the following steps of S1: obtaining sample data for modeling, wherein the sample data comprises training data, and determining a key input variable; s2: training the CNN by using key input variables of the sample data to obtain a feature vector model; s3: substituting key input variables of the training data into the feature vector model to obtain input variables substituted into the Lasso regression model; s4: determining an optimal regularization factor of the Lasso regression model, training the Lasso regression model by using the input variables in the S3 to obtain an uncorrected mixed prediction model, and correcting the uncorrected mixed prediction model to obtain a corrected mixed prediction model; s5: and inputting the production data of the future time period into the modified hybrid prediction model to obtain a prediction result of the production data. The method can improve the prediction precision of the model.

Description

Hot-rolled product quality prediction method based on CNN algorithm and Lasso regression model

Technical Field

The invention belongs to the technical field of steel rolling product quality prediction, and particularly relates to a hot rolling product quality prediction method based on a CNN algorithm and a Lasso regression model.

Background

In recent years, although steel enterprises have solved the situation of insufficient production of steel products, no proper control model exists for the product quality, so that the resource utilization rate in the production process is higher than the average level, the steel rolling production energy consumption ratio is too much, and compared with the foreign advanced technology, the method has a great progress space. At present, steel products are mainly used in various aspects of construction industry, aerospace industry, automobile manufacturing industry and the like, and the requirements of the industries on the quality of steel are very strict, so that a proper and accurate steel rolling product quality prediction model is established, and the method has great significance for product quality performance prediction.

The production process of steel is composed of a plurality of links, each link affects the product quality, in order to guarantee the product quality, a relation between production variables and product performance indexes needs to be established, and the aim of controlling the product quality is achieved by changing production parameters under the existing production conditions. After years of production experience, steel enterprises accumulate a great deal of production data which is easily polluted by factory noise, so how to process the data and use the data in modeling is preparation work before product quality modeling. At present, the method for establishing a prediction model for product quality mainly comprises two major categories, namely a regression algorithm and a neural network algorithm: the regression model has a simple structure, and although the generation of the over-fitting problem can be avoided, the regression model has the defect that the prediction result is greatly different from the expectation; all the nodes of the fully-connected neural network model are connected with each other, so that the quantity of extracted parameters is increased, and the computational complexity of the model is increased. Therefore, considering the above problems of the rolled steel product data and the modeling process, a proper modeling method needs to be selected to complete the product quality prediction.

Disclosure of Invention

Technical problem to be solved

Aiming at the existing technical problems, the invention provides the hot-rolled product quality prediction method based on the CNN algorithm and the Lasso regression model, which can relieve the overfitting problem of the model to a certain extent and improve the prediction precision of the model.

(II) technical scheme

In order to achieve the purpose, the invention adopts the main technical scheme that:

a hot-rolled product quality prediction method based on a CNN algorithm and a Lasso regression model comprises the following steps,

s1: obtaining sample data for modeling in historical data about the performance of the hot rolled product, wherein the sample data comprises training data, and determining a key input variable;

s2: training the CNN by using key input variables of the sample data to obtain a feature vector model;

s3: substituting key input variables of the training data into the feature vector model to obtain input variables substituted into the Lasso regression model;

s4: determining an optimal regularization factor of the Lasso regression model, training the Lasso regression model by using the input variables obtained in the step S3 to obtain an uncorrected hybrid prediction model, and correcting the uncorrected hybrid prediction model at least once to obtain a corrected hybrid prediction model for predicting production data of a future time period;

s5: and inputting the production data of the future time period into the modified hybrid prediction model to obtain a prediction result of the production data.

Specifically, in step S1, sample data for modeling in relation to the history data of the properties of the hot rolled product is acquired, including,

s11: acquiring historical data about the performance of hot rolling products in a preset time period, cleaning the historical data, and acquiring complete cleaned first-time data;

s12: sampling the cleaned first-time data to obtain sampled second-time data;

s13: performing dimensionality reduction on the sampled second data to obtain third data subjected to dimensionality reduction;

s14: and carrying out normalization processing on the third time data after dimension reduction to finally obtain sample data for modeling.

Specifically, after step S14, further comprising,

s15: determining the proportion of training data and test data in sample data;

s16: a key output variable is selected.

In particular, in step S4, the unmodified hybrid prediction model is modified at least once, including,

s41: substituting the key input variables of the test data into the feature vector model to obtain input variables for substituting into the unmodified hybrid prediction model;

s42: and substituting the input variables obtained in the step S41 into the unmodified hybrid prediction model to obtain a predicted value, comparing the predicted value with the key output variables of the test data to modify the hybrid prediction model for multiple times until the error between the predicted value and the key output variables in the test data is within a preset range, and obtaining the modified hybrid prediction model for predicting the production data of the current time period.

Specifically, the step S11 of obtaining the complete cleaned first data includes performing a cleaning operation on data having an abnormal value and a missing value by using a direct elimination processing mode;

the step S12 of obtaining the sampled second data includes sampling the data by using a system random sampling method.

Specifically, the data dimensionality reduction in S13 includes,

determining the degree of correlation between the input variable and the output variable in the sampled secondary data by using a correlation analysis method, calculating a correlation coefficient between the input variable and the output variable, determining a key input variable of the model according to the importance degree of the variables, and reducing the dimensionality of the input variable of the production data in the model, wherein the calculation mode of the correlation coefficient is as follows:

wherein n is the number of sample data sets, x _i ，y _i (i =1,2, … n) as input variables and output variables,

is the average of the input variable and the output variable.

Specifically, the specific method of the data normalization process in S14 is,

let x _i ＝(x _i1 ,x _i2 ,…,x _ip ) I ∈ 2313, p is the total number of features of the input variable, and the formula for normalizing the data is as follows:

wherein the content of the first and second substances,

is a value distributed between 0 and 1 after normalization treatment, k _min Corresponding to the minimum value of the production variable, k _max Corresponding to the maximum value of the production variable.

Specifically, in S15, the ratio of training data to test data in the sample data is determined, including,

modeling by using a regression method and a neural network method respectively, and comparing average relative errors of model predicted values and actual values in different proportions to obtain the proportion of training data and test data;

and in S16, three mechanical performance indexes of elongation at break, yield strength and tensile strength are selected as key output variables.

Specifically, the CNN is trained in S2 to obtain a feature vector model, including,

the process of establishing the feature vector model includes both forward propagation and backward propagation,

A. forward propagation

Integrating one-dimensional input variables into 6 × 6 two-dimensional data, using 32 convolution kernels with the size of 3 × 3 for the first convolution layer, using 64 convolution kernels with the same size in the design of the second convolution layer, and extracting layer by layer through the two convolution layers to finally obtain representative local data characteristics, wherein the convolution output calculation formula is as follows:

wherein the content of the first and second substances,

representing the output of the convolutional layer after adding an activation function, the activation function being a ReLU function, F _j The number of the characteristic diagrams is shown,

is the input of the convolution characteristic of the previous layer,

in order to convolve the kernel matrix with the desired pattern,

is an offset;

after the convolution, the characteristic sampling after the convolution is carried out again, all data characteristics that draw the convolution layer promptly carry out statistical processing, choose for use maximum value pooling mode to sample, and two pooling layer windows all set to 2 x 2 size, and every process pooling is handled, and the data size all can reduce half, and the computational formula is:

wherein the content of the first and second substances,

is the output of the pond layer after the action,

pooling weight, pooling is pooling operation;

B. counter-propagating

The back propagation is a process of carrying out error calculation on the predicted value and the actual value, carrying out training through back transmission, and updating the weight, and the error objective function is defined as follows:

wherein E represents a back propagation error, n is the number of samples, y _n Which represents the actual output of the device,

representing the prediction output.

Specifically, the optimal regularization factor of the Lasso regression model is determined in S4, which includes,

let X _i (x _i1 ，x _i2 ，…，x _ip ) ^T Is an input value of the model, wherein i =1,2, …, N is the number of samples, and p is a characteristic number of the production variable; y is _i Is the output value of the model. Then the objective function optimized after the Lasso regression adds the L1 regularization term is:

wherein j =1,2, …, N, β = (β) ₀ ，β ₁ ，…，β _j ) And λ is L1 regularization term coefficient for unknown parameters of dimension j × 1.

(III) advantageous effects

The invention has the beneficial effects that: according to the hot-rolled product quality prediction method based on the CNN algorithm and the Lasso regression model, the advantage of CNN extraction data characteristics is utilized, and Lasso regression is used for replacing the single-layer perceptron prediction output of the CNN, so that the problem of model overfitting can be relieved to a certain extent, and the prediction precision of the model is improved.

Drawings

FIG. 1 is a model diagram of a hot rolled product quality prediction method based on a CNN algorithm and a Lasso regression model;

FIG. 2 is a network architecture diagram for training a CNN;

FIG. 3 is a flow chart of a hot rolled product quality prediction method based on CNN algorithm and Lasso regression model.

Detailed Description

For the purpose of better explaining the present invention and to facilitate understanding, the present invention will be described in detail by way of specific embodiments with reference to the accompanying drawings.

The invention discloses a hot-rolled product quality prediction method based on a CNN algorithm and a Lasso regression model, which comprises the following steps,

s1: obtaining sample data for modeling in historical data of hot-rolled product performance, wherein the sample data comprises training data, and determining key input variables;

Specifically, in step S1, obtaining sample data for modeling in the history data on the properties of the hot rolled product includes:

s12: sampling the cleaned first data to obtain sampled second data;

s13: carrying out dimensionality reduction on the sampled second data to obtain third data subjected to dimensionality reduction;

Specifically, after step S14, further comprising,

s15: determining the proportion of training data and test data in sample data;

s16: a key output variable is selected.

Specifically, in step S4, the at least one correction of the unmodified hybrid prediction model includes:

Specifically, the specific method for acquiring the complete cleaned first data in step S11 includes: for data with abnormal values and missing values, a direct elimination processing mode is adopted to carry out cleaning work on the data;

the step S12 of obtaining the sampled second-time data includes: and sampling the data by using a systematic random sampling method.

Specifically, the data dimensionality reduction in S13 includes:

is the average of the input variable and the output variable.

In order to reduce the difference of the magnitude of the sample data, all variables need to be unified in the same magnitude, so that the data needs to be normalized, and the production variables can be converted from dimensional to non-dimensional. Specifically, the specific method of the data normalization process in S14 is,

let x _i ＝(x _i1 ,x _i2 ,…,x _ip ) I ∈ 2313, p is the special of the input variableAnd (4) characterizing the total number, wherein a formula for normalizing the data is as follows:

wherein the content of the first and second substances,

is a value distributed between 0 and 1 after normalization processing, k _min Corresponding to the minimum value of the production variable, k _max Corresponding to the maximum value of the production variable.

Specifically, the determining the ratio of the training data to the test data in the sample data in S15 includes:

modeling is carried out by a regression method and a neural network method respectively, and the average relative errors of the model predicted values and the model actual values under different proportions are compared to obtain the proportion of 8:2 between the training data and the test data, so 1850 groups of sample data are selected as the training data, and the rest 463 groups are selected as the test data.

Determining the factors influencing the mechanical properties of hot rolled products in production as 32 key input variables, and selecting three mechanical property indexes of elongation at break and yield strength) and tensile strength as key output variables.

the process of establishing the feature vector model comprises two aspects of forward propagation and backward propagation,

A. forward propagation

wherein the content of the first and second substances,

is the input of the convolution characteristic of the previous layer,

in order to convolve the kernel matrix with the desired pattern,

is an offset;

wherein, the first and the second end of the pipe are connected with each other,

is the output of the pond layer after the action,

pooling weight, pooling is pooling operation;

B. counter-propagating

representing the prediction output.

The feature vector model obtained by training the CNN is finally input by using a single-layer perceptron, a predicted value is output through a full-connection layer, the output mode can cause the over-fitting problem and further influences the prediction precision, and the Lasso regression method can relieve the over-fitting problem by changing the regularization coefficient, so that Lasso regression can be used for replacing the CNN single-layer perceptron output layer for prediction, and the improvement of the feature vector model obtained by training the CNN is realized. Lasso regression, i.e., minimum absolute value shrinkage and selection operators, alleviates the problem of overfitting of the model by adding L1 regularization.

With the increase of the regularization term coefficient lambda, some model coefficients can be reduced to 0 even under the condition of keeping data characteristics, and the effect of reducing sum and selecting variables is achieved, so that the problem of model overfitting is effectively relieved.

The CNN is used as a special artificial neural network, has the ability of autonomous learning, and has the advantages of two functions of local connection and weight sharing. The function of replacing the whole characteristic by partial characteristic is realized by the local connection of the neuron and the neuron of the previous layer; the weight sharing means that the weights of the neurons in the same layer are the same, and the number of parameters processed by the model can be reduced. In addition, the Lasso regression can compress variables with smaller estimation parameters to zero by adding a regularization coefficient, so that the complexity and instability of a prediction model are reduced, and the over-fitting problem is avoided. According to the method, CNN is introduced to replace a full-connection neural network to model the quality prediction of the rolled steel product, and the CNN is easy to generate overfitting by using a mode of outputting a predicted value by a single-layer perceptron, so that the prediction precision of a test set is reduced, so that the hot rolled product quality prediction method based on the CNN algorithm and the Lasso regression model is provided, the advantage of CNN extraction data characteristics is kept, the advantage of the Lasso regression model for relieving the overfitting problem is also utilized, and the complexity and the instability of the model are reduced.

The technical principles of the present invention have been described above in connection with specific embodiments, which are intended to explain the principles of the present invention and should not be construed as limiting the scope of the present invention in any way. Based on the explanations herein, those skilled in the art will be able to conceive of other embodiments of the present invention without inventive efforts, which shall fall within the scope of the present invention.

Claims

1. A hot-rolled product quality prediction method based on a CNN algorithm and a Lasso regression model is characterized by comprising the following steps,

2. The CNN algorithm and Lasso regression model based hot rolled product quality prediction method according to claim 1, wherein said step S1 of obtaining sample data for modeling in historical data about hot rolled product properties comprises:

s12: sampling the cleaned first-time data to obtain sampled second-time data;

s14: and carrying out normalization processing on the third time data after dimensionality reduction to finally obtain sample data for modeling.

3. The CNN algorithm and Lasso regression model based hot rolled product quality prediction method according to claim 2, further comprising, after step S14,

s15: determining the proportion of training data and test data in sample data;

s16: a key output variable is selected.

4. The CNN algorithm and Lasso regression model based hot rolled product quality prediction method according to claim 3, wherein the step S4 of at least one correction of the uncorrected hybrid prediction model comprises:

5. The hot rolled product quality prediction method based on CNN algorithm and Lasso regression model according to claim 4,

step S11 of acquiring complete cleaned first data includes: for data with abnormal values and missing values, a direct elimination processing mode is adopted to carry out cleaning work on the data;

6. The CNN algorithm and Lasso regression model based hot rolled product quality prediction method according to claim 5, wherein the data dimensionality reduction in step S13 comprises,

is the average of the input variable and the output variable.

7. The hot rolled product quality prediction method based on CNN algorithm and Lasso regression model according to claim 6, wherein the data normalization in S14 is performed by,

wherein the content of the first and second substances,

8. The hot rolled product quality prediction method based on CNN algorithm and Lasso regression model according to claim 7,

in S15, the ratio of training data to test data in the sample data is determined, including,

9. The CNN algorithm and Lasso regression model based hot rolled product quality prediction method according to claim 8, wherein the CNN is trained in step S2 to obtain a feature vector model, comprising,

A. forward propagation

wherein the content of the first and second substances,

is the input of the convolution characteristic of the previous layer,

in order to convolve the kernel matrix with the desired pattern,

is an offset;

wherein the content of the first and second substances,

is the output of the pond layer after the action,

pooling weight, pooling is pooling operation;

B. counter-propagating

representing the prediction output.

10. The CNN algorithm and Lasso regression model based hot rolled product quality prediction method according to claim 9, characterized in that the optimal regularization factor of the Lasso regression model determined in step S4 comprises,

let X _i (x _i1 ，x _i2 ，…，x _ip ) ^T Is an input value of the model, wherein i =1,2, …, N is the number of samples, and p is a characteristic number of the production variable; y is _i Is the output value of the model; then the objective function optimized after the Lasso regression adds the L1 regularization term is: