CN112215351B

CN112215351B - Enhanced multi-scale convolution neural network soft measurement method

Info

Publication number: CN112215351B
Application number: CN202010994589.7A
Authority: CN
Inventors: 葛志强; 江肖禹
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2020-09-21
Filing date: 2020-09-21
Publication date: 2022-05-03
Anticipated expiration: 2040-09-21
Also published as: CN112215351A

Abstract

The invention discloses an enhanced multi-scale convolution neural network soft measurement method, which carries out data enhancement by reconstructing coarse-grained characteristics and applies a convolution neural network to construct a regression model. Splicing process variable data of adjacent time periods to obtain a data matrix, replacing original fine-grained characteristics with new input of a sample, and solving the problems of unbalanced process variable sampling and inaccurate input and output correspondence of the sample; expanding the sample by using a generation countermeasure network, and performing data enhancement on the original data set by using a quantitative generated sample to solve the problem of local data loss in the process; the coarse-grained features are simultaneously extracted by using a one-dimensional convolution kernel and a two-dimensional convolution kernel, the dynamic property of variables and the time difference between the variables are considered, and the problem of time delay between process variables is solved; and reducing the data to one dimension through the alternative operation of convolution and pooling, and establishing a prediction model through a multilayer perceptron to realize real-time soft measurement of the target quality variable.

Description

Enhanced multi-scale convolution neural network soft measurement method

Technical Field

The invention belongs to the field of industrial process soft measurement, and particularly relates to an enhanced multi-scale convolution neural network soft measurement method.

Background

In the industry big data age, data-driven soft sensors have become an important means for guiding industrial production and control. However, in the actual industry, the means of acquiring different data are different, most process variables in production can be directly measured by a Distributed Control System (DCS), and some quality variables need to be analyzed by chemical means or specialized instruments to obtain values. Meanwhile, the process variable data and the quality variable data are input values and output values of one sample, respectively. The quality variable data of an industrial product is much less than the process variable data due to its high measurement costs. Therefore, the problem of unbalanced sampling of different variable data breaks the basic premise that the training sample needs to have an input value and an output value simultaneously in the traditional soft sensor modeling, and a large amount of process variable data is wasted.

In addition, in the data measurement process, because points distributed in production equipment by the DCS are different, data collected at the same time often do not correspond to the same batch of materials, and the problem of time delay among variable data is caused. Meanwhile, the time difference between the sampling of the quality variable data and the sampling of the process variable data also causes the problem that the input value and the output value of the sample are inaccurate correspondingly. Both of these problems also greatly limit the performance of soft sensors.

Based on the above three problems, the fine-grained characteristics (process variable data at a certain moment) cannot provide effective process information for soft sensor modeling, and thus a good soft measurement effect cannot be obtained. Meanwhile, when it is considered that the absolute amount of the quality variable data is small and a local loss is caused for some reason, the model is further badly influenced. Therefore, in industrial practice, a soft measurement method that can be used to specifically solve the above practical problems is very valuable.

Disclosure of Invention

The invention aims to provide an enhanced multi-scale convolution neural network soft measurement method aiming at the defects of the prior art. The invention realizes the prediction of quality variable data in industry based on the generation of a confrontation network and a Convolutional Neural Network (CNN) structure.

The purpose of the invention is realized by the following technical scheme: an enhanced multi-scale convolution neural network soft measurement method comprises the following steps:

step one, data preprocessing: let the mass variable be Y and the process variable be X_iI is 1,2, …, m; where m is the number of process variables. Taking the quality variable data collected at the moment t as a sample S_tOutput value y of_tAnd t is 1,2, …. Setting the duration of the material in the whole process as n, and dividing [ t-n, t]Process variable x collected over a period of time_i·jJ-1, 2, …, n, spliced together as a sample S_tInput value of

Referred to as coarse-grained features. Then, the process variables are normalized by the maximum and minimum normalization to eliminate dimensional differences between the variables.

Step two, data enhancement: pairing samples S with a generating countermeasure network_tCarrying out data enhancement to obtain a new generated sample S^*。

Step three, feature extraction: the convolution operation employs both a 1 × U one-dimensional convolution kernel and a V × W two-dimensional convolution kernelConvolution kernel, for sample S_tAnd sample S^*And performing feature extraction on the coarse granularity features after the normalization. Then, the feature maps extracted by the convolution kernels at different scales are fused through convolution of 1x 1. And then, performing dimension reduction processing on the extracted feature map by using a pooling layer. And completely converting the coarse-grained features into serialized one-dimensional features through repeated convolution and pooling operations.

Step four, regression modeling: and carrying out regression modeling on the serialized one-dimensional features through a multilayer perceptron comprising a single hidden layer. And training the model constructed in the third step to the fourth step through error back propagation, and correcting network parameters.

Step five, real-time prediction: and (4) predicting the quality variable corresponding to the real-time process variable by using the model trained in the step four aiming at the collected real-time process variable data in the actual production.

Further, in the second step, the input value D of the sample is used_tIs a data matrix, and the data value is a value, so the input data of the countermeasure network is firstly serialized and spliced to obtain a one-dimensional vector. And then, training of the generation countermeasure network is carried out to obtain one-dimensional generation data. Finally, the generated data is divided and reconstructed into a complete sample S^*. Generating a sample S^*Will compensate for the sample S_tData is locally missing for a certain period of time.

Further, in the third step, the convolutional layer is subjected to average pooling, and the result of pooling is as follows:

where b denotes the ordinal threshold of the activation values involved in pooling, R_lRepresenting pooled fields in the ith feature map, k representing index values of activation values throughout the pool, r_kAnd a_kRespectively, the ordinal and activation values of the activation value k.

Further, in the model constructed in the third to fourth steps, the activation function between the convolutional layer and the pooling uses a linear rectification function, and the formula is as follows:

f(x)＝max(0,x)

where x represents the in-and-out vector from the previous layer network.

The invention has the following beneficial effects:

1. by splicing the data of the process variables in the adjacent time periods, the process variable data which cannot participate in modeling originally is utilized. Meanwhile, fine-grained characteristics in the sample are converted into coarse-grained characteristics, information in process data is better reserved, and the problems of variable time delay and non-correspondence of input and output of the sample are solved.

2. The data enhancement is carried out on the samples by utilizing the generated countermeasure network, the absolute number of the samples is expanded, and the influence of local data loss on soft measurement modeling is prevented.

3. The multi-scale convolution kernel is used for pertinently extracting the coarse-grained characteristics, one-dimensional and two-dimensional convolution kernels are respectively considered in the aspects of the dynamic property and the time ductility of the data, and the characteristics of the process data are better met.

Drawings

FIG. 1 is a schematic diagram of data preprocessing;

FIG. 2 is a schematic diagram of data enhancement;

FIG. 3 is a schematic diagram of a soft measurement model structure;

FIG. 4 is a schematic diagram of a multi-scale convolution kernel.

Detailed Description

The enhanced multi-scale convolution kernel CNN soft measurement method of the present invention is further described in detail below with reference to specific embodiments.

The invention discloses an enhanced multi-scale convolutional neural network soft measurement method, which is used for realizing the prediction of quality variable data in industry based on the generation of a countermeasure network and a Convolutional Neural Network (CNN) structure, and comprises the following steps:

step one (data preprocessing): the data preprocessing process is shown in FIG. 1, and the quality variable is set to be Y and the process variable is set to be X_iI is 1,2, …, m is the number of process variables. Taking the quality variable data collected at the moment t as a sample S_tOutput of (2)Value y_tAnd t is 1,2, …. Setting the duration of the material in the whole process as n, and dividing [ t-n, t]Process variable x collected over a period of time_i·jI 1,2, …, m, j 1,2, …, n, spliced together as sample S_tInput value of

The sample input obtained by reconstruction at this time is called coarse-grained feature D_t. Then, the process variables are normalized by the maximum and minimum normalization to eliminate dimensional differences between the variables.

Step two (data enhancement): data enhancement procedure As shown in FIG. 2, the samples S are paired with the generation countermeasure network_tCarrying out data enhancement to obtain a new generated sample S^*. Due to the input value D of the sample_tIs a data matrix and the data value is a value, so that the input data D of the countermeasure network is generated first_tAnd carrying out serialization and splicing treatment to obtain a one-dimensional vector. And then, one-dimensional generated data is obtained by training the generation confrontation network. Finally, the generated data is divided and reconstructed into a complete sample S^*. Generating a sample S^*Will compensate for the sample S_tData is locally missing for a certain period of time.

Step three (feature extraction): as shown in FIG. 4, the multiscale convolution operation uses both a 1 × U one-dimensional convolution kernel and a V × W two-dimensional convolution kernel for the sample S_tAnd sample S^*Extracting the characteristics of the medium and coarse granularity characteristics; u, V, W are selected in combination with the actual data situation. Unlike ordinary image data, the coarse-grained features of process variable data have different physical meanings on different coordinate axes. Therefore, we want to use multi-scale features to extract the coarse-grained features sufficiently to obtain complete process information. The one-dimensional convolution core extracts dynamic characteristics of the time sequence of each process variable, and the two-dimensional convolution core extracts time delay relation characteristics of the time sequence matrix of the adjacent process variable. And then fusing feature maps extracted by convolution kernels under different scales through convolution of 1x 1.

After the convolution operation, the feature map is subjected to dimensionality reduction by using the pooling layer. As shown in FIG. 3, the coarse-grained features D are transformed by repeated convolution and pooling operations_tCompletely converting into serialized one-dimensional features. To make full use of the existing information, the convolutional layer was pooled evenly, and the pooling results are shown below:

Step four (regression modeling): and carrying out regression modeling on the serialized one-dimensional features through a multilayer perceptron comprising a single hidden layer. And training the model through error back propagation, and correcting network parameters.

Step three and step four constitute the basic structure of the multi-scale convolution CNN soft measurement model shown in FIG. 3. The activation function between convolutional layer and pooling uses a linear rectification function (ReLU), whose formula is as follows:

f(x)＝max(0,x)

where x represents the in-and-out vector from the previous layer network. The ReLU enables the network to perform gradient descent and back propagation more efficiently, avoiding the problems of gradient explosion and gradient disappearance.

Step five (real-time prediction): and aiming at the fact that in actual production, collected real-time process variable data form coarse granularity characteristics, and the multi-scale convolution CNN soft measurement model trained in the third step, the fourth step is utilized to predict the coarse granularity characteristics in real time.

Combining a specific CO²Industrial example of an absorber tower to illustrate an enhanced multi-scale convolutional neural network soft measurement method. CO 2²The absorption tower is an important part in the ammonia synthesis process, and CO in the mixed gas before being sent to the ammonia synthesis tower²The content of (a) may adversely affect the product. CO 2²Absorption tower processThe equation contains 11 variables, as shown in Table 1. By modeling these variables, CO in the mixed gas is predicted²The content of (a).

Table 1: TE Process Fault Listing

In the data preprocessing stage, 30 sets of process variable data are spliced to obtain coarse-grained (30 × 11) features. Generation of a Confrontation network A Wasserstein generated Confrontation network (WGAN-GP) based on a gradient penalty was employed, generating 500 new samples. The convolutional layer uses one-dimensional convolution kernels of (1 × 3) and (1 × 5), and two-dimensional convolution kernels of (3 × 5) and (3 × 7), with a step size of 1. The size of the pooling layer was (2 × 2) with a step size of 2. The optimization algorithm adopts random gradient descent, and the learning rate is 0.01.

In the experiment, 1000 samples are collected, wherein the first 500 samples in time are taken as training samples, the second 500 samples are taken as testing samples, and 20% of the training samples are randomly deleted to simulate the problem of local data loss. Table 2 shows the predicted results of the method and other models, and from RMSE, the method achieves the best results.

Table 2: soft measurement model comparison

Soft measurement model	Multilayer perceptron	Convolutional neural network	The invention
				RMSE	0.01598	0.01298	0.01054

Claims

1. An enhanced multi-scale convolution neural network soft measurement method is characterized by comprising the following steps:

step one, data preprocessing: let the mass variable be Y and the process variable be X_iI is 1,2, …, m; where m is the number of process variables; taking the quality variable data collected at the moment t as a sample S_tOutput value y of_tT is 1,2, …; setting the duration of the material in the whole process as n, and dividing [ t-n, t]Process variable x collected over a period of time_i·jJ-1, 2, …, n, spliced together as a sample S_tInput value of

Referred to as coarse grain features; then, carrying out normalization operation on the process variables by using maximum and minimum normalization to eliminate dimensional difference between the variables;

step two, data enhancement: pairing samples S with a generating countermeasure network_tCarrying out data enhancement to obtain a new generated sample S^*；

Step three, feature extraction: the convolution operation simultaneously adopts a 1 XU one-dimensional convolution kernel and a V XW two-dimensional convolution kernel to perform convolution on the sample S_tAnd sample S^*Performing feature extraction on the coarse-grained features after medium normalization; then fusing feature maps extracted by convolution kernels under different scales through convolution of 1x 1; then, performing dimensionality reduction on the extracted feature map by using a pooling layer; completely converting the coarse grain characteristics into serialized one-dimensional characteristics through repeated convolution and pooling operations;

step four, regression modeling: carrying out regression modeling on the serialized one-dimensional features through a multilayer perceptron comprising a single hidden layer; training the model constructed in the third step to the fourth step through error back propagation, and correcting network parameters;

2. The enhanced multi-scale convolutional neural network soft measurement method of claim 1, wherein: in the second step, the input value D of the sample is used_tThe data matrix is used, and the data value is a value, so that input data of the countermeasure network is firstly serialized and spliced to obtain a one-dimensional vector; then, one-dimensional generated data is obtained through training of a generated countermeasure network; finally, the generated data is divided and reconstructed into a complete sample S^*(ii) a Generating a sample S^*Will compensate for the sample S_tData is locally missing for a certain period of time.

3. The enhanced multi-scale convolutional neural network soft measurement method of claim 1, wherein: in the third step, the convolution layer adopts average pooling, and the result of pooling is as follows:

4. The enhanced multi-scale convolutional neural network soft measurement method of claim 1, wherein: in the model constructed in the third to fourth steps, the activation function between the convolutional layer and the pooling uses a linear rectification function, and the formula is as follows:

f(x)＝max(0,x)

where x represents the incoming and outgoing vectors from the previous layer of the network.