CN112070156A

CN112070156A - Gas emission concentration prediction method and system based on GRU network

Info

Publication number: CN112070156A
Application number: CN202010928729.0A
Authority: CN
Inventors: 李德波; 廖宏楷; 周杰联; 陈拓; 成明涛; 冯永新
Original assignee: Guangdong Electric Power Science Research Institute Energy Technology Co Ltd
Current assignee: Guangdong Electric Power Science Research Institute Energy Technology Co Ltd
Priority date: 2020-09-07
Filing date: 2020-09-07
Publication date: 2020-12-11
Anticipated expiration: 2040-09-07
Also published as: CN112070156B

Abstract

The application discloses a gas emission concentration prediction method and a gas emission concentration prediction system of a GRU network, wherein the method comprises the steps of obtaining a first parameter data set for predicting the gas emission concentration; performing feature selection by using the first parameter data set to obtain a plurality of feature vectors for inputting to a GRU network for network training; dividing a first parameter data set into a training set and a test set according to the obtained multiple feature vectors; taking the training set as the input of the GRU network to be trained, training the GRU network, and outputting a pre-training network until an iterative convergence condition is met; and taking the test set as the input of the pre-training network, and obtaining a trained gas emission concentration prediction model by measuring the deviation between a predicted value output by the pre-training network and a given actual value when the deviation value is smaller than a preset deviation threshold value.

Description

Gas emission concentration prediction method and system based on GRU network

Technical Field

The application relates to the technical field of environmental engineering detection, in particular to a gas emission concentration prediction method and system based on a GRU network.

Background

With the increasingly prominent problem of atmospheric pollution, the emission of atmospheric pollutants of thermal power generating units in China is strictly regulated. The traditional low NOx combustion control is difficult to meet the emission standard, and the SCR flue gas denitration technology is widely adopted at home and abroad by matching with a tail flue gas denitration device.

In the current implementation scheme, a PCA method is utilized to extract the characteristics of original data, and a NOx emission prediction model of a denitration system is established by an RNN neural network. However, principal components subjected to PCA dimension reduction analysis are fuzzy when explaining the meanings of the principal components, and cannot be clear and accurate as those described in original variables; meanwhile, the data size after dimension reduction is small, so that the reference significance in practical industrial application is not large.

The traditional artificial neural network RBF model has high spatial complexity, and meanwhile, the prediction result does not meet the requirement of practical application. The decision coefficient R2 of the LSSVM model is better than that of the RBF model, but the root mean square error and the average absolute error of the LSSVM model are higher than those of the RBF model, so that the stability of predicting the gas emission concentration by using the LSSVM model is insufficient.

Disclosure of Invention

The application provides a gas emission concentration prediction method and system based on a GRU network, and the advantages of the GRU network in model accuracy, generalization capability and training speed are utilized, so that the conditions of lag and inaccurate measured value in gas concentration measurement are avoided, and the prediction accuracy is effectively improved.

In view of the above, a first aspect of the present application provides a method for predicting a gas emission concentration from a GRU network, the method comprising:

acquiring a first parameter data set for predicting gas emission concentration;

performing feature selection on the first parameter data set to obtain a plurality of feature vectors for inputting to a GRU network for network training;

dividing the first parameter data set into a training set and a test set according to the obtained multiple feature vectors;

taking a training set as the input of the GRU network, training the GRU network, and outputting a pre-training network until an iterative convergence condition is met;

and taking the test set as the input of the pre-training network, calculating the deviation between a predicted value output by the pre-training network and a given actual value, and if the deviation value is smaller than a preset deviation threshold value, obtaining a gas emission concentration prediction model.

Optionally, before performing feature selection on the first parameter data set, the method further includes:

filtering abnormal operation data from the first parameter data set by adopting a singular value inspection method or a threshold inspection method, wherein the abnormal operation data filtering by adopting the singular value inspection method comprises the following steps:

performing a singular value check on items of data included in the first parameter data set;

defining an abnormal value according to a singular value inspection result;

removing abnormal values from the first parameter dataset or defining abnormal values as missing values and performing completion of the missing values;

the threshold value checking method comprises the step of selecting abnormal operation data with parameter values lower than a preset detection threshold value from the first parameter data set.

Optionally, the filtering, by using a singular value inspection method or a threshold inspection method, the abnormal operation data from the first parameter data set, and then further includes:

and adopting a Z-score data standardization method to carry out standardization processing on the second parameter data set with the filtered abnormal operation data.

Optionally, the feature selection performed by the first parameter data set specifically includes:

the characteristic selection of the normalized second parameter data set by using a mutual information characteristic selection algorithm comprises the following steps:

combining the data included in the second parameter data set in pairs in sequence to obtain a plurality of combination sequences, and calculating mutual information among elements aiming at each combination sequence;

sequencing a plurality of items of mutual information according to the value of the mutual information to obtain a mutual information sequencing sequence;

selecting top k elements from the mutual information sorting sequence, selecting corresponding data items from the second parameter data set based on the selected k elements, and using the corresponding data items as the input of the GRU network; k is 1 or more.

Optionally, the normalizing the second parameter data set by using the Z-score data normalization method specifically includes:

calculating the mean value of the overall data in the second parameter data set by using the formula (1)

Calculating to obtain a standard deviation s of the overall data in the second parameter data set by using a formula (2);

combining the mean value and standard deviation of the overall data, and obtaining a final standardized processing result Z by using a formula (3)_i：

Where n denotes the total number of feature vectors in the second parameter dataset, x_iAnd representing the mean value corresponding to the ith feature vector.

Optionally, the hidden layer of the GRU network builds a 4-layer recurrent neural network by using GRU cells, wherein:

the nodes contained in each layer are respectively: 128. 128, 64 and 32.

Optionally, the calculating a deviation between the predicted value output by the pre-training network and the given actual value specifically includes:

after a first deviation value, a second deviation value and a third deviation value between the predicted value and the given actual value are respectively calculated through a root mean square error calculation formula, an average absolute value error calculation formula and a decision coefficient calculation formula, the first deviation value, the second deviation value and the third deviation value are averaged or accumulated to obtain the deviation between the predicted value and the given actual value.

A second aspect of the present application provides a gas emission concentration prediction system by a GRU network, the system comprising:

a first parameter data set acquisition unit for acquiring a first parameter data set for predicting a gas emission concentration;

the characteristic selection unit is used for carrying out characteristic selection on the first parameter data set by utilizing a mutual information characteristic selection algorithm to obtain a plurality of characteristic vectors for inputting to a GRU network for network training; the characteristic selection by utilizing a mutual information characteristic selection algorithm comprises the following steps:

combining the data included in the first parameter data set in pairs in sequence to obtain a plurality of combination sequences, and calculating mutual information among elements aiming at each combination sequence;

selecting top k elements from the mutual information sorting sequence, selecting corresponding data items from the first parameter data set based on the selected k elements, and using the corresponding data items as the input of the GRU network; k is greater than or equal to 1;

the data dividing unit is used for dividing the first parameter data set into a training set and a test set according to the obtained multiple feature vectors;

the GRU network training unit is used for taking a training set as the input of the GRU network, training the GRU network and outputting a pre-training network until an iteration convergence condition is met;

and the prediction result output unit is used for taking the test set as the input of the pre-training network, calculating the deviation between a predicted value output by the pre-training network and a given actual value, and obtaining a gas emission concentration prediction model if the deviation value is smaller than a preset deviation threshold value.

Optionally, the system further comprises a data filtering unit; wherein:

the data filtering unit is used for filtering abnormal operation data from the first parameter data set by adopting a singular value inspection method or a threshold inspection method; when the abnormal operation data filtering is carried out by adopting a singular value inspection method, the method comprises the following steps:

defining an abnormal value according to a singular value inspection result;

when the threshold value checking method is adopted, abnormal operation data with a parameter value lower than a detection threshold value are filtered from the first parameter data set by setting the detection threshold value.

Optionally, the system further comprises a normalization processing unit, wherein:

the normalization processing unit is configured to, after filtering the abnormal operation data from the first parameter data set by using the singular value inspection method or the threshold inspection method, perform normalization processing on the second parameter data set with the filtered abnormal operation data by using a Z-score data normalization method.

The application provides a gas emission concentration prediction method and a gas emission concentration prediction system of a GRU network, and the method comprises the steps of obtaining a first parameter data set for predicting the gas emission concentration; performing feature selection by using the first parameter data set to obtain a plurality of feature vectors for inputting to a GRU network for network training; dividing a first parameter data set into a training set and a test set according to the obtained multiple feature vectors; taking the training set as the input of the GRU network to be trained, training the GRU network, and outputting a pre-training network until an iterative convergence condition is met; and taking the test set as the input of the pre-training network, and obtaining a trained gas emission concentration prediction model by measuring the deviation between a predicted value output by the pre-training network and a given actual value when the deviation value is smaller than a preset deviation threshold value.

The method and the device utilize the advantages of the GRU network in model precision, generalization capability and training speed, and avoid the situation that gas concentration measurement lags; in order to improve the accuracy of the measured value, filtering abnormal operation data is performed in sequence before feature selection is performed on the acquired data set, and the filtered data set is subjected to standardized processing, so that the acquired data meet the actual field requirements, and the prediction precision is effectively improved; the GRU neural network structure with 4 layers is adopted, the space complexity of the network is reduced, and the requirement of practical application is met.

Drawings

FIG. 1 is a method flow diagram of one embodiment of a method of predicting a concentration of a gas emission by a GRU network of the present application;

FIG. 2 is a method flow diagram of a second embodiment of a method of predicting gas emission concentration from a GRU network of the present application;

fig. 3 is a schematic structural diagram of a first embodiment of a gas emission concentration prediction system of a GRU network according to the present application.

Detailed Description

In order to make the technical solutions of the present application better understood, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art from the embodiments in the present application without any creative effort belong to the protection scope of the present application.

Example 1:

referring to fig. 1, fig. 1 is a flowchart illustrating a method for predicting a gas emission concentration from a GRU network according to an embodiment of the present application, as shown in fig. 1, specifically:

s100, acquiring a first parameter data set for predicting gas emission concentration;

s200, performing feature selection by using the first parameter data set to obtain a plurality of feature vectors for inputting to a GRU network for network training;

wherein, to SCR flue gas denitration system NO_XFor example, when data acquisition in an earlier stage is performed, an existing flue gas pollution source emission process (working condition) monitoring system is used for monitoring a denitration system in real time, and then data samples of parameters related to the training of a GRU network model are acquired from a power station DCS, wherein the data samples comprise parameter variables such as ammonia injection mass flow, boiler load, SCR inlet flue gas temperature, SCR inlet flue gas oxygen content, SCR inlet NOx mass concentration and SCR outlet NOx concentration.

Specifically, when feature selection is performed on the first parameter data set, a mutual information feature selection algorithm is considered, and measurement of interdependence between variables is performed based on mutual information or transfer information between different random variables.

S300, carrying out data division according to the obtained multiple characteristic vectors to construct a training set and a test set;

it should be noted that, the training set and the test set are constructed by including sample set time series data trained by the NOx emission prediction model of the NOx removal system according to the data structure required by the network model.

S400, taking the training set as the input of the GRU network, training the GRU network, and outputting a pre-training network until an iteration convergence condition is met. In the training process, when the loss function value meets the iterative convergence condition, a pre-training network is output; specifically, the network prediction adopts an iterative point-by-point prediction method.

In order to reduce the spatial complexity of the network and meet the requirements of practical application, the hidden layer of the GRU network in this embodiment adopts GRU cells, and a 4-layer recurrent neural network is built, wherein:

the nodes contained in each layer are respectively: 128. 128, 64 and 32.

It should be noted that deep learning often requires a lot of time and computer resources for training, which is also a significant reason for the development of deep learning algorithms. Although learning of the model can be accelerated by adopting distributed parallel training, the required computing resources are not reduced. Based on the fact that the learning speed and the learning effect of the machine can be fundamentally accelerated only by an optimization algorithm which needs fewer resources and enables the model to be converged faster, an Adam optimization algorithm is adopted when model optimization is conducted, the Adam optimization algorithm is an extended form of a random gradient descent algorithm, and the Adam optimization algorithm can update the weight of the neural network based on iteration of training data.

When the Loss function value is judged to meet the iterative convergence condition, the output value and the input value of each unit of the hidden layer are generally calculated first, then the output value of each unit of the output layer is calculated, and finally the Loss function Loss value is calculated, wherein:

the loss function is used for measuring the inconsistency degree between the predicted value f (x) and the true value Y of the network model, is a non-negative real value function and is generally expressed by L (Y, f (x)), and the smaller the loss function is, the better the robustness of the model is.

And when the Loss value does not meet the iteration convergence condition, adjusting the connection weight of each layer of the network. In the iteration process, by continuously updating the training parameters, when the iteration is completed, namely when the loss function value meets the iteration convergence condition, the convergent GRU pre-training network is obtained.

S500, taking the test set as the input of a pre-training network, calculating the deviation between a predicted value output by the pre-training network and a given actual value, and if the deviation value is smaller than a preset deviation threshold value, obtaining a gas emission concentration prediction model; and under the condition that the deviation value is greater than a preset deviation threshold value, returning to the GRU network iterative training process until the pre-training network meeting the preset deviation threshold value is output.

Specifically, when measuring the deviation between the predicted value and the given actual value, in this embodiment, after the first deviation value, the second deviation value, and the third deviation value between the predicted value and the given actual value are respectively calculated by the root mean square error calculation formula, the average absolute value error calculation formula, and the decision coefficient calculation formula, the first deviation value, the second deviation value, and the third deviation value are averaged or accumulated to obtain the deviation between the predicted value and the given actual value.

It should be noted that, when the actual deviation solution is performed, the actual deviation value may be determined according to any one of the first, second, or third deviation values, or the actual deviation may be calculated by averaging the first, second, and third deviation values or by using any other index that reflects the trend in the data set.

In the method for predicting the gas emission concentration by the GRU network disclosed in this embodiment 1, the advantages of the GRU network in terms of model accuracy, generalization ability and training speed are utilized to avoid the occurrence of the condition that the gas concentration measurement is delayed; the GRU neural network structure with 4 layers is adopted, the space complexity of the network is reduced, and the requirement of practical application is met.

Example 2:

in order to make the input training data and the test data more close to the actual field requirement and improve the prediction accuracy, please refer to fig. 2, which is a flowchart of a gas emission concentration prediction method of a GRU network according to a second embodiment of the present application, in which, on the basis of embodiment 1, step S200 includes, before performing feature selection on the first parameter data set:

s210, filtering abnormal operation data from the first parameter data set by adopting a singular value inspection method or a threshold value inspection method;

specifically, when the singular value inspection method is used for filtering abnormal operation data, the method comprises the following steps:

performing singular value inspection on each item of data included in the first parameter data set;

defining an abnormal value according to a singular value inspection result;

removing abnormal values from the first parameter dataset, or defining abnormal values as missing values and performing completion of the missing values; when removing the abnormal value, direct deletion by a deletion method, a substitution method (substitution of a continuous variable mean, substitution of a discrete variable with a mode and a median) or an interpolation method (regression interpolation, multiple interpolation) may be selected. When the missing values are supplemented, if the missing values are continuous variables, calculating the mean value of the continuous variables to be supplemented can be selected subsequently; if the variable is a discrete variable, a mode or a median may be selected for completion. Alternatively, a regression model may be fitted to the non-missing data and then used to predict the missing data.

Specifically, when the threshold value checking method is adopted, abnormal operation data with a parameter value lower than the detection threshold value is filtered from the first parameter data set by setting the detection threshold value.

If it is required to ensure that the parameters in the first parameter data set are uniform in magnitude and dimension, after step S210 is performed, the following steps may be performed:

s220, carrying out standardization processing on the second parameter data set with the filtered abnormal operation data by adopting a Z-score data standardization method; specifically, the normalizing the second parameter data set by the Z-score data normalization method includes:

Finally, in this embodiment, a mutual information feature selection algorithm is considered to be used, feature selection is performed based on the normalized second parameter data set or the normalized first parameter data set with the abnormal operating data filtered out, wherein data in the data sets are combined pairwise, mutual information between the two combined data is calculated, and a mutual information calculation formula is as follows:

(4) in the formula: p (X, Y) is the joint probability distribution of X and Y; p (X) and p (Y) are the edge probability distributions for X and Y, respectively. In the embodiment, mutual information is used for quantifying the correlation degree among 2 random variables, and when the mutual information is 0, the variable X and the variable Y are in an independent relationship; and when the mutual information value is larger, the higher the correlation degree between the variable X and the variable Y is.

After the mutual information of all pairwise combined data is calculated, sorting can be performed according to the value size of the obtained multiple items of mutual information, and the front k items of feature subsets are selected as the input of the model to complete feature selection.

In the method for predicting the gas emission concentration of the GRU network disclosed in this embodiment 2, in order to improve the accuracy of the measured value, before feature selection is performed on the acquired data set, abnormal operation data is sequentially filtered, and the filtered data set is subjected to standardized processing, so that the acquired data meets the actual field requirements, and the prediction precision is effectively improved.

Example 3:

for easy understanding, please refer to fig. 3, fig. 3 is a system structural diagram of a gas emission concentration prediction system of a GRU network according to an embodiment of the present application, and as shown in fig. 3, the system structural diagram specifically includes:

the first parameter data set acquisition unit 10 is for acquiring a first parameter data set for predicting a gas emission concentration;

the feature selection unit 20 is configured to perform feature selection on the first parameter data set by using a mutual information feature selection algorithm to obtain a plurality of feature vectors for inputting to the GRU network for network training; the characteristic selection by utilizing a mutual information characteristic selection algorithm comprises the following steps:

selecting front k items of elements from the mutual information sequencing sequence, selecting corresponding data items from the first parameter data set based on the selected k items of elements, and taking the data items as the input of the GRU network; k is 1 or more.

The data dividing unit 30 is configured to divide the first parameter data set into a training set and a test set according to the obtained multiple feature vectors;

the GRU network training unit 40 is configured to use the training set as an input of the GRU network, train the GRU network, and output a pre-training network until an iterative convergence condition is satisfied;

the prediction result output unit 50 is configured to use the test set as an input of the pre-training network, calculate a deviation between a predicted value output by the pre-training network and a given actual value, and obtain a gas emission concentration prediction model if the deviation value is smaller than a preset deviation threshold.

Example 4:

in order to be able to filter out abnormal operation data from the first parameter data set, the system further comprises a data filtering unit 60; wherein:

the data filtering unit 60 is configured to filter abnormal operation data from the first parameter data set by using a singular value inspection method or a threshold inspection method; when the abnormal operation data filtering is carried out by adopting a singular value inspection method, the method comprises the following steps:

defining an abnormal value according to a singular value inspection result;

removing abnormal values from the first parameter dataset, or defining abnormal values as missing values and performing completion of the missing values;

It should be noted that, in order to ensure that the parameters in the first parameter data set are uniform in magnitude and dimension; the system further comprises a normalization processing unit 70, wherein:

the normalization processing unit 70 is configured to, after filtering the abnormal operation data from the first parameter data set by using a singular value inspection method or a threshold inspection method, perform normalization processing on the second parameter data set with the filtered abnormal operation data by using a Z-score data normalization method.

According to the gas emission concentration prediction method and system based on the GRU network, the advantages of the GRU network in model accuracy, generalization capability and training speed are utilized, and the situation that gas concentration measurement is delayed is avoided; in order to improve the accuracy of the measured value, filtering abnormal operation data is performed in sequence before feature selection is performed on the acquired data set, and the filtered data set is subjected to standardized processing, so that the acquired data meet the actual field requirements, and the prediction precision is effectively improved; the GRU neural network structure with 4 layers is adopted, the space complexity of the network is reduced, and the requirement of practical application is met.

It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

The terms "first," "second," "third," "fourth," and the like in the description of the present application and in the above-described drawings are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the application described herein are, for example, capable of operation in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

It should be understood that in the present application, "at least one" means one or more, "a plurality" means two or more. "and/or" for describing an association relationship of associated objects, indicating that there may be three relationships, e.g., "a and/or B" may indicate: only A, only B and both A and B are present, wherein A and B may be singular or plural. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship. "at least one of the following" or similar expressions refer to any combination of these items, including any combination of single item(s) or plural items. For example, at least one (one) of a, b, or c, may represent: a, b, c, "a and b", "a and c", "b and c", or "a and b and c", wherein a, b, c may be single or plural.

The above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions in the embodiments of the present application.

Claims

1. A gas emission concentration prediction method based on a GRU network is characterized by comprising the following steps:

acquiring a first parameter data set for predicting gas emission concentration;

taking the training set as the input of the GRU network, training the GRU network, and outputting a pre-training network until an iteration convergence condition is met;

2. The gas emission concentration prediction method according to claim 1, further comprising, before performing the feature selection on the first parameter data set:

defining an abnormal value according to a singular value inspection result;

3. The method of predicting a gas emission concentration of claim 2, wherein after filtering abnormal operation data from the first parameter data set using a singular value inspection method or a threshold inspection method, further comprising:

4. The method of predicting a concentration of a gas emission according to claim 3, wherein the characteristic selection from the first parameter data set is specifically:

5. The method of predicting gas emission concentration according to claim 3, wherein the normalizing the second parameter data set using the Z-score data normalization method comprises:

6. The method according to claim 1, wherein the hidden layer of the GRU network builds a 4-layer recurrent neural network with GRU cells, wherein:

the nodes contained in each layer are respectively: 128. 128, 64 and 32.

7. The method according to claim 1, wherein the calculating of the deviation between the predicted value of the pre-trained network output and the given actual value is specifically:

8. A GRU network-based gas emission concentration prediction system, comprising:

9. The system of claim 8, further comprising a data filtering unit; wherein:

defining an abnormal value according to a singular value inspection result;

10. The gas emission concentration prediction system of claim 9, further comprising a normalization processing unit, wherein: