CN111597175B

CN111597175B - Filling method of sensor missing value fusing time-space information

Info

Publication number: CN111597175B
Application number: CN202010374180.5A
Authority: CN
Inventors: 胡清华; 李东
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2020-05-06
Filing date: 2020-05-06
Publication date: 2023-06-02
Anticipated expiration: 2040-05-06
Also published as: CN111597175A

Abstract

The invention provides a filling method of sensor missing values fusing space-time information, which comprises the following steps: inputting N pieces of historical data X and M pieces of missing data X _missing The method comprises the steps of carrying out a first treatment on the surface of the Wherein M, N is greater than the input timing length T; filling a threshold eta, inputting the history data into the trained LSTM-AE _S After that, η=std (X-X'); obtaining a trained model LSTM-AE _S The method comprises the steps of carrying out a first treatment on the surface of the Repaired data X ^repaired The method comprises the steps of carrying out a first treatment on the surface of the Dividing the original data into time sequence data sets; initializing LSTM-AE _S The method comprises the steps of carrying out a first treatment on the surface of the Then using Tensorflow to initialize the network; updating LSTM-AE using a back propagation algorithm commonly used by neural networks _S Weight W of (2); and filling the missing value. The method and the device consider the space-time information at the same time, can be relatively robust when a large number of sensors are simultaneously missing, can train a single model to process different types of missing, and can meet the real-time requirement of filling of missing values of the sensors.

Description

Filling method of sensor missing value fusing time-space information

Technical Field

The invention belongs to the field of equipment health management, and in particular relates to a filling method of sensor missing values fused with space-time information.

Background

Previously, the missing value filling method utilized only the association in the data space, and did not use the timing information. They do not perform well when there is a multidimensional loss of data, and even cannot be used in the presence of a block loss. Furthermore, previous work has first assumed that the location of the missing values is known, modeling for a single missing type. But the location of the missing values is not known in real-time systems. In this case, in order to be able to cope with various types of missing in real time, the number of models to be trained grows exponentially with the number of sensors, which is disadvantageous for practical application of the missing value filling method.

Disclosure of Invention

In order to solve the problems, the invention provides a filling method of sensor missing values fused with space-time information, which combines a depth automatic encoder and a long-short time neural network.

The invention provides a filling method of sensor missing values fusing space-time information, which comprises the following steps:

inputting N pieces of historical data X and M pieces of missing data X _missing The method comprises the steps of carrying out a first treatment on the surface of the Wherein M, N is greater than the input timing length T;

filling a threshold eta, inputting the history data into the trained LSTM-AE _S Then η=std (X-X '), where X is the test data, X' is the model output data, std is the standard deviation; obtaining a trained model LSTM-AE _S The method comprises the steps of carrying out a first treatment on the surface of the Repaired data X ^repaired ；

Dividing the original data into time sequence data sets;

initializing LSTM-AE _S : constructing a multi-layer self-coding neural network by using a Tensorflow deep learning framework, wherein the neuron types use LSTM, the number of neurons of a first layer is consistent with the number of sensors, and the number of neurons of an intermediate coding layer is the minimum dimension when the information retention rate of the historical data X after the dimension reduction is more than 99% by using a principal component analysis method; then using Tensorflow to initialize the network;

updating LSTM-AE using a back propagation algorithm commonly used by neural networks _S Weight W of (2);

and filling the missing value.

In the above method, wherein, in initializing LSTM-AE _S And before updating the weight W, further comprising calculating a reconstruction error:

wherein->

Respectively X ^j 、X ^j Sensor data at time T in'.

In the above method, wherein, in initializing LSTM-AE _S And before updating the weight W, further comprising calculating a regularization term error:

wherein->

Respectively represent X ^j '、X ^j+1 '、X ^j-1 Data at time T in'.

In the above method, wherein, in initializing LSTM-AE _S And before updating the weight W, further comprising calculating a penalty term:

wherein->

Is to solve the partial derivative, the whole neural network is regarded as a weight W, a bias term b and an input X _T By h _W,b (X _T ) Referring to θ, the regularization parameter.

The method has the advantages that the space-time information in the data can be simultaneously mined for missing value filling, and filling precision in multidimensional missing is greatly improved. Moreover, since the automatic encoder recovers all data simultaneously, the model can be trained for different deletion types, and the model training complexity and the data requirement are greatly reduced. Meanwhile, smoothness regularization is introduced, so that the prediction accuracy and robustness of the model are further improved. The use of the shared weight strategy enables the model to converge more quickly, and reduces the training complexity of the algorithm.

The existence of the missing values is a great hidden trouble for the safe and stable operation of a plurality of large-scale equipment, particularly power plants, and the precision and the robustness of the existing methods can not be practically applied to the practical application of the missing value filling, and the algorithm provided by the invention can improve the accuracy of the classical algorithm by more than 60% on the data of the actually operated gas turbine and has stronger robustness under the condition of multidimensional missing.

The invention can be widely applied to the health management system of the large-scale power device so as to realize the stable operation of the health management system.

Drawings

FIG. 1 shows a training flow diagram for LSTM-AE and LSTM-AEs.

Detailed Description

The following examples will enable those skilled in the art to more fully understand the present invention and are not intended to limit the same in any way.

In order to be able to use the timing information and the spatial information in the sensor data, neurons in the automatic encoder are replaced with cell structures in the long-term neural network, so that the automatic encoder can mine the timing and the spatial information simultaneously.

In addition, in the training process, the sensor data at the current moment is recovered as a main target, the middle characteristic layer can be free from a long-short-time neural network layer and only needs to use common neurons, so that the complexity of the model is reduced. The input model in the training process is data in a matrix form, the corresponding sensor data are in the transverse direction, and the time axis is in the longitudinal direction.

Because the time series data has smooth characteristic, namely the value change between adjacent records is not too large, smoothness regularization is introduced into the model, and meanwhile, the risk of over-fitting of the model is reduced. In addition, the model becomes more complex and difficult to solve after the regular term is added, so that a strategy of sharing the weight is introduced to avoid unified solving of the loss of the regular term. And simultaneously inputting the data before and after the current moment into the model to obtain a reconstruction value, and directly solving the loss of the regularization term through the three reconstruction values.

The determination of the missing value is performed by a threshold determination method. In general, the difference between the reconstructed values corresponding to the missing values is large. If the difference value exceeds the threshold value, the model reconstruction value can be judged to replace the existing value, so that the purpose of automatic filling of the missing value is achieved.

The invention will be better understood by reference to the following examples.

Input: n pieces of history data X; m pieces of missing data X _missing The method comprises the steps of carrying out a first treatment on the surface of the Wherein M, N must be greater than the input timing length T; for sensor data, the T value is generally more than 200; filling threshold: η, input the history data into trained LSTM-AE _S After that, η=std(X-X '), wherein X is test data, X' is model output data, and std is standard deviation.

And (3) outputting: trained model LSTM-AE _S The method comprises the steps of carrying out a first treatment on the surface of the Repaired data X ^repaired 。

Training:

1) Dividing the raw data into time series data sets:

for i＝0 to N-T do

|X ⁱ ＝X(X _i+1 ,....X _i+T )

end

X _train ＝{X ¹ ,X ² ,...X ^N-T }

2) Training a model:

initializing LSTM-AE _S : constructing a multi-layer self-coding neural network by using a Tensorflow deep learning framework, wherein the neuron types use LSTM, the number of neurons of a first layer is consistent with the number of sensors, and the number of neurons of an intermediate coding layer is the minimum dimension when the information retention rate of the historical data X after the dimension reduction is more than 99% by using a principal component analysis method; network initialization is then performed using Tensorflow.

for j＝1to N-T do

X is to be ^j-1 ,X ^j ,X ^j+1 Input LSTM-AE _S And obtain output X ^j-1′ ,X ^j′ ,X ^j+1′ ；

Calculating a reconstruction error:

wherein->

Respectively X ^j 、X ^j'

Sensor data at time T in (b);

calculating a regularization term error:

wherein->

Respectively represent X ^j '、X ^j+1' 、X ^j-1' Data at time T in (b);

calculating a loss term:

wherein->

The partial derivative is solved, and the whole neural network can be regarded as a weight W, a bias term b and an input X _T Is herein h _W,b (X _T ) In the method, θ is a regularization parameter, and the value is generally smaller than 0.1, and larger values can lead to too smooth prediction results of an algorithm obtained through training, so that the results are poor;

updating LSTM-AE using a back propagation algorithm commonly used by neural networks _S Weight W of (a).

Missing value filling:

for k＝1 to M-T do

X ^k ＝X _missing (X _k+1 ,...X _k+T )；

x is to be ^k Inputting trained LSTM-AE _s And obtain output X ^k '；

Wherein the method comprises the steps of

Refers to output X ^k ' predicted value at time T +.>

Is the predicted value on its p-th sensor; the eta takes the value of the difference between the predicted value and the true value in the training processStandard deviation of values;

end。

FIG. 1 shows a model training flow diagram of the present invention. The algorithm flow disclosed by the invention may be better understood with reference to fig. 1.

The sensor missing value filling algorithm fused with the space-time information combines a depth automatic encoder and a long-short time neural network, integrates the feature extraction capability and the time sequence feature mining capability of the depth automatic encoder and the long-short time neural network, combines the depth automatic encoder and the long-short time neural network into the same depth network for optimization, and can more accurately fill the missing value through the combination of the space-time information. Meanwhile, the method utilizes the smoothness characteristic of the time sequence data of the sensor, namely that adjacent records should change smoothly, and smoothness regularization is added into the model, so that the filling precision is further improved. In addition, by introducing a weight sharing mechanism, the calculation of the regular term is simplified, and the training speed of the model is also improved to a certain extent. The method and the device consider the space-time information at the same time, can be relatively robust when a large number of sensors are simultaneously missing, have higher filling precision compared with the prior art, can simultaneously process different types of missing when one model is trained, and can meet the real-time requirement of filling the missing values of the sensors.

In addition, the method of the invention fuses the depth automatic encoder and the long and short time neural network, and combines the two into the same depth network for optimization. In addition, by adding smoothness regularization into the model, the accuracy of filling missing values by the algorithm can be further improved. In addition, data at different moments are input into a shared network through a weight sharing mechanism, and then the regular term loss is directly calculated, so that the calculation of the regular term is simplified, and the training speed of an algorithm is accelerated.

It should be understood by those skilled in the art that the above embodiments are exemplary embodiments only and that various changes, substitutions, and alterations can be made hereto without departing from the spirit and scope of the present application.

Claims

1. A method of filling sensor missing values that fuse spatio-temporal information, comprising:

Dividing N pieces of historical data X into time sequence data sets;

m pieces of missing data X _missing Inputting trained LSTM-AE _S And filling the missing value.

2. The method of claim 1, wherein, at initialization of LSTM-AE _S And before updating the weight W, further comprising calculating a reconstruction error:

wherein->

Respectively X ^j 、X ^j' The value range of j is 1 to N-T.

3. The method of claim 2, wherein, at initialization of LSTM-AE _S And before updating the weight W, further comprising calculating a regularization term error:

wherein->

Respectively represent X ^j' 、X ^j+1' 、X ^j-1' The value range of j is 1 to N-T.

4. The method of claim 3, wherein, in initializing LSTM-AE _S And before updating the weight W, further comprising calculating a penalty term:

wherein->

Is to solve the partial derivative, the whole neural network is regarded as a weight W, a bias term b and an input X _T By h _W,b (X _T ) Referring to θ, the regularization parameter. />